{"id":4561,"date":"2022-02-02T11:32:51","date_gmt":"2022-02-02T16:32:51","guid":{"rendered":"https:\/\/portfolios.cs.earlham.edu\/?page_id=4561"},"modified":"2025-04-14T15:37:11","modified_gmt":"2025-04-14T19:37:11","slug":"andrew-strozewski","status":"publish","type":"page","link":"https:\/\/portfolios.cs.earlham.edu\/index.php\/students\/2022-2\/cs488\/andrew-strozewski\/","title":{"rendered":"Andrew Strozewski"},"content":{"rendered":"\n<h2 class=\"wp-block-heading\">Introduction<\/h2>\n\n\n\n<p>Hello, I am a senior Computer Science major graduating in May 2022. This page will host my capstone project. My primary interests are cybersecurity, machine learning, and system administration. My project is using machine learning to identify malicious URLs.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Abstract<\/h2>\n\n\n\n<p>&nbsp;Uniform Resource Locator (URL) is a link to a website. A Malicious URL leads to a malicious website that has been designed or is used for the sole purpose of causing harm to the user. In recent years malicious website attacks have been ranked first in the top 10 cyber-attack techniques. Before the advent of machine learning, the common method for identifying and limiting access to malicious websites was a blacklist. In the early days of the internet, with fewer websites, crowdsourcing these blacklists was an efficient and robust solution. Now it is impossible to maintain an exhaustive list. Other methods have been implemented as extensions of blacklisting, such as heuristics, where a signature is given to common attack types and then web scanners can look for these signatures. As the technology for finding attacks has advanced so have the attacks. Google estimates that 30 trillion unique URLs are currently in existence. This sheer number of URLs combined with evolving attack techniques has proven blacklisting to be slow and rigid. Previous studies in using machine learning to identify malicious URLs have primarily focused on batch learning methods. While this allows quicker processing of large amounts of data it can be limited in identifying new attack types and can be circumvented by attackers disguising their attacks. This project will utilize online learning as a way of being more adaptable and use preprocessing techniques to help avoid attacker obfuscation.<\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"582\" height=\"720\" src=\"https:\/\/portfolios.cs.earlham.edu\/wp-content\/uploads\/2022\/04\/data-diagram.jpg\" alt=\"\" class=\"wp-image-4794\" srcset=\"https:\/\/portfolios.cs.earlham.edu\/wp-content\/uploads\/2022\/04\/data-diagram.jpg 582w, https:\/\/portfolios.cs.earlham.edu\/wp-content\/uploads\/2022\/04\/data-diagram-243x300.jpg 243w\" sizes=\"auto, (max-width: 582px) 100vw, 582px\" \/><\/figure>\n\n\n\n<h2 class=\"wp-block-heading\">Application user interface<\/h2>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"939\" height=\"657\" src=\"https:\/\/portfolios.cs.earlham.edu\/wp-content\/uploads\/2022\/05\/GUI_Image.png\" alt=\"\" class=\"wp-image-4927\" srcset=\"https:\/\/portfolios.cs.earlham.edu\/wp-content\/uploads\/2022\/05\/GUI_Image.png 939w, https:\/\/portfolios.cs.earlham.edu\/wp-content\/uploads\/2022\/05\/GUI_Image-300x210.png 300w, https:\/\/portfolios.cs.earlham.edu\/wp-content\/uploads\/2022\/05\/GUI_Image-768x537.png 768w\" sizes=\"auto, (max-width: 939px) 100vw, 939px\" \/><\/figure>\n\n\n\n<h2 class=\"wp-block-heading\">Research Poster<\/h2>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"878\" src=\"https:\/\/portfolios.cs.earlham.edu\/wp-content\/uploads\/2022\/05\/Slide1-1024x878.png\" alt=\"\" class=\"wp-image-4930\" srcset=\"https:\/\/portfolios.cs.earlham.edu\/wp-content\/uploads\/2022\/05\/Slide1-1024x878.png 1024w, https:\/\/portfolios.cs.earlham.edu\/wp-content\/uploads\/2022\/05\/Slide1-300x257.png 300w, https:\/\/portfolios.cs.earlham.edu\/wp-content\/uploads\/2022\/05\/Slide1-768x658.png 768w, https:\/\/portfolios.cs.earlham.edu\/wp-content\/uploads\/2022\/05\/Slide1-1536x1317.png 1536w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/figure>\n\n\n\n<h2 class=\"wp-block-heading\">Gitlab<\/h2>\n\n\n\n<p><a href=\"https:\/\/code.cs.earlham.edu\/ajstroz18\/detecting-urls-with-machine-learning\">https:\/\/code.cs.earlham.edu\/ajstroz18\/detecting-urls-with-machine-learning<\/a><\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Software Demonstration video<\/h2>\n\n\n\n<figure class=\"wp-block-embed-youtube wp-block-embed is-type-video is-provider-youtube wp-embed-aspect-16-9 wp-has-aspect-ratio\"><div class=\"wp-block-embed__wrapper\">\n<iframe loading=\"lazy\" title=\"Software demonstration video\" width=\"940\" height=\"529\" src=\"https:\/\/www.youtube.com\/embed\/obAjRp1s6Uk?feature=oembed\" frameborder=\"0\" allow=\"accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture\" allowfullscreen><\/iframe>\n<\/div><\/figure>\n\n\n\n<h2 class=\"wp-block-heading\">Software Paper<\/h2>\n\n\n\n<p><a href=\"https:\/\/drive.google.com\/drive\/u\/0\/folders\/1SR9ahLO1fvuuEER0o7-1Ij1_1UBllplQ\">https:\/\/drive.google.com\/drive\/u\/0\/folders\/1SR9ahLO1fvuuEER0o7-1Ij1_1UBllplQ<\/a><\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Papers and Articles I&#8217;ve been reading<\/h2>\n\n\n\n<h2 class=\"wp-block-heading\">Other things I&#8217;ve been up to<\/h2>\n\n\n\n<p><a href=\"https:\/\/portswigger.net\/web-security\">https:\/\/portswigger.net\/web-security<\/a><\/p>\n\n\n\n<p><a href=\"https:\/\/pentesterlab.com\/exercises\">https:\/\/pentesterlab.com\/exercises<\/a><\/p>\n\n\n\n<p><a href=\"https:\/\/www.hackerone.com\/\">https:\/\/www.hackerone.com\/<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Introduction Hello, I am a senior Computer Science major graduating in May 2022. This page will host my capstone project. My primary interests are cybersecurity, machine learning, and system administration. My project is using machine learning to identify malicious URLs. &hellip; <a href=\"https:\/\/portfolios.cs.earlham.edu\/index.php\/students\/2022-2\/cs488\/andrew-strozewski\/\">Read More<\/a><\/p>\n","protected":false},"author":109,"featured_media":0,"parent":4565,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"footnotes":""},"class_list":["post-4561","page","type-page","status-publish","hentry"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.4 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>Andrew Strozewski - CS\/DS Student Portfolios<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/portfolios.cs.earlham.edu\/index.php\/students\/2022-2\/cs488\/andrew-strozewski\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Andrew Strozewski - CS\/DS Student Portfolios\" \/>\n<meta property=\"og:description\" content=\"Introduction Hello, I am a senior Computer Science major graduating in May 2022. This page will host my capstone project. My primary interests are cybersecurity, machine learning, and system administration. My project is using machine learning to identify malicious URLs. &hellip; Read More\" \/>\n<meta property=\"og:url\" content=\"https:\/\/portfolios.cs.earlham.edu\/index.php\/students\/2022-2\/cs488\/andrew-strozewski\/\" \/>\n<meta property=\"og:site_name\" content=\"CS\/DS Student Portfolios\" \/>\n<meta property=\"article:modified_time\" content=\"2025-04-14T19:37:11+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/portfolios.cs.earlham.edu\/wp-content\/uploads\/2022\/04\/data-diagram.jpg\" \/>\n\t<meta property=\"og:image:width\" content=\"582\" \/>\n\t<meta property=\"og:image:height\" content=\"720\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data1\" content=\"2 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/portfolios.cs.earlham.edu\\\/index.php\\\/students\\\/2022-2\\\/cs488\\\/andrew-strozewski\\\/\",\"url\":\"https:\\\/\\\/portfolios.cs.earlham.edu\\\/index.php\\\/students\\\/2022-2\\\/cs488\\\/andrew-strozewski\\\/\",\"name\":\"Andrew Strozewski - CS\\\/DS Student Portfolios\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/portfolios.cs.earlham.edu\\\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\\\/\\\/portfolios.cs.earlham.edu\\\/index.php\\\/students\\\/2022-2\\\/cs488\\\/andrew-strozewski\\\/#primaryimage\"},\"image\":{\"@id\":\"https:\\\/\\\/portfolios.cs.earlham.edu\\\/index.php\\\/students\\\/2022-2\\\/cs488\\\/andrew-strozewski\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/portfolios.cs.earlham.edu\\\/wp-content\\\/uploads\\\/2022\\\/04\\\/data-diagram.jpg\",\"datePublished\":\"2022-02-02T16:32:51+00:00\",\"dateModified\":\"2025-04-14T19:37:11+00:00\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/portfolios.cs.earlham.edu\\\/index.php\\\/students\\\/2022-2\\\/cs488\\\/andrew-strozewski\\\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/portfolios.cs.earlham.edu\\\/index.php\\\/students\\\/2022-2\\\/cs488\\\/andrew-strozewski\\\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/portfolios.cs.earlham.edu\\\/index.php\\\/students\\\/2022-2\\\/cs488\\\/andrew-strozewski\\\/#primaryimage\",\"url\":\"https:\\\/\\\/portfolios.cs.earlham.edu\\\/wp-content\\\/uploads\\\/2022\\\/04\\\/data-diagram.jpg\",\"contentUrl\":\"https:\\\/\\\/portfolios.cs.earlham.edu\\\/wp-content\\\/uploads\\\/2022\\\/04\\\/data-diagram.jpg\",\"width\":582,\"height\":720},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/portfolios.cs.earlham.edu\\\/index.php\\\/students\\\/2022-2\\\/cs488\\\/andrew-strozewski\\\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\\\/\\\/portfolios.cs.earlham.edu\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Students\",\"item\":\"https:\\\/\\\/portfolios.cs.earlham.edu\\\/index.php\\\/students\\\/\"},{\"@type\":\"ListItem\",\"position\":3,\"name\":\"2022\",\"item\":\"https:\\\/\\\/portfolios.cs.earlham.edu\\\/index.php\\\/students\\\/2022-2\\\/\"},{\"@type\":\"ListItem\",\"position\":4,\"name\":\"CS488\",\"item\":\"https:\\\/\\\/portfolios.cs.earlham.edu\\\/index.php\\\/students\\\/2022-2\\\/cs488\\\/\"},{\"@type\":\"ListItem\",\"position\":5,\"name\":\"Andrew Strozewski\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/portfolios.cs.earlham.edu\\\/#website\",\"url\":\"https:\\\/\\\/portfolios.cs.earlham.edu\\\/\",\"name\":\"CS\\\/DS Student Portfolios\",\"description\":\"AI and ML, Image Classification, Arduino\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/portfolios.cs.earlham.edu\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Andrew Strozewski - CS\/DS Student Portfolios","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/portfolios.cs.earlham.edu\/index.php\/students\/2022-2\/cs488\/andrew-strozewski\/","og_locale":"en_US","og_type":"article","og_title":"Andrew Strozewski - CS\/DS Student Portfolios","og_description":"Introduction Hello, I am a senior Computer Science major graduating in May 2022. This page will host my capstone project. My primary interests are cybersecurity, machine learning, and system administration. My project is using machine learning to identify malicious URLs. &hellip; Read More","og_url":"https:\/\/portfolios.cs.earlham.edu\/index.php\/students\/2022-2\/cs488\/andrew-strozewski\/","og_site_name":"CS\/DS Student Portfolios","article_modified_time":"2025-04-14T19:37:11+00:00","og_image":[{"width":582,"height":720,"url":"https:\/\/portfolios.cs.earlham.edu\/wp-content\/uploads\/2022\/04\/data-diagram.jpg","type":"image\/jpeg"}],"twitter_card":"summary_large_image","twitter_misc":{"Est. reading time":"2 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/portfolios.cs.earlham.edu\/index.php\/students\/2022-2\/cs488\/andrew-strozewski\/","url":"https:\/\/portfolios.cs.earlham.edu\/index.php\/students\/2022-2\/cs488\/andrew-strozewski\/","name":"Andrew Strozewski - CS\/DS Student Portfolios","isPartOf":{"@id":"https:\/\/portfolios.cs.earlham.edu\/#website"},"primaryImageOfPage":{"@id":"https:\/\/portfolios.cs.earlham.edu\/index.php\/students\/2022-2\/cs488\/andrew-strozewski\/#primaryimage"},"image":{"@id":"https:\/\/portfolios.cs.earlham.edu\/index.php\/students\/2022-2\/cs488\/andrew-strozewski\/#primaryimage"},"thumbnailUrl":"https:\/\/portfolios.cs.earlham.edu\/wp-content\/uploads\/2022\/04\/data-diagram.jpg","datePublished":"2022-02-02T16:32:51+00:00","dateModified":"2025-04-14T19:37:11+00:00","breadcrumb":{"@id":"https:\/\/portfolios.cs.earlham.edu\/index.php\/students\/2022-2\/cs488\/andrew-strozewski\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/portfolios.cs.earlham.edu\/index.php\/students\/2022-2\/cs488\/andrew-strozewski\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/portfolios.cs.earlham.edu\/index.php\/students\/2022-2\/cs488\/andrew-strozewski\/#primaryimage","url":"https:\/\/portfolios.cs.earlham.edu\/wp-content\/uploads\/2022\/04\/data-diagram.jpg","contentUrl":"https:\/\/portfolios.cs.earlham.edu\/wp-content\/uploads\/2022\/04\/data-diagram.jpg","width":582,"height":720},{"@type":"BreadcrumbList","@id":"https:\/\/portfolios.cs.earlham.edu\/index.php\/students\/2022-2\/cs488\/andrew-strozewski\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/portfolios.cs.earlham.edu\/"},{"@type":"ListItem","position":2,"name":"Students","item":"https:\/\/portfolios.cs.earlham.edu\/index.php\/students\/"},{"@type":"ListItem","position":3,"name":"2022","item":"https:\/\/portfolios.cs.earlham.edu\/index.php\/students\/2022-2\/"},{"@type":"ListItem","position":4,"name":"CS488","item":"https:\/\/portfolios.cs.earlham.edu\/index.php\/students\/2022-2\/cs488\/"},{"@type":"ListItem","position":5,"name":"Andrew Strozewski"}]},{"@type":"WebSite","@id":"https:\/\/portfolios.cs.earlham.edu\/#website","url":"https:\/\/portfolios.cs.earlham.edu\/","name":"CS\/DS Student Portfolios","description":"AI and ML, Image Classification, Arduino","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/portfolios.cs.earlham.edu\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"}]}},"rttpg_featured_image_url":null,"rttpg_author":{"display_name":"Strozewski","author_link":"https:\/\/portfolios.cs.earlham.edu\/index.php\/author\/ajstroz18\/"},"rttpg_comment":0,"rttpg_category":null,"rttpg_excerpt":"Introduction Hello, I am a senior Computer Science major graduating in May 2022. This page will host my capstone project. My primary interests are cybersecurity, machine learning, and system administration. My project is using machine learning to identify malicious URLs. &hellip; Read More","_links":{"self":[{"href":"https:\/\/portfolios.cs.earlham.edu\/index.php\/wp-json\/wp\/v2\/pages\/4561","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/portfolios.cs.earlham.edu\/index.php\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/portfolios.cs.earlham.edu\/index.php\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/portfolios.cs.earlham.edu\/index.php\/wp-json\/wp\/v2\/users\/109"}],"replies":[{"embeddable":true,"href":"https:\/\/portfolios.cs.earlham.edu\/index.php\/wp-json\/wp\/v2\/comments?post=4561"}],"version-history":[{"count":18,"href":"https:\/\/portfolios.cs.earlham.edu\/index.php\/wp-json\/wp\/v2\/pages\/4561\/revisions"}],"predecessor-version":[{"id":9854,"href":"https:\/\/portfolios.cs.earlham.edu\/index.php\/wp-json\/wp\/v2\/pages\/4561\/revisions\/9854"}],"up":[{"embeddable":true,"href":"https:\/\/portfolios.cs.earlham.edu\/index.php\/wp-json\/wp\/v2\/pages\/4565"}],"wp:attachment":[{"href":"https:\/\/portfolios.cs.earlham.edu\/index.php\/wp-json\/wp\/v2\/media?parent=4561"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}