{"id":4575,"date":"2022-02-02T11:39:40","date_gmt":"2022-02-02T16:39:40","guid":{"rendered":"https:\/\/portfolios.cs.earlham.edu\/?page_id=4575"},"modified":"2025-04-15T13:50:37","modified_gmt":"2025-04-15T17:50:37","slug":"khoa-nguyen","status":"publish","type":"page","link":"https:\/\/portfolios.cs.earlham.edu\/index.php\/students\/2022-2\/ds488\/khoa-nguyen\/","title":{"rendered":"Khoa Nguyen"},"content":{"rendered":"\n<h2 class=\"wp-block-heading\">About me<\/h2>\n\n\n\n<p>My name is Khoa, and I am a senior, double-major in Quantitative Economics and Data Science. My senior capstone project is about computational drug development, focusing on the application of deep learning in predicting the solubility of drug compounds from the public ZINC online database.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">My project<\/h2>\n\n\n\n<h3 class=\"wp-block-heading\">Abstract<\/h3>\n\n\n\n<p>Drug discovery and development is a costly and time-consuming process, taking up to billions of dollars and 12-15 years from basic research to FDA approval. Early stage discovery involves intensive search through an enormous database of molecules and analysis of their quantitative structure-activity relationships to determine their physicochemical properties. Important features like absorption, distribution, metabolism, and excretion (ADME) are extracted to measure how these compounds interact with the human bodies. At its root, this is an optimization problem in which researchers try to identify the \u201cbest\u201d compounds with desired properties to be qualified for clinical development to produce a safe and cost-effective drug. Nowadays, with stronger computation power, the process can be sped up significantly with artificial intelligence. Many deep learning models have demonstrated highly accurate predictions on the ADME properties of drug-like small molecules. In particular, graph neural networks (GNN) are shown to learn effectively graph-based molecular representation. This paper examines the feasibility of several state-of-the-art graph neural networks on predicting the solubility of commercially available compounds in the ZINC database. The experiment indicated that each model&#8217;s performance was significantly improved through training. The results suggested promising applications of deep learning in reducing the time and cost of the drug development process in the foreseeable future.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\">Data diagram<\/h3>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"620\" src=\"https:\/\/portfolios.cs.earlham.edu\/wp-content\/uploads\/2022\/05\/complex_data_diagram-1024x620.png\" alt=\"\" class=\"wp-image-4890\" srcset=\"https:\/\/portfolios.cs.earlham.edu\/wp-content\/uploads\/2022\/05\/complex_data_diagram-1024x620.png 1024w, https:\/\/portfolios.cs.earlham.edu\/wp-content\/uploads\/2022\/05\/complex_data_diagram-300x182.png 300w, https:\/\/portfolios.cs.earlham.edu\/wp-content\/uploads\/2022\/05\/complex_data_diagram-768x465.png 768w, https:\/\/portfolios.cs.earlham.edu\/wp-content\/uploads\/2022\/05\/complex_data_diagram-1536x930.png 1536w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/figure>\n\n\n\n<h3 class=\"wp-block-heading\"><a href=\"https:\/\/code.cs.earlham.edu\/kanguyen18\/ds-senior-capstone\">Gitlab project<\/a><\/h3>\n\n\n\n<h3 class=\"wp-block-heading\"><a href=\"https:\/\/drive.google.com\/file\/d\/1JtlH_As6Do4D5LM-J29pgpAKtFlyscxf\/view?usp=sharing\">Paper<\/a><\/h3>\n\n\n\n<h3 class=\"wp-block-heading\">Software demonstration video<\/h3>\n\n\n\n<figure class=\"wp-block-embed-youtube wp-block-embed is-type-video is-provider-youtube wp-embed-aspect-4-3 wp-has-aspect-ratio\"><div class=\"wp-block-embed__wrapper\">\n<iframe loading=\"lazy\" title=\"Final SDV - DS Capstone - Khoa Nguyen\" width=\"940\" height=\"705\" src=\"https:\/\/www.youtube.com\/embed\/tG2B7mEo-zA?feature=oembed\" frameborder=\"0\" allow=\"accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture\" allowfullscreen><\/iframe>\n<\/div><\/figure>\n\n\n\n<h3 class=\"wp-block-heading\">Poster<\/h3>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"878\" src=\"https:\/\/portfolios.cs.earlham.edu\/wp-content\/uploads\/2022\/05\/Final_Poster-DS_Capstone-Khoa_Nguyen-1-1024x878.png\" alt=\"\" class=\"wp-image-4894\" srcset=\"https:\/\/portfolios.cs.earlham.edu\/wp-content\/uploads\/2022\/05\/Final_Poster-DS_Capstone-Khoa_Nguyen-1-1024x878.png 1024w, https:\/\/portfolios.cs.earlham.edu\/wp-content\/uploads\/2022\/05\/Final_Poster-DS_Capstone-Khoa_Nguyen-1-300x257.png 300w, https:\/\/portfolios.cs.earlham.edu\/wp-content\/uploads\/2022\/05\/Final_Poster-DS_Capstone-Khoa_Nguyen-1-768x658.png 768w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/figure>\n","protected":false},"excerpt":{"rendered":"<p>About me My name is Khoa, and I am a senior, double-major in Quantitative Economics and Data Science. My senior capstone project is about computational drug development, focusing on the application of deep learning in predicting the solubility of drug &hellip; <a href=\"https:\/\/portfolios.cs.earlham.edu\/index.php\/students\/2022-2\/ds488\/khoa-nguyen\/\">Read More<\/a><\/p>\n","protected":false},"author":107,"featured_media":0,"parent":4563,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"footnotes":""},"class_list":["post-4575","page","type-page","status-publish","hentry"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.4 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>Khoa Nguyen - CS\/DS Student Portfolios<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/portfolios.cs.earlham.edu\/index.php\/students\/2022-2\/ds488\/khoa-nguyen\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Khoa Nguyen - CS\/DS Student Portfolios\" \/>\n<meta property=\"og:description\" content=\"About me My name is Khoa, and I am a senior, double-major in Quantitative Economics and Data Science. My senior capstone project is about computational drug development, focusing on the application of deep learning in predicting the solubility of drug &hellip; Read More\" \/>\n<meta property=\"og:url\" content=\"https:\/\/portfolios.cs.earlham.edu\/index.php\/students\/2022-2\/ds488\/khoa-nguyen\/\" \/>\n<meta property=\"og:site_name\" content=\"CS\/DS Student Portfolios\" \/>\n<meta property=\"article:modified_time\" content=\"2025-04-15T17:50:37+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/portfolios.cs.earlham.edu\/wp-content\/uploads\/2022\/05\/complex_data_diagram.png\" \/>\n\t<meta property=\"og:image:width\" content=\"4849\" \/>\n\t<meta property=\"og:image:height\" content=\"2937\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/png\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data1\" content=\"1 minute\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/portfolios.cs.earlham.edu\\\/index.php\\\/students\\\/2022-2\\\/ds488\\\/khoa-nguyen\\\/\",\"url\":\"https:\\\/\\\/portfolios.cs.earlham.edu\\\/index.php\\\/students\\\/2022-2\\\/ds488\\\/khoa-nguyen\\\/\",\"name\":\"Khoa Nguyen - CS\\\/DS Student Portfolios\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/portfolios.cs.earlham.edu\\\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\\\/\\\/portfolios.cs.earlham.edu\\\/index.php\\\/students\\\/2022-2\\\/ds488\\\/khoa-nguyen\\\/#primaryimage\"},\"image\":{\"@id\":\"https:\\\/\\\/portfolios.cs.earlham.edu\\\/index.php\\\/students\\\/2022-2\\\/ds488\\\/khoa-nguyen\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/portfolios.cs.earlham.edu\\\/wp-content\\\/uploads\\\/2022\\\/05\\\/complex_data_diagram-1024x620.png\",\"datePublished\":\"2022-02-02T16:39:40+00:00\",\"dateModified\":\"2025-04-15T17:50:37+00:00\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/portfolios.cs.earlham.edu\\\/index.php\\\/students\\\/2022-2\\\/ds488\\\/khoa-nguyen\\\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/portfolios.cs.earlham.edu\\\/index.php\\\/students\\\/2022-2\\\/ds488\\\/khoa-nguyen\\\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/portfolios.cs.earlham.edu\\\/index.php\\\/students\\\/2022-2\\\/ds488\\\/khoa-nguyen\\\/#primaryimage\",\"url\":\"https:\\\/\\\/portfolios.cs.earlham.edu\\\/wp-content\\\/uploads\\\/2022\\\/05\\\/complex_data_diagram.png\",\"contentUrl\":\"https:\\\/\\\/portfolios.cs.earlham.edu\\\/wp-content\\\/uploads\\\/2022\\\/05\\\/complex_data_diagram.png\",\"width\":4849,\"height\":2937},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/portfolios.cs.earlham.edu\\\/index.php\\\/students\\\/2022-2\\\/ds488\\\/khoa-nguyen\\\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\\\/\\\/portfolios.cs.earlham.edu\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Students\",\"item\":\"https:\\\/\\\/portfolios.cs.earlham.edu\\\/index.php\\\/students\\\/\"},{\"@type\":\"ListItem\",\"position\":3,\"name\":\"2022\",\"item\":\"https:\\\/\\\/portfolios.cs.earlham.edu\\\/index.php\\\/students\\\/2022-2\\\/\"},{\"@type\":\"ListItem\",\"position\":4,\"name\":\"DS488\",\"item\":\"https:\\\/\\\/portfolios.cs.earlham.edu\\\/index.php\\\/students\\\/2022-2\\\/ds488\\\/\"},{\"@type\":\"ListItem\",\"position\":5,\"name\":\"Khoa Nguyen\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/portfolios.cs.earlham.edu\\\/#website\",\"url\":\"https:\\\/\\\/portfolios.cs.earlham.edu\\\/\",\"name\":\"CS\\\/DS Student Portfolios\",\"description\":\"AI and ML, Image Classification, Arduino\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/portfolios.cs.earlham.edu\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Khoa Nguyen - CS\/DS Student Portfolios","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/portfolios.cs.earlham.edu\/index.php\/students\/2022-2\/ds488\/khoa-nguyen\/","og_locale":"en_US","og_type":"article","og_title":"Khoa Nguyen - CS\/DS Student Portfolios","og_description":"About me My name is Khoa, and I am a senior, double-major in Quantitative Economics and Data Science. My senior capstone project is about computational drug development, focusing on the application of deep learning in predicting the solubility of drug &hellip; Read More","og_url":"https:\/\/portfolios.cs.earlham.edu\/index.php\/students\/2022-2\/ds488\/khoa-nguyen\/","og_site_name":"CS\/DS Student Portfolios","article_modified_time":"2025-04-15T17:50:37+00:00","og_image":[{"width":4849,"height":2937,"url":"https:\/\/portfolios.cs.earlham.edu\/wp-content\/uploads\/2022\/05\/complex_data_diagram.png","type":"image\/png"}],"twitter_card":"summary_large_image","twitter_misc":{"Est. reading time":"1 minute"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/portfolios.cs.earlham.edu\/index.php\/students\/2022-2\/ds488\/khoa-nguyen\/","url":"https:\/\/portfolios.cs.earlham.edu\/index.php\/students\/2022-2\/ds488\/khoa-nguyen\/","name":"Khoa Nguyen - CS\/DS Student Portfolios","isPartOf":{"@id":"https:\/\/portfolios.cs.earlham.edu\/#website"},"primaryImageOfPage":{"@id":"https:\/\/portfolios.cs.earlham.edu\/index.php\/students\/2022-2\/ds488\/khoa-nguyen\/#primaryimage"},"image":{"@id":"https:\/\/portfolios.cs.earlham.edu\/index.php\/students\/2022-2\/ds488\/khoa-nguyen\/#primaryimage"},"thumbnailUrl":"https:\/\/portfolios.cs.earlham.edu\/wp-content\/uploads\/2022\/05\/complex_data_diagram-1024x620.png","datePublished":"2022-02-02T16:39:40+00:00","dateModified":"2025-04-15T17:50:37+00:00","breadcrumb":{"@id":"https:\/\/portfolios.cs.earlham.edu\/index.php\/students\/2022-2\/ds488\/khoa-nguyen\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/portfolios.cs.earlham.edu\/index.php\/students\/2022-2\/ds488\/khoa-nguyen\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/portfolios.cs.earlham.edu\/index.php\/students\/2022-2\/ds488\/khoa-nguyen\/#primaryimage","url":"https:\/\/portfolios.cs.earlham.edu\/wp-content\/uploads\/2022\/05\/complex_data_diagram.png","contentUrl":"https:\/\/portfolios.cs.earlham.edu\/wp-content\/uploads\/2022\/05\/complex_data_diagram.png","width":4849,"height":2937},{"@type":"BreadcrumbList","@id":"https:\/\/portfolios.cs.earlham.edu\/index.php\/students\/2022-2\/ds488\/khoa-nguyen\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/portfolios.cs.earlham.edu\/"},{"@type":"ListItem","position":2,"name":"Students","item":"https:\/\/portfolios.cs.earlham.edu\/index.php\/students\/"},{"@type":"ListItem","position":3,"name":"2022","item":"https:\/\/portfolios.cs.earlham.edu\/index.php\/students\/2022-2\/"},{"@type":"ListItem","position":4,"name":"DS488","item":"https:\/\/portfolios.cs.earlham.edu\/index.php\/students\/2022-2\/ds488\/"},{"@type":"ListItem","position":5,"name":"Khoa Nguyen"}]},{"@type":"WebSite","@id":"https:\/\/portfolios.cs.earlham.edu\/#website","url":"https:\/\/portfolios.cs.earlham.edu\/","name":"CS\/DS Student Portfolios","description":"AI and ML, Image Classification, Arduino","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/portfolios.cs.earlham.edu\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"}]}},"rttpg_featured_image_url":null,"rttpg_author":{"display_name":"Khoa Nguyen","author_link":"https:\/\/portfolios.cs.earlham.edu\/index.php\/author\/kanguyen18\/"},"rttpg_comment":0,"rttpg_category":null,"rttpg_excerpt":"About me My name is Khoa, and I am a senior, double-major in Quantitative Economics and Data Science. My senior capstone project is about computational drug development, focusing on the application of deep learning in predicting the solubility of drug &hellip; Read More","_links":{"self":[{"href":"https:\/\/portfolios.cs.earlham.edu\/index.php\/wp-json\/wp\/v2\/pages\/4575","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/portfolios.cs.earlham.edu\/index.php\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/portfolios.cs.earlham.edu\/index.php\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/portfolios.cs.earlham.edu\/index.php\/wp-json\/wp\/v2\/users\/107"}],"replies":[{"embeddable":true,"href":"https:\/\/portfolios.cs.earlham.edu\/index.php\/wp-json\/wp\/v2\/comments?post=4575"}],"version-history":[{"count":14,"href":"https:\/\/portfolios.cs.earlham.edu\/index.php\/wp-json\/wp\/v2\/pages\/4575\/revisions"}],"predecessor-version":[{"id":9888,"href":"https:\/\/portfolios.cs.earlham.edu\/index.php\/wp-json\/wp\/v2\/pages\/4575\/revisions\/9888"}],"up":[{"embeddable":true,"href":"https:\/\/portfolios.cs.earlham.edu\/index.php\/wp-json\/wp\/v2\/pages\/4563"}],"wp:attachment":[{"href":"https:\/\/portfolios.cs.earlham.edu\/index.php\/wp-json\/wp\/v2\/media?parent=4575"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}