{"id":3972,"date":"2020-08-19T22:00:23","date_gmt":"2020-08-20T02:00:23","guid":{"rendered":"https:\/\/portfolios.cs.earlham.edu\/?p=3972"},"modified":"2020-08-20T22:10:50","modified_gmt":"2020-08-21T02:10:50","slug":"pitches","status":"publish","type":"post","link":"https:\/\/portfolios.cs.earlham.edu\/index.php\/2020\/08\/19\/pitches\/","title":{"rendered":"Pitches"},"content":{"rendered":"\n<p><\/p>\n\n\n\n<ul class=\"wp-block-list\"><li>Use deep reinforcement learning to tune the hyperparameters (learning rate, lambda \u2013 regularization parameter, number of layers, number of units in each layer, different activation functions) of a Neural Network. The overall cost function of RL agent will include the metrics such as accuracy of the NN (or F1 score) on training and validation sets, time taken to learn, the measures of over\/underfitting. This network would be trained on different types of problems.<\/li><li>For this idea, I\u2019m using the game of Pong (ATARI) as a test environment. My plan is to introduce a specific pipeline in training the AI agent to play the game. Instead of directly using the Policy Gradients, I will train the agent to guess the next frames in the game. First, I will use RNN to learn (approximate) the transition function in an unknown environment. The transition function, modeled by a Recurrent Neural Network, will take previous n states of the game(in raw pixel form) and agent\u2019s action, and output the state representation that corresponds to the future state of the environment. The intuition behind this is that the agent will first learn the \u2018laws of physics\u2019 of a certain environment (exploration) and this will help the agent learn how to play the game more efficiently. After learning the weights of the transition function, I will implement the Reinforcement Learning algorithm (Policy Gradients) that reuses the learned weights (transfer learning) and train this deep neural network by letting in play a number of games and learn from experience.<\/li><li>I will train a CNN to be able to verify, given the images of handwritten text, if two handwritings belong to the same person. In order to generate more labeled data, I will use a dataset with images of handwritten texts and break up each image into the windows containing a few words. I will assume that each word written on a single image belongs to one person.<\/li><\/ul>\n","protected":false},"excerpt":{"rendered":"<p>Use deep reinforcement learning to tune the hyperparameters (learning rate, lambda \u2013 regularization parameter, number of layers, number of units in each layer, different activation functions) of a Neural Network. The overall cost function of RL agent will include the &hellip; <a href=\"https:\/\/portfolios.cs.earlham.edu\/index.php\/2020\/08\/19\/pitches\/\">Read More<\/a><\/p>\n","protected":false},"author":94,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[101,126,19],"tags":[],"class_list":["post-3972","post","type-post","status-publish","format-standard","hentry","category-101","category-davit-kvartskhava","category-student"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.4 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>Pitches - CS\/DS Student Portfolios<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/portfolios.cs.earlham.edu\/index.php\/2020\/08\/19\/pitches\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Pitches - CS\/DS Student Portfolios\" \/>\n<meta property=\"og:description\" content=\"Use deep reinforcement learning to tune the hyperparameters (learning rate, lambda \u2013 regularization parameter, number of layers, number of units in each layer, different activation functions) of a Neural Network. The overall cost function of RL agent will include the &hellip; Read More\" \/>\n<meta property=\"og:url\" content=\"https:\/\/portfolios.cs.earlham.edu\/index.php\/2020\/08\/19\/pitches\/\" \/>\n<meta property=\"og:site_name\" content=\"CS\/DS Student Portfolios\" \/>\n<meta property=\"article:published_time\" content=\"2020-08-20T02:00:23+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2020-08-21T02:10:50+00:00\" \/>\n<meta name=\"author\" content=\"Davit Kvartskhava\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Davit Kvartskhava\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"2 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\\\/\\\/portfolios.cs.earlham.edu\\\/index.php\\\/2020\\\/08\\\/19\\\/pitches\\\/#article\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/portfolios.cs.earlham.edu\\\/index.php\\\/2020\\\/08\\\/19\\\/pitches\\\/\"},\"author\":{\"name\":\"Davit Kvartskhava\",\"@id\":\"https:\\\/\\\/portfolios.cs.earlham.edu\\\/#\\\/schema\\\/person\\\/aa3cef0df71b3630feb0c5b3b30fc641\"},\"headline\":\"Pitches\",\"datePublished\":\"2020-08-20T02:00:23+00:00\",\"dateModified\":\"2020-08-21T02:10:50+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\\\/\\\/portfolios.cs.earlham.edu\\\/index.php\\\/2020\\\/08\\\/19\\\/pitches\\\/\"},\"wordCount\":324,\"commentCount\":0,\"articleSection\":[\"2021\",\"Davit Kvartskhava\",\"Student\"],\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\\\/\\\/portfolios.cs.earlham.edu\\\/index.php\\\/2020\\\/08\\\/19\\\/pitches\\\/#respond\"]}]},{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/portfolios.cs.earlham.edu\\\/index.php\\\/2020\\\/08\\\/19\\\/pitches\\\/\",\"url\":\"https:\\\/\\\/portfolios.cs.earlham.edu\\\/index.php\\\/2020\\\/08\\\/19\\\/pitches\\\/\",\"name\":\"Pitches - CS\\\/DS Student Portfolios\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/portfolios.cs.earlham.edu\\\/#website\"},\"datePublished\":\"2020-08-20T02:00:23+00:00\",\"dateModified\":\"2020-08-21T02:10:50+00:00\",\"author\":{\"@id\":\"https:\\\/\\\/portfolios.cs.earlham.edu\\\/#\\\/schema\\\/person\\\/aa3cef0df71b3630feb0c5b3b30fc641\"},\"breadcrumb\":{\"@id\":\"https:\\\/\\\/portfolios.cs.earlham.edu\\\/index.php\\\/2020\\\/08\\\/19\\\/pitches\\\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/portfolios.cs.earlham.edu\\\/index.php\\\/2020\\\/08\\\/19\\\/pitches\\\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/portfolios.cs.earlham.edu\\\/index.php\\\/2020\\\/08\\\/19\\\/pitches\\\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\\\/\\\/portfolios.cs.earlham.edu\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Pitches\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/portfolios.cs.earlham.edu\\\/#website\",\"url\":\"https:\\\/\\\/portfolios.cs.earlham.edu\\\/\",\"name\":\"CS\\\/DS Student Portfolios\",\"description\":\"AI and ML, Image Classification, Arduino\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/portfolios.cs.earlham.edu\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Person\",\"@id\":\"https:\\\/\\\/portfolios.cs.earlham.edu\\\/#\\\/schema\\\/person\\\/aa3cef0df71b3630feb0c5b3b30fc641\",\"name\":\"Davit Kvartskhava\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/241c8a5c3abead1328f06c7b7fc3c16cec883a822c6a566f75ef981d1d82b044?s=96&d=mm&r=g\",\"url\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/241c8a5c3abead1328f06c7b7fc3c16cec883a822c6a566f75ef981d1d82b044?s=96&d=mm&r=g\",\"contentUrl\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/241c8a5c3abead1328f06c7b7fc3c16cec883a822c6a566f75ef981d1d82b044?s=96&d=mm&r=g\",\"caption\":\"Davit Kvartskhava\"},\"url\":\"https:\\\/\\\/portfolios.cs.earlham.edu\\\/index.php\\\/author\\\/dkvart17\\\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Pitches - CS\/DS Student Portfolios","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/portfolios.cs.earlham.edu\/index.php\/2020\/08\/19\/pitches\/","og_locale":"en_US","og_type":"article","og_title":"Pitches - CS\/DS Student Portfolios","og_description":"Use deep reinforcement learning to tune the hyperparameters (learning rate, lambda \u2013 regularization parameter, number of layers, number of units in each layer, different activation functions) of a Neural Network. The overall cost function of RL agent will include the &hellip; Read More","og_url":"https:\/\/portfolios.cs.earlham.edu\/index.php\/2020\/08\/19\/pitches\/","og_site_name":"CS\/DS Student Portfolios","article_published_time":"2020-08-20T02:00:23+00:00","article_modified_time":"2020-08-21T02:10:50+00:00","author":"Davit Kvartskhava","twitter_card":"summary_large_image","twitter_misc":{"Written by":"Davit Kvartskhava","Est. reading time":"2 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/portfolios.cs.earlham.edu\/index.php\/2020\/08\/19\/pitches\/#article","isPartOf":{"@id":"https:\/\/portfolios.cs.earlham.edu\/index.php\/2020\/08\/19\/pitches\/"},"author":{"name":"Davit Kvartskhava","@id":"https:\/\/portfolios.cs.earlham.edu\/#\/schema\/person\/aa3cef0df71b3630feb0c5b3b30fc641"},"headline":"Pitches","datePublished":"2020-08-20T02:00:23+00:00","dateModified":"2020-08-21T02:10:50+00:00","mainEntityOfPage":{"@id":"https:\/\/portfolios.cs.earlham.edu\/index.php\/2020\/08\/19\/pitches\/"},"wordCount":324,"commentCount":0,"articleSection":["2021","Davit Kvartskhava","Student"],"inLanguage":"en-US","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/portfolios.cs.earlham.edu\/index.php\/2020\/08\/19\/pitches\/#respond"]}]},{"@type":"WebPage","@id":"https:\/\/portfolios.cs.earlham.edu\/index.php\/2020\/08\/19\/pitches\/","url":"https:\/\/portfolios.cs.earlham.edu\/index.php\/2020\/08\/19\/pitches\/","name":"Pitches - CS\/DS Student Portfolios","isPartOf":{"@id":"https:\/\/portfolios.cs.earlham.edu\/#website"},"datePublished":"2020-08-20T02:00:23+00:00","dateModified":"2020-08-21T02:10:50+00:00","author":{"@id":"https:\/\/portfolios.cs.earlham.edu\/#\/schema\/person\/aa3cef0df71b3630feb0c5b3b30fc641"},"breadcrumb":{"@id":"https:\/\/portfolios.cs.earlham.edu\/index.php\/2020\/08\/19\/pitches\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/portfolios.cs.earlham.edu\/index.php\/2020\/08\/19\/pitches\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/portfolios.cs.earlham.edu\/index.php\/2020\/08\/19\/pitches\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/portfolios.cs.earlham.edu\/"},{"@type":"ListItem","position":2,"name":"Pitches"}]},{"@type":"WebSite","@id":"https:\/\/portfolios.cs.earlham.edu\/#website","url":"https:\/\/portfolios.cs.earlham.edu\/","name":"CS\/DS Student Portfolios","description":"AI and ML, Image Classification, Arduino","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/portfolios.cs.earlham.edu\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Person","@id":"https:\/\/portfolios.cs.earlham.edu\/#\/schema\/person\/aa3cef0df71b3630feb0c5b3b30fc641","name":"Davit Kvartskhava","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/secure.gravatar.com\/avatar\/241c8a5c3abead1328f06c7b7fc3c16cec883a822c6a566f75ef981d1d82b044?s=96&d=mm&r=g","url":"https:\/\/secure.gravatar.com\/avatar\/241c8a5c3abead1328f06c7b7fc3c16cec883a822c6a566f75ef981d1d82b044?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/241c8a5c3abead1328f06c7b7fc3c16cec883a822c6a566f75ef981d1d82b044?s=96&d=mm&r=g","caption":"Davit Kvartskhava"},"url":"https:\/\/portfolios.cs.earlham.edu\/index.php\/author\/dkvart17\/"}]}},"rttpg_featured_image_url":null,"rttpg_author":{"display_name":"Davit Kvartskhava","author_link":"https:\/\/portfolios.cs.earlham.edu\/index.php\/author\/dkvart17\/"},"rttpg_comment":0,"rttpg_category":"<a href=\"https:\/\/portfolios.cs.earlham.edu\/index.php\/category\/student\/2021\/\" rel=\"category tag\">2021<\/a> <a href=\"https:\/\/portfolios.cs.earlham.edu\/index.php\/category\/student\/2021\/davit-kvartskhava\/\" rel=\"category tag\">Davit Kvartskhava<\/a> <a href=\"https:\/\/portfolios.cs.earlham.edu\/index.php\/category\/student\/\" rel=\"category tag\">Student<\/a>","rttpg_excerpt":"Use deep reinforcement learning to tune the hyperparameters (learning rate, lambda \u2013 regularization parameter, number of layers, number of units in each layer, different activation functions) of a Neural Network. The overall cost function of RL agent will include the &hellip; Read More","_links":{"self":[{"href":"https:\/\/portfolios.cs.earlham.edu\/index.php\/wp-json\/wp\/v2\/posts\/3972","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/portfolios.cs.earlham.edu\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/portfolios.cs.earlham.edu\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/portfolios.cs.earlham.edu\/index.php\/wp-json\/wp\/v2\/users\/94"}],"replies":[{"embeddable":true,"href":"https:\/\/portfolios.cs.earlham.edu\/index.php\/wp-json\/wp\/v2\/comments?post=3972"}],"version-history":[{"count":1,"href":"https:\/\/portfolios.cs.earlham.edu\/index.php\/wp-json\/wp\/v2\/posts\/3972\/revisions"}],"predecessor-version":[{"id":3973,"href":"https:\/\/portfolios.cs.earlham.edu\/index.php\/wp-json\/wp\/v2\/posts\/3972\/revisions\/3973"}],"wp:attachment":[{"href":"https:\/\/portfolios.cs.earlham.edu\/index.php\/wp-json\/wp\/v2\/media?parent=3972"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/portfolios.cs.earlham.edu\/index.php\/wp-json\/wp\/v2\/categories?post=3972"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/portfolios.cs.earlham.edu\/index.php\/wp-json\/wp\/v2\/tags?post=3972"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}