{"id":10236,"date":"2025-09-08T13:40:30","date_gmt":"2025-09-08T17:40:30","guid":{"rendered":"https:\/\/portfolios.cs.earlham.edu\/?page_id=10236"},"modified":"2025-12-14T12:11:35","modified_gmt":"2025-12-14T17:11:35","slug":"helena-jose","status":"publish","type":"page","link":"https:\/\/portfolios.cs.earlham.edu\/index.php\/students\/2024-2\/cs488\/helena-jose\/","title":{"rendered":"Helena Aleluya Jose &#8211; Quantifying Themes in Theatre Using Topic Modeling (LDA)"},"content":{"rendered":"\n<h6 class=\"has-text-align-center wp-block-heading\"><\/h6>\n\n\n\n<h2 class=\"has-text-align-center wp-block-heading\">Introduction<\/h2>\n\n\n\n<p>My name is Helena Aleluya Jos\u00e9, and I\u2019m an international student from Angola. I\u2019m also a Computer Science and Theater double major, and an African and African-American Studies and Creative Writing double minor. As both a computer scientist and a playwright, I wanted to create a project that reflects the intersection of my passions. For my senior capstone, I developed a framework that applies topic modeling techniques (LDA) to a corpus of dramatic texts in order to identify and compare thematic patterns across plays.<\/p>\n\n\n\n<h3 class=\"has-text-align-center wp-block-heading\">Abstract<\/h3>\n\n\n\n<p>This capstone project aims to develop a novel approach for analyzing and comparing playwrights\u2019 unique voices and styles by applying topic modeling techniques, specifically Latent Dirichlet Allocation (LDA), on a corpus of theater texts. By leveraging advanced natural language processing (NLP) methods, the project will preprocess and prepare a diverse collection of plays for topic modeling, implement custom algorithms tailored for dramatic literature, and analyze the discovered topics to identify recurring themes, character archetypes, and narrative structures prevalent across different playwrights\u2019 works. Interactive visualization tools will be developed to facilitate the exploration and interpretation of these insights, enabling literary scholars and critics to understand the creative processes better and the influences that shape dramatists\u2019 voices.<\/p>\n\n\n\n<p><strong>Keywords<\/strong>&nbsp;<\/p>\n\n\n\n<p>Topic modeling techniques, textual analysis, LDA model, Digital Humanities<\/p>\n\n\n\n<h3 class=\"has-text-align-center wp-block-heading\"><strong>Data Architecture Diagram<\/strong><\/h3>\n\n\n\n<div class=\"wp-block-image\"><figure class=\"aligncenter size-large is-resized\"><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/portfolios.cs.earlham.edu\/wp-content\/uploads\/2025\/12\/Blank-diagram-1-1024x420.png\" alt=\"\" class=\"wp-image-10973\" width=\"758\" height=\"310\" srcset=\"https:\/\/portfolios.cs.earlham.edu\/wp-content\/uploads\/2025\/12\/Blank-diagram-1-1024x420.png 1024w, https:\/\/portfolios.cs.earlham.edu\/wp-content\/uploads\/2025\/12\/Blank-diagram-1-300x123.png 300w, https:\/\/portfolios.cs.earlham.edu\/wp-content\/uploads\/2025\/12\/Blank-diagram-1-768x315.png 768w, https:\/\/portfolios.cs.earlham.edu\/wp-content\/uploads\/2025\/12\/Blank-diagram-1-1536x631.png 1536w\" sizes=\"auto, (max-width: 758px) 100vw, 758px\" \/><\/figure><\/div>\n\n\n\n<p><\/p>\n\n\n\n<h3 class=\"has-text-align-center wp-block-heading\"><strong>Poster<\/strong><\/h3>\n\n\n\n<div class=\"wp-block-image\"><figure class=\"aligncenter size-large is-resized\"><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/portfolios.cs.earlham.edu\/wp-content\/uploads\/2025\/12\/Poster-v1-HAJ-1024x768.jpg\" alt=\"\" class=\"wp-image-10974\" width=\"712\" height=\"534\" srcset=\"https:\/\/portfolios.cs.earlham.edu\/wp-content\/uploads\/2025\/12\/Poster-v1-HAJ-1024x768.jpg 1024w, https:\/\/portfolios.cs.earlham.edu\/wp-content\/uploads\/2025\/12\/Poster-v1-HAJ-300x225.jpg 300w, https:\/\/portfolios.cs.earlham.edu\/wp-content\/uploads\/2025\/12\/Poster-v1-HAJ-768x576.jpg 768w, https:\/\/portfolios.cs.earlham.edu\/wp-content\/uploads\/2025\/12\/Poster-v1-HAJ-1536x1152.jpg 1536w\" sizes=\"auto, (max-width: 712px) 100vw, 712px\" \/><\/figure><\/div>\n\n\n\n<p><\/p>\n\n\n\n<h3 class=\"has-text-align-center wp-block-heading\">Paper<\/h3>\n\n\n\n<div class=\"wp-block-image\"><figure class=\"aligncenter size-large is-resized\"><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/portfolios.cs.earlham.edu\/wp-content\/uploads\/2025\/12\/Capturartttt.jpg\" alt=\"\" class=\"wp-image-10980\" width=\"741\" height=\"845\" srcset=\"https:\/\/portfolios.cs.earlham.edu\/wp-content\/uploads\/2025\/12\/Capturartttt.jpg 709w, https:\/\/portfolios.cs.earlham.edu\/wp-content\/uploads\/2025\/12\/Capturartttt-263x300.jpg 263w\" sizes=\"auto, (max-width: 741px) 100vw, 741px\" \/><\/figure><\/div>\n\n\n\n<div class=\"wp-block-file\"><a href=\"https:\/\/portfolios.cs.earlham.edu\/wp-content\/uploads\/2025\/12\/Technical-Report-Final-HAJ.pdf\">Technical-Report-Final-HAJ<\/a><a href=\"https:\/\/portfolios.cs.earlham.edu\/wp-content\/uploads\/2025\/12\/Technical-Report-Final-HAJ.pdf\" class=\"wp-block-file__button\" download>Download<\/a><\/div>\n\n\n\n<h3 class=\"has-text-align-center wp-block-heading\"><strong>Youtube Video<\/strong><\/h3>\n\n\n\n<figure class=\"wp-block-embed-youtube wp-block-embed is-type-video is-provider-youtube wp-embed-aspect-16-9 wp-has-aspect-ratio\"><div class=\"wp-block-embed__wrapper\">\n<iframe loading=\"lazy\" title=\"Senior Capstone Project- Quantifying Themes Using Topic Modeling\" width=\"940\" height=\"529\" src=\"https:\/\/www.youtube.com\/embed\/5eDCKhzUIRA?feature=oembed\" frameborder=\"0\" allow=\"accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share\" referrerpolicy=\"strict-origin-when-cross-origin\" allowfullscreen><\/iframe>\n<\/div><\/figure>\n\n\n\n<h3 class=\"has-text-align-center wp-block-heading\">GitHub Link<\/h3>\n\n\n\n<ul class=\"wp-block-social-links is-layout-flex wp-block-social-links-is-layout-flex\">\n\n\n\n\n\n\n\n\n\n<li class=\"wp-social-link wp-social-link-github  wp-block-social-link\"><a href=\"https:\/\/code.cs.earlham.edu\/Helena\/cs-senior-capstone-2025\/\" class=\"wp-block-social-link-anchor\"><svg width=\"24\" height=\"24\" viewBox=\"0 0 24 24\" version=\"1.1\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\" aria-hidden=\"true\" focusable=\"false\"><path d=\"M12,2C6.477,2,2,6.477,2,12c0,4.419,2.865,8.166,6.839,9.489c0.5,0.09,0.682-0.218,0.682-0.484 c0-0.236-0.009-0.866-0.014-1.699c-2.782,0.602-3.369-1.34-3.369-1.34c-0.455-1.157-1.11-1.465-1.11-1.465 c-0.909-0.62,0.069-0.608,0.069-0.608c1.004,0.071,1.532,1.03,1.532,1.03c0.891,1.529,2.341,1.089,2.91,0.833 c0.091-0.647,0.349-1.086,0.635-1.337c-2.22-0.251-4.555-1.111-4.555-4.943c0-1.091,0.39-1.984,1.03-2.682 C6.546,8.54,6.202,7.524,6.746,6.148c0,0,0.84-0.269,2.75,1.025C10.295,6.95,11.15,6.84,12,6.836 c0.85,0.004,1.705,0.114,2.504,0.336c1.909-1.294,2.748-1.025,2.748-1.025c0.546,1.376,0.202,2.394,0.1,2.646 c0.64,0.699,1.026,1.591,1.026,2.682c0,3.841-2.337,4.687-4.565,4.935c0.359,0.307,0.679,0.917,0.679,1.852 c0,1.335-0.012,2.415-0.012,2.741c0,0.269,0.18,0.579,0.688,0.481C19.138,20.161,22,16.416,22,12C22,6.477,17.523,2,12,2z\"><\/path><\/svg><span class=\"wp-block-social-link-label screen-reader-text\">GitHub<\/span><\/a><\/li><\/ul>\n\n\n\n<p><a href=\"https:\/\/code.cs.earlham.edu\/Helena\/cs-senior-capstone-2025\/\">https:\/\/code.cs.earlham.edu\/Helena\/cs-senior-capstone-2025\/<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Introduction My name is Helena Aleluya Jos\u00e9, and I\u2019m an international student from Angola. I\u2019m also a Computer Science and Theater double major, and an African and African-American Studies and Creative Writing double minor. As both a computer scientist and &hellip; <a href=\"https:\/\/portfolios.cs.earlham.edu\/index.php\/students\/2024-2\/cs488\/helena-jose\/\">Read More<\/a><\/p>\n","protected":false},"author":167,"featured_media":0,"parent":8534,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"footnotes":""},"class_list":["post-10236","page","type-page","status-publish","hentry"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.4 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>Helena Aleluya Jose - Quantifying Themes in Theatre Using Topic Modeling (LDA) - CS\/DS Student Portfolios<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/portfolios.cs.earlham.edu\/index.php\/students\/2024-2\/cs488\/helena-jose\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Helena Aleluya Jose - Quantifying Themes in Theatre Using Topic Modeling (LDA) - CS\/DS Student Portfolios\" \/>\n<meta property=\"og:description\" content=\"Introduction My name is Helena Aleluya Jos\u00e9, and I\u2019m an international student from Angola. I\u2019m also a Computer Science and Theater double major, and an African and African-American Studies and Creative Writing double minor. As both a computer scientist and &hellip; Read More\" \/>\n<meta property=\"og:url\" content=\"https:\/\/portfolios.cs.earlham.edu\/index.php\/students\/2024-2\/cs488\/helena-jose\/\" \/>\n<meta property=\"og:site_name\" content=\"CS\/DS Student Portfolios\" \/>\n<meta property=\"article:modified_time\" content=\"2025-12-14T17:11:35+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/portfolios.cs.earlham.edu\/wp-content\/uploads\/2025\/12\/Blank-diagram-1.png\" \/>\n\t<meta property=\"og:image:width\" content=\"3225\" \/>\n\t<meta property=\"og:image:height\" content=\"1324\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/png\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data1\" content=\"1 minute\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/portfolios.cs.earlham.edu\\\/index.php\\\/students\\\/2024-2\\\/cs488\\\/helena-jose\\\/\",\"url\":\"https:\\\/\\\/portfolios.cs.earlham.edu\\\/index.php\\\/students\\\/2024-2\\\/cs488\\\/helena-jose\\\/\",\"name\":\"Helena Aleluya Jose - Quantifying Themes in Theatre Using Topic Modeling (LDA) - CS\\\/DS Student Portfolios\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/portfolios.cs.earlham.edu\\\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\\\/\\\/portfolios.cs.earlham.edu\\\/index.php\\\/students\\\/2024-2\\\/cs488\\\/helena-jose\\\/#primaryimage\"},\"image\":{\"@id\":\"https:\\\/\\\/portfolios.cs.earlham.edu\\\/index.php\\\/students\\\/2024-2\\\/cs488\\\/helena-jose\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/portfolios.cs.earlham.edu\\\/wp-content\\\/uploads\\\/2025\\\/12\\\/Blank-diagram-1-1024x420.png\",\"datePublished\":\"2025-09-08T17:40:30+00:00\",\"dateModified\":\"2025-12-14T17:11:35+00:00\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/portfolios.cs.earlham.edu\\\/index.php\\\/students\\\/2024-2\\\/cs488\\\/helena-jose\\\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/portfolios.cs.earlham.edu\\\/index.php\\\/students\\\/2024-2\\\/cs488\\\/helena-jose\\\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/portfolios.cs.earlham.edu\\\/index.php\\\/students\\\/2024-2\\\/cs488\\\/helena-jose\\\/#primaryimage\",\"url\":\"https:\\\/\\\/portfolios.cs.earlham.edu\\\/wp-content\\\/uploads\\\/2025\\\/12\\\/Blank-diagram-1.png\",\"contentUrl\":\"https:\\\/\\\/portfolios.cs.earlham.edu\\\/wp-content\\\/uploads\\\/2025\\\/12\\\/Blank-diagram-1.png\",\"width\":3225,\"height\":1324},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/portfolios.cs.earlham.edu\\\/index.php\\\/students\\\/2024-2\\\/cs488\\\/helena-jose\\\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\\\/\\\/portfolios.cs.earlham.edu\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Students\",\"item\":\"https:\\\/\\\/portfolios.cs.earlham.edu\\\/index.php\\\/students\\\/\"},{\"@type\":\"ListItem\",\"position\":3,\"name\":\"2024\",\"item\":\"https:\\\/\\\/portfolios.cs.earlham.edu\\\/index.php\\\/students\\\/2024-2\\\/\"},{\"@type\":\"ListItem\",\"position\":4,\"name\":\"CS488\",\"item\":\"https:\\\/\\\/portfolios.cs.earlham.edu\\\/index.php\\\/students\\\/2024-2\\\/cs488\\\/\"},{\"@type\":\"ListItem\",\"position\":5,\"name\":\"Helena Aleluya Jose &#8211; Quantifying Themes in Theatre Using Topic Modeling (LDA)\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/portfolios.cs.earlham.edu\\\/#website\",\"url\":\"https:\\\/\\\/portfolios.cs.earlham.edu\\\/\",\"name\":\"CS\\\/DS Student Portfolios\",\"description\":\"AI and ML, Image Classification, Arduino\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/portfolios.cs.earlham.edu\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Helena Aleluya Jose - Quantifying Themes in Theatre Using Topic Modeling (LDA) - CS\/DS Student Portfolios","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/portfolios.cs.earlham.edu\/index.php\/students\/2024-2\/cs488\/helena-jose\/","og_locale":"en_US","og_type":"article","og_title":"Helena Aleluya Jose - Quantifying Themes in Theatre Using Topic Modeling (LDA) - CS\/DS Student Portfolios","og_description":"Introduction My name is Helena Aleluya Jos\u00e9, and I\u2019m an international student from Angola. I\u2019m also a Computer Science and Theater double major, and an African and African-American Studies and Creative Writing double minor. As both a computer scientist and &hellip; Read More","og_url":"https:\/\/portfolios.cs.earlham.edu\/index.php\/students\/2024-2\/cs488\/helena-jose\/","og_site_name":"CS\/DS Student Portfolios","article_modified_time":"2025-12-14T17:11:35+00:00","og_image":[{"width":3225,"height":1324,"url":"https:\/\/portfolios.cs.earlham.edu\/wp-content\/uploads\/2025\/12\/Blank-diagram-1.png","type":"image\/png"}],"twitter_card":"summary_large_image","twitter_misc":{"Est. reading time":"1 minute"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/portfolios.cs.earlham.edu\/index.php\/students\/2024-2\/cs488\/helena-jose\/","url":"https:\/\/portfolios.cs.earlham.edu\/index.php\/students\/2024-2\/cs488\/helena-jose\/","name":"Helena Aleluya Jose - Quantifying Themes in Theatre Using Topic Modeling (LDA) - CS\/DS Student Portfolios","isPartOf":{"@id":"https:\/\/portfolios.cs.earlham.edu\/#website"},"primaryImageOfPage":{"@id":"https:\/\/portfolios.cs.earlham.edu\/index.php\/students\/2024-2\/cs488\/helena-jose\/#primaryimage"},"image":{"@id":"https:\/\/portfolios.cs.earlham.edu\/index.php\/students\/2024-2\/cs488\/helena-jose\/#primaryimage"},"thumbnailUrl":"https:\/\/portfolios.cs.earlham.edu\/wp-content\/uploads\/2025\/12\/Blank-diagram-1-1024x420.png","datePublished":"2025-09-08T17:40:30+00:00","dateModified":"2025-12-14T17:11:35+00:00","breadcrumb":{"@id":"https:\/\/portfolios.cs.earlham.edu\/index.php\/students\/2024-2\/cs488\/helena-jose\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/portfolios.cs.earlham.edu\/index.php\/students\/2024-2\/cs488\/helena-jose\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/portfolios.cs.earlham.edu\/index.php\/students\/2024-2\/cs488\/helena-jose\/#primaryimage","url":"https:\/\/portfolios.cs.earlham.edu\/wp-content\/uploads\/2025\/12\/Blank-diagram-1.png","contentUrl":"https:\/\/portfolios.cs.earlham.edu\/wp-content\/uploads\/2025\/12\/Blank-diagram-1.png","width":3225,"height":1324},{"@type":"BreadcrumbList","@id":"https:\/\/portfolios.cs.earlham.edu\/index.php\/students\/2024-2\/cs488\/helena-jose\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/portfolios.cs.earlham.edu\/"},{"@type":"ListItem","position":2,"name":"Students","item":"https:\/\/portfolios.cs.earlham.edu\/index.php\/students\/"},{"@type":"ListItem","position":3,"name":"2024","item":"https:\/\/portfolios.cs.earlham.edu\/index.php\/students\/2024-2\/"},{"@type":"ListItem","position":4,"name":"CS488","item":"https:\/\/portfolios.cs.earlham.edu\/index.php\/students\/2024-2\/cs488\/"},{"@type":"ListItem","position":5,"name":"Helena Aleluya Jose &#8211; Quantifying Themes in Theatre Using Topic Modeling (LDA)"}]},{"@type":"WebSite","@id":"https:\/\/portfolios.cs.earlham.edu\/#website","url":"https:\/\/portfolios.cs.earlham.edu\/","name":"CS\/DS Student Portfolios","description":"AI and ML, Image Classification, Arduino","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/portfolios.cs.earlham.edu\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"}]}},"rttpg_featured_image_url":null,"rttpg_author":{"display_name":"Helena Jose","author_link":"https:\/\/portfolios.cs.earlham.edu\/index.php\/author\/hcjose22\/"},"rttpg_comment":0,"rttpg_category":null,"rttpg_excerpt":"Introduction My name is Helena Aleluya Jos\u00e9, and I\u2019m an international student from Angola. I\u2019m also a Computer Science and Theater double major, and an African and African-American Studies and Creative Writing double minor. As both a computer scientist and &hellip; Read More","_links":{"self":[{"href":"https:\/\/portfolios.cs.earlham.edu\/index.php\/wp-json\/wp\/v2\/pages\/10236","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/portfolios.cs.earlham.edu\/index.php\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/portfolios.cs.earlham.edu\/index.php\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/portfolios.cs.earlham.edu\/index.php\/wp-json\/wp\/v2\/users\/167"}],"replies":[{"embeddable":true,"href":"https:\/\/portfolios.cs.earlham.edu\/index.php\/wp-json\/wp\/v2\/comments?post=10236"}],"version-history":[{"count":16,"href":"https:\/\/portfolios.cs.earlham.edu\/index.php\/wp-json\/wp\/v2\/pages\/10236\/revisions"}],"predecessor-version":[{"id":11037,"href":"https:\/\/portfolios.cs.earlham.edu\/index.php\/wp-json\/wp\/v2\/pages\/10236\/revisions\/11037"}],"up":[{"embeddable":true,"href":"https:\/\/portfolios.cs.earlham.edu\/index.php\/wp-json\/wp\/v2\/pages\/8534"}],"wp:attachment":[{"href":"https:\/\/portfolios.cs.earlham.edu\/index.php\/wp-json\/wp\/v2\/media?parent=10236"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}