I have been taking notes and learning about different tree indexing structures. I currently have 6 pages of notes and understand How R trees work and how X trees uses techniques from both R trees and B trees. I need … Read More
After filtering through many different project ideas, I finally decided to create an aesthetically pleasing data visualization of pollution on Earlham’s campus. I have written a literature review of 15 sources on all of data visualizations, artistic factors that influence … Read More
This week I finished writing the literature review and looking in depth into the literature available for my two topics: RFID in library and Sentiment analysis for stock prediction. I believe I will proceed with RFID technology and research more … Read More
Finished the machine learning pipeline with actual experimentation. Having issues getting Flask setup and have been in touch with the SysAdmins. Halfway through the first draft of the paper. Made new design for the system architecture.
Finished literature review. Selected 1 topic from the remaining two for Quiz 3 (proposal)
I am still in the process of writing my literary reviews. I have completed one and the second I am currently outlining. I am learning a lot of new information with my new sources. Since I know which of the … Read More
This week I filled out the quiz and started looking into what resources I’ll need for my project, image stitching. At the least I’ll need openCV, but probably imageMagick as well. Starting to think about how I should organize my … Read More
For the past week, I have been looking at some available corpora found online. I was able to get one for free, namely the Open American National Corpus. It seems like it might be a better fit than the Blogs … Read More
Last week I was able to set-up a two node blockchain on the Earlham Cluster, after working through technical issues with the help of Craig. However, I discovered several problems with this structure this week. I couldn’t re-access an already … Read More
This week I dedicated my time towards designing my proposal idea. After my literature review I decided I wanted to continue my audiovisual project. I read through the documentation for the jitter programming language to see what would be feasible … Read More
This week I’ve focused primarily on drafting my final paper. So far I have a general outline of my paper (abstract –> introduction –> related work –> framework and design –> implementation –> conclusions and future work), and I’ve fully … Read More
During the past week I have: – Cleaned up the data, converted it to the CSV file, tried feeding it to the MLP classifier. However, I’m running into some error scaling it before feeding the data to the MLP. I … Read More
This week I’m focusing on my project, which is to build a method for building music recommendation using collaborative filtering. To do that I am currently reading papers about collaborative filtering in more details and comparing different techniques for collaborative … Read More
After reading more about literature on stock prediction with sentiment analysis, I came to the conclusion that it is very difficult to obtain text data that is relevant for my research for free. I looked at Twitter streaming API and … Read More
I did an experimentation with Sci-kit Learn. The run-time for the program was more than 2 hours. Testing the multiple dataset has been an issue lately.
Progress on the draft of the paper. Related works is almost completed.
This week I am writing my literary review for Indexing Time Series Databases. It is going well. Because this is an area that has been researched for over two decades there is a lot of background information I will be … Read More
This week I made good progress in my research. Here are some of the things I have done: -Met with Forrest Tobey to discuss his past research involving motion tracking virtual instruments as well as visual capabilities of the Max/MSP/Jitter … Read More
For the past week, I worked on literature reviews on two topics: stock prediction with natural language processing and stock prediction with machine learning. One of the most important findings is that each researcher used different datasets to predict different … Read More
Making heavy headway into literature review, but still adjusting to overleaf. My final topics are predicting matches in the premier league, and making a software that combines several CV methods in a robust wrapper for easy use. Found eight sources … Read More
Started 2nd pass for the papers. Continued work on literature review.
Last week I was having a problem with a deprecation warning. I looked into that and I thought I fixed it. It seemed to be working fine, but I got the same error at a different point. So I am just … Read More
This week I started playing around with creating the visualization tool piece of my project. In doing that, I learned a lot of the more finicky details of NVDA. It uses an internal installation of Python, which makes sense for … Read More
During the past week I have: – Gotten close to getting the data to work. Right now the data is divided into leaves and nodes. I’ve been able to get the node-level labels but not the leaf-level labels. Once that … Read More
For this week I am working on the literary review for the two projects that I’m most interested in working on. I am also reading more papers related to my topics.
Worked on the first draft of the paper. Focusing on the related works and findings currently.
For the past week, I did the second reading for the 15 papers for the annotated bibliography and also prepared the presentation for the class. I narrowed down my topics into two: stock prediction with sentiment analysis and stock prediction … Read More
After talking to Ajit about the problems I have been running into with my virtual machines I’ve made the decision to switch to using the Earlham servers for the purpose of running my experiments and collecting data. There’s only one … Read More
Finished bibliography. Selected two topics and prepared topics presentation. Started literature review.
In the past week, I have been working on the related for my three project ideas. I wrote an annotated bibliography for each topic with five different sources. This week got rid of one of the ideas and decided to … Read More
Prepared a presentation for class on my final two topics, image stitching and predicting premier league games. Started searching for additional topics so I can meet the 8 requirement for the imminent lit review.
I found some quirks and issues with the way NVDA works in different settings. I can now get a transcript with the appropriate details by using object review mode in Firefox, but not by navigating the objects normally or while … Read More
During the past week, I was able to query the data from the IBC and looked more into it. It has come to my attention that the IBC was labeled both on the word level (leaf) and the sentence level … Read More
Continuing from last week, I was able to change the implementation of the Scikit-Learn wrapper for TSVM from Python2 to Python3. I am currently getting a new deprecation warning caused by TSVM. It seems like the code for TSVM assumes I … Read More
Build a rough machine learning pipeline for testing. Worked with Ajit to update timeline. Started with the first draft of paper.
I am certain at this point that I will be going in a direction where I will be doing research on a database. My first thought was to explore how to store Tree Data Structures in Postgresql. Which led me … Read More
For my project, optimizing an AI agent for Dungeon Crawl Stone Soup, I’ve recently changed the scope of my project somewhat. Instead of trying to improve the AI agent’s chance of winning a game of DCSS, I’m now working to … Read More
Nothing much to report, worked on my annotated bibliography and am narrowing down my interest to just two of my three ideas. Currently leaning towards 2d stitching, potentially expanding it to include some 3d point cloud work as well, but … Read More
I’ve laid down the bare bones for my thesis paper. With the main project however, I am running into problems with my virtual machines and blockchain code, rather I keep running into problem after problem with them. I plan … Read More
Continued working on Annotated Bibliography: Found more papers on the topics, Did the first pass reading for all of them.
There is no big update on my work for the past week. I will be working on the annotated bibliography due this Friday.
For this week I have been reading the research papers that are relevant to my topic and take notes of them. I still need to work on narrow down the scopes of my topics as a couple of them may … Read More
During the past week I have: – Started writing the introduction of my thesis paper – Started implementing the first module of the project (word matrices initialization) – Looked into the IBC corpus data, whose sentences are implemented as a … Read More
This week I’ve gotten minorly stuck. I know that NVDA is able to grab — and speak — certain information about the data visualizations I’ve created(it might say, for example, “graphic New York” when I navigate to a point representing … Read More
During the past week, I run into a few problems. I kept getting a PendingDeprecationWarning coming from one the methods used from Scikit Learn. It seems that something has changed or is currently changing on Numpy that affects Scikit learn modules. … Read More
Created a smaller dataset using pySpark for training and testing the fake news model.
I’ve been researching Voronoi graph regions and their usage in recommendation system. I’m particularly interested in finding open-source implementation of Voronoi graph regions usage. I’ve also been reading and researching FiveThirtyEight’s gerrymandering maps and how they created their maps of … Read More
Read the papers and summarized them. I am leaning more towards the gesture control system idea, but at the moment it seems like I keep finding more and more challenges to overcome with this project. Typing with gestures may not … Read More
For my senior project, I’m working on optimizing the performance of qw, an AI Agent developed to play (and sometimes win) the game of Dungeon Crawl Stone Soup. To reduce the search space and make optimization with machine learning faster, … Read More
My project ideas have been evolving since starting to research articles. My main fields of research have been in audiovisual synthesis, motion control for virtual instruments, and algorithmic music composition. I have found lots of articles including a good one … Read More
Met with Andy Moore and talked about projects regarding natural disasters. Realized that many of the ideas were too big for a semester, and started researching in Earthquake Early Warning systems Worked on Quiz 2, and collected papers for each … Read More
Realized two of my ideas don’t really work at all and so I need to search for new ones Learned that I really need to work on something I’m interested in because if I’m not it’s going to be hell … Read More
Have found a working software for my Virtual Machines and the blockchain program to run on them, was harder to find than I thought it would be. I have my timeline finished and solidified thanks to my talk with Ajit, … Read More
I spent the past week reading related papers to my three ideas. It helped me gain a better understanding of the scope and challenges associated with each potential project. I also met with Ajit and discussed the projects. I believe … Read More
I finished making an additional data visualization — a Sankey diagram this week. I looked into NVDA’s built-in logging system and decided I didn’t want to use it to create my logs because it includes — even at the bare … Read More
This past week: Looked into the SKL issue; I now understand what’s happening. Had to change a few things – LabelPropagation only accepts numpy arrays When making the pipeline as a structure, I need to do it differently than what … Read More
Over the past week, I was able to make the following progress with my project: – Came up with a detailed design of my project, which lists all the modular components and their functions. These components might change as the … Read More
After skimming through 15 papers over three different topics, I am still most interested in the topic of “generate sentiment-based stock trading signals through NLP.” Since last week, I started taking a Coursera course called applied text mining in Python. … Read More
Completed corpus creation. Filter the dataset and collecting data from Facebook. Automated process using pyspark. Reduced 40GB file to 9 GB and needed to be cleaned for machine learning processing.
Worked with setting up sci-kit learn and testing environment. Got Craig to give me access to Pollock and Bronte.
Read the papers about how to read a paper. Met with Michael Lerner regarding one of the strategies talked about in his research last year. Found 5-7 papers related to each of the three topic areas. Attempted the Quiz 1 … Read More
Met with my advisor twice, worked on an updated timeline. Worked out a design framework and prepared the presentation slides.
This week, I decided to change my first project (soccer coaching tool) and do something different. I haven’t really thought to utilize my knowledge I’ve gained from classes and I want to do some type of AI or ML project. … Read More
This week I revised my proposal, framework and timeline. These revisions are summed up in my presentation for this week (slides can be found here).
-I met with Kendall to get my ACM membership -I continued refining my ideas. -I talked with music professor Forrest Tobey about my ideas and projects that the music department is interested in.
For the past week, I did the following: – Found research papers on my topics – Skimmed through several papers and gained the better understandings of the topics – After skimming through papers on my possible three topics, I am … Read More
For this week, I plan to: Finalize the project timeline Look into how to plug the sample data into the classifier and link this functionality to the Chrome extension Take the Udacity crash course on Supervised learning Set up PyTorch … Read More
In the last week I have: Reached out and had Ajit as my project adviser Contacted the IBC author to request the dataset Started reading about PyTorch and how to set them up Started looking into online crash courses about CNN … Read More
This past week Have run into difficulties this week: Blogs corpus compiler isn’t working. Looking into making a small implementation of semi-supervised learning This seems to be working Trying to do the same with the SVM one from github We … Read More
This past week: Got blogs extraction going Looked at algorithms Have found ones that seem to work in a similar way to scikit-learn modules. Generative adversarial networks? Label propagation from scikit-learn. Found the representations to use. This coming week: Get … Read More
Retrieved the FakeNewsCorupus Dataset from Kaggle. The file size is 40GB and I am thinking about selecting a subset of it to create a smaller dataset. Sci-kit learn cannot load the data on my computer and I need to use … Read More
Improved Earlham Directory I want to work on a system that manages courses, students and registration similar to Earlham directory but with better management and user-interface. The project will entail a large scale design and breakdown of the problem into … Read More
I talked with Dave about my senior project ideas as I had a concern that former students have done similar research previously. The followings are some of the takeaways from the discussion: Finding a good niche within a field is … Read More
Python Module for Image Processing • Is your proposed topic clearly a research activity? Is it consistent with the aims and purposes of research? Yes. I plan to create something new and make some small breakthrough. • How is your … Read More
Met with Ajit to filter ideas regarding parallel computing, and machine learning. Emailed Andy Moore in Geology to talk about Earthquake and Tsunami predictions. Emailed Charlie for suggestions regarding my Structure From Motion idea. Searched for more specific details on work … Read More
Have set up and arranged weekly meeting times with Ajit as my adviser. We’ll be meeting weekly at 4:00pm on Wednesday. I have better evaluated what pitfalls and goals I will need to be aware of for my research, and … Read More
Deploying Software Defined Networking (SDN) for cloud endpoints Topic Description: Using the cloud infrastructure, a Software Defined Networking (SDN) can be deployed for cloud endpoints. SDNs are used to connect data centers to public cloud providers to create a hybrid … Read More
Idea 1: Soccer Coaching Tool Did research to find similar softwares for coaching. Idea 2: Hand/Finger Gesture Sensor for Browser Control and Navigation Read research papers regarding similar ideas about monitoring hands and fingers. Found devices that provide the sensors … Read More
In this past week, I’ve found and simplified 2 types of data visualizations that I think would be useful for testing purposes: a slopegraph and a bubble graph. I’ve run some preliminary tests with NVDA on them to see how … Read More
Started the project pipeline for Fake News Detection.
For next week, I plan to Set up the environment on my computer for SciKit learn (or potentially PyTorch) Collect data (the IBC) by emailing the authors Read through documentation and familiarize myself with SciKit learn and supervised learning. Reach … Read More
Looked for three general areas that I want to do my research in, namely: Structure from Motion Disaster prep and management Parallel Computing Searched for some related work that has happened in these areas.
Ideas: Pillow AI: I am thinking of having Arduino device built-in the pillow, which can be charged and have heart-rate sensors to receive heart-rate while the person is sleeping. Having this data, I could determine sleeping patterns and find the Light Sleep … Read More
Computer Echolocation For my project, I would like to do something where I can use both hardware and software. An idea I am interested in that contains the intersection between hardware and software would be creating a robot that uses … Read More
Idea 1: A web application that lets users test the performance of systematic trading strategies with user selected parameters. The purpose of this application is to let users try out different inputs and test how his/her trading strategy would have … Read More
1)A piece of software that takes aspects of a soccer team and suggests players to sign or release based on statistics found online. This will include physical attributes, such as height, weight, 40 time, as well as statistics like shots … Read More
For the next week, I plan on working on: Extract data from the blog and Brown corpora Identify which algorithms we’ll be using Find where these are available Identify parts of work already that that’ll be useful
These are super vague ideas, and I don’t know if they’re doable, but that’s what I came up with: App/Game for experiencing different cultures Tone analyzer Text editor extension for good code-writing habits
This past week I reached out to Dave and he agreed to be my mentor for this project and prepared my presentation slides based on my project proposal. I dug into the NVDA developer’s guide and walked through some basic … Read More
Poster3 36×42 Final poster of sensoring real-world enviornment and predicting using Machine Learning.
Vitalii Poster Final
Minh Vo – Poster Presentation
The finalized poster for EPIC Expo is attached here, on the addition of new training samples to a decision tree.
The finalized poster is attached to this post. CS488Poster_Niraj
This week I acquired a linux iso file that I will run using Virtual Box, so I will have something to put Singularity on, and reviewed how to use Singularity and DOCK. Things I have to do for the … Read More