Hi 👋 I'm Rachael.

A data science major in Smith College who is passionate about working with numbers, collecting data, and finding trends and patterns. I enjoy working on data analytics projects using various tools and techniques. This site is where I keep track of my personal projects.


Data Analytics Projects


Here, you can find an overview of personal data analytics projects that I have worked on to further develop my Data Science skills. You also can find all projects on Github.


Seattle Weather Forecasting Permalink

Assess four time-series statistical forecasting methods to determine the most accurate predictive model for daily temperature patterns in Seattle, Washington.

LEGO® Price Analysis Permalink

Investigate the relationship between the prices of LEGO® sets and the number of pieces in the set and its theme to gain a better understanding of where consumer's money is going and identify which specific franchises within LEGO® have the best price-to-piece ratio.

Fenceline Community Permalink

Analyze multiple data sources to profile a "fenceline community" in the US, the risks individuals in these communities are exposed to and the extent to which certain demographics are disproportionately exposed to those risks.

Quantium's Retail Strategy Analytics Permalink

Conduct analysis on the transactional and customer data for a large supermarket brand to generate highly valued insights and provide recommendations to help the business make strategic decisions.

College Scorecard Data Visualization Permalink

Use College Scorecard data to visualize the relationship between student enrollment and the distribution of degree areas based on the type and level of institution in Massachusetts.

Open Payment Data Wrangling Permalink

Wrangle the Open Payment dataset in order to reveal findings about the financial relationships between drug and device companies and certain healthcare providers in Massachusetts.

Machine Learning Projects


Explore a selection of my personal machine learning projects, showcasing my skills in building and evaluating models, and my expertise in using statistical tools for model selection, regularization, and cross-validation. You also can find all projects on Github.


Multiple Regression Model Permalink

A machine learning model leveraging multiple regression techniques to forecast house prices based on a range of predictor variables, with RMSLE used for accuracy evaluation.

CART Model Permalink

This machine learning model utilizes Classification and Regression Trees (CART) to estimate house prices by analyzing several predictors, with RMSLE assessing the model’s precision.

Cross-Validation Accuracy Permalink

An evaluation of 10-fold cross-validation accuracy estimates in predicting Kaggle competition scores using K-Nearest Neighbors, multiple regression, and CART models.

Logistic Regression Model Permalink

A machine learning model applying logistic regression to predict the likelihood of financial distress within two years, using factors like age and credit history, with accuracy gauged by AUC.

LASSO Regression Model Permalink

Employing LASSO regression, this machine learning model predicts house prices by focusing on key predictor variables, with RMSLE measuring performance.

Random Forest Model Permalink

A random forests machine learning algorithm is used here to predict house prices based on a diverse set of predictors, with RMSLE ensuring model accuracy.

Database Projects


Explore an overview of my personal database project, crafted to showcase my proficiency in database design, SQL, and data management. You also can find all projects on Github.


Non-profit Database Design Permalink

A comprehensive relational database design that centralizes information, enabling more effective resource allocation, volunteer deployment, fundraising efforts, and community engagement for non-profit organizations.

Java Projects


Discover an overview of my personal Java projects, which I have undertaken to enhance my problem-solving abilities. You also can find all projects on Github.


Criminal Justice Permalink

This project models and replicates the ProPublica’s analysis that highlights the racial biases in the COMPAS recidivism algorithm.

Spell Checking Permalink

A spell-checking program that stores its dictionary in a hash table checks whether a given word is spelled correctly and provides a method to suggest possible alternatives in the event of a misspelled word.

Postfix/Infix Calculator Permalink

This project uses an ArrayDeque as a stack to compute the result of expressions written in both postfix notation and infix notation by implementing the Shunting-yard algorithm.

Recursive Maze Solver Permalink

A program for solving mazes using recursion and animating the solution.

Linked List Implementation Permalink

My own implementation of singly linked lists and doubly linked lists.

Card Sorting Permalink

An animated card sorter that uses the Insertion Sort, Selection Sort, Merge Sort, and Quicksort algorithms on Linked Lists and compares their running times.

Decision Trees Permalink

This project creates a DecisionTree class that inherits from the BinaryTree class, and utilizes it to develop a program that plays a "20 Questions" style guessing game.

Potpourri of Mini Projects Permalink

A collection of mini projects including a recursive program that generates fractal patterns, an enhanced BST, algorithms for converting array to binary tree and tree to DLL, and an in-place heap sorting algorithm.