J a k e    H a i n e s


I'm a prospective data engineer and seasoned creative. I have a creative and visionary mindset with deep understanding of sound, automotive, and technology, positioning me to streamline data pipelines and manipulation.

Portfolio


Data

All projects related to data science, engineering, operations, and analysis.


Research

Formal end-to-end research projects, generally consisting of broader skillsets.


Music

Musical projects including original songs, remixes, press features, and tutorials.

Data Portfolio


Autoproxy: rotating proxy for MacOS

Python script that iterates available proxies in proxyscrape. For webscrapers and MacOS.


Gas price prediction using linear regression

Gas prices in the United States have consistently been an area of interest due to heavy reliance on...


Apartment search using pure mathematics

Have you ever been apartment searching, and had a large, comprehensive list of places you...


Tracking mental health using data analysis

Using Python, I used one year of user data exported from mobile apps to explore several different...


Mini-golf performance analysis

I recently went on a mini-golf date, which was my first time playing mini-golf in years. With present...


Restructuring inventory data using ETL

Desperate for a summer job to add more data science work experience for my resume, I...


Nickel and cobalt sourcing analysis

With demand of EV batteries skyrocketing, demand of lithium-based batteries follows. Demand...


Determining ideal lithium sources for EV batteries

With increasing incentive and collective interest in electric vehicles mostly in the automobile...

Research Portfolio


Traffic flow in university residential parking

Shortly after the start of the Fall 2020 semester, my university cut off access to half of the resident...

Music Portfolio


Original music


Remixes


Featured on

Indie Shuffle
Indie Shuffle is a music discovery platform founded in 2009 featuring independent and emerging artists, with a diverse user base of music enthusiasts from around the world.


Stereofox
Stereofox is a music discovery platform and blog that curates a variety of music in indie, electronic, and alternative genres, attracting a dedicated community of music enthusiasts.


Futuremag
Futuremag Music is an online music publication based in Australia, founded in 2015. It focuses on broadcasting emerging artists with incredible talent worldwide.


The Groove Cartel
The Groove Cartel is a Chilean blog founded in 2014, chosen as one of the best music blogs by Sound-unsound and Movie Hustle in 2019. It receives over 80k unique visitors per month.


Endorsed by

Nathaniel Drew, Laszlo, Nigel Good, inverness, Justin Hawkes

Traffic flow in university residential parking

Skills used: Microsoft SQL Server, Docker, Python (pandas, tensorflow), Tableau, Raspberry Pi, Linux, Unix, bash, SSIS, IoT, machine learning, computer vision, CNNs, infrastructure, data engineering, data visualization, ETL


Motivation: Shortly after the start of the Fall 2020 semester, my university cut off access to half of the residential parking spaces available near my dorm. Finding parking as a resident on campus was hard enough as it was before that happened. Upon contacting the university regarding why the parking deck was closed off, I did not receive a transparent response. With time, I noticed clear trends in times when parking spaces were available.The goal of the project was to gain more insight on the flow of traffic entering and exiting residential parking areas, in order to plan for avoiding the inconvenience of not having parking nearby when I need it. To put it simply, I was trying to hack the inconvenience.This project was presented at the Statistics Symposium at University of North Carolina at Asheville and the Undergraduate STEM Research Symposium at Wake Forest University.

Determining ideal lithium sources for EV batteries

Skills used: Tableau, data analysis, data visualization


Motivation: With increasing incentive and collective interest in electric vehicles mostly in the automobile industry (but scaling beyond that), demand for Lithium-ion batteries is increasing and will likely continue to grow at unprecedented rates.The goal of this analysis was to provide insight on preparing for shortages to anyone needing to source lithium-ion batteries.

Nickel and cobalt sourcing analysis

Skills used: Tableau, data analysis, data visualization


Motivation: With demand of EV batteries skyrocketing, demand of lithium-based batteries follows. Demand of chemical sourcing optimization to ensure longevity, performance, and cost-efficiency of the batteries increases as well. Two common supplemental agents in batteries are nickel and cobalt. Which is the most cost effective option moving forward?This project visualizes global supply chain data and provides predictions for future prospects.

Autoproxy: rotating proxy tool for MacOS

Skills used: Python, APIs


Motivation: When scraping websites for data (within policy), it is easy for the client and/or its IP to be blacklisted. It becomes tedious to keep testing and setting proxies, and often requires certain scrapers to be restarted.Autoproxy is a very small tool that allows its users to run their webscrapers without being stopped. It is also useful for users on MacOS as a proxy service. Autoproxy works by using the Proxyscrape API to continuously fetch and test proxy addresses, then activates the first working proxy within the current environment. It is easily usable as a Python module by webscrapers, or can be run standalone.

Gas price prediction using linear regression

Skills used: Python (pandas, plotly, scikit-learn, BeautifulSoup), statistical tests, machine learning


Motivation: Gas prices in the United States have consistently been an area of interest due to heavy reliance on automotive transportation. Despite EVs, Gas expenses remain an issue for those who commute by internal combustion vehicles.This is a paper, with the goal being to make a small contribution to optimizing currently used modes of transportation. In doing so, I could explore processes of building a regression model.

Apartment search using pure mathematics

Skills used: linear algebra, statistics


Motivation: Have you ever been apartment searching, and had a large, comprehensive list of places you were looking at? It can get overwhelming — and is often hard to decide what your best option is because you feel ambivalent towards most if not all apartments on your list (this is especially true in the Bay Area).The goal of this project was to help me rank the apartments I was looking at by several quantitative parameters, including rent, size, and self-rated "general feeling" about the place and its area. Each parameter was weighted and both rankings and parameters were normalized.Overall, this little project helped organize all the rental listings so it was easier to process and keep track of.

Tracking mental health using data analysis

Skills used: Python (pandas, numpy, plotly, researchpy, scipy), statistical testing, data visualization, ETL


Motivation: Mental health and its level of well-being are often hard to quantify, especially outside clinical environments. To combat this and make healthcare more accessible, there are an abundance of apps available that allow users to keep track of their mental health. However, the apps don't always provide their users with multiple data points or useful analysis dashboards.The goal of this project was to centralize data exported from multiple apps used by the same individual and run analyses to draw conclusions the individual and others may benefit from.

Restructuring inventory data using ETL

Skills used: Python (pandas, regex), data cleaning, data transformation


Motivation: Desperate for a summer job to add more data science work experience for my resume, I accepted an offer from my brother to work for a company specializing in golf cart part distribution. The role was a data entry role, meant to take around 10,000 rows of messy, manually entered inventory data, and move it to a more organized CSV readable by new software.This project is a sample of the work I did during this internship and shows the exact methods I used.

Mini-golf performance analysis

Skills used: Python (pandas, plotly), data visualization


Motivation: After I played the first game of mini-golf after many years, I couldn't help but notice the scorecard looked like a dataset. I noticed an improvement in my performance as the game went on, so I figured I'd make a small story using the data I have.This was a fun project to unlock cool insights on my game of mini-golf.

About Me


From a young age, I've been filled with curiosity, creativity and imagination, coming from an innate ability to vividly visualize scenarios. I delved into the complexities of producing electronic music, a genre that demands precision and selectivity in sound engineering and mix engineering. My work involves separating essential details from the non-essential, requiring surgical problem-solving. These skills, honed over a decade, are a cornerstone of my technical acumen.Data Engineering
In the realm of data engineering, my approach is unconventional yet deliberate, emphasizing stability and meticulous attention to detail. I prioritize setting well-thought-out, achievable goals, delivering high-quality, efficient architecture that's primed for continuous improvement. My professional journey includes internships at Tesla, where I contributed to groundbreaking projects in future vehicle technology and big data operations.
Entrepreneurship
My entrepreneurial spirit shines through in my co-founding of Myndmap, a UK-based SaaS incorporation featured on There's An AI For That, garnering over 400 waitlist signups in its first month. This app empowers individuals with ADHD through research-backed tools addressing executive functioning deficits.
Music & Sound
At the intersection of technology and creativity, my music production journey has earned recognition from reputable independent blogs and support from established artists. My deep understanding of sound, from harmonics to frequency modulation, positions me to innovate in companies known for groundbreaking audio technologies.
Automotive
Within the automotive field, my internships at Tesla exposed me to future vehicle technology development, involving engineering proof-of-concept research and the integration of cutting-edge components.
Interests
Beyond my technical prowess, my passion for photography, nature, and travel enriches my creativity and problem-solving abilities, complementing my diverse interests in psychology, neuroscience, sustainability, and more. This multifaceted perspective fuels my drive to make a meaningful impact in the world.