Data Analytics

Welcome to the data analytics portion of my WordPress site.


MLB: Correlating Runs Scored

Correlating runs scored to AVG, SLG, OBP, and OPS. Which metric best correlates to runs scored?

Click here to view my analysis.


Iris Dataset

Here is my foray into the famous iris dataset. Besides the usual EDA, I perform a number of supervised and unsupervised machine learning models on the data. Because it is used extensively in ML training models, it seems appropriate to spend some time to learn this data set in full.

Click here to view my analysis.


MLB No-Hitters and the Exponential Distribution

A review the exponential distribution of major league no-hitters.

Click here to view my analysis.


Rainfall in Austin, Texas

Analysis of the average rainfall in my home city of Austin, Texas.

Click here to view my analysis.


Literacy vs Fertility

Analysis of literacy vs fertility throughout the world using linear regression and bootstrap sampling.

Click here to view my analysis.


Canelo Álvarez vs. Gennady Golovkin II

Who really won the Álvarez vs Golovkin II match? Here I perform an analysis of the CompuBox stats.

Click here to view my analysis.


Dice Roll Game

A fun dice roll game where I run 10,000 simulations and perform a statistical analysis of my results.

Click here to view my analysis.


High Low Card Game

A fun card game where I run 1 million simulations and perform a statistical analysis of my results.

Click here to view my analysis.


Greatest NY Yankee

For this analysis I look at the career totals of 4 of the greatest Yankees to play the game.

Click here to view my analysis.


Happy coding!