Welcome to the data analytics portion of my WordPress site.
MLB: Correlating Runs Scored
Correlating runs scored to AVG, SLG, OBP, and OPS. Which metric best correlates to runs scored?
Here is my foray into the famous iris dataset. Besides the usual EDA, I perform a number of supervised and unsupervised machine learning models on the data. Because it is used extensively in ML training models, it seems appropriate to spend some time to learn this data set in full.
MLB No-Hitters and the Exponential Distribution
A review the exponential distribution of major league no-hitters.
Rainfall in Austin, Texas
Analysis of the average rainfall in my home city of Austin, Texas.
Literacy vs Fertility
Analysis of literacy vs fertility throughout the world using linear regression and bootstrap sampling.
Canelo Álvarez vs. Gennady Golovkin II
Who really won the Álvarez vs Golovkin II match? Here I perform an analysis of the CompuBox stats.
Dice Roll Game
A fun dice roll game where I run 10,000 simulations and perform a statistical analysis of my results.
High Low Card Game
A fun card game where I run 1 million simulations and perform a statistical analysis of my results.
Greatest NY Yankee
For this analysis I look at the career totals of 4 of the greatest Yankees to play the game.