MLB DATA ANALYSIS
For our project we decided to analyze Major League Baseball data by year for all MLB teams. The dataset contained statistics for each team from 1876-2020. We made the decision to focus only on baseball played in the modern era, post 1945. From our dataset we dropped the years that had columns that were null (certain stats were not tracked in this dataset prior to 1970 such as "batters hit by pitch" and "sacrifice flies"). We also decided to drop the year 2020 because of the shortened season (only 60 games were played) due to the COVID-19 pandemic. On the remaining data we used both supervised and unsupervised machine learning algorithms to help us analyze the data.
Image Source: mlb.com