I created a Python notebook titled “Premier League: Goalies EDA for the 22/23 season,” which offers a comprehensive exploratory data analysis (EDA) of goalkeeper data from the 2022/2023 Premier League season. There is more to explore, and I would love comments on things you might want to see!
Here’s a summary of the findings and visualizations presented in the notebook:
- Data Overview: The notebook begins with loading the dataset containing player information for the 2022/2023 season, focusing on relevant fields for goalkeepers such as appearances, assists, goals conceded, clean sheets, and annual salary.
- Filtering Goalkeeper Data: The dataset is filtered to include only goalkeeper information, resulting in a subset of data specifically about goalkeepers.
- Descriptive Statistics: Descriptive statistics are computed for goalkeeper data, including counts, means, standard deviations, and quartiles for various attributes such as appearances, assists, goals conceded, clean sheets, and annual salary.
- Data Exploration: The notebook explores the distribution of goalkeeper appearances using histograms, revealing insights into the frequency of goalkeepers playing specific numbers of games throughout the season.
- Analysis by Club: The club analyzes Goalkeeper appearances, highlighting which clubs utilized their goalkeepers more frequently throughout the season.
- Goalkeeper Appearances by Name and Club: A bar chart visualizes the number of appearances by each goalkeeper for their respective clubs, providing insights into goalkeeper rotation and usage patterns.
- Specific Club Analysis (e.g., Manchester City): Individual analyses are conducted for particular clubs, such as Manchester City, to delve deeper into the performance and appearances of their goalkeepers.
- Clean Sheets Statistics: Clean sheets statistics are explored, including the number of clean sheets achieved by each goalkeeper in both home and away matches.
- Visualization with Seaborn: Seaborn is used to create visualizations, such as bar plots, to present clean sheet statistics by a goalkeeper, providing a clear comparison of clean sheet performances.
Overall, the notebook offers a detailed exploration of goalkeeper data from the 2022/2023 Premier League season, providing valuable insights for further analysis and modeling efforts.