Grouping
Explore how to use pandas groupby to organize MLB data by year and team, calculate total and average statistics, and filter grouped data effectively.
We'll cover the following...
We'll cover the following...
Chapter Goals:
- Learn how to group DataFrames by columns
- Write code to retrieve home run statistics through DataFrame grouping
A. Grouping by column
When dealing with large amounts of data, it is usually a good idea to group the data by common categories. For example, we could group a large dataset of MLB player statistics by year, so we can deal with each year's data separately.
With pandas DataFrames, we can perform dataset grouping with the groupby function. A common usage of the function is to group a DataFrame by values from a particular column, e.g. a column representing years.
The code below shows how to use the groupby function, with ...