Problem 1

Import NumPy under the alias np.

In [1]:
import numpy as np

Problem 2

Import pandas under the alias pd.

In [2]:
import pandas as pd

Problem 3

We will again be using salesperson data to test your knowledge of the groupby method. Given the dataset data, print a new DataFrame that shows the mean sales per salesperson, grouped by Organization.

In [3]:
data = pd.DataFrame([ ['Coca-Cola', 'Nick', 200],

                    ['Coca-Cola', 'Joel', 120],

                    ['Pepsi','Taylor', 125],

                    ['Pepsi','Josiah', 250],

                    ['Dr. Pepper','Josh', 150],

                    ['Dr. Pepper','Micaiah', 500]], 
                    columns = ['Organization', 'Salesperson Name', 'Sales'])

data
Out[3]:
Organization Salesperson Name Sales
0 Coca-Cola Nick 200
1 Coca-Cola Joel 120
2 Pepsi Taylor 125
3 Pepsi Josiah 250
4 Dr. Pepper Josh 150
5 Dr. Pepper Micaiah 500
In [4]:
#Solution goes here
data.groupby('Organization').mean()
Out[4]:
Sales
Organization
Coca-Cola 160.0
Dr. Pepper 325.0
Pepsi 187.5

Problem 4

Given the dataset data, print a new DataFrame that shows the total sales for each Organization.

In [5]:
data.groupby('Organization').sum()
Out[5]:
Sales
Organization
Coca-Cola 320
Dr. Pepper 650
Pepsi 375

Problem 5

Given the dataset data, print a new DataFrame that applies the describe method to each organization.

In [6]:
data.groupby('Organization').describe()
Out[6]:
Sales
count mean std min 25% 50% 75% max
Organization
Coca-Cola 2.0 160.0 56.568542 120.0 140.00 160.0 180.00 200.0
Dr. Pepper 2.0 325.0 247.487373 150.0 237.50 325.0 412.50 500.0
Pepsi 2.0 187.5 88.388348 125.0 156.25 187.5 218.75 250.0
In [ ]:
 
In [ ]:
 
In [ ]: