Problem 1

Import NumPy under the alias np.

In [1]:
import numpy as np

Problem 2

Import pandas under the alias pd.

In [2]:
import pandas as pd

Problem 3

Given the following NumPy array data, create a pandas DataFrame named first_data_frame that contains the same elements. Print the DataFrame to make sure the operation executed successfully.

In [8]:
data = np.round(np.random.randn(5,5),1)
In [9]:
#Solution goes here
first_data_frame = pd.DataFrame(data)
first_data_frame

Problem 4

Assign the values of row_labels to the index of first_data_frame. Print the DataFrame to make sure the operation executed successfully.

Hint: It will be easier to overwrite first_data_frame by using another pd.DataFrame method.

In [10]:
row_labels = ['one','two','three','four','five']
In [14]:
#Solution goes here
first_data_frame = pd.DataFrame(data,row_labels)
first_data_frame
Out[14]:
0 1 2 3 4
one -1.9 1.8 0.6 0.0 1.1
two 2.3 -2.1 -0.3 -1.8 -0.6
three 0.7 0.4 -1.0 -0.7 0.3
four 0.8 0.5 -0.7 0.0 0.2
five 1.7 1.0 -1.3 1.8 1.3

Problem 5

Assign the values of column_labels to the columns of first_data_frame. Note that there are two main ways to do this - you are free to chose the method of your choice. Print the DataFrame to make sure the operation executed successfully.

In [16]:
column_labels = ['alpha','beta','charlie','delta','echo']
In [17]:
#Solution goes here
first_data_frame.columns = column_labels
first_data_frame
Out[17]:
alpha beta charlie delta echo
one -1.9 1.8 0.6 0.0 1.1
two 2.3 -2.1 -0.3 -1.8 -0.6
three 0.7 0.4 -1.0 -0.7 0.3
four 0.8 0.5 -0.7 0.0 0.2
five 1.7 1.0 -1.3 1.8 1.3

Problem 6

Create a pandas Series named my_series that contains the values from row alpha of first_data_frame. Print my_series to make sure the operation executed successfully.

In [18]:
#Solution goes here
my_series = first_data_frame['alpha']
my_series
Out[18]:
one     -1.9
two      2.3
three    0.7
four     0.8
five     1.7
Name: alpha, dtype: float64

Problem 7

Create a new DataFrame called second_data_frame that is equal to first_data_frame but without row one. Print second_data_frame to make sure the operation executed successfully.

In [19]:
#Solution goes here
second_data_frame = first_data_frame.drop('one')
second_data_frame
Out[19]:
alpha beta charlie delta echo
two 2.3 -2.1 -0.3 -1.8 -0.6
three 0.7 0.4 -1.0 -0.7 0.3
four 0.8 0.5 -0.7 0.0 0.2
five 1.7 1.0 -1.3 1.8 1.3

Problem 8

Create a new DataFrame called third_data_frame that is equal to second_data_frame, but without row charlie. Print third_data_frame to make sure the operation executed successfully.

In [21]:
#Solution goes here
third_data_frame = second_data_frame.drop('charlie', axis=1)
third_data_frame
Out[21]:
alpha beta delta echo
two 2.3 -2.1 -1.8 -0.6
three 0.7 0.4 -0.7 0.3
four 0.8 0.5 0.0 0.2
five 1.7 1.0 1.8 1.3

Problem 9

Create a variable called row_two that is equal to row two from third_data_frame. Print row_two to make sure the operation executed successfully.

In [22]:
#Solution goes here
row_two = third_data_frame.loc['two']
row_two
Out[22]:
alpha    2.3
beta    -2.1
delta   -1.8
echo    -0.6
Name: two, dtype: float64

Problem 10

Print the shape of new_data.

In [24]:
new_data = np.round(np.random.randn(5,5),1)
In [26]:
#Solution goes here
new_data.shape
Out[26]:
(5, 5)

Problem 11

Print a DataFrame that contains boolean values that indicate whether the elements of new_data are greater than 1.

In [29]:
pd.DataFrame(new_data > 1)
Out[29]:
0 1 2 3 4
0 True True True False False
1 False False False False True
2 False False False True False
3 True False False False False
4 False False False True False

Problem 12

Print a NumPy array that contains only the elements of new_data that are greater than 1.

In [32]:
new_data[new_data > 1]
Out[32]:
array([2.3, 1.5, 1.6, 1.2, 2.2, 1.1, 1.7])
In [ ]:
 
In [ ]:
 
In [ ]: