NumPy tutorial¶

Python 3¶

Since the focus of this notebook is math, we note only one difference between Python 2 and 3: division of integers does not truncate or round down:

In [1]:

1/2

Out[1]:

0.5

OK, that's it for Python 3!

NumPy¶

NumPy is the standard package for all kinds of operations on multidimensional arrays. Be aware that it originated as a merge between two packages and has evolved a lot over time, and there is a lot of stuff in there that you shouldn't necessarily use. For example, there is a matrix class, but the general consensus is that you should not use it.

The array class (that is, the one that you should use) is called ndarray. Every array has a shape, which is a tuple of nonnegative sizes, one for each dimension of the array. For example, a 2x3 matrix has shape (2,3).

Dimensions in NumPy are also called axes. Note that they are not the same as the dimensions of a vector space in linear algebra.

Creating arrays¶

In [2]:

import numpy as np

In [3]:

np.zeros((2, 3))

Out[3]:

array([[ 0.,  0.,  0.],
       [ 0.,  0.,  0.]])

In [4]:

np.random.uniform(0, 1, (2, 3))

Out[4]:

array([[ 0.41841411,  0.18258245,  0.62852128],
       [ 0.34715359,  0.61273193,  0.08825056]])

In [5]:

np.array([[0., 1., 2.], [3., 4., 5.]]) # default is row-major order

Out[5]:

array([[ 0.,  1.,  2.],
       [ 3.,  4.,  5.]])

Indexing and slicing¶

In [6]:

a = np.array([[0., 1., 2.], [3., 4., 5.]])

In [7]:

a[1, 2]

Out[7]:

5.0

In [8]:

a[0]

Out[8]:

array([ 0.,  1.,  2.])

In [9]:

a[0,:] # synonymous with a[0]

Out[9]:

array([ 0.,  1.,  2.])

In [10]:

a[:,0]

Out[10]:

array([ 0.,  3.])

Note that unlike slices of lists, slices of arrays "point" back to the original array:

In [11]:

b = a[0]
b[0] = 6.
a

Out[11]:

array([[ 6.,  1.,  2.],
       [ 3.,  4.,  5.]])

Elementwise operations¶

In [12]:

a = np.array([[0., 1., 2.], [3., 4., 5.]])
b = np.array([[6., 7., 8.], [9., 10., 11.]])

In [13]:

a + b

Out[13]:

array([[  6.,   8.,  10.],
       [ 12.,  14.,  16.]])

In [14]:

a - b

Out[14]:

array([[-6., -6., -6.],
       [-6., -6., -6.]])

In [15]:

a * b

Out[15]:

array([[  0.,   7.,  16.],
       [ 27.,  40.,  55.]])

In [16]:

a / b

Out[16]:

array([[ 0.        ,  0.14285714,  0.25      ],
       [ 0.33333333,  0.4       ,  0.45454545]])

In [17]:

np.exp(a)

Out[17]:

array([[   1.        ,    2.71828183,    7.3890561 ],
       [  20.08553692,   54.59815003,  148.4131591 ]])

In [18]:

np.log(b) # natural log

Out[18]:

array([[ 1.79175947,  1.94591015,  2.07944154],
       [ 2.19722458,  2.30258509,  2.39789527]])

In [19]:

np.tanh(a) # yes, we'll actually use this

Out[19]:

array([[ 0.        ,  0.76159416,  0.96402758],
       [ 0.99505475,  0.9993293 ,  0.9999092 ]])

Broadcasting¶

In some cases, it's possible to apply a binary elementwise operation (like +) to two arrays with different shapes. Namely, if an axis has size 1, it can be "broadcast" to any size. This is easier to demonstrate by example.

In [20]:

a = np.zeros((2, 3))
b = np.array([[1., 2., 3.]])
b.shape

Out[20]:

(1, 3)

In [21]:

a + b

Out[21]:

array([[ 1.,  2.,  3.],
       [ 1.,  2.,  3.]])

In [22]:

b = np.array([[1.],[2.]])
b.shape

Out[22]:

(2, 1)

In [23]:

a + b

Out[23]:

array([[ 1.,  1.,  1.],
       [ 2.,  2.,  2.]])

If one array has fewer axes than the other, its shape is left padded with ones:

In [24]:

b = np.array([1., 2., 3.])
b.shape

Out[24]:

(3,)

In [25]:

a + b

Out[25]:

array([[ 1.,  2.,  3.],
       [ 1.,  2.,  3.]])

Reductions¶

Reductions perform some operation along an axis: for example, on all the rows or all the columns of a matrix. (If you don't specify an axis, the operation will be performed on the entire array.)

In [26]:

Out[26]:

array([[ 0.,  0.,  0.],
       [ 0.,  0.,  0.]])

In [27]:

np.sum(a, axis=0) # all the columns

Out[27]:

array([ 0.,  0.,  0.])

In [28]:

np.sum(a, axis=1) # all the rows

Out[28]:

array([ 0.,  0.])

In [29]:

np.max(a, axis=0)

Out[29]:

array([ 0.,  0.,  0.])

In [30]:

np.min(a, axis=0)

Out[30]:

array([ 0.,  0.,  0.])

In [31]:

np.argmax(a, axis=0) # which element is the max?

Out[31]:

array([0, 0, 0])

In [32]:

np.argmin(a, axis=0) # which element is the min?

Out[32]:

array([0, 0, 0])

Transposing axes¶

In [33]:

a.T

Out[33]:

array([[ 0.,  0.],
       [ 0.,  0.],
       [ 0.,  0.]])

Transpose operations don't create new arrays; they create views.

In [34]:

b = a.T
b[0,1] = 7.
a

Out[34]:

array([[ 0.,  0.,  0.],
       [ 7.,  0.,  0.]])

In [35]:

c = np.zeros((2,3,4,5))

I hope you don't need it, but more complex rearrangements of axes are possible too:

In [36]:

d = np.moveaxis(c, 1, 2) # move axis 1 to become axis 2
d.shape

Out[36]:

(2, 4, 3, 5)

Changing shape¶

Arrays can be reshaped arbitrarily, but I can only imagine that you'll ever need to add or remove axes of size one, in order to make broadcasting work the way you want.

In [37]:

e = np.expand_dims(c, 0) # add new axis 0 (with size 1)
e.shape

Out[37]:

(1, 2, 3, 4, 5)

In [38]:

f = np.squeeze(e, 0)
f.shape

Out[38]:

(2, 3, 4, 5)

Again, these operations don't create new arrays; they create views.

Vector and matrix multiplication¶

To do matrix multiplication, don't use *; instead use:

In [39]:

a = np.random.uniform(0, 1, (2,3))
b = np.random.uniform(0, 1, (3,4))

In [40]:

a @ b # Python 3.5 and NumPy 1.10

Out[40]:

array([[ 0.35177723,  0.29972786,  0.50640218,  0.4781246 ],
       [ 0.77974407,  0.44074346,  0.57848922,  0.84393576]])

In [41]:

np.dot(a, b) # All versions

Out[41]:

array([[ 0.35177723,  0.29972786,  0.50640218,  0.4781246 ],
       [ 0.77974407,  0.44074346,  0.57848922,  0.84393576]])

These two functions (@ calls np.matmul) behave differently for arrays with more than 2 axes. Hopefully, you will not need to know the difference for this class.

The same operator/function works for vector-vector dot (inner) products, matrix-vector products, and (row) vector-matrix products.