Introducción a Python¶

en construcción, revisado el 17/05/17
Alberto Ruiz, DIS-UM

Python es un lenguaje de propósito general que permite acceder cómodamente a una amplísima colección de bibliotecas útiles en todos los campos de la informática. Su uso en ciencia y tecnología es cada vez mayor.

Es un lenguaje interpretado, con tipos de datos dinámicos y manejo automático de memoria, que puede utilizarse tanto para escribir programas de forma tradicional como para experimentar en un entorno interactivo. Incluye las construcciones más importantes de programación funcional y también admite programación orientada a objetos.

Aunque su sintaxis es sencilla e intuitiva tiene algunas diferencias con respecto a otros lenguajes usados en el ámbito científico que inicialmente pueden inducir a confusión:

Los bloques de instrucciones de las construcciones condicionales, bucles y funciones se delimitan mediante la "indentación del código": no se utiliza "end" ni { }.
Los índices para acceder a los arrays o listas comienzan en 0 y acaban en tamaño-1. Las secuencias (range) usadas en bucles o list comprehensions no incluyen el límite superior.
Algunas funciones tienen la sintaxis tradicional f(x), g(x,a), mientras que otras se expresan como x.f(), x.g(a), etc., indicando que el "objeto" x se modifica de alguna forma.
Los arrays y las listas son "mutables": su asignación a otra variable no crea una copia del objeto original sino una "referencia" a través de la cual se puede modificar la estructura original.
Las funciones pueden leer directamente el valor de variables globales, pero para modificarlas hay que declararlas como global. La asignación de variables dentro de una función crea variables locales.

tipos simples¶

Cadenas de caracteres:

In [1]:

s = 'Hola' 

In [2]:

Out[2]:

'Hola'

In [3]:

print(s)

Hola

In [4]:

type(s)

Out[4]:

str

Se admiten diferentes tipos de delimitadores y cadenas multilínea.

In [5]:

type("Hola ")

Out[5]:

str

In [6]:

type('''Hola''')

Out[6]:

str

Variables lógicas:

In [7]:

c = 3 < 4

In [8]:

type(c)

Out[8]:

bool

In [9]:

c and (2==1+1) or not (3 != 5)

Out[9]:

True

Números reales aproximados con coma flotante de doble precisión:

In [10]:

x = 3.5

In [11]:

type(x)

Out[11]:

float

Los enteros tienen tamaño ilimitado:

In [12]:

x = 20

In [13]:

type(x)

Out[13]:

int

In [14]:

x**x

Out[14]:

104857600000000000000000000

Variable compleja:

In [15]:

(1+1j)*(1-1j)

Out[15]:

(2+0j)

In [16]:

import cmath

cmath.sqrt(-1)

Out[16]:

1j

tuplas¶

Se usan normalmente para almacenar unos pocos datos, posiblemente de diferentes tipos:

In [17]:

t = (2,'rojo')

In [18]:

Out[18]:

(2, 'rojo')

In [19]:

t[0]

Out[19]:

Son inmutables.

listas¶

In [20]:

l = [1,2,3,4,5]

In [21]:

type(l)

Out[21]:

list

In [22]:

l[0]

Out[22]:

In [23]:

l[-1]

Out[23]:

In [24]:

len(l)

Out[24]:

In [25]:

l[1:3]

Out[25]:

[2, 3]

In [26]:

[1,2,3] + [4,5]

Out[26]:

[1, 2, 3, 4, 5]

Lo normal es guardar datos del mismo tipo aunque no es obligatorio.

In [27]:

s =[1,2,3,"hola",True]
type(s)

Out[27]:

list

Las listas y arrays son referencias a objetos mutables:

In [28]:

x = 2
y = x
y = 100

x

Out[28]:

In [29]:

x = [1,2]
y = x
y[0] = 100

x

Out[29]:

[100, 2]

Para evitar este comportamiento podemos hacer una copia:

In [30]:

x = [1,2]
y = x.copy()
y[0] = 100

x

Out[30]:

[1, 2]

In [31]:

import numpy as np

a = np.array([1,2,3])
b = a
c = a.copy()
b[1] = 100

a, c

Out[31]:

(array([  1, 100,   3]), array([1, 2, 3]))

Relacionado con esto, algunas operaciones modifican la estructura original y otras no:

In [32]:

m = np.array(range(10))

m.resize([2,5])

In [33]:

Out[33]:

array([[0, 1, 2, 3, 4],
       [5, 6, 7, 8, 9]])

In [34]:

m = np.array(range(10))

np.resize(m,[2,5])

Out[34]:

array([[0, 1, 2, 3, 4],
       [5, 6, 7, 8, 9]])

In [35]:

Out[35]:

array([0, 1, 2, 3, 4, 5, 6, 7, 8, 9])

In [36]:

m = np.array(range(10))

m.reshape(2,5)

Out[36]:

array([[0, 1, 2, 3, 4],
       [5, 6, 7, 8, 9]])

In [37]:

Out[37]:

array([0, 1, 2, 3, 4, 5, 6, 7, 8, 9])

dictionaries¶

Es un array asociativo (el índice puede ser cualquier tipo (inmutable)). Es una estructura muy utilizada en Python.

In [38]:

d = {'lunes': 8, 'martes' : [1,2,3], 3: 5}

In [39]:

d['martes']

Out[39]:

[1, 2, 3]

In [40]:

d.keys()

Out[40]:

dict_keys([3, 'lunes', 'martes'])

In [41]:

d.values()

Out[41]:

dict_values([5, 8, [1, 2, 3]])

control¶

Condiciones:

In [42]:

k = 7

if k%2 == 0:
    print(k," es par")
else:
    print(k," es impar")
    print("me gustan los impares")

7  es impar
me gustan los impares

En Python 3 las secuencias entregan los elementos "bajo demanda", no crean una lista completa.

In [43]:

range(3,8)

Out[43]:

range(3, 8)

In [44]:

list(range(3,8))

Out[44]:

[3, 4, 5, 6, 7]

Bucles:

In [45]:

for k in [1,2,3]:
    print(k)

1
2
3

In [46]:

for k in range(5):
    print(k)

In [47]:

k = 1
p = 1
while k < 5:
    p = p*k
    k = k+1
p

Out[47]:

Hay muchos tipos de contenedores que pueden recorrerse con bucles: listas, arrays, diccionarios, etc.

funciones¶

In [48]:

def sp(n):
    r = n**2+n+41
    return r

In [49]:

sp(5)

Out[49]:

Se pueden devolver varios resultados en una tupla:

In [50]:

import math

def ecsec(a,b,c):
    d = math.sqrt(b**2- 4*a*c)
    s1 = (-b+d)/2/a
    s2 = (-b-d)/2/a
    return (s1,s2)

In [51]:

ecsec(2,-6,4)

Out[51]:

(2.0, 1.0)

Los paréntesis de la tupla son opcionales.

In [52]:

a,b = ecsec(1,-3,2)

b

Out[52]:

1.0

Las variables globales son visibles dentro de las funciones y las asignaciones crean variables locales (a menos que el nombre se declare global).

In [53]:

a = 5

b = 8

def f(x):
    b = a+1
    return b

print(f(3))
print(b)

6
8

Argumentos por omisión:

In [54]:

def incre(x,y=1):
    return x + y

print(incre(5))
print(incre(5,3))

6
8

Argumentos por nombre:

In [55]:

incre(y=3, x=2)

Out[55]:

Documentación:

In [56]:

# ? sum
help(sum)

Help on built-in function sum in module builtins:

sum(iterable, start=0, /)
    Return the sum of a 'start' value (default: 0) plus an iterable of numbers
    
    When the iterable is empty, return the start value.
    This function is intended specifically for use with numeric values and may
    reject non-numeric types.

In [57]:

def fun(n):
    """Una función muy simple que calcula el triple de su argumento."""
    return 3*n

In [58]:

help(fun)

Help on function fun in module __main__:

fun(n)
    Una función muy simple que calcula el triple de su argumento.

bibliotecas¶

Las funciones definidas en un archivo se pueden utilizar directamente haciendo un import. Existe una convención para definir una función main que se ejecuta cuando el archivo se arranca como programa y suele usarse para ejecutar tests.

programación funcional¶

En Python 3 las construcciones funcionales crean secuencias "bajo demanda".

In [59]:

map(sp,range(5))

Out[59]:

<map at 0x7f768c0d2128>

In [60]:

for k in map(sp,range(5)):
    print(k)

In [61]:

list(map(sp,range(5)))

Out[61]:

[41, 43, 47, 53, 61]

In [62]:

list(filter(lambda x: x%2 == 1, range(10)))

Out[62]:

[1, 3, 5, 7, 9]

List comprehensions:

In [63]:

[k**2 for k in range(10) if k >5 ]

Out[63]:

[36, 49, 64, 81]

In [64]:

def divis(n):
    return [k for k in range(2,n) if n%k==0]

In [65]:

divis(12)

Out[65]:

[2, 3, 4, 6]

In [66]:

divis(1001)

Out[66]:

[7, 11, 13, 77, 91, 143]

In [67]:

def perfect(n):
    return sum(divis(n)) + 1 == n

In [68]:

perfect(4)

Out[68]:

False

In [69]:

perfect(6)

Out[69]:

True

In [70]:

def prime(n):
    return divis(n)==[]

In [71]:

[k for k in range(2,21) if prime(k)]

Out[71]:

[2, 3, 5, 7, 11, 13, 17, 19]

Función que construye funciones:

In [72]:

def mkfun(y):
    return lambda x: x+y

In [73]:

f = mkfun(1)
g = mkfun(5)

print(f(10))
print(g(10))

11
15

In [74]:

fs = list(map(mkfun,range(1,6)))

print(fs[0](10))
print(fs[4](10))

11
15

Existe reduce (fold) pero parece que no es muy "idiomático".

gráficas¶

Hay muchos paquetes para producir gráficos espectaculares. Uno de los más conocidos es matplotlib, que puede utilizarse con un interfaz muy parecido al de Matlab/Octave.

In [75]:

import numpy as np
import matplotlib.pyplot as plt

%matplotlib inline

In [76]:

x=np.linspace(0,2*np.pi,200)

In [77]:

plt.plot(np.sin(x))

Out[77]:

[<matplotlib.lines.Line2D at 0x7f7678b9f2e8>]

In [78]:

plt.plot(np.cos(x),np.sin(x)); plt.axis('equal')

Out[78]:

(-1.0998691589935401,
 1.0999937694758828,
 -1.099965731583572,
 1.099965731583572)

In [79]:

plt.plot(x,np.sin(x), x,np.cos(x));

In [80]:

plt.plot(x,np.sin(x),'r',x,np.sin(2*x),'k',[1,2.5],[-0.5,0],'bo');
plt.legend(['hola','bla','bla bla']);
plt.xlabel('x'); plt.ylabel('y'); plt.title('bonito plot'); plt.axis('tight');

In [81]:

plt.plot(x,np.exp(x)); plt.axis([0,3,-1,5]);

In [82]:

for k in [1,2,3]:
    plt.plot(x,np.sin(k*x))
plt.grid()

In [83]:

def espiral(n):
    t = np.linspace(0,n*2*np.pi,1000)
    r = 3 * t
    x = r * np.cos(t)
    y = r * np.sin(t)
    plt.plot(x,y)
    plt.axis('equal')
    plt.axis('off')

espiral(4)

In [84]:

import numpy.random as rnd

def randwalk(n,s):
    p = s*rnd.randn(n,2)
    r = np.cumsum(p,axis=0)
    x = r[:,0]
    y = r[:,1]
    plt.plot(x,y)
    plt.axis('equal');

In [85]:

import matplotlib.pylab as pylab
print(pylab.rcParams['figure.figsize'])
pylab.rcParams['figure.figsize'] = 5, 5

[6.0, 4.0]

In [86]:

randwalk(1000,1)

In [87]:

pylab.rcParams['figure.figsize'] = 8, 8
x = np.linspace(0,6*np.pi,100);

plt.subplot(2,2,1)
plt.plot(x,np.sin(x),'r')

plt.subplot(2,2,2)
plt.plot(x,np.cos(x))

plt.subplot(2,2,3)
plt.plot(x,np.sin(2*x))

plt.subplot(2,2,4)
plt.plot(x,np.cos(2*x),'g');

In [88]:

pylab.rcParams['figure.figsize'] = 6, 6

In [89]:

x,y = np.mgrid[-3:3:0.2,-3:3:0.2]

plt.contour(x,y, x**2-y**2-1, 0, colors=['k']);
plt.axis('equal');

In [90]:

from mpl_toolkits.mplot3d import Axes3D
from matplotlib import cm

fig = plt.figure()
ax = fig.add_subplot(111, projection='3d')

ax.plot_surface(x,y,x**2-y**2, cmap=cm.coolwarm, linewidth=0.5, rstride=2, cstride=2);

In [91]:

pylab.rcParams['figure.figsize'] = 6, 4

números pseudoaleatorios y estadística elemental¶

In [92]:

import numpy.random as rnd

rnd.randn(2,3)

Out[92]:

array([[ 1.10109629, -0.72138553, -1.2202826 ],
       [-0.53595346, -0.44037795, -0.98122434]])

In [93]:

r = 2+3*rnd.randn(100)

plt.hist(r,10);

In [94]:

np.mean(r)

Out[94]:

2.2170177852015147

In [95]:

np.std(r)

Out[95]:

2.938161931731035

In [96]:

np.median(r)

Out[96]:

2.44634684308995

In [97]:

plt.plot(np.sort(r),range(1,101));

vectores, matrices y arrays n-dimensionales¶

In [98]:

np.matrix('1 2; 3 5')

Out[98]:

matrix([[1, 2],
        [3, 5]])

In [99]:

m = np.matrix([[1, 2, 3, 4]
              ,[5, 6, 7, 8]
              ,[9,10,11,12]])

In [100]:

Out[100]:

matrix([[ 1,  2,  3,  4],
        [ 5,  6,  7,  8],
        [ 9, 10, 11, 12]])

In [101]:

m.transpose()

Out[101]:

matrix([[ 1,  5,  9],
        [ 2,  6, 10],
        [ 3,  7, 11],
        [ 4,  8, 12]])

In [102]:

m.T

Out[102]:

matrix([[ 1,  5,  9],
        [ 2,  6, 10],
        [ 3,  7, 11],
        [ 4,  8, 12]])

En los arrays de un índice (vectores) no se distinguen filas y columnas:

In [103]:

np.array([1,2,3]).T

Out[103]:

array([1, 2, 3])

pero en los de dos índices (matrices), sí:

In [104]:

np.array([[1,2,3]]).T

Out[104]:

array([[1],
       [2],
       [3]])

Matrices por bloques

In [105]:

np.bmat([[m,         np.zeros([3,3])]
        ,[np.eye(3), np.ones([3,4]) ]])

Out[105]:

matrix([[  1.,   2.,   3.,   4.,   0.,   0.,   0.],
        [  5.,   6.,   7.,   8.,   0.,   0.,   0.],
        [  9.,  10.,  11.,  12.,   0.,   0.,   0.],
        [  1.,   0.,   0.,   1.,   1.,   1.,   1.],
        [  0.,   1.,   0.,   1.,   1.,   1.,   1.],
        [  0.,   0.,   1.,   1.,   1.,   1.,   1.]])

"Automatic broadcasting"

In [106]:

m + np.array([[10],[20],[30]])

Out[106]:

matrix([[11, 12, 13, 14],
        [25, 26, 27, 28],
        [39, 40, 41, 42]])

Extracción de elementos y submatrices

In [107]:

Out[107]:

matrix([[ 1,  2,  3,  4],
        [ 5,  6,  7,  8],
        [ 9, 10, 11, 12]])

In [108]:

m[1,2]

Out[108]:

In [109]:

m[2,1:4]

Out[109]:

matrix([[10, 11, 12]])

In [110]:

m[:2, 2:]

Out[110]:

matrix([[3, 4],
        [7, 8]])

In [111]:

m[[1,0,0,2,1],:]

Out[111]:

matrix([[ 5,  6,  7,  8],
        [ 1,  2,  3,  4],
        [ 1,  2,  3,  4],
        [ 9, 10, 11, 12],
        [ 5,  6,  7,  8]])

Extracción de elementos que cumplen una condición:

In [112]:

n = np.array(range(1,11))

n

Out[112]:

array([ 1,  2,  3,  4,  5,  6,  7,  8,  9, 10])

In [113]:

n < 5

Out[113]:

array([ True,  True,  True,  True, False, False, False, False, False, False], dtype=bool)

In [114]:

n[n<5]

Out[114]:

array([1, 2, 3, 4])

In [115]:

k = np.linspace(1,100,100)

(k ** 2)[np.logical_and(k>10 , k**3 < 2000)]

Out[115]:

array([ 121.,  144.])

Más ejemplos:

In [116]:

s = np.array([1,2,3,4])

In [117]:

type(s)

Out[117]:

numpy.ndarray

In [118]:

np.ndim(s)

Out[118]:

In [119]:

np.shape(s)

Out[119]:

(4,)

In [120]:

sum(s)

Out[120]:

In [121]:

x = np.array([k for k in range(1,11)])

In [122]:

print(x)

[ 1  2  3  4  5  6  7  8  9 10]

In [123]:

np.append(x,[13,14])

Out[123]:

array([ 1,  2,  3,  4,  5,  6,  7,  8,  9, 10, 13, 14])

In [124]:

np.size(x)

Out[124]:

In [125]:

for x in np.linspace(0,1,10):
    print(x)

0.0
0.111111111111
0.222222222222
0.333333333333
0.444444444444
0.555555555556
0.666666666667
0.777777777778
0.888888888889
1.0

In [126]:

np.sqrt(4)

Out[126]:

2.0

álgebra lineal¶

In [127]:

import numpy as np
import scipy.linalg as la

Resuelve

$$ \begin{align*} x + 2y &= 3\\ 3x+4y &= 5 \end{align*} $$

In [128]:

m = np.matrix([[1,2],[3,4]])

In [129]:

Out[129]:

matrix([[1, 2],
        [3, 4]])

In [130]:

np.shape(m)

Out[130]:

(2, 2)

In [131]:

la.inv(m)

Out[131]:

array([[-2. ,  1. ],
       [ 1.5, -0.5]])

In [132]:

x = la.solve(m,[[3],[5]])

x

Out[132]:

array([[-1.],
       [ 2.]])

In [133]:

np.dot(m,x)

Out[133]:

matrix([[ 3.],
        [ 5.]])

In [134]:

m @ x

Out[134]:

matrix([[ 3.],
        [ 5.]])

In [135]:

np.shape(x)

Out[135]:

(2, 1)

In [136]:

np.ndim(x)

Out[136]:

Como ejemplo de uso de las herramientas de álgebra lineal realizaremos el ajuste de un modelo polinomial a unas observaciones ficticias. Encontraremos la solución de mínimo error cuadrático a un sistema de ecuaciones sobredeterminado.

En primer lugar generamos unos datos de prueba artificiales que simulan observaciones contaminadas con ruido de una función no lineal.

In [137]:

x = np.linspace(0,2,30)

y = np.sin(x) + 0.05*rnd.randn(x.size)

plt.plot(x,y,'.');

Vamos a ajustar un modelo del tipo $y = ax^2 + bx + c$. Los coeficientes desconocidos $a$, $b$ y $c$ se pueden obtener resolviendo un sistema de ecuaciones lineales.

La matriz de coeficientes tiene potencias de $x$ hasta el grado que nos interesa.

In [138]:

A = np.vstack([x**2, x, np.ones(x.size)]).transpose()

A

Out[138]:

array([[ 0.        ,  0.        ,  1.        ],
       [ 0.00475624,  0.06896552,  1.        ],
       [ 0.01902497,  0.13793103,  1.        ],
       [ 0.04280618,  0.20689655,  1.        ],
       [ 0.07609988,  0.27586207,  1.        ],
       [ 0.11890606,  0.34482759,  1.        ],
       [ 0.17122473,  0.4137931 ,  1.        ],
       [ 0.23305589,  0.48275862,  1.        ],
       [ 0.30439952,  0.55172414,  1.        ],
       [ 0.38525565,  0.62068966,  1.        ],
       [ 0.47562426,  0.68965517,  1.        ],
       [ 0.57550535,  0.75862069,  1.        ],
       [ 0.68489893,  0.82758621,  1.        ],
       [ 0.80380499,  0.89655172,  1.        ],
       [ 0.93222354,  0.96551724,  1.        ],
       [ 1.07015458,  1.03448276,  1.        ],
       [ 1.2175981 ,  1.10344828,  1.        ],
       [ 1.3745541 ,  1.17241379,  1.        ],
       [ 1.54102259,  1.24137931,  1.        ],
       [ 1.71700357,  1.31034483,  1.        ],
       [ 1.90249703,  1.37931034,  1.        ],
       [ 2.09750297,  1.44827586,  1.        ],
       [ 2.3020214 ,  1.51724138,  1.        ],
       [ 2.51605232,  1.5862069 ,  1.        ],
       [ 2.73959572,  1.65517241,  1.        ],
       [ 2.97265161,  1.72413793,  1.        ],
       [ 3.21521998,  1.79310345,  1.        ],
       [ 3.46730083,  1.86206897,  1.        ],
       [ 3.72889417,  1.93103448,  1.        ],
       [ 4.        ,  2.        ,  1.        ]])

En realidad es una matriz de Vandermonde:

In [139]:

np.vander(x,3)

Out[139]:

array([[ 0.        ,  0.        ,  1.        ],
       [ 0.00475624,  0.06896552,  1.        ],
       [ 0.01902497,  0.13793103,  1.        ],
       [ 0.04280618,  0.20689655,  1.        ],
       [ 0.07609988,  0.27586207,  1.        ],
       [ 0.11890606,  0.34482759,  1.        ],
       [ 0.17122473,  0.4137931 ,  1.        ],
       [ 0.23305589,  0.48275862,  1.        ],
       [ 0.30439952,  0.55172414,  1.        ],
       [ 0.38525565,  0.62068966,  1.        ],
       [ 0.47562426,  0.68965517,  1.        ],
       [ 0.57550535,  0.75862069,  1.        ],
       [ 0.68489893,  0.82758621,  1.        ],
       [ 0.80380499,  0.89655172,  1.        ],
       [ 0.93222354,  0.96551724,  1.        ],
       [ 1.07015458,  1.03448276,  1.        ],
       [ 1.2175981 ,  1.10344828,  1.        ],
       [ 1.3745541 ,  1.17241379,  1.        ],
       [ 1.54102259,  1.24137931,  1.        ],
       [ 1.71700357,  1.31034483,  1.        ],
       [ 1.90249703,  1.37931034,  1.        ],
       [ 2.09750297,  1.44827586,  1.        ],
       [ 2.3020214 ,  1.51724138,  1.        ],
       [ 2.51605232,  1.5862069 ,  1.        ],
       [ 2.73959572,  1.65517241,  1.        ],
       [ 2.97265161,  1.72413793,  1.        ],
       [ 3.21521998,  1.79310345,  1.        ],
       [ 3.46730083,  1.86206897,  1.        ],
       [ 3.72889417,  1.93103448,  1.        ],
       [ 4.        ,  2.        ,  1.        ]])

El lado derecho del sistema es directamente el vector con los valores de $y$, la variable independiente del modelo.

In [140]:

B = np.matrix(y).T

B

Out[140]:

matrix([[-0.03017562],
        [ 0.12102307],
        [ 0.20472809],
        [ 0.22336368],
        [ 0.29015562],
        [ 0.28786901],
        [ 0.39513457],
        [ 0.46121004],
        [ 0.49673906],
        [ 0.55230101],
        [ 0.65127628],
        [ 0.65792914],
        [ 0.77057348],
        [ 0.79241806],
        [ 0.76122737],
        [ 0.94307161],
        [ 0.8812106 ],
        [ 0.89471824],
        [ 0.96952483],
        [ 0.95853218],
        [ 0.99921169],
        [ 0.96480888],
        [ 1.00485747],
        [ 0.94585026],
        [ 1.07448525],
        [ 0.94431881],
        [ 0.96184194],
        [ 0.96520362],
        [ 0.96336003],
        [ 0.93290306]])

El sistema que hay que resolver está sobredeterminado: tiene solo tres incógnitas y tantas ecuaciones como observaciones de la función.

$$A \begin{bmatrix}a\\b\\c\end{bmatrix}= B$$

La solución de mínimo error cuadrático para los coeficientes del modelo se obtiene de manera directa con el operador de división matricial.

In [141]:

sol = la.lstsq(A,B)[0]

sol

Out[141]:

array([[-0.36807329],
       [ 1.21964859],
       [-0.01910038]])

In [142]:

#ye = np.dot(A,sol)

ye = A @ sol

plt.plot(x,y,'.',x,ye,'r');

Se puede experimentar con polinomios de mayor o menor grado.

Otra forma de llegar a esta solución es resolver las "ecuaciones normales" del sistema $Ax=b$. Como está sobredeterminado, vamos a minimizar el error cuadrático $E=||Ax-b||^2$, que es matemáticamente tratable. Hay que resolver

$$\frac{\partial E}{\partial x_k} =0 $$

Se puede comprobar que la solución es

$$x = A^+ b$$

donde $A^+$ es la "pseudoinversa" de $A$:

$$A^+ = (A^TA)^{-1}A^T$$

In [143]:

la.inv(A.T @ A) @ A.T @ B

Out[143]:

matrix([[-0.36807329],
        [ 1.21964859],
        [-0.01910038]])

In [144]:

la.pinv(A) @ B

Out[144]:

matrix([[-0.36807329],
        [ 1.21964859],
        [-0.01910038]])

solución numérica de ecuaciones no lineales¶

Resuelve

$$x^4=16$$

In [145]:

import scipy as sci

sci.roots([1,0,0,0,-16])

Out[145]:

array([ -2.00000000e+00+0.j,   1.66533454e-16+2.j,   1.66533454e-16-2.j,
         2.00000000e+00+0.j])

Resuelve

$$sin(x)+cos(2x)=0$$

In [146]:

import scipy.optimize as opt

opt.fsolve(lambda x: sci.sin(x) + sci.cos(2*x), 0)

Out[146]:

array([-0.52359878])

Resuelve

$$ \begin{align*} x^2 - 3y &= 10\\ sin(x)+y &= 5 \end{align*} $$

In [147]:

def fun(z):
    x,y = z
    return [ x**2 - 3*y - 10
           , sci.sin(x) + y - 5]

opt.fsolve(fun,[0.1,-0.1])

Out[147]:

array([ 5.2511881 ,  5.85832548])

minimización¶

Encuentra $(x,y)$ que minimiza $(x-1)^2 + (y-2)^2-x+3y$

In [148]:

def fun(z):
    x,y = z
    return (x-1)**2 + (y-2)**2 - x + 3*y

opt.minimize(fun,[0.1,-0.1])

Out[148]:

      fun: 2.500000000000014
 hess_inv: array([[ 0.57758622, -0.18103452],
       [-0.18103452,  0.92241375]])
      jac: array([  0.00000000e+00,  -2.38418579e-07])
  message: 'Optimization terminated successfully.'
     nfev: 12
      nit: 2
     njev: 3
   status: 0
  success: True
        x: array([ 1.49999999,  0.49999988])

In [149]:

r = np.linspace(0,2,30)
x,y = np.meshgrid(r,r)

fig = plt.figure()
ax = fig.add_subplot(111, projection='3d')

z = fun((x,y))

ax.plot_surface(x,y,z, cmap=cm.coolwarm, linewidth=0.5, rstride=2, cstride=2);

derivación numérica¶

Calcula una aproximación numérica para $f'(2)$ cuando $f(x) = \sin(2x)*\exp(\cos(x))$

In [150]:

from scipy.misc import derivative

derivative(lambda x: sci.sin(2*x)*sci.exp(sci.cos(x)),2,1E-6)

Out[150]:

-0.40836700757052036

In [151]:

(lambda x: (-np.sin(x)*np.sin(2*x) + 2*np.cos(2*x))*np.exp(np.cos(x)))(2)

Out[151]:

-0.40836700756782335

integración numérica¶

Calcula una aproximación numérica a la integral definida

$$\int_0^1 \frac{4}{1+x^2}dx$$

In [152]:

from scipy.integrate import quad

quad(lambda x: 4/(1+x**2),0,1)

Out[152]:

(3.1415926535897936, 3.4878684980086326e-14)

ecuaciones diferenciales¶

Resuelve

$$\ddot{x}+0.95x+0.1\dot{x}=0$$

para $x(0)=10$, $\dot{x}(0)=0, t\in[0,20]$

In [153]:

from scipy.integrate import odeint

def xdot(z,t):
    x,v = z
    return [v,-0.95*x-0.1*v]

t = np.linspace(0,20,1000)
r = odeint(xdot,[10,0],t)
# plt.plot(r);
plt.plot(t,r[:,0],t,r[:,1]);

In [154]:

plt.plot(r[:,0],r[:,1]);

cálculo simbólico¶

sympy

estilo¶

Podemos "tunearlo":

In [155]:

from IPython.display import HTML

HTML(open('../nb1.css').read())

Out[155]: