https://pytorch.org/tutorials/beginner/blitz/autograd_tutorial.html

The gradient for this tensor will be accumulated into .grad attribute.

In [2]:

import torch

In [92]:

x = torch.ones(1, requires_grad=True); x

Out[92]:

tensor([ 1.])

In [93]:

y = x**2
z = x**3

accumulated from `y.backward()` then `z.backward()`¶

In [94]:

y.backward()

In [95]:

x.grad

Out[95]:

tensor([ 2.])

In [96]:

z.backward()

In [97]:

x.grad

Out[97]:

tensor([ 5.])

accumulated from `z` then `y`¶

In [98]:

x = torch.ones(1, requires_grad=True)
y = x**2
z = x**3

In [99]:

z.backward()

In [100]:

x.grad

Out[100]:

tensor([ 3.])

In [101]:

y.backward()

In [102]:

x.grad

Out[102]:

tensor([ 5.])

Set back to 0 after `y.backward()`¶

In [103]:

x = torch.ones(1, requires_grad=True)
y = x**2
z = x**3

In [104]:

y.backward()

In [105]:

x.grad

Out[105]:

tensor([ 2.])

In [106]:

x.grad.zero_()

Out[106]:

tensor([ 0.])

In [107]:

z.backward()

In [90]:

x.grad

Out[90]:

tensor([ 3.])

vector version¶

In [3]:

x = torch.rand((2, 1), requires_grad = True); x

Out[3]:

tensor([[ 0.3725],
        [ 0.4378]])

In [4]:

y = torch.zeros(3, 1)

y[0] = x[0]**2
y[1] = x[1]**3
y[2] = x[1]**4

In [5]:

y.backward(gradient=torch.ones(y.size()))

Cummulative grad of x[0] and x[1] respectively.

In [6]:

x.grad

Out[6]:

tensor([[ 0.7450],
        [ 0.9105]])

In [7]:

2*x[0], 3*x[1]**2, 4*x[1]**3

Out[7]:

(tensor([ 0.7450]), tensor([ 0.5749]), tensor([ 0.3356]))

In [8]:

2*x[0], 3*x[1]**2 + 4*x[1]**3

Out[8]:

(tensor([ 0.7450]), tensor([ 0.9105]))

In [ ]:

accumulated from y.backward() then z.backward()¶

accumulated from z then y¶

Set back to 0 after y.backward()¶

vector version¶

accumulated from `y.backward()` then `z.backward()`¶

accumulated from `z` then `y`¶

Set back to 0 after `y.backward()`¶