File I/O¶

In [1]:

import torch
from torch import nn
from torch.nn import functional as F

Loading and Saving Tensors

In [3]:

x = torch.arange(4)
torch.save(x, 'x-file')

x2 = torch.load('x-file')
x2

Out[3]:

tensor([0, 1, 2, 3])

Store a list of tensors and read them back into memory

In [4]:

y = torch.zeros(4)
torch.save([x, y],'x-files')
x2, y2 = torch.load('x-files')
(x2, y2)

Out[4]:

(tensor([0, 1, 2, 3]), tensor([0., 0., 0., 0.]))

Write and read a dictionary that maps from strings to tensors

In [5]:

mydict = {'x': x, 'y': y}
torch.save(mydict, 'mydict')
mydict2 = torch.load('mydict')
mydict2

Out[5]:

{'x': tensor([0, 1, 2, 3]), 'y': tensor([0., 0., 0., 0.])}

Loading and Saving Model Parameters Let's start with our familiar MLP

In [6]:

class MLP(nn.Module):
    def __init__(self):
        super().__init__()
        self.hidden = nn.LazyLinear(256)
        self.output = nn.LazyLinear(10)

    def forward(self, x):
        return self.output(F.relu(self.hidden(x)))

net = MLP()
X = torch.randn(size=(2, 20))
Y = net(X)

Store the parameters of the model as a file

In [7]:

torch.save(net.state_dict(), 'mlp.params')

Read the parameters stored in the file directly

In [8]:

clone = MLP()
clone.load_state_dict(torch.load('mlp.params'))
clone.eval()

Out[8]:

MLP(
  (hidden): LazyLinear(in_features=0, out_features=256, bias=True)
  (output): LazyLinear(in_features=0, out_features=10, bias=True)
)

In [9]:

Y_clone = clone(X)
Y_clone == Y

Out[9]:

tensor([[True, True, True, True, True, True, True, True, True, True],
        [True, True, True, True, True, True, True, True, True, True]])