.. DO NOT EDIT.
.. THIS FILE WAS AUTOMATICALLY GENERATED BY SPHINX-GALLERY.
.. TO MAKE CHANGES, EDIT THE SOURCE PYTHON FILE:
.. "tutorials/data_fashion.py"
.. LINE NUMBERS ARE GIVEN BELOW.

.. only:: html

    .. note::
        :class: sphx-glr-download-link-note

        :ref:`Go to the end <sphx_glr_download_tutorials_data_fashion.py>`
        to download the full example code.

.. rst-class:: sphx-glr-example-title

.. _sphx_glr_tutorials_data_fashion.py:


Using TensorDict for datasets
=============================

.. GENERATED FROM PYTHON SOURCE LINES 7-12

In this tutorial we demonstrate how ``TensorDict`` can be used to
efficiently and transparently load and manage data inside a training
pipeline. The tutorial is based heavily on the `PyTorch Quickstart
Tutorial <https://pytorch.org/tutorials/beginner/basics/quickstart_tutorial.html>`__,
but modified to demonstrate use of ``TensorDict``.

.. GENERATED FROM PYTHON SOURCE LINES 12-26

.. code-block:: Python


    import torch
    import torch.nn as nn

    from tensordict import MemoryMappedTensor, TensorDict
    from torch.utils.data import DataLoader
    from torchvision import datasets
    from torchvision.transforms import ToTensor

    device = "cuda" if torch.cuda.is_available() else "cpu"
    print(f"Using device: {device}")


.. rst-class:: sphx-glr-script-out

 .. code-block:: none

    Using device: cpu


.. GENERATED FROM PYTHON SOURCE LINES 27-31

The ``torchvision.datasets`` module contains a number of convenient pre-prepared
datasets. In this tutorial we'll use the relatively simple FashionMNIST dataset. Each
image is an item of clothing, the objective is to classify the type of clothing in
the image (e.g. "Bag", "Sneaker" etc.).

.. GENERATED FROM PYTHON SOURCE LINES 31-45

.. code-block:: Python


    training_data = datasets.FashionMNIST(
        root="data",
        train=True,
        download=True,
        transform=ToTensor(),
    )
    test_data = datasets.FashionMNIST(
        root="data",
        train=False,
        download=True,
        transform=ToTensor(),
    )


.. GENERATED FROM PYTHON SOURCE LINES 46-52

We will create two tensordicts, one each for the training and test data. We create
memory-mapped tensors to hold the data. This will allow us to efficiently load
batches of transformed data from disk rather than repeatedly load and transform
individual images.

First we create the :class:`~tensordict.MemoryMappedTensor` containers.

.. GENERATED FROM PYTHON SOURCE LINES 52-76

.. code-block:: Python


    training_data_td = TensorDict(
        {
            "images": MemoryMappedTensor.empty(
                (len(training_data), *training_data[0][0].squeeze().shape),
                dtype=torch.float32,
            ),
            "targets": MemoryMappedTensor.empty((len(training_data),), dtype=torch.int64),
        },
        batch_size=[len(training_data)],
        device=device,
    )
    test_data_td = TensorDict(
        {
            "images": MemoryMappedTensor.empty(
                (len(test_data), *test_data[0][0].squeeze().shape), dtype=torch.float32
            ),
            "targets": MemoryMappedTensor.empty((len(test_data),), dtype=torch.int64),
        },
        batch_size=[len(test_data)],
        device=device,
    )


.. GENERATED FROM PYTHON SOURCE LINES 77-80

Then we can iterate over the data to populate the memory-mapped tensors. This takes a
bit of time, but performing the transforms up-front will save repeated effort during
training later.

.. GENERATED FROM PYTHON SOURCE LINES 80-87

.. code-block:: Python


    for i, (img, label) in enumerate(training_data):
        training_data_td[i] = TensorDict({"images": img, "targets": label}, [])

    for i, (img, label) in enumerate(test_data):
        test_data_td[i] = TensorDict({"images": img, "targets": label}, [])


.. GENERATED FROM PYTHON SOURCE LINES 88-98

DataLoaders
----------------

We'll create DataLoaders from the ``torchvision``-provided Datasets, as well as from
our memory-mapped TensorDicts.

Since ``TensorDict`` implements ``__len__`` and ``__getitem__`` (and also
``__getitems__``) we can use it like a map-style Dataset and create a ``DataLoader``
directly from it. Note that because ``TensorDict`` can already handle batched indices,
there is no need for collation, so we pass the identity function as ``collate_fn``.

.. GENERATED FROM PYTHON SOURCE LINES 98-111

.. code-block:: Python


    batch_size = 64

    train_dataloader = DataLoader(training_data, batch_size=batch_size)  # noqa: TOR401
    test_dataloader = DataLoader(test_data, batch_size=batch_size)  # noqa: TOR401

    train_dataloader_td = DataLoader(  # noqa: TOR401
        training_data_td, batch_size=batch_size, collate_fn=lambda x: x
    )
    test_dataloader_td = DataLoader(  # noqa: TOR401
        test_data_td, batch_size=batch_size, collate_fn=lambda x: x
    )


.. GENERATED FROM PYTHON SOURCE LINES 112-118

Model
-------

We use the same model from the
`Quickstart Tutorial <https://pytorch.org/tutorials/beginner/basics/quickstart_tutorial.html>`__.


.. GENERATED FROM PYTHON SOURCE LINES 118-142

.. code-block:: Python


    class Net(nn.Module):
        def __init__(self):
            super().__init__()
            self.flatten = nn.Flatten()
            self.linear_relu_stack = nn.Sequential(
                nn.Linear(28 * 28, 512),
                nn.ReLU(),
                nn.Linear(512, 512),
                nn.ReLU(),
                nn.Linear(512, 10),
            )

        def forward(self, x):
            x = self.flatten(x)
            logits = self.linear_relu_stack(x)
            return logits


    model = Net().to(device)
    model_td = Net().to(device)
    model, model_td


.. rst-class:: sphx-glr-script-out

 .. code-block:: none


    (Net(
      (flatten): Flatten(start_dim=1, end_dim=-1)
      (linear_relu_stack): Sequential(
        (0): Linear(in_features=784, out_features=512, bias=True)
        (1): ReLU()
        (2): Linear(in_features=512, out_features=512, bias=True)
        (3): ReLU()
        (4): Linear(in_features=512, out_features=10, bias=True)
      )
    ), Net(
      (flatten): Flatten(start_dim=1, end_dim=-1)
      (linear_relu_stack): Sequential(
        (0): Linear(in_features=784, out_features=512, bias=True)
        (1): ReLU()
        (2): Linear(in_features=512, out_features=512, bias=True)
        (3): ReLU()
        (4): Linear(in_features=512, out_features=10, bias=True)
      )
    ))


.. GENERATED FROM PYTHON SOURCE LINES 143-149

Optimizing the parameters
---------------------------------

We'll optimise the parameters of the model using stochastic gradient descent and
cross-entropy loss.


.. GENERATED FROM PYTHON SOURCE LINES 149-174

.. code-block:: Python


    loss_fn = nn.CrossEntropyLoss()
    optimizer = torch.optim.SGD(model.parameters(), lr=1e-3)
    optimizer_td = torch.optim.SGD(model_td.parameters(), lr=1e-3)


    def train(dataloader, model, loss_fn, optimizer):
        size = len(dataloader.dataset)
        model.train()

        for batch, (X, y) in enumerate(dataloader):
            X, y = X.to(device), y.to(device)

            pred = model(X)
            loss = loss_fn(pred, y)

            optimizer.zero_grad()
            loss.backward()
            optimizer.step()

            if batch % 100 == 0:
                loss, current = loss.item(), batch * len(X)
                print(f"loss: {loss:>7f} [{current:>5d}/{size:>5d}]")


.. GENERATED FROM PYTHON SOURCE LINES 175-178

The training loop for our ``TensorDict``-based DataLoader is very similar, we just
adjust how we unpack the data to the more explicit key-based retrieval offered by
``TensorDict``. The ``.contiguous()`` method loads the data stored in the memmap tensor.

.. GENERATED FROM PYTHON SOURCE LINES 178-264

.. code-block:: Python


    def train_td(dataloader, model, loss_fn, optimizer):
        size = len(dataloader.dataset)
        model.train()

        for batch, data in enumerate(dataloader):
            X, y = data["images"].contiguous(), data["targets"].contiguous()

            pred = model(X)
            loss = loss_fn(pred, y)

            optimizer.zero_grad()
            loss.backward()
            optimizer.step()

            if batch % 100 == 0:
                loss, current = loss.item(), batch * len(X)
                print(f"loss: {loss:>7f} [{current:>5d}/{size:>5d}]")


    def test(dataloader, model, loss_fn):
        size = len(dataloader.dataset)
        num_batches = len(dataloader)
        model.eval()
        test_loss, correct = 0, 0
        with torch.no_grad():
            for X, y in dataloader:
                X, y = X.to(device), y.to(device)

                pred = model(X)

                test_loss += loss_fn(pred, y).item()
                correct += (pred.argmax(1) == y).type(torch.float).sum().item()

        test_loss /= num_batches
        correct /= size

        print(
            f"Test Error: \n Accuracy: {(100 * correct):>0.1f}%, Avg loss: {test_loss:>8f} \n"
        )


    def test_td(dataloader, model, loss_fn):
        size = len(dataloader.dataset)
        num_batches = len(dataloader)
        model.eval()
        test_loss, correct = 0, 0
        with torch.no_grad():
            for batch in dataloader:
                X, y = batch["images"].contiguous(), batch["targets"].contiguous()

                pred = model(X)

                test_loss += loss_fn(pred, y).item()
                correct += (pred.argmax(1) == y).type(torch.float).sum().item()

        test_loss /= num_batches
        correct /= size

        print(
            f"Test Error: \n Accuracy: {(100 * correct):>0.1f}%, Avg loss: {test_loss:>8f} \n"
        )


    for d in train_dataloader_td:
        print(d)
        break

    import time

    t0 = time.time()
    epochs = 5
    for t in range(epochs):
        print(f"Epoch {t + 1}\n-------------------------")
        train_td(train_dataloader_td, model_td, loss_fn, optimizer_td)
        test_td(test_dataloader_td, model_td, loss_fn)
    print(f"TensorDict training done! time: {time.time() - t0: 4.4f} s")

    t0 = time.time()
    epochs = 5
    for t in range(epochs):
        print(f"Epoch {t + 1}\n-------------------------")
        train(train_dataloader, model, loss_fn, optimizer)
        test(test_dataloader, model, loss_fn)
    print(f"Training done! time: {time.time() - t0: 4.4f} s")


.. rst-class:: sphx-glr-script-out

 .. code-block:: none

    TensorDict(
        fields={
            images: Tensor(shape=torch.Size([64, 28, 28]), device=cpu, dtype=torch.float32, is_shared=False),
            targets: Tensor(shape=torch.Size([64]), device=cpu, dtype=torch.int64, is_shared=False)},
        batch_size=torch.Size([64]),
        device=cpu,
        is_shared=False)
    Epoch 1
    -------------------------
    loss: 2.300006 [    0/60000]
    loss: 2.285770 [ 6400/60000]
    loss: 2.273347 [12800/60000]
    loss: 2.267766 [19200/60000]
    loss: 2.252256 [25600/60000]
    loss: 2.225189 [32000/60000]
    loss: 2.224440 [38400/60000]
    loss: 2.200019 [44800/60000]
    loss: 2.186608 [51200/60000]
    loss: 2.161410 [57600/60000]
    Test Error: 
     Accuracy: 48.0%, Avg loss: 2.156438 

    Epoch 2
    -------------------------
    loss: 2.157808 [    0/60000]
    loss: 2.149556 [ 6400/60000]
    loss: 2.098034 [12800/60000]
    loss: 2.112672 [19200/60000]
    loss: 2.057320 [25600/60000]
    loss: 1.999760 [32000/60000]
    loss: 2.020871 [38400/60000]
    loss: 1.951849 [44800/60000]
    loss: 1.948923 [51200/60000]
    loss: 1.876193 [57600/60000]
    Test Error: 
     Accuracy: 58.0%, Avg loss: 1.880293 

    Epoch 3
    -------------------------
    loss: 1.905531 [    0/60000]
    loss: 1.879487 [ 6400/60000]
    loss: 1.767206 [12800/60000]
    loss: 1.805723 [19200/60000]
    loss: 1.689224 [25600/60000]
    loss: 1.644271 [32000/60000]
    loss: 1.661640 [38400/60000]
    loss: 1.577187 [44800/60000]
    loss: 1.597238 [51200/60000]
    loss: 1.487815 [57600/60000]
    Test Error: 
     Accuracy: 61.6%, Avg loss: 1.513000 

    Epoch 4
    -------------------------
    loss: 1.574449 [    0/60000]
    loss: 1.546283 [ 6400/60000]
    loss: 1.397252 [12800/60000]
    loss: 1.465600 [19200/60000]
    loss: 1.347920 [25600/60000]
    loss: 1.343926 [32000/60000]
    loss: 1.354034 [38400/60000]
    loss: 1.292370 [44800/60000]
    loss: 1.324001 [51200/60000]
    loss: 1.220044 [57600/60000]
    Test Error: 
     Accuracy: 63.2%, Avg loss: 1.251282 

    Epoch 5
    -------------------------
    loss: 1.323417 [    0/60000]
    loss: 1.312863 [ 6400/60000]
    loss: 1.144016 [12800/60000]
    loss: 1.245970 [19200/60000]
    loss: 1.128958 [25600/60000]
    loss: 1.148559 [32000/60000]
    loss: 1.164331 [38400/60000]
    loss: 1.114616 [44800/60000]
    loss: 1.150293 [51200/60000]
    loss: 1.061169 [57600/60000]
    Test Error: 
     Accuracy: 64.5%, Avg loss: 1.088204 

    TensorDict training done! time:  8.5924 s
    Epoch 1
    -------------------------
    loss: 2.301796 [    0/60000]
    loss: 2.296222 [ 6400/60000]
    loss: 2.274664 [12800/60000]
    loss: 2.271216 [19200/60000]
    loss: 2.244895 [25600/60000]
    loss: 2.202356 [32000/60000]
    loss: 2.224274 [38400/60000]
    loss: 2.176206 [44800/60000]
    loss: 2.179168 [51200/60000]
    loss: 2.142767 [57600/60000]
    Test Error: 
     Accuracy: 30.6%, Avg loss: 2.141736 

    Epoch 2
    -------------------------
    loss: 2.151084 [    0/60000]
    loss: 2.152220 [ 6400/60000]
    loss: 2.086757 [12800/60000]
    loss: 2.107814 [19200/60000]
    loss: 2.052995 [25600/60000]
    loss: 1.971206 [32000/60000]
    loss: 2.017657 [38400/60000]
    loss: 1.923631 [44800/60000]
    loss: 1.940881 [51200/60000]
    loss: 1.863185 [57600/60000]
    Test Error: 
     Accuracy: 45.4%, Avg loss: 1.867750 

    Epoch 3
    -------------------------
    loss: 1.901070 [    0/60000]
    loss: 1.880820 [ 6400/60000]
    loss: 1.754982 [12800/60000]
    loss: 1.804825 [19200/60000]
    loss: 1.693071 [25600/60000]
    loss: 1.634619 [32000/60000]
    loss: 1.671293 [38400/60000]
    loss: 1.562100 [44800/60000]
    loss: 1.604850 [51200/60000]
    loss: 1.497533 [57600/60000]
    Test Error: 
     Accuracy: 58.4%, Avg loss: 1.514449 

    Epoch 4
    -------------------------
    loss: 1.582409 [    0/60000]
    loss: 1.553058 [ 6400/60000]
    loss: 1.396988 [12800/60000]
    loss: 1.474941 [19200/60000]
    loss: 1.361917 [25600/60000]
    loss: 1.348914 [32000/60000]
    loss: 1.369752 [38400/60000]
    loss: 1.284704 [44800/60000]
    loss: 1.332423 [51200/60000]
    loss: 1.236008 [57600/60000]
    Test Error: 
     Accuracy: 63.2%, Avg loss: 1.256972 

    Epoch 5
    -------------------------
    loss: 1.332691 [    0/60000]
    loss: 1.318964 [ 6400/60000]
    loss: 1.147880 [12800/60000]
    loss: 1.258536 [19200/60000]
    loss: 1.141736 [25600/60000]
    loss: 1.156333 [32000/60000]
    loss: 1.180326 [38400/60000]
    loss: 1.107780 [44800/60000]
    loss: 1.158038 [51200/60000]
    loss: 1.077620 [57600/60000]
    Test Error: 
     Accuracy: 64.8%, Avg loss: 1.093794 

    Training done! time:  34.9433 s


.. rst-class:: sphx-glr-timing

   **Total running time of the script:** (0 minutes 56.763 seconds)


.. _sphx_glr_download_tutorials_data_fashion.py:

.. only:: html

  .. container:: sphx-glr-footer sphx-glr-footer-example

    .. container:: sphx-glr-download sphx-glr-download-jupyter

      :download:`Download Jupyter notebook: data_fashion.ipynb <data_fashion.ipynb>`

    .. container:: sphx-glr-download sphx-glr-download-python

      :download:`Download Python source code: data_fashion.py <data_fashion.py>`

    .. container:: sphx-glr-download sphx-glr-download-zip

      :download:`Download zipped: data_fashion.zip <data_fashion.zip>`


.. only:: html

 .. rst-class:: sphx-glr-signature

    `Gallery generated by Sphinx-Gallery <https://sphinx-gallery.github.io>`_