TensorStorage¶

class torchrl.data.replay_buffers.TensorStorage(storage, max_size=None, *, device: device = 'cpu', ndim: int = 1, compilable: bool =False)[源码]¶

用于存储张量 (tensor) 和 tensordict 的存储类。

参数:

storage (tensor 或 TensorDict) – 要使用的数据缓冲区。
max_size (int) – 存储的大小，即缓冲区中存储的最大元素数量。

关键字参数:

device (torch.device, 可选) – 采样到的张量将存储和发送到的设备。默认值为 torch.device("cpu")。如果传入 "auto"，则设备将自动从传入的第一批数据中获取。此选项默认不启用，以避免数据被错误地放置在 GPU 上，导致 OOM 问题。
ndim (int, 可选) – 计算存储大小时要考虑的维度数量。例如，形状为 [3, 4] 的存储，如果 ndim=1，容量为 3；如果 ndim=2，容量为 12。默认为 1。
compilable (bool, 可选) – 存储是否可编译。如果为 True，则写入器不能在多个进程之间共享。默认为 False。

示例

>>> data = TensorDict({
...     "some data": torch.randn(10, 11),
...     ("some", "nested", "data"): torch.randn(10, 11, 12),
... }, batch_size=[10, 11])
>>> storage = TensorStorage(data)
>>> len(storage)  # only the first dimension is considered as indexable
10
>>> storage.get(0)
TensorDict(
    fields={
        some data: Tensor(shape=torch.Size([11]), device=cpu, dtype=torch.float32, is_shared=False),
        some: TensorDict(
            fields={
                nested: TensorDict(
                    fields={
                        data: Tensor(shape=torch.Size([11, 12]), device=cpu, dtype=torch.float32, is_shared=False)},
                    batch_size=torch.Size([11]),
                    device=None,
                    is_shared=False)},
            batch_size=torch.Size([11]),
            device=None,
            is_shared=False)},
    batch_size=torch.Size([11]),
    device=None,
    is_shared=False)
>>> storage.set(0, storage.get(0).zero_()) # zeros the data along index ``0``

此类也支持 tensorclass 数据。

示例

>>> from tensordict import tensorclass
>>> @tensorclass
... class MyClass:
...     foo: torch.Tensor
...     bar: torch.Tensor
>>> data = MyClass(foo=torch.randn(10, 11), bar=torch.randn(10, 11, 12), batch_size=[10, 11])
>>> storage = TensorStorage(data)
>>> storage.get(0)
MyClass(
    bar=Tensor(shape=torch.Size([11, 12]), device=cpu, dtype=torch.float32, is_shared=False),
    foo=Tensor(shape=torch.Size([11]), device=cpu, dtype=torch.float32, is_shared=False),
    batch_size=torch.Size([11]),
    device=None,
    is_shared=False)

attach(buffer: Any) → None¶

此函数将采样器附加到此存储。

读取此存储的缓冲区必须通过调用此方法作为附加实体包含进来。这保证了当存储中的数据发生变化时，即使存储与其他缓冲区（例如优先级采样器）共享，组件也能感知到变化。

参数:: buffer – 从此存储读取数据的对象。

dump(*args, **kwargs)¶: dumps() 的别名。

load(*args, **kwargs)¶: loads() 的别名。

save(*args, **kwargs)¶: dumps() 的别名。

TensorStorage¶

文档

教程

资源