OpenMLEnv¶

torchrl.envs.OpenMLEnv(*args, **kwargs)[源]¶

OpenML 数据环境接口，用于 bandit 环境。

文档: https://www.openml.org/search?type=data

Scikit-learn 接口: https://scikit-learn.cn/stable/modules/generated/sklearn.datasets.fetch_openml.html

参数:

dataset_name (str) – 支持以下数据集: "adult_num", "adult_onehot", "mushroom_num", "mushroom_onehot", "covertype", "shuttle" 和 "magic"。
device (torch.device 或 兼容类型, 可选) – 期望输入和输出数据所在的设备。默认为 "cpu"。
batch_size (torch.Size 或 兼容类型, 可选) – 环境的批处理大小，即调用 reset() 时采样并返回的元素数量。默认为空批处理大小，即每次采样一个元素。

变量:

available_envs (List[str]) – 由此类构建的环境列表。

示例

>>> env = OpenMLEnv("adult_onehot", batch_size=[2, 3])
>>> print(env.reset())
TensorDict(
    fields={
        done: Tensor(shape=torch.Size([2, 3, 1]), device=cpu, dtype=torch.bool, is_shared=False),
        observation: Tensor(shape=torch.Size([2, 3, 106]), device=cpu, dtype=torch.float32, is_shared=False),
        reward: Tensor(shape=torch.Size([2, 3, 1]), device=cpu, dtype=torch.float32, is_shared=False),
        y: Tensor(shape=torch.Size([2, 3]), device=cpu, dtype=torch.int64, is_shared=False)},
    batch_size=torch.Size([2, 3]),
    device=cpu,
    is_shared=False)

OpenMLEnv¶

文档

教程

资源