NormalParamWrapper¶
- class torchrl.modules.NormalParamWrapper(operator: Module, scale_mapping: str = 'biased_softplus_1.0', scale_lb: Number = 0.0001)[source]¶
正态分布参数的包装器。
- 参数:
operator (nn.Module) – 运算符,其输出将在位置和尺度参数中进行转换
scale_mapping (str, optional) – 与标准差一起使用的正向映射函数。默认值 = “biased_softplus_1.0”(即 softplus 映射,具有偏置,使得 fn(0.0) = 1.0)选项:“softplus”、“exp”、“relu”、“biased_softplus_1”;
scale_lb (Number, optional) – 方差可以采用的最小值。默认值为 1e-4。
示例
>>> from torch import nn >>> import torch >>> module = nn.Linear(3, 4) >>> module_normal = NormalParamWrapper(module) >>> tensor = torch.randn(3) >>> loc, scale = module_normal(tensor) >>> print(loc.shape, scale.shape) torch.Size([2]) torch.Size([2]) >>> assert (scale > 0).all() >>> # with modules that return more than one tensor >>> module = nn.LSTM(3, 4) >>> module_normal = NormalParamWrapper(module) >>> tensor = torch.randn(4, 2, 3) >>> loc, scale, others = module_normal(tensor) >>> print(loc.shape, scale.shape) torch.Size([4, 2, 2]) torch.Size([4, 2, 2]) >>> assert (scale > 0).all()