快捷方式

choose_qparams_and_quantize_affine_hqq

torchao.quantization.choose_qparams_and_quantize_affine_hqq(tensor: ~torch.Tensor, nbits: float = 4, group_size: int = 64, optimize: bool = True, axis: int = 1, compute_dtype: ~torch.dtype = torch.float16, device: str = 'cuda', verbose: bool = False, raw_output: bool = False, optimize_weights: ~typing.Callable = <function optimize_weights_proximal_legacy>) tuple[source]

文档

访问 PyTorch 的综合开发者文档

查看文档

教程

获取面向初学者和高级开发者的深入教程

查看教程

资源

查找开发资源并获得解答

查看资源