Shortcuts

choose_qparams_and_quantize_affine_hqq

torchao.quantization.choose_qparams_and_quantize_affine_hqq(tensor: ~torch.Tensor, nbits: float = 4, group_size: int = 64, optimize: bool = True, axis: int = 1, compute_dtype: ~torch.dtype = torch.float16, device: str = 'cuda', verbose: bool = False, raw_output: bool = False, optimize_weights: ~typing.Callable = <function optimize_weights_proximal_legacy>) tuple[source]

Docs

Access comprehensive developer documentation for PyTorch

View Docs

Tutorials

Get in-depth tutorials for beginners and advanced developers

View Tutorials

Resources

Find development resources and get your questions answered

View Resources