torchao.utils¶
Tensor Subclass Utils¶
A util tensor subclass that provides commonly used functions |
torchao.quantization.quantize_.common¶
quantize_ API Common Utils¶
Enum for specifying the groups of kernels that's used for quantization, matrix multiplication or other compute ops for quantized tensor |
|
Packing format for quantized data in Tensor subclasses in torchao, represents how the values are packed and laid out in the quantized data. |
|
Base class for keyword argument container for quantized tensor creation. |
|
Given a tensor and a kwargs container, chooses a derived dtype (float8, int8, etc) to quantize tensor to, based on the type of quant_kwargs quantizes tensor to the derived dtype chosen in (1) This is needed to support flexible quantization of activation to various derived dtypes. |