torchao.utils#

Created On: Jan 08, 2026 | Last Updated On: Jan 08, 2026

Tensor Subclass Utils#

A util tensor subclass that provides commonly used functions

`KernelPreference`	Enum for specifying the groups of kernels that's used for quantization, matrix multiplication or other compute ops for quantized tensor
`PackingFormat`	Packing format for quantized data in Tensor subclasses in torchao, represents how the values are packed and laid out in the quantized data.
`QuantizeTensorKwargs`	Base class for keyword argument container for quantized tensor creation.
`_choose_quant_func_and_quantize_tensor`	Given a tensor and a kwargs container, chooses a derived dtype (float8, int8, etc) to quantize tensor to, based on the type of quant_kwargs quantizes tensor to the derived dtype chosen in (1) This is needed to support flexible quantization of activation to various derived dtypes.