torchao.dtypes¶
NF4Tensor class for converting a weight to the QLoRA NF4 format |
|
Affine quantized tensor subclass. Affine quantization means we quantize the floating point tensor with an affine transformation: |
NF4Tensor class for converting a weight to the QLoRA NF4 format |
|
Affine quantized tensor subclass. Affine quantization means we quantize the floating point tensor with an affine transformation: |