Shortcuts

int8_weight_only

torchao.quantization.int8_weight_only(group_size=None)[source]

Applies int8 weight-only symmetric per-channel quantization to linear layers.

Docs

Access comprehensive developer documentation for PyTorch

View Docs

Tutorials

Get in-depth tutorials for beginners and advanced developers

View Tutorials

Resources

Find development resources and get your questions answered

View Resources