Shortcuts

int8_dynamic_activation_int8_weight

torchao.quantization.int8_dynamic_activation_int8_weight(layout=PlainLayout(), act_mapping_type=MappingType.SYMMETRIC, weight_only_decode=False)[source]

Applies int8 dynamic symmetric per-token activation and int8 per-channel weight quantization to linear layers

Docs

Access comprehensive developer documentation for PyTorch

View Docs

Tutorials

Get in-depth tutorials for beginners and advanced developers

View Tutorials

Resources

Find development resources and get your questions answered

View Resources