Shortcuts

int8_dynamic_activation_int8_semi_sparse_weight

torchao.sparsity.int8_dynamic_activation_int8_semi_sparse_weight()[source]

Applies int8 dnynamic symmetric per-token activation and int8 per-channel weight quantization + 2:4 sparsity to linear layers.

Docs

Access comprehensive developer documentation for PyTorch

View Docs

Tutorials

Get in-depth tutorials for beginners and advanced developers

View Tutorials

Resources

Find development resources and get your questions answered

View Resources