Table of Contents

Shortcuts

int_scaled_matmul¶

torchao.quantization.int_scaled_matmul(a: Tensor, b: Tensor, scales1: Tensor) → Tensor[source]¶

Performs scaled integer matrix multiplication.

Parameters:

a (torch.Tensor) – The first matrix to multiply.
b (torch.Tensor) – The second matrix to multiply.
scales1 (torch.Tensor) – The scaling factors for the rows of the result.

Returns:

The result of the scaled matrix multiplication.

Return type:

Raises:

AssertionError – If the dimensions of the input tensors do not match the expected shapes.

Docs

Access comprehensive developer documentation for PyTorch

View Docs

Tutorials

Get in-depth tutorials for beginners and advanced developers

View Tutorials

Resources

Find development resources and get your questions answered

View Resources