Shortcuts

int_scaled_matmul

torchao.quantization.int_scaled_matmul(a: Tensor, b: Tensor, scales1: Tensor) Tensor[source]

Performs scaled integer matrix multiplication.

Parameters:
Returns:

The result of the scaled matrix multiplication.

Return type:

torch.Tensor

Raises:

AssertionError – If the dimensions of the input tensors do not match the expected shapes.

Docs

Access comprehensive developer documentation for PyTorch

View Docs

Tutorials

Get in-depth tutorials for beginners and advanced developers

View Tutorials

Resources

Find development resources and get your questions answered

View Resources