Shortcuts

swap_linear_with_smooth_fq_linear

torchao.quantization.swap_linear_with_smooth_fq_linear(model, skip_fqn_list=None, cur_fqn='', alpha=0.5) None[source]

Replaces linear layers in the model with their SmoothFakeDynamicallyQuantizedLinear equivalents.

Parameters:
  • model (torch.nn.Module) – The model containing linear layers to be replaced.

  • skip_fqn_list (list of str, optional) – List of fully qualified names to skip during replacement. Defaults to None.

  • cur_fqn (str, optional) – The current fully qualified name of the module being processed. Defaults to “”.

  • alpha (float, optional) – The scaling factor for SmoothQuant. Defaults to 0.5.

Returns:

None

Docs

Access comprehensive developer documentation for PyTorch

View Docs

Tutorials

Get in-depth tutorials for beginners and advanced developers

View Tutorials

Resources

Find development resources and get your questions answered

View Resources