Tutorials#
Created On: Jan 29, 2026 | Last Updated On: Jan 29, 2026
Tutorials for quantization using eager mode execution.
- First Quantization Example
- (Part 1) Pre-training with float8
- (Part 2) Fine-tuning with QAT, QLoRA, and float8
- (Part 3) Serving on vLLM, SGLang, ExecuTorch
- Integration with VLLM: Architecture and Usage Guide
- Hugging Face Integration
- Serialization
- Static Quantization
- Writing Your Own Quantized Tensor
- Writing Your Own Quantized Tensor (advanced)