Skip to main content
Ctrl+K

Torch-TensorRT

  • Installation
  • User Guide
  • Advanced Usage
  • Model Zoo
  • API Reference
    • Debugging
    • Contributing
    • Legacy Frontends
  • GitHub
  • Installation
  • User Guide
  • Advanced Usage
  • Model Zoo
  • API Reference
  • Debugging
  • Contributing
  • Legacy Frontends
  • GitHub

Section Navigation

Architecture

  • System Overview
  • Compiler Phases
    • Lowering Phase
    • Partitioning Phase
    • Conversion Phase
    • Runtime Phase
  • Built-in Dynamo Lowering Passes
  • Observer / Callback System
  • Writing TorchScript Converters
  • Useful Links for Torch-TensorRT Development

Feature Design

  • Engine Caching
  • Weight Refit
  • CUDAGraphs
  • Dynamic Shape Support
  • Complex Number Support
  • Dynamic Memory Allocation
  • Autocast and Precision Management
  • Export and Serialization
  • Contributing

Contributing#

Guides for contributing to Torch-TensorRT — writing converters, lowering passes, understanding the system architecture, and managing resources.

Architecture

  • System Overview
  • Compiler Phases
  • Built-in Dynamo Lowering Passes
  • Observer / Callback System
  • Writing TorchScript Converters
  • Useful Links for Torch-TensorRT Development

Feature Design

  • Engine Caching
  • Weight Refit
  • CUDAGraphs
  • Dynamic Shape Support
  • Complex Number Support
  • Dynamic Memory Allocation
  • Autocast and Precision Management
  • Export and Serialization

previous

Debugging Torch-TensorRT Compilation

next

System Overview

Edit on GitHub
Show Source

© Copyright 2024, NVIDIA Corporation.