Shortcuts

Common Components

Base classes and common utilities for all loss modules.

LossModule(*args, **kwargs)

A parent class for RL losses.

add_random_module(module)

Adds a random module to the list of modules that will be detected by vmap_randomness() as random.

Value Estimators

ValueEstimatorBase(*args, **kwargs)

An abstract parent class for value function modules.

TD0Estimator(*args, **kwargs)

Temporal Difference (TD(0)) estimate of advantage function.

TD1Estimator(*args, **kwargs)

\(\infty\)-Temporal Difference (TD(1)) estimate of advantage function.

TDLambdaEstimator(*args, **kwargs)

TD(\(\lambda\)) estimate of advantage function.

GAE(*args, **kwargs)

A class wrapper around the generalized advantage estimate functional.

ValueEstimators(value[, names, module, ...])

Value function enumerator for custom-built estimators.

Docs

Access comprehensive developer documentation for PyTorch

View Docs

Tutorials

Get in-depth tutorials for beginners and advanced developers

View Tutorials

Resources

Find development resources and get your questions answered

View Resources