Shortcuts

Policy Gradient Methods

Loss modules for policy gradient algorithms.

PPOLoss(*args, **kwargs)

A parent PPO loss class.

ClipPPOLoss(*args, **kwargs)

Clipped PPO loss.

KLPENPPOLoss(*args, **kwargs)

KL Penalty PPO loss.

A2CLoss(*args, **kwargs)

TorchRL implementation of the A2C loss.

ReinforceLoss(*args, **kwargs)

Reinforce loss module.

Docs

Access comprehensive developer documentation for PyTorch

View Docs

Tutorials

Get in-depth tutorials for beginners and advanced developers

View Tutorials

Resources

Find development resources and get your questions answered

View Resources