Trainer Basics¶
Core trainer classes and builder utilities.
Trainer and hooks¶
|
A generic Trainer class. |
An abstract hooking class for torchrl Trainer class. |
Algorithm-specific trainers¶
|
PPO (Proximal Policy Optimization) trainer implementation. |
|
A trainer class for Soft Actor-Critic (SAC) algorithm. |
Builders¶
|
Returns a data collector for off-policy sota-implementations. |
|
Makes a collector in on-policy settings. |
|
Builds the DQN loss module. |
|
Builds a replay buffer using the config built from ReplayArgsConfig. |
|
Builds a target network weight update object. |
|
Creates a Trainer instance given its constituents. |
|
Returns a parallel environment from an argparse.Namespace built with the appropriate parser constructor. |
|
Runs asynchronous collectors, each running synchronous environments. |
|
Runs synchronous collectors, each running synchronous environments. |
|
Returns an environment creator from an argparse.Namespace built with the appropriate parser constructor. |
Utils¶
Correct the arguments for the input frame_skip, by dividing all the arguments that reflect a count of frames by the frame_skip. |
|
|
Gathers stas (loc and scale) from an environment using random rollouts. |