Training Hooks¶
Hooks for customizing the training loop at various points.
|
Data subsampler for online RL sota-implementations. |
|
Clears cuda cache at a given interval. |
|
A frame counter hook. |
|
Generic scalar logger hook for any tensor values in the batch. |
|
Add an optimizer for one or more loss components. |
|
Recorder hook for |
|
Replay buffer hook provider. |
|
Reward normalizer hook. |
|
Selects keys in a TensorDict batch. |
|
A collector weights update hook class. |
|
A hook for target parameters update. |
|
Hook for logging Update-to-Data (UTD) ratio during async collection. |