Common Components¶
Base classes and common utilities for all loss modules.
|
A parent class for RL losses. |
|
Adds a random module to the list of modules that will be detected by |
Value Estimators¶
|
An abstract parent class for value function modules. |
|
Temporal Difference (TD(0)) estimate of advantage function. |
|
\(\infty\)-Temporal Difference (TD(1)) estimate of advantage function. |
|
TD(\(\lambda\)) estimate of advantage function. |
|
A class wrapper around the generalized advantage estimate functional. |
|
Value function enumerator for custom-built estimators. |