torchrl.modules.mcts package¶
This module provides Monte Carlo Tree Search (MCTS) components, including score computation modules for balancing exploration and exploitation in tree search algorithms.
MCTS Scores¶
|
Abstract base class for MCTS score computation modules. |
|
Computes the PUCT (Polynomial Upper Confidence Trees) score for MCTS. |
|
Computes the UCB (Upper Confidence Bound) score, specifically UCB1, for MCTS. |
|
Computes action selection probabilities for the EXP3 algorithm in MCTS. |
|
Computes the UCB1-Tuned score for MCTS, using variance estimation. |
|
Enum providing factory functions for common MCTS score configurations. |