MultiProcessedWeightUpdater¶

class torchrl.collectors.MultiProcessedWeightUpdater(*, get_server_weights: Callable[[], TensorDictBase] | None, policy_weights: dict[device, TensorDictBase])[source]¶

A remote weight updater for synchronizing policy weights across multiple processes or devices.

The MultiProcessedWeightUpdater class provides a mechanism for updating the weights of a policy across multiple inference workers in a multiprocessed environment. It is designed to handle the distribution of weights from a central server to various devices or processes that are running the policy. This class is typically used in multiprocessed data collectors where each process or device requires an up-to-date copy of the policy weights.

Keyword Arguments:

get_server_weights (Callable[[], TensorDictBase] | None) – A callable that retrieves the latest policy weights from the server or another centralized source.
policy_weights (Dict[torch.device, TensorDictBase]) – A dictionary mapping each device or process to its current policy weights, which will be updated.

Note

This class assumes that the server weights can be directly applied to the workers without any additional processing. If your use case requires more complex weight mapping or synchronization logic, consider extending WeightUpdaterBase with a custom implementation.

MultiProcessedWeightUpdater¶

Docs

Tutorials

Resources