VLLMDoubleBufferWeightSender¶
- class torchrl.weight_update.llm.VLLMDoubleBufferWeightSender(scheme: VLLMDoubleBufferSyncScheme)[source]¶
Sends weights to vLLM workers using double-buffered storage.
This sender extracts weights from a training model and writes them to a shared directory using TensorDict.memmap.
Example
>>> sender = scheme.create_sender() >>> sender.register_model(policy_model) >>> >>> # During training loop >>> sender.update_weights() # Writes current weights to shared storage