torch.cuda.comm.gather¶
- torch.cuda.comm.gather(tensors, dim=0, destination=None, *, out=None)[source]¶
- Gathers tensors from multiple GPU devices. - Parameters
- tensors (Iterable[Tensor]) – an iterable of tensors to gather. Tensor sizes in all dimensions other than - dimhave to match.
- dim (int, optional) – a dimension along which the tensors will be concatenated. Default: - 0.
- destination (torch.device, str, or int, optional) – the output device. Can be CPU or CUDA. Default: the current CUDA device. 
- out (Tensor, optional, keyword-only) – the tensor to store gather result. Its sizes must match those of - tensors, except for- dim, where the size must equal- sum(tensor.size(dim) for tensor in tensors). Can be on CPU or CUDA.
 
 - Note - destinationmust not be specified when- outis specified.- Returns
- If destinationis specified,
- a tensor located on - destinationdevice, that is a result of concatenating- tensorsalong- dim.
 
- If 
- If outis specified,
- the - outtensor, now containing results of concatenating- tensorsalong- dim.
 
- If