We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent c86564b commit 529031aCopy full SHA for 529031a
vllm/distributed/device_communicators/cuda_communicator.py
@@ -225,7 +225,7 @@ def reduce_scatterv(
225
output_shape, dtype=input_tensor.dtype, device=input_tensor.device
226
)
227
228
- if sizes is not None:
+ if sizes is not None and sizes.count(sizes[0]) != len(sizes):
229
pynccl_comm.reduce_scatterv(output, input_tensor, sizes=sizes)
230
else:
231
pynccl_comm.reduce_scatter(output, input_tensor)
0 commit comments