You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
When the input tensor is not on device 0, histogram causes an illegal memory access which prevents indices_and_bins from being computed correctly on a model & inputs which aren't on device zero.
Traceback (most recent call last):
File "/home/ubuntu/test_mb.py", line 8, in <module>
result = megablocks.ops.histogram(test_tensor, 1).cpu()
File "/home/ubuntu/.local/lib/python3.10/site-packages/torch/autograd/function.py", line 598, in apply
return super().apply(*args, **kwargs) # type: ignore[misc]
File "/home/ubuntu/.local/lib/python3.10/site-packages/megablocks/ops/histogram.py", line 17, in forward
return ops.histogram(x, max_val)
RuntimeError: an illegal memory access was encountered
whereas when idx is set to 0 the correct values are computed. Quite confused as to how this might be possible.
When the input tensor is not on device 0,
histogram
causes an illegal memory access which preventsindices_and_bins
from being computed correctly on a model & inputs which aren't on device zero.Reproduction:
when run with
CUDA_LAUNCH_BLOCKING=1
we getwhereas when
idx
is set to0
the correct values are computed. Quite confused as to how this might be possible.I'm on megablocks
0.5.1
,Cuda = 12.1 and reproduced on 2 A100's.
The text was updated successfully, but these errors were encountered: