[BUG]: Uninitialized __global__ memory read of size 4 bytes in thrust::inclusive_scan #2104
Closed
1 task done
Labels
bug
Something isn't working right.
Is this a duplicate?
Type of Bug
Silent Failure
Component
Thrust
Describe the bug
The similar issue appears in several Thrust/CUB APIs (#1891, #1790, #1889), and some of them might be due to the same root cause, as @elstehle mentioned in the issue comment about the decoupled look-back algorithm.
It would be great to find a way to appease the compute-sanitizer so that CUDA programmers can trust the NVIDIA compute-sanitizer to identify any potential bugs while using the NVIDIA CCCL library. Thank you so much. 👍
How to Reproduce
Step 1. nvcc --extended-lambda -G main.cu
Step 2. compute-sanitizer --tool initcheck --check-device-heap yes --leak-check full --padding 512 --track-stream-ordered-races all --check-warpgroup-mma yes --require-cuda-init no --check-exit-code yes --error-exitcode 1 --nvtx true ./a.out
Expected behavior
No diagnostics emitted.
Reproduction link
No response
Operating System
Ubuntu Linux 22.04.4 LTS
nvidia-smi output
NVCC version
The text was updated successfully, but these errors were encountered: