Skip to content

Conversation

@RiverDave
Copy link
Collaborator

@RiverDave RiverDave commented Nov 20, 2025

The tests for all kernell calls are originally located at: https://github.com/llvm/clangir/blob/main/clang/test/CodeGenCUDA/kernel-call.cu we might need to re-organize all of our already existing tests for parity and correctness in the future. If anybody's interested in what stream-per-thread actually means: https://developer.nvidia.com/blog/gpu-pro-tip-cuda-7-streams-simplify-concurrency/

@RiverDave RiverDave changed the title [CIR][CUDA] Support stream per thread [CIR][CUDA] Add Support for stream per thread Nov 20, 2025
@RiverDave RiverDave force-pushed the users/riverdave/cuda-per-thread-stream branch from 9949967 to 4069ee9 Compare November 20, 2025 23:04
@github-actions
Copy link

github-actions bot commented Nov 20, 2025

✅ With the latest revision this PR passed the C/C++ code formatter.

Copy link
Member

@bcardosolopes bcardosolopes left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM minor pending nits. (@koparasy, any feedback?)

@koparasy
Copy link
Contributor

LGTM, please add the HIP test as well

@RiverDave RiverDave changed the title [CIR][CUDA] Add Support for stream per thread [CIR][CUDA][HIP] Add Support for stream per thread Nov 21, 2025
@RiverDave RiverDave force-pushed the users/riverdave/cuda-per-thread-stream branch from 09bf69e to fe68dbb Compare November 21, 2025 17:57
@RiverDave RiverDave force-pushed the users/riverdave/cuda-per-thread-stream branch from fe68dbb to 88cbb36 Compare November 21, 2025 18:00
@bcardosolopes bcardosolopes merged commit 39d59b6 into main Nov 21, 2025
6 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants