Skip to content
Discussion options

You must be logged in to vote

Sorry for late reply. Starting the introduction of the Hopper GPU (cc 9.0), the CUDA programming model gains a new level in the thread hierarchy called "thread block clusters." The new hierarchy goes like this: a grid can have one or more clusters, a cluster can have one or more blocks, and a block can have one or more threads.

It presents a new challenge to the traditional CUDA C++ triple chevron syntax, because it does not allow simultaneously specifying all hierarchical information at once; that is, <<<grid, cluster, block>>>, where grid, cluster, and block are all dim3 objects with integer-overloads (so N means (N, 1, 1)), is not supported due to the ambiguity in overload resolution. …

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by richardhboyd
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants