CUDA Kernels for cpm.
- CUDA 10.1 - 11.8
- CUDA 12.0 - 12.6
- Pascal (GTX 10 series): sm_61, sm_62
- Volta (V100, Titan V): sm_70, sm_72
- Turing (RTX 20 series, T4): sm_75
- Ampere (RTX 30 series, A100): sm_80, sm_86
- Orin (Jetson AGX Orin): sm_87
- Ada Lovelace (RTX 40 series): sm_89
- Hopper (H100): sm_90