When TORCH_CUDA_ARCH_LIST is not empty, set device_capability to empty. Shouldn't TORCH_CUDA_ARCH_LIST be assigned to device_capability ? Otherwise, the specification of TORCH_CUDA_ARCH_LIST would be invalid.
|
$ TORCH_CUDA_ARCH_LIST=8.0 GROUPED_GEMM_CUTLASS=1 pip install . |