I'm trying to compile this for an AMD 6900XT. On an AMD CPU, arch linux. Pytorch works properly with ROCM.
Here are some of the link issues I'm getting:
/usr/bin/ld: CMakeFiles/ctranslate2.dir/src/ops/quantize_gpu.cu.o: in function `__float22bfloat162_rn(HIP_vector_type<float, 2u>)':
quantize_gpu.cu:(.text+0xa0): multiple definition of `__float22bfloat162_rn(HIP_vector_type<float, 2u>)'; CMakeFiles/ctranslate2.dir/src/cuda/primitives.cu.o:primitives.cu:(.text+0xa0): first defined here
/usr/bin/ld: CMakeFiles/ctranslate2.dir/src/ops/quantize_gpu.cu.o: in function `__high2float(__hip_bfloat162)':
quantize_gpu.cu:(.text+0x120): multiple definition of `__high2float(__hip_bfloat162)'; CMakeFiles/ctranslate2.dir/src/cuda/primitives.cu.o:primitives.cu:(.text+0x120): first defined here
/usr/bin/ld: CMakeFiles/ctranslate2.dir/src/ops/quantize_gpu.cu.o: in function `__low2float(__hip_bfloat162)':
quantize_gpu.cu:(.text+0x130): multiple definition of `__low2float(__hip_bfloat162)'; CMakeFiles/ctranslate2.dir/src/cuda/primitives.cu.o:primitives.cu:(.text+0x130): first defined here
clang++: error: linker command failed with exit code 1 (use -v to see invocation)
Would appreciate any help you can give here.
I'm trying to compile this for an AMD 6900XT. On an AMD CPU, arch linux. Pytorch works properly with ROCM.
Here are some of the link issues I'm getting:
Would appreciate any help you can give here.