Open
Description
Describe the bug
The performance test SYCL :: PerformanceTests/Reduction/reduce_over_sub_group.cpp
is failing on native_cpu. Upon resolution, please remove XFAIL from test source.
To reproduce
FAIL: SYCL :: PerformanceTests/Reduction/reduce_over_sub_group.cpp (8 of 8)
******************** TEST 'SYCL :: PerformanceTests/Reduction/reduce_over_sub_group.cpp' FAILED ********************
Exit Code: 1
Command Output (stdout):
--
# RUN: at line 1
/__w/llvm/llvm/toolchain/bin//clang++ -Werror -fsycl -fsycl-targets=spir64,native_cpu /__w/llvm/llvm/llvm/sycl/test-e2e/PerformanceTests/Reduction/reduce_over_sub_group.cpp -o /__w/llvm/llvm/build-e2e/PerformanceTests/Reduction/Output/reduce_over_sub_group.cpp.tmp.out
# executed command: /__w/llvm/llvm/toolchain/bin//clang++ -Werror -fsycl -fsycl-targets=spir64,native_cpu /__w/llvm/llvm/llvm/sycl/test-e2e/PerformanceTests/Reduction/reduce_over_sub_group.cpp -o /__w/llvm/llvm/build-e2e/PerformanceTests/Reduction/Output/reduce_over_sub_group.cpp.tmp.out
# .---command stderr------------
# | /usr/bin/ld: /tmp/lit-tmp-co4cvrku/reduce_over_sub_group-1ac783.o: in function `_ZTSZZ4testIhEvvENKUlRN4sycl3_V17handlerEE_clES3_EUlNS1_7nd_itemILi1EEEE_.NativeCPUKernel':
# | llvm-link:(.text+0xbb): undefined reference to `__mux_sub_group_broadcast_i64'
# | /usr/bin/ld: llvm-link:(.text+0xd6): undefined reference to `__mux_sub_group_broadcast_i32'
# | /usr/bin/ld: /tmp/lit-tmp-co4cvrku/reduce_over_sub_group-1ac783.o: in function `_ZTSZZ4testItEvvENKUlRN4sycl3_V17handlerEE_clES3_EUlNS1_7nd_itemILi1EEEE_.NativeCPUKernel':
# | llvm-link:(.text+0x208): undefined reference to `__mux_sub_group_broadcast_i64'
# | /usr/bin/ld: llvm-link:(.text+0x226): undefined reference to `__mux_sub_group_broadcast_i32'
# | /usr/bin/ld: /tmp/lit-tmp-co4cvrku/reduce_over_sub_group-1ac783.o: in function `_ZTSZZ4testIjEvvENKUlRN4sycl3_V17handlerEE_clES3_EUlNS1_7nd_itemILi1EEEE_.NativeCPUKernel':
# | llvm-link:(.text+0x358): undefined reference to `__mux_sub_group_broadcast_i64'
# | /usr/bin/ld: llvm-link:(.text+0x373): undefined reference to `__mux_sub_group_broadcast_i32'
# | /usr/bin/ld: /tmp/lit-tmp-co4cvrku/reduce_over_sub_group-1ac783.o: in function `_ZTSZZ4testImEvvENKUlRN4sycl3_V17handlerEE_clES3_EUlNS1_7nd_itemILi1EEEE_.NativeCPUKernel':
# | llvm-link:(.text+0x4a8): undefined reference to `__mux_sub_group_broadcast_i64'
# | /usr/bin/ld: llvm-link:(.text+0x4c3): undefined reference to `__mux_sub_group_broadcast_i64'
# | /usr/bin/ld: /tmp/lit-tmp-co4cvrku/reduce_over_sub_group-1ac783.o: in function `_ZTSZZ4testIaEvvENKUlRN4sycl3_V17handlerEE_clES3_EUlNS1_7nd_itemILi1EEEE_.NativeCPUKernel':
# | llvm-link:(.text+0x5fb): undefined reference to `__mux_sub_group_broadcast_i64'
# | /usr/bin/ld: llvm-link:(.text+0x616): undefined reference to `__mux_sub_group_broadcast_i32'
# | /usr/bin/ld: /tmp/lit-tmp-co4cvrku/reduce_over_sub_group-1ac783.o: in function `_ZTSZZ4testIsEvvENKUlRN4sycl3_V17handlerEE_clES3_EUlNS1_7nd_itemILi1EEEE_.NativeCPUKernel':
# | llvm-link:(.text+0x748): undefined reference to `__mux_sub_group_broadcast_i64'
# | /usr/bin/ld: llvm-link:(.text+0x766): undefined reference to `__mux_sub_group_broadcast_i32'
# | /usr/bin/ld: /tmp/lit-tmp-co4cvrku/reduce_over_sub_group-1ac783.o: in function `_ZTSZZ4testIiEvvENKUlRN4sycl3_V17handlerEE_clES3_EUlNS1_7nd_itemILi1EEEE_.NativeCPUKernel':
# | llvm-link:(.text+0x898): undefined reference to `__mux_sub_group_broadcast_i64'
# | /usr/bin/ld: llvm-link:(.text+0x8b3): undefined reference to `__mux_sub_group_broadcast_i32'
# | /usr/bin/ld: /tmp/lit-tmp-co4cvrku/reduce_over_sub_group-1ac783.o: in function `_ZTSZZ4testIlEvvENKUlRN4sycl3_V17handlerEE_clES3_EUlNS1_7nd_itemILi1EEEE_.NativeCPUKernel':
# | llvm-link:(.text+0x9e8): undefined reference to `__mux_sub_group_broadcast_i64'
# | /usr/bin/ld: llvm-link:(.text+0xa03): undefined reference to `__mux_sub_group_broadcast_i64'
# | /usr/bin/ld: /tmp/lit-tmp-co4cvrku/reduce_over_sub_group-1ac783.o: in function `_ZTSZZ4testIN4sycl3_V16detail9half_impl4halfEEvvENKUlRNS1_7handlerEE_clES6_EUlNS1_7nd_itemILi1EEEE_.NativeCPUKernel':
# | llvm-link:(.text+0xb46): undefined reference to `__mux_sub_group_broadcast_i64'
# | /usr/bin/ld: llvm-link:(.text+0xb6b): undefined reference to `__mux_sub_group_broadcast_i32'
# | /usr/bin/ld: /tmp/lit-tmp-co4cvrku/reduce_over_sub_group-1ac783.o: in function `_ZTSZZ4testIfEvvENKUlRN4sycl3_V17handlerEE_clES3_EUlNS1_7nd_itemILi1EEEE_.NativeCPUKernel':
# | llvm-link:(.text+0xe67): undefined reference to `__mux_sub_group_broadcast_i64'
# | /usr/bin/ld: llvm-link:(.text+0xe83): undefined reference to `__mux_sub_group_broadcast_f32'
# | /usr/bin/ld: /tmp/lit-tmp-co4cvrku/reduce_over_sub_group-1ac783.o: in function `_ZTSZZ4testIdEvvENKUlRN4sycl3_V17handlerEE_clES3_EUlNS1_7nd_itemILi1EEEE_.NativeCPUKernel':
# | llvm-link:(.text+0xfe7): undefined reference to `__mux_sub_group_broadcast_i64'
# | /usr/bin/ld: llvm-link:(.text+0x1003): undefined reference to `__mux_sub_group_broadcast_f64'
# | clang++: error: linker command failed with exit code 1 (use -v to see invocation)
# `-----------------------------
# error: command failed with exit status: 1
Environment
- OS: Any
- Target device and vendor: native_cpu
- DPC++ version: da317d04df8c6ffc873dfe1943c1c7c26e8de1e5
- Dependencies version:
- 2024-12-17T21:51:33.5779673Z �[36;1msycl-ls --verbose�[0m
2024-12-17T21:51:33.5780436Z shell: sh -e {0}
2024-12-17T21:51:33.5780864Z env:
2024-12-17T21:51:33.5781299Z LIT_FILTER: PerformanceTests/
2024-12-17T21:51:33.5781958Z CMAKE_PREFIX_PATH: /runtimes/oneapi-tbb/env/..
2024-12-17T21:51:33.5782676Z CPATH: /runtimes/oneapi-tbb/env/../include
2024-12-17T21:51:33.5783693Z LD_LIBRARY_PATH: /__w/llvm/llvm/toolchain/lib/:/runtimes/oneapi-tbb/env/../lib/intel64/gcc4.8
2024-12-17T21:51:33.5784819Z LIBRARY_PATH: /runtimes/oneapi-tbb/env/../lib/intel64/gcc4.8
2024-12-17T21:51:33.5785683Z PKG_CONFIG_PATH: /runtimes/oneapi-tbb/env/../lib/pkgconfig
2024-12-17T21:51:33.5786445Z TBBROOT: /runtimes/oneapi-tbb/env/..
2024-12-17T21:51:33.5787440Z PATH: /__w/llvm/llvm/toolchain/bin/:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin
2024-12-17T21:51:33.5788417Z ##[endgroup]
2024-12-17T21:51:33.6363228Z [level_zero:gpu][level_zero:0] Intel(R) oneAPI Unified Runtime over Level-Zero, Intel(R) Iris(R) Xe Graphics 12.0.0 [1.6.31294.120000]
2024-12-17T21:51:33.6363940Z [opencl:gpu][opencl:0] Intel(R) OpenCL Graphics, Intel(R) Iris(R) Xe Graphics OpenCL 3.0 NEO [24.39.31294.12]
2024-12-17T21:51:33.6364412Z [opencl:cpu][opencl:1] Intel(R) OpenCL, 11th Gen Intel(R) Core(TM) i7-1165G7 @ 2.80GHz OpenCL 3.0 (Build 0) [2024.18.10.0.08_160000]
2024-12-17T21:51:33.6364945Z [opencl:fpga][opencl:2] Intel(R) FPGA Emulation Platform for OpenCL(TM), Intel(R) FPGA Emulation Device OpenCL 1.2 [2024.18.10.0.08_160000]
2024-12-17T21:51:33.6365391Z [native_cpu:cpu][native_cpu:0] SYCL_NATIVE_CPU, SYCL Native CPU 0.1 [0.0.0]
2024-12-17T21:51:33.6365584Z
2024-12-17T21:51:33.6366425Z Platforms: 5
2024-12-17T21:51:33.6366566Z Platform [#1]:
2024-12-17T21:51:33.6366703Z Version : 1.6
2024-12-17T21:51:33.6366924Z Name : Intel(R) oneAPI Unified Runtime over Level-Zero
2024-12-17T21:51:33.6367155Z Vendor : Intel(R) Corporation
2024-12-17T21:51:33.6367338Z Devices : 1
2024-12-17T21:51:33.6367477Z Device [#0]:
2024-12-17T21:51:33.6367627Z Type : gpu
2024-12-17T21:51:33.6367795Z Version : 12.0.0
2024-12-17T21:51:33.6367990Z Name : Intel(R) Iris(R) Xe Graphics
2024-12-17T21:51:33.6368208Z Vendor : Intel(R) Corporation
2024-12-17T21:51:33.6368406Z Driver : 1.6.31294.120000
2024-12-17T21:51:33.6368594Z UUID : 13412873154100002000000
2024-12-17T21:51:33.6368785Z DeviceID : 39497
2024-12-17T21:51:33.6368958Z Num SubDevices : 0
2024-12-17T21:51:33.6369124Z Num SubSubDevices : 0
2024-12-17T21:51:33.6370909Z Aspects : gpu fp16 online_compiler online_linker queue_profiling usm_device_allocations usm_host_allocations usm_shared_allocations ext_intel_pci_address ext_intel_gpu_eu_count ext_intel_gpu_eu_simd_width ext_intel_gpu_slices ext_intel_gpu_subslices_per_slice ext_intel_gpu_eu_count_per_subslice atomic64 ext_intel_device_info_uuid ext_intel_gpu_hw_threads_per_eu ext_intel_device_id ext_intel_memory_clock_rate ext_intel_memory_bus_width ext_intel_legacy_image ext_intel_esimd ext_oneapi_ballot_group ext_oneapi_fixed_size_group ext_oneapi_opportunistic_group ext_oneapi_tangle_group ext_oneapi_limited_graph ext_oneapi_private_alloca ext_oneapi_queue_profiling_tag ext_oneapi_virtual_mem ext_oneapi_virtual_functions
2024-12-17T21:51:33.6374960Z info::device::sub_group_sizes: 8 16 32
2024-12-17T21:51:33.6375204Z Architecture: intel_gpu_tgllp
2024-12-17T21:51:33.6375396Z Platform [#2]:
2024-12-17T21:51:33.6375542Z Version : OpenCL 3.0
2024-12-17T21:51:33.6375717Z Name : Intel(R) OpenCL Graphics
2024-12-17T21:51:33.6375928Z Vendor : Intel(R) Corporation
2024-12-17T21:51:33.6376101Z Devices : 1
2024-12-17T21:51:33.6376228Z Device [#0]:
2024-12-17T21:51:33.6376363Z Type : gpu
2024-12-17T21:51:33.6376679Z Version : OpenCL 3.0 NEO
2024-12-17T21:51:33.6376879Z Name : Intel(R) Iris(R) Xe Graphics
2024-12-17T21:51:33.6377079Z Vendor : Intel(R) Corporation
2024-12-17T21:51:33.6377257Z Driver : 24.39.31294.12
2024-12-17T21:51:33.6377432Z UUID : 13412873154100002000000
2024-12-17T21:51:33.6377605Z DeviceID : 39497
2024-12-17T21:51:33.6377771Z Num SubDevices : 0
2024-12-17T21:51:33.6377928Z Num SubSubDevices : 0
2024-12-17T21:51:33.6379165Z Aspects : gpu fp16 online_compiler online_linker queue_profiling usm_device_allocations usm_host_allocations usm_shared_allocations ext_intel_pci_address ext_intel_gpu_eu_count ext_intel_gpu_slices ext_intel_gpu_subslices_per_slice ext_intel_gpu_eu_count_per_subslice atomic64 ext_intel_device_info_uuid ext_oneapi_srgb ext_intel_gpu_hw_threads_per_eu ext_intel_device_id ext_intel_legacy_image ext_intel_esimd ext_oneapi_ballot_group ext_oneapi_fixed_size_group ext_oneapi_opportunistic_group ext_oneapi_tangle_group ext_oneapi_private_alloca ext_oneapi_atomic16 ext_oneapi_virtual_functions
2024-12-17T21:51:33.6380418Z info::device::sub_group_sizes: 8 16 32
2024-12-17T21:51:33.6380610Z Architecture: intel_gpu_tgllp
2024-12-17T21:51:33.6380773Z Platform [#3]:
2024-12-17T21:51:33.6380902Z Version : OpenCL 3.0 LINUX
2024-12-17T21:51:33.6381057Z Name : Intel(R) OpenCL
2024-12-17T21:51:33.6381215Z Vendor : Intel(R) Corporation
2024-12-17T21:51:33.6381375Z Devices : 1
2024-12-17T21:51:33.6381500Z Device [#1]:
2024-12-17T21:51:33.6381631Z Type : cpu
2024-12-17T21:51:33.6381791Z Version : OpenCL 3.0 (Build 0)
2024-12-17T21:51:33.6382078Z Name : 11th Gen Intel(R) Core(TM) i7-1165G7 @ 2.80GHz
2024-12-17T21:51:33.6382287Z Vendor : Intel(R) Corporation
2024-12-17T21:51:33.6382469Z Driver : 2024.18.10.0.08_160000
2024-12-17T21:51:33.6382649Z DeviceID : 526017
2024-12-17T21:51:33.6543853Z Num SubDevices : 0
2024-12-17T21:51:33.6544081Z Num SubSubDevices : 0
2024-12-17T21:51:33.6545480Z Aspects : cpu fp16 fp64 online_compiler online_linker queue_profiling usm_device_allocations usm_host_allocations usm_shared_allocations usm_system_allocations ext_intel_gpu_slices ext_intel_gpu_subslices_per_slice ext_intel_gpu_eu_count_per_subslice usm_atomic_host_allocations usm_atomic_shared_allocations atomic64 ext_oneapi_srgb ext_oneapi_native_assert ext_intel_gpu_hw_threads_per_eu ext_intel_device_id ext_intel_legacy_image ext_oneapi_ballot_group ext_oneapi_fixed_size_group ext_oneapi_opportunistic_group ext_oneapi_tangle_group ext_oneapi_private_alloca ext_oneapi_virtual_functions
2024-12-17T21:51:33.6546895Z info::device::sub_group_sizes: 4 8 16 32 64
2024-12-17T21:51:33.6547098Z Architecture: x86_64
2024-12-17T21:51:33.6547261Z Platform [#4]:
2024-12-17T21:51:33.6547473Z Version : OpenCL 1.2 Intel(R) FPGA SDK for OpenCL(TM), Version 20.3
2024-12-17T21:51:33.6547755Z Name : Intel(R) FPGA Emulation Platform for OpenCL(TM)
2024-12-17T21:51:33.6547975Z Vendor : Intel(R) Corporation
2024-12-17T21:51:33.6548146Z Devices : 1
2024-12-17T21:51:33.6548284Z Device [#2]:
2024-12-17T21:51:33.6548429Z Type : fpga
2024-12-17T21:51:33.6548599Z Version : OpenCL 1.2
2024-12-17T21:51:33.6548800Z Name : Intel(R) FPGA Emulation Device
2024-12-17T21:51:33.6549007Z Vendor : Intel(R) Corporation
2024-12-17T21:51:33.6549204Z Driver : 2024.18.10.0.08_160000
2024-12-17T21:51:33.6549390Z DeviceID : 526017
2024-12-17T21:51:33.6752831Z Num SubDevices : 0
2024-12-17T21:51:33.6754179Z Num SubSubDevices : 0
2024-12-17T21:51:33.6762258Z Aspects : accelerator fp64 online_compiler online_linker queue_profiling usm_device_allocations usm_host_allocations usm_shared_allocations ext_intel_gpu_slices ext_intel_gpu_subslices_per_slice ext_intel_gpu_eu_count_per_subslice usm_atomic_host_allocations usm_atomic_shared_allocations ext_oneapi_srgb ext_intel_gpu_hw_threads_per_eu ext_intel_device_id ext_oneapi_ballot_group ext_oneapi_fixed_size_group ext_oneapi_opportunistic_group ext_oneapi_tangle_group ext_intel_fpga_task_sequence ext_oneapi_private_alloca
2024-12-17T21:51:33.6769624Z info::device::sub_group_sizes: 4 8 16 32 64
2024-12-17T21:51:33.6770811Z Architecture: unknown
2024-12-17T21:51:33.6771610Z Platform [#5]:
2024-12-17T21:51:33.6772245Z Version : 0.1
2024-12-17T21:51:33.6772914Z Name : SYCL_NATIVE_CPU
2024-12-17T21:51:33.6773714Z Vendor : tbd
2024-12-17T21:51:33.6774373Z Devices : 1
2024-12-17T21:51:33.6774999Z Device [#0]:
2024-12-17T21:51:33.6775737Z Type : cpu
2024-12-17T21:51:33.6776593Z Version : 0.1
2024-12-17T21:51:33.6777421Z Name : SYCL Native CPU
2024-12-17T21:51:33.6778340Z Vendor : Intel(R) Corporation
2024-12-17T21:51:33.6779241Z Driver : 0.0.0
2024-12-17T21:51:33.6780130Z DeviceID : UNKNOWN
2024-12-17T21:51:33.6781039Z Num SubDevices : 0
2024-12-17T21:51:33.6781927Z Num SubSubDevices : 0
2024-12-17T21:51:33.6784998Z Aspects : cpu fp16 fp64 online_compiler online_linker queue_profiling usm_device_allocations usm_host_allocations usm_shared_allocations usm_system_allocations usm_atomic_host_allocations usm_atomic_shared_allocations atomic64
2024-12-17T21:51:33.6788095Z info::device::sub_group_sizes: 1
2024-12-17T21:51:33.6789016Z Architecture: unknown
2024-12-17T21:51:33.6791320Z default_selector() : gpu, Intel(R) oneAPI Unified Runtime over Level-Zero, Intel(R) Iris(R) Xe Graphics 12.0.0 [1.6.31294.120000]
2024-12-17T21:51:33.6794207Z accelerator_selector() : fpga, Intel(R) FPGA Emulation Platform for OpenCL(TM), Intel(R) FPGA Emulation Device OpenCL 1.2 [2024.18.10.0.08_160000]
2024-12-17T21:51:33.6796865Z cpu_selector() : cpu, Intel(R) OpenCL, 11th Gen Intel(R) Core(TM) i7-1165G7 @ 2.80GHz OpenCL 3.0 (Build 0) [2024.18.10.0.08_160000]
2024-12-17T21:51:33.6799087Z gpu_selector() : gpu, Intel(R) oneAPI Unified Runtime over Level-Zero, Intel(R) Iris(R) Xe Graphics 12.0.0 [1.6.31294.120000]
2024-12-17T21:51:33.6801120Z custom_selector(gpu) : gpu, Intel(R) oneAPI Unified Runtime over Level-Zero, Intel(R) Iris(R) Xe Graphics 12.0.0 [1.6.31294.120000]
2024-12-17T21:51:33.6803352Z custom_selector(cpu) : cpu, Intel(R) OpenCL, 11th Gen Intel(R) Core(TM) i7-1165G7 @ 2.80GHz OpenCL 3.0 (Build 0) [2024.18.10.0.08_160000]
2024-12-17T21:51:33.6805946Z custom_selector(acc) : fpga, Intel(R) FPGA Emulation Platform for OpenCL(TM), Intel(R) FPGA Emulation Device OpenCL 1.2 [2024.18.10.0.08_160000]
Additional context
No response