Skip to content

reduce_over_sub_group.cpp failing on native_cpu #16415

Open
@lbushi25

Description

@lbushi25

Describe the bug

The performance test SYCL :: PerformanceTests/Reduction/reduce_over_sub_group.cpp is failing on native_cpu. Upon resolution, please remove XFAIL from test source.

To reproduce

FAIL: SYCL :: PerformanceTests/Reduction/reduce_over_sub_group.cpp (8 of 8)
******************** TEST 'SYCL :: PerformanceTests/Reduction/reduce_over_sub_group.cpp' FAILED ********************
Exit Code: 1

Command Output (stdout):
--
# RUN: at line 1
/__w/llvm/llvm/toolchain/bin//clang++  -Werror  -fsycl -fsycl-targets=spir64,native_cpu  /__w/llvm/llvm/llvm/sycl/test-e2e/PerformanceTests/Reduction/reduce_over_sub_group.cpp -o /__w/llvm/llvm/build-e2e/PerformanceTests/Reduction/Output/reduce_over_sub_group.cpp.tmp.out
# executed command: /__w/llvm/llvm/toolchain/bin//clang++ -Werror -fsycl -fsycl-targets=spir64,native_cpu /__w/llvm/llvm/llvm/sycl/test-e2e/PerformanceTests/Reduction/reduce_over_sub_group.cpp -o /__w/llvm/llvm/build-e2e/PerformanceTests/Reduction/Output/reduce_over_sub_group.cpp.tmp.out
# .---command stderr------------
# | /usr/bin/ld: /tmp/lit-tmp-co4cvrku/reduce_over_sub_group-1ac783.o: in function `_ZTSZZ4testIhEvvENKUlRN4sycl3_V17handlerEE_clES3_EUlNS1_7nd_itemILi1EEEE_.NativeCPUKernel':
# | llvm-link:(.text+0xbb): undefined reference to `__mux_sub_group_broadcast_i64'
# | /usr/bin/ld: llvm-link:(.text+0xd6): undefined reference to `__mux_sub_group_broadcast_i32'
# | /usr/bin/ld: /tmp/lit-tmp-co4cvrku/reduce_over_sub_group-1ac783.o: in function `_ZTSZZ4testItEvvENKUlRN4sycl3_V17handlerEE_clES3_EUlNS1_7nd_itemILi1EEEE_.NativeCPUKernel':
# | llvm-link:(.text+0x208): undefined reference to `__mux_sub_group_broadcast_i64'
# | /usr/bin/ld: llvm-link:(.text+0x226): undefined reference to `__mux_sub_group_broadcast_i32'
# | /usr/bin/ld: /tmp/lit-tmp-co4cvrku/reduce_over_sub_group-1ac783.o: in function `_ZTSZZ4testIjEvvENKUlRN4sycl3_V17handlerEE_clES3_EUlNS1_7nd_itemILi1EEEE_.NativeCPUKernel':
# | llvm-link:(.text+0x358): undefined reference to `__mux_sub_group_broadcast_i64'
# | /usr/bin/ld: llvm-link:(.text+0x373): undefined reference to `__mux_sub_group_broadcast_i32'
# | /usr/bin/ld: /tmp/lit-tmp-co4cvrku/reduce_over_sub_group-1ac783.o: in function `_ZTSZZ4testImEvvENKUlRN4sycl3_V17handlerEE_clES3_EUlNS1_7nd_itemILi1EEEE_.NativeCPUKernel':
# | llvm-link:(.text+0x4a8): undefined reference to `__mux_sub_group_broadcast_i64'
# | /usr/bin/ld: llvm-link:(.text+0x4c3): undefined reference to `__mux_sub_group_broadcast_i64'
# | /usr/bin/ld: /tmp/lit-tmp-co4cvrku/reduce_over_sub_group-1ac783.o: in function `_ZTSZZ4testIaEvvENKUlRN4sycl3_V17handlerEE_clES3_EUlNS1_7nd_itemILi1EEEE_.NativeCPUKernel':
# | llvm-link:(.text+0x5fb): undefined reference to `__mux_sub_group_broadcast_i64'
# | /usr/bin/ld: llvm-link:(.text+0x616): undefined reference to `__mux_sub_group_broadcast_i32'
# | /usr/bin/ld: /tmp/lit-tmp-co4cvrku/reduce_over_sub_group-1ac783.o: in function `_ZTSZZ4testIsEvvENKUlRN4sycl3_V17handlerEE_clES3_EUlNS1_7nd_itemILi1EEEE_.NativeCPUKernel':
# | llvm-link:(.text+0x748): undefined reference to `__mux_sub_group_broadcast_i64'
# | /usr/bin/ld: llvm-link:(.text+0x766): undefined reference to `__mux_sub_group_broadcast_i32'
# | /usr/bin/ld: /tmp/lit-tmp-co4cvrku/reduce_over_sub_group-1ac783.o: in function `_ZTSZZ4testIiEvvENKUlRN4sycl3_V17handlerEE_clES3_EUlNS1_7nd_itemILi1EEEE_.NativeCPUKernel':
# | llvm-link:(.text+0x898): undefined reference to `__mux_sub_group_broadcast_i64'
# | /usr/bin/ld: llvm-link:(.text+0x8b3): undefined reference to `__mux_sub_group_broadcast_i32'
# | /usr/bin/ld: /tmp/lit-tmp-co4cvrku/reduce_over_sub_group-1ac783.o: in function `_ZTSZZ4testIlEvvENKUlRN4sycl3_V17handlerEE_clES3_EUlNS1_7nd_itemILi1EEEE_.NativeCPUKernel':
# | llvm-link:(.text+0x9e8): undefined reference to `__mux_sub_group_broadcast_i64'
# | /usr/bin/ld: llvm-link:(.text+0xa03): undefined reference to `__mux_sub_group_broadcast_i64'
# | /usr/bin/ld: /tmp/lit-tmp-co4cvrku/reduce_over_sub_group-1ac783.o: in function `_ZTSZZ4testIN4sycl3_V16detail9half_impl4halfEEvvENKUlRNS1_7handlerEE_clES6_EUlNS1_7nd_itemILi1EEEE_.NativeCPUKernel':
# | llvm-link:(.text+0xb46): undefined reference to `__mux_sub_group_broadcast_i64'
# | /usr/bin/ld: llvm-link:(.text+0xb6b): undefined reference to `__mux_sub_group_broadcast_i32'
# | /usr/bin/ld: /tmp/lit-tmp-co4cvrku/reduce_over_sub_group-1ac783.o: in function `_ZTSZZ4testIfEvvENKUlRN4sycl3_V17handlerEE_clES3_EUlNS1_7nd_itemILi1EEEE_.NativeCPUKernel':
# | llvm-link:(.text+0xe67): undefined reference to `__mux_sub_group_broadcast_i64'
# | /usr/bin/ld: llvm-link:(.text+0xe83): undefined reference to `__mux_sub_group_broadcast_f32'
# | /usr/bin/ld: /tmp/lit-tmp-co4cvrku/reduce_over_sub_group-1ac783.o: in function `_ZTSZZ4testIdEvvENKUlRN4sycl3_V17handlerEE_clES3_EUlNS1_7nd_itemILi1EEEE_.NativeCPUKernel':
# | llvm-link:(.text+0xfe7): undefined reference to `__mux_sub_group_broadcast_i64'
# | /usr/bin/ld: llvm-link:(.text+0x1003): undefined reference to `__mux_sub_group_broadcast_f64'
# | clang++: error: linker command failed with exit code 1 (use -v to see invocation)
# `-----------------------------
# error: command failed with exit status: 1

Environment

  • OS: Any
  • Target device and vendor: native_cpu
  • DPC++ version: da317d04df8c6ffc873dfe1943c1c7c26e8de1e5
  • Dependencies version:
- 2024-12-17T21:51:33.5779673Z �[36;1msycl-ls --verbose�[0m
2024-12-17T21:51:33.5780436Z shell: sh -e {0}
2024-12-17T21:51:33.5780864Z env:
2024-12-17T21:51:33.5781299Z   LIT_FILTER: PerformanceTests/
2024-12-17T21:51:33.5781958Z   CMAKE_PREFIX_PATH: /runtimes/oneapi-tbb/env/..
2024-12-17T21:51:33.5782676Z   CPATH: /runtimes/oneapi-tbb/env/../include
2024-12-17T21:51:33.5783693Z   LD_LIBRARY_PATH: /__w/llvm/llvm/toolchain/lib/:/runtimes/oneapi-tbb/env/../lib/intel64/gcc4.8
2024-12-17T21:51:33.5784819Z   LIBRARY_PATH: /runtimes/oneapi-tbb/env/../lib/intel64/gcc4.8
2024-12-17T21:51:33.5785683Z   PKG_CONFIG_PATH: /runtimes/oneapi-tbb/env/../lib/pkgconfig
2024-12-17T21:51:33.5786445Z   TBBROOT: /runtimes/oneapi-tbb/env/..
2024-12-17T21:51:33.5787440Z   PATH: /__w/llvm/llvm/toolchain/bin/:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin
2024-12-17T21:51:33.5788417Z ##[endgroup]
2024-12-17T21:51:33.6363228Z [level_zero:gpu][level_zero:0] Intel(R) oneAPI Unified Runtime over Level-Zero, Intel(R) Iris(R) Xe Graphics 12.0.0 [1.6.31294.120000]
2024-12-17T21:51:33.6363940Z [opencl:gpu][opencl:0] Intel(R) OpenCL Graphics, Intel(R) Iris(R) Xe Graphics OpenCL 3.0 NEO  [24.39.31294.12]
2024-12-17T21:51:33.6364412Z [opencl:cpu][opencl:1] Intel(R) OpenCL, 11th Gen Intel(R) Core(TM) i7-1165G7 @ 2.80GHz OpenCL 3.0 (Build 0) [2024.18.10.0.08_160000]
2024-12-17T21:51:33.6364945Z [opencl:fpga][opencl:2] Intel(R) FPGA Emulation Platform for OpenCL(TM), Intel(R) FPGA Emulation Device OpenCL 1.2  [2024.18.10.0.08_160000]
2024-12-17T21:51:33.6365391Z [native_cpu:cpu][native_cpu:0] SYCL_NATIVE_CPU, SYCL Native CPU 0.1 [0.0.0]
2024-12-17T21:51:33.6365584Z 
2024-12-17T21:51:33.6366425Z Platforms: 5
2024-12-17T21:51:33.6366566Z Platform [#1]:
2024-12-17T21:51:33.6366703Z     Version  : 1.6
2024-12-17T21:51:33.6366924Z     Name     : Intel(R) oneAPI Unified Runtime over Level-Zero
2024-12-17T21:51:33.6367155Z     Vendor   : Intel(R) Corporation
2024-12-17T21:51:33.6367338Z     Devices  : 1
2024-12-17T21:51:33.6367477Z         Device [#0]:
2024-12-17T21:51:33.6367627Z         Type              : gpu
2024-12-17T21:51:33.6367795Z         Version           : 12.0.0
2024-12-17T21:51:33.6367990Z         Name              : Intel(R) Iris(R) Xe Graphics
2024-12-17T21:51:33.6368208Z         Vendor            : Intel(R) Corporation
2024-12-17T21:51:33.6368406Z         Driver            : 1.6.31294.120000
2024-12-17T21:51:33.6368594Z         UUID              : 13412873154100002000000
2024-12-17T21:51:33.6368785Z         DeviceID          : 39497
2024-12-17T21:51:33.6368958Z         Num SubDevices    : 0
2024-12-17T21:51:33.6369124Z         Num SubSubDevices : 0
2024-12-17T21:51:33.6370909Z         Aspects           : gpu fp16 online_compiler online_linker queue_profiling usm_device_allocations usm_host_allocations usm_shared_allocations ext_intel_pci_address ext_intel_gpu_eu_count ext_intel_gpu_eu_simd_width ext_intel_gpu_slices ext_intel_gpu_subslices_per_slice ext_intel_gpu_eu_count_per_subslice atomic64 ext_intel_device_info_uuid ext_intel_gpu_hw_threads_per_eu ext_intel_device_id ext_intel_memory_clock_rate ext_intel_memory_bus_width ext_intel_legacy_image ext_intel_esimd ext_oneapi_ballot_group ext_oneapi_fixed_size_group ext_oneapi_opportunistic_group ext_oneapi_tangle_group ext_oneapi_limited_graph ext_oneapi_private_alloca ext_oneapi_queue_profiling_tag ext_oneapi_virtual_mem ext_oneapi_virtual_functions
2024-12-17T21:51:33.6374960Z         info::device::sub_group_sizes: 8 16 32
2024-12-17T21:51:33.6375204Z         Architecture: intel_gpu_tgllp
2024-12-17T21:51:33.6375396Z Platform [#2]:
2024-12-17T21:51:33.6375542Z     Version  : OpenCL 3.0 
2024-12-17T21:51:33.6375717Z     Name     : Intel(R) OpenCL Graphics
2024-12-17T21:51:33.6375928Z     Vendor   : Intel(R) Corporation
2024-12-17T21:51:33.6376101Z     Devices  : 1
2024-12-17T21:51:33.6376228Z         Device [#0]:
2024-12-17T21:51:33.6376363Z         Type              : gpu
2024-12-17T21:51:33.6376679Z         Version           : OpenCL 3.0 NEO 
2024-12-17T21:51:33.6376879Z         Name              : Intel(R) Iris(R) Xe Graphics
2024-12-17T21:51:33.6377079Z         Vendor            : Intel(R) Corporation
2024-12-17T21:51:33.6377257Z         Driver            : 24.39.31294.12
2024-12-17T21:51:33.6377432Z         UUID              : 13412873154100002000000
2024-12-17T21:51:33.6377605Z         DeviceID          : 39497
2024-12-17T21:51:33.6377771Z         Num SubDevices    : 0
2024-12-17T21:51:33.6377928Z         Num SubSubDevices : 0
2024-12-17T21:51:33.6379165Z         Aspects           : gpu fp16 online_compiler online_linker queue_profiling usm_device_allocations usm_host_allocations usm_shared_allocations ext_intel_pci_address ext_intel_gpu_eu_count ext_intel_gpu_slices ext_intel_gpu_subslices_per_slice ext_intel_gpu_eu_count_per_subslice atomic64 ext_intel_device_info_uuid ext_oneapi_srgb ext_intel_gpu_hw_threads_per_eu ext_intel_device_id ext_intel_legacy_image ext_intel_esimd ext_oneapi_ballot_group ext_oneapi_fixed_size_group ext_oneapi_opportunistic_group ext_oneapi_tangle_group ext_oneapi_private_alloca ext_oneapi_atomic16 ext_oneapi_virtual_functions
2024-12-17T21:51:33.6380418Z         info::device::sub_group_sizes: 8 16 32
2024-12-17T21:51:33.6380610Z         Architecture: intel_gpu_tgllp
2024-12-17T21:51:33.6380773Z Platform [#3]:
2024-12-17T21:51:33.6380902Z     Version  : OpenCL 3.0 LINUX
2024-12-17T21:51:33.6381057Z     Name     : Intel(R) OpenCL
2024-12-17T21:51:33.6381215Z     Vendor   : Intel(R) Corporation
2024-12-17T21:51:33.6381375Z     Devices  : 1
2024-12-17T21:51:33.6381500Z         Device [#1]:
2024-12-17T21:51:33.6381631Z         Type              : cpu
2024-12-17T21:51:33.6381791Z         Version           : OpenCL 3.0 (Build 0)
2024-12-17T21:51:33.6382078Z         Name              : 11th Gen Intel(R) Core(TM) i7-1165G7 @ 2.80GHz
2024-12-17T21:51:33.6382287Z         Vendor            : Intel(R) Corporation
2024-12-17T21:51:33.6382469Z         Driver            : 2024.18.10.0.08_160000
2024-12-17T21:51:33.6382649Z         DeviceID          : 526017
2024-12-17T21:51:33.6543853Z         Num SubDevices    : 0
2024-12-17T21:51:33.6544081Z         Num SubSubDevices : 0
2024-12-17T21:51:33.6545480Z         Aspects           : cpu fp16 fp64 online_compiler online_linker queue_profiling usm_device_allocations usm_host_allocations usm_shared_allocations usm_system_allocations ext_intel_gpu_slices ext_intel_gpu_subslices_per_slice ext_intel_gpu_eu_count_per_subslice usm_atomic_host_allocations usm_atomic_shared_allocations atomic64 ext_oneapi_srgb ext_oneapi_native_assert ext_intel_gpu_hw_threads_per_eu ext_intel_device_id ext_intel_legacy_image ext_oneapi_ballot_group ext_oneapi_fixed_size_group ext_oneapi_opportunistic_group ext_oneapi_tangle_group ext_oneapi_private_alloca ext_oneapi_virtual_functions
2024-12-17T21:51:33.6546895Z         info::device::sub_group_sizes: 4 8 16 32 64
2024-12-17T21:51:33.6547098Z         Architecture: x86_64
2024-12-17T21:51:33.6547261Z Platform [#4]:
2024-12-17T21:51:33.6547473Z     Version  : OpenCL 1.2 Intel(R) FPGA SDK for OpenCL(TM), Version 20.3
2024-12-17T21:51:33.6547755Z     Name     : Intel(R) FPGA Emulation Platform for OpenCL(TM)
2024-12-17T21:51:33.6547975Z     Vendor   : Intel(R) Corporation
2024-12-17T21:51:33.6548146Z     Devices  : 1
2024-12-17T21:51:33.6548284Z         Device [#2]:
2024-12-17T21:51:33.6548429Z         Type              : fpga
2024-12-17T21:51:33.6548599Z         Version           : OpenCL 1.2 
2024-12-17T21:51:33.6548800Z         Name              : Intel(R) FPGA Emulation Device
2024-12-17T21:51:33.6549007Z         Vendor            : Intel(R) Corporation
2024-12-17T21:51:33.6549204Z         Driver            : 2024.18.10.0.08_160000
2024-12-17T21:51:33.6549390Z         DeviceID          : 526017
2024-12-17T21:51:33.6752831Z         Num SubDevices    : 0
2024-12-17T21:51:33.6754179Z         Num SubSubDevices : 0
2024-12-17T21:51:33.6762258Z         Aspects           : accelerator fp64 online_compiler online_linker queue_profiling usm_device_allocations usm_host_allocations usm_shared_allocations ext_intel_gpu_slices ext_intel_gpu_subslices_per_slice ext_intel_gpu_eu_count_per_subslice usm_atomic_host_allocations usm_atomic_shared_allocations ext_oneapi_srgb ext_intel_gpu_hw_threads_per_eu ext_intel_device_id ext_oneapi_ballot_group ext_oneapi_fixed_size_group ext_oneapi_opportunistic_group ext_oneapi_tangle_group ext_intel_fpga_task_sequence ext_oneapi_private_alloca
2024-12-17T21:51:33.6769624Z         info::device::sub_group_sizes: 4 8 16 32 64
2024-12-17T21:51:33.6770811Z         Architecture: unknown
2024-12-17T21:51:33.6771610Z Platform [#5]:
2024-12-17T21:51:33.6772245Z     Version  : 0.1
2024-12-17T21:51:33.6772914Z     Name     : SYCL_NATIVE_CPU
2024-12-17T21:51:33.6773714Z     Vendor   : tbd
2024-12-17T21:51:33.6774373Z     Devices  : 1
2024-12-17T21:51:33.6774999Z         Device [#0]:
2024-12-17T21:51:33.6775737Z         Type              : cpu
2024-12-17T21:51:33.6776593Z         Version           : 0.1
2024-12-17T21:51:33.6777421Z         Name              : SYCL Native CPU
2024-12-17T21:51:33.6778340Z         Vendor            : Intel(R) Corporation
2024-12-17T21:51:33.6779241Z         Driver            : 0.0.0
2024-12-17T21:51:33.6780130Z         DeviceID          : UNKNOWN
2024-12-17T21:51:33.6781039Z         Num SubDevices    : 0
2024-12-17T21:51:33.6781927Z         Num SubSubDevices : 0
2024-12-17T21:51:33.6784998Z         Aspects           : cpu fp16 fp64 online_compiler online_linker queue_profiling usm_device_allocations usm_host_allocations usm_shared_allocations usm_system_allocations usm_atomic_host_allocations usm_atomic_shared_allocations atomic64
2024-12-17T21:51:33.6788095Z         info::device::sub_group_sizes: 1
2024-12-17T21:51:33.6789016Z         Architecture: unknown
2024-12-17T21:51:33.6791320Z default_selector()      : gpu, Intel(R) oneAPI Unified Runtime over Level-Zero, Intel(R) Iris(R) Xe Graphics 12.0.0 [1.6.31294.120000]
2024-12-17T21:51:33.6794207Z accelerator_selector()  : fpga, Intel(R) FPGA Emulation Platform for OpenCL(TM), Intel(R) FPGA Emulation Device OpenCL 1.2  [2024.18.10.0.08_160000]
2024-12-17T21:51:33.6796865Z cpu_selector()          : cpu, Intel(R) OpenCL, 11th Gen Intel(R) Core(TM) i7-1165G7 @ 2.80GHz OpenCL 3.0 (Build 0) [2024.18.10.0.08_160000]
2024-12-17T21:51:33.6799087Z gpu_selector()          : gpu, Intel(R) oneAPI Unified Runtime over Level-Zero, Intel(R) Iris(R) Xe Graphics 12.0.0 [1.6.31294.120000]
2024-12-17T21:51:33.6801120Z custom_selector(gpu)    : gpu, Intel(R) oneAPI Unified Runtime over Level-Zero, Intel(R) Iris(R) Xe Graphics 12.0.0 [1.6.31294.120000]
2024-12-17T21:51:33.6803352Z custom_selector(cpu)    : cpu, Intel(R) OpenCL, 11th Gen Intel(R) Core(TM) i7-1165G7 @ 2.80GHz OpenCL 3.0 (Build 0) [2024.18.10.0.08_160000]
2024-12-17T21:51:33.6805946Z custom_selector(acc)    : fpga, Intel(R) FPGA Emulation Platform for OpenCL(TM), Intel(R) FPGA Emulation Device OpenCL 1.2  [2024.18.10.0.08_160000]

Additional context

No response

Metadata

Metadata

Assignees

No one assigned

    Labels

    Need infoSome clarifications are needed from the reporterbugSomething isn't workingconfirmednative-cpu

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions