Skip to content

add EESSI/2025.06 module file (+ populate scripts for all CPU targets in 2025.06 software layer) #30

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 8 commits into
base: main
Choose a base branch
from

Conversation

bedroge
Copy link
Contributor

@bedroge bedroge commented Jul 7, 2025

This is basically the same as #6, but with an additional change on top: it moves the easystacks of the GPU support scripts to a version-specific subdirectory (2023.06), and modifies the script accordingly. Instead of using a hardcoded list of easystacks, it will just find all files named like eessi-${EESSI_VERSION}-*-CUDA-host-injections.yml (maybe it could even just find *.yml?) in easystacks/$EESSI_VERSION.

@bedroge
Copy link
Contributor Author

bedroge commented Jul 7, 2025

Let's give it try:

bot: build repo:eessi.io-2023.06-software instance:eessi-bot-mc-aws arch:x86_64/amd/zen2

@eessi-bot-aws
Copy link

eessi-bot-aws bot commented Jul 7, 2025

New job on instance eessi-bot-mc-aws for CPU micro-architecture x86_64-amd-zen2 for repository eessi.io-2023.06-software in job dir /project/def-users/SHARED/jobs/2025.07/pr_30/74512

date job status comment
Jul 07 12:19:26 UTC 2025 submitted job id 74512 awaits release by job manager
Jul 07 12:19:37 UTC 2025 released job awaits launch by Slurm scheduler
Jul 07 12:25:39 UTC 2025 running job 74512 is running
Jul 07 12:29:43 UTC 2025 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-74512.out
✅ no message matching FATAL:
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-x86_64-amd-zen2-17518911420.tar.gzsize: 0 MiB (3132 bytes)
entries: 1
modules under 2023.06/software/linux/x86_64/amd/zen2/modules/all
no module files in tarball
software under 2023.06/software/linux/x86_64/amd/zen2/software
no software packages in tarball
other under 2023.06/software/linux/x86_64/amd/zen2
2023.06/init/modules/EESSI/2023.06.lua
Jul 07 12:29:43 UTC 2025 test result
😁 SUCCESS (click triangle for details)
ReFrame Summary
[ OK ] ( 1/10) EESSI_LAMMPS_lj %device_type=cpu %module_name=LAMMPS/29Aug2024-foss-2023b-kokkos %scale=1_node /aeb2d9df @BotBuildTests:x86_64_amd_zen2+default
P: perf: 425.676 timesteps/s (r:0, l:None, u:None)
[ OK ] ( 2/10) EESSI_LAMMPS_lj %device_type=cpu %module_name=LAMMPS/2Aug2023_update2-foss-2023a-kokkos %scale=1_node /04ff9ece @BotBuildTests:x86_64_amd_zen2+default
P: perf: 446.811 timesteps/s (r:0, l:None, u:None)
[ OK ] ( 3/10) EESSI_OSU_coll %benchmark_info=mpi.collective.osu_allreduce %module_name=OSU-Micro-Benchmarks/7.2-gompi-2023b %scale=1_node %device_type=cpu /775175bf @BotBuildTests:x86_64_amd_zen2+default
P: latency: 1.87 us (r:0, l:None, u:None)
[ OK ] ( 4/10) EESSI_OSU_coll %benchmark_info=mpi.collective.osu_allreduce %module_name=OSU-Micro-Benchmarks/7.1-1-gompi-2023a %scale=1_node %device_type=cpu /52707c40 @BotBuildTests:x86_64_amd_zen2+default
P: latency: 1.82 us (r:0, l:None, u:None)
[ OK ] ( 5/10) EESSI_OSU_coll %benchmark_info=mpi.collective.osu_alltoall %module_name=OSU-Micro-Benchmarks/7.2-gompi-2023b %scale=1_node %device_type=cpu /b1aacda9 @BotBuildTests:x86_64_amd_zen2+default
P: latency: 4.06 us (r:0, l:None, u:None)
[ OK ] ( 6/10) EESSI_OSU_coll %benchmark_info=mpi.collective.osu_alltoall %module_name=OSU-Micro-Benchmarks/7.1-1-gompi-2023a %scale=1_node %device_type=cpu /c6bad193 @BotBuildTests:x86_64_amd_zen2+default
P: latency: 6.31 us (r:0, l:None, u:None)
[ OK ] ( 7/10) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_latency %module_name=OSU-Micro-Benchmarks/7.2-gompi-2023b %scale=1_node /15cad6c4 @BotBuildTests:x86_64_amd_zen2+default
P: latency: 0.59 us (r:0, l:None, u:None)
[ OK ] ( 8/10) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_latency %module_name=OSU-Micro-Benchmarks/7.1-1-gompi-2023a %scale=1_node /6672deda @BotBuildTests:x86_64_amd_zen2+default
P: latency: 0.57 us (r:0, l:None, u:None)
[ OK ] ( 9/10) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_bw %module_name=OSU-Micro-Benchmarks/7.2-gompi-2023b %scale=1_node /2a9a47b1 @BotBuildTests:x86_64_amd_zen2+default
P: bandwidth: 7370.89 MB/s (r:0, l:None, u:None)
[ OK ] (10/10) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_bw %module_name=OSU-Micro-Benchmarks/7.1-1-gompi-2023a %scale=1_node /1b24ab8e @BotBuildTests:x86_64_amd_zen2+default
P: bandwidth: 7237.64 MB/s (r:0, l:None, u:None)
[ PASSED ] Ran 10/10 test case(s) from 10 check(s) (0 failure(s), 0 skipped, 0 aborted)
Details
✅ job output file slurm-74512.out
✅ no message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@bedroge
Copy link
Contributor Author

bedroge commented Jul 7, 2025

Oops, that was built for the wrong repo... (though we did a comment to the 2023.06 module file, so in the end we may want to build for both versions). Let's try 2025.06 now.

bot: build repo:eessi.io-2025.06-software instance:eessi-bot-mc-aws arch:x86_64/amd/zen2

@eessi-bot-aws
Copy link

eessi-bot-aws bot commented Jul 7, 2025

New job on instance eessi-bot-mc-aws for CPU micro-architecture x86_64-amd-zen2 for repository eessi.io-2025.06-software in job dir /project/def-users/SHARED/jobs/2025.07/pr_30/74513

date job status comment
Jul 07 13:16:23 UTC 2025 submitted job id 74513 awaits release by job manager
Jul 07 13:16:49 UTC 2025 released job awaits launch by Slurm scheduler
Jul 07 13:17:51 UTC 2025 running job 74513 is running
Jul 07 13:20:54 UTC 2025 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-74513.out
✅ no message matching FATAL:
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2025.06-software-linux-x86_64-amd-zen2-17518943520.tar.gzsize: 13 MiB (14287477 bytes)
entries: 16634
modules under 2025.06/software/linux/x86_64/amd/zen2/modules/all
EasyBuild/5.1.1.lua
EESSI-extend/2025.06-easybuild.lua
software under 2025.06/software/linux/x86_64/amd/zen2/software
EasyBuild/5.1.1
EESSI-extend/2025.06-easybuild
other under 2025.06/software/linux/x86_64/amd/zen2
2025.06/init/arch_specs/eessi_arch_arm.spec
2025.06/init/arch_specs/eessi_arch_ppc.spec
2025.06/init/arch_specs/eessi_arch_riscv.spec
2025.06/init/arch_specs/eessi_arch_x86.spec
2025.06/init/bash
2025.06/init/easybuild/eb_hooks.py
2025.06/init/eessi_archdetect.sh
2025.06/init/eessi_defaults
2025.06/init/eessi_environment_variables
2025.06/init/eessi_software_subdir_for_host.py
2025.06/init/lmod/bash
2025.06/init/lmod/csh
2025.06/init/lmod_eessi_archdetect_wrapper_accel.sh
2025.06/init/lmod_eessi_archdetect_wrapper.sh
2025.06/init/lmod/fish
2025.06/init/lmod/ksh
2025.06/init/lmod/zsh
2025.06/init/Magic_Castle/bash
2025.06/init/Magic_Castle/eessi_python3
2025.06/init/minimal_eessi_env
2025.06/init/modules/EESSI/2025.06.lua
2025.06/init/README.md
2025.06/init/test.py
2025.06/scripts/gpu_support/nvidia/install_cuda_and_libraries.sh
2025.06/scripts/gpu_support/nvidia/install_cuda_host_injections.sh
2025.06/scripts/gpu_support/nvidia/link_nvidia_host_libraries.sh
2025.06/scripts/utils.sh
.lmod/lmodrc.lua
.lmod/SitePackage.lua
Jul 07 13:20:54 UTC 2025 test result
😢 FAILURE (click triangle for details)
Reason
EESSI test suite was not run, test step itself failed to execute.
Details
✅ job output file slurm-74513.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@bedroge
Copy link
Contributor Author

bedroge commented Jul 7, 2025

The 2025.06 tarball does not contain any 2023.06 host-injection easystacks anymore (like the build in #6 (comment) did), so I think this looks good.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants