Skip to content

Test utils funtion nvidia #49

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 8 commits into
base: main
Choose a base branch
from

Conversation

laraPPr
Copy link
Contributor

@laraPPr laraPPr commented Aug 1, 2025

No description provided.

@laraPPr
Copy link
Contributor Author

laraPPr commented Aug 1, 2025

bot: build instance:eessi-bot-vsc-ugent repo:eessi.io-2023.06-software arch:zen3 accel:nvidia/cc80

@gpu-bot-ugent
Copy link

gpu-bot-ugent bot commented Aug 1, 2025

New job on instance eessi-bot-vsc-ugent for CPU micro-architecture x86_64-amd-zen3 and accelerator nvidia/cc80 for repository eessi.io-2023.06-softwarein job dir/scratch/gent/vo/002/gvo00211/SHARED/jobs/2025.08/pr_49/15518539`

date job status comment
Aug 01 09:31:09 UTC 2025 submitted job id 15518539 awaits release by job manager
Aug 01 09:32:12 UTC 2025 released job awaits launch by Slurm scheduler
Aug 01 09:42:16 UTC 2025 running job 15518539 is running
Aug 01 09:44:17 UTC 2025 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-15518539.out
✅ no message matching FATAL:
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-x86_64-amd-zen3-17540413190.tar.gzsize: 0 MiB (419088 bytes)
entries: 40
modules under 2023.06/software/linux/x86_64/amd/zen3/accel/nvidia/cc80/modules/all
pmt/1.2.0-GCCcore-12.3.0-CUDA-12.1.1.lua
software under 2023.06/software/linux/x86_64/amd/zen3/accel/nvidia/cc80/software
pmt/1.2.0-GCCcore-12.3.0-CUDA-12.1.1
reprod directories under 2023.06/software/linux/x86_64/amd/zen3/accel/nvidia/cc80/reprod
no reprod directories in tarball
other under 2023.06/software/linux/x86_64/amd/zen3/accel/nvidia/cc80
2023.06/scripts/utils.sh
Aug 01 09:44:17 UTC 2025 test result
😢 FAILURE (click triangle for details)
Reason
EESSI test suite produced failures.
ReFrame Summary
[ FAILED ] Ran 1/9 test case(s) from 9 check(s) (1 failure(s), 8 skipped, 0 aborted)
Details
✅ job output file slurm-15518539.out
❌ found message matching ERROR:
❌ found message matching [\s*FAILED\s*].*Ran .* test case

@laraPPr
Copy link
Contributor Author

laraPPr commented Aug 1, 2025

Command 'nvidia-smi' found. Installing NVIDIA drivers for use in prefix shell...

/kyukon/scratch/gent/vo/002/gvo00211/SHARED/jobs/2025.08/pr_49/event_3ffa3520-6eba-11f0-99f9-9a7e2153aa74/run_000/linux_x86_64_amd_zen3/eessi.io-2023.06-software/scripts/utils.sh: line 156: /scripts/gpu_support/nvidia/link_nvidia_host_libraries.sh: No such file or directory

Ok so #22 is broken. It does try to run /scripts/gpu_support/nvidia/link_nvidia_host_libraries.sh but cannot find it.

@laraPPr
Copy link
Contributor Author

laraPPr commented Aug 1, 2025

bot: build instance:eessi-bot-vsc-ugent repo:eessi.io-2023.06-software arch:zen3 accel:nvidia/cc80

@laraPPr
Copy link
Contributor Author

laraPPr commented Aug 1, 2025

bot: help instance:eessi-bot-vsc-ugent

@eessi-bot-aws
Copy link

eessi-bot-aws bot commented Aug 1, 2025

Updates by the bot instance eessi-bot-mc-aws (click for details)
  • received bot command help instance:eessi-bot-vsc-ugent from laraPPr

    • expanded format: help instance:eessi-bot-vsc-ugent
  • handling command help instance:eessi-bot-vsc-ugent resulted in:
    How to send commands to bot instances

    • Commands must be sent with a new comment (edits of existing comments are ignored).
    • A comment may contain multiple commands, one per line.
    • Every command begins at the start of a line and has the syntax bot: COMMAND [ARGUMENTS]*
    • Currently supported COMMANDs are: help, build, show_config, status

    For more information, see https://www.eessi.io/docs/bot

@eessi-bot-deucalion
Copy link

eessi-bot-deucalion bot commented Aug 1, 2025

Updates by the bot instance eessi-bot-deucalion (click for details)
  • received bot command help instance:eessi-bot-vsc-ugent from laraPPr

    • expanded format: help instance:eessi-bot-vsc-ugent
  • handling command help instance:eessi-bot-vsc-ugent resulted in:
    How to send commands to bot instances

    • Commands must be sent with a new comment (edits of existing comments are ignored).
    • A comment may contain multiple commands, one per line.
    • Every command begins at the start of a line and has the syntax bot: COMMAND [ARGUMENTS]*
    • Currently supported COMMANDs are: help, build, show_config, status

    For more information, see https://www.eessi.io/docs/bot

@eessi-bot-surf
Copy link

eessi-bot-surf bot commented Aug 1, 2025

Updates by the bot instance eessi-bot-surf (click for details)
  • received bot command help instance:eessi-bot-vsc-ugent from laraPPr

    • expanded format: help instance:eessi-bot-vsc-ugent
  • handling command help instance:eessi-bot-vsc-ugent resulted in:
    How to send commands to bot instances

    • Commands must be sent with a new comment (edits of existing comments are ignored).
    • A comment may contain multiple commands, one per line.
    • Every command begins at the start of a line and has the syntax bot: COMMAND [ARGUMENTS]*
    • Currently supported COMMANDs are: help, build, show_config, status

    For more information, see https://www.eessi.io/docs/bot

@eessi-bot-jsc
Copy link

eessi-bot-jsc bot commented Aug 1, 2025

Updates by the bot instance eessi-bot-jsc (click for details)
  • received bot command help instance:eessi-bot-vsc-ugent from laraPPr

    • expanded format: help instance:eessi-bot-vsc-ugent
  • handling command help instance:eessi-bot-vsc-ugent resulted in:
    How to send commands to bot instances

    • Commands must be sent with a new comment (edits of existing comments are ignored).
    • A comment may contain multiple commands, one per line.
    • Every command begins at the start of a line and has the syntax bot: COMMAND [ARGUMENTS]*
    • Currently supported COMMANDs are: help, build, show_config, status

    For more information, see https://www.eessi.io/docs/bot

@laraPPr
Copy link
Contributor Author

laraPPr commented Aug 1, 2025

bot: build instance:eessi-bot-vsc-ugent repo:eessi.io-2023.06-software arch:zen3 accel:nvidia/cc80

@gpu-bot-ugent
Copy link

gpu-bot-ugent bot commented Aug 1, 2025

New job on instance eessi-bot-vsc-ugent for CPU micro-architecture x86_64-amd-zen3 and accelerator nvidia/cc80 for repository eessi.io-2023.06-softwarein job dir/scratch/gent/vo/002/gvo00211/SHARED/jobs/2025.08/pr_49/15518563`

date job status comment
Aug 01 12:09:52 UTC 2025 submitted job id 15518563 awaits release by job manager
Aug 01 12:10:27 UTC 2025 released job awaits launch by Slurm scheduler
Aug 01 12:12:30 UTC 2025 running job 15518563 is running
Aug 01 12:14:32 UTC 2025 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-15518563.out
✅ no message matching FATAL:
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-x86_64-amd-zen3-17540503150.tar.gzsize: 0 MiB (418473 bytes)
entries: 40
modules under 2023.06/software/linux/x86_64/amd/zen3/accel/nvidia/cc80/modules/all
pmt/1.2.0-GCCcore-12.3.0-CUDA-12.1.1.lua
software under 2023.06/software/linux/x86_64/amd/zen3/accel/nvidia/cc80/software
pmt/1.2.0-GCCcore-12.3.0-CUDA-12.1.1
reprod directories under 2023.06/software/linux/x86_64/amd/zen3/accel/nvidia/cc80/reprod
no reprod directories in tarball
other under 2023.06/software/linux/x86_64/amd/zen3/accel/nvidia/cc80
2023.06/scripts/utils.sh
Aug 01 12:14:32 UTC 2025 test result
😁 SUCCESS (click triangle for details)
ReFrame Summary
[ SKIP ] (1/9) Skipping GPU test : only 1 GPU available for this test case
[ SKIP ] (2/9) Skipping GPU test : only 1 GPU available for this test case
[ SKIP ] (3/9) Skipping GPU test : only 1 GPU available for this test case
[ SKIP ] (4/9) Skipping GPU test : only 1 GPU available for this test case
[ SKIP ] (5/9) Skipping test : 1 GPU(s) available for this test case, need exactly 2
[ SKIP ] (6/9) Skipping test : 1 GPU(s) available for this test case, need exactly 2
[ SKIP ] (7/9) Skipping test : 1 GPU(s) available for this test case, need exactly 2
[ SKIP ] (8/9) Skipping test : 1 GPU(s) available for this test case, need exactly 2
[ OK ] (9/9) EESSI_LAMMPS_lj %device_type=gpu %module_name=LAMMPS/2Aug2023_update2-foss-2023a-kokkos-CUDA-12.1.1 %scale=1_4_node /497af4b1 @BotBuildTests:x86_64_amd_zen3_accel_nvidia_cc80+default
P: perf: 4149.921 timesteps/s (r:0, l:None, u:None)
[ PASSED ] Ran 1/9 test case(s) from 9 check(s) (0 failure(s), 8 skipped, 0 aborted)
Details
✅ job output file slurm-15518563.out
✅ no message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants