Skip to content

Generating MFC Images and Testing Them on OSPool #935

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Draft
wants to merge 94 commits into
base: master
Choose a base branch
from
Draft
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
94 commits
Select commit Hold shift + click to select a range
bb15510
just experimenting (#654)
Jun 15, 2025
d4875b8
updated sing version
Jun 15, 2025
e0dbc1c
corrected and directed images to be stored in images folder
Jun 15, 2025
eeb1706
corrected path
Jun 15, 2025
56ba5a4
chaning path
Jun 15, 2025
77c5ce0
another path correction
Jun 15, 2025
a12263e
added cloning of PR
Jun 15, 2025
6c64ac0
adjusted images commands
Jun 15, 2025
5234667
changed folder images=>image
Jun 15, 2025
23fff81
removed items from Build singularity Images
Jun 15, 2025
f6faafe
added --fakeroot
Jun 15, 2025
91b0e58
commenting to only generate one image
Jun 15, 2025
07ad245
added --sandbox
Jun 15, 2025
3204a2a
root path
Jun 15, 2025
999eaf6
apptainer
Jun 15, 2025
7162b30
removed stuff
Jun 15, 2025
303660b
removed fakeroot flag
Jun 15, 2025
7500195
using apptainer instead of github action
Jun 15, 2025
cac8ca9
ensuring all images are saved
Jun 15, 2025
aade580
user privileges
Jun 15, 2025
1ccbb7a
singularity files
Jun 15, 2025
65cff8f
corrected path (#654)
Jun 15, 2025
5940002
unified singularity files to avoid errors
Jun 15, 2025
b70278e
some changes
Jun 16, 2025
0728f78
experimenet with the workflow file
Jun 20, 2025
09cf1d7
removed all workflow files except containerization
Jun 20, 2025
a86aafd
edit container-image.yml
Jun 20, 2025
edde446
small edit
Jun 20, 2025
e0e1106
small edit
Jun 20, 2025
165970f
testing current workflow
Malmahrouqi3 Jul 10, 2025
d33ff5d
only cpu image
Malmahrouqi3 Jul 10, 2025
5c0eb49
another edit
Malmahrouqi3 Jul 10, 2025
d94a8c8
corrected permissions
Malmahrouqi3 Jul 10, 2025
e18a72e
figuring out runscript
Malmahrouqi3 Jul 10, 2025
35846d2
runscript edit
Malmahrouqi3 Jul 10, 2025
ab9ac8f
fixing runscript
Malmahrouqi3 Jul 10, 2025
9e1ad00
another fix to runscript
Malmahrouqi3 Jul 10, 2025
ce3214f
just checking
Malmahrouqi3 Jul 10, 2025
007f351
working concept pushed for testing
Jul 11, 2025
dcb169b
updating workflow file
Jul 11, 2025
7d2a89d
another approach: build and store then test
Jul 11, 2025
dbbc147
added double CI execution on base and fork
Malmahrouqi3 Jul 11, 2025
2798c9d
reorganized packages
Malmahrouqi3 Jul 11, 2025
a5ec16e
recent layer of nvhpc
Malmahrouqi3 Jul 11, 2025
8733924
HTConder job submission files
Malmahrouqi3 Jul 11, 2025
644f278
recnet changes
Malmahrouqi3 Jul 11, 2025
a3fc14b
modified requirements
Malmahrouqi3 Jul 18, 2025
41a6590
added ssh-keyscan
Malmahrouqi3 Jul 18, 2025
68df3d3
condor_wait to wait for all jobs to finish
Malmahrouqi3 Jul 18, 2025
55abc0d
build, store, and test all in Access Point
Malmahrouqi3 Jul 18, 2025
769ed4d
quick fix
Malmahrouqi3 Jul 18, 2025
b78c61f
added recursive flag
Malmahrouqi3 Jul 18, 2025
bc2121b
removed sudo
Malmahrouqi3 Jul 18, 2025
37ea3a6
copy all files
Malmahrouqi3 Jul 18, 2025
53cd06b
dumping logs rather than dislaying
Malmahrouqi3 Jul 18, 2025
d9c7501
updated steup instructions to remove pre-exisiting files and logs
Malmahrouqi3 Jul 18, 2025
517dafb
remove installed packages
Malmahrouqi3 Jul 18, 2025
d6af295
Migrate Images to ODSF
Malmahrouqi3 Jul 18, 2025
1c9479a
migrate images prior to testing
Malmahrouqi3 Jul 18, 2025
3932e8c
corrected mv
Malmahrouqi3 Jul 18, 2025
f9544f3
made image builds occur on gh runner
Malmahrouqi3 Jul 18, 2025
72172ab
corrected image dir
Malmahrouqi3 Jul 18, 2025
0958266
corrected mv to be inside ssh
Malmahrouqi3 Jul 18, 2025
028a116
cleanup added to clear some space
Malmahrouqi3 Jul 18, 2025
13b0c20
testing out two images only
Malmahrouqi3 Jul 18, 2025
30d8a5e
checking gpu image
Malmahrouqi3 Jul 18, 2025
ad8673d
clear up some space
Malmahrouqi3 Jul 18, 2025
f55ff1b
modified gpu images to compatible base container
Malmahrouqi3 Jul 19, 2025
88e6aae
using nv runtime base container instead of devel cuda
Malmahrouqi3 Jul 20, 2025
64ec9c6
added python package
Malmahrouqi3 Jul 20, 2025
06d9412
update list of python packages
Malmahrouqi3 Jul 20, 2025
055a148
just experimenting with py packages
Malmahrouqi3 Jul 20, 2025
8352bbd
just experimenting with stuff
Malmahrouqi3 Jul 20, 2025
92f79fa
new gpu container
Malmahrouqi3 Jul 20, 2025
7064e36
removed stuff
Malmahrouqi3 Jul 20, 2025
308a21c
compiler vars set
Malmahrouqi3 Jul 20, 2025
6eb8b1d
just debugging env var paths
Malmahrouqi3 Jul 20, 2025
bf6014e
updated gpu image and experimenting
Malmahrouqi3 Jul 20, 2025
5bb059a
updated stuff
Malmahrouqi3 Jul 20, 2025
b1bbe27
env into workflow file
Malmahrouqi3 Jul 20, 2025
4734eb6
modified Singularity.gpu
Malmahrouqi3 Jul 20, 2025
a7eb958
new gpu base container nvhpc:23.11-devel-cuda12.3-ubuntu22.04
Malmahrouqi3 Jul 20, 2025
51c6584
updated environment
Malmahrouqi3 Jul 20, 2025
cec680d
cleanup into workflow
Malmahrouqi3 Jul 21, 2025
af7b85d
added Free up disk space
Malmahrouqi3 Jul 21, 2025
20c6cc2
more cleanup
Malmahrouqi3 Jul 22, 2025
8769729
included --tmpdir /tmp/mfc_tmp
Malmahrouqi3 Jul 22, 2025
ba0adfd
restoring all commented commands
Malmahrouqi3 Jul 22, 2025
9e0d458
added hpl
Malmahrouqi3 Jul 22, 2025
fc71dcb
added mkdir mfc_tmp for each image
Malmahrouqi3 Jul 22, 2025
8a07f7d
mkdir MFC
Jul 22, 2025
a3e96b4
added hpl package
Jul 22, 2025
9e4360c
simplified yaml file
Jul 22, 2025
4d41695
something
Jul 22, 2025
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
68 changes: 0 additions & 68 deletions .github/workflows/bench.yml

This file was deleted.

127 changes: 0 additions & 127 deletions .github/workflows/cleanliness.yml

This file was deleted.

76 changes: 76 additions & 0 deletions .github/workflows/container-image.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,76 @@
name: Build Singularity Images

on:
push:

jobs:
Build-singularity-images:
name: Build Singularity Images
runs-on: ubuntu-latest
steps:
- name: Clone - PR
uses: actions/checkout@v4
with:
path: pr

- name: Build & Store Images
run: |
sudo apt-get update
sudo apt-get install -y software-properties-common openssh-client
sudo add-apt-repository -y ppa:apptainer/ppa
sudo apt-get install -y apptainer
sudo apptainer config fakeroot --enable $(whoami)

sudo apt-get clean
sudo rm -rf /tmp/* /var/lib/apt/lists/* /opt/hostedtoolcache /usr/local/lib/android /usr/share/dotnet /opt/ghc /usr/local/share/boost /usr/lib/jvm /home/runner/.cache /home/runner/work/_temp
docker system prune -af || true

mkdir -p ~/.ssh
echo "${{secrets.SSH_PRIVATE_KEY}}" >~/.ssh/id_rsa
chmod 600 ~/.ssh/id_rsa
ssh-keyscan -H ap40.uw.osg-htc.org >> ~/.ssh/known_hosts

mkdir -p /tmp/mfc_tmp
ssh ${{secrets.SSH_USER}} "mkdir -p MFC"

declare -a images=("cpu" "cpu_bench" "gpu" "gpu_bench")
for img in "${images[@]}"; do
sif="mfc_${img}.sif"
sub="mfc_${img}.sub"
def="Singularity.${img}"
(cd pr/.github/workflows/images && sudo apptainer build --force --tmpdir /tmp/mfc_tmp "$sif" "$def")
scp "pr/.github/workflows/images/$sif" "pr/.github/workflows/images/$sub" ${{secrets.SSH_USER}}:MFC/
ssh ${{secrets.SSH_USER}} "mv /home/mohammed.al-mahrouqi/MFC/$sif /ospool/ap40/data/mohammed.al-mahrouqi"
rm -rf "pr/.github/workflows/images/$sif"
rm -rf ~/.apptainer
sudo rm -rf /tmp/* /var/lib/apt/lists/* /opt/hostedtoolcache /usr/local/lib/android /usr/share/dotnet /opt/ghc /usr/local/share/boost /usr/lib/jvm /home/runner/.cache /home/runner/work/_temp
sudo apt-get clean
mkdir -p /tmp/mfc_tmp
done

scp pr/.github/workflows/images/run_mfc.sh ${{secrets.SSH_USER}}:MFC/run_mfc.sh

- name: Test Images
run: |
scp pr/.github/workflows/images/*.sub ${{secrets.SSH_USER}}:MFC/

ssh ${{secrets.SSH_USER}} "
condor_submit MFC/mfc_cpu.sub
condor_submit MFC/mfc_cpu_bench.sub
condor_submit MFC/mfc_gpu.sub
condor_submit MFC/mfc_gpu_bench.sub

condor_wait logfile
"

- name: Migrate Images from ODSF
run: |
scp -r ${{secrets.SSH_USER}}:/ospool/ap40/data/mohammed.al-mahrouqi/* pr/.github/workflows/images/

- name: Upload Artifacts
uses: actions/upload-artifact@v4
if: always()
with:
name: singularity-images
path: |
pr/.github/workflows/images/*
48 changes: 0 additions & 48 deletions .github/workflows/coverage.yml

This file was deleted.

Loading