Skip to content

Fix skypilot archive mount bug#288

Merged
hemildesai merged 1 commit intoNVIDIA-NeMo:mainfrom
ri-roee:skypilot-mount-fix
Jul 10, 2025
Merged

Fix skypilot archive mount bug#288
hemildesai merged 1 commit intoNVIDIA-NeMo:mainfrom
ri-roee:skypilot-mount-fix

Conversation

@ri-roee
Copy link
Copy Markdown
Contributor

@ri-roee ri-roee commented Jul 10, 2025

The symptom of this bug was the following error: (head, rank=0, pid=3432) bash: cd: /nemo_run/code: No such file or directory. This happens because the archived file is not mounted correctly, which is what this PR aims to fix.

I've tested this with a Run/Skypilot setup and by monkey-patching my local copy of Run was able to bypass the issue.

Signed-off-by: Roee Landesman <roeeland@cisco.com>
@ri-roee ri-roee force-pushed the skypilot-mount-fix branch from 725e6e3 to 71adf45 Compare July 10, 2025 19:03
Copy link
Copy Markdown
Contributor

@hemildesai hemildesai left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for the contribution 🎉

@hemildesai hemildesai merged commit 6550ff6 into NVIDIA-NeMo:main Jul 10, 2025
19 of 21 checks passed
@ri-roee
Copy link
Copy Markdown
Contributor Author

ri-roee commented Jul 10, 2025

Thanks for the contribution 🎉

Sure thing! Anecdotally, I also had to monkey-patch this bug to get skypilot running locally with nemo run (arm64 mac) NVIDIA-NeMo/NeMo#13936

@hemildesai
Copy link
Copy Markdown
Contributor

Thanks for the contribution 🎉

Sure thing! Anecdotally, I also had to monkey-patch this bug to get skypilot running locally with nemo run (arm64 mac) NVIDIA/NeMo#13936

Thanks for the info, we will try to merge that PR soon.

zoeyz101 pushed a commit to zoeyz101/NeMo-Run that referenced this pull request Nov 12, 2025
Signed-off-by: Roee Landesman <roeeland@cisco.com>
Signed-off-by: Zoey Zhang <zozhang@nvidia.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants