Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

build: Remove libnccl-net before creating symlink #818

Merged
merged 1 commit into from
Mar 21, 2025

Conversation

bwbarrett
Copy link
Contributor

f71aea2 changed the default library name from libnccl-net.so to libnccl-net-ofi.so, and then created a symlink from libnccl-net-ofi.so to libnccl-net.so. The commit missed the upgrade path, where the install directory already contained a libnccl-net.so file, causing the symlink creation to fail.

This commit removes the libnccl-net.so file/symlink before creating the symlink, fixing the upgrade path.

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.

f71aea2 changed the default library name from libnccl-net.so to
libnccl-net-ofi.so, and then created a symlink from libnccl-net-ofi.so
to libnccl-net.so.  The commit missed the upgrade path, where the
install directory already contained a libnccl-net.so file, causing
the symlink creation to fail.

This commit removes the libnccl-net.so file/symlink before creating
the symlink, fixing the upgrade path.

Signed-off-by: Brian Barrett <[email protected]>
@bwbarrett
Copy link
Contributor Author

Sigh; this was a dumb bug. Sorry everyone.

@bwbarrett bwbarrett marked this pull request as ready for review March 21, 2025 01:25
@bwbarrett bwbarrett requested a review from a team as a code owner March 21, 2025 01:26
@bwbarrett bwbarrett merged commit b23102f into aws:master Mar 21, 2025
23 checks passed
@bwbarrett bwbarrett deleted the bugfix/update-across-symlink-fix branch March 21, 2025 01:26
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants