-
Notifications
You must be signed in to change notification settings - Fork 4.3k
Add monarch distributed tutorial #3613
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/tutorials/3613
Note: Links to docs will display an error until the docs builds have been completed. ❗ 2 Active SEVsThere are 2 currently active SEVs. If your PR is affected, please view them below:
✅ No FailuresAs of commit 3a7f607 with merge base 3469d47 ( This comment was automatically generated by Dr. CI and updates every 15 minutes. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Approved, but please wait for @svekars to take a look at the comment in case she has anything.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Added a couple of minor comments but LGTM overall.
One question - we don't want this to be executable, right? and have a notebook?
Thanks for the quick review! Addressed your feedback. There's a notebook linked at the bottom of the tutorial |
39c7dcb
to
479d02d
Compare
479d02d
to
1a58734
Compare
Description
This adds a tutorial demonstrating how to use Monarch with TorchTitan to spin up distributed jobs from a single controller.
We would like to link to this as part of the Monarch pytorch.org blog post.