Skip to content

Conversation

@cijose
Copy link
Contributor

@cijose cijose commented Oct 21, 2025

Summary

We were not passing process group when loading teacher checkpoints for distillation and this caused a bug when distilling from unsharded checkpoints. This PR fixes that.

Test plan

Tested with the multi-distillation example run from README and verified that we can load unsharded as well as sharded checkpoints for distillation

@meta-cla meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Oct 21, 2025
Copy link
Contributor

@MichaelRamamonjisoa MichaelRamamonjisoa left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Tested, LGTM!

@cijose cijose merged commit 0e35cfd into main Oct 21, 2025
1 of 2 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CLA Signed This label is managed by the Meta Open Source bot.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants