You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Enabling rope_fusion with cp > 1 gives incorrect loss for Qwen3 MoE 30b for long context training. Workaround for now is to disable rope_fusion for cp > 1, need to investigate root cause for proper fix.