Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

USP method with para_attn of Chengzy in Flux #474

Open
TuanNT-ZenAI opened this issue Mar 6, 2025 · 1 comment
Open

USP method with para_attn of Chengzy in Flux #474

TuanNT-ZenAI opened this issue Mar 6, 2025 · 1 comment

Comments

@TuanNT-ZenAI
Copy link

TuanNT-ZenAI commented Mar 6, 2025

In Para_Attn, Chengzy use hybird Ring and Ulysses, why in this repo use only Ulysses (with 2 GPU)?

@feifeibear
Copy link
Collaborator

The USP (Unified Sequence Parallelism) approach in xDiT is flexible, allowing you to configure it as needed.

For example, you can use Ring parallelism on 2 GPUs by setting ulysses_degree=1 and ring_degree=2. This flexibility enables users to adapt the parallelism strategy based on their specific hardware and performance requirements.

Also, we are happy to notice that USP which is first proposed by the team of xDiT has been widely adopted in other repos.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants