You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
In addition to parallel degree setting constrain, there are performance issues with SP implementation.
On an L40 machine, using two GPUs is slower than using 1 GPU.
feifeibear
changed the title
Contrains on Parallel Setting for CogVideoX
Contrains on Parallel Setting for CogVideoX and Performance issue
Sep 13, 2024
Facing the restrictions on the choice of SP parallelism degree, we developed CFG parallel to improve the inference efficiency. We also run experiments on our final versions. The updated results can be found in docs/performance/cogvideo.md, where the parallel versions show reasonable speedup.
xDiT currently implements the sequential parallel version of CogVideoX. However, there are restrictions when using it:
head_num
(30 here) %ulysses_degree
== 0height
%sp_degree
== 0--height 640 --width 720
withsp_degree = 8
(uly=2, ring=4), the VAE decoder throws an error.The text was updated successfully, but these errors were encountered: