fix the abnormal ‘CAPACITY_FACTOR’ value #79

jordgedu · 2023-12-25T03:29:23Z

No description provided.

jordgedu · 2023-12-25T03:31:16Z

When I tested it, I found that this abnormal value resulted in a huge amount of GPU memory

tgale96 · 2024-01-02T16:08:42Z

Ah yes, a while back we were specifying the capacity factor in terms of tokens rather than multiples of the expected number of tokens per expert. We must have missed updating this when we changed it :)

Would you mind updating the other moe scripts as well? Thanks!

tgale96 · 2024-01-02T16:09:03Z

Also, out of curiosity - why are you using MoE, as opposed to dMoE?

fix the abnormal ‘CAPACITY_FACTOR’ value

4d92a54

comane mentioned this pull request Apr 2, 2025

fix CAPACITY_FACTOR for moe #125

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix the abnormal ‘CAPACITY_FACTOR’ value #79

fix the abnormal ‘CAPACITY_FACTOR’ value #79

Uh oh!

jordgedu commented Dec 25, 2023

Uh oh!

jordgedu commented Dec 25, 2023

Uh oh!

tgale96 commented Jan 2, 2024

Uh oh!

tgale96 commented Jan 2, 2024

Uh oh!

Uh oh!

fix the abnormal ‘CAPACITY_FACTOR’ value #79

Are you sure you want to change the base?

fix the abnormal ‘CAPACITY_FACTOR’ value #79

Uh oh!

Conversation

jordgedu commented Dec 25, 2023

Uh oh!

jordgedu commented Dec 25, 2023

Uh oh!

tgale96 commented Jan 2, 2024

Uh oh!

tgale96 commented Jan 2, 2024

Uh oh!

Uh oh!