Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[405b-SUT] Max number of output tokens #2060

Open
attafosu opened this issue Jan 28, 2025 · 1 comment
Open

[405b-SUT] Max number of output tokens #2060

attafosu opened this issue Jan 28, 2025 · 1 comment

Comments

@attafosu
Copy link
Contributor

For 405B the sampling parameter config sets the max output tokens to be 20k.
However, given the reference output distribution with max output length of 1.7k, I don't think we should set this parameter in the sampler that high.
@nvzhihanj @arjunsuresh @mrmhodak

@nvzhihanj
Copy link
Contributor

nvzhihanj commented Jan 28, 2025

max_new_tokens should be 2000 (max input length is 20000), this looks like a typo. Can you help submit a PR to patch it?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants