-
Notifications
You must be signed in to change notification settings - Fork 1.2k
Issues: huggingface/text-generation-inference
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
NotImplementedError: Vlm do not work with prefix caching yet
#3110
opened Mar 13, 2025 by
AndriiBihun
2 of 4 tasks
Sharding Error with max_total_tokens and max_input_tokens options in Gemma3-27B-it model
#3104
opened Mar 13, 2025 by
calycekr
[Upstream dependence changes] The behavior about env var in
hf-hub
has changed.
#3088
opened Mar 8, 2025 by
HairlessVillager
Running container rootless does not work anymore
#3082
opened Mar 7, 2025 by
scriptator
2 of 4 tasks
Llama 3.3 70B Weird , gibberish outputs in production setup
#3043
opened Feb 20, 2025 by
andresC98
2 of 4 tasks
TGI metrics don't have model_name label to indicate which model the metrics belong to
wontfix
This will not be worked on
#3026
opened Feb 17, 2025 by
yashaswipiplani
WARN text_generation_launcher: Unkown compute for card nvidia-geforce-rtx-3090
#3014
opened Feb 11, 2025 by
bmilesp
Resource underutilization, thread thrashing: CPU affinity ignores allowed CPUs and cannot be switched off
#3011
opened Feb 11, 2025 by
askervin
3 of 4 tasks
Nonsense responses with n-gram speculative decoding
#2997
opened Feb 6, 2025 by
olliestanley
1 of 4 tasks
Request failed during generation: Server error: Value out of range: -29146814772
#2994
opened Feb 5, 2025 by
AlperYildirim1
2 of 4 tasks
Mistral Small 3 : chat template with python functions causes error
#2987
opened Feb 3, 2025 by
v3ss0n
2 tasks done
Previous Next
ProTip!
Follow long discussions with comments:>50.