Skip to content

Conversation

BasicCoder
Copy link

Try to get a valid output of the model, i.e. do not include the input of the model in the case of non-streaming

@Columpio
Copy link

@BasicCoder just pass exclude_input_in_output:True to tensorrt_llm/config.pbtxt.

Still you can transform this PR into making the above option default as many people struggle with it being disabled by default.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants