-
Notifications
You must be signed in to change notification settings - Fork 11k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
tool-call
: fix Qwen 2.5 Coder support, add micro benchmarks, support trigger patterns for lazy grammars
#12034
Open
ochafik
wants to merge
36
commits into
ggml-org:master
Choose a base branch
from
ochafik:tool-bench-prod
base: master
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
Open
Changes from all commits
Commits
Show all changes
36 commits
Select commit
Hold shift + click to select a range
b37779b
sampler: turn lazy grammar trigger words to regexes
ochafik a456911
add scripts/tool_bench.sh & .py
ochafik 14a4388
optionally allow any spaces in json schema grammars (useful for llama…
ochafik e2ca8be
constrain llama json output regardless of function name if matches at…
ochafik 53266f9
better error when wrong function called
ochafik 7833c16
improve error message in weather test
ochafik 0e1a00e
add more models to tool_bench.sh
ochafik 44740f7
benchmark other sizes of qwen 2.5 coder
ochafik dd6eb97
rm duplicate in tool_bench.sh
ochafik 0fc6218
add missing <variant> include
ochafik 6fd4972
fix lints
ochafik 2e656f9
improve "bad" qwen triggers
ochafik fbd3c19
add cast to please some gccs
ochafik 62a1416
ditch server test request retry logic
ochafik 596ff7f
fix flake8 lints
ochafik fe6968f
nits
ochafik 1caacd5
remove any_spaces grammar option, allow extra line for airy llama jso…
ochafik 789a3e1
Update test_tool_call.py
ochafik 6493a14
test w/ beefier qwen 2.5 coder 3b
ochafik cc817a0
revert some test_hello_world diffs
ochafik ead02c6
diff
ochafik d7acf2c
Update test_tool_call.py
ochafik 0db4073
add requirements for tool_bench
ochafik 0ce606b
fix test_thoughts deepseek test expectation
ochafik a3cde16
Update README.md
ochafik 79ad623
update relaxed newline space rule in grammar tests
ochafik 3fe208a
support add_generation_prompt query parameter (useful for /apply_temp…
ochafik fe8c79b
Merge remote-tracking branch 'origin/master' into tool-bench-prod
ochafik 99d2d80
token cast tweak for gcc
ochafik c7fa19a
fix warning on gcc13 w/ uninitialized variant
ochafik 6e5a830
fix python lints
ochafik 0b5d105
fix gcc13 warning
ochafik 7bcc5af
fix pyright lints in tool_bench.py
ochafik d1f48d0
Merge remote-tracking branch 'origin/master' into tool-bench-prod
ochafik fc19192
update readme w/ link to tool call
ochafik 60f28ef
tool-bench: add --ctk, --ctv, --fa flags
ochafik File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Large diffs are not rendered by default.
Oops, something went wrong.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
A bit hesitating about the usage of
std::variant
here. Overall, it's OK, but it's the first time we use it in the codebase, and I'm not sure it is worth introducing it as a pattern. Having 2 separate membersvalue_token
andvalue_string
is the alternative.