-
Notifications
You must be signed in to change notification settings - Fork 91
Pull requests: groq/openbench
Author
Label
Milestones
Reviews
Assignee
Sort
Pull requests list
chore: prune README, move extra info to docs
#336
opened Dec 4, 2025 by
lee-groq
Loading…
1 of 23 tasks
feat: add chat-completions/ and responses/ provider prefixes
#335
opened Dec 4, 2025 by
lee-groq
Loading…
12 of 23 tasks
feat: upgraded InspectAI, OpenAI and MCP versions to support Gemini 3 Pro Preview
#333
opened Dec 2, 2025 by
geelen
Loading…
7 of 23 tasks
feat: Add extract_eval_stats.py script for eval statistics extraction
#332
opened Dec 1, 2025 by
geelen
Loading…
6 of 23 tasks
chore(deps): bump the actions group across 1 directory with 6 updates
dependencies
Pull requests that update a dependency file
#331
opened Dec 1, 2025 by
dependabot
bot
Loading…
feat: add CI/CD workflow to validate tagging of MCQEval tasks
#330
opened Dec 1, 2025 by
nmayorga7
Loading…
8 of 19 tasks
feat: implement model-graded flag for use with MCQEvals
#329
opened Dec 1, 2025 by
nmayorga7
Loading…
10 of 21 tasks
feat(exercism): adding codex, claude, gemini and hidden tests flag
#325
opened Nov 25, 2025 by
lvogel04
Loading…
10 of 23 tasks
feat(prbench): add PRBench integration
#311
opened Nov 16, 2025 by
aaditgupta21
Loading…
9 of 23 tasks
✨ feat: add BurnCloud as new AI model provider
#308
opened Nov 16, 2025 by
rustburn
Loading…
4 tasks done
feat(cli): add --system-prompt argument for controlling model behavior
Stale
#248
opened Oct 14, 2025 by
n33levo
Loading…
build(deps): bump the python-dependencies group across 1 directory with 2 updates
#152
opened Sep 8, 2025 by
dependabot
bot
Loading…
feat(slide_vqa): add SlideVQA support
Stale
#148
opened Sep 4, 2025 by
damoonsh
Loading…
10 of 23 tasks
feat(benchmark): add support for GSM8K benchmark, solves #143
Stale
#144
opened Sep 3, 2025 by
ceferisbarov
Loading…
9 of 23 tasks
ProTip!
Adding no:label will show everything without a label.