Skip to content

Activity

Deleted branch

baberabbdeleted feature/#2733 • 
21 minutes ago

add humaneval+ and mbpp+ (#2734)

Pull request merge
baberabbpushed 1 commit to main • 9b29da0…86bbf6a • 
21 minutes ago

Fix the import source for eval_logger (#2735)

Pull request merge
baberabbpushed 1 commit to main • 2b2fa97…9b29da0 • 
28 minutes ago

add newline at end of file

bzantiumpushed 1 commit to feature/#2733 • bef5d07…6f7af78 • 
8 hours ago

add humaneval+ and mbpp+

bzantiumcreated feature/#2733 • bef5d07 • 
8 hours ago

add cocoteros_es dataset (#2721)

Pull request merge
baberabbpushed 1 commit to main • 2f403fa…2b2fa97 • 
13 hours ago

add Basque translation of ARC and PAWS to BasqueBench (#2732)

Pull request merge
baberabbpushed 1 commit to main • 01849b4…2f403fa • 
13 hours ago

Merge branch 'main' into longcxt

baberabbpushed 6 commits to longcxt • ccf4a58…1183252 • 
14 hours ago

add o3-mini support (#2697)

Pull request merge
baberabbpushed 1 commit to main • a9a0e3c…01849b4 • 
14 hours ago

Added IberoBench citation info (https://aclanthology.org/2025.coling-…

Pull request merge
baberabbpushed 1 commit to main • 5e0b6f1…a9a0e3c • 
yesterday

Deleted branch

baberabbdeleted ifeval_log • 
yesterday

remove unused import (#2728)

Pull request merge
baberabbpushed 1 commit to main • 0bf9f4e…5e0b6f1 • 
yesterday

remove unused import

baberabbcreated ifeval_log • 09da619 • 
yesterday

fix missing dataset repo (#2719)

Pull request merge
baberabbpushed 1 commit to main • 1ba35e6…0bf9f4e • 
3 days ago

Deleted branch

baberabbdeleted logging-best-practices • 
4 days ago

Logging (#2203)

Pull request merge
baberabbpushed 1 commit to main • 358adaf…1ba35e6 • 
4 days ago

refactor setup_logging to utils

baberabbpushed 1 commit to logging-best-practices • e3e6b3c…2f11680 • 
4 days ago

add logging to docs

baberabbpushed 1 commit to logging-best-practices • fea7594…e3e6b3c • 
4 days ago

Deleted branch

baberabbdeleted maths • 
4 days ago

add math_verify to some tasks (#2686)

Pull request merge
baberabbpushed 1 commit to main • 52df63b…358adaf • 
4 days ago

increment version

baberabbpushed 1 commit to maths • cd0ddaf…1015205 • 
4 days ago

add math_verify to minerva_math and leaderboard_math

baberabbpushed 1 commit to qwen_math • 47051bd…adaa79e • 
4 days ago

nit

baberabbpushed 1 commit to qwen_math • 55020c9…47051bd • 
4 days ago

nit

baberabbpushed 1 commit to qwen_math • 6e7c789…55020c9 • 
4 days ago

nit

baberabbpushed 1 commit to qwen_math • f8e9fa1…6e7c789 • 
4 days ago

add math_verify to pyproject

baberabbcreated qwen_math • f8e9fa1 • 
4 days ago

add ruler

baberabbpushed 1 commit to longcxt • 527a435…ccf4a58 • 
4 days ago

Merge branch 'main' into longcxt

baberabbpushed 33 commits to longcxt • 6042f62…527a435 • 
4 days ago

add ruler

baberabbpushed 3 commits to longcxt • 77356fb…6042f62 • 
4 days ago

nit

baberabbpushed 1 commit to longcxt • a2bc624…77356fb • 
4 days ago