add humaneval+ and mbpp+ (
#2734 )
Pull request merge
baberabbpushed 1 commit to main • 9b29da0…86bbf6a • 21 minutes ago
Fix the import source for eval_logger (
#2735 )
Pull request merge
baberabbpushed 1 commit to main • 2b2fa97…9b29da0 • 28 minutes ago
add newline at end of file
add cocoteros_es dataset (
#2721 )
Pull request merge
baberabbpushed 1 commit to main • 2f403fa…2b2fa97 • 13 hours ago
add Basque translation of ARC and PAWS to BasqueBench (
#2732 )
Pull request merge
baberabbpushed 1 commit to main • 01849b4…2f403fa • 13 hours ago
Merge branch 'main' into longcxt
add o3-mini support (
#2697 )
Pull request merge
baberabbpushed 1 commit to main • a9a0e3c…01849b4 • 14 hours ago
baberabbpushed 1 commit to main • 5e0b6f1…a9a0e3c • yesterday
remove unused import (
#2728 )
Pull request merge
baberabbpushed 1 commit to main • 0bf9f4e…5e0b6f1 • yesterday
fix missing dataset repo (
#2719 )
Pull request merge
baberabbpushed 1 commit to main • 1ba35e6…0bf9f4e • 3 days ago
baberabbpushed 1 commit to main • 358adaf…1ba35e6 • 4 days ago
refactor setup_logging to utils
add math_verify to some tasks (
#2686 )
Pull request merge
baberabbpushed 1 commit to main • 52df63b…358adaf • 4 days ago
add math_verify to minerva_math and leaderboard_math
add math_verify to pyproject
Merge branch 'main' into longcxt
You can’t perform that action at this time.