Skip to content

Pull requests: huggingface/open-r1

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Update grpo.py
#325 opened Feb 14, 2025 by tpoisonooo Loading…
add text similarity for more common accuracy reward
#322 opened Feb 14, 2025 by sungatetop Loading…
fix: sft fix
#307 opened Feb 13, 2025 by pointerhacker Loading…
Fix eval max length
#297 opened Feb 12, 2025 by Some-random Loading…
[rewards] use dense rep penalty
#296 opened Feb 12, 2025 by kashif Loading…
Update README.md
#291 opened Feb 12, 2025 by tpoisonooo Loading…
Performance improvements of reward calculation
#286 opened Feb 11, 2025 by saidineshpola Loading…
fix: easier environment setup; pin trl, transformers
#199 opened Feb 6, 2025 by ctjlewis Loading…
2
6
[Feat] Adding minimal training for multimodal model
#136 opened Jan 31, 2025 by kcz358 Loading…
Create data from remote api.
#102 opened Jan 29, 2025 by PoTaTo-Mika Loading…
feat: Added reward model according to paper.
#78 opened Jan 27, 2025 by ahmeterdempmk Loading…
Add Environment Test Script
#52 opened Jan 26, 2025 by sambhavnoobcoder Loading…
Add devcontainer configuration for VS Code
#33 opened Jan 25, 2025 by bhack Loading…
ProTip! Updated in the last three days: updated:>2025-02-11.