Skip to content

Navigation Menu

Explore
By company size
By use case
By industry
View all solutions
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

huggingface / open-r1 Public

Notifications You must be signed in to change notification settings
Fork 1.7k
Star 19.8k

Code
Issues 118
Pull requests 24
Discussions
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Pull requests: huggingface/open-r1

Labels 11 Milestones 0

Labels 11 Milestones 0

New pull request New

24 Open 114 Closed

24 Open 114 Closed

Author

Filter by author

Loading

Label

Filter by label

Loading

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Loading

Milestones

Filter by milestone

Loading

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Loading

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

Update grpo.py

#325 opened Feb 14, 2025 by tpoisonooo

Loading…

add text similarity for more common accuracy reward

#322 opened Feb 14, 2025 by sungatetop

Loading…

fix: sft fix

#307 opened Feb 13, 2025 by pointerhacker

Loading…

Fix: Default value of cosine_min_value_wrong parameter

#305 opened Feb 13, 2025 by zhangsheng377

Loading…

Simplified installation requirements to support more accelerators

#303 opened Feb 13, 2025 by ji-huazhong

Loading…

2

Fix eval max length

#297 opened Feb 12, 2025 by Some-random

Loading…

[rewards] use dense rep penalty

#296 opened Feb 12, 2025 by kashif

Loading…

Update README.md

#291 opened Feb 12, 2025 by tpoisonooo

Loading…

Performance improvements of reward calculation

#286 opened Feb 11, 2025 by saidineshpola

Loading…

[GRPO] generate with prompt containing the first <think> tag

#283 opened Feb 11, 2025 by kashif

Loading…

2

Fix: Avoid empty keyword argument in VLLMModelConfig from Makefile

#246 opened Feb 8, 2025 by mattdepaolis

Loading…

1

fix: easier environment setup; pin trl, transformers

#199 opened Feb 6, 2025 by ctjlewis

Loading…

2

6

Replace the base model deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B to Qwen/Qwen2.5-1.5B-Instruct in GRPO

#198 opened Feb 5, 2025 by DVampire

Loading…

Feature: Add SGLang Support to backend

#166 opened Feb 2, 2025 by jhinpan • Draft

6

Update: pinned lighteval reference to allow PyTorch 2.5+

#142 opened Jan 31, 2025 by ATaylorAerospace

Loading…

[Feat] Adding minimal training for multimodal model

#136 opened Jan 31, 2025 by kcz358

Loading…

1

Replace static plan of action image with dynamic mermaid file

#111 opened Jan 29, 2025 by INF800

Loading…

[NOT MEANT TO MERG!] GRPO reward func for coding dataset

#105 opened Jan 29, 2025 by August-murr

Loading…

1

Create data from remote api.

#102 opened Jan 29, 2025 by PoTaTo-Mika

Loading…

2

Solution for Potential Inflation of Reward Metrics for Unparseable Go…

#87 opened Jan 28, 2025 by agulati18

Loading…

4

feat: Added reward model according to paper.

#78 opened Jan 27, 2025 by ahmeterdempmk

Loading…

4

Add Environment Test Script

#52 opened Jan 26, 2025 by sambhavnoobcoder

Loading…

Add devcontainer configuration for VS Code

#33 opened Jan 25, 2025 by bhack

Loading…

chore: update trl to grpo_vllm branch, move lighteval to uv

#30 opened Jan 25, 2025 by gerred

Loading…

7

ProTip! Updated in the last three days: updated:>2025-02-11.

Footer

© 2025 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.