Skip to content

Conversation

@ShaneIsley
Copy link
Owner

Adds an example demonstrating recursive RLM rollouts on the Oolong
benchmark. Previously, examples lacked coverage of recursive calls.

  • Loads context and question from oolongbench/oolong-real dataset
  • Runs RLM completion with logging enabled
  • Validates response against expected answer

Original-Author: alt-glitch (balyan.sid@gmail.com)
Upstream-PR: alexzhang13#34

Adds an example demonstrating recursive RLM rollouts on the Oolong
benchmark. Previously, examples lacked coverage of recursive calls.

- Loads context and question from oolongbench/oolong-real dataset
- Runs RLM completion with logging enabled
- Validates response against expected answer

Original-Author: alt-glitch (balyan.sid@gmail.com)
Upstream-PR: alexzhang13#34
@ShaneIsley ShaneIsley merged commit 767cf3d into main Jan 14, 2026
2 of 3 checks passed
@ShaneIsley ShaneIsley deleted the claude/apply-upstream-pr-bQxtq branch January 14, 2026 22:54
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants