Skip to content

Update CLAUDE.md: target 1.0781 BPB, single-agent protocol#3

Closed
dhruvjatkar wants to merge 1 commit intomainfrom
worktree-agent-aa9dbf33
Closed

Update CLAUDE.md: target 1.0781 BPB, single-agent protocol#3
dhruvjatkar wants to merge 1 commit intomainfrom
worktree-agent-aa9dbf33

Conversation

@dhruvjatkar
Copy link
Copy Markdown
Owner

Updates CLAUDE.md to target 1.0781 BPB (PR openai#672), adds single-agent protocol, marks cron as non-functional, adds operational lessons, updates milestones.

…al lessons

- New target: 1.0781 BPB (PR openai#672, TTT_EPOCHS=30 Cosine TTT)
- Merged SOTA kept as 1.1194 for context
- Add single-agent protocol (one agent on cluster at a time)
- Add operational lessons from March 2026
- Mark crontab auto-submitter as non-functional
- Update milestones relative to 1.0781
- Update preferred source script to PR672 baseline

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
dhruvjatkar pushed a commit that referenced this pull request Mar 25, 2026
PR openai#672 maxes TTT at 30 epochs (590s/600s eval budget), so all future
improvements must be orthogonal to TTT. This update:
- Sets 1.0781 BPB (PR openai#672) as the new target to beat
- Reorders Top 8 directions: XSA-all confirmed at #1, Full GPTQ #2,
  SwiGLU #3, Muon-VS #4, aggressive quant #5, MASA openai#6,
  depth recurrence openai#7 with int6 risk warning, AdEMAMix openai#8
- Deprioritizes TTT-related directions already exploited by PR openai#672
- Collapses ~1000 lines of stale Round 0-3.9 session logs into a
  concise historical summary
- Removes resolved blockers (flash_attn, SSH hangs, local runtime)
- Adds fresh Round 1 section with 5 submitted experiments

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
dhruvjatkar pushed a commit that referenced this pull request Mar 25, 2026
PR openai#672 maxes TTT at 30 epochs (590s/600s eval budget), so all future
improvements must be orthogonal to TTT. This update:
- Sets 1.0781 BPB (PR openai#672) as the new target to beat
- Reorders Top 8 directions: XSA-all confirmed at #1, Full GPTQ #2,
  SwiGLU #3, Muon-VS #4, aggressive quant #5, MASA openai#6,
  depth recurrence openai#7 with int6 risk warning, AdEMAMix openai#8
- Deprioritizes TTT-related directions already exploited by PR openai#672
- Collapses ~1000 lines of stale Round 0-3.9 session logs into a
  concise historical summary
- Removes resolved blockers (flash_attn, SSH hangs, local runtime)
- Adds fresh Round 1 section with 5 submitted experiments

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
@dhruvjatkar
Copy link
Copy Markdown
Owner Author

Merged directly to main via cherry-pick

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant