Update plan.md: reset baseline to 1.0781 BPB, reprioritize directions by dhruvjatkar · Pull Request #4 · dhruvjatkar/parameter-golf

dhruvjatkar · 2026-03-25T05:36:37Z

Summary

Sets 1.0781 BPB (PR Record: 30ep Cosine TTT on LeakyReLU² stack (3-seed mean val_bpb=1.0781) openai/parameter-golf#672, unmerged) as the new target to beat. Merged SOTA remains 1.1194.
Reorders Top 8 research directions around the constraint that TTT is maxed at 30 epochs (590s/600s eval budget). All priorities are now orthogonal to TTT.
Collapses ~1000 lines of stale Round 0-3.9 session logs into a concise historical summary with key findings.
Removes resolved blockers (flash_attn, SSH hangs, local runtime).
Adds fresh Round 1 section tracking 5 currently submitted experiments.

Test plan

Grep for 1.1194: only appears as merged SOTA context (2 occurrences)
Grep for 1.0781: primary target in 6 locations
Top 8 priorities reordered: XSA-all Add PR #672 baseline (Cosine TTT30, 1.0781 BPB) #1, Full GPTQ Update AGENTS.md: new baseline, single-agent protocol #2, SwiGLU Update CLAUDE.md: target 1.0781 BPB, single-agent protocol #3, Muon-VS Update plan.md: reset baseline to 1.0781 BPB, reprioritize directions #4
No stale Round 0/2 logs remain
Resolved blockers removed

PR openai#672 maxes TTT at 30 epochs (590s/600s eval budget), so all future improvements must be orthogonal to TTT. This update: - Sets 1.0781 BPB (PR openai#672) as the new target to beat - Reorders Top 8 directions: XSA-all confirmed at #1, Full GPTQ #2, SwiGLU #3, Muon-VS #4, aggressive quant #5, MASA openai#6, depth recurrence openai#7 with int6 risk warning, AdEMAMix openai#8 - Deprioritizes TTT-related directions already exploited by PR openai#672 - Collapses ~1000 lines of stale Round 0-3.9 session logs into a concise historical summary - Removes resolved blockers (flash_attn, SSH hangs, local runtime) - Adds fresh Round 1 section with 5 submitted experiments Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

dhruvjatkar · 2026-03-25T05:38:43Z

Merged directly to main via cherry-pick

dhruvjatkar closed this Mar 25, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update plan.md: reset baseline to 1.0781 BPB, reprioritize directions#4

Update plan.md: reset baseline to 1.0781 BPB, reprioritize directions#4
dhruvjatkar wants to merge 1 commit intomainfrom
worktree-agent-ae360f06

dhruvjatkar commented Mar 25, 2026

Uh oh!

dhruvjatkar commented Mar 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

dhruvjatkar commented Mar 25, 2026

Summary

Test plan

Uh oh!

dhruvjatkar commented Mar 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant