non-record: LR warmdown on 1x A40 (1.723 bpb, 8.40MB) by my-sonicase · Pull Request #313 · openai/parameter-golf

my-sonicase · 2026-03-21T05:30:29Z

This PR adds a non-record submission under track_10min_16mb.

Summary:
This submission improves over a local MLX baseline (~1.87 bpb) and demonstrates that schedule tuning alone yields significant gains under the 16MB constraint.

baseline architecture
no tokenizer or dataset changes
schedule tuning only:
- WARMDOWN_ITERS=3600
- MATRIX_LR=0.06

Result:

final int8+zlib roundtrip val_bpb: 1.7232
total submission size int8+zlib: 8,397,395 bytes

Hardware:

1x A40
600s wallclock-capped run

This is a reproducible non-record submission demonstrating a simple improvement from training schedule tuning under the 16MB constraint.

Add non-record LR warmdown A40 submission

d6dc864

notapplica mentioned this pull request Mar 21, 2026

Parameter Golf Live AI Commentary + Analysis / Ideas | every 10 minutes #140

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

non-record: LR warmdown on 1x A40 (1.723 bpb, 8.40MB)#313

non-record: LR warmdown on 1x A40 (1.723 bpb, 8.40MB)#313
my-sonicase wants to merge 1 commit intoopenai:mainfrom
my-sonicase:submit-lr-warmdown-a40

my-sonicase commented Mar 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

my-sonicase commented Mar 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant