Skip to content

Record: Cosine TTT scheduling with per-layer lr (mean val_bpb=1.0970,…

c8e034d
Select commit
Loading
Failed to load commit list.
Closed

Record: Cosine TTT scheduling with per-layer lr — mean val_bpb=1.0970 (3 seeds) #481

Record: Cosine TTT scheduling with per-layer lr (mean val_bpb=1.0970,…
c8e034d
Select commit
Loading
Failed to load commit list.

Workflow runs completed with no jobs