Research / Features Backlog - Potential Future Use #60

vukrosic · 2025-12-20T08:32:19Z

vukrosic
Dec 20, 2025
Maintainer

Rsearch that didn't accelerate the training speed but shows promise and code that will be used if the need arises (to avoid bloating the codebase)

Research

TurboMuon - #65

Unet connection of shallow and deep layers - PR

This code achieves a bit better (lower) loss for a worse (longer) training time. For now we decided it's not worthed the extra training time and additional code complexity. It will be reconsidered with further improvements.

Baseline: 16m 44s 139ms
With unet: 18m 38s 397ms

Features

Adding Docker - Pull Request

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Research / Features Backlog - Potential Future Use #60

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

Research / Features Backlog - Potential Future Use #60

Uh oh!

Uh oh!

vukrosic Dec 20, 2025 Maintainer

Research

TurboMuon - #65

Unet connection of shallow and deep layers - PR

Features

Replies: 0 comments

vukrosic
Dec 20, 2025
Maintainer