Skip to content

Conversation

@pull
Copy link

@pull pull bot commented Nov 12, 2025

See Commits and Changes for more details.


Created by pull[bot] (v2.0.0-alpha.4)

Can you help keep this open source service alive? 💖 Please sponsor : )

qiacheng and others added 4 commits November 12, 2025 15:21
IPEX is no longer needed for Intel Arc GPUs.  Removing instruction to setup ipex.
The partial unloader path in model re-use flow skips straight to the
actual unload without any check of the patching UUID. This means that
if you do an upscale flow with a model patch on an existing model, it
will not apply your patchings.

Fix by delaying the partial_unload until after the uuid checks. This
is done by making partial_unload a model of partial_load where extra_mem
is -ve.
Clean up a bunch of stacked and no-longer-needed tensors on the QWEN
VRAM peak (currently FFN).

With this I go from OOMing at B=37x1328x1328 to being able to
succesfully run B=47 (RTX5090).
@pull pull bot locked and limited conversation to collaborators Nov 12, 2025
@pull pull bot added the ⤵️ pull label Nov 12, 2025
@pull pull bot merged commit 8b0b93d into code:master Nov 12, 2025
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants