[loading] Re-add and improve disk offloading support #42242

Cyrilvallez · 2025-11-17T13:38:15Z

What does this PR do?

This PR re-add support for weight offloading to disk using the device_map, which was temporarily dropped in #41580 to simplify the PR (because weights need to be resaved correctly after performing custom Ops during dynamic loading).

It further improve the offloading mechanism as everything is now offloaded in safetensors format, vs numpy format originally when using accelerate. This is much easier and efficient.

Slow tests are exactly similar to before the big weight loading refactor (10 failing over all models)

HuggingFaceDocBuilderDev · 2025-11-17T13:47:25Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

ArthurZucker

for the weight renaming, IMO we should just use the prepare model state dict instead of the weights

ArthurZucker · 2025-11-18T16:12:16Z

src/transformers/core_model_loading.py

+                            # Offloading support
+                            if param_device == "disk":
+                                missing_keys.discard(target_name)
+                                # If not already offloaded, or if we applied any special Operation, we need to re-save
+                                if target_name not in disk_offload_index or len(operations) > 0:
+                                    disk_offload_index = offload_weight(
+                                        param, target_name, disk_offload_folder, disk_offload_index
+                                    )


can be done inside set param for module as its kinda related

I preferred to keep it outside for now, as it's actually not setting the param! (just saves it to disk and skip loading) So would be a bit weird IMO to do that inside set_param as it does not set it

ArthurZucker

Very nice thanks!

src/transformers/core_model_loading.py

ArthurZucker · 2025-11-21T16:54:20Z

tests/test_modeling_common.py

I think there were other tests I skiped with a todo @Cyrilvallez but ff to check

Indeed I missed a few! Nice catch!

tests/utils/test_core_model_loading.py

src/transformers/modeling_utils.py

src/transformers/integrations/accelerate.py

src/transformers/core_model_loading.py

github-actions · 2025-11-24T13:25:58Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: deepseek_vl_hybrid, glm4_moe

Cyrilvallez changed the title ~~[loading] Re-add disk offloading support~~ [loading] Re-add and improve disk offloading support Nov 17, 2025

Cyrilvallez mentioned this pull request Nov 17, 2025

[loading] Fix device when source and target are different #42246

Merged

Cyrilvallez force-pushed the offloading branch from c5d1a03 to 937e7e4 Compare November 18, 2025 09:02

ArthurZucker reviewed Nov 18, 2025

View reviewed changes

Cyrilvallez force-pushed the offloading branch from c634b9a to f2cd562 Compare November 18, 2025 17:53

This was referenced Nov 18, 2025

Fix accelerate integration #42264

Merged

[loading] Fix device detection #42323

Merged

Cyrilvallez added 22 commits November 21, 2025 17:05

unskip tests

40d1827

first shot

5096ca6

offload in safetensors format

ee4978e

remove hard-coded value

14e1699

update error

843371d

typo

d53ab68

fix

0086c24

update test

e3fb6eb

fix

ad7a84f

return it

4bc793c

post rebase

00abfba

improve var names

5e559dd

improve names

009e44b

fix finally

adacfe0

comment

74b3862

fix tests

76756b8

fix

b2f9593

simplify

ee88919

fix

98bf29d

doc

16c6dee

fix

cb1d7c7

remove additional tiying after rebase

240767c

Cyrilvallez added 3 commits November 21, 2025 17:18

update test function source

2e2b725

fix

deedbdf

post rebase

9a0675a

Cyrilvallez force-pushed the offloading branch from 49d6b3a to 9a0675a Compare November 21, 2025 16:21

new renaming patterns

81d136d

ArthurZucker approved these changes Nov 24, 2025

View reviewed changes

Cyrilvallez added 4 commits November 24, 2025 13:42

clear confusion about variable names

bd4e788

create cleaner function

2a9009e

better doc

a6a4c45

other tests

f4619dd

Cyrilvallez added 2 commits November 24, 2025 14:27

remove skip

c41ea84

unskip other tests

da954ef

Cyrilvallez merged commit af6a36a into main Nov 24, 2025
22 of 24 checks passed

Cyrilvallez deleted the offloading branch November 24, 2025 13:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[loading] Re-add and improve disk offloading support #42242

[loading] Re-add and improve disk offloading support #42242

Uh oh!

Cyrilvallez commented Nov 17, 2025 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Nov 17, 2025

Uh oh!

ArthurZucker left a comment

Uh oh!

ArthurZucker Nov 18, 2025

Uh oh!

Cyrilvallez Nov 21, 2025

Uh oh!

ArthurZucker left a comment

Uh oh!

Uh oh!

ArthurZucker Nov 21, 2025

Uh oh!

Cyrilvallez Nov 24, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Nov 24, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[loading] Re-add and improve disk offloading support #42242

[loading] Re-add and improve disk offloading support #42242

Uh oh!

Conversation

Cyrilvallez commented Nov 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Uh oh!

HuggingFaceDocBuilderDev commented Nov 17, 2025

Uh oh!

ArthurZucker left a comment

Choose a reason for hiding this comment

Uh oh!

ArthurZucker Nov 18, 2025

Choose a reason for hiding this comment

Uh oh!

Cyrilvallez Nov 21, 2025

Choose a reason for hiding this comment

Uh oh!

ArthurZucker left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ArthurZucker Nov 21, 2025

Choose a reason for hiding this comment

Uh oh!

Cyrilvallez Nov 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Nov 24, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Cyrilvallez commented Nov 17, 2025 •

edited

Loading

Cyrilvallez Nov 24, 2025 •

edited

Loading