Skip loading external weight data during static analysis. by chinazhangchao · Pull Request #379 · microsoft/winml-cli

chinazhangchao · 2026-04-22T07:26:51Z

Problem

Running winml analyze on large models with external data (e.g., Qwen3-8B with a 30.5 GB .data sidecar) causes the process to consume all available memory and disk, hanging indefinitely. The analyzer called onnx.load(path, load_external_data=True), loading the entire weight file into RAM despite never inspecting weight values.

Root Cause

The static analyzer only needs graph structure (operator types, shapes, connectivity, and small embedded constants) to perform op-support checks. Three call sites were loading or attempting to access the full weight tensors unnecessarily:

ONNXStaticAnalyzer.analyze() — explicitly passed load_external_data=True
ONNXLoader.load() — called bare onnx.load() which defaults to load_external_data=True
RuntimeCheckerQuery — called numpy_helper.to_array() on every initializer and embedded full TensorProtos into single-node models

Changes

File	Change
analyzer.py	`load_external_data=True` → `False`
onnx_loader.py	Add explicit `load_external_data=False`
runtime_checker_query.py	For external-data initializers: extract shape from `dims` instead of `to_array()`, and emit graph inputs instead of embedding empty tensors in single-node models

Impact

Models with external data (Qwen3-8B, Llama, etc.) now use ~MB instead of ~30+ GB RAM
No behavior change for models with inline weights (the data_location != EXTERNAL path is unchanged)
849 unit tests pass, 0 failures

Performance (Improve 39.65%)

Model	Operators	Time
Qwen/Qwen3-8B	5333	57.36s
dbmdz/bert-large-cased-finetuned-conll03-english	2663	28.395s

DingmaomaoBJTU

Overall good fix for a real and impactful problem — static analysis never needs multi-GB weight tensors, and the three-site fix is comprehensive. A few items to address before merging.

…to chao/largemodel

Co-authored-by: vortex-captain <75063846+vortex-captain@users.noreply.github.com>

…ith 'import' and 'import from'' Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com>

…to chao/largemodel

DingmaomaoBJTU

Good work addressing the previous round of feedback — the is_constant=False fix, test coverage additions, and the raw_data comment all look solid. The three-site fix is correct and comprehensive. A few new items for this round:

Non-inline note — _collect_node_tags / ALL_INPUTS_CONSTANT:
External-data initializers are still in self.initializers, so a node whose inputs are all unloaded external weights would be tagged ALL_INPUTS_CONSTANT even though weight data is unavailable. This is informational-only and doesn't affect runtime check results, but could confuse future debugging. Consider filtering external-data initializers without loaded data in that check.

…to chao/largemodel

chinazhangchao added 2 commits April 22, 2026 15:26

Do not load external data when analyze

b4b5415

Merge branch 'main' into chao/largemodel

37063e7

chinazhangchao changed the title ~~Do not load external data when analyze~~ Skip loading external weight data during static analysis. Apr 22, 2026

chinazhangchao marked this pull request as ready for review April 22, 2026 07:37

chinazhangchao requested a review from a team as a code owner April 22, 2026 07:37

chinazhangchao requested review from DingmaomaoBJTU and vortex-captain April 22, 2026 07:37

vortex-captain reviewed Apr 22, 2026

View reviewed changes

Comment thread src/winml/modelkit/analyze/core/runtime_checker_query.py Outdated

vortex-captain reviewed Apr 22, 2026

View reviewed changes

Comment thread src/winml/modelkit/analyze/core/runtime_checker_query.py Outdated

vortex-captain reviewed Apr 22, 2026

View reviewed changes

Comment thread src/winml/modelkit/analyze/core/runtime_checker_query.py

DingmaomaoBJTU requested changes Apr 22, 2026

View reviewed changes

Comment thread src/winml/modelkit/analyze/core/runtime_checker_query.py

Comment thread src/winml/modelkit/analyze/core/runtime_checker_query.py Outdated

Comment thread src/winml/modelkit/analyze/core/runtime_checker_query.py

Comment thread src/winml/modelkit/analyze/core/runtime_checker_query.py

chinazhangchao and others added 5 commits April 23, 2026 14:03

Merge branch 'main' of https://github.com/microsoft/WinML-ModelKit in…

ee1350f

…to chao/largemodel

Update src/winml/modelkit/analyze/core/runtime_checker_query.py

3e812e9

Co-authored-by: vortex-captain <75063846+vortex-captain@users.noreply.github.com>

Update src/winml/modelkit/analyze/core/runtime_checker_query.py

5e0089a

Co-authored-by: vortex-captain <75063846+vortex-captain@users.noreply.github.com>

fix comments

289404c

fix comments

e22dcf1

chinazhangchao requested review from DingmaomaoBJTU and vortex-captain April 23, 2026 09:55

github-advanced-security AI found potential problems Apr 23, 2026

View reviewed changes

Comment thread tests/unit/analyze/core/test_runtime_checker.py Fixed

Comment thread tests/unit/analyze/core/test_runtime_checker_query_helpers.py Dismissed

chinazhangchao and others added 8 commits April 23, 2026 03:05

Potential fix for pull request finding 'CodeQL / Module is imported w…

889c70a

…ith 'import' and 'import from'' Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com>

Potential fix for pull request finding 'CodeQL / Module is imported w…

b52d269

…ith 'import' and 'import from'' Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com>

Merge branch 'main' into chao/largemodel

7cc4544

Merge branch 'main' of https://github.com/microsoft/WinML-ModelKit in…

876e8f0

…to chao/largemodel

Merge branch 'main' of https://github.com/microsoft/WinML-ModelKit in…

d8492dd

…to chao/largemodel

fix test

09ef033

Merge branch 'main' of https://github.com/microsoft/WinML-ModelKit in…

6363edc

…to chao/largemodel

fix lint

4fb32c6

DingmaomaoBJTU reviewed Apr 27, 2026

View reviewed changes

Comment thread src/winml/modelkit/analyze/core/runtime_checker_query.py

Comment thread src/winml/modelkit/analyze/core/runtime_checker_query.py

Comment thread src/winml/modelkit/analyze/core/runtime_checker_query.py

chinazhangchao added 2 commits April 27, 2026 16:26

fix comments

b62c552

Merge branch 'main' of https://github.com/microsoft/WinML-ModelKit in…

4bdc6d0

…to chao/largemodel

chinazhangchao requested a review from DingmaomaoBJTU April 27, 2026 08:27

Merge branch 'main' into chao/largemodel

f92fea3

vortex-captain approved these changes Apr 28, 2026

View reviewed changes

chinazhangchao enabled auto-merge (squash) April 28, 2026 02:51

chinazhangchao disabled auto-merge April 28, 2026 02:51

DingmaomaoBJTU approved these changes Apr 28, 2026

View reviewed changes

chinazhangchao enabled auto-merge (squash) April 28, 2026 02:55

chinazhangchao merged commit 1877cd4 into main Apr 28, 2026
9 checks passed

chinazhangchao deleted the chao/largemodel branch April 28, 2026 02:56

xieofxie mentioned this pull request Apr 28, 2026

fix(pattern): skip ONNX validation when external data is unloaded #412

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Skip loading external weight data during static analysis.#379

Skip loading external weight data during static analysis.#379
chinazhangchao merged 18 commits into
mainfrom
chao/largemodel

chinazhangchao commented Apr 22, 2026 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

DingmaomaoBJTU left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

DingmaomaoBJTU left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Conversation

chinazhangchao commented Apr 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Problem

Root Cause

Changes

Impact

Performance (Improve 39.65%)

Uh oh!

Uh oh!

Uh oh!

Uh oh!

DingmaomaoBJTU left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

DingmaomaoBJTU left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

chinazhangchao commented Apr 22, 2026 •

edited

Loading