fix: [Memory Estimator] Fix text config attribute fetching for new multimodal models (e.g., Qwen-3-VL) by LZY2275 · Pull Request #67 · ISEEKYAN/mbridge

LZY2275 · 2026-01-20T12:26:23Z

Background
Most modern multimodal models extended from text models (e.g., Qwen-3-VL) encapsulate core text-related attributes (vocab_size, max_position_embeddings, hidden_size, etc.) in hf_config.text_config. However, some older multimodal models (e.g., Qwen-2-vl-7B-Instruct) keep these attributes at the top level of the config.

The original code only fetched attributes from the top-level config, which worked for older models like Qwen-2-vl but failed for newer ones like Qwen-3-VL (since their text attributes are nested in text_config).

Changes Made
Add support for text_config-nested attributes:

Fetch vocab_size and max_position_embeddings from either top-level config or text_config sub-object

ISEEKYAN · 2026-01-20T15:34:10Z

thank you @LZY2275 for your interests in the memory estimator, I did not tested this on qwen-vl models and no vision part is developed. I think now only the language part can be precisely estimated, do you have any estimation results on qwen-vl? Do you think it necessary to precisely estimate the vision part? Feel free to discuss with me!

LZY2275 and others added 2 commits January 20, 2026 16:33

fix: suport estimating Qwen-vl series

a516c65

fix: update Megatron-LM path in run_webui.sh

0e77feb

LZY2275 changed the title ~~fix: Fix text config attribute fetching for new multimodal models (e.g., Qwen-3-VL)~~ fix: [Memory Estimator] Fix text config attribute fetching for new multimodal models (e.g., Qwen-3-VL) Jan 20, 2026

ISEEKYAN merged commit 63bd59a into ISEEKYAN:main Jan 20, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: [Memory Estimator] Fix text config attribute fetching for new multimodal models (e.g., Qwen-3-VL)#67

fix: [Memory Estimator] Fix text config attribute fetching for new multimodal models (e.g., Qwen-3-VL)#67
ISEEKYAN merged 2 commits intoISEEKYAN:mainfrom
LZY2275:main

LZY2275 commented Jan 20, 2026

Uh oh!

ISEEKYAN commented Jan 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

LZY2275 commented Jan 20, 2026

Uh oh!

ISEEKYAN commented Jan 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants