[Fix] support converting torch_dist to hf for qwen3vl dense model #1491

p1k0pan · 2026-01-25T14:13:54Z

After training Qwen3-VL-8B with Megatron, it was unable to convert torch_dist to hf. Adding convert code.
Tested on Qwen3-VL-8B, not sure whether suitable for Qwen3-VL moe model.

support converting torch_dist to hf for qwen3vl dense model

50e2adc

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Fix] support converting torch_dist to hf for qwen3vl dense model #1491

[Fix] support converting torch_dist to hf for qwen3vl dense model #1491

Uh oh!

p1k0pan commented Jan 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

[Fix] support converting torch_dist to hf for qwen3vl dense model #1491

Are you sure you want to change the base?

[Fix] support converting torch_dist to hf for qwen3vl dense model #1491

Uh oh!

Conversation

p1k0pan commented Jan 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant