Commit 2624b3f
committed
[Bugfix] Resolve MTP > 1 issue when lm head tp > 1
Previously, the dummy run executed compute_logits only once, regardless of num_speculative_tokens. This caused execute_model to hang on compute_logits when lm head tensor parallelism exceeded 1. The fix ensures compute_logits executes correctly during dummy run, matching num_speculative_tokens.
Signed-off-by: Jade Zheng <[email protected]>1 parent fff258b commit 2624b3f
File tree
3 files changed
+19
-9
lines changed- vllm_ascend
- spec_decode
- worker
3 files changed
+19
-9
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
136 | 136 | | |
137 | 137 | | |
138 | 138 | | |
139 | | - | |
| 139 | + | |
| 140 | + | |
140 | 141 | | |
141 | 142 | | |
142 | 143 | | |
| |||
148 | 149 | | |
149 | 150 | | |
150 | 151 | | |
| 152 | + | |
151 | 153 | | |
152 | 154 | | |
153 | 155 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
232 | 232 | | |
233 | 233 | | |
234 | 234 | | |
235 | | - | |
| 235 | + | |
| 236 | + | |
236 | 237 | | |
237 | 238 | | |
238 | 239 | | |
| |||
316 | 317 | | |
317 | 318 | | |
318 | 319 | | |
| 320 | + | |
319 | 321 | | |
320 | 322 | | |
321 | 323 | | |
| |||
775 | 777 | | |
776 | 778 | | |
777 | 779 | | |
| 780 | + | |
778 | 781 | | |
779 | 782 | | |
780 | 783 | | |
| |||
840 | 843 | | |
841 | 844 | | |
842 | 845 | | |
843 | | - | |
| 846 | + | |
844 | 847 | | |
845 | 848 | | |
846 | 849 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
3086 | 3086 | | |
3087 | 3087 | | |
3088 | 3088 | | |
| 3089 | + | |
| 3090 | + | |
| 3091 | + | |
| 3092 | + | |
| 3093 | + | |
| 3094 | + | |
| 3095 | + | |
| 3096 | + | |
3089 | 3097 | | |
3090 | 3098 | | |
3091 | 3099 | | |
| |||
3105 | 3113 | | |
3106 | 3114 | | |
3107 | 3115 | | |
3108 | | - | |
3109 | | - | |
| 3116 | + | |
3110 | 3117 | | |
3111 | 3118 | | |
3112 | 3119 | | |
| |||
3115 | 3122 | | |
3116 | 3123 | | |
3117 | 3124 | | |
3118 | | - | |
3119 | | - | |
3120 | | - | |
3121 | | - | |
| 3125 | + | |
| 3126 | + | |
3122 | 3127 | | |
3123 | 3128 | | |
3124 | 3129 | | |
| |||
0 commit comments