Commit f1d41a1
Support rope cache indexing using positions (#2112)
Add support to indexing rope cache using `position_ids`, this might be
needed during
1. inference, where we passed in `position_ids` into transformer forward
2. CP load balancing where we need to index rope cache given positions
ids
Test:
running dpskv3 16b base
<img width="489" height="286" alt="image"
src="https://github.com/user-attachments/assets/6f463d65-a0de-413d-ab19-770db9983dbb"
/>
also tested in https://github.com/wwwjn/torchtitan/pull/1/files when
passing position_ids
<img width="665" height="269" alt="image"
src="https://github.com/user-attachments/assets/70e4bddc-0334-4dbf-b00d-6e4b49a94655"
/>
---------
Co-authored-by: JessicaZhong <[email protected]>1 parent 1ebd914 commit f1d41a1
File tree
8 files changed
+226
-47
lines changed- torchtitan/models
- deepseek_v3
- infra
- model
- llama3
- infra
- model
- llama4
- infra
- model
- qwen3
- infra
- model
8 files changed
+226
-47
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
224 | 224 | | |
225 | 225 | | |
226 | 226 | | |
| 227 | + | |
| 228 | + | |
227 | 229 | | |
228 | | - | |
229 | | - | |
| 230 | + | |
| 231 | + | |
230 | 232 | | |
231 | 233 | | |
232 | 234 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
126 | 126 | | |
127 | 127 | | |
128 | 128 | | |
129 | | - | |
| 129 | + | |
| 130 | + | |
| 131 | + | |
| 132 | + | |
| 133 | + | |
| 134 | + | |
| 135 | + | |
| 136 | + | |
| 137 | + | |
| 138 | + | |
| 139 | + | |
| 140 | + | |
| 141 | + | |
| 142 | + | |
| 143 | + | |
| 144 | + | |
| 145 | + | |
| 146 | + | |
| 147 | + | |
| 148 | + | |
| 149 | + | |
| 150 | + | |
| 151 | + | |
| 152 | + | |
| 153 | + | |
| 154 | + | |
| 155 | + | |
| 156 | + | |
| 157 | + | |
| 158 | + | |
| 159 | + | |
| 160 | + | |
| 161 | + | |
| 162 | + | |
| 163 | + | |
| 164 | + | |
| 165 | + | |
| 166 | + | |
| 167 | + | |
| 168 | + | |
| 169 | + | |
| 170 | + | |
| 171 | + | |
| 172 | + | |
| 173 | + | |
| 174 | + | |
| 175 | + | |
| 176 | + | |
| 177 | + | |
| 178 | + | |
| 179 | + | |
130 | 180 | | |
131 | 181 | | |
132 | 182 | | |
133 | 183 | | |
134 | 184 | | |
135 | 185 | | |
| 186 | + | |
136 | 187 | | |
137 | 188 | | |
138 | 189 | | |
139 | 190 | | |
140 | 191 | | |
141 | 192 | | |
142 | | - | |
| 193 | + | |
143 | 194 | | |
144 | 195 | | |
145 | 196 | | |
| |||
196 | 247 | | |
197 | 248 | | |
198 | 249 | | |
| 250 | + | |
199 | 251 | | |
200 | 252 | | |
201 | 253 | | |
202 | 254 | | |
203 | 255 | | |
204 | 256 | | |
205 | 257 | | |
| 258 | + | |
| 259 | + | |
206 | 260 | | |
207 | 261 | | |
208 | 262 | | |
| |||
222 | 276 | | |
223 | 277 | | |
224 | 278 | | |
225 | | - | |
| 279 | + | |
226 | 280 | | |
227 | 281 | | |
228 | 282 | | |
229 | 283 | | |
230 | 284 | | |
231 | 285 | | |
232 | 286 | | |
233 | | - | |
| 287 | + | |
234 | 288 | | |
235 | 289 | | |
236 | 290 | | |
| |||
312 | 366 | | |
313 | 367 | | |
314 | 368 | | |
| 369 | + | |
315 | 370 | | |
316 | 371 | | |
317 | 372 | | |
318 | 373 | | |
319 | 374 | | |
320 | 375 | | |
321 | 376 | | |
| 377 | + | |
| 378 | + | |
322 | 379 | | |
323 | 380 | | |
324 | 381 | | |
325 | 382 | | |
326 | | - | |
| 383 | + | |
| 384 | + | |
| 385 | + | |
327 | 386 | | |
328 | 387 | | |
329 | 388 | | |
| |||
413 | 472 | | |
414 | 473 | | |
415 | 474 | | |
| 475 | + | |
416 | 476 | | |
417 | 477 | | |
418 | 478 | | |
| |||
422 | 482 | | |
423 | 483 | | |
424 | 484 | | |
| 485 | + | |
| 486 | + | |
425 | 487 | | |
426 | 488 | | |
427 | 489 | | |
| |||
430 | 492 | | |
431 | 493 | | |
432 | 494 | | |
433 | | - | |
| 495 | + | |
434 | 496 | | |
435 | 497 | | |
436 | 498 | | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
205 | 205 | | |
206 | 206 | | |
207 | 207 | | |
| 208 | + | |
| 209 | + | |
208 | 210 | | |
209 | | - | |
210 | | - | |
| 211 | + | |
| 212 | + | |
211 | 213 | | |
212 | 214 | | |
213 | 215 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
88 | 88 | | |
89 | 89 | | |
90 | 90 | | |
91 | | - | |
| 91 | + | |
| 92 | + | |
| 93 | + | |
92 | 94 | | |
93 | 95 | | |
94 | 96 | | |
95 | 97 | | |
96 | 98 | | |
97 | 99 | | |
98 | | - | |
| 100 | + | |
99 | 101 | | |
100 | 102 | | |
101 | 103 | | |
102 | 104 | | |
103 | 105 | | |
| 106 | + | |
| 107 | + | |
104 | 108 | | |
105 | 109 | | |
106 | 110 | | |
107 | 111 | | |
108 | 112 | | |
109 | 113 | | |
110 | 114 | | |
111 | | - | |
112 | | - | |
113 | | - | |
114 | | - | |
| 115 | + | |
| 116 | + | |
| 117 | + | |
| 118 | + | |
| 119 | + | |
| 120 | + | |
| 121 | + | |
| 122 | + | |
| 123 | + | |
| 124 | + | |
| 125 | + | |
| 126 | + | |
| 127 | + | |
| 128 | + | |
| 129 | + | |
| 130 | + | |
| 131 | + | |
| 132 | + | |
| 133 | + | |
| 134 | + | |
| 135 | + | |
| 136 | + | |
115 | 137 | | |
116 | 138 | | |
117 | 139 | | |
118 | 140 | | |
119 | 141 | | |
120 | 142 | | |
| 143 | + | |
121 | 144 | | |
122 | 145 | | |
123 | 146 | | |
| |||
131 | 154 | | |
132 | 155 | | |
133 | 156 | | |
| 157 | + | |
134 | 158 | | |
135 | 159 | | |
136 | 160 | | |
137 | 161 | | |
138 | 162 | | |
139 | 163 | | |
140 | | - | |
| 164 | + | |
141 | 165 | | |
142 | 166 | | |
143 | 167 | | |
| |||
213 | 237 | | |
214 | 238 | | |
215 | 239 | | |
| 240 | + | |
216 | 241 | | |
217 | 242 | | |
218 | 243 | | |
219 | 244 | | |
220 | 245 | | |
221 | 246 | | |
222 | 247 | | |
| 248 | + | |
| 249 | + | |
223 | 250 | | |
224 | 251 | | |
225 | 252 | | |
| |||
236 | 263 | | |
237 | 264 | | |
238 | 265 | | |
239 | | - | |
| 266 | + | |
240 | 267 | | |
241 | 268 | | |
242 | 269 | | |
| |||
360 | 387 | | |
361 | 388 | | |
362 | 389 | | |
| 390 | + | |
363 | 391 | | |
364 | 392 | | |
365 | 393 | | |
366 | 394 | | |
367 | 395 | | |
368 | 396 | | |
369 | 397 | | |
| 398 | + | |
| 399 | + | |
370 | 400 | | |
371 | 401 | | |
372 | 402 | | |
373 | 403 | | |
374 | 404 | | |
375 | | - | |
| 405 | + | |
| 406 | + | |
| 407 | + | |
376 | 408 | | |
377 | 409 | | |
378 | 410 | | |
| |||
519 | 551 | | |
520 | 552 | | |
521 | 553 | | |
| 554 | + | |
522 | 555 | | |
523 | 556 | | |
524 | 557 | | |
| |||
528 | 561 | | |
529 | 562 | | |
530 | 563 | | |
| 564 | + | |
| 565 | + | |
531 | 566 | | |
532 | 567 | | |
533 | 568 | | |
| |||
537 | 572 | | |
538 | 573 | | |
539 | 574 | | |
540 | | - | |
| 575 | + | |
| 576 | + | |
| 577 | + | |
541 | 578 | | |
542 | 579 | | |
543 | 580 | | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
240 | 240 | | |
241 | 241 | | |
242 | 242 | | |
| 243 | + | |
| 244 | + | |
243 | 245 | | |
244 | | - | |
245 | | - | |
| 246 | + | |
| 247 | + | |
246 | 248 | | |
247 | 249 | | |
248 | 250 | | |
| |||
0 commit comments