Commit 2b31193
add pipeline benchmark for kvzch
Summary:
# context
* modify the kvzch benchmark configs to better represent the real use case
* add config pass-in to the test models
* fix small bugs and minior refactoring
# changes
* previous kv-zch embedding table is too small the prefetch process is too short, after this change (increased table size) the prefetch process is longer
{F1983784711} {F1983784733}
# benchmark
|short name |GPU Runtime (P90)|CPU Runtime (P90)|GPU Peak Mem alloc (P90)|GPU Peak Mem reserved (P90)|GPU Mem used (P90)|Malloc retries (P50/P90/P100)|CPU Peak RSS (P90)|
|--|--|--|--|--|--|--|--|
|regular-base |9864.51 ms |9403.68 ms |33.77 GB |49.66 GB |50.71 GB |0.0 / 0.0 / 0.0 |30.65 GB |
|kvzch-base |18804.26 ms |44245.82 ms |25.28 GB |36.33 GB |37.38 GB |0.0 / 0.0 / 0.0 |31.18 GB |
|base-inplace |20141.71 ms |46805.58 ms |25.28 GB |34.39 GB |35.44 GB |0.0 / 0.0 / 0.0 |31.19 GB |
|kvzch-sdd |20382.59 ms |45647.02 ms |33.42 GB |47.52 GB |48.56 GB |0.0 / 0.0 / 0.0 |31.13 GB |
|kvzch-prefetch |17951.19 ms |38598.57 ms |33.45 GB |47.16 GB |48.21 GB |0.0 / 0.0 / 0.0 |30.83 GB |
Differential Revision: D842683611 parent 7b3effd commit 2b31193
File tree
4 files changed
+34
-15
lines changed- torchrec/distributed
- benchmark
- yaml
- test_utils
4 files changed
+34
-15
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
504 | 504 | | |
505 | 505 | | |
506 | 506 | | |
507 | | - | |
508 | | - | |
509 | | - | |
510 | | - | |
511 | | - | |
512 | | - | |
513 | | - | |
514 | | - | |
515 | 507 | | |
516 | 508 | | |
517 | 509 | | |
| |||
531 | 523 | | |
532 | 524 | | |
533 | 525 | | |
534 | | - | |
535 | 526 | | |
536 | 527 | | |
537 | 528 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
10 | 10 | | |
11 | 11 | | |
12 | 12 | | |
| 13 | + | |
| 14 | + | |
| 15 | + | |
13 | 16 | | |
14 | 17 | | |
| 18 | + | |
15 | 19 | | |
16 | | - | |
| 20 | + | |
| 21 | + | |
| 22 | + | |
| 23 | + | |
| 24 | + | |
| 25 | + | |
| 26 | + | |
| 27 | + | |
| 28 | + | |
| 29 | + | |
| 30 | + | |
| 31 | + | |
17 | 32 | | |
18 | | - | |
19 | | - | |
| 33 | + | |
| 34 | + | |
20 | 35 | | |
21 | 36 | | |
22 | 37 | | |
23 | 38 | | |
24 | | - | |
| 39 | + | |
25 | 40 | | |
26 | 41 | | |
27 | 42 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
88 | 88 | | |
89 | 89 | | |
90 | 90 | | |
| 91 | + | |
91 | 92 | | |
92 | 93 | | |
93 | 94 | | |
| |||
108 | 109 | | |
109 | 110 | | |
110 | 111 | | |
| 112 | + | |
111 | 113 | | |
112 | 114 | | |
113 | 115 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
881 | 881 | | |
882 | 882 | | |
883 | 883 | | |
| 884 | + | |
884 | 885 | | |
885 | 886 | | |
886 | 887 | | |
| |||
1191 | 1192 | | |
1192 | 1193 | | |
1193 | 1194 | | |
| 1195 | + | |
| 1196 | + | |
1194 | 1197 | | |
1195 | 1198 | | |
1196 | 1199 | | |
| |||
1237 | 1240 | | |
1238 | 1241 | | |
1239 | 1242 | | |
1240 | | - | |
| 1243 | + | |
| 1244 | + | |
1241 | 1245 | | |
1242 | 1246 | | |
1243 | 1247 | | |
| |||
1398 | 1402 | | |
1399 | 1403 | | |
1400 | 1404 | | |
| 1405 | + | |
1401 | 1406 | | |
1402 | 1407 | | |
1403 | 1408 | | |
| |||
1547 | 1552 | | |
1548 | 1553 | | |
1549 | 1554 | | |
| 1555 | + | |
1550 | 1556 | | |
1551 | 1557 | | |
1552 | 1558 | | |
| |||
1559 | 1565 | | |
1560 | 1566 | | |
1561 | 1567 | | |
1562 | | - | |
| 1568 | + | |
| 1569 | + | |
| 1570 | + | |
1563 | 1571 | | |
1564 | 1572 | | |
1565 | 1573 | | |
| |||
1571 | 1579 | | |
1572 | 1580 | | |
1573 | 1581 | | |
| 1582 | + | |
1574 | 1583 | | |
1575 | 1584 | | |
1576 | 1585 | | |
1577 | 1586 | | |
1578 | 1587 | | |
1579 | 1588 | | |
1580 | 1589 | | |
| 1590 | + | |
1581 | 1591 | | |
1582 | 1592 | | |
1583 | 1593 | | |
| |||
1596 | 1606 | | |
1597 | 1607 | | |
1598 | 1608 | | |
| 1609 | + | |
1599 | 1610 | | |
1600 | 1611 | | |
1601 | 1612 | | |
| |||
0 commit comments