Skip to content

Commit 27a942f

Browse files
committed
Review relaeted updates.
1 parent 2c35b95 commit 27a942f

File tree

2 files changed

+9
-3
lines changed

2 files changed

+9
-3
lines changed

docs/debugging.md

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -93,7 +93,11 @@ CONFIG_FILE="./torchtitan/models/llama3/train_configs/debug_model.toml" ./run_tr
9393
- Sets deterministic workspace configuration for CuBLAS operations
9494
- **Note:** This will significantly reduce training performance but ensures exact reproducibility
9595

96+
<<<<<<< HEAD
9697
Use `--debug.deterministic_warn_only` to only warn about (not stop running) kernel without deterministic implementation.
98+
=======
99+
Use --debug.deterministic.warn_only to only warn about (not stop running) kernel without deterministic implementation.
100+
>>>>>>> 0b9a2b71 (Review relaeted updates.)
97101
98102
### Activation Checkipointing Debugging ###
99103

torchtitan/models/llama4/train_configs/llama4_17bx16e.toml

Lines changed: 5 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -70,8 +70,10 @@ filter_fqns = ["output", "router.gate"]
7070
filter_fqns = ["output", "router.gate"]
7171

7272
[debug]
73-
torch_deterministic = false
74-
torch_deterministic_warn_only = false
75-
torch_preserve_rng_state = false
73+
#seed = <integer>
74+
deterministic = false
75+
deterministic_warn_only = false
76+
ac_preserve_rng_state = false
7677
ac_determinism_check = "default"
7778
ac_debug = false
79+
moe_force_load_balance = false

0 commit comments

Comments
 (0)