benchmark_litgpt - update `low_precision_mode` accepted values #1779

kshitij12345 · 2025-02-18T17:31:50Z

With TE 2.0, the default recipe is chosen based on the platform

https://github.com/NVIDIA/TransformerEngine/blob/b39397c541292f336c5964dd1661d80c08dc4c78/transformer_engine/pytorch/fp8.py#L46-L51

For B200, it is MXFP8BlockScaling
For H100 and others, it is DelayedScaling

This PR updates benchmark_litgpt.py script to update the accepted value for --low-precision-mode from fp8-delayed-te and fp8-delayed-te-wo_layernorm to fp8-default-te and fp8-default-te-wo_layernorm to hint at this.

kshitij12345 · 2025-02-19T11:25:43Z

cc: @AdamRajfer

update: low_precision_mode accepted values

067181a

kshitij12345 changed the title ~~update: low_precision_mode accepted values~~ benchmark_litgpt - update low_precision_mode accepted values Feb 18, 2025

kshitij12345 marked this pull request as ready for review March 3, 2025 19:16

kshitij12345 requested review from mruberry, lantiga and t-vi as code owners March 3, 2025 19:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

benchmark_litgpt - update `low_precision_mode` accepted values #1779

benchmark_litgpt - update `low_precision_mode` accepted values #1779

kshitij12345 commented Feb 18, 2025

kshitij12345 commented Feb 19, 2025

benchmark_litgpt - update low_precision_mode accepted values #1779

Are you sure you want to change the base?

benchmark_litgpt - update low_precision_mode accepted values #1779

Conversation

kshitij12345 commented Feb 18, 2025

kshitij12345 commented Feb 19, 2025

benchmark_litgpt - update `low_precision_mode` accepted values #1779

benchmark_litgpt - update `low_precision_mode` accepted values #1779