Share your thoughts, 1 month free Claude Pro on usSee more

LLM Training on Llama2-70B (64 x H100-8)

7.8Iteration Time (s)

Megatron-LM

Updated 5mo ago

Evaluation Results

Method	Links
Megatron-LM 2025.07		7.8	47.2	-
AXLearn 2025.07		9.2	40	-
MaxText 2025.07		9.4	39.1	-
PyTorch FSDP 2025.07		10.6	34.7	-