Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Dynamic Mixture of Experts (DynMoE) implementation on New-feature tasks
Loading...
60.4
Session Duration
PithTrain
57.192
78.846
100.5
122.154
May 29, 2026
Session Duration
Active GPU Time
Agent Turns
Context Size (K Tokens)
Output Tokens (K)
Updated 2d ago
Evaluation Results
Method
Method
Links
Session Duration
Active GPU Time
Agent Turns
Context Size (K Tokens)
Output Tokens (K)
PithTrain
Framework=PithTrain
2026.05
60.4
41.9
76
146
76.4
PithTrain
Framework=PithTrain
2026.05
63
39.9
90
176.6
107.7
TorchTitan
Framework=TorchTitan
2026.05
71.4
51.9
87
164.6
85.3
Megatron-LM
Framework=Megatron-LM
2026.05
83.8
49.1
199
208
115.2
Megatron-LM
Framework=Megatron-LM
2026.05
88.5
58.7
145
188.7
117
TorchTitan
Framework=TorchTitan
2026.05
140.6
94.4
197
228.8
161.3
Feedback
Search any
task
Search any
task