Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
General Reasoning on AGIEval (en)
Loading...
2.132
Speedup Ratio
MTP-D ensemble
1.0192
1.3081
1.597
1.8859
Mar 25, 2026
Speedup Ratio
Updated 23d ago
Evaluation Results
Method
Method
Links
Speedup Ratio
MTP-D ensemble
Loop strategy=4 to 8,...
2026.03
2.132
MTP-D
Loop strategy=4 to 8,...
2026.03
2.071
MTP-D
Loop strategy=4 to 8,...
2026.03
2.068
MTP-D
Loop strategy=1 to 8,...
2026.03
1.964
MTP-D
Loop strategy=4, Train...
2026.03
1.94
MTP
Loop strategy=4 to 8,...
2026.03
1.87
MTP-D
Loop strategy=4 to 8,...
2026.03
1.851
MTP-D
Loop strategy=4 to 16,...
2026.03
1.766
MTP
Loop strategy=1 to 8,...
2026.03
1.764
MTP-D
Loop strategy=4 to 8 t...
2026.03
1.735
MTP-D
Loop strategy=1 to 16,...
2026.03
1.535
MTP-D
Loop strategy=4 to 16,...
2026.03
1.51
MTP-D
Loop strategy=1 to 8,...
2026.03
1.337
MTP-D
Loop strategy=1, Train...
2026.03
1.128
MTP-D
Loop strategy=1 to 16,...
2026.03
1.062
Feedback
Search any
task
Search any
task