Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
MLP Evaluation on Synthetic Arithmetic Tasks
Loading...
100
Success Rate
Global-Local Pipeline
-3.584
23.308
50.2
77.092
Jun 30, 2025
Success Rate
Updated 4d ago
Evaluation Results
Method
Method
Links
Success Rate
Global-Local Pipeline
Initialization=P_s^-,...
2025.06
100
Global-Local Pipeline
Initialization=P_b^-,...
2025.06
100
Global-Local Pipeline
Initialization=P_b^-,...
2025.06
100
Global-Local Pipeline
Initialization=P_b^-,...
2025.06
100
Global-Local Pipeline
Initialization=P_s^-,...
2025.06
89.4
Global-Local Pipeline
Initialization=P_r^-,...
2025.06
38.2
Global-Local Pipeline
Initialization=P_r^-,...
2025.06
10.8
Reverse Order
Target Length=L = 10
2025.06
10.4
Reverse Order
Target Length=L = 20
2025.06
9.4
Reverse Order
Target Length=L = 13
2025.06
8.2
Global-Local Pipeline
Initialization=P_r^-,...
2025.06
5.9
Reverse Order
Target Length=L = 30
2025.06
3.8
Reverse Order
Target Length=L = 40
2025.06
0.8
Global-Local Pipeline
Initialization=P_s^-,...
2025.06
0.4
Feedback
Search any
task
Search any
task