Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Error Forecasting on Who&When
Loading...
100
Eta (%)
ALL-at-Once
23.8616
43.6283
63.395
83.1617
Mar 12, 2026
Eta (%)
Step Accuracy
Updated 2mo ago
Evaluation Results
Method
Method
Links
Eta (%)
Step Accuracy
ALL-at-Once
Look-ahead=Full
2026.03
100
10.37
AgenTracer
Look-ahead=Full
2026.03
100
31.89
Famas
Look-ahead=Full
2026.03
100
29.35
Random
Look-ahead=None
2026.03
50
14.36
MASC
Look-ahead=None
2026.03
42.19
21.62
PROMAS
Look-ahead=None
2026.03
26.79
22.97
Feedback
Search any
task
Search any
task