Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Scaling Model Validation on BrowseComp-Plus Out-of-sample (val)
Loading...
0.071
MAE
predictive framework for agentic scaling
0.06745
0.069225
0.071
0.072775
Dec 9, 2025
MAE
MAPE
Normalized MAE
Qualitative Validation Score
Kendall's τ (Ranking)
Updated 4d ago
Evaluation Results
Method
Method
Links
MAE
MAPE
Normalized MAE
Qualitative Validation Score
Kendall's τ (Ranking)
predictive framework for agentic scaling
Model=GPT-5.2, Intelli...
2025.12
0.071
15.8
0.045
-
0.2
Feedback
Search any
task
Search any
task