Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Scaling Model Validation on BrowseComp-Plus Out-of-sample (val)

0.071MAE

predictive framework for agentic scaling

0.067450.0692250.0710.072775Dec 9, 2025
Updated 1mo ago

Evaluation Results

MethodLinks
2025.12
0.07115.80.045-0.2