Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Web-agent QA on BrowseComp

2.7F1 (Avg)

ReAct + Tree-GRPO

1.141.5451.952.355Sep 25, 2025
Updated 1mo ago

Evaluation Results

MethodLinks
2025.09
2.7
2025.09
2.6
2025.09
2.4
2025.09
2.4
2025.09
2.3
2025.09
2.2
2025.09
1.3
2025.09
1.2