Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

DeepSearchQA

Benchmarks

Task NameDataset NameSOTA ResultTrend
Question AnsweringDeepSearchQA
Accuracy82.67
19
DeepSearchQADeepSearchQA
Accuracy76.67
19
Long-horizon agentic taskDeepSearchQA
Performance66
18
Search-based Question AnsweringDeepSearchQA
Pass64
4
Showing 4 of 4 rows