Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

X-Bench

Benchmarks

Task NameDataset NameSOTA ResultTrend
Deep SearchX-Bench
Score (%)75
14
Deep search and information seekingx-bench DeepSearch-2510
Accuracy73
6
Showing 2 of 2 rows