Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Expert-Level Reasoning on XBench-DeepSearch 1.0 (test)

0.9Inference Accuracy

ReThinker

0.4840.5920.70.808Feb 4, 2026
Updated 3mo ago

Evaluation Results

MethodLinks
2026.02
0.9
2026.02
0.87
2026.02
0.78
2026.02
0.778
2026.02
0.75
2026.02
0.71
2026.02
0.706
2026.02
0.7
2026.02
0.69
2026.02
0.66
2026.02
0.537
2026.02
0.5