Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Information Retrieval and Question Answering on DeepSearch-QA

91.3Avg@3 Score

Claude-4.6-Opus

59.68467.89276.184.308Mar 16, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.03
91.3----
2026.03
80.6----
2026.03
80----
2026.03
79----
2026.03
77.4----
2026.03
77.1----
2026.03
76.9----
2026.03
72.1----
2026.03
67.9----
2026.03
60.9----
2026.02
-40.0025895.882.8
2026.02
-160.0109351.765.5
2026.02
-220.0408522.3610
2026.02
-200.0263437.069.9
2026.02
-280.0454519.9110.8
2026.02
-280.0034548.74
2026.02
-300.0191428.636.9
2026.02
-420.0495875.2611.7