Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Deep Research on SQA v2

88.3Score

DR Tulu-8B (RL)

19.55637.40355.2573.097Nov 24, 2025
Updated 16d ago

Evaluation Results

MethodLinks
2025.11
88.3
2025.11
87.7
2025.11
79.6
2025.11
74.8
2025.11
72.3
2025.11
69.8
2025.11
67.3
2025.11
61.1
2025.11
57.2
2025.11
46.7
2025.11
46.5
2025.11
45.2
2025.11
42.5
2025.11
41.9
2025.11
40.4
2025.11
32.9
2025.11
26.9
2025.11
22.2