Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Deep Research on DeepResearch Bench

53.08RACE Overall

DualGraph

33.455238.550143.64548.7399Feb 14, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.02
53.0853.7854.352.4849.8157.5579.65
2026.02
52.5452.6553.251.8251.8372.1153.38
2026.02
51.552.075251.4149.356.2376.33
2026.02
51.1751.6951.7951.1648.7561.1566.06
2026.02
49.1850.1348.4150.0647.3559.9662.96
2026.02
46.6847.2745.554845.9373.1831.7
2026.02
46.6247.6745.2148.1645.0751.8735.12
2026.02
46.547.0645.6747.5345.4720.82.3
2026.02
43.0842.8541.9644.6243.7785.753.84
42.9241.9544.6343.7543.0859.8361.29
2026.02
42.4842.0141.8143.843.0146.014.41
2026.02
42.4442.3640.8544.4143.1176.0321.69
2026.02
41.5641.4239.9943.842.0287.3224.51
2026.02
41.0140.5538.8243.5642.8368.477.28
2026.02
40.9539.1943.6542.8841.2752.238.39
40.744039.0643.1442.337.316.72
2026.02
40.6238.6343.9242.5841.03--
40.5339.7338.8742.7942.2651.1911.6
2026.02
40.3139.7638.3542.6842.0276.6413.06
2026.02
37.2936.2634.5140.4640.6169.598.11
2026.02
34.3233.4332.3736.9836.3493.88.96
34.2133.3630.5238.1837.6482.634.08