Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Long-form deep research on DeepResearch Bench (test)

48.24Overall Score

Gemini-2.5-Pro Deep Research

40.200842.287944.37546.4621Jan 25, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.01
48.2448.2447.6248.7448.9785.7127
2026.01
47.247.0745.8448.9847.6465.334.15
2026.01
45.6343.5344.3248.1748.1435.96.69
2026.01
45.4743.544.0748.2148.0824.82.51
2026.01
44.3343.1442.2847.6145.9725.492.19
2026.01
43.8943.6142.1947.345.0753.5231.17
2026.01
43.8842.34247.3545.7817.81.86
2026.01
43.6942.0742.7646.1545.14--
2026.01
42.7741.3440.0946.8245.1532.722.33
2026.01
42.2941.4543.7844.5342.2389.5110.39
2026.01
41.8339.8540.2744.7444.4460.638.3
2026.01
41.7940.0438.9546.2243.9719.611.56
2026.01
40.5139.1537.7143.9843.421.691.36