Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Agentic Task on ResearchQA

73.7Score

DR-Rubric-8B (GPT-5)

62.67665.53868.471.262May 31, 2026
Updated 1d ago

Evaluation Results

MethodLinks
2026.05
73.7
2026.05
72.4
2026.05
71.7
2026.05
69.9
2026.05
67.1
2026.05
66.7
2026.05
66.6
2026.05
66.1
2026.05
65.5
2026.05
63.1