Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Agentic Task on DRBench

43Score

DR-Rubric-8B (GPT-5)

30.20833.52936.8540.171May 31, 2026
Updated 1d ago

Evaluation Results

MethodLinks
2026.05
43
2026.05
41.5
2026.05
39.8
2026.05
39.5
2026.05
39.4
2026.05
38.7
2026.05
37.3
2026.05
35.5
2026.05
33.6
2026.05
30.7