Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Agentic Task on DRBench
Loading...
43
Score
DR-Rubric-8B (GPT-5)
30.208
33.529
36.85
40.171
May 31, 2026
Score
Updated 1d ago
Evaluation Results
Method
Method
Links
Score
DR-Rubric-8B (GPT-5)
Training=SFT+RL, 1K
2026.05
43
DR-Rubric-8B (Gemini)
Training=SFT+RL, 1K
2026.05
41.5
DR-Tulu-SFT-8B
Training=SFT, 16K
2026.05
39.8
DR-Rubric-8B (BS-3)
Training=SFT+RL, 3K
2026.05
39.5
Qwen3-8B-SFT
Training=SFT, 1K
2026.05
39.4
Qwen3-8B
2026.05
38.7
WebExplorer-8B
Training=SFT+RL, 25K
2026.05
37.3
DR-Tulu-RL-8B
Training=SFT+RL, 25K
2026.05
35.5
Search-R1-7B
Training=RL, 90K
2026.05
33.6
Qwen2.5-7B
2026.05
30.7
Feedback
Search any
task
Search any
task