Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Feedback Evaluation on DRS-BENCH IDD
Loading...
0.742
Human Score
AGENTIC-DRS
0.73992
0.74046
0.741
0.74154
Aug 14, 2025
Human Score
AIMSim Score
AIMGemini Score
AIMGPT-4o Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
Human Score
AIMSim Score
AIMGemini Score
AIMGPT-4o Score
AGENTIC-DRS
Backbone=Gemini
2025.08
0.742
0.834
0.802
0.785
AGENTIC-DRS
Backbone=GPT-4o
2025.08
0.74
0.837
0.791
0.804
Feedback
Search any
task
Search any
task