Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Feedback Evaluation on DRS-BENCH GDE
Loading...
0.762
Human Score
AGENTIC-DRS
0.74328
0.74814
0.753
0.75786
Aug 14, 2025
Human Score
AIMSim Score
AIMGemini Score
AIMGPT-4o Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
Human Score
AIMSim Score
AIMGemini Score
AIMGPT-4o Score
AGENTIC-DRS
Backbone=GPT-4o
2025.08
0.762
0.881
0.829
0.832
AGENTIC-DRS
Backbone=Gemini
2025.08
0.744
0.851
0.792
0.795
Feedback
Search any
task
Search any
task