Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Feedback Evaluation on DRS-BENCH Infographic
Loading...
70.8
Human Score
AGENTIC-DRS
68.096
68.798
69.5
70.202
Aug 14, 2025
Human Score
AIMSim Score
Gemini Score
GPT-4o Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
Human Score
AIMSim Score
Gemini Score
GPT-4o Score
AGENTIC-DRS
Backbone=GPT-4o
2025.08
70.8
83.5
77.4
76.3
AGENTIC-DRS
Backbone=Gemini
2025.08
68.2
80.6
75.8
73.6
Feedback
Search any
task
Search any
task