Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Feedback Evaluation on DRS-BENCH Afixa
Loading...
73.6
Human Score
AGENTIC-DRS
71.936
72.368
72.8
73.232
Aug 14, 2025
Human Score
AIMSim Score
AIMGemini Score
AIMGPT-4o Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
Human Score
AIMSim Score
AIMGemini Score
AIMGPT-4o Score
AGENTIC-DRS
Backbone=GPT-4o
2025.08
73.6
81.7
79.4
80.3
AGENTIC-DRS
Backbone=Gemini
2025.08
72
82.1
76.9
76.2
Feedback
Search any
task
Search any
task