Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Explanation Quality Evaluation on DFD 100 randomly selected samples (test)
Loading...
7.55
GPT-4o Score
VRAG-DFD
4.482
5.2785
6.075
6.8715
Apr 15, 2026
GPT-4o Score
Gemini 2.5 Pro Score
Average Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
GPT-4o Score
Gemini 2.5 Pro Score
Average Score
VRAG-DFD
2026.04
7.55
7.78
7.66
Gemini 2.5 Pro
2026.04
7.31
7.02
7.16
GPT-4o
2026.04
4.6
3.25
3.93
Feedback
Search any
task
Search any
task