Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Knowledge Depth Evaluation on Ancient Rome
Loading...
5.36
EVD Score
Model B
4.0184
4.3667
4.715
5.0633
Mar 5, 2026
EVD Score
Max D
Accuracy
Common Accuracy
Text Accuracy
Proficiency Score
Updated 2mo ago
Evaluation Results
Method
Method
Links
EVD Score
Max D
Accuracy
Common Accuracy
Text Accuracy
Proficiency Score
Model B
Model ID=B
2026.03
5.36
8
84
100
68
40
Model D
Model ID=D
2026.03
5.24
8
82
99
67
47
Model A
Model ID=A
2026.03
4.96
7
85
100
67
47
Model C
Model ID=C
2026.03
4.79
8
81
97
68
38
Model E
Model ID=E
2026.03
4.07
7
80
97
66
30
Feedback
Search any
task
Search any
task