Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Hallucination Evaluation on HaluEval (Avg. Metric)
Loading...
23.5
Average Score
Seed 2.0 Pro
10.188
13.644
17.1
20.556
Apr 3, 2026
Average Score
Updated 13d ago
Evaluation Results
Method
Method
Links
Average Score
Seed 2.0 Pro
Latency (s)=3.8
2026.04
23.5
DeepSeek V3.2
Latency (s)=5.6
2026.04
22.2
Gemini 3.1 Pro
Latency (s)=2.8
2026.04
20.1
GPT-5.4
Latency (s)=3.2
2026.04
19.1
Claude Opus 4.6
Latency (s)=4.1
2026.04
16.7
Council Mode
Latency (s)=8.4
2026.04
10.7
Feedback
Search any
task
Search any
task