Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Hallucination prediction on ParaRel original
Loading...
82.29
Accuracy
Probe_dnn
65.0884
69.5542
74.02
78.4858
Sep 18, 2025
Accuracy
A(phi(sM))
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
A(phi(sM))
Probe_dnn
Model Size=70B
2025.09
82.29
5.61
Conf + Probe
Model Size=70B
2025.09
82.18
5.5
Conf + Probe (SCAO)
Model Size=70B
2025.09
81.84
5.16
Conf + Probe (SCAO)
Model Size=8B
2025.09
80.92
7.66
Conf + Probe
Model Size=8B
2025.09
80.64
7.38
Probe_dnn
Model Size=8B
2025.09
80.52
7.26
Conf
Model Size=8B
2025.09
67.58
-
Conf (SCAO)
Model Size=70B
2025.09
66.83
-
Conf (SCAO)
Model Size=8B
2025.09
66.67
-
Conf
Model Size=70B
2025.09
65.75
-
Feedback
Search any
task
Search any
task