Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Hallucination Detection on MME (val)
Loading...
97.57
AUPRC
Lyapunov Probes
93.4516
94.5208
95.59
96.6592
Mar 6, 2026
AUPRC
Updated 2mo ago
Evaluation Results
Method
Method
Links
AUPRC
Lyapunov Probes
Model=Qwen-2.5-VL
2026.03
97.57
Probe
Model=Qwen-2.5-VL
2026.03
96.32
Lyapunov Probes
Model=LLaVA-1.5
2026.03
95.18
Probe
Model=LLaVA-1.5
2026.03
93.61
Feedback
Search any
task
Search any
task