Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Hallucination Detection on RSHalluEval 1.0 (test)
Loading...
1.2396
ES_IA
Qwen2-VL
0.811224
0.922437
1.03365
1.144863
Feb 11, 2026
ES_IA
ES_IS
ES_OE
ES_OA
ES_OR
ES_all
Updated 4d ago
Evaluation Results
Method
Method
Links
ES_IA
ES_IS
ES_OE
ES_OA
ES_OR
ES_all
Qwen2-VL
Mode=Fine-tuned, Refer...
2026.02
1.2396
0.2653
0.2766
0.4557
0.1833
0.2844
mPLUG-Owl3
Mode=Fine-tuned, Refer...
2026.02
0.8277
0.3084
0.3303
0.6019
0.2053
0.3598
Feedback
Search any
task
Search any
task