Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Relational Hallucination Evaluation on Reefknot
Loading...
67.9
F1 Score
Teacher (LLaVA-4B)
59.996
62.048
64.1
66.152
Oct 14, 2025
F1 Score
Updated 9d ago
Evaluation Results
Method
Method
Links
F1 Score
Teacher (LLaVA-4B)
Backbone=LLaVA-4B, Rol...
2025.10
67.9
CompoDistill-2B
Method Variant=CompoDi...
2025.10
66.7
LLaVA-MoD-2B
Method Variant=LLaVA-M...
2025.10
63.4
Student (LLaVA-2B)
Backbone=LLaVA-2B, Rol...
2025.10
61.3
LLaVA-KD-2B
Method Variant=LLaVA-K...
2025.10
60.3
Feedback
Search any
task
Search any
task