Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Multi-hop Fact Verification on HoVer few-shot
Loading...
56
Recall
LLaDA (Public)
53.192
53.921
54.65
55.379
Apr 4, 2026
Recall
F1 Score
Accuracy
Updated 11d ago
Evaluation Results
Method
Method
Links
Recall
F1 Score
Accuracy
LLaDA (Public)
Model configuration=Pu...
2026.04
56
26.8
43.5
LLaDA (FS)
Model configuration=Fu...
2026.04
55.6
27.9
55
LLaDA (FS+RO)
Model configuration=FS...
2026.04
54.7
21.1
60.4
LLaDA (RO)
Model configuration=Re...
2026.04
53.3
16.3
50.3
Feedback
Search any
task
Search any
task