Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Knowledge-intensive reasoning on Generalization Verification
Loading...
99.18
Hits@1
KDCM + Code Module
89.9032
92.3116
94.72
97.1284
Jan 6, 2026
Hits@1
Hits@3
Hits@5
Updated 4d ago
Evaluation Results
Method
Method
Links
Hits@1
Hits@3
Hits@5
KDCM + Code Module
2026.01
99.18
97.64
95.12
LLM-SubKG-Sum
2026.01
92.36
90.11
86.25
RAG
2026.01
90.36
90.25
91.09
Self-Check
2026.01
90.28
91.41
91.26
KG-LLM-PR
2026.01
90.26
88.52
86.47
Feedback
Search any
task
Search any
task