Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Generalization Knowledge Base Question Answering on GrailQA
Loading...
89.3
Hit@1
LMP
68.084
73.592
79.1
84.608
Jun 2, 2025
Jul 24, 2025
Sep 15, 2025
Nov 7, 2025
Dec 29, 2025
Feb 20, 2026
Apr 14, 2026
Hit@1
Updated 3d ago
Evaluation Results
Method
Method
Links
Hit@1
LMP
LLM=GPT-4, Size=-
2026.04
89.3
KG-Agent
LLM=LLaMA3, Size=7B
2026.04
86.1
ToG
2025.06
81.4
ToG
LLM=GPT-4, Size=-
2026.04
81.4
KG-Reasoner
LLM=Qwen3, Size=30B
2026.04
76.86
iQUEST
Backbone=GPT-4o
2025.06
73.52
iQUEST
LLM=GPT-4o, Size=-
2026.04
73.5
KBQA-o1
LLM=LLaMA3, Size=70B
2026.04
72.9
FlexKBQA
2025.06
68.9
Feedback
Search any
task
Search any
task