Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Knowledge Base Question Answering on WebQuestion Freebase (test)
Loading...
81.02
Hits@1 Accuracy
GPT-4o-mini + KG
42.7584
52.6917
62.625
72.5583
Mar 22, 2026
Hits@1 Accuracy
Updated 25d ago
Evaluation Results
Method
Method
Links
Hits@1 Accuracy
GPT-4o-mini + KG
Strategy=KG-Augmented...
2026.03
81.02
DeepSeek-R1-Distill-Llama-70B + KG
Size=70B, Strategy=KG-...
2026.03
75.8
LLaMA-3.3-70B + KG
Size=70B, Strategy=KG-...
2026.03
72.6
DeepSeek-R1-Distill-Llama-70B
Size=70B, Strategy=LLM...
2026.03
68.84
KG-Hopper w/Qwen-2.5-7B
Size=7B, Strategy=KG-A...
2026.03
66.9
KG-CoT w/GPT 3.5-Turbo
Strategy=KG-Augmented...
2026.03
66.5
GPT-4o
Strategy=LLM Prompting...
2026.03
64.79
Qwen-2.5-7B (SFT) + KG
Size=7B, Strategy=KG-A...
2026.03
61.42
LLaMA-3.1-8B (SFT) + KG
Size=8B, Strategy=KG-A...
2026.03
60
LLaMA-3.3-70B
Size=70B, Strategy=LLM...
2026.03
59.73
GPT-4o-mini
Strategy=LLM Prompting...
2026.03
57.26
LLaMA-3.1-8B + KG
Size=8B, Strategy=KG-A...
2026.03
57.05
Qwen-2.5-7B + KG
Size=7B, Strategy=KG-A...
2026.03
56.33
LLaMA-3.1-8B
Size=8B, Strategy=LLM...
2026.03
45.88
Qwen-2.5-7B
Size=7B, Strategy=LLM...
2026.03
44.23
Feedback
Search any
task
Search any
task