Share your thoughts, 1 month free Claude Pro on usSee more

Knowledge Base Question Answering on WebQuestion Freebase (test)

81.02Hits@1 Accuracy

GPT-4o-mini + KG

Updated 4mo ago

Evaluation Results

Method	Links
GPT-4o-mini + KG 2026.03		81.02
DeepSeek-R1-Distill-Llama-70B + KG 2026.03		75.8
LLaMA-3.3-70B + KG 2026.03		72.6
DeepSeek-R1-Distill-Llama-70B 2026.03		68.84
KG-Hopper w/Qwen-2.5-7B 2026.03		66.9
KG-CoT w/GPT 3.5-Turbo 2026.03		66.5
GPT-4o 2026.03		64.79
Qwen-2.5-7B (SFT) + KG 2026.03		61.42
LLaMA-3.1-8B (SFT) + KG 2026.03		60
LLaMA-3.3-70B 2026.03		59.73
GPT-4o-mini 2026.03		57.26
LLaMA-3.1-8B + KG 2026.03		57.05
Qwen-2.5-7B + KG 2026.03		56.33
LLaMA-3.1-8B 2026.03		45.88
Qwen-2.5-7B 2026.03		44.23