Share your thoughts, 1 month free Claude Pro on usSee more

Question Answering on QALD en (WikiData) v10 (test)

83.73Hits@1

KG-Reasoner with Qwen-2.5-7B

Updated 3mo ago

Evaluation Results

Method	Links
KG-Reasoner with Qwen-2.5-7B 2026.04		83.73
GPT-4o-mini + KG 2026.04		72.86
LLaMA-3.3-70B + KG 2026.04		71.78
DeepSeek-R1-Distill-Llama-70B + KG 2026.04		66.1
Qwen-2.5-7B (SFT) + KG 2026.04		65.18
LLaMA-3.1-8B (SFT) + KG 2026.04		59.63
Qwen-2.5-7B + KG 2026.04		56.54
GPT-4o 2026.04		56.2
LLaMA-3.3-70B 2026.04		56
LLaMA-3.1-8B + KG 2026.04		55.73
ToG-2.0 (ICL)(GPT-3.5-turbo) 2026.04		54.1
ToG with GPT 4 2026.04		53.8
GPT-4o-mini 2026.04		51.98
ToG with GPT 3.5-Turbo 2026.04		50.2
DeepSeek-R1-Distill-Llama-70B 2026.04		43.1
Qwen-2.5-7B 2026.04		41.88
LLaMA-3.1-8B 2026.04		40.25