Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Knowledge Base Question Answering on T-REx WikiData (test)
Loading...
72.14
Hits@1
KG-Hopper w/Qwen-2.5-7B
18.0392
32.0846
46.13
60.1754
Mar 22, 2026
Hits@1
Updated 25d ago
Evaluation Results
Method
Method
Links
Hits@1
KG-Hopper w/Qwen-2.5-7B
Size=7B, Strategy=KG-A...
2026.03
72.14
DeepSeek-R1-Distill-Llama-70B + KG
Size=70B, Strategy=KG-...
2026.03
72.04
LLaMA-3.1-8B (SFT) + KG
Size=8B, Strategy=KG-A...
2026.03
70.23
GPT-4o-mini + KG
Strategy=KG-Augmented...
2026.03
69.7
Qwen-2.5-7B (SFT) + KG
Size=7B, Strategy=KG-A...
2026.03
68.77
LLaMA-3.1-8B + KG
Size=8B, Strategy=KG-A...
2026.03
65.8
LLaMA-3.3-70B + KG
Size=70B, Strategy=KG-...
2026.03
65.42
Qwen-2.5-7B + KG
Size=7B, Strategy=KG-A...
2026.03
64.21
GPT-4o
Strategy=LLM Prompting...
2026.03
44.46
DeepSeek-R1-Distill-Llama-70B
Size=70B, Strategy=LLM...
2026.03
34.21
Qwen-2.5-7B
Size=7B, Strategy=LLM...
2026.03
31.15
GPT-4o-mini
Strategy=LLM Prompting...
2026.03
26.9
LLaMA-3.1-8B
Size=8B, Strategy=LLM...
2026.03
23
LLaMA-3.3-70B
Size=70B, Strategy=LLM...
2026.03
20.12
Feedback
Search any
task
Search any
task