Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

CWQ

Benchmarks

Task NameDataset NameSOTA ResultTrend
Knowledge Graph Question AnsweringCWQ
Hit@179.3
166
Knowledge Graph Question AnsweringCWQ (test)
Hits@176.9
100
Multi-Hop Knowledge Graph Question AnsweringCWQ
Hits@181.4
46
Knowledge Base Question AnsweringCWQ (test)
F1 Score81.3
42
Knowledge Base Question AnsweringCWQ Freebase (test)
Hits@186
38
Question AnsweringCWQ
Accuracy23.62
30
Discriminative EvaluationCWQ (test)
Binary Accuracy92.88
24
Knowledge Base Question AnsweringCWQ
Answer F151.74
18
Question AnsweringCWQ
Hits@172.5
17
Knowledge Base CompletionCWQ 50% KB
MRR61.4
16
Knowledge Base CompletionCWQ (30% KB)
MRR58.8
16
Knowledge Base Question AnsweringCWQ 50% KB
Hits@150.8
12
Knowledge Base Question AnsweringCWQ 30% KB
Hits@150.2
12
Multi-hop ReasoningCWQ
Hits@182.2
10
Knowledge Base Question AnsweringCWQ (hidden test)
Accuracy67.1
7
Complex Question AnsweringCWQ
Total Score14.85
4
Knowledge Base Question AnsweringCWQ w/o KB
Hits@146.4
3
Showing 17 of 17 rows