Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

AMBIGQA

Benchmarks

Task NameDataset NameSOTA ResultTrend
Question AnsweringAmbigQA
Cover EM60
18
Question AnsweringAmbigQA
EM61.3
11
Disambiguation and completenessAmbigQA
Personalization Bias0.113
9
Question Answering with ClarificationAmbigQA Unambiguous queries (dev)
Reward42.05
8
Question Answering with ClarificationAmbigQA Ambiguous queries (dev)
Reward15.81
8
Question AnsweringAmbigQA
Accuracy59.8
7
Open-Domain QAAmbigQA Nq=300
Acc0.473
6
Question AnsweringAmbigQA
Helpfulness4.96
5
Question AnsweringAmbigQA (sampled)
Accuracy65.5
4
Multi-answer Question AnsweringAMBIGQA (test)
F1 (All Questions)46.2
3
Multi-answer Question AnsweringAMBIGQA (dev)
F1 (all questions)52.1
3
Showing 11 of 11 rows