Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

ASQA

Benchmarks

Task NameDataset NameSOTA ResultTrend
Long-form Question Answering with CitationsASQA
EM45.01
37
Question AnsweringASQA (test)
Correctness EM Recall40.05
29
Question AnsweringASQA
StrEM44.73
27
Attributed Text GenerationASQA
Correctness (EM Rec.)50.1
19
Long-form Question AnsweringASQA
str-em51.3
15
Question AnsweringASQA (in-domain)
EM47.21
12
CompletenessASQA
Kendall's Tau0.54
11
Retrieval-Augmented GenerationASQA
str-EM42.44
11
Sentence-level attributionASQA (test)
Citation Recall87.2
10
RAG-CompletenessASQA (test)
Kendall's Tau0.54
6
Long-form Question Answering refinementASQA (test)
Error Rate (%)16.63
5
Open-Domain Question AnsweringASQA (dev)
STR-EM37.22
4
Knowledge-grounded GenerationASQA ALCE (test)
Correctness31.8
4
Attributed Question AnsweringASQA ALCE (dev)
FSupp88.58
3
Showing 14 of 14 rows