Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

ALCE

Benchmarks

Task NameDataset NameSOTA ResultTrend
Attributed Question AnsweringALCE
STRREC Score41.023
24
Long-form Ambiguous Question AnsweringALCE ASQA
str-em47.51
17
AttributionALCE Average
Avg. F153.2
15
RelevanceALCE
Kendall's Tau0.61
15
Citation-aware Question AnsweringALCE ASQA
EM Recall43.1
13
Citation-aware Question AnsweringALCE ELI5
EM Recall21.4
12
CompletenessALCE
Kendall's Tau0.47
11
Citation-augmented GenerationALCE (test)
Support74.8
9
Long-form Question AnsweringALCE LFQA
ROUGE-L38.6
7
RAG-CompletenessALCE (test)
Mean Kendall's Tau0.47
6
Showing 10 of 10 rows