Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

MultiDocQA

Benchmarks

Task NameDataset NameSOTA ResultTrend
Multi-document Question AnsweringMultidocQA
HotpotQA Accuracy67.78
14
Long-form Answer GenerationMultiDocQA
Spearman Correlation0.392
8
Showing 2 of 2 rows