Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

HotPot

Benchmarks

Task NameDataset NameSOTA ResultTrend
AbstentionHotpot (test)
AUARC60.9
25
Question AnsweringADVHOTPOT
Accuracy82.4
12
Selective Question AnsweringHOTPOT
Area under Coverage-F192.5
12
Retrieval Question AnsweringHotPot
MRR47.7
6
Information RetrievalHotpot BEIR
nDCG0.687
5
Retrieval Question AnsweringHotPot (in-domain)
MRR63.8
4
Showing 6 of 6 rows