Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

QReCC

Benchmarks

Task NameDataset NameSOTA ResultTrend
Conversational RetrievalQReCC (test)
Recall@1075.8
43
Conversational Query RetrievalQReCC
R@1076.5
29
Conversational SearchQReCC (test)
MRR57.4
16
Answer GenerationQReCC
F1 Score31
16
Conversational response generationQReCC
F1 Score31
15
Conversational Information RetrievalQReCC (test)
R@1077.2
13
Conversational Question AnsweringQReCC (test)
EM (%)120
12
Conversational Response GenerationQReCC (test)
F1 Score26.3
10
Question RewritingQRECC Mean/Overall 1.0 (test)
BLEU64.7
9
Question RewritingQRECC Easy 1.0 (test)
BLEU82.79
9
Question RewritingQRECC Medium subset 1.0 (test)
BLEU Score63.17
9
Question RewritingQRECC Hard subset 1.0 (test)
BLEU0.4948
9
RetrievalQReCC
NDCG@339.6
8
Conversational RetrievalQReCC
Top-1 Recall53.37
7
Knowledge-intensive dialog attributionQReCC (dev)
Auto AIS (before)19.1
3
Conversational SearchQReCC large (test)
Recall@1063.7
2
Showing 16 of 16 rows