Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

CompAct: Compressing Retrieved Documents Actively for Question Answering

About

Retrieval-augmented generation supports language models to strengthen their factual groundings by providing external contexts. However, language models often face challenges when given extensive information, diminishing their effectiveness in solving questions. Context compression tackles this issue by filtering out irrelevant information, but current methods still struggle in realistic scenarios where crucial information cannot be captured with a single-step approach. To overcome this limitation, we introduce CompAct, a novel framework that employs an active strategy to condense extensive documents without losing key information. Our experiments demonstrate that CompAct brings significant improvements in both performance and compression rate on multi-hop question-answering benchmarks. CompAct flexibly operates as a cost-efficient plug-in module with various off-the-shelf retrievers or readers, achieving exceptionally high compression rates (47x).

Chanwoong Yoon, Taewhoo Lee, Hyeon Hwang, Minbyul Jeong, Jaewoo Kang• 2024

Related benchmarks

TaskDatasetResultRank
Multi-hop Question AnsweringHotpotQA
F1 Score42.42
221
Question AnsweringHotpotQA
F147.5
114
Question Answering2WikiMultihopQA
EM32.2
73
Question AnsweringNQ (test)
EM Accuracy35.71
66
Question AnsweringTriviaQA (TQA)
EM67.7
56
End-to-end Open-Domain Question AnsweringNQ (test)
Exact Match (EM)38.67
50
Open-domain Question AnsweringWQ (test)
EM33.07
37
Question AnsweringAVG
EM43.7
28
Question AnsweringHQA
EM0.338
28
Question AnsweringNQ (Natural Questions)
EM37.9
28
Showing 10 of 25 rows

Other info

Follow for update