Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

CREAM: Continual Retrieval on Dynamic Streaming Corpora with Adaptive Soft Memory

About

Information retrieval (IR) in dynamic data streams is a crucial task, as shifts in data distribution degrade the performance of AI-powered IR systems. To mitigate this issue, memory-based continual learning has been widely adopted for IR. However, existing methods rely on a fixed set of queries with ground-truth documents, which limits generalization to unseen data, making them impractical for real-world applications. To enable more effective learning with unseen topics of a new corpus without ground-truth labels, we propose CREAM, a self-supervised framework for memory-based continual retrieval. CREAM captures the evolving semantics of streaming queries and documents into dynamically structured soft memory and leverages it to adapt to both seen and unseen topics in an unsupervised setting. We realize this through three key techniques: fine-grained similarity estimation, regularized cluster prototyping, and stratified coreset sampling. Experiments on two benchmark datasets demonstrate that CREAM exhibits superior adaptability and retrieval accuracy, outperforming the strongest method in a label-free setting by 27.79% in Success@5 and 44.5% in Recall@10 on average, and achieving performance comparable to or even exceeding that of supervised methods.

HuiJeong Son, Hyeongu Kang, Sunho Kim, Subeen Ho, SeongKu Kang, Dongha Lee, Susik Yoon• 2026

Related benchmarks

TaskDatasetResultRank
Continual RetrievalLoTTE
Success@50.682
14
Continual RetrievalMSMARCO streaming topic-clustered (Session 3)
Success@568.9
14
Continual RetrievalMSMARCO streaming topic-clustered (Session 8)
Success@592.6
14
RetrievalLoTTE Session 0
Success@537.2
14
RetrievalLoTTE Session 1
Success@547.4
14
RetrievalLoTTE Session 2
Success@547.8
14
RetrievalLoTTE Session 3
Success@50.578
14
RetrievalLoTTE Session 4
Success@558.5
14
RetrievalLoTTE Session 8
Success@566.7
14
RetrievalLoTTE Average across Sessions 0-9
Success@548.6
14
Showing 10 of 24 rows

Other info

Follow for update