Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

ClusterRAG: Cluster-Based Collaborative Filtering for Personalized Retrieval-Augmented Generation

About

Personalized Retrieval-Augmented Generation (RAG) relies on accurately selecting user-relevant documents. In practice, existing RAG approaches often suffer from high retrieval costs and overlook that collaborative signals from similar users can enhance personalized generation for the current user. We propose ClusterRAG, a Cluster-Based Collaborative Filtering for Personalized Retrieval-Augmented Generation. ClusterRAG represents users through their profile documents, organizes users into semantically coherent clusters using density-based clustering, and performs retrieval at both the cluster and document levels via cluster-level similarity and fine-grained ranking. Extensive experiments on the LaMP benchmark demonstrate that jointly leveraging the target user's profile and profiles from top similar users consistently yields the best performance across diverse tasks. Further analysis shows that ClusterRAG integrates seamlessly with different dense retrievers and rankers, and remains effective when paired with both fine-tuned and zero-shot language models.

Gibson Nkhata, Uttamasha Anjally Oyshi, Quan Mai, Susan Gauch• 2026

Related benchmarks

TaskDatasetResultRank
Categorical classificationLaMP-2 (test)
Accuracy66.1
16
Personalized ClassificationLaMP 1 (test)
Accuracy69
7
Personalized GenerationLaMP-4 (test)
R-1 Score19
7
Personalized GenerationLaMP-5 (test)
R-1 Score49
7
Personalized GenerationLaMP-7 (test)
ROUGE-152.1
7
Personalized RegressionLaMP-3 (test)
MAE0.27
7
Showing 6 of 6 rows

Other info

Follow for update