Long Context Modeling with Ranked Memory-Augmented Retrieval
About
Effective long-term memory management is crucial for language models handling extended contexts. We introduce the Enhanced Ranked Memory Augmented Retrieval (ERMAR) framework, which dynamically ranks memory entries based on relevance. Unlike prior models, ERMAR employs a novel relevance scoring mechanism and a pointwise re-ranking model for key-value embeddings, inspired by learning-to-rank techniques in information retrieval. By integrating historical usage patterns and adaptive retrieval, ERMAR achieves state-of-the-art results on standard benchmarks, demonstrating superior scalability and performance in long-context tasks.
Ghadir Alselwi, Hao Xue, Shoaib Jameel, Basem Suleiman, Flora D. Salim, Imran Razzak• 2025
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Language Modeling | WikiText-103 (test) | Perplexity7.88 | 703 | |
| Language Modeling | PG-19 | Perplexity9.75 | 206 | |
| Language Modeling | PG-19 (test) | Perplexity9.765 | 112 | |
| Language Modeling | Proof-pile | Perplexity2.98 | 92 | |
| Language Modeling | WikiText-103 | Perplexity (PPL)7.61 | 43 | |
| Text Classification | NLU Tasks (SST-2, MR, Subj, SST-5, MPQA) | SST-2 Accuracy94.7 | 13 |
Showing 6 of 6 rows