Long Context Modeling with Ranked Memory-Augmented Retrieval

About

Effective long-term memory management is crucial for language models handling extended contexts. We introduce the Enhanced Ranked Memory Augmented Retrieval (ERMAR) framework, which dynamically ranks memory entries based on relevance. Unlike prior models, ERMAR employs a novel relevance scoring mechanism and a pointwise re-ranking model for key-value embeddings, inspired by learning-to-rank techniques in information retrieval. By integrating historical usage patterns and adaptive retrieval, ERMAR achieves state-of-the-art results on standard benchmarks, demonstrating superior scalability and performance in long-context tasks.

Ghadir Alselwi, Hao Xue, Shoaib Jameel, Basem Suleiman, Flora D. Salim, Imran Razzak• 2025

Related benchmarks

Task	Dataset	Result
Language Modeling	WikiText-103 (test)	Perplexity7.88	773
Language Modeling	PG-19	Perplexity9.75	244
Language Modeling	Proof-pile	Perplexity2.98	130
Language Modeling	PG-19 (test)	Perplexity9.765	112
Language Modeling	WikiText-103	Perplexity (PPL)7.61	43
Text Classification	NLU Tasks (SST-2, MR, Subj, SST-5, MPQA)	SST-2 Accuracy94.7	29

Showing 6 of 6 rows

Other info

Follow for update

@wizwand_team Discord