Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

E2LLM: Encoder Elongated Large Language Models for Long-Context Understanding and Reasoning

About

Processing long contexts is increasingly important for Large Language Models (LLMs) in tasks like multi-turn dialogues, code generation, and document summarization. This paper addresses the challenges of achieving high long-context performance, low computational complexity, and compatibility with pretrained models -- collectively termed the ``impossible triangle''. We introduce E2LLM (Encoder Elongated Large Language Models), a novel approach that effectively navigates this paradox. E2LLM divides long contexts into chunks, compresses each into soft prompts using a pretrained text encoder, and aligns these representations with a decoder-only LLM via an adapter. To enhance the LLM's reasoning with these soft prompts, we employ two training objectives: encoder output reconstruction and long-context instruction fine-tuning. Extensive experiments reveal that E2LLM not only outperforms 8 state-of-the-art (SOTA) methods in effectiveness and efficiency for document summarization and question answering, but also achieves the best performance on LongBench v2 among models of comparable size.

Zihan Liao, Jun Wang, Hang Yu, Lingxiao Wei, Jianguo Li, Jun Wang, Wei Zhang• 2024

Related benchmarks

TaskDatasetResultRank
Long-context Language UnderstandingRULER 32k context length
VT Score8.6
33
Long-context Language UnderstandingRULER 16k context length--
16
Multiple-choice Question AnsweringLongBench v2 (val)
Overall Accuracy31.8
15
Long-context Language UnderstandingRULER 4k context length
VT Score7.2
10
Document SummarizationQMSum
G-mean15.47
9
Document SummarizationGovReport
G-mean18.43
9
Long-context Language UnderstandingRULER 64k context length
QA Score34.5
9
Question AnsweringTriviaQA
F1 Score38.57
8
Long-context UnderstandingRULER 8k context
NIAH60.66
7
Long-context UnderstandingRULER 128K context
NIAH48.38
5
Showing 10 of 11 rows

Other info

Follow for update