Understanding and Leveraging the Expert Specialization of Context Faithfulness in Mixture-of-Experts LLMs

About

Context faithfulness is essential for reliable reasoning in context-dependent scenarios. However, large language models often struggle to ground their outputs in the provided context, resulting in irrelevant responses. Inspired by the emergent expert specialization observed in mixture-of-experts architectures, this work investigates whether certain experts exhibit specialization in context utilization, offering a potential pathway toward targeted optimization for improved context faithfulness. To explore this, we propose Router Lens, a method that accurately identifies context-faithful experts. Our analysis reveals that these experts progressively amplify attention to relevant contextual information, thereby enhancing context grounding. Building on this insight, we introduce Context-faithful Expert Fine-Tuning (CEFT), a lightweight optimization approach that selectively fine-tunes context-faithful experts. Experiments across a wide range of benchmarks and models demonstrate that CEFT matches or surpasses the performance of full fine-tuning while being significantly more efficient.

Jun Bai, Minghao Tong, Yang Liu, Zixia Jia, Zilong Zheng• 2025

Related benchmarks

Task	Dataset	Result
Object Hallucination Evaluation	POPE	Accuracy87.1	2056
Diagram Understanding	AI2D	Accuracy66.4	377
Multi-modal Evaluation	MME	MME Score1.51e+3	240
Multimodal Understanding	MMMU (val)	--	211
Multimodal Benchmarking	MMBench	Accuracy74.9	168
Multi-modal Reasoning	MMVet	Score43.5	81
Image Understanding	SEED-IMG	Accuracy71.8	17

Showing 7 of 7 rows

Other info

Follow for update

@wizwand_team Discord