Xetrieval: Mechanistically Explaining Dense Retrieval

About

Explaining why dense retrievers assign high relevance scores remains challenging because retrieval decisions are made through opaque high-dimensional embeddings. Existing explanations often focus on surface signals, such as lexical matches, token alignments, or post-hoc textual rationales, and thus provide limited insight into the latent factors that shape dense retrieval behavior at the embedding level. We propose \textit{Xetrieval}, an embedding-level mechanistic framework for explaining dense retrieval. \textit{Xetrieval} first introduces a lightweight reasoning internalizer that approximates Chain-of-Thought reasoning directly in the embedding space with a single forward pass, enriching sentence embeddings with reasoning-oriented information while avoiding expensive autoregressive generation. It then decomposes these reasoning-enhanced embeddings into sparse, human-interpretable features, each associated with a coherent natural language description. By aggregating sparse feature overlaps across multiple document-side views, \textit{Xetrieval} provides feature-level explanations of individual retrieval decisions. Experiments on diverse retrievers and benchmarks show that \textit{Xetrieval} uncovers coherent interpretable features, yields stronger pair-level intervention effects, and supports task-level feature steering. The project page and source code are available at https://hihiczx.github.io/Xetrieval .

Zhixin Cai, Jun Bai, Yang Liu, Jiaqi Li, Yichi Zhang, Taichuan Li, Zhuofan Chen, Zixia Jia, Zilong Zheng, Wenge Rong• 2026

Related benchmarks

Task	Dataset	Result
Information Retrieval	BRIGHT	Mean nDCG@1054.8	94
Information Retrieval	Robust04	--	72
Information Retrieval	ArguAna	nDCG@1050.7	31
Information Retrieval	TREC DL	NDCG@1093.4	25
Information Retrieval	MuTual	NDCG@1047.1	12
Information Retrieval	Signal1m	NDCG@1074.6	12

Showing 6 of 6 rows

Other info

GitHub

Follow for update

@wizwand_team Discord