Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

MambaLRP: Explaining Selective State Space Sequence Models

About

Recent sequence modeling approaches using selective state space sequence models, referred to as Mamba models, have seen a surge of interest. These models allow efficient processing of long sequences in linear time and are rapidly being adopted in a wide range of applications such as language modeling, demonstrating promising performance. To foster their reliable use in real-world scenarios, it is crucial to augment their transparency. Our work bridges this critical gap by bringing explainability, particularly Layer-wise Relevance Propagation (LRP), to the Mamba architecture. Guided by the axiom of relevance conservation, we identify specific components in the Mamba architecture, which cause unfaithful explanations. To remedy this issue, we propose MambaLRP, a novel algorithm within the LRP framework, which ensures a more stable and reliable relevance propagation through these components. Our proposed method is theoretically sound and excels in achieving state-of-the-art explanation performance across a diverse range of models and datasets. Moreover, MambaLRP facilitates a deeper inspection of Mamba architectures, uncovering various biases and evaluating their significance. It also enables the analysis of previous speculations regarding the long-range capabilities of Mamba models.

Farnoush Rezaei Jafari, Gr\'egoire Montavon, Klaus-Robert M\"uller, Oliver Eberle• 2024

Related benchmarks

TaskDatasetResultRank
Word AlignmentRWTH Gold Alignment de-en (test)
AER0.47
31
Explanation FaithfulnessMed-BIOS
Delta AF7.083
24
Explanation FaithfulnessSNLI
Delta AF0.989
24
Explanation FaithfulnessEmotion
Delta AF Score5.637
24
Explanation FaithfulnessSST-2
Delta AF1.157
24
Token AlignmentIWSLT DE→EN 2017 (test)
AER0.65
22
Token AlignmentIWSLT Fr-En 2017 (test)
AER65
22
CopyingCopying task
AUC40
11
Explanation FaithfulnessImageNet
Delta AF4.715
8
Showing 9 of 9 rows

Other info

Code

Follow for update