Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

InfoMamba: An Attention-Free Hybrid Mamba-Transformer Model

About

Balancing fine-grained local modeling with long-range dependency capture under computational constraints remains a central challenge in sequence modeling. While Transformers provide strong token mixing, they suffer from quadratic complexity, whereas Mamba-style selective state-space models (SSMs) scale linearly but often struggle to capture high-rank and synchronous global interactions. We present a consistency boundary analysis that characterizes when diagonal short-memory SSMs can approximate causal attention and identifies structural gaps that remain. Motivated by this analysis, we propose InfoMamba, an attention-free hybrid architecture. InfoMamba replaces token-level self-attention with a concept bottleneck linear filtering layer that serves as a minimal-bandwidth global interface and integrates it with a selective recurrent stream through information-maximizing fusion (IMF). IMF dynamically injects global context into the SSM dynamics and encourages complementary information usage through a mutual-information-inspired objective. Extensive experiments on classification, dense prediction, and non-vision tasks show that InfoMamba consistently outperforms strong Transformer and SSM baselines, achieving competitive accuracy-efficiency trade-offs while maintaining near-linear scaling.

Youjin Wang, Jiaqiao Zhao, Rong Fu, Run Zhou, Ruizhe Zhang, Jiani Liang, Suisuai Cao, Feng Zhou• 2026

Related benchmarks

TaskDatasetResultRank
Automatic Speech RecognitionLibriSpeech clean (test)
WER1.1
1156
Automatic Speech RecognitionLibriSpeech (test-other)
WER4.1
1151
Image ClassificationFood-101--
542
Object DetectionMS-COCO
AP55.3
120
Instance SegmentationMS-COCO
mAP Mask47.9
60
SegmentationADE20K
mIoU53
59
SegmentationCityscapes
mIoU84.3
37
Sentiment AnalysisIMDB
Accuracy85.1
13
Natural Language UnderstandingAGNews
Accuracy89.1
9
Image ClassificationFood-11
Top-1 Accuracy91
5
Showing 10 of 10 rows

Other info

Follow for update