MixerCSeg: An Efficient Mixer Architecture for Crack Segmentation via Decoupled Mamba Attention

About

Feature encoders play a key role in pixel-level crack segmentation by shaping the representation of fine textures and thin structures. Existing CNN-, Transformer-, and Mamba-based models each capture only part of the required spatial or structural information, leaving clear gaps in modeling complex crack patterns. To address this, we present MixerCSeg, a mixer architecture designed like a coordinated team of specialists, where CNN-like pathways focus on local textures, Transformer-style paths capture global dependencies, and Mamba-inspired flows model sequential context within a single encoder. At the core of MixerCSeg is the TransMixer, which explores Mamba's latent attention behavior while establishing dedicated pathways that naturally express both locality and global awareness. To further enhance structural fidelity, we introduce a spatial block processing strategy and a Direction-guided Edge Gated Convolution (DEGConv) that strengthens edge sensitivity under irregular crack geometries with minimal computational overhead. A Spatial Refinement Multi-Level Fusion (SRF) module is then employed to refine multi-scale details without increasing complexity. Extensive experiments on multiple crack segmentation benchmarks show that MixerCSeg achieves state-of-the-art performance with only 2.05 GFLOPs and 2.54 M parameters, demonstrating both efficiency and strong representational capability. The code is available at https://github.com/spiderforest/MixerCSeg.

Zilong Zhao, Zhengming Ding, Pei Niu, Wenhao Sun, Feng Guo• 2026

Related benchmarks

Task	Dataset	Result
Crack Segmentation	CRACK500 (test)	mIoU78.24	29
Crack Segmentation	DeepCrack (test)	mIoU91.51	25
Crack Segmentation	CrackMap (test)	mIoU81.23	24
Crack Segmentation	CamCrack789 (test)	mIoU84.1	24
Crack Segmentation	CRACK500	--	22
Crack Segmentation	CrackMap	--	22
Crack Segmentation	Efficiency Evaluation Benchmark	FLOPs (G)2.05	8
Crack Segmentation	CamCrack789	Skeleton F1 (micro)93.3	8
Crack Segmentation	DeepCrack, Crack500, CamCrack789, CrackMap (cross-dataset)	mIoU (DeepC -> C500)63.5	7
Crack Segmentation	Aggregate Crack Segmentation Datasets Average of 4	clDice89.1	4

Showing 10 of 14 rows

Other info

Follow for update

@wizwand_team Discord