EAAI Journal 2026 Journal Article
A vision mamba-enhanced network with frequency-directional feature fusion for pavement crack segmentation
- Xi Li
- Yuqi Wang
- Qiang Zhou
- Jianhui Zhan
- Deng Zuo
- Weichao Chen
Driven by the rapid advancement of intelligent transportation and infrastructure digitalization, pavement crack detection has emerged as a research hotspot bridging civil engineering and computer vision. It plays a vital role in improving road safety and optimizing maintenance operations. However, the irregular morphology and susceptibility to background interference pose significant challenges to achieving accurate and robust automatic detection. To effectively address these issues, a Vision Mamba-Enhanced Network with Frequency-Directional Feature Fusion, VMFDF-Net, was proposed in this work for attaining efficient crack segmentation. The network integrates global modelling and local feature extraction capabilities, combining a Vision Mamba-inspired mechanism with convolutional neural networks (CNNs). Multi-frequency, multi-directional feature modelling, and an adaptive gated fusion module were introduced to enhance crack representation. The proposed Vision Mamba Convolutional (VMambaConv) module integrates local texture enhancement with long-range dependency modelling to improve crack feature extraction. In the bottleneck layer, a Wavelet and Directional-aware feature Cascade (WDcascade) module adaptively fuses multi-source features through a gated mechanism, enhancing the representation of diverse crack patterns in complex backgrounds. The effectiveness of the developed model was evaluated on four publicly available crack datasets: CRACK500, DeepCrack, CFD, and EdmCrack600. Six evaluation metrics were used for fair and objective assessment. The experimental results demonstrated that the model can generally achieve superior performance. Our code is open source on GitHub: https: //github. com/Acruelsummer/VMFDF-Net/tree/main.