A platform for research: civil engineering, architecture and urbanism
Dual-path network combining CNN and transformer for pavement crack segmentation
Abstract Cracks are one of the most common pavement surface diseases. Timely repair of these cracks is imperative to prevent a substantial reduction in the pavement’s service life. However, the persistent challenges in crack segmentation arise from factors such as thin and shallow crack characteristics, a cluttered background, and foreground distractors. In response to these challenges, a dual-path network for pavement crack segmentation is introduced, leveraging a synergistic combination of Convolutional Neural Network (CNN) and transformer. First, the proposed approach involves a lightweight CNN encoder for local feature extraction and a novel transformer encoder integrating a fully convolutional high-low frequency attention (FCHiLo) mechanism and an efficient feedforward network for global feature extraction. Second, a complementary fusion module (CFM) is introduced to aggregate intermediate features extracted from both encoders. The multi-scale fusion outputs are systematically conveyed to the decoder, facilitating gradual image recovery and segmentation result acquisition. Evaluation on three publicly available datasets—DeepCrack, CrackForest, and CrackTree 260—affirms the superior performance of the proposed network compared to ten established models.
Highlights Proposed a dual-path network for pavement crack segmentation. Proposed a lightweight CNN encoder and a novel transformer encoder. Proposed a complementary fusion module for pavement crack segmentation. Conducted experiments to verify our performance on three public datasets. Performed ablation experiments to validate the effectiveness of each component.
Dual-path network combining CNN and transformer for pavement crack segmentation
Abstract Cracks are one of the most common pavement surface diseases. Timely repair of these cracks is imperative to prevent a substantial reduction in the pavement’s service life. However, the persistent challenges in crack segmentation arise from factors such as thin and shallow crack characteristics, a cluttered background, and foreground distractors. In response to these challenges, a dual-path network for pavement crack segmentation is introduced, leveraging a synergistic combination of Convolutional Neural Network (CNN) and transformer. First, the proposed approach involves a lightweight CNN encoder for local feature extraction and a novel transformer encoder integrating a fully convolutional high-low frequency attention (FCHiLo) mechanism and an efficient feedforward network for global feature extraction. Second, a complementary fusion module (CFM) is introduced to aggregate intermediate features extracted from both encoders. The multi-scale fusion outputs are systematically conveyed to the decoder, facilitating gradual image recovery and segmentation result acquisition. Evaluation on three publicly available datasets—DeepCrack, CrackForest, and CrackTree 260—affirms the superior performance of the proposed network compared to ten established models.
Highlights Proposed a dual-path network for pavement crack segmentation. Proposed a lightweight CNN encoder and a novel transformer encoder. Proposed a complementary fusion module for pavement crack segmentation. Conducted experiments to verify our performance on three public datasets. Performed ablation experiments to validate the effectiveness of each component.
Dual-path network combining CNN and transformer for pavement crack segmentation
Wang, Jin (author) / Zeng, Zhigao (author) / Sharma, Pradip Kumar (author) / Alfarraj, Osama (author) / Tolba, Amr (author) / Zhang, Jianming (author) / Wang, Lei (author)
2023-11-22
Article (Journal)
Electronic Resource
English
Pavement crack detection based on transformer network
Elsevier | 2022
|Asymmetric dual-decoder-U-Net for pavement crack semantic segmentation
Elsevier | 2023
|Evidential transformer for pavement distress segmentation
Wiley | 2023
|