Automatic crack segmentation model based on multi-branch aggregation transformer: Fid-Bau Portal

Fachinformationsdienst BAUdigital

A platform for research: civil engineering, architecture and urbanism

Automatic crack segmentation model based on multi-branch aggregation transformer

Wang, Jin / Zeng, Zhigao / Wang, Jianxin / Zhang, Jianming / Zhou, Siyuan

Crack detection plays a crucial role in evaluating the safety and durability of civil infrastructure. However, detecting cracks of uneven intensity in complex backgrounds is challenging. To overcome this problem, we propose a dual decoder network (CSMT) based on a multi-branch aggregation Transformer, which uses residual atrous spatial pyramid pooling (RASPP) and Transformer dual decoding branches to extract local and global features of different structures. To enhance global feature extraction, we designed a multi-branch aggregation Transformer (MAT) that adaptively weights the features of two attention heads from spatial and channel dimensions to achieve intra block feature aggregation between dimensions. Meanwhile, to obtain multi-scale semantic information, we constructed a new decoding branch, RASPP, which embeds a squeeze-and-excitation (SE) module and residual structures into standard ASPP. Finally, we propose a feature adaptive fusion module (FAM) to enhance feature fusion between adjacent layers and codec layers. Many experiments on three benchmark datasets have shown that the proposed CSMT segmentation network provides excellent performance in a variety of complex scenarios.

Access

Check availability in my library

Order at Subito €

Page navigation

Document information

Export, share and cite

Automatic crack segmentation model based on multi-branch aggregation transformer

Wang, Jin / Zeng, Zhigao / Wang, Jianxin / Zhang, Jianming / Zhou, Siyuan

Crack detection plays a crucial role in evaluating the safety and durability of civil infrastructure. However, detecting cracks of uneven intensity in complex backgrounds is challenging. To overcome this problem, we propose a dual decoder network (CSMT) based on a multi-branch aggregation Transformer, which uses residual atrous spatial pyramid pooling (RASPP) and Transformer dual decoding branches to extract local and global features of different structures. To enhance global feature extraction, we designed a multi-branch aggregation Transformer (MAT) that adaptively weights the features of two attention heads from spatial and channel dimensions to achieve intra block feature aggregation between dimensions. Meanwhile, to obtain multi-scale semantic information, we constructed a new decoding branch, RASPP, which embeds a squeeze-and-excitation (SE) module and residual structures into standard ASPP. Finally, we propose a feature adaptive fusion module (FAM) to enhance feature fusion between adjacent layers and codec layers. Many experiments on three benchmark datasets have shown that the proposed CSMT segmentation network provides excellent performance in a variety of complex scenarios.

Automatic crack segmentation model based on multi-branch aggregation transformer

Wang, Jin / Zeng, Zhigao / Wang, Jianxin / Zhang, Jianming / Zhou, Siyuan

Crack detection plays a crucial role in evaluating the safety and durability of civil infrastructure. However, detecting cracks of uneven intensity in complex backgrounds is challenging. To overcome this problem, we propose a dual decoder network (CSMT) based on a multi-branch aggregation Transformer, which uses residual atrous spatial pyramid pooling (RASPP) and Transformer dual decoding branches to extract local and global features of different structures. To enhance global feature extraction, we designed a multi-branch aggregation Transformer (MAT) that adaptively weights the features of two attention heads from spatial and channel dimensions to achieve intra block feature aggregation between dimensions. Meanwhile, to obtain multi-scale semantic information, we constructed a new decoding branch, RASPP, which embeds a squeeze-and-excitation (SE) module and residual structures into standard ASPP. Finally, we propose a feature adaptive fusion module (FAM) to enhance feature fusion between adjacent layers and codec layers. Many experiments on three benchmark datasets have shown that the proposed CSMT segmentation network provides excellent performance in a variety of complex scenarios.

Access

Check availability in my library

Order at Subito €

Page navigation

Document information

Export, share and cite

Document information

Title:

Automatic crack segmentation model based on multi-branch aggregation transformer

Contributors:

Wang, Jin (author) / Zeng, Zhigao (author) / Wang, Jianxin (author) / Zhang, Jianming (author) / Zhou, Siyuan (author)

Published in:

Advances in Structural Engineering ; 27 ; 2289-2302

Publication date:

2024-10-01

ISSN:

1369-4332 , 2048-4011

DOI:

https://doi.org/10.1177/13694332241266538

Type of media:

Article (Journal)

Type of material:

Electronic Resource

Language:

English

Keywords:

crack segmentation , feature adaptation module , squeeze-and-excitation , multi-branch aggregation transformer , atrous spatial pyramid pooling

Similar titles

Crack Detection, Classification, and Segmentation on Road Pavement Material Using Multi-Scale Feature Aggregation and Transformer-Based Attention Mechanisms

Arselan Ashraf / Ali Sophian / Ali Aryo Bawono | DOAJ | 2024

Free access

Vison Transformer-Based Automatic Crack Detection on Dam Surface

Jian Zhou / Guochuan Zhao / Yonglong Li | DOAJ | 2024

Free access

CGV-Net: Tunnel Lining Crack Segmentation Method Based on Graph Convolution Guided Transformer

Kai Liu / Tao Ren / Zhangli Lan et al. | DOAJ | 2025

Free access

Asphalt Pavement Crack Image Screening by Transformer-Based Model

Ziwei, Wang / Jun, Feng / Tian, Zhang | IEEE | 2022

Asphalt Pavement Crack Identification based on Two-Stage Training and Multi-Branch Model Integrating Multiple Attention

Wang, Yue / Mo, Jiadi / Yu, Zhi et al. | IEEE | 2022