[1]PANG Rong,YANG Yan,LENG Xiongjin,et al.Segmentation model of pavement diseases based on semantic priori of double-branched point flow[J].CAAI Transactions on Intelligent Systems,2024,19(1):153-164.[doi:10.11992/tis.202306037]
Copy
CAAI Transactions on Intelligent Systems[ISSN 1673-4785/CN 23-1538/TP] Volume:
19
Number of periods:
2024 1
Page number:
153-164
Column:
学术论文—智能系统
Public date:
2024-01-05
- Title:
-
Segmentation model of pavement diseases based on semantic priori of double-branched point flow
- Author(s):
-
PANG Rong1; 2; 3; 4; YANG Yan1; 2; LENG Xiongjin1; 2; ZHANG Peng3; 4; LIU Yan1; 2
-
1. School of Computing and Artificial Intelligence, Southwest Jiaotong University, Chengdu 611756, China;
2. Engineering Research Center of Sustainable Urban Intelligent Transportation, Ministry of Education, Chengdu 611756, China;
3. China Merchants Chongqing Road Engineering Inspection Center Co., Ltd, Chongqing 400067, China;
4. Mountain Highway Engineering Technology Research Center, Chongqing 400067, China
-
- Keywords:
-
semantic priori information; efficient attention mechanism; cross-covariance image transformers attention mechanism; sparse subject sampling point flow; category imbalance; semantic segmentation; pavement diseases; deep learning
- CLC:
-
TP391
- DOI:
-
10.11992/tis.202306037
- Abstract:
-
At present, the main problems faced by real road disease image recognition algorithms based on deep learning include serious imbalance in categories caused by different proportions of complex road background and foreground of diseases, and small disease scales. What’s more, the inconspicuous contrast between pavement diseases and the geometric structure characteristics of roads leads to their difficulty in recognition. To address the above issues, we propose a semantic prior two-branch network to guide Transformer’s backbone feature network in mining the complex relationship between background and foreground of pavement disease. It uses high-efficiency self-attention mechanism and cross-covariance image transformers(XCiT) to extract semantic features from two-dimensional space and feature channels, respectively, and a semantic locally-enhanced feed-forward (SLeff) module to improve the ability of local feature aggregation. We also propose a new sparse subject sampling point stream module, which is combined with the traditional FPN structure to further alleviate the category imbalance problem of pavement diseases. Finally, we constructed the road disease segmentation dataset based on real scene and compared it with multiple baseline models on this dataset and public dataset. The experimental results demonstrated effectiveness of this model.