[1]LU Bin,YANG Zhenyu,SUN Yang,et al.3D object detection algorithm with multi-channel cross attention fusion[J].CAAI Transactions on Intelligent Systems,2024,19(4):885-897.[doi:10.11992/tis.202305029]
Copy
CAAI Transactions on Intelligent Systems[ISSN 1673-4785/CN 23-1538/TP] Volume:
19
Number of periods:
2024 4
Page number:
885-897
Column:
学术论文—机器感知与模式识别
Public date:
2024-07-05
- Title:
-
3D object detection algorithm with multi-channel cross attention fusion
- Author(s):
-
LU Bin1; 2; YANG Zhenyu1; 2; SUN Yang1; 2; LIU Yawei1; 2; WANG Minghan1; 2
-
1. School of Control and Compute Engineering, North China Electric Power University, Baoding 071000 China;
2. Hebei Key Laboratory of Knowledge Computing for Energy & Power, North China Electric Power University, Baoding 071000, China
-
- Keywords:
-
3D point cloud; autonomous driving; LiDAR; deep learning; 3D object detection; pillar; cross attention; single-stage algorithm
- CLC:
-
TP391
- DOI:
-
10.11992/tis.202305029
- Abstract:
-
To solve the problems that the existing single-stage 3D object detection algorithm utilizes point cloud downsampling features in a single way and the degree of aggregation of features for the long-range contextual information cannot meet the requirement of enhancing the algorithm performance, we propose a single-stage 3D object detection algorithm based on multi-channel cross attention fusion. First, the channel-wise cross attention module is designed to fuse the down sampled features, which can enhance the expression ability of multi-scale features for the long-range spatial information under different receptive field based on the cross attention mechanism. Then, a cascade feature excitation module is proposed to combine the original downsampling features to cascade channel-wise cross attention weighted features to enhance the algorithm’s learning ability for key spatial features. Extensive experiments were conducted on the public autonomous driving dataset KITTI and compared with mainstream algorithms. As a single-stage algorithm, the detection accuracy was 91.34%, 79.85% and 75.98% for the three difficulty levels of car categories, which were 4.83%, 3.26% and 3.32% better than the baseline algorithm. The experimental results demonstrate the effectiveness and advancement of the algorithm and the proposed modules for 3D object detection task.