[1]LU Jun,LU Linchao,ZHAI Xiaoyang,et al.High-efficiency 3D object detection for road traffic scenes[J].CAAI Transactions on Intelligent Systems,2025,20(1):91-100.[doi:10.11992/tis.202311013]
Copy
CAAI Transactions on Intelligent Systems[ISSN 1673-4785/CN 23-1538/TP] Volume:
20
Number of periods:
2025 1
Page number:
91-100
Column:
学术论文—机器感知与模式识别
Public date:
2025-01-05
- Title:
-
High-efficiency 3D object detection for road traffic scenes
- Author(s):
-
LU Jun; LU Linchao; ZHAI Xiaoyang; LIU Shuang
-
College of Intelligent Systems Science and Engineering, Harbin Engineering University, Harbin 150001, China
-
- Keywords:
-
deep learning; 3D object detection; point cloud; random sampling; local feature aggregation; attention mechanism; autonomous driving
- CLC:
-
TP391
- DOI:
-
10.11992/tis.202311013
- Abstract:
-
Based on the 3D object proposal generation and detection from pointcloud, namely PointRCNN network, this study proposes an RandLA-RCNN architecture to address the issues of high time cost and inefficiency in the point cloud downsampling stage of the current two-stage point cloud object detection algorithm. Firstly, by taking advantage of the efficiency of random sampling method, the large-scale point cloud data are downsampled to handle massive point cloud data. Then, the spatial positions of each neighboring point of the input point cloud are encoded to effectively enhance the ability of each point to extract local features from the neighborhood. Attention-based pooling rules are used to aggregate local feature vectors and obtain global features. Finally, an extended residual module formed by stacking multiple local spatial encoding units and attention pooling units is used to further enhance the global features of each point and avoid the loss of key point information. Experimental results show that this detection algorithm retains the advantages of PointRCNN network in detecting 3D objects, while achieves nearly twice the detection speed compared with PointRCNN, reaching an inference speed of 16 frames per second.