分割
计算机科学
点云
云计算
比例(比率)
人工智能
网(多面体)
数据挖掘
地理
数学
地图学
几何学
操作系统
出处
期刊:IEEE Transactions on Intelligent Transportation Systems
[Institute of Electrical and Electronics Engineers]
日期:2024-03-27
卷期号:25 (8): 9153-9167
被引量:2
标识
DOI:10.1109/tits.2024.3373507
摘要
Accurate understanding of 3D objects in complex scenes plays essential roles in the fields of intelligent transportation and autonomous driving technology. Recent deep neural networks have made significant progress in 3D visual tasks by using point cloud data. However, the acquisition of geometric features and the expression of local fine-grained features in point clouds are still not sufficient for the classification and segmentation tasks. Inspired by the application of transformer structures in 2D and 3D computer vision tasks, in this paper, a multi-scale neighborhood aggregation transformer network (MNAT-Net) is proposed for point cloud classification and segmentation, which captures the global semantic information and local geometric structure features of point clouds by aggregating the receptive field and node weights. MNAT-Net consists of three key components, namely the multi-scale neighborhood feature aggregation module, the global transformer module and the category-weighted focal loss. The neighborhood features learned by the MNAT-Net network is sent to the global transformer module to fully enrich the contextual representation. Experimental results show that MNAT-Net achieves competitive performance on publicly available ModelNet40, ShapeNet, S3DIS and SemanticKITTI data sets in comparison to related methods.
科研通智能强力驱动
Strongly Powered by AbleSci AI