计算机科学
人工智能
计算机视觉
目标检测
点云
对象(语法)
特征(语言学)
RGB颜色模型
投影(关系代数)
点(几何)
模式识别(心理学)
算法
几何学
数学
语言学
哲学
作者
Zhoutao Wang,Qian Xie,Mingqiang Wei,Kun Long,Jun Wang
摘要
In this article, we propose a Multi-feature Fusion VoteNet (MFFVoteNet) framework for improving the 3D object detection performance in cluttered and heavily occluded scenes. Our method takes the point cloud and the synchronized RGB image as inputs to provide object detection results in 3D space. Our detection architecture is built on VoteNet with three key designs. First, we augment the VoteNet input with point color information to enhance the difference of various instances in a scene. Next, we integrate an image feature module into the VoteNet to provide a strong object class signal that can facilitate deterministic detections in occlusion. Moreover, we propose a Projection Non-Maximum Suppression (PNMS) method in 3D object detection to eliminate redundant proposals and hence provide more accurate positioning of 3D objects. We evaluate the proposed MFFVoteNet on two challenging 3D object detection datasets, i.e., ScanNetv2 and SUN RGB-D. Extensive experiments show that our framework can effectively improve the performance of 3D object detection.
科研通智能强力驱动
Strongly Powered by AbleSci AI