点云
计算机科学
云计算
人工智能
杠杆(统计)
计算机视觉
瓶颈
特征(语言学)
图像融合
数据挖掘
图像(数学)
语言学
操作系统
哲学
嵌入式系统
作者
ZhaoWen Li,Shujin Lin,Fan Zhou
标识
DOI:10.1109/smc53992.2023.10394659
摘要
The task of image-guided point cloud completion aims to leverage information from images to address uncertainty issues in the completion inference of point clouds. The key challenge in this setting lies in how to effectively combine features extracted from both modalities. Due to the large domain discrepancy between the image and point cloud, existing methods that use cross-modal attention to directly fuse features have increased attention on redundant information and noise from different modalities, resulting in poor feature fusion performance. Hence, by introducing multi-modal fusion transformers that use bottleneck tokens, we enabled point cloud feature to learn image feature through information bridges, leading to improved point cloud completion performance. Our method can not only benefit from RGB images, but also from sketches with less feature information but more emphasis on edge information. Extensive experiments demonstrate that our proposed method enhances the quality of point cloud completion and outperforms other state-of-the-art methods.
科研通智能强力驱动
Strongly Powered by AbleSci AI