点云
计算机科学
人工智能
特征(语言学)
特征学习
立体视觉
计算机视觉
关系(数据库)
块(置换群论)
稳健性(进化)
模式识别(心理学)
数据挖掘
数学
基因
化学
几何学
生物化学
语言学
哲学
作者
Rong Zhao,Xie Han,Xindong Guo,Liqun Kuang,Xiaowen Yang,Fusheng Sun
出处
期刊:IEEE Transactions on Circuits and Systems for Video Technology
[Institute of Electrical and Electronics Engineers]
日期:2023-04-17
卷期号:33 (11): 6747-6763
被引量:5
标识
DOI:10.1109/tcsvt.2023.3267457
摘要
Learning-based multi-view stereo (MVS) is gaining prominence as a method for 3D reconstruction. However, existing methods in the process of feature learning fail to focus on the structural information implied in the scene. This oversight prevents the network from perceiving the geometric properties of the scene and weakens the generalizability of the network. Therefore, we propose a novel framework named Point Feature Relation Network for Multi-view Stereo (PFR-MVSNet), which is composed of a Dynamic Structure Perception (DSP) module, an Adaptive Feature Exploration (AFE) module, and a Point Transformer Block (PTB) module, to solve the problems caused by the oversight. The DSP module first augments the feature of the 3D point cloud from multi-view 2D features, then establishes spatial structure relations within local regions on the point cloud and guides the feature learning of points through the aggregated structure information. After the network has fully learned the intra-region structure features, the AFE module repartitions perception regions with similar features. The point features within the regions are further learned by the PTB module. We evaluate our method on three benchmark datasets: DTU, Tanks & Temples, and ETH3D. The experimental results show that our method achieves superior accuracy of 0.289 mm on the DTU dataset and exhibits more robust generalization on the Tanks & Temples and ETH3D datasets compared with other learning-based MVS methods.
科研通智能强力驱动
Strongly Powered by AbleSci AI