计算机科学
对象(语法)
人工智能
情报检索
计算机视觉
作者
Maria Pegia,Björn Þór Jónsson,Anastasia Moumtzidou,Sotiris Diplaris,Ilias Gialampoukidis,Stefanos Vrochidis,Ioannis Kompatsiaris
标识
DOI:10.1007/978-3-031-53302-0_14
摘要
Three-dimensional (3D) retrieval of objects and models plays a crucial role in many application areas, such as industrial design, medical imaging, gaming and virtual and augmented reality. Such 3D retrieval involves storing and retrieving different representations of single objects, such as images, meshes or point clouds. Early approaches considered only one such representation modality, but recently the CMCL method has been proposed, which considers multimodal representations. Multimodal retrieval, meanwhile, has recently seen significant interest in the image retrieval domain. In this paper, we explore the application of state-of-the-art multimodal image representations to 3D retrieval, in comparison to existing 3D approaches. In a detailed study over two benchmark 3D datasets, we show that the MuseHash approach from the image domain outperforms other approaches, improving recall over the CMCL approach by about 11 $$\%$$ for unimodal retrieval and 9 $$\%$$ for multimodal retrieval.
科研通智能强力驱动
Strongly Powered by AbleSci AI