计算机科学
跳跃式监视
光学(聚焦)
人工智能
目标检测
对象(语法)
计算机视觉
最小边界框
增强现实
视觉对象识别的认知神经科学
数字图像
图像(数学)
模式识别(心理学)
图像处理
光学
物理
作者
Vladislav Li,Barbara Villarini,Jean‐Christophe Nebel,Θωμάς Λάγκας,Panagiotis Sarigiannidis,Vasileios Argyriou
标识
DOI:10.1109/dcoss-iot58021.2023.00058
摘要
The objective of augmented reality (AR) is to add digital content to natural images and videos to create an interactive experience between the user and the environment. Scene analysis and object recognition play a crucial role in AR, as they must be performed quickly and accurately. In this study, a new approach is proposed that involves using oriented bounding boxes with a detection and recognition deep network to improve performance and processing time. The approach is evaluated using two datasets: a real image dataset (DOTA dataset) commonly used for computer vision tasks, and a synthetic dataset that simulates different environmental, lighting, and acquisition conditions. The focus of the evaluation is on small objects, which are difficult to detect and recognise. The results indicate that the proposed approach tends to produce better Average Precision and greater accuracy for small objects in most of the tested conditions.
科研通智能强力驱动
Strongly Powered by AbleSci AI