计算机科学
人工智能
透视图(图形)
计算机视觉
对象(语法)
软件可移植性
代表(政治)
目标检测
图像(数学)
深度学习
水准点(测量)
坐标系
模式识别(心理学)
地理
政治
程序设计语言
法学
政治学
大地测量学
作者
Md. Rasel Hossain,Md. Mahbubur Rahman,Mohammad Rezaul Karim,Md. Jaki Al Amin,Chinmay Bepery
标识
DOI:10.1109/ccwc54503.2022.9720795
摘要
We can capture an image or record a video by using a camera, but these frames take the two-dimensional representation of the objects. The main drawback of two-dimensional representation is the depth or perspective view between objects is unknown, sometimes an object is overlapped with another and could not be captured. This drawback can be solved if we could represent the objects by giving them three-dimensional coordinates. This paper proposes a lightweight algorithm that can distinguish different objects and can provide a three-dimensional coordinate to an object. This solution is developed using You Only Look Once (YOLO) for object detection and classification with the COCO dataset, and two cameras for observing the perspective between the objects. The system can provide portability and is easy to use. The algorithm can be used with different types of deep learning models.
科研通智能强力驱动
Strongly Powered by AbleSci AI