卷积神经网络
计算机科学
人工智能
分类
对象(语法)
鉴定(生物学)
目标检测
失真(音乐)
视觉对象识别的认知神经科学
模式识别(心理学)
计算机视觉
算法
深度学习
机器学习
生物
放大器
植物
带宽(计算)
计算机网络
作者
T J Nandhini,K. Thinakaran
标识
DOI:10.1109/aisp57993.2023.10134980
摘要
Object detection algorithms must first identify all the objects inside an image before machine vision can properly categorize and localize them. Many methods have been proposed to handle this problem, with most of the motivation coming from computer vision and deep learning methods. However, prevailing technologies have never effectively recognized tiny, dense things and often failed to detect objects that have undergone random geometric alterations. We analyze the current state of the art in object identification and propose a deformable convolutional network with adjustable depths to address these concerns. The results of our research suggest that they are better than the current best practices, blend deep convolutional networks with flexible convolutional structures to account for geometric variations, and get multi-scaled features. Next, we perform the remaining phases of object identification and region regress by up-sampling the fusion of multi-scaled elements. Experimental validation of our proposed framework demonstrates a considerable improvement in accuracy relative to time spent recognizing small target objects with geometric distortion.
科研通智能强力驱动
Strongly Powered by AbleSci AI