计算机科学
人工智能
卷积神经网络
超参数
深度学习
行人检测
机器学习
元启发式
行人
模式识别(心理学)
计算机视觉
运输工程
工程类
作者
Deepak Kumar Jain,Xudong Zhao,Germán González-Almagro,Chenquan Gan,Ketan Kotecha
标识
DOI:10.1016/j.inffus.2023.02.014
摘要
Pedestrian detection (PD) is a vital computer vision (CV) problem that is highly employed in several real-time applications, namely autonomous driving methods, robotics, and security observing methods. Simulated by deep learning (DL) approaches to the recognition of generic objects, several investigation mechanisms have attained maximum recognition accuracy for acceptable scale and non-blocked pedestrians. However, the detection efficiency needed to be improved for complex cases like rare pose samples, crowd scenes, and cases with worse visibility due to daytime or weather. Therefore, this study develops a multimodal pedestrian detection system in crowded scenes using metaheuristics and a deep convolutional neural network (MMPD-MDCNN) technique. The MMPD-MDCNN technique's goal is to identify pedestrians in crowd scenes using different deep-learning models effectively. The proposed MMPD-MDCNN technique integrates three deep learning models: the residual network (ResNet-50), Inception v3, and the capsule network (CapsNet). In addition, the Harris Hawks Optimization (HHO) algorithm is applied for optimal hyperparameter tuning of the deep learning models. For pedestrian detection, the MMPD-MDCNN technique uses the long short-term memory (LSTM) model, and its hyperparameters can be adjusted by the shark smell optimization (SSO) algorithm. To demonstrate the superior performance of the MMPD-MDCNN approach, A comprehensive set of simulations on the INRIA and UCSD datasets was performed to illustrate the superior performance of the MMPD-MDCNN approach. The experimental results suggest that the MMPD-MDCNN model performs well on both datasets.
科研通智能强力驱动
Strongly Powered by AbleSci AI