卷积(计算机科学)
计算机科学
动作识别
动作(物理)
残余物
构造(python库)
卷积神经网络
透视图(图形)
人工智能
模式识别(心理学)
算法
计算机网络
人工神经网络
量子力学
物理
班级(哲学)
作者
Haoqi Duan,Chao Zhang,Zhao Lv
标识
DOI:10.1109/cis54983.2021.00028
摘要
With the number of motor vehicles rising year by year, traffic safety problems are becoming increasingly serious. In this paper, we study the application of convolutional networks to driving action recognition from the perspective of the driver's action in a car. Due to the lack of current driving action datasets, we construct a driving action video dataset DAD-10 containing the entire upper body of the driver. We compare and analyze the effectiveness of 2D convolution and 3D convolution in processing the driving action video data, and based on this, we design a network structure called M(2-1)D that mixes 2D convolution and 3D convolution. The M(2-1)D network uses residual learning and utilizes convolution splitting to reduce the number of network parameters. The network structure achieves a recognition accuracy of 98.5% on the DAD-10 dataset, which has a very high application value.
科研通智能强力驱动
Strongly Powered by AbleSci AI