骨架(计算机编程)
计算机科学
动作识别
人工智能
GSM演进的增强数据速率
人体骨骼
动作(物理)
模式识别(心理学)
蒸馏
机器学习
边缘检测
计算机视觉
图像(数学)
图像处理
化学
色谱法
程序设计语言
班级(哲学)
物理
量子力学
作者
Cheng Dai,Shoupeng Lu,Chuanjie Liu,Bing Guo
标识
DOI:10.1016/j.asoc.2023.111166
摘要
Skeleton based human action recognition has evolved as one of the most important applications in multimedia IoT system. However, it requires extensive computation resource including high performance computing unites and large memory to train a deep mode with large number of parameters, which seriously limits it effectiveness and efficiency for edge intelligence multimedia IoT applications. In this paper, a knowledge distillation based light-weight deep model is proposed for skeleton human action recognition to meet the edge multimedia IoT applications. It can get competitive recognition performance in terms of learning accuracy for combination of AI model and edge surveillance equipment. On the one hand, to achieve desirable accuracy, we propose a deep pose-transition image representation method based on two-stream spatial–temporal architecture, which can mine the hidden features of color texture images in spatial and temporal domain, and fuse them for comprehensive discrimination before final classification. On the other hand, to increase the transfer learning ability to the student model on the edge device, we use tucker decomposition to weak the teacher model during knowledge transfer learning process. Finally, in order to validate the effectiveness of our proposal, we conducted extensive experiments to evaluate the proposed approach. The experimental results demonstrate that our proposal can realize deep model miniaturization to meet the requirement of edge multimedia IoT system and achieve the competitive performance.
科研通智能强力驱动
Strongly Powered by AbleSci AI