计算机科学
编码器
序列(生物学)
多径传播
动作(物理)
人工智能
模式识别(心理学)
电信
频道(广播)
遗传学
物理
量子力学
生物
操作系统
作者
Y H Qiu,Li Niu,Feng Sha
标识
DOI:10.1016/j.eswa.2024.123760
摘要
Counting repetitive actions is important in work and daily life. Automated counting using deep learning provides a more efficient, accurate alternative to manual counting, which is tedious and error-prone Deep-learning models have been proposed to automatically count repetitive actions in video content. However, for these models to be applied to realistic scenes, high-quality performance and generalization to multiple environments, particularly for long videos, are essential. To address these challenges, we propose a new model, ME-RAC, which includes the multipath 3D-Conv encoder module, and we also propose a temporal-sequence random-combination data augmentation to improve counting performance and prevent model over-fitting during training. Additionally, we propose the temporal-sequence-decision (TSD) framework system to realize long repetitive-action counting in complex realistic scenes. We conducted experiments to validate that our proposed methods perform better than comparable methods and our TSD framework achieved unique performance in long repetitive-action-counting tasks.
科研通智能强力驱动
Strongly Powered by AbleSci AI