四元数
定位关键字
计算机科学
卷积神经网络
足迹
推论
人工智能
语音识别
数学
几何学
生物
古生物学
作者
Aryan Chaudhary,Vinayak Abrol
标识
DOI:10.1109/waspaa58266.2023.10248052
摘要
On-device keyword spotting (KWS) is an essential component for wake-up and user interaction on smart edge devices. Existing low-footprint models are mainly based on 2D and 1D convolutions, where the former is better at capturing invariances while the latter enables faster inference times. In this work, we explore Quaternion neural models as an alternative for effective acoustic modeling for the KWS task. Quaternion models can embed various facets of input features within the multiple dimensions of the quaternion space. This leads to smaller & efficient models as compared to their conventional counterparts. We demonstrate this using quaternion versions of the popular KWS models on the Google Command V2 dataset, where our models achieve comparable performance to existing ones. In addition, we also provide an extensive analysis of the learning behavior in the quaternion network to motivate their use in other speech/audio tasks.
科研通智能强力驱动
Strongly Powered by AbleSci AI