光谱图
计算机科学
判别式
语音识别
卷积(计算机科学)
卷积神经网络
乌兹别克语
人工智能
模式识别(心理学)
深度学习
人工神经网络
图像(数学)
语言学
哲学
作者
Muhammadjon Musaev,Ilyos Khujayorov,Mannon Ochilov
标识
DOI:10.1145/3386164.3389100
摘要
In this paper has been discussed about speech recognition using spectrogram images and deep convolution neural network(CNN) of Uzbek spoken digits. Spectrogram images from speech signal were generated and it were used for deep CNN training. Presented CNN model contains 3 convolution layers and 2 fully connected layers that discriminative features can be divided and estimated of spectrogram images by those layers. In current research period, dataset of Uzbek spoken digits were made and in based on presented CNN model they were trained. Testing results shows that, proposed approach for Uzbek spoken digits classified 100% accuracy.
科研通智能强力驱动
Strongly Powered by AbleSci AI