光谱图
计算机科学
卷积神经网络
超参数
人工智能
模式识别(心理学)
人工神经网络
直线(几何图形)
领域(数学)
语音识别
计算机视觉
数学
几何学
纯数学
作者
Agnes Incze,Henrietta-Bernadett Jancso,Zoltán Szilágyi,Attila Farkas,Csaba Sulyok
标识
DOI:10.1109/sisy.2018.8524677
摘要
Convolutional neural networks (CNNs) are powerful toolkits of machine learning which have proven efficient in the field of image processing and sound recognition. In this paper, a CNN system classifying bird sounds is presented and tested through different configurations and hyperparameters. The MobileNet pre-trained CNN model is fine-tuned using a dataset acquired from the Xeno-canto bird song sharing portal, which provides a large collection of labeled and categorized recordings. Spectrograms generated from the downloaded data represent the input of the neural network. The attached experiments compare various configurations including the number of classes (bird species) and the color scheme of the spectrograms. Results suggest that choosing a color map in line with the images the network has been pre-trained with provides a measurable advantage. The presented system is viable only for a low number of classes.
科研通智能强力驱动
Strongly Powered by AbleSci AI