Mel倒谱
预处理器
持续时间(音乐)
计算机科学
期限(时间)
语音识别
短时记忆
模式识别(心理学)
人工智能
数据预处理
倒谱
特征提取
声学
人工神经网络
物理
量子力学
循环神经网络
作者
Nantalira Niar Wijaya,De Rosal Ignatius Moses Setiadi,Ahmad Rofiqul Muslikh
摘要
Music genre classification is one part of the music recommendation process, which is a challenging job. This research proposes the classification of music genres using Bidirectional Long Short-Term Memory (BiLSTM) and Mel-Frequency Cepstral Coefficients (MFCC) extraction features. This method was tested on the GTZAN and ISMIR2004 datasets, specifically on the IS-MIR2004 dataset, a duration cutting operation was carried out, which was only taken from seconds 31 to 60 so that it had the same duration as GTZAN, namely 30 seconds. Preprocessing operations by removing silent parts and stretching are also performed at the preprocessing stage to obtain normalized input. Based on the test results, the performance of the proposed method is able to produce accuracy on testing data of 93.10% for GTZAN and 93.69% for the ISMIR2004 dataset.
科研通智能强力驱动
Strongly Powered by AbleSci AI