Effective Infant Cry Signal Analysis and Reasoning using IARO based Leaky Bi-LSTM Model

计算机科学 信号(编程语言) 人工智能 程序设计语言
作者
B.M. Mala,Smita Chormunge
出处
期刊:Computer Speech & Language [Elsevier]
卷期号:: 101621-101621
标识
DOI:10.1016/j.csl.2024.101621
摘要

In the present scenario, the recognition of particular emotions or needs from an infant's cry is a difficult process in the field of pattern recognition as it does not have any verbal information. In this article, an automated model is introduced for an effective recognition of infant cries. At first, the infant cry signals are collected from the Baby Chillanto (BC) dataset and the Donate a Cry Corpus (DCC) dataset. These acquired signals are converted into feature vectors by employing nine techniques namely, Zero Crossing Rate (ZCR), acoustic features, audio features, amplitude, energy, Root Mean Square (RMS), statistical moments, autocorrelation, and Mel-Frequency Cepstral Coefficients (MFCCs). These obtained feature vectors are multi-dimensional; therefore, a Simulated Annealing Algorithm (SAA) is employed to select informative feature vectors. The selected informative feature vectors are passed to the leaky Bi-directional Long Short Term Memory (Bi-LSTM) model for classifying the types of infant cries. Specifically, in the leaky Bi-LSTM model, the conventional activation functions (Tangent (Tanh) and sigmoid) are replaced with the leaky Rectified Linear Unit (leaky ReLU) activation function. This process significantly mitigates the vanishing gradient problem and improves convergence during data training, which is vital for signal classification tasks. Furthermore, an Improved Artificial Rabbit's Optimization (IARO) algorithm is proposed to choose optimal hyper-parameters in the leaky Bi-LSTM model, where this mechanism reduces the complexity and training time of the classification model. In the IARO algorithm, selective opposition and Lévy flight strategies are integrated with the conventional ARO algorithm to enhance the dynamics and diversity of the population, along with the model's tracking efficiency. The empirical investigation denotes that the proposed IARO based leaky Bi-LSTM model achieves 99.66% and 95.92% of classification accuracy on the BC and DCC datasets, respectively. The proposed IARO based leaky Bi-LSTM model achieves maximum classification results when related to the conventional recognition models.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
qin发布了新的文献求助20
刚刚
小二郎应助foolingtheblind采纳,获得10
1秒前
bkagyin应助小米采纳,获得10
3秒前
我d温柔乡完成签到,获得积分20
6秒前
7秒前
郝好完成签到 ,获得积分10
7秒前
8秒前
橘子海完成签到,获得积分20
9秒前
方格格格完成签到,获得积分20
10秒前
动心忍性关注了科研通微信公众号
12秒前
ajiduo发布了新的文献求助10
14秒前
翻翻发布了新的文献求助10
16秒前
坚定的羽毛完成签到,获得积分10
17秒前
18秒前
19秒前
慕青应助DocRivers采纳,获得10
19秒前
方格格格发布了新的文献求助30
21秒前
叶楠完成签到,获得积分10
22秒前
sahjdkah发布了新的文献求助20
24秒前
丘比特应助可可杨采纳,获得10
27秒前
快哒哒哒完成签到 ,获得积分10
28秒前
烟花应助Sunech采纳,获得10
29秒前
领导范儿应助美满电灯胆采纳,获得10
30秒前
淘淘发布了新的文献求助10
30秒前
34秒前
Ava应助结实星星采纳,获得10
35秒前
星辰大海应助寂寞的安筠采纳,获得10
36秒前
脑洞疼应助sahjdkah采纳,获得10
37秒前
37秒前
38秒前
博大精森完成签到,获得积分10
38秒前
40秒前
可可杨发布了新的文献求助10
41秒前
42秒前
大模型应助瀚海的雄狮采纳,获得10
44秒前
sahjdkah完成签到,获得积分10
44秒前
44秒前
yujing发布了新的文献求助30
45秒前
46秒前
22完成签到,获得积分10
46秒前
高分求助中
Impact of Mitophagy-Related Genes on the Diagnosis and Development of Esophageal Squamous Cell Carcinoma via Single-Cell RNA-seq Analysis and Machine Learning Algorithms 1600
Exploring Mitochondrial Autophagy Dysregulation in Osteosarcoma: Its Implications for Prognosis and Targeted Therapy 1500
LNG地下式貯槽指針(JGA指-107) 1000
LNG地上式貯槽指針 (JGA指 ; 108) 1000
QMS18Ed2 | process management. 2nd ed 600
LNG as a marine fuel—Safety and Operational Guidelines - Bunkering 560
Clinical Interviewing, 7th ed 400
热门求助领域 (近24小时)
化学 医学 材料科学 生物 工程类 有机化学 生物化学 物理 内科学 纳米技术 计算机科学 化学工程 复合材料 基因 遗传学 物理化学 催化作用 免疫学 细胞生物学 电极
热门帖子
关注 科研通微信公众号,转发送积分 2940137
求助须知:如何正确求助?哪些是违规求助? 2597822
关于积分的说明 6996141
捐赠科研通 2240088
什么是DOI,文献DOI怎么找? 1189412
版权声明 590152
科研通“疑难数据库(出版商)”最低求助积分说明 582311