EmoMusicTV: Emotion-Conditioned Symbolic Music Generation With Hierarchical Transformer VAE

计算机科学 自编码 人工智能 语音识别 Chord(对等) 自然语言处理 深度学习 分布式计算
作者
Shulei Ji,Xinyu Yang
出处
期刊:IEEE Transactions on Multimedia [Institute of Electrical and Electronics Engineers]
卷期号:26: 1076-1088 被引量:7
标识
DOI:10.1109/tmm.2023.3276177
摘要

Emotion is one of the most crucial attributes of music. However, due to the scarcity of emotional music datasets, emotion-conditioned symbolic music generation using deep learning techniques has not been investigated in depth. In particular, no study explores conditional music generation with the guidance of emotion, and few studies adopt time-varying emotional conditions. To address these issues, first, we endow three public lead sheet datasets with fine-grained emotions by automatically computing the valence labels from the chord progressions. Second, we propose a novel and effective encoder-decoder architecture named EmoMusicTV to explore the impact of emotional conditions on multiple music generation tasks and to capture the rich variability of musical sequences. EmoMusicTV is a transformer-based variational autoencoder (VAE) that contains a hierarchical latent variable structure to model holistic properties of the music segments and short-term variations within bars. The piece-level and bar-level emotional labels are embedded in their corresponding latent spaces to guide music generation. Third, we pretrain EmoMusicTV with the lead sheet continuation task to further improve its performance on conditional melody or harmony generation. Experimental results demonstrate that EmoMusicTV outperforms previous methods on three tasks, i.e., melody harmonization, melody generation given harmony, and lead sheet generation. Ablation studies verify the significant roles of emotional conditions and hierarchical latent variable structure on conditional music generation. Human listening shows that the lead sheets generated by EmoMusicTV are closer to the ground truth (GT) and perform slightly worse than the GT in conveying emotional polarity.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
PDF的下载单位、IP信息已删除 (2025-6-4)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
刚刚
万能图书馆应助何想采纳,获得10
刚刚
清漪完成签到,获得积分10
1秒前
星辰大海应助QWE采纳,获得10
1秒前
黄小翰完成签到,获得积分10
1秒前
1秒前
酷酷世德发布了新的文献求助10
1秒前
2秒前
煤灰发布了新的文献求助10
2秒前
2秒前
清漪发布了新的文献求助30
3秒前
4秒前
5秒前
所所应助tsngl采纳,获得10
5秒前
微澜发布了新的文献求助30
5秒前
6秒前
6秒前
Huobol完成签到,获得积分10
7秒前
安益平完成签到,获得积分10
8秒前
sheldoo完成签到 ,获得积分10
8秒前
彳亍发布了新的文献求助10
8秒前
左丘以云完成签到,获得积分10
8秒前
8秒前
认真的连虎完成签到,获得积分10
8秒前
9秒前
9秒前
汉堡包应助小张采纳,获得10
9秒前
丘比特应助淘气科研采纳,获得10
9秒前
9秒前
哭泣的翠丝完成签到,获得积分10
9秒前
zhouxinxiao发布了新的文献求助10
9秒前
飞翔的葡萄籽完成签到,获得积分10
10秒前
Tingshan完成签到,获得积分10
10秒前
论文仙人兔一乐完成签到,获得积分10
10秒前
安益平发布了新的文献求助50
11秒前
11秒前
左丘以云发布了新的文献求助10
11秒前
11秒前
12秒前
12秒前
高分求助中
Picture Books with Same-sex Parented Families: Unintentional Censorship 1000
A new approach to the extrapolation of accelerated life test data 1000
ACSM’s Guidelines for Exercise Testing and Prescription, 12th edition 500
Nucleophilic substitution in azasydnone-modified dinitroanisoles 500
不知道标题是什么 500
Indomethacinのヒトにおける経皮吸収 400
Phylogenetic study of the order Polydesmida (Myriapoda: Diplopoda) 370
热门求助领域 (近24小时)
化学 材料科学 医学 生物 工程类 有机化学 生物化学 物理 内科学 纳米技术 计算机科学 化学工程 复合材料 遗传学 基因 物理化学 催化作用 冶金 细胞生物学 免疫学
热门帖子
关注 科研通微信公众号,转发送积分 3978493
求助须知:如何正确求助?哪些是违规求助? 3522581
关于积分的说明 11213889
捐赠科研通 3260014
什么是DOI,文献DOI怎么找? 1799712
邀请新用户注册赠送积分活动 878604
科研通“疑难数据库(出版商)”最低求助积分说明 807002