一致性(知识库)
计算机科学
瓶颈
编码(集合论)
航程(航空)
信息丢失
语音识别
算法
人工智能
集合(抽象数据类型)
工程类
程序设计语言
航空航天工程
嵌入式系统
作者
Haoran Sun,Dong Wang,Lantian Li,Chen Chen,Thomas Fang Zheng
标识
DOI:10.1109/tpami.2023.3257839
摘要
Speech disentanglement aims to decompose independent causal factors of speech signals into separate codes. Perfect disentanglement benefits to a broad range of speech processing tasks. This paper presents a simple but effective disentanglement approach based on cycle consistency loss and random factor substitution. This leads to a novel random cycle (RC) loss that enforces analysis-and-resynthesis consistency, a main principle of reductionism. We theoretically demonstrate that the proposed RC loss can achieve independent codes if well optimized, which in turn leads to superior disentanglement when combined with information bottleneck (IB). Extensive simulation experiments were conducted to understand the properties of the RC loss, and experimental results on voice conversion further demonstrate the practical merit of the proposal. Source code and audio samples can be found on the webpage http://rc.cslt.org .
科研通智能强力驱动
Strongly Powered by AbleSci AI