计算机科学
发音
语音识别
压力(语言学)
还原(数学)
语音合成
语调(语言学)
光谱图
共振峰
韵律
人工智能
语言学
元音
数学
几何学
哲学
作者
Sixuan Zhao,Soo Ngee Koh,Soon Ing Yann,Kang Kwong Luke
标识
DOI:10.1109/icassp.2013.6639265
摘要
This paper considers the generation of feedback utterances for speaking skills training of non-native English learners. The proposed feedback is in the form of a combination of the learner's voice and the linguistic gestures, i.e., the prosody or pronunciation, of a native speaker. Both accent reduction method and voice conversion method are employed to generate feedback stimuli. For accent reduction, three speech synthesis methods, namely pitch-synchronous overlap and add (PSOLA), harmonic stochastic model (HSM), and speech transformation and representation by adaptive interpolation of weighted spectrogram (STRAIGHT) are used to reduce the accent of the utterances of English learners. For voice conversion, the teacher's voice is converted to that of the learner and the converted speech is used as a feedback. Objective measurements are employed to assess the nativeness and acoustic quality of the generated stimuli. A feedback scheme which combines the accent reduction and voice conversion methods is also proposed.
科研通智能强力驱动
Strongly Powered by AbleSci AI