压力(语言学)
计算机科学
自然语言处理
人工智能
语言学
语音识别
哲学
作者
Maria Luisa García Lecumberri,Roberto Barra-Chicote,Rubén Pérez Ramón,Junichi Yamagishi,Martin Cooke
标识
DOI:10.21437/interspeech.2014-324
摘要
For most of us, speaking in a non-native language involves deviating to some extent from native pronunciation norms. However, the detailed basis for foreign accent (FA) remains elusive, in part due to methodological challenges in isolating segmental from suprasegmental factors. The current study examines the role of segmental features in conveying FA through the use of a generative approach in which accent is localised to single consonantal segments. Three techniques are evaluated: the first requires a highly-proficiency bilingual to produce words with isolated accented segments; the second uses cross-splicing of context-dependent consonants from the non-native language into native words; the third employs hidden Markov model synthesis to blend voice models for both languages. Using English and Spanish as the native/non-native languages respectively, listener cohorts from both languages identified words and rated their degree of FA. All techniques were capable of generating accented words, but to differing degrees. Naturally-produced speech led to the strongest FA ratings and synthetic speech the weakest, which we interpret as the outcome of over-smoothing. Nevertheless, the flexibility offered by synthesising localised accent encourages further development of the method. Index Terms: Foreign accent, speech synthesis, splicing
科研通智能强力驱动
Strongly Powered by AbleSci AI