计算机科学
词典
电话
资源(消歧)
抄写(语言学)
语音语料库
自然语言处理
中国
集合(抽象数据类型)
钥匙(锁)
语言学
语音识别
人工智能
语音合成
历史
考古
哲学
程序设计语言
计算机安全
计算机网络
作者
Guanyu Li,Hongzhi Yu,Thomas Fang Zheng,Jinghao Yan,Shipeng Xu
标识
DOI:10.1109/apsipa.2017.8282130
摘要
Tibetan is an important low-resource language in China. A key factor that hinders the speech and language research for Tibetan is the lack of resources, particularly free ones. This paper describes our recent progression on Tibetan resource construction supported by the NSFC M2ASR project, including the phone set, lexicon, as well as the transcription of a large scale speech corpus. Following the M2ASR free data program, all the resources are publicly available and free for researchers. We also release a small Tibetan speech database that can be used to build a proto type Tibetan speech recognition system.
科研通智能强力驱动
Strongly Powered by AbleSci AI