A Dialectal Chinese Speech Recognition Framework

来源 :计算机科学技术学报(英文版) | 被引量 : 0次 | 上传用户:a370412412
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
A framework for dialectal Chinese speech recognition is proposed and studied, in which a relatively small dialectal Chinese (or in other words Chinese influenced by the native dialect) speech corpus and dialect-related knowledge are adopted to transform a standard Chinese (or Putonghua, abbreviated as PTH) speech recognizer into a dialectal Chinese speech recognizer. Two kinds of knowledge sources are explored: one is expert knowledge and the other is a small dialectal Chinese corpus. These knowledge sources provide information at four levels: phonetic level, lexicon level, language level,and acoustic decoder level. This paper takes Wu dialectal Chinese (WDC) as an example target language. The goal is to establish a WDC speech recognizer from an existing PTH speech recognizer based on the Initial-Final structure of the Chinese language and a study of how dialectal Chinese speakers speak Putonghua. The authors propose to use contextindependent PTH-IF mappings (where IF means either a Chinese Initial or a Chinese Final), context-independent WDC-IF mappings, and syllable-dependent WDC-IF mappings (obtained from either experts or data), and combine them with the supervised maximum likelihood linear regression (MLLR) acoustic model adaptation method. To reduce the size of the multipronunciation lexicon introduced by the IF mappings, which might also enlarge the lexicon confusion and hence lead to the performance degradation, a Multi-Pronunciation Expansion (MPE) method based on the accumulated uni-gram probability (AUP) is proposed. In addition, some commonly used WDC words are selected and added to the lexicon. Compared with the original PTH speech recognizer, the resulting WDC speech recognizer achieves 10-18% absolute Character Error Rate (CER) reduction when recognizing WDC, with only a 0.62% CER increase when recognizing PTH. The proposed framework and methods are expected to work not only for Wu dialectal Chinese but also for other dialectal Chinese languages and even other languages.
其他文献
目的:了解新疆维吾尔族乡村已婚育龄期妇女生活质量现状及影响因素。方法:用SF-36量表对新疆612名维吾尔族乡村已婚育龄期妇女进行问卷调查。结果:新疆维吾尔族已婚育龄期妇
注射用硫普宁又名凯西莱,主要成分为硫普罗定,主要用于改善各类急慢性肝炎病人的肝功能;用于脂肪肝、酒精肝、药物性肝损伤的治疗以及重金属的解毒;降低放疗、化疗所致的毒副
采用均匀颗粒成型法 (UniformDropletSpray -UDS)制备Fe基合金粉末。该方法制备的颗粒粒度分布均匀 ,性能一致 ,大大优于传统的粉末制备方法 (如破碎法、气雾化 ) ,具有广阔
期刊
浓缩饲料是由蛋白质、矿物质、微量元素、维生素和非营养性添加剂等成分按一定比例配制而成的混合物.使用时,只要将其按一定比例掺入由玉米、麸皮、高粱、大麦等原料配制成的
颈动脉的左右两侧的解剖结构存在差异.左侧颈总动脉直接起自主动脉弓,而右侧颈总动脉起自头臂干.两侧颈总动脉均在胸锁乳突肌的深面平对甲状软骨上缘处分为颈内动脉和颈外动
我国胎儿出生缺陷的发生率约占所有新生儿的4%~6%,每年大约有80~100万的畸形儿出生,给家庭和社会带来巨大的精神和沉重的经济负担.超声作为一项无创伤的技术,在产前诊断胎儿畸
期刊