A Dialectal Chinese Speech Recognition Framework

来源 :计算机科学技术学报（英文版） | 被引量 : 0次 | 上传用户：a370412412

【摘要】

：

A framework for dialectal Chinese speech recognition is proposed and studied, in which a relatively small dialectal Chinese (or in other words Chinese influence

【作者】

：

Jing Li Thomas Fang Zheng William Byrne Dan Jurafsky

【机构】

：

Center for Speech Technology, State Key Laboratory of Intelligent Technology and Systems Department

【出处】

：

计算机科学技术学报（英文版）

【发表日期】

：

2006年1期

【关键词】

：

dialectal Chinese speech recognition initial or final (IF) IF-mapping rule pronu

下载到本地 , 更方便阅读

下载此文赞助VIP

声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架

论文部分内容阅读

A framework for dialectal Chinese speech recognition is proposed and studied, in which a relatively small dialectal Chinese (or in other words Chinese influenced by the native dialect) speech corpus and dialect-related knowledge are adopted to transform a standard Chinese (or Putonghua, abbreviated as PTH) speech recognizer into a dialectal Chinese speech recognizer. Two kinds of knowledge sources are explored: one is expert knowledge and the other is a small dialectal Chinese corpus. These knowledge sources provide information at four levels: phonetic level, lexicon level, language level,and acoustic decoder level. This paper takes Wu dialectal Chinese (WDC) as an example target language. The goal is to establish a WDC speech recognizer from an existing PTH speech recognizer based on the Initial-Final structure of the Chinese language and a study of how dialectal Chinese speakers speak Putonghua. The authors propose to use contextindependent PTH-IF mappings (where IF means either a Chinese Initial or a Chinese Final), context-independent WDC-IF mappings, and syllable-dependent WDC-IF mappings (obtained from either experts or data), and combine them with the supervised maximum likelihood linear regression (MLLR) acoustic model adaptation method. To reduce the size of the multipronunciation lexicon introduced by the IF mappings, which might also enlarge the lexicon confusion and hence lead to the performance degradation, a Multi-Pronunciation Expansion (MPE) method based on the accumulated uni-gram probability (AUP) is proposed. In addition, some commonly used WDC words are selected and added to the lexicon. Compared with the original PTH speech recognizer, the resulting WDC speech recognizer achieves 10-18% absolute Character Error Rate (CER) reduction when recognizing WDC, with only a 0.62% CER increase when recognizing PTH. The proposed framework and methods are expected to work not only for Wu dialectal Chinese but also for other dialectal Chinese languages and even other languages.

其他文献

新疆维吾尔族乡村已婚育龄期妇女生命质量调查研究

目的:了解新疆维吾尔族乡村已婚育龄期妇女生活质量现状及影响因素。方法:用SF-36量表对新疆612名维吾尔族乡村已婚育龄期妇女进行问卷调查。结果:新疆维吾尔族已婚育龄期妇

期刊

新疆维吾尔族育龄期生活质量妇女生活生命质量社会功能思想教育活动情感职能精神健康维度得分

1例凯西莱致过敏性休克病人的护理

注射用硫普宁又名凯西莱,主要成分为硫普罗定,主要用于改善各类急慢性肝炎病人的肝功能;用于脂肪肝、酒精肝、药物性肝损伤的治疗以及重金属的解毒;降低放疗、化疗所致的毒副

一种制备磁性粉末的新技术--均匀颗粒成型法

采用均匀颗粒成型法 (UniformDropletSpray -UDS)制备Fe基合金粉末。该方法制备的颗粒粒度分布均匀 ,性能一致 ,大大优于传统的粉末制备方法 (如破碎法、气雾化 ) ,具有广阔

期刊

均匀颗粒成型法Fe基合金粉末

基于环形振荡器的绑定前硅通孔测试

期刊

浓缩饲料的配制及使用方法

浓缩饲料是由蛋白质、矿物质、微量元素、维生素和非营养性添加剂等成分按一定比例配制而成的混合物.使用时,只要将其按一定比例掺入由玉米、麸皮、高粱、大麦等原料配制成的

期刊

浓缩饲料原料配制能量饲料非营养性添加剂畜禽饲养全价配合饲料蛋白质饲料运输费用营养需要营养不足微量元素推广应用推广使用饲料原料饲料成本

健康人颈动脉超声结构和功能双侧对比的研究

颈动脉的左右两侧的解剖结构存在差异.左侧颈总动脉直接起自主动脉弓,而右侧颈总动脉起自头臂干.两侧颈总动脉均在胸锁乳突肌的深面平对甲状软骨上缘处分为颈内动脉和颈外动

期刊

健康人颈动脉超声结构和功能颈总动脉血流动力学胸锁乳突肌统计学意义颈动脉弹性甲状软骨上主动脉弓血管病变颈外动脉颈内动脉解剖结构对正常人

意大利:免费短信提示菜价助省钱/科学家发现可利用线虫控制作物害虫/2009年韩国可能减少玉米进口

期刊

意大利短信提示科学家利用线虫控制作物害虫韩国玉米

超声在产前胎儿畸形诊断中的应用

我国胎儿出生缺陷的发生率约占所有新生儿的4%～6%,每年大约有80～100万的畸形儿出生,给家庭和社会带来巨大的精神和沉重的经济负担.超声作为一项无创伤的技术,在产前诊断胎儿畸

期刊

超声诊断产前诊断胎儿畸形出生缺陷应用价值统计资料经济负担对比分析新生儿无创伤畸形儿分娩后发生率引产显示家庭技术

草粒饲料压制机;颗粒产品包装机;双螺杆万能膨化机;定量灌装机;全自动回转式杀菌锅;多功能热灌装机组;电控电动磨粉机;魔芋精炼机;拉袋粒类自动包装机;煎炸油过滤机;多功能粉皮生产线

期刊

饲料压制机颗粒产品自动包装机双螺杆膨化机定量灌装机全自动回转式杀菌锅多功能热灌装机组电控电动磨粉机魔芋精炼机煎炸油过滤机粉皮

转炉上料除尘系统技改工程

期刊

A Dialectal Chinese Speech Recognition Framework

与本文相关的学术论文