Stream Weight Training Based on MCE for Audio-Visual LVCSR

来源 :Tsinghua Science and Technology | 被引量 : 0次 | 上传用户:fei000chong
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
In this paper we address the problem of audio-visual speech recognition in the framework of the multi-stream hidden Markov model. Stream weight training based on minimum classification error criterion is discussed for use in large vocabulary continuous speech recognition (LVCSR). We present the lattice re- scoring and Viterbi approaches for calculating the loss function of continuous speech. The experimental re- sults show that in the case of clean audio, the system performance can be improved by 36.1% in relative word error rate reduction when using state-based stream weights trained by a Viterbi approach, compared to an audio only speech recognition system. Further experimental results demonstrate that our audio-visual LVCSR system provides significant enhancement of robustness in noisy environments. In this paper we address the problem of audio-visual speech recognition in the framework of the multi-stream hidden Markov model. Stream weight training based on minimum classification error criterion is discussed for use in large vocabulary continuous speech recognition (LVCSR). We present the lattice re- scoring and Viterbi approaches for calculating the loss function of continuous speech. The experimental re- sults show that in the case of clean audio, the system performance can be improved by 36.1% in relative word error rate reduction when using state- based stream weights trained by a Viterbi approach, compared to an audio only speech recognition system. Further experimental results demonstrate that our audio-visual LVCSR system provides significant enhancement of robustness in noisy environments.
其他文献
美国著名杂志,曾经选出年度好莱坞花钱最小气的明星,简称“省长”。结果“省长”前四名全由美女包办:No.1凯瑟琳·丽塔琼斯、No.2朱丽叶·罗伯兹、No.3费唐娜薇、No.4凯丽·斯塔
数值仿真不同治疗参数条件下高强度聚焦超声(high intensity focused ultrasound,HIFU)可治疗区域的变化,对HIFU治疗剂量的确定具有重要的指导意义。本文采用Westervelt方程
目的 研究多潘立酮联合复方氢氧化铝治疗胆汁反流性胃炎的临床效果.方法 随机抽取我院自2018年5月至2020年5月两年期间中收治的86例胆汁反流性胃炎患者,以奇偶分组法将其分为
针对克拉玛依农业灌溉区农户用水情况,详细阐述应用超声波流量计仪表来准确计量及远程监测系统,实时掌握并合理安排灌溉方式,为供水部门计划用水、科学管水提供依据.为做好水
计算机技术的自主更新与各行业信息化安全的不断需求,推动了企业安全技术的发展.本文以系统与网络安全技术在勘探决策系统中的应用为多个实例,对当今较为流行的电源系统优化
本文分析了油田勘探综合研究过程中数据应用现状及存在的突出问题,探讨了建立勘探项目数据库的思路和实施方案.通过建立勘探项目数据库,为勘探综合研究工作提供丰富基础数据,
目的:本次研究的目的为分析宫外孕治疗中开腹手术与腹腔镜手术效果对比.方法:本次将选择2018年6月至2019年6月期间前来我院就诊的宫外孕患者80位.采用随机分组法将患者平均分
目的:分析卡前列素氨丁三醇治疗产妇产后出血的临床疗效.方法:回顾分析我院2019年5月至2020年4月期间收治产妇产后出血患者80例,按治疗方式分组,其中40例接受缩宫素治疗(对照
目的:分析循证药学在临床用药干预中的应用效果.方法:随机选取本院2019年2月-2020年1月期间收治的79例循证药学干预的患者进行此次研究,总结循证药学干预的过程,分析其应用效