Stream Weight Training Based on MCE for Audio-Visual LVCSR

来源 :Tsinghua Science and Technology | 被引量 : 0次 | 上传用户：fei000chong

【摘要】

：

In this paper we address the problem of audio-visual speech recognition in the framework of the multi-stream hidden Markov model. Stream weight training ba

【作者】

：

刘鹏王作英

【机构】

：

Department of Electronic Engineering, Tsinghua University, Beijing 100084, China,Department of Elect

【出处】

：

Tsinghua Science and Technology

【发表日期】

：

2005年02期

【关键词】

：

Training aligned dimensions sentences deletion otherwise alter

下载到本地 , 更方便阅读

下载此文赞助VIP

声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架

论文部分内容阅读

In this paper we address the problem of audio-visual speech recognition in the framework of the multi-stream hidden Markov model. Stream weight training based on minimum classification error criterion is discussed for use in large vocabulary continuous speech recognition (LVCSR). We present the lattice re- scoring and Viterbi approaches for calculating the loss function of continuous speech. The experimental re- sults show that in the case of clean audio, the system performance can be improved by 36.1% in relative word error rate reduction when using state-based stream weights trained by a Viterbi approach, compared to an audio only speech recognition system. Further experimental results demonstrate that our audio-visual LVCSR system provides significant enhancement of robustness in noisy environments. In this paper we address the problem of audio-visual speech recognition in the framework of the multi-stream hidden Markov model. Stream weight training based on minimum classification error criterion is discussed for use in large vocabulary continuous speech recognition (LVCSR). We present the lattice re- scoring and Viterbi approaches for calculating the loss function of continuous speech. The experimental re- sults show that in the case of clean audio, the system performance can be improved by 36.1% in relative word error rate reduction when using state- based stream weights trained by a Viterbi approach, compared to an audio only speech recognition system. Further experimental results demonstrate that our audio-visual LVCSR system provides significant enhancement of robustness in noisy environments.

其他文献

美国明星“省长”大比拼

美国著名杂志，曾经选出年度好莱坞花钱最小气的明星，简称“省长”。结果“省长”前四名全由美女包办：No.1凯瑟琳·丽塔琼斯、No.2朱丽叶·罗伯兹、No.3费唐娜薇、No.4凯丽·斯塔

期刊

言承旭郭富城割地赔款凯丽明日之星千里之外光鲜亮丽亲笔签名刘田给你

高强度聚焦超声可治疗区域的仿真研究

数值仿真不同治疗参数条件下高强度聚焦超声(high intensity focused ultrasound,HIFU)可治疗区域的变化,对HIFU治疗剂量的确定具有重要的指导意义。本文采用Westervelt方程

期刊

治疗区域高强度聚焦超声声学特性组织声学特性治疗剂量照射时间声强生物热传导方程时域有限差分法最高温升

多潘立酮联合复方氢氧化铝治疗胆汁反流性胃炎的临床分析

目的研究多潘立酮联合复方氢氧化铝治疗胆汁反流性胃炎的临床效果.方法随机抽取我院自2018年5月至2020年5月两年期间中收治的86例胆汁反流性胃炎患者,以奇偶分组法将其分为

期刊

多潘立酮复方氢氧化铝胆汁反流性胃炎临床疗效不良反应

计量及远程监测在农业灌溉中的应用

针对克拉玛依农业灌溉区农户用水情况,详细阐述应用超声波流量计仪表来准确计量及远程监测系统,实时掌握并合理安排灌溉方式,为供水部门计划用水、科学管水提供依据.为做好水

会议

计量远程监测系统农业灌溉合理调配水资源超声波流量计水平衡测试用水情况克拉玛依

头发怎样阐释健康如何养护

期刊

头发阐释健康

勘探决策系统中的多种安全技术研究

计算机技术的自主更新与各行业信息化安全的不断需求,推动了企业安全技术的发展.本文以系统与网络安全技术在勘探决策系统中的应用为多个实例,对当今较为流行的电源系统优化

会议

油田勘探决策系统网络安全技术行业信息化网络数据备份计算机技术应用效率研究成果

勘探项目数据库建设技术研究

本文分析了油田勘探综合研究过程中数据应用现状及存在的突出问题,探讨了建立勘探项目数据库的思路和实施方案.通过建立勘探项目数据库,为勘探综合研究工作提供丰富基础数据,

会议

油田勘探项目数据库综合应用现状研究效果实施方案基础数据工作周期

开腹宫外孕手术与腹腔镜下宫外孕手术的临床效果分析

目的:本次研究的目的为分析宫外孕治疗中开腹手术与腹腔镜手术效果对比.方法:本次将选择2018年6月至2019年6月期间前来我院就诊的宫外孕患者80位.采用随机分组法将患者平均分

期刊

宫外孕开腹手术腹腔镜手术生育功能

卡前列素氨丁三醇治疗产妇产后出血的临床疗效观察

目的:分析卡前列素氨丁三醇治疗产妇产后出血的临床疗效.方法:回顾分析我院2019年5月至2020年4月期间收治产妇产后出血患者80例,按治疗方式分组,其中40例接受缩宫素治疗(对照

期刊

[关键字]产后出血产妇卡前列素氨丁三醇缩宫素

观察循证药学在临床用药干预中的应用效果

目的:分析循证药学在临床用药干预中的应用效果.方法:随机选取本院2019年2月-2020年1月期间收治的79例循证药学干预的患者进行此次研究,总结循证药学干预的过程,分析其应用效

期刊

循证药学临床用药应用效果

Stream Weight Training Based on MCE for Audio-Visual LVCSR

与本文相关的学术论文