,Binary neural networks for speech recognition?#

来源 :信息与电子工程前沿(英文版) | 被引量 : 0次 | 上传用户:weilonglee
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
Recently, deep neural networks (DNNs) significantly outperform Gaussian mixture models in acoustic modeling for speech recognition. However, the substantial increase in computational load during the inference stage makes deep models difficult to directly deploy on low-power embedded devices. To alleviate this issue, structure sparseness and low precision fixed-point quantization have been applied widely. In this work, binary neural networks for speech recognition are developed to reduce the computational cost during the inference stage. A fast implementation of binary matrix multiplication is introduced. On mode central processing unit (CPU) and graphics processing unit (GPU) architectures, a 5-7 times speedup compared with full precision floatingpoint matrix multiplication can be achieved in real applications. Several kinds of binary neural networks and related model optimization algorithms are developed for large vocabulary continuous speech recognition acoustic modeling. In addition, to improve the accuracy of binary models, knowledge distillation from the normal full precision floating-point model to the compressed binary model is explored. Experiments on the standard Switchboard speech recognition task show that the proposed binary neural networks can deliver 3-4 times speedup over the normal full precision deep models. With the knowledge distillation from the normal floating-point models, the binary DNNs or binary convolutional neural networks (CNNs) can restrict the word error rate (WER) degradation to within 15.0%, compared to the normal full precision floating-point DNNs or CNNs, respectively. Particularly for the binary CNN with binarization only on the convolutional layers, the WER degradation is very small and is almost negligible with the proposed approach.
其他文献
中国自上世纪80年代末期开展空间诱变育种以来,中国各地已先后选育出一大批优质、高产、抗病的作物新品种和新种质的材料,显示出空间诱变作为作物育种新技术、新途 径的重大
Educational innovation is a field that has been greatly enriched by using technology in its processes, resulting in a leaing model where information comes from
胡锦涛总书记在全国人才工作会议上的讲话(《人民日报》2003年12月21日)中提出了科学人才观.这一科学人才观反映了我们党对社会主义人才建设规律认识的升华,是对马克思主义人
EST-SSRs是基于ESTs数据库的一种新型分子标记。与传统基因组来源的SSRs不同,EST-SSRs具有基因功能,而且可在不同物种之间通用。因此,小麦EST-SSRs分子标记的开发与应用对于小麦功能基因组研究及小麦与其他作物的比较基因组研究有重要意义。本研究采用PCR扩增、6%聚丙烯酰胺凝胶电泳及硝酸银染色等技术,检测了:1)597对小麦EST-SSRs引物在水稻、玉米、大豆中的通用性;2)9
发展--企业培训的本质价值rn记者(以下简称为“记”):陈先生,以前您谈培训问题时,很少涉及企业的人力资源开发与管理方面的内容,自从您上次从“工作分析”角度来谈“企业培训
该文以棉花为试材,针对麦棉套作中棉花苗荫蔽,麦收后突然暴露在强光下的问题,研究了遮荫对棉生长发育的影响、遮荫棉花由弱光转到自然强光下的叶片生理功能和解剖结构的适应
机器人在不久的将来可能还无法对人类构成威胁,但不可否认的是,机器人正变得越来越“聪明”。球形机器人“ApriAlpha”可以利用先进的声音识别技术区别不同位置发出的声音。
胡耀邦同志在《关于党的新闻工作》一文中说:“新闻事业要能够当好党的喉舌,并不是一件容易的事”。凡是做过党的新闻工作的同志,都会深有同感。为什么当好党的喉舌不容易?
Coordinating multiple unmanned aerial vehicles (multi-UAVs) is a challenging technique in highly dynamic and sophisticated environments. Based on digital pherom
<正> 1986年4月27日是《中国青年报》创刊三十五周年纪念日。二百多位老青年报人从各条战线赶回“娘家”过“报节”,更增添了纪念活动的欢乐。三十五年来,先后有一千一百多人参加了《中国青年报》的工作,至今,报社已为各条战线输送了五百多人。因此,中国青年报社又有“人才摇篮”之美称。