,Binary neural networks for speech recognition?#

来源 :信息与电子工程前沿（英文版） | 被引量 : 0次 | 上传用户：weilonglee

【摘要】

：

Recently, deep neural networks (DNNs) significantly outperform Gaussian mixture models in acoustic modeling for speech recognition. However, the substantial inc

【作者】

：

Yan-min QIAN Xu XIANG

【机构】

：

Key Laboratory of Shanghai Education Commission for Intelligent Interaction and Cognitive Engineerin

【出处】

：

信息与电子工程前沿（英文版）

【发表日期】

：

2019年5期

【关键词】

：

Speech recognition Binary neural networks Binary matrix multiplication Knowledge

下载到本地 , 更方便阅读

下载此文赞助VIP

声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架

论文部分内容阅读

Recently, deep neural networks (DNNs) significantly outperform Gaussian mixture models in acoustic modeling for speech recognition. However, the substantial increase in computational load during the inference stage makes deep models difficult to directly deploy on low-power embedded devices. To alleviate this issue, structure sparseness and low precision fixed-point quantization have been applied widely. In this work, binary neural networks for speech recognition are developed to reduce the computational cost during the inference stage. A fast implementation of binary matrix multiplication is introduced. On mode central processing unit (CPU) and graphics processing unit (GPU) architectures, a 5-7 times speedup compared with full precision floatingpoint matrix multiplication can be achieved in real applications. Several kinds of binary neural networks and related model optimization algorithms are developed for large vocabulary continuous speech recognition acoustic modeling. In addition, to improve the accuracy of binary models, knowledge distillation from the normal full precision floating-point model to the compressed binary model is explored. Experiments on the standard Switchboard speech recognition task show that the proposed binary neural networks can deliver 3-4 times speedup over the normal full precision deep models. With the knowledge distillation from the normal floating-point models, the binary DNNs or binary convolutional neural networks (CNNs) can restrict the word error rate (WER) degradation to within 15.0％, compared to the normal full precision floating-point DNNs or CNNs, respectively. Particularly for the binary CNN with binarization only on the convolutional layers, the WER degradation is very small and is almost negligible with the proposed approach.

其他文献

水稻空间诱变突变系性状变异及相关分析研究

中国自上世纪80年代末期开展空间诱变育种以来,中国各地已先后选育出一大批优质、高产、抗病的作物新品种和新种质的材料,显示出空间诱变作为作物育种新技术、新途径的重大

学位

水稻空间诱变性状变异相关分析遗传育种

,Collaborative learning via social computing

Educational innovation is a field that has been greatly enriched by using technology in its processes, resulting in a leaing model where information comes from

期刊

Context-awarenessCollaborative learningSocial computingVirtual organizations

浅论胡锦涛的科学人才观

胡锦涛总书记在全国人才工作会议上的讲话(《人民日报》2003年12月21日)中提出了科学人才观.这一科学人才观反映了我们党对社会主义人才建设规律认识的升华,是对马克思主义人

期刊

胡锦涛总书记科学人才观马克思主义小康社会社会主义三个代表人才建设规律认识工作会议大发展思想升华人民历史讲话

小麦EST-SSRs在其他作物中的通用性及96份小麦材料的遗传多样性检测

EST-SSRs是基于ESTs数据库的一种新型分子标记。与传统基因组来源的SSRs不同，EST-SSRs具有基因功能，而且可在不同物种之间通用。因此，小麦EST-SSRs分子标记的开发与应用对于小麦功能基因组研究及小麦与其他作物的比较基因组研究有重要意义。本研究采用PCR扩增、6％聚丙烯酰胺凝胶电泳及硝酸银染色等技术，检测了：1)597对小麦EST-SSRs引物在水稻、玉米、大豆中的通用性；2)9

学位

小麦EST-SSR通用性遗传多样性

“有效施训,培育企业核心竞争力”系列专访(十):发展,企业培训最为关键的理由!——专访北京东方人培训中心资深顾问陈东先生

发展--企业培训的本质价值rn记者(以下简称为“记”):陈先生,以前您谈培训问题时,很少涉及企业的人力资源开发与管理方面的内容,自从您上次从“工作分析”角度来谈“企业培训

期刊

企业核心竞争力专访企业培训京东方培训中心人力资源开发企业人力资源管理与培训培训问题课程设计管理方面工作分析本质价值

棉花对光胁迫的适应机制及调控

该文以棉花为试材,针对麦棉套作中棉花苗荫蔽,麦收后突然暴露在强光下的问题,研究了遮荫对棉生长发育的影响、遮荫棉花由弱光转到自然强光下的叶片生理功能和解剖结构的适应

学位

棉花光胁迫光合作用调控

会找人的机器人

机器人在不久的将来可能还无法对人类构成威胁,但不可否认的是,机器人正变得越来越“聪明”。球形机器人“ApriAlpha”可以利用先进的声音识别技术区别不同位置发出的声音。

期刊

声音识别盯着相机数码一个人将其

当党的喉舌要有创造精神

胡耀邦同志在《关于党的新闻工作》一文中说:“新闻事业要能够当好党的喉舌,并不是一件容易的事”。凡是做过党的新闻工作的同志,都会深有同感。为什么当好党的喉舌不容易?

期刊

新闻工作重大方针政策胡耀邦同志深有同感十一届三中全会报道方法创造精神会议新闻创新局面新闻形式

,Target detection for multi-UAVs via digital pheromones and navigation algorithm in unknown environm

Coordinating multiple unmanned aerial vehicles (multi-UAVs) is a challenging technique in highly dynamic and sophisticated environments. Based on digital pherom

期刊

Collective intelligenceDigital pheromonesArtificial potentialfieldNavigation

文摘

<正> 1986年4月27日是《中国青年报》创刊三十五周年纪念日。二百多位老青年报人从各条战线赶回“娘家”过“报节”,更增添了纪念活动的欢乐。三十五年来,先后有一千一百多人参加了《中国青年报》的工作,至今,报社已为各条战线输送了五百多人。因此,中国青年报社又有“人才摇篮”之美称。

期刊

中国青年报人才摇篮纪念活动人从副总编辑五百中国青年运动胡克实人生篇钟沛璋

,Binary neural networks for speech recognition?#

与本文相关的学术论文