Negative effects of sufficiently small initial weights on back-propagation neural networks

来源 :Journal of Zhejiang University-Science C(Computers & Electro | 被引量 : 0次 | 上传用户:ccshixg
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
In the training of feedforward neural networks,it is usually suggested that the initial weights should be small in magnitude in order to prevent premature saturation.The aim of this paper is to point out the other side of the story:In some cases,the gradient of the error functions is zero not only for infinitely large weights but also for zero weights.Slow convergence in the beginning of the training procedure is often the result of sufficiently small initial weights.Therefore,we suggest that,in these cases,the initial values of the weights should be neither too large,nor too small.For instance,a typical range of choices of the initial weights might be something like(0.4,0.1) ∪(0.1,0.4),rather than(0.1,0.1) as suggested by the usual strategy.Our theory that medium size weights should be used has also been extended to a few commonly used transfer functions and error functions.Numerical experiments are carried out to support our theoretical findings. In the training of feedforward neural networks, it is usually suggest that the initial weights should be small in magnitude in order to prevent premature saturation. The aim of this paper is to point out the other side of the story: In some cases, the gradient of the error functions is zero not only for infinitely large weights but also for zero weights.Slow convergence in the beginning of the training procedure is often the result of small small initial weights.Therefore, we suggest that, in these cases, the initial values of the weights should be neither neither large nor nor too small. For instance, a typical range of choices of the initial weights might be something like (0.4,0.1) ∪ (0.1,0.4), rather than (0.1,0.1) as suggested by the usual strategy. Our theory that medium size weights should be used has also been extended to a few commonly used transfer functions and error functions. Numerical experiments are carried out to support our theoretical findings.
其他文献
在新时期背景下,非物质文化遗产融入初中道德与法治教学当中,对促进初中道德与法治教学效率的提高起到积极作用和影响,帮助学生更好了解非物质文化遗产内容和中国传统文化,促
中国华能集团有限公司(以下简称“华能”)认真贯彻习近平总书记关于青年工作的重要思想,结合实施人才强企战略,大力加强青年队伍建设,着力搭建促进青年成长成才的平台,让广大
期刊
莆田县是我省在大革命时期首批建立党组织的县份之一。六月二十六日,莆田县委举行党史报告会,隆重纪念莆田党组织建立六十周年。出席会议的有程序、苏华、黄明、张兆汉、许
阅读教学是小学语文教学中重要的组成部分,借助阅读教学培养小学生的思辨能力,渗透有效的阅读分析方法,帮助小学生掌握恰当的阅读方法,逐步提高语文的学习能力.能够在语文阅
为核实审查闽西南系统党的组织史资料,厦门市组织史资料征集办公室于九且二十三日至二十六日,在集美宾馆召开编纂工作座谈会。省征委、省组织史办公室,以及龙岩、漳州、莆田
[目的]为mtDNA-RAPD技术检测杂种后代的纯度的可行性提供参考,为引物组合法广泛应用于植物纯度检测、遗传多样性分析提供依据。[方法]应用6对随机单引物和引物组合对99B和海
校园课本剧对学生语文素养及能力的提高。
随着科技的发展,岩矿分析与测试技术日趋成熟.近年来,随着不断的研究和发展,在岩矿分析测试领域,人们逐渐有了更高的认识.同时,国家对于岩矿分析测试的重视程度不断提高,因此
一、荷兰声像研究所简介荷兰声像研究所是荷兰最大的音像资料馆,1997年由政府出资筹建,分为三个资料馆(公共广播馆、学术研究馆、政府音像资料馆)和一个博物馆。资料馆担负着
在熙熙攘攘的人群中,许多女子都脚着一双高跟鞋。从她们行走时的咚咚声中,可看出这样一双鞋子带给女性的愉悦感和自信感。当我们驻足冥想,若时间退回到古时,那个时代中国女性