Efficient Handling of Lock Hand-off in DSM Multiprocessors with Buffering Coherence Controllers

来源 :计算机科学技术学报(英文版) | 被引量 : 0次 | 上传用户:Seanecn
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
Synchronization in parallel programs is a major performance bottleneck in multiprocessor systems.Shared data is protected by locks and a lot of time is spent on the competition arising at the lock hand-off.In order to be serialized,requests to the same cache line can either be bounced (NACKed) or buffered in the coherence controller.In this paper,we focus mainly on systems whose coherence controllers buffer requests.In a lock hand-off,a burst of requests to the same line arrive at the coherence controller.During lock hand-off only the requests from the winning processor contribute to progress of the computation,since the winning processor is the only one that will advance the work.This key observation leads us to propose a hardware mechanism we call request bypassing,which allows requests from the winning processor to bypass the requests buffered in the coherence controller keeping the lock line.We present an inexpensive implementation of request bypassing that reduces the time spent on all the execution phases of a critical section (acquiring the lock,accessing shared data,and releasing the lock) and which,as a consequence,speeds up the whole parallel computation.This mechanism requires neither compiler or programmer support nor ISA or coherence protocol changes.By simulating a 32-processor system,we show that using request bypassing does not degrade but rather improves performance in three applications with low synchronization rates,while in those having a large amount of synchronization activity (the remaining four),we see reductions in execution time and in lock stall time ranging from 14% to 39% and from 52% to 71%,respectively.We compare request bypassing with a previously proposed technique called read combining and with a system that bounces requests,observing a significantly lower execution time with the bypassing scheme.Finally,we analyze the sensitivity of our results to some key hardware and software parameters.
其他文献
该文从挂篮荷载计算、施工流程、支座及临时固结施工、挂篮安装及试验、合拢段施工、模板制作安装、钢筋安装、混凝土的浇筑及养生、测量监控等方面人手,介绍了S226海滨大桥
期刊
近日,广东省期刊协会公布了第四届广东省期刊优秀作品评选结果,《实用医学杂志》在本届期刊优秀作品评选中收获颇丰,共有10件作品获奖,其中一等奖2件、二等奖3件、三等奖5件
该文从挂篮荷载计算、施工流程、支座及临时固结施工、挂篮安装及试验、合拢段施工、模板制作安装、钢筋安装、混凝土的浇筑及养生、测量监控等方面人手,介绍了S226海滨大桥
期刊
该文从挂篮荷载计算、施工流程、支座及临时固结施工、挂篮安装及试验、合拢段施工、模板制作安装、钢筋安装、混凝土的浇筑及养生、测量监控等方面人手,介绍了S226海滨大桥
期刊
当地时间2月6日,加拿大多伦多各界华人盛装聚会,迎接鸡年春节。 Local time on February 6, Toronto, Canada, Chinese from all walks of life party to greet the Chinese
期刊
该文从挂篮荷载计算、施工流程、支座及临时固结施工、挂篮安装及试验、合拢段施工、模板制作安装、钢筋安装、混凝土的浇筑及养生、测量监控等方面人手,介绍了S226海滨大桥
期刊
该文从挂篮荷载计算、施工流程、支座及临时固结施工、挂篮安装及试验、合拢段施工、模板制作安装、钢筋安装、混凝土的浇筑及养生、测量监控等方面人手,介绍了S226海滨大桥