Wide Area Analytics for Geographically Distributed Datacenters

来源 :Tsinghua Science and Technology | 被引量 : 0次 | 上传用户:cangzhe
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
Big data analytics,the process of organizing and analyzing data to get useful information,is one of the primary uses of cloud services today.Traditionally,collections of data are stored and processed in a single datacenter.As the volume of data grows at a tremendous rate,it is less efficient for only one datacenter to handle such large volumes of data from a performance point of view.Large cloud service providers are deploying datacenters geographically around the world for better performance and availability.A widely used approach for analytics of geo-distributed data is the centralized approach,which aggregates all the raw data from local datacenters to a central datacenter.However,it has been observed that this approach consumes a significant amount of bandwidth,leading to worse performance.A number of mechanisms have been proposed to achieve optimal performance when data analytics are performed over geo-distributed datacenters.In this paper,we present a survey on the representative mechanisms proposed in the literature for wide area analytics.We discuss basic ideas,present proposed architectures and mechanisms,and discuss several examples to illustrate existing work.We point out the limitations of these mechanisms,give comparisons,and conclude with our thoughts on future research directions. Big data analytics, the process of organizing and analyzing data to get useful information, one of the primary uses of cloud services today. Traditionally, collections of data are stored and processed in a single datacenter. As the volume of data grows at a tremendous rate, it is less efficient for only one datacenter to handle such large volumes of data from a performance point of view. Large cloud service providers are deploying datacenters geographically around the world for better performance and availability. A widely used approach for analytics of geo- distributed data is the centralized approach, which aggregates all the raw data from local datacenters to a central datacenter.However, it has been observed that this approach consumes a significant amount of bandwidth, leading to worse performance. A number of mechanisms have been proposed to achieve optimal performance when data analytics are performed over geo-distributed datacenters. In this paper, we present a survey on the representative mechanisms proposed in the literature for wide area analytics.We discuss basic ideas, present proposed architectures and mechanisms, and discuss several examples to illustrate existing work .We point out the limitations of these mechanisms, give comparisons, and conclude with our thoughts on future research directions.
其他文献
本文通过对荣华二采区10
为探究吕家坨井田地质构造格局,根据钻孔勘探资料,采用分形理论和趋势面分析方法,研究了井田7
文章对体育教学中的听评、观议进行了详细的论述,为更好地进行体育教学提供了参考。 The article discusses the listening comprehension and opinion deliberation in phy
本文通过对荣华二采区10
现在,连健身也可以在网上完成了!如果你还不知道,那你可要小心被人说“OUT”(过时)了!可是,当健身与网络联系在一起时,究竟是怎么样的呢?今天,我们就来了解一下网络健身。室
Flash技术被广泛地应用到了社会各个领域,当一个个优秀的网络动画在网络上传播的时候,越来越多的教师开始把Flash技术用在了多媒体课件的制作上,现在Flash软件逐渐成为当前最
4月18—21日,第十九届全国结构风工程学术会议暨第五届全国风工程研究生论坛在厦门杏林湾大酒店举行.本次会议由中国土木工程学会桥梁及结构工程分会和中国空气动力学会风工
期刊
为探究吕家坨井田地质构造格局,根据钻孔勘探资料,采用分形理论和趋势面分析方法,研究了井田7
为探究吕家坨井田地质构造格局,根据钻孔勘探资料,采用分形理论和趋势面分析方法,研究了井田7
为探究吕家坨井田地质构造格局,根据钻孔勘探资料,采用分形理论和趋势面分析方法,研究了井田7