Approaches for Scaling DBSCAN Algorithm to Large Spatial Databases

来源 :计算机科学技术学报(英文版) | 被引量 : 0次 | 上传用户:po54321s
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
The huge amount of information stored in databases owned by corporations (e.g., retail, financial, telecom) has spurred a tremendous interest in the area of knowledge discovery and data mining. Clustering, in data mining, is a useful technique for discovering interesting data distributions and patts in the underlying data, and has many application fields, such as statistical data analysis, patt recognition, image processing, and other business applications. Although researchers have been working on clustering algorithms for decades, and a lot of algorithms for clustering have been developed, there is still no efficient algorithm for clustering very large databases and high dimensional data. As an outstanding representative of clustering algorithms, DBSCAN algorithm shows good performance in spatial data clustering. However, for large spatial databases, DBSCAN requires large volume of memory support and could incur substantial I/O costs because it operates directly on the entire database. In this paper, several approaches are proposed to scale DBSCAN algorithm to large spatial databases. To begin with, a fast DBSCAN algorithm is developed, which considerably speeds up the original DBSCAN algorithm. Then a sampling based DBSCAN algorithm, a partitioning-based DBSCAN algorithm, and a parallel DBSCAN algorithm are introduced consecutively. Following that, based on the above-proposed algorithms, a synthetic algorithm is also given. Finally, some experimental results are given to demonstrate the effectiveness and efficiency of these algorithms.
其他文献
期刊
本文通过对荣华二采区10
期刊
众所周知,水稻是人类赖以生存的重要食物之一,尤其是在人口密集且社会经济发展飞速的中国,水稻具有非凡的意义和地位,因此,水稻高产一旦被制约,势必会对我国造成不可限量的影
期刊
期刊
根据杂交稻特点及紫金县气候条件,结合2007-2014年国家农业综合开发杂交稻示范推广项目在紫金实施情况及当地种植情况,介绍杂交稻早、晚两季种植的抛秧栽培技术。 According
目的探究综合康复护理对老年冠心病患者生活质量的影响。方法选择于2016年4月—2017年4月进入该院心内科治疗的120例老年冠心病患者为研究主体对象,利用随机数字表法将所有患
介绍了水稻种植环境、无公害标准要求,重点对水稻品种选择、秧苗培育、肥水管理、病虫草害防除等栽培技术。 Introduced rice planting environment, pollution-free standa
历史教学的特点之一就是年代乱、事件杂、人物多。要想从复杂的历史事件中弄清前后事件的联系,找出历史发展规律,对历史教学来说,是个非常困难的事情。笔者经过多年的高中历
期刊
请下载后查看,本文暂不支持在线获取查看简介。 Please download to view, this article does not support online access to view profile.