A fast SVM training algorithm based on the set segmentation and k-means clustering

来源 :Progress in Natural Science | 被引量 : 0次 | 上传用户:ted_yu
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
At present, studies on training algorithms for support vector machines (SVM) are important issues in the field of machine learning. It is a challenging task to improve the efficiency of the algorithm without reducing the generalization performance of SVM. To face this challenge, a new SVM training algorithm based on the set segmentation and k means clustering is presented in this paper. The new idea is to divide all the original training data into many subsets, followed by clustering each subset using k means clustering and finally train SVM using the new data set obtained from clustering centroids. Considering that the decomposition algorithm such as SVM light is one of the major methods for solving support vector machines, the SVM light is used in our experiments. Simulations on different types of problems show that the proposed method can solve efficiently not only large linear classification problems but also large nonlinear ones. At present, studies on training algorithms for support vector machines (SVM) are important issues in the field of machine learning. It is a challenging task to improve the efficiency of the algorithm without reducing the generalization performance of SVM. To face this challenge, a new SVM training algorithm based on the set segmentation and k means clustering is presented in this paper. The new idea is to divide all the original training data into many subsets, followed by clustering each subset using k means clustering and finally train SVM using the new data set obtained from clustering centroids. Considering that the decomposition algorithm such as SVM light is one of the major methods for solving support vector machines, the SVM light is used in our experiments. efficiently not only large linear classification problems but also large nonlinear ones.
其他文献
目的探讨食管癌术后早期肠内、外配合营养的临床价值。方法选取2005年1月~2013年1月收治的食管癌根治术患者400例为研究对象,分为早期肠内、外混合营养组(观察组)和早期全肠
采访日志时间:6月20日18:00—19:00天气:晴地点:云南昆明到达终点昆明时,回味40天的骑行历程让人感触良多。从大海之上到彩云之南,沿途朋友们的热情、不同地区的遭遇犹若影像
探讨细胞松弛素D对稳态层流诱导的actin与VASP分布改变的影响.试验采用人脐静脉内皮细胞(HUVECs)暴露于稳定层流.荧光标记VASP和actin,Western blot检测VASP表达及磷酸化水平
会议
本文将2000年~2004年住院患者中,符合MDR-TB的条件的96例进行了治疗.旨在评价含左氧氟沙星方案治疗耐多药肺结核及远期疗效分析.
本文分三部分介绍了我国结核病防治规划的进展.一、1991—2003年全国结核病控制工作进展;二、1991—2003年全国结核病控制三大目标进展情况;三、2004年采取的最新行动.
家电是我国加入WTO后最具有竞争力的行业,我国已经成为世界主要的家电生产基地的特征明显,但文章也指出中国离家电制造强国还有一定距离.
采用表面张力法研究了改性明胶与十二烷基苯磺酸钠(SDBS)的相互作用,考察了改性明胶浓度、温度以及盐对二者相互作用的影响.研究结果表明,在降低表面张力方面,改性明胶/SDBS
本文主要论述了我国电声行业面对加入WTO带来的商机及国际市场的挑战,应采取的措施与对策,使我国电声行业在市场竞争中取得更大的市场份额.
支撑跳跃是体操课中最需要勇气的项目之一 ,而跳箱分腿腾越又是支撑跳跃项目中的基础。通过学习跳箱分腿腾越基本知识和基本技能 ,能锻炼学生的身体 ,增强体质 ,并且能促进学
国民经济运行环境在第2季度发生明显改变,但并未改变我国经济增长的态势,只是造成经济增长速度的下调一、上半年投资形势的基本特征(一)固定资产投资以低于上年的增速 The o