ConfDTree:A Statistical Method for Improving Decision Trees

来源 :Journal of Computer Science & Technology | 被引量 : 0次 | 上传用户:luyan135
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
Decision trees have three main disadvantages: reduced performance when the training set is small; rigid decision criteria; and the fact that a single “uncharacteristic” attribute might “derail” the classification process. In this paper we present ConfDTree(Confidence-Based Decision Tree) — a post-processing method that enables decision trees to better classify outlier instances. This method, which can be applied to any decision tree algorithm, uses easy-to-implement statistical methods(confidence intervals and two-proportion tests) in order to identify hard-to-classify instances and to propose alternative routes. The experimental study indicates that the proposed post-processing method consistently and significantly improves the predictive performance of decision trees, particularly for small, imbalanced or multi-class datasets in which an average improvement of 5%~9% in the AUC performance is reported. Decision trees have three main disadvantages: reduced performance when the training set is small; rigid decision criteria; and the fact that a single “uncharacteristic” attribute might “” derail "the classification process. In this paper we present ConfDTree (Confidence -Based Decision Tree) - a post-processing method that enables decisions trees to better classify outlier instances. This method, which can be applied to any decision tree algorithm, uses easy-to-implement statistical methods (confidence intervals and two-proportion tests ) in order to identify hard-to-classify instances and to propose alternative routes. The experimental study that that proposed post-processing method consistently and significantly improves the predictive performance of decision trees, particularly for small, imbalanced or multi-class datasets in which an average improvement of 5% ~ 9% in the AUC performance is reported.
其他文献
该文从挂篮荷载计算、施工流程、支座及临时固结施工、挂篮安装及试验、合拢段施工、模板制作安装、钢筋安装、混凝土的浇筑及养生、测量监控等方面人手,介绍了S226海滨大桥
期刊
舌骨呈U型,位于甲状软骨上方,主要有舌骨上下肌附着于其上.这些肌肉与舌骨一起在下颌运动、舌的运动、舌咽运动、气道的形成及维持头颅的姿势等方面有着重要的功能,而这些功
今天,我们怀着喜悦的心情,在这里隆重表彰全省交通行业争创所有公路基本无“三乱”省份有功单位、有功集体和有功个人.
2010年2月8日,二道区地税局连续第四年在长春市联社和长春农商行个人所得税申报服务站进行现场办公并召开新闻发布会.
监理工作的规范性是开展监理工作的基础必要条件.通过检查、指导、总结、改进等措施,加强监理工作的规范性,以提升监理服务的质量.
目的 :探讨甲状腺瘤及甲状腺癌的 CT表现特点和诊断价值。方法 :收集 1 986~ 1 999年经手术病理证实的 2 6例甲状腺瘤和 1 3例甲状腺癌进行 CT回顾性分析。结果 :经统计学处理
该文从挂篮荷载计算、施工流程、支座及临时固结施工、挂篮安装及试验、合拢段施工、模板制作安装、钢筋安装、混凝土的浇筑及养生、测量监控等方面人手,介绍了S226海滨大桥
期刊
目的 :总结球形肺炎的 CT特征 ,提高 CT诊断水平。方法 :收集经临床证实的 2 0例球形肺炎的 CT资料进行回顾分析。结果 :主要 CT表现为贴近胸膜呈方形或三角形 :边缘不规则 ,
该文从挂篮荷载计算、施工流程、支座及临时固结施工、挂篮安装及试验、合拢段施工、模板制作安装、钢筋安装、混凝土的浇筑及养生、测量监控等方面人手,介绍了S226海滨大桥
期刊
基底动脉延长扩张症较少见 ,其 CT表现文献报道亦较少 ,现将我院遇到的 2例报告如下。例 1 女性 ,64岁 ,因头晕、呕吐伴流涎 1 d就诊。既往高血压病史 1 0余年。体检除右侧