,Clustering feature decision trees for semi-supervised classification from high-speed data streams

来源 :浙江大学学报(英文版)(C辑：计算机与电子) | 被引量 : 0次 | 上传用户：zlq5626

【摘要】

：

Most stream data classification algorithms apply the supervised leaing strategy which requires massive labeled data.Such approaches are impractical since labele

【作者】

：

Wen-hua XU Zheng QIN Yang CHANG

【机构】

：

Department of Computer Science and Technology, Tsinghua University, Beijing 100084, China“,”School o

【出处】

：

浙江大学学报(英文版)(C辑：计算机与电子)

【发表日期】

：

2011年8期

【关键词】

：

Clustering feature vector Decision tree Semi-supervised learning Stream data cla

下载到本地 , 更方便阅读

下载此文赞助VIP

声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架

论文部分内容阅读

Most stream data classification algorithms apply the supervised leaing strategy which requires massive labeled data.Such approaches are impractical since labeled data are usually hard to obtain in reality.In this paper,we build a clustering feature　decision tree model,CFDT,from data streams having both unlabeled and a small number of labeled examples.CFDT applies a　micro-clustering algorithm that scans the data only once to provide the statistical summaries of the data for incremental decision　tree induction.Micro-clusters also serve as classifiers in tree leaves to improve classification accuracy and reinforce the any-time　property.Our experiments on synthetic and real-world datasets show that CFDT is highly scalable for data streams while generating high classification accuracy with high speed.

其他文献

橡胶树遗传转化的研究

巴西橡胶树是重要的经济树种,其独特的乳管结构和生产期长的特性,有可能发展成为一个高效、低成本的“植物生物反应器”,潜藏着巨大的商业开发价值。建立稳定、高效的橡胶树

学位

巴西橡胶树遗传转化胚胎发育关键基因

,A virtual network mapping algorithm based on integer programming

The virtual network (VN) embedding/mapping problem is recognized as an essential question of network virtualiza-tion. The VN embedding problem is a major challe

期刊

Virtual network embeddingInteger programmingTopology-awarenessNetwork virtual

作业为起点的品德与社会评价

我们认为,课程评价对课程的实施起着重要的导向和质量监控作用,新一轮课程改革倡导“立足过程,促进发展”的评价理念。品德与社会教学的评价包括教师的课堂教学评价、社会实

期刊

作业品德与社会学业评价学生全面发展课堂教学改革课堂教学评价课程评价践活动社会实践评价理念课堂改革课堂表现课程改革教师监控作用活动评

浅谈如何让学生快乐学习

新课标强调教师要放弃传统的教学观念,把学习的主动权还给学生,为学生提供充分的活动空间,让学生成为学习的主人,获得学习的快乐。教师可从构建和谐的师生关系、创设良好的教

期刊

快乐学习主人翁赞美游戏

,Verification of workflow nets with transition conditions

Workflow management is conceed with automated support for business processes.Workflow management systems are driven by process models specifying the tasks that

期刊

Workflow netsTransition conditionVerificationProcess model

,Micro-angle tilt detection for the rotor of a novel rotational gyroscope with a 0.47″resolution

Differential capacitive detection has been widely used in the displacement measurement of the proof mass of vibratory gyroscopes, but it did not achieve high re

期刊

Micro-angle detectionDifferential capacitive structureRotational gyroscopeStr

小麦miRNAs表达特征及其部分成员和作用靶基因遗传转化

微小分子RNA （miRNAs）通过序列反向互补方式与其作用靶基因结合，在转录后和翻译水平上对其作用靶基因进行调控。本项研究以38个小麦miRNAs为基础，较系统地研究了供试小麦miRNAs（TaMIRs）在丰、缺氮和干旱条件下的表达特征，鉴定了对低氮和干旱逆境应答的TaMIRs及其可能作用的靶基因。采用DNA重组和基因遗传转化技术，建立了应答低氮TaMIR1129的正、反义表达转基因烟草植株，对

学位

小麦（TriticumaestivumL.）微小分子RNA靶基因低氮胁迫干旱表达特性遗传转化功能鉴定钙调素基因

,Caching resource sharing in radio access networks: a game theoretic approach

期刊

Video cachingOligopoly marketGame theoryNash equilibriumStability analysis

启发心智提高能力--浅谈提高复习课有效性的策略

随着新课程改革的推进,对课堂有效性改革的研究也在不断深入。要想提高复习课的有效性应该做到以下几点:回归课本,强化“双基”训练;构建网络,巩固知识;选典型题目进行强化训

期刊

中考高效复习启发能力提高

水稻BT型细胞质雄性不育和恢复基因的克隆及其互作机理研究

水稻细胞质雄性不育及其恢复系统对水稻杂种优势利用发挥了巨大的作用.植物细胞质雄性不育及恢复的机理研究是遗传育种学和分子生物学的重要研究内容.该论文对水稻BT型雄性不

学位

细胞质雄性不育不育基因恢复基因PPR结构蛋白水稻

,Clustering feature decision trees for semi-supervised classification from high-speed data streams

与本文相关的学术论文