HUITWU：An E?cient Algorithm for High-Utility Itemset Mining in Transaction Databases

来源 :计算机科学技术学报（英文版） | 被引量 : 0次 | 上传用户：tianshi6868

【摘要】

：

Mining high-utility itemsets (HUIs) from a transaction database refers to the discovery of itemsets with high utilities like profits. Most of existing studies d

【作者】

：

Shi-Ming Guo Hong Gao

【机构】

：

School of Computer Science and Technology, Harbin Institute of Technology, Harbin 150001, China

【出处】

：

计算机科学技术学报（英文版）

【发表日期】

：

2016年4期

【关键词】

：

data mining high-utility itemset pattern growth

下载到本地 , 更方便阅读

下载此文赞助VIP

声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架

论文部分内容阅读

Mining high-utility itemsets (HUIs) from a transaction database refers to the discovery of itemsets with high utilities like profits. Most of existing studies discover HUIs from a transaction database in two phases. In phase 1, different overestimation methods are applied to calculate the upper bounds of the utilities of itemsets. Since the overestimated utilities of itemsets are adopted, the itemsets whose overestimated utilities are no less than a user-specified threshold are selected as candidate HUIs, and they are verified by scanning the database one more time in phase 2. However, a large number of candidate HUIs incur two problems: 1) it requires excessive memory to store these candidates;2) it needs a large amount of running time to calculate their exact utilities. Vertical data format has been applied to mine HUIs recently. However this kind of method cannot deal with transactions with the same items effectively so that the size of database cannot be reduced su?ciently. The overall performance of algorithms is degraded consequently. Thus an algorithm HUITWU is proposed in this paper for mining HUIs. A novel data structure HUITWU-Tree is adopted to e?ciently calculate the utilities of itemsets in a database. Extensive studies with both sparse and dense datasets have demonstrated that our proposed algorithm is more than an order of magnitude faster and consumes less memory than the state-of-the-art algorithms.

其他文献

门冬胰岛素70/30与人胰岛素70/30治疗糖尿病的疗效比较

门冬胰岛素70/30是由30%门冬胰岛素和70%中速人胰岛素类似物(NPL)组成的新型胰岛素预混制剂。国外研究表明2次/d应用门冬胰岛素70/30比人胰岛素70/30更能显著降低餐后血糖水

期刊

门冬胰岛素人胰岛素治疗糖尿病胰岛素类似物血糖水平国外研究组成中速制剂应用

组合式安瓿折断器的研制及应用

该文从挂篮荷载计算、施工流程、支座及临时固结施工、挂篮安装及试验、合拢段施工、模板制作安装、钢筋安装、混凝土的浇筑及养生、测量监控等方面人手,介绍了S226海滨大桥

期刊

原发性肝癌介入治疗的疗效观察

通过对 8 6例中、晚期肝癌病人进行了 35 4次肝动脉内化疗(ＨＡＩ)和栓塞 (ＨＡＥ)报告如下。1　材料与方法1.1　 86例中 ,男 72例 ,女 14例 ,年龄 2 5～ 71岁 ,平均 5 3岁。均经临床检查 ,

期刊

原发性肝癌介入治疗疗效观察Interventional TherapyHepatic Cell肝动脉内化疗丝裂霉素动脉造影肠系膜上动脉肝固有动

小剂量尿激酶联合藻酸双酯钠治疗进展性脑梗死的临床观察

对于到达医院已失去早期溶栓时间窗的急性进展性脑梗死,防止血栓进展形成是重要的治疗措施,有效的抗凝治疗是减轻致残,改善预后的有效方法。1资料和方法1.1一般资料:98例患者

期刊

小剂量酶联藻酸双酯钠治疗措施急性进展性脑梗死溶栓时间窗抗凝治疗改善预后致残医院血栓方法

腹腔镜下单孔前列腺癌根治术的手术配合体会

期刊

等密度硬膜下血肿的CT诊断（附33例分析）

等密度硬膜下血肿 (ｉｓｏｄｅｎｓｅｃｈｒｏｎｉｃｓｕｂｄｕｒａｌｈｅｍａｔｏｍａ ,ＩＳＤＨ)系硬膜下血肿中ＣＴ表现较为特珠的一种常见类型 ,外伤史易忽略或不详 ,加之ＣＴ医师经验不足 ,常误诊或漏诊。本文总结我院 4年来发现的ＩＳＤＨ 33例 ,分析结果如下

期刊

等密度硬膜下血肿诊断外伤史延迟扫描局灶症状复方泛影葡胺颅内压增高轴位扫描增强扫描神经系统强化扫描精神症状常见类型造影剂注射智力阴

南方路机全环保商混智慧工厂 ── 应用于广东汕头创业混凝土有限公司

2017年9月南方路机在广东汕头推出了一座生产商品混凝土的全面环保型智慧工厂.rn广东汕头创业混凝土有限公司(简称:汕头创业)在积极响应发展低碳经济的同时,从改善自己的工作

期刊

抖音上线第一款下游戏——《音跃球球》

抖音也可以玩小游戏了! 2019年2月18日,“抖音游戏”官方账号发布了一款名为《音跃球球》的小游戏.这是2018年头条小程序上线以来,抖音发布的首款小游戏,游戏的体量和玩法与

期刊

国产氯普鲁卡因用于人工流产的临床观察

为了在人工流产术时提高镇痛技术,避免孕妇的痛苦和恐惧,减少并发症的发生,我院对部分人工流产手术患者采用国产氯普鲁卡因宫颈注射,效果显著,现介绍如下。1资料与方法1.1临

期刊

氯普鲁卡因人工流产镇痛技术手术患者宫颈注射流产术并发症孕妇

肺腺癌空洞误诊结核的病例分析

肺癌如能早期诊断与治疗 ,其效果满意 ,但由于肺癌的细胞类型不同 ,其征象也表现的多种多样 ,因此容易产生误诊 ,延误了治疗时机。笔者收集了我院经临床与手术证实的肺腺癌88

HUITWU：An E?cient Algorithm for High-Utility Itemset Mining in Transaction Databases

与本文相关的学术论文