首页 | 本学科首页   官方微博 | 高级检索  
     

数据流频繁闭集的在线挖掘
引用本文:刘春,郑征,蔡开元,张师超. 数据流频繁闭集的在线挖掘[J]. 北京航空航天大学学报, 2008, 34(8): 969-972
作者姓名:刘春  郑征  蔡开元  张师超
作者单位:北京航空航天大学 自动化科学与电气工程学院, 北京 100191
基金项目:国家自然科学基金,中国博士后科学基金
摘    要:
基于算法LossyCounting,提出了数据流频繁闭集的在线挖掘算法LC_Closed(LossyCounting_Closed).设计了基于前缀树的频繁闭集压缩存储结构CI-forest(ClosedItemsets-forest),利用该数据结构可以快速的插入和查询闭集模式,且在处理新的事务数据时能够快速定位相关的历史闭集模式.该算法采用在线的处理方式,提高了算法的实时性.实验的结果证明该算法是有效的. 

关 键 词:数据挖掘   数据流   频繁闭集   在线
收稿时间:2007-07-20

Online mining frequent closed itemsets over data stream
Liu Chun,Zheng Zheng,Cai Kaiyuan,Zhang Shichao. Online mining frequent closed itemsets over data stream[J]. Journal of Beijing University of Aeronautics and Astronautics, 2008, 34(8): 969-972
Authors:Liu Chun  Zheng Zheng  Cai Kaiyuan  Zhang Shichao
Affiliation:School of Automation Science and Electrical Engineering, Beijing University of Aeronautics and Astronautics, Beijing 100191, China
Abstract:
Based on the algorithm LossCounting,a novel approach called LossyCounting_Closed(LC_Closed) for mining closed frequent itemsets over data stream was proposed.A new summary data structure called Closed-Itemsets-forest(CI-forest) was developed for maintaining only closed frequent itemsets.The insertion and query of closed itemsets can be rapidly made based on the data structure CI-forest,and the location of the associated historical closed itemsets in the stage of dealing with the new transaction is also faci...
Keywords:data mining  data stream  frequent closed itemsets  online  
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《北京航空航天大学学报》浏览原始摘要信息
点击此处可从《北京航空航天大学学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号