您当前所在位置: 首页 > 学者

周傲英

  • 121浏览

  • 0点赞

  • 0收藏

  • 0分享

  • 59下载

  • 0评论

  • 引用

期刊论文

False Positive or False Negative: Mining Frequent Itemsets from High Speed Transactional Data Streams

周傲英Jeffrey Xu Yui Zhihong Chong Hongjun Lu Aoying Zhou

,-0001,():

URL:

摘要/描述

The problem of finding frequent items has been recently studied over high speed datastreams. However, mining frequent iteinsetsfroIn transactional data streams has not beenwell addressed yet in terms of its bounds ofmemory consumption. The main difficulty isdue to the nature of the exponential explo-sion of itemsets. Given a domain of uniqueitems, the possible number of itemsets can beup to 2i-i. When the length of data stremsapproaches to a very large number N, thepossibility of an itemset to be frequent be-comes larger and difficult to track with lim-ited memory. However. the real killer of ef-fective frequent itemset mining is that mostof existing algorithms are false-positive ori-nted. That is, they control memory con-sumption in the counting processes by an er-ror arameter e, aud allow items with sup-port below the specified minimum support s but above s-e counted as frequent ones. Such false-positive items increase the num- ber of false-positive frequent itemsets expo-nentially, which rn, make the problem com- putationally intractable with bounded mem-ory consumption. In this paper, we developed algorithms that can effectively mine fl'equent item(set)s from high speed transactional datastreams with a bound of memory consump-tion. While our algorithms are false-negative oriented, that is, certain frequent itemsets may not appear in the zesults, the number of false-negative itemsets can be controlled by a predefined parameter so that desired recall rate of frequent itemsets can be guaranteed. We developed algorithms based on Chernoff bound. Our extensive experimental studies

关键词:

【免责声明】以下全部内容由[周傲英]上传于[2011年01月14日 17时49分35秒],版权归原创者所有。本文仅代表作者本人观点,与本网站无关。本网站对文中陈述、观点判断保持中立,不对所包含内容的准确性、可靠性或完整性提供任何明示或暗示的保证。请读者仅作参考,并请自行承担全部责任。

我要评论

全部评论 0

本学者其他成果

    同领域成果