論文

査読有り
2008年

LCM over ZBDDs: Fast generation of very large-scale frequent itemsets using a compact graph-based representation

ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS
  • Shin-ichi Minato
  • ,
  • Takeaki Uno
  • ,
  • Hiroki Arimura

5012
開始ページ
234
終了ページ
+
記述言語
英語
掲載種別
研究論文(国際会議プロシーディングス)
DOI
10.1007/978-3-540-68125-0_22
出版者・発行元
SPRINGER-VERLAG BERLIN

Frequent itemset mining is one of the fundamental techniques for data mining and knowledge discovery. In the last decade, a number of efficient algorithms have been presented for frequent itemset mining, but most of them focused on only enumerating the itemsets that satisfy the given conditions, and how to store and index the mining result in order to ensure an efficient data analysis is a different matter.
In this paper, we propose a fast algorithm for generating very large-scale all/closed/maximal frequent itemsets using Zero-suppressed BDDs (ZBDDs), a compact graph-based data structure. Our method, "LCM over ZBDDs," is based on one of the most efficient state-of-the-art algorithms proposed thus far. Not only does it enumerate/list the itemsets, but it also generates a compact output data structure on the main memory. The result can be efficiently postprocessed by using algebraic ZBDD operations. The original LCM is known as an output linear time algorithm, but our new method requires a sub-linear time for the number of frequent patterns when the ZBDD-based data compression works well. Our method will greatly accelerate the data mining process and this will leads to a new style of on-memory processing for dealing with knowledge discovery problems.

リンク情報
DOI
https://doi.org/10.1007/978-3-540-68125-0_22
J-GLOBAL
https://jglobal.jst.go.jp/detail?JGLOBAL_ID=200902222664755810
Web of Science
https://gateway.webofknowledge.com/gateway/Gateway.cgi?GWVersion=2&SrcAuth=JSTA_CEL&SrcApp=J_Gate_JST&DestLinkType=FullRecord&KeyUT=WOS:000256127100021&DestApp=WOS_CPL
ID情報
  • DOI : 10.1007/978-3-540-68125-0_22
  • ISSN : 0302-9743
  • J-Global ID : 200902222664755810
  • Web of Science ID : WOS:000256127100021

エクスポート
BibTeX RIS