- [AIS93]
-
R. Agrawal, T. Imielinski, and A. Swami. Mining
association rules between sets of items in large databases. In
Proceedings of the ACM SIGMOD International Conference on
Management of Data, pages 207-216, Washington D.C., May 1993.
[ bib |
find paper in Google Scholar |
find paper in Google ]
Introduces association rules and the
SUPPORT-CONFIDENCE framework and an algorithm to mine large
itemsets. The algorithm is sometimes called AIS after the authors
initials.
- [MTV94]
-
Heikki Mannila, Hannu Toivonen, and A. Inkeri Verkamo.
Efficient algorithms for discovering association rules. In
Usama M. Fayyad and Ramasamy Uthurusamy, editors, AAAI
Workshop on Knowledge Discovery in Databases (KDD-94), pages
181-192, Seattle, Washington, 1994. AAAI Press. [ bib |
find paper in Google Scholar |
find paper in Google ]
Develop similar improvements to the candidate
generation as APRIORI. Itemsets with support are called covering
sets. The paper also introduces sampling from the database and
gives bounds for the resulting estimate of support.
- [AS94]
-
Rakesh Agrawal and Ramakrishnan Srikant. Fast algorithms for mining
association rules in large databases. In Jorge B. Bocca,
Matthias Jarke, and Carlo Zaniolo, editors, Proceedings of the
20th International Conference on Very Large Data Bases, VLDB,
pages 487-499, Santiago, Chile, September 1994. [ bib |
find paper in Google Scholar |
find paper in Google ]
Introduction of the APRIORI algorithm (the
best-known algorithm; it uses a breadth-first search strategy to
counting the support of itemsets). The algorithm uses an improved
candidate generation function which exploits the downward closure
property of support and makes it more efficient than AIS. Also an
algorithm to generate synthetic transaction data is presented. Such
synthetic transaction data are widely used for the evaluation and
comparison of new algorithms.
- [SON95]
-
Ashok Savasere, Edward Omiecinski, and Shamkant Navathe. An
efficient algorithm for mining association rules in large
databases. In Proceedings of the 21st VLDB Conference,
pages 432-443, Zurich, Switzerland, 1995. [ bib |
find paper in Google Scholar |
find paper in Google ]
Introduction of the PARTITION algorithm. The
database is scanned only twice. For the first scan the DB is
partitioned and in each partition support is counted. Then the
counts are merged to generate potential large itemsets. In the
second scan the potential large itemsets are counted to find the
actual large itemsets.
- [Toi96]
-
Hannu Toivonen. Sampling large databases for association rules. In
VLDB '96: Proceedings of the 22th International Conference on
Very Large Data Bases, pages 134-145, San Francisco, CA, USA,
1996. Morgan Kaufmann Publishers Inc. [ bib |
find paper in Google Scholar |
find paper in Google ]
Find frequent itemsets in a random sample of a
database (that fits into main memory) and then verify the found
frequent itemsets in the database.
- [HGN00]
-
Jochen Hipp, Ulrich Güntzer, and Gholamreza Nakhaeizadeh.
Algorithms for association rule mining - A general survey and
comparison. SIGKDD Explorations, 2(2):1-58, 2000.
[ bib |
find paper in Google Scholar |
find paper in Google ]
Describes the fundamentals of association rule
mining and presents an systematization of existing
algorithms.
- [Zak00]
-
Mohammed J. Zaki. Scalable algorithms for association mining.
IEEE Transactions on Knowledge and Data Engineering,
12(3):372-390, May/June 2000. [ bib |
find paper in Google Scholar |
find paper in Google ]
Introduces six new algorithms combining several
features (database format, the decomposition technique, and the
search procedure). Includes Eclat (Equivalence CLAss
Transformation), MaxEclat, Clique, MaxClique, TopDown, and
AprClique. ECLAT is a well known depth-first search algorithm using
set intersection.
- [OLP+03]
-
Salvatore Orlando, Claudio Lucchese, Paolo Palmerini, Raffaele
Perego, and Fabrizio Silvestri. kdci: a multi-strategy algorithm
for mining frequent sets. In Bart Goethals and Mohammed J.
Zaki, editors, FIMI'03: Proceedings of the IEEE ICDM Workshop
on Frequent Itemset Mining Implementations, November 2003.
[ bib |
find paper in Google Scholar |
find paper in Google ]
Introduces the kDCI algorithm.
- [HPYM04]
-
Jiawei Han, Jian Pei, Yiwen Yin, and Runying Mao. Mining frequent
patterns without candidate generation. Data Mining and
Knowledge Discovery, 8:53-87, 2004. [ bib |
find paper in Google Scholar |
find paper in Google ]
Describes the data mining method FP-growth
(frequent pattern growth) which uses an extended prefix-tree
(FP-tree) structure to store the database in a compressed form.
FP-growth adopts a divide-and-conquer approach to decompose both
the mining tasks and the databases. It uses a pattern fragment
growth method to avoid the costly process of candidate generation
and testing.
- [CGL04]
-
Frans Coenen, Graham Goulbourne, and Paul Leng. Tree structures for
mining association rules. Data Mining and Knowledge
Discovery, 8:25-51, 2004. [ bib |
find paper in Google Scholar |
find paper in Google ]
Describes how to compute PARTIAL SUPPORT COUNTS in
one DB-pass and how to store them in an enumeration tree
(P-Tree).
- [HCXY07]
-
J. Han, H. Cheng, D. Xin, and X. Yan. Frequent
pattern mining: Current status and future directions. Data
Mining and Knowledge Discovery, 14(1), 2007. [ bib |
find paper in Google Scholar |
find paper in Google ]
Complete overview of the state-of-the art in
frequent patten mining and identifies future research
directions.
This file was generated by bibtex2html
1.95.