This second course in data mining focuses on
understanding the research methods used in the field of data mining.
The course targets students who want to gain in-depth knowledge
about a particular data mining topic
(e.g., PhD students who plan to use a data mining component in their research).
Due Date |
Paper |
Additional Material (code, papers, etc.) |
2/6 |
Clustering Using a Similarity Measure Based on Shared Near Neighbors, IEEE Transactions on Computers, 22(11), 1025-1034, 1973. (download from CiteSeer) |
Jarvis-Patrick clustering in R, SNN clustering paper |
2/13 |
ABACUS: Mining Arbitrary Shaped Clusters from Large Datasets based on
Backbone Identification, SIAM International Conference on Data Mining, 2011 |
Partial ABACUS implementation,
clustering images,
DBSCAN paper,
clustering with dbscan in R,
more dbscan examples,
a unified view
|
2/20 |
Dissimilarity plots: A visual exploration tool for partitional clustering, Journal of Computational and Graphical Statistics, 2011 |
R package seriation,
Dissimilarity plot in R |
2/27 |
Clustering of Time Series Subsequences is Meaningless, IEEE International Conference on Data Mining, 2003 |
Other papers: An Alternate Measure for Comparing Time Series Subsequence Clusters,
Making Subsequence Time Series Clustering Meaningful.
Code:
Using R for time series analysis,
time series example code,
Cylinder-Bell-Funnel example,
subsequence clustering example with real data
|
3/6 |
Temporal structure learning for clustering massive data streams in real-time, SIAM International Conference on Data Mining, 2011 |
rEMM package,
code example from paper
|
3/27 |
Clustering data streams based on shared density between micro-clusters, IEEE Transactions on Knowledge and Data Engineering, 2016 |
Implementation in the R package stream
|
4/3 |
Integrating Classification and Association Rule Mining, KDD'98 |
LUCS-KDD Implementations,
R package arulesCBA,
arulesCBA example.
Classification Using Association Rules: Weaknesses and Enhancements.
|
4/10 |
Interpretable regularized class association rules algorithm for classification in a categorical data space, Information Sciences 2019 |
R package arulesCBA,
about rulefit,
some R code examples.
|
4/17 |
Approximate Ranking from Pairwise Comparisons, International Conference on Artificial Intelligence and Statistics 2018 |
R code for H-LUBC, R package relations
|