Overview |
In recent years, rapid developments in data collection and storage technologies have led to data sets that are "big" in many senses of the word. Data mining is the automatic discovery of interesting patterns and relationships in such "big data". This undergraduate course will provide an introduction to the topic of data mining, and some statistical principles underlying its key methods. Topics covered will include data preprocessing, regression, classification, clustering, dimensionality reduction, and association analysis.
|