Vous êtes sur la page 1sur 1

Data Mining and Data Analysis

Syllabus
http://www.stanford.edu/class/stats202/

Stats 202 Susan Holmes Fall 2010

Week 1 9/20 - 9/24 What is Data Mining?


(Chap1) EDA, Machine Learning.
Examples, buzzwords, algorithms
Implementations: Using R
Week 2 9/27 - 10/1 Attributes and Variables
(Chap2, Appendix B) Preprocessing, dimension reduction.
Distances (examples in R).
Week 3 10/4 - 10/8 Explor. Data An. and Visualization
(Chap3) EDA, Multivariate visualizations.
Using R (lattice graphics)
Week 4 10/11 - 10/15 Classification and Prediction
(Chap4) Decision trees, discrimination.
Criteria, Model selection.
Using R (tree)
Week 5 10/18 - 10/22 Instance Based Learning
(Chap 5) Nearest neighbor, kernel methods.
Bayesian classifiers.
Ensemble methods.
Week 6 10/25 - 10/29 Midterm Review and Perspective
10/27 Midterm Exam
Graph Data: Examples (sna)
Week 7 11/1 -11/5 Association Analysis Contingency and Correlation
(Chap 6- 7) Graphs and Networks
Ordination
Week 8 11/8 -11/12 Clustering
(Chap8-9) k-means, spectral clustering
hierarchical clustering
R (heatmap,hclust).
Week 9 11/15 -11/19 Probabilistic/density based Clustering
(Chap 9-10) Model based clustering
Outliers and Anomaly detection
Week 10 11/29-12/3 Validation Techniques Resampling
MC simulation
Bootstrap Methods
Final Exam 12/10 8.30-11.30

Vous aimerez peut-être aussi