Vous êtes sur la page 1sur 5

Introduction to Data Mining

Pang-Ning Tan, Michigan State University,


Michael Steinbach, University of Minnesota
Vipin Kumar, University of Minnesota
Table of Contents
Sample Chapters
Resources for Instructors and Students
Instructor Solution Manual
Errata (March 25, 2006)
Addison-Wesley Companion Book Site

Contact info: dmbook@cs.umn.edu


Highlights:

Provides both theoretical and practical coverage of all data mining topics.
Includes extensive number of integrated examples and figures.
Offers instructor resources including solutions for exercises and complete set of lecture
slides.
Assumes only a modest statistics or mathematics background, and no database
knowledge is needed.
Topics covered include; predictive modeling, association analysis, clustering, anomaly
detection, visualization.

Sample Chapters:
These sample chapters are also available at the publisher's Web site.

Chapter 4. Classification: Basic Concepts, Decision Trees, and Model


Evaluation (444KB)
Chapter 6. Association Analysis: Basic Concepts and
Algorithms (612KB)
Chapter 8. Cluster Analysis: Basic Concepts and Algorithms (1.3MB)

All files are in Adobe's PDF format and require Acrobat Reader.

Resources for Instructors and Students:


Link to PowerPoint Slides
Link to Figures as PowerPoint Slides
Links to Data Mining Software and Data Sets
Suggestions for Term Papers and Projects
Tutorials
Errata

PowerPoint Slides:
1. Introduction (lecture slides: [PPT] [PDF])
2. Data (lecture slides: [PPT][PDF])
3. Exploring Data (lecture slides: [PPT][PDF])
4. Classication: Basic Concepts, Decision Trees, and Model
Evaluation (lecture slides: [ PPT][PDF])
5. Classication: Alternative Techniques (lecture slides: [PPT][PDF])
6. Association Analysis: Basic Concepts and Algorithms (lecture slides:
[PPT][PDF])
7. Association Analysis: Advanced Concepts (lecture slides: [PPT][PDF])
8. Cluster Analysis: Basic Concepts and Algorithms (lecture slides:
[PPT][PDF])
9. Cluster Analysis: Additional Issues and Algorithms (lecture slides:
[PPT][PDF])

10. Anomaly Detection (lecture slides: [PPT][PDF])

Book Figures in PowerPoint Slide Format:


1. Introduction (figure slides: [PPT])
2. Data (figure slides: [PPT])
3. Exploring Data (figure slides: [PPT])
4. Classication: Basic Concepts, Decision Trees, and Model
Evaluation (figure slides: [ PPT])
5. Classication: Alternative Techniques (figure slides: [PPT])
6. Association Analysis: Basic Concepts and Algorithms (figure slides:
[PPT])
7. Association Analysis: Advanced Concepts (figure slides: [PPT])
8. Cluster Analysis: Basic Concepts and Algorithms (figure slides: [PPT])
9. Cluster Analysis: Additional Issues and Algorithms (figure slides:
[PPT])
10. Anomaly Detection (figure slides: [PPT])

The views and opinions expressed in this page are strictly those of the page author.
The contents of this page have not been reviewed or approved by the University of Minnesota.

RESOURCES
A. SOFTWARE

General Purpose Data


Mining

WEKA (Source: Java)


MLC++ (Source: C++)
SIPINA
List from KDNuggets (Various)
List from Data Management Center
(Various)

Classification

C4.5 (Decision tree)


OC1 (Oblique decision tree)
Ripper (Rule-based)
CBA (association-rule based)
bayes (Naive Bayes)
Evidential distance-based (nearestneighbor)
PEBLS (nearest-neighbor)
mlp (Neural Network)
tiberius (Neural Network)
svmlight (Support Vector Machine)

FIMI Repository of Algorithms


Apriori, Eclat, and FP Growth

ARTool
ARMADA (Association rule mining in
Matlab)
Tree Mining, Closed Itemsets,
Sequential Pattern Mining
Tree Mining, Closed Itemsets,
Sequential Pattern Mining
PAFI

CLUTO

Open Source Clustering Software


Model-based Clustering
Online software for Clustering

Anomaly Detection

ORCA (distance based)

Regression

Regression routines

Data Preprocessing

Feature Selection
Isomap (Dimensionality Reduction - in

Association Analysis

Cluster Analysis

Matlab)

B. Data Sets

IDS data sets


Data Sets for Data Mining
Competition Data Set
UCI Machine learning repository
Quest data repository
KDNuggets

The views and opinions expressed in this page are strictly those of the page author.
The contents of this page have not been reviewed or approved by the University of Minnesota.