Bienvenue sur Scribd !

Decision Tree Classification

Transféré par

0% ont trouvé ce document utile (0 vote)

110 vues6 pages

Data Mining CIS 467 Yarmouk University Department of Computer Information Systems. Basic algorithm for Decision Tree Induction - Tree is constructed in a top-down recursive divide-and-conquer manner. Attributes are categorical (if continuous-valued, they are discretized in advance) - examples are partitioned recursively based on selected attributes. Test attributes are selected on the basis of a heuristic or statistical measure.

Description originale:

Copyright

Formats disponibles

PDF, TXT ou lisez en ligne sur Scribd

Partager ce document

Partager ou intégrer le document

Options de partage

Avez-vous trouvé ce document utile ?

Ce contenu est-il inapproprié ?

Signaler ce document

Droits d'auteur :

Attribution Non-Commercial (BY-NC)

Formats disponibles

Téléchargez comme PDF, TXT ou lisez en ligne sur Scribd

Signaler comme contenu inapproprié

0% ont trouvé ce document utile (0 vote)

110 vues6 pages

Decision Tree Classification

Transféré par

nobeen666

Droits d'auteur :

Attribution Non-Commercial (BY-NC)

Formats disponibles

Téléchargez comme PDF, TXT ou lisez en ligne sur Scribd

Signaler comme contenu inapproprié

Passer à la page

Vous êtes sur la page 1sur 6

Rechercher à l'intérieur du document

Data Mining

CIS 467

Dr. Qasem Al-Radaideh

qradaideh@yahoo.com

Yarmouk University
Department of Computer Information Systems

Data Classification using

Decision Tree

١
Algorithm for Decision Tree Induction

z Basic algorithm (a greedy algorithm)

– Tree is constructed in a top-down recursive divide-and-conquer manner
– At start, all the training examples are at the root
– Attributes are categorical (if continuous-valued, they are discretized in
advance)
– Examples are partitioned recursively based on selected attributes
– Test attributes are selected on the basis of a heuristic or statistical
measure (e.g., information gain)
z Conditions for stopping partitioning
– All samples for a given node belong to the same class
– There are no remaining attributes for further partitioning – majority voting
is employed for classifying the leaf
– There are no samples left

Attribute Selection Measure

z Information gain (ID3/C4.5)

– All attributes are assumed to be categorical
– Can be modified for continuous-valued attributes
z Gini index (IBM IntelligentMiner)
– All attributes are assumed continuous-valued
– Assume there exist several possible split values for each attribute
– May need other tools, such as clustering, to get the possible split
values
– Can be modified for categorical attributes

٢
Information Gain (ID3/C4.5)

z Select the attribute with the highest information gain

z Assume there are two classes, P and N
– Let the set of examples S contain p elements of class P and n
elements of class N
– The amount of information, needed to decide if an arbitrary example
in S belongs to P or N is defined as

p p n n
I ( p, n ) = − log 2 − log 2
p+n p+n p+n p+n

Information Gain in Decision Tree Induction

z Assume that using attribute A a set S will be partitioned

into sets {S1, S2 , …, Sv}
– If Si contains pi examples of P and ni examples of N, the
entropy, or the expected information needed to classify objects
in all subtrees Si is

pi + ni
ν
E ( A) = ∑ I ( pi , ni )
i =1 p + n
z The encoding information that would be gained by
branching on A
Gain( A) = I ( p, n) − E ( A)

٣
Example 1 : Training Dataset

This
follows an
example
from
Quinlan’s
ID3

Youth : <=30, Middle_aged : 30..40, Sinior: >40

Attribute Selection by Information Gain Computation

g Class P: buys_computer = “yes”

g Class N: buys_computer = “no” 5 4
E ( age ) = I ( 2 ,3 ) + I ( 4 ,0 )
14 14
g I(p, n) = I(9, 5) =0.940
5
+ I ( 3 , 2 ) = 0 . 69
g Compute the entropy for age: 14

Hence
Gain(age) = I ( p, n) − E (age)
Similarly
age pi ni I(pi, ni) Gain(income) = 0.029
Youth 2 3 0.971 Gain( student ) = 0.151
Middle_Aged 4 0 0 Gain(credit _ rating ) = 0.048
Sinior 3 2 0.971

٤
Output: A Decision Tree for “buys_computer”

age?

Youth
overcast
Middle-aged Sinior

student? yes credit rating?

no yes excellent fair

no yes no yes

٥
Extracting Classification Rules from Trees

z Represent the knowledge in the form of IF-THEN rules

z One rule is created for each path from the root to a leaf
z Each attribute-value pair along a path forms a conjunction
z The leaf node holds the class prediction
z Rules are easier for humans to understand
z Example
IF age = Youth AND student = “no” THEN buys_computer = “no”
IF age = Youth AND student = “yes” THEN buys_computer = “yes”
IF age = Middle-aged THEN buys_computer = “yes”
IF age = Senior AND credit_rating = “excellent” THEN buys_computer = “yes”
IF age = Senior AND credit_rating = “fair” THEN buys_computer = “no”

Vous aimerez peut-être aussi

The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
D'Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
Évaluation : 4 sur 5 étoiles
4/5 (5795)
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
D'Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brené Brown
Évaluation : 4 sur 5 étoiles
4/5 (1090)
Never Split the Difference: Negotiating As If Your Life Depended On It
D'Everand
Never Split the Difference: Negotiating As If Your Life Depended On It
Chris Voss
Évaluation : 4.5 sur 5 étoiles
4.5/5 (838)
Principles: Life and Work
D'Everand
Principles: Life and Work
Ray Dalio
Évaluation : 4 sur 5 étoiles
4/5 (599)
The Glass Castle: A Memoir
D'Everand
The Glass Castle: A Memoir
Jeannette Walls
Évaluation : 4.5 sur 5 étoiles
4.5/5 (1713)
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
D'Everand
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Margot Lee Shetterly
Évaluation : 4 sur 5 étoiles
4/5 (895)
Sing, Unburied, Sing: A Novel
D'Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
Évaluation : 4 sur 5 étoiles
4/5 (1103)
Grit: The Power of Passion and Perseverance
D'Everand
Grit: The Power of Passion and Perseverance
Angela Duckworth
Évaluation : 4 sur 5 étoiles
4/5 (588)
Shoe Dog: A Memoir by the Creator of Nike
D'Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
Évaluation : 4.5 sur 5 étoiles
4.5/5 (537)
The Perks of Being a Wallflower
D'Everand
The Perks of Being a Wallflower
Stephen Chbosky
Évaluation : 4.5 sur 5 étoiles
4.5/5 (2104)
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
D'Everand
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Ben Horowitz
Évaluation : 4.5 sur 5 étoiles
4.5/5 (345)
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
D'Everand
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Ashlee Vance
Évaluation : 4.5 sur 5 étoiles
4.5/5 (474)
Bad Feminist: Essays
D'Everand
Bad Feminist: Essays
Roxane Gay
Évaluation : 4 sur 5 étoiles
4/5 (1016)
Her Body and Other Parties: Stories
D'Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
Évaluation : 4 sur 5 étoiles
4/5 (821)
The Outsider: A Novel
D'Everand
The Outsider: A Novel
Stephen King
Évaluation : 4 sur 5 étoiles
4/5 (1839)
The Emperor of All Maladies: A Biography of Cancer
D'Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
Évaluation : 4.5 sur 5 étoiles
4.5/5 (271)
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
D'Everand
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Viet Thanh Nguyen
Évaluation : 4.5 sur 5 étoiles
4.5/5 (121)
Angela's Ashes: A Memoir
D'Everand
Angela's Ashes: A Memoir
Frank McCourt
Évaluation : 4.5 sur 5 étoiles
4.5/5 (440)
Brooklyn: A Novel
D'Everand
Brooklyn: A Novel
Colm Tóibín
Évaluation : 3.5 sur 5 étoiles
3.5/5 (1937)
The Little Book of Hygge: Danish Secrets to Happy Living
D'Everand
The Little Book of Hygge: Danish Secrets to Happy Living
Meik Wiking
Évaluation : 3.5 sur 5 étoiles
3.5/5 (400)
The World Is Flat 3.0: A Brief History of the Twenty-first Century
D'Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
Évaluation : 3.5 sur 5 étoiles
3.5/5 (2259)
A Man Called Ove: A Novel
D'Everand
A Man Called Ove: A Novel
Fredrik Backman
Évaluation : 4.5 sur 5 étoiles
4.5/5 (4610)
The Art of Racing in the Rain: A Novel
D'Everand
The Art of Racing in the Rain: A Novel
Garth Stein
Évaluation : 4 sur 5 étoiles
4/5 (4200)
A Tree Grows in Brooklyn
D'Everand
A Tree Grows in Brooklyn
Betty Smith
Évaluation : 4.5 sur 5 étoiles
4.5/5 (1929)
The Yellow House: A Memoir (2019 National Book Award Winner)
D'Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
Évaluation : 4 sur 5 étoiles
4/5 (98)
Steve Jobs
D'Everand
Steve Jobs
Walter Isaacson
Évaluation : 4.5 sur 5 étoiles
4.5/5 (806)
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
D'Everand
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Gilbert King
Évaluation : 4.5 sur 5 étoiles
4.5/5 (266)
Yes Please
D'Everand
Yes Please
Amy Poehler
Évaluation : 4 sur 5 étoiles
4/5 (1891)
The Woman in Cabin 10
D'Everand
The Woman in Cabin 10
Ruth Ware
Évaluation : 3.5 sur 5 étoiles
3.5/5 (2322)
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
D'Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
Évaluation : 3.5 sur 5 étoiles
3.5/5 (231)
Team of Rivals: The Political Genius of Abraham Lincoln
D'Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
Évaluation : 4.5 sur 5 étoiles
4.5/5 (234)
Fear: Trump in the White House
D'Everand
Fear: Trump in the White House
Bob Woodward
Évaluation : 3.5 sur 5 étoiles
3.5/5 (738)
Wolf Hall: A Novel
D'Everand
Wolf Hall: A Novel
Hilary Mantel
Évaluation : 4 sur 5 étoiles
4/5 (3811)
John Adams
D'Everand
John Adams
David McCullough
Évaluation : 4.5 sur 5 étoiles
4.5/5 (2409)
On Fire: The (Burning) Case for a Green New Deal
D'Everand
On Fire: The (Burning) Case for a Green New Deal
Naomi Klein
Évaluation : 4 sur 5 étoiles
4/5 (74)
The Light Between Oceans: A Novel
D'Everand
The Light Between Oceans: A Novel
M.L. Stedman
Évaluation : 4.5 sur 5 étoiles
4.5/5 (789)
The Unwinding: An Inner History of the New America
D'Everand
The Unwinding: An Inner History of the New America
George Packer
Évaluation : 4 sur 5 étoiles
4/5 (45)
Manhattan Beach: A Novel
D'Everand
Manhattan Beach: A Novel
Jennifer Egan
Évaluation : 3.5 sur 5 étoiles
3.5/5 (792)
The Constant Gardener: A Novel
D'Everand
The Constant Gardener: A Novel
John le Carré
Évaluation : 3.5 sur 5 étoiles
3.5/5 (104)
Button Data Setup
Document46 pages
Button Data Setup
Miguel Sánchez G
Pas encore d'évaluation
Rise of ISIS: A Threat We Can't Ignore
D'Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
Évaluation : 3.5 sur 5 étoiles
3.5/5 (137)
Little Women
D'Everand
Little Women
Louisa May Alcott
Évaluation : 4 sur 5 étoiles
4/5 (104)
Peoplesoft Certification Questions & Answers
Document9 pages
Peoplesoft Certification Questions & Answers
Ganesh.am
92% (12)
InDesign CC 2015 Scripting Read Me
Document10 pages
InDesign CC 2015 Scripting Read Me
Buster Strandberg
Pas encore d'évaluation
Bit Sequential (BSQ) Data Model and Peano Count Trees (P-Trees)
Document20 pages
Bit Sequential (BSQ) Data Model and Peano Count Trees (P-Trees)
nobeen666
Pas encore d'évaluation
Exercise 4
Document3 pages
Exercise 4
nobeen666
Pas encore d'évaluation
044 Kernelizing The Outp
Document8 pages
044 Kernelizing The Outp
nobeen666
Pas encore d'évaluation
Chapter 5. Paper 1: Fast Rule-Based Classification Using P-Trees 5.1. Abstract
Document22 pages
Chapter 5. Paper 1: Fast Rule-Based Classification Using P-Trees 5.1. Abstract
nobeen666
Pas encore d'évaluation
K-Nearest Neighbor: Classification On Spatial Data Streams Using P-Trees
Document12 pages
K-Nearest Neighbor: Classification On Spatial Data Streams Using P-Trees
nobeen666
Pas encore d'évaluation
Periodicities in Trees Associated With The Classification of P - Groups by Width, Rank and Obliquity
Document15 pages
Periodicities in Trees Associated With The Classification of P - Groups by Width, Rank and Obliquity
nobeen666
Pas encore d'évaluation
Face Iccit 04
Document5 pages
Face Iccit 04
nobeen666
Pas encore d'évaluation
O StA 08
Document6 pages
O StA 08
nobeen666
Pas encore d'évaluation
MDM/KDD2002: Multimedia Data Mining Between Promises and Problems
Document4 pages
MDM/KDD2002: Multimedia Data Mining Between Promises and Problems
nobeen666
Pas encore d'évaluation
Iccit 2005 Ppmtree Paper
Document5 pages
Iccit 2005 Ppmtree Paper
nobeen666
Pas encore d'évaluation
Decision Tree
Document7 pages
Decision Tree
Alvaro Navarro Azaola
Pas encore d'évaluation
Hidden Tree Markov Models For Document Image Classification: Michelangelo Diligenti Paolo Frasconi Marco Gori
Document15 pages
Hidden Tree Markov Models For Document Image Classification: Michelangelo Diligenti Paolo Frasconi Marco Gori
nobeen666
Pas encore d'évaluation
Lazy Classifiers Using P-Trees
Document4 pages
Lazy Classifiers Using P-Trees
nobeen666
Pas encore d'évaluation
Decision Tree
Document7 pages
Decision Tree
Alvaro Navarro Azaola
Pas encore d'évaluation
Image Classification For Content-Based Indexing
Document14 pages
Image Classification For Content-Based Indexing
nobeen666
Pas encore d'évaluation
Ptree
Document2 pages
Ptree
nobeen666
Pas encore d'évaluation
Decision Tree Classification of Spatial Data Streams Using Peano Count Trees
Document5 pages
Decision Tree Classification of Spatial Data Streams Using Peano Count Trees
nobeen666
Pas encore d'évaluation
Efficient Hierarchical Clustering of Large Data Sets Using P-Trees
Document4 pages
Efficient Hierarchical Clustering of Large Data Sets Using P-Trees
nobeen666
Pas encore d'évaluation
Reverse Password Synchronization With IBM Tivoli Identity Manager Redp4299
Document22 pages
Reverse Password Synchronization With IBM Tivoli Identity Manager Redp4299
bupbechanh
Pas encore d'évaluation
Teaching Plan 1718 PDF
Document6 pages
Teaching Plan 1718 PDF
Vithdyaa Rajan
Pas encore d'évaluation
A Greedy Algorithm
Document8 pages
A Greedy Algorithm
billpetrrie
Pas encore d'évaluation
C Programming Khmer Language
Document42 pages
C Programming Khmer Language
Socheat Tieng
100% (7)
Thrashing & Working Set
Document19 pages
Thrashing & Working Set
rahupi
Pas encore d'évaluation
QoMoD: Effective Query Optimization in Mobile Database Systems
Document9 pages
QoMoD: Effective Query Optimization in Mobile Database Systems
JournalofICT
Pas encore d'évaluation
WebSphereMessageAdminAndTopologies ShareAnaheim
Document44 pages
WebSphereMessageAdminAndTopologies ShareAnaheim
Naseeruddin Shaik
Pas encore d'évaluation
Google File System Basics: Google World Wide Web Computers
Document5 pages
Google File System Basics: Google World Wide Web Computers
Abhay Chaturvedi
Pas encore d'évaluation
About Intel Virtualization Supported CPUs
Document11 pages
About Intel Virtualization Supported CPUs
asimalamp
Pas encore d'évaluation
CH 4
Document38 pages
CH 4
Siimple Opinion Final
Pas encore d'évaluation
Data Base
Document3 pages
Data Base
krishnamohan pokala
Pas encore d'évaluation
Vehicle Traffic Management
Document5 pages
Vehicle Traffic Management
Ajit Karhale
Pas encore d'évaluation
Simotion New Features V531 EN
Document13 pages
Simotion New Features V531 EN
Gener Tzab
Pas encore d'évaluation
(IN) SECURE Magazine Issue 14
Document111 pages
(IN) SECURE Magazine Issue 14
insecuremag
100% (3)
Jolly Roger Internet Security Guide
Document71 pages
Jolly Roger Internet Security Guide
The_Phreak
Pas encore d'évaluation
Very Short Intro To Assembly Language Programming
Document26 pages
Very Short Intro To Assembly Language Programming
ashis_biswas
Pas encore d'évaluation
RACF7 Ichza8c0
Document228 pages
RACF7 Ichza8c0
Siranjeevi Mohanaraja
Pas encore d'évaluation
TCL TK Tutorial
Document30 pages
TCL TK Tutorial
Ratna Raju Ayalapogu
Pas encore d'évaluation
Authshield-Web Application Security Solutions
Document11 pages
Authshield-Web Application Security Solutions
AuthShield Lab
Pas encore d'évaluation
2019 R2 Structures ANSYS Mechanical Highlights-Páginas-1-17
Document17 pages
2019 R2 Structures ANSYS Mechanical Highlights-Páginas-1-17
Israel Castillo Altamirano
Pas encore d'évaluation
Liferay Custom SQL Example - Pro Liferay
Document8 pages
Liferay Custom SQL Example - Pro Liferay
SiAgam Devinfo
Pas encore d'évaluation
Traffic Light Controller
Document16 pages
Traffic Light Controller
Joshua Jebaraj Joseph
Pas encore d'évaluation
Seleniumbook Prathap
Document42 pages
Seleniumbook Prathap
vundavilliravindra
Pas encore d'évaluation
NETMF+for+STM32+-+Technical+Notes+Release+4 2
Document5 pages
NETMF+for+STM32+-+Technical+Notes+Release+4 2
vigneshwaranj
Pas encore d'évaluation
UTL File Package
Document6 pages
UTL File Package
chowdary
Pas encore d'évaluation
Intro To Operating System
Document38 pages
Intro To Operating System
marc johansen
Pas encore d'évaluation
OSINT
Document49 pages
OSINT
MARCUS VINICIUS
Pas encore d'évaluation