Bienvenue sur Scribd !

Ensemble Learning: David Sontag New York University

Transféré par

0% ont trouvé ce document utile (0 vote)

10 vues17 pages

Ensemble Methods are methods that combine together many model predictions. For example, in Bagging (short for bootstrap aggregation), parallel models are constructed on m = many bootstrapped samples (eg., 50), and then the predictions from the m models are averaged to obtain the prediction from the ensemble of models. In this tutorial we walk through basics of three Ensemble Methods: Bagging, Random Forests, and Boosting.

Titre original

Lecture 13

Copyright

Formats disponibles

PDF, TXT ou lisez en ligne sur Scribd

Partager ce document

Partager ou intégrer le document

Options de partage

Avez-vous trouvé ce document utile ?

Ce contenu est-il inapproprié ?

Signaler ce document

Droits d'auteur :

Formats disponibles

Téléchargez comme PDF, TXT ou lisez en ligne sur Scribd

Signaler comme contenu inapproprié

0% ont trouvé ce document utile (0 vote)

10 vues17 pages

Ensemble Learning: David Sontag New York University

Transféré par

isaias.prestes

Droits d'auteur :

Formats disponibles

Téléchargez comme PDF, TXT ou lisez en ligne sur Scribd

Signaler comme contenu inapproprié

Passer à la page

Vous êtes sur la page 1sur 17

Rechercher à l'intérieur du document

Ensemble learning

Lecture 13

David Sontag
New York University

Slides adapted from Navneet Goyal, Tan, Steinbach,

Kumar, Vibhav Gogate
Ensemble methods
Machine learning competition with a $1 million prize
Bias/Variance Tradeoff

Hastie, Tibshirani, Friedman “Elements of Statistical Learning” 2001

Reduce Variance Without
Increasing Bias
•  Averaging reduces variance:
(when predictions
are independent)

Average models to reduce model variance

One problem:
only one training set
where do multiple models come from?

4
Bagging: Bootstrap Aggregation
•  Leo Breiman (1994)
•  Take repeated bootstrap samples from training set
D
•  Bootstrap sampling: Given set D containing N
training examples, create D’ by drawing N
examples at random with replacement from D.

•  Bagging:
–  Create k bootstrap samples D1 … Dk.
–  Train distinct classifier on each Di.
–  Classify new instance by majority vote / average.
General Idea
Bagging
•  Sampling with replacement
Training Data
Data ID

•  Build classifier on each bootstrap sample

•  Each data point has probability (1 – 1/n)n
of being selected as test data
•  Training data = 1- (1 – 1/n)n of the original
data
The 0.632 bootstrap
•  This method is also called the 0.632 bootstrap
–  A particular training data has a probability of
1-1/n of not being picked
–  Thus its probability of ending up in the test
data (not selected) is:

–  This means the training data will contain

approximately 63.2% of the instances
9
decision tree learning algorithm; very similar to ID3

10
shades of blue/red indicate strength of vote for particular classification
Example of Bagging

Assume that the training data is:

+1 0.4 to 0.7: -1 +1

0.8 x
0.3

Goal: find a collection of 10 simple thresholding classifiers that

collectively can classify correctly.
- Each simple (or weak) classifier is:
(x<=K  class = +1 or -1 depending on
which value yields the lowest error; where K
is determined by entropy minimization)
Random Forests
•  Ensemble method specifically designed for
decision tree classifiers

•  Introduce two sources of randomness:

“Bagging” and “Random input vectors”
–  Bagging method: each tree is grown using a bootstrap
sample of training data
–  Random vector method: At each node, best split is
chosen from a random sample of m attributes instead
of all attributes
Random Forests
Methods for Growing the Trees
•  Fix a m <= M. At each node
–  Method 1:
•  Choose m attributes randomly, compute their information
gains, and choose the attribute with the largest gain to split
–  Method 2:
•  (When M is not very large): select L of the attributes
randomly. Compute a linear combination of the L attributes
using weights generated from [-1,+1] randomly. That is, new
A = Sum(Wi*Ai), i=1..L.
–  Method 3:
•  Compute the information gain of all M attributes. Select the
top m attributes by information gain. Randomly select one of
the m attributes as the splitting node.
Random Forest Algorithm:
method 1 in previous slide

16
Reduce Bias2 and Decrease
Variance?
•  Bagging reduces variance by averaging
•  Bagging has little effect on bias
•  Can we average and reduce bias?
•  Yes:

•  Boosting

Vous aimerez peut-être aussi

Alternating Decision Tree: Fundamentals and Applications
D'Everand
Alternating Decision Tree: Fundamentals and Applications
Fouad Sabry
Pas encore d'évaluation
Bagging+Boosting+Gradient Boosting
Document48 pages
Bagging+Boosting+Gradient Boosting
Parimal Shivendu
100% (1)
Lecture 9 PDF
Document28 pages
Lecture 9 PDF
Sachin singh
100% (1)
Ensemble Classifiers
Document37 pages
Ensemble Classifiers
twinkz7
100% (1)
ML Cat 2 - 7
Document30 pages
ML Cat 2 - 7
Ved Prakash
Pas encore d'évaluation
Module 4
Document30 pages
Module 4
kobaya7455
Pas encore d'évaluation
DT-0 (3 Files Merged)
Document143 pages
DT-0 (3 Files Merged)
Qasim Abid
Pas encore d'évaluation
11 W11NSE6220 - Fall 2023 - Zeng
Document43 pages
11 W11NSE6220 - Fall 2023 - Zeng
rahul101056
Pas encore d'évaluation
WINSEM2021-22 CSE4020 ETH VL2021220502460 Reference Material I 02-03-2022 Ensemble Classifier Intro 22
Document37 pages
WINSEM2021-22 CSE4020 ETH VL2021220502460 Reference Material I 02-03-2022 Ensemble Classifier Intro 22
STYX
100% (1)
Decision Tree
Document18 pages
Decision Tree
Rithvik Dadapuram
Pas encore d'évaluation
Machine Learning: Classification & Decision Trees
Document24 pages
Machine Learning: Classification & Decision Trees
Nguyen Ba Quan
Pas encore d'évaluation
Ensemble V2 1
Document19 pages
Ensemble V2 1
Aradhya
Pas encore d'évaluation
ML QB Solutionss
Document16 pages
ML QB Solutionss
Kunj Trivedi
Pas encore d'évaluation
Lec 18
Document34 pages
Lec 18
ABHIRAJ E
100% (1)
Combining Classifiers: Outline
Document15 pages
Combining Classifiers: Outline
jawad_shah
Pas encore d'évaluation
Lecture 05 Random Forest 07112022 124639pm
Document25 pages
Lecture 05 Random Forest 07112022 124639pm
Misbah
Pas encore d'évaluation
Machine Learning
Document33 pages
Machine Learning
shobhit
Pas encore d'évaluation
NLP Chapter 2
Document79 pages
NLP Chapter 2
ai20152023
Pas encore d'évaluation
COMP 6930 Topic01 Classification Basics
Document190 pages
COMP 6930 Topic01 Classification Basics
Person
Pas encore d'évaluation
DS4 - CLS-Decision Tree
Document32 pages
DS4 - CLS-Decision Tree
Văn Hoàng Trần
Pas encore d'évaluation
Machine Learning and Data Mining: Prof. Alexander Ihler Fall 2012
Document36 pages
Machine Learning and Data Mining: Prof. Alexander Ihler Fall 2012
Pallav Anand
Pas encore d'évaluation
Basic Ensemble Learning (Random Forest, AdaBoost, Gradient Boosting) - Step by Step Explained
Document8 pages
Basic Ensemble Learning (Random Forest, AdaBoost, Gradient Boosting) - Step by Step Explained
Luciano
100% (1)
Module 1 ML Mumbai University
Document47 pages
Module 1 ML Mumbai University
2021.shreya.pawaskar
Pas encore d'évaluation
6.2 K Means
Document23 pages
6.2 K Means
Matrix Bot
Pas encore d'évaluation
Decision Trees
Document37 pages
Decision Trees
Dennis Angel
Pas encore d'évaluation
Class Basic
Document75 pages
Class Basic
Basant Yadav
Pas encore d'évaluation
Unit-6: Classification and Prediction
Document63 pages
Unit-6: Classification and Prediction
Nayan Patel
Pas encore d'évaluation
Lec 6
Document15 pages
Lec 6
Lyu Philip
Pas encore d'évaluation
Random Forest
Document10 pages
Random Forest
noname
Pas encore d'évaluation
DWDM PPT
Document35 pages
DWDM PPT
Rakesh Kumar
Pas encore d'évaluation
Data Mining Ensemble Learning: Gergely Lukács
Document33 pages
Data Mining Ensemble Learning: Gergely Lukács
Blazs
Pas encore d'évaluation
15 dm2 Imbalanced Learning 2022 23
Document35 pages
15 dm2 Imbalanced Learning 2022 23
nimra
Pas encore d'évaluation
Random Forest
Document18 pages
Random Forest
yaswanth kumar vitta
Pas encore d'évaluation
Week 7 - Tree-Based Model
Document8 pages
Week 7 - Tree-Based Model
Nguyễn Trường Sơn
100% (1)
BDT KSETA Freudenstadt
Document32 pages
BDT KSETA Freudenstadt
Picasbrancas
Pas encore d'évaluation
158 9 (Clustering)
Document36 pages
158 9 (Clustering)
Hendy Kurniawan
Pas encore d'évaluation
Introduction To Multiple Imputation: Francis Bursa
Document16 pages
Introduction To Multiple Imputation: Francis Bursa
Dorice Respick
Pas encore d'évaluation
Unit5 - Unsupervised Learning
Document48 pages
Unit5 - Unsupervised Learning
Soumya Mishra
Pas encore d'évaluation
w2 - Fundamentals of Learning
Document37 pages
w2 - Fundamentals of Learning
Swastik Sindhani
Pas encore d'évaluation
ML Unit 3
Document14 pages
ML Unit 3
aiswarya
Pas encore d'évaluation
ML (Interview)
Document20 pages
ML (Interview)
ratnadepp
Pas encore d'évaluation
Mod 3 AIML QB With Answers
Document26 pages
Mod 3 AIML QB With Answers
Dhathri Reddy
Pas encore d'évaluation
Classification and Regression Trees (CART - I) : Dr. A. Ramesh
Document34 pages
Classification and Regression Trees (CART - I) : Dr. A. Ramesh
Vaibhav Gupta
Pas encore d'évaluation
Classification
Document40 pages
Classification
Syed Hamza Ibrar Shah
Pas encore d'évaluation
A Preliminary Idea On Machine Learning
Document40 pages
A Preliminary Idea On Machine Learning
Avijit Bose
Pas encore d'évaluation
Random Forests 2
Document43 pages
Random Forests 2
sidikikalako
Pas encore d'évaluation
SVM ML
Document29 pages
SVM ML
Dilip Singh
Pas encore d'évaluation
JNTUK R20 B.Tech CSE 3-2 Machine Learning Unit 3 Notes
Document21 pages
JNTUK R20 B.Tech CSE 3-2 Machine Learning Unit 3 Notes
Harini
Pas encore d'évaluation
Decision Trees - 2022
Document49 pages
Decision Trees - 2022
Soubhav Chaman
Pas encore d'évaluation
E-Assessment & Learning Analytics: 10. Clustering, Classification & Prediction
Document40 pages
E-Assessment & Learning Analytics: 10. Clustering, Classification & Prediction
fabian plett
Pas encore d'évaluation
ML Notes
Document79 pages
ML Notes
Ina Vlad
Pas encore d'évaluation
Unit 2
Document40 pages
Unit 2
vamsi kiran
Pas encore d'évaluation
Lecture+Notes+-+Random Forests
Document10 pages
Lecture+Notes+-+Random Forests
samrat141988
Pas encore d'évaluation
Basic Concepts and K-Nearest Neighbors: Pekka Parviainen
Document40 pages
Basic Concepts and K-Nearest Neighbors: Pekka Parviainen
Val
Pas encore d'évaluation
CS109a Lecture16 Bagging RF Boosting
Document48 pages
CS109a Lecture16 Bagging RF Boosting
Teshome Mulugeta
Pas encore d'évaluation
Decision Tree
Document68 pages
Decision Tree
2K20CO345 Priti Meena
Pas encore d'évaluation
Artificial Intelligence: Slide 6
Document42 pages
Artificial Intelligence: Slide 6
fatima hussain
Pas encore d'évaluation
Decision Tree Learning: - A Learned Decision Tree Can Also Be Re-Represented As A Set of If-Then Rules
Document49 pages
Decision Tree Learning: - A Learned Decision Tree Can Also Be Re-Represented As A Set of If-Then Rules
Arun Kumar
Pas encore d'évaluation
Bagging and Boosting
Document19 pages
Bagging and Boosting
Ana-Cosmina Popescu
100% (1)
C4.5 and CHAID Algorithm: Pavan J Joshi 2010MCS2095 Special Topics in Database Systems
Document30 pages
C4.5 and CHAID Algorithm: Pavan J Joshi 2010MCS2095 Special Topics in Database Systems
Fidia Dta
Pas encore d'évaluation
STICK WELDING 101 - Getting Started With SMAW
Document5 pages
STICK WELDING 101 - Getting Started With SMAW
isaias.prestes
Pas encore d'évaluation
Bitbucket Branch Management - A Comprehensive Guide - by Goutam Das
Document12 pages
Bitbucket Branch Management - A Comprehensive Guide - by Goutam Das
isaias.prestes
Pas encore d'évaluation
How To Weld - MIG Welding - 11 Steps (With Pictures) - Instructables
Document19 pages
How To Weld - MIG Welding - 11 Steps (With Pictures) - Instructables
isaias.prestes
Pas encore d'évaluation
How To Make Your Own Bow and Arrow by Hand
Document19 pages
How To Make Your Own Bow and Arrow by Hand
isaias.prestes
Pas encore d'évaluation
Beginner's Guide To Welding 101 - All You Need To Know PDF
Document60 pages
Beginner's Guide To Welding 101 - All You Need To Know PDF
isaias.prestes
Pas encore d'évaluation
Welding 101 For Hobbyists (And Nerds!) - Practical Engineering
Document6 pages
Welding 101 For Hobbyists (And Nerds!) - Practical Engineering
isaias.prestes
Pas encore d'évaluation
6 Welding Tips and Tricks - How To Weld The Right Way
Document12 pages
6 Welding Tips and Tricks - How To Weld The Right Way
isaias.prestes
Pas encore d'évaluation
Ensemble Methods
Document15 pages
Ensemble Methods
isaias.prestes
Pas encore d'évaluation
Bagging and Boosting: Amit Srinet Dave Snyder
Document33 pages
Bagging and Boosting: Amit Srinet Dave Snyder
isaias.prestes
Pas encore d'évaluation
Lecture 20: Bagging, Random Forests, Boosting: Reading: Chapter 8
Document53 pages
Lecture 20: Bagging, Random Forests, Boosting: Reading: Chapter 8
isaias.prestes
Pas encore d'évaluation
Ensemble Methods
Document15 pages
Ensemble Methods
isaias.prestes
Pas encore d'évaluation
Bagging and Boosting: Amit Srinet Dave Snyder
Document33 pages
Bagging and Boosting: Amit Srinet Dave Snyder
isaias.prestes
Pas encore d'évaluation
Lecture 20: Bagging, Random Forests, Boosting: Reading: Chapter 8
Document53 pages
Lecture 20: Bagging, Random Forests, Boosting: Reading: Chapter 8
isaias.prestes
Pas encore d'évaluation
Gradient Boosting From Scratch
Document10 pages
Gradient Boosting From Scratch
isaias.prestes
Pas encore d'évaluation
Boosting and AdaBoost For Machine Learning
Document18 pages
Boosting and AdaBoost For Machine Learning
isaias.prestes
Pas encore d'évaluation
Online Mini-Lecture:: DR Andy Evans
Document10 pages
Online Mini-Lecture:: DR Andy Evans
isaias.prestes
Pas encore d'évaluation
09 Bayesian FIL2011May
Document26 pages
09 Bayesian FIL2011May
isaias.prestes
Pas encore d'évaluation
1471 2458 14 1045 PDF
Document21 pages
1471 2458 14 1045 PDF
isaias.prestes
Pas encore d'évaluation
Software - AvaxHome
Document11 pages
Software - AvaxHome
isaias.prestes
Pas encore d'évaluation
Bayesian Statistical Concepts Bielefeld2
Document35 pages
Bayesian Statistical Concepts Bielefeld2
isaias.prestes
Pas encore d'évaluation
1471 2458 14 1044 PDF
Document12 pages
1471 2458 14 1044 PDF
isaias.prestes
Pas encore d'évaluation
Farmer P Cancercare Control LMIC Lancet 2010 PDF
Document8 pages
Farmer P Cancercare Control LMIC Lancet 2010 PDF
isaias.prestes
Pas encore d'évaluation
Medical Advice Tutorial
Document4 pages
Medical Advice Tutorial
isaias.prestes
Pas encore d'évaluation
1471 2458 14 1043 PDF
Document20 pages
1471 2458 14 1043 PDF
isaias.prestes
Pas encore d'évaluation
1471 2164 15 872 PDF
Document30 pages
1471 2164 15 872 PDF
isaias.prestes
Pas encore d'évaluation
1471 2164 15 871 PDF
Document37 pages
1471 2164 15 871 PDF
isaias.prestes
Pas encore d'évaluation
Statics: Content Summary
Document4 pages
Statics: Content Summary
isaias.prestes
Pas encore d'évaluation
101 2020 0 B
Document13 pages
101 2020 0 B
RixizoNgobeni
Pas encore d'évaluation
R-1 Transformation of Roots
Document2 pages
R-1 Transformation of Roots
Prabhat Sharma
Pas encore d'évaluation
BST Search Insert Delete
Document20 pages
BST Search Insert Delete
گل گلاب
Pas encore d'évaluation
KIE1003 Digital System Tutorial 5
Document5 pages
KIE1003 Digital System Tutorial 5
brainnytan
Pas encore d'évaluation
Pruning: Advanced Topic in Image Processing
Document191 pages
Pruning: Advanced Topic in Image Processing
Qaisar Ayub Sheikh
100% (2)
Clustering
Document17 pages
Clustering
Aatri Pal
Pas encore d'évaluation
Malignant Melanoma Detection in Clinical Images Using SVM Classification
Document5 pages
Malignant Melanoma Detection in Clinical Images Using SVM Classification
Muzamil Hussain
Pas encore d'évaluation
Ensemble Machine Learning Model For Software Defect Prediction
Document11 pages
Ensemble Machine Learning Model For Software Defect Prediction
Dada Emmanuel Gbenga
Pas encore d'évaluation
A Modular Approach: Prepared by Mrs. Marivic R. Rosini
Document20 pages
A Modular Approach: Prepared by Mrs. Marivic R. Rosini
Patrick Rivera
Pas encore d'évaluation
Procedural Terrain Generation Method With Biomes
Document1 page
Procedural Terrain Generation Method With Biomes
Santiago Giraudin
Pas encore d'évaluation
Spoj Solution AMS
Document4 pages
Spoj Solution AMS
Viraj Kumar
Pas encore d'évaluation
Module 3 - Integration of Rational Functions
Document7 pages
Module 3 - Integration of Rational Functions
Khemme Lapor Chu Ubial
Pas encore d'évaluation
OR Section 5
Document49 pages
OR Section 5
asmaa soliman
Pas encore d'évaluation
Calculation of The CRC16 Using C
Document2 pages
Calculation of The CRC16 Using C
Arun Prakash
Pas encore d'évaluation
Spectrogram Using Short-Time Fourier Transform - MATLAB Spectrogram PDF
Document25 pages
Spectrogram Using Short-Time Fourier Transform - MATLAB Spectrogram PDF
Álvaro Quintana Carvacho
Pas encore d'évaluation
Introduction To Numerical Analysis: Cho, Hyoung Kyu Cho, Hyoung Kyu
Document93 pages
Introduction To Numerical Analysis: Cho, Hyoung Kyu Cho, Hyoung Kyu
sadik palta
Pas encore d'évaluation
Lab 5,6,7
Document20 pages
Lab 5,6,7
Huzaifa Khan
Pas encore d'évaluation
Numerical Methods3
Document17 pages
Numerical Methods3
Kaiash M Y
Pas encore d'évaluation
DMC Lab Ex - 1 To 15 (31.03.2024)
Document52 pages
DMC Lab Ex - 1 To 15 (31.03.2024)
mrsanthoosh.edu
Pas encore d'évaluation
Activity 1
Document4 pages
Activity 1
Benedict Sunga
Pas encore d'évaluation
Digital Signal Processing
Document1 page
Digital Signal Processing
veeramaniks408
Pas encore d'évaluation
Assembly Line Balancing
Document17 pages
Assembly Line Balancing
Harshad_S
50% (2)
5 More On Optimum Design Concepts:: Optimality Conditions
Document88 pages
5 More On Optimum Design Concepts:: Optimality Conditions
saina
Pas encore d'évaluation
Newton-Raphson Method
Document36 pages
Newton-Raphson Method
Sonal Gupta
Pas encore d'évaluation
Eight Way Symmetry: Midpoint Line Drawing Algorithm
Document20 pages
Eight Way Symmetry: Midpoint Line Drawing Algorithm
Alif hasan
Pas encore d'évaluation
InfyTQ Practice Problem - Day 23
Document3 pages
InfyTQ Practice Problem - Day 23
Shobhit Tiwari
Pas encore d'évaluation
Ant Colony Optimization
Document13 pages
Ant Colony Optimization
javed765
100% (1)
PDSA Week 2
Document32 pages
PDSA Week 2
SRI HARSHA SARAGADAM
Pas encore d'évaluation
Searching and Sorting Arrays
Document28 pages
Searching and Sorting Arrays
Kobina
Pas encore d'évaluation
Sunil Sharma PHD Proposal
Document6 pages
Sunil Sharma PHD Proposal
Sunil Sharma
Pas encore d'évaluation