LDA KNN Logistic

Transféré par

Govind Yadav

0% ont trouvé ce document utile (0 vote)

40 vues4 pages

LDA KNN Logistic Analysis

Copyright

Formats disponibles

DOCX, PDF, TXT ou lisez en ligne sur Scribd

Partager ce document

Partager ou intégrer le document

Options de partage

Avez-vous trouvé ce document utile ?

Ce contenu est-il inapproprié ?

Signaler ce document

LDA KNN Logistic Analysis

Droits d'auteur :

Formats disponibles

Téléchargez comme DOCX, PDF, TXT ou lisez en ligne sur Scribd

Signaler comme contenu inapproprié

0% ont trouvé ce document utile (0 vote)

40 vues4 pages

LDA KNN Logistic

Transféré par

Govind Yadav

LDA KNN Logistic Analysis

Droits d'auteur :

Formats disponibles

Téléchargez comme DOCX, PDF, TXT ou lisez en ligne sur Scribd

Signaler comme contenu inapproprié

Passer à la page

Vous êtes sur la page 1sur 4

Rechercher à l'intérieur du document

ASSIGNMENT-2

LDA, LR, KNN

LDA,KNN and LR
STEP-1:

Collinearity Analysis:
First we performed collinearity analysis to find out, if any, high collinearity exist among dependent
variables. The cut off value for high collinearity is considered to be 0.7
Using the collinearity analysis we dropped following 6 variables from our dataset:
 PriceCH
 SalePriceCH
 PriceDiff
 DiscMM
 SalePriceMM
 PctDiscCH
Collinearity output:
Post variable reduction using collinearity method the data set has following remaining non-collinear
variables:
STEP-2:
To perform further analysis dataset was partitioned in training and validation sets in the proportion of
70:30 percentage, where 70%of original dataset was assigned as training data set and rest 30% as
validation data set.
LOGISTIC REGRESIION ANALYSIS:
The cutoff probability for prediction under this regression analysis was set to 0.5.
Training Confusion Matrix:
Post training and prediction on the logistic model using training dataset following confusion matrix was
obtained:

Accuracy: (425+307)/ (425+67+77+307) = 83.56%

Misclassification: (77+67)/( 425+67+77+307) = 16.43%
Validation Confusion Matrix:
The confusion matrix obtained from validation 30% data over logistic model is:

Accuracy: (225+127)/(30+46+225+127) = 82.24%

Misclassification: (30+46)/ (30+46+225+127) = 17.75%
Benchmark Accuracy:
The actual number of CH observations in original dataset: 653
The actual number of MM observations in original dataset: 417
If prediction was made exactly as the CH and MM observation in dataset, then:
Accuracy: 653/ (653+417) = 61.02% <- Can be considered as benchmark accuracy to evaluate the
performance of logistic model.
The accuracy of validation model is close to the training model and greater than benchmark accuracy.
Therefore the model can be considered to be a good fit and acceptable.
ROC Curve:
The ROC curve represents greater area under curve above the straight benchmark line. This shows that
TPR is significantly higher than FPR for the logistic model. Therefore the model Is good fit in the
prediction accuracy.
STEP-3:
LINEAR DISCRIMINANT MODEL:
Training Confusion Matrix:
The confusion matrix for training data under this model is:

Accuracy: (398+221)/(398+221+73+57) = 82.64%

Misclassification: (57+73)/ (398+221+73+57) = 17.35

Validation Confusion Matrix:

Accuracy: (220+130)/(220+130+43+35) = 81.75%

Misclassification: (35+43)/( 220+130+43+35) = 18.22%

Prediction Histogram:
The above histogram for CH and MM purchase groups shows significant overlap in the central area.
Therefore from above observation we can conclude that prediction accuracy for LDA model is lower
than Logistic model and also the significant overlap in the histogram among two categories of Purchase
for LDA prediction model shows that this model is not good fit in predictive power for the given dataset.
STEP-4:
KNN ANALYSIS:
Confusion Matrix:

Accuracy: (183+95)/(183+28+15+95) = 86.6%

Misclassification: (15+28)/( 183+28+15+95) =13.4%
The accuracy is good in this model but KNN doesn’t give any powerful insights of the data as it is given in
LDA and LR model. It is major generalized scenario of data classification whose accuracy depends on
value of K. Therefore this model cannot be considered as precise and powerful as Logistic Regression.

CONCLUSION: Among all the three models Logistic Regression model is most powerful and accurate and
best fit for the given type of dataset. It has good accuracy, lesser misclassification and ROC curve hihly
supports this model.

Vous aimerez peut-être aussi

The Yellow House: A Memoir (2019 National Book Award Winner)
D'Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
Évaluation : 4 sur 5 étoiles
4/5 (98)
10 Normalization
Document81 pages
10 Normalization
Sammy
Pas encore d'évaluation
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
D'Everand
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Margot Lee Shetterly
Évaluation : 4 sur 5 étoiles
4/5 (895)
Unit-Ii: Chapter-4 Object Oriented Methodologies Objectives
Document64 pages
Unit-Ii: Chapter-4 Object Oriented Methodologies Objectives
corry amellia
Pas encore d'évaluation
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
D'Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
Évaluation : 4 sur 5 étoiles
4/5 (5794)
Cs 2 PDF
Document4 pages
Cs 2 PDF
Harsimran Kapoor
Pas encore d'évaluation
The Little Book of Hygge: Danish Secrets to Happy Living
D'Everand
The Little Book of Hygge: Danish Secrets to Happy Living
Meik Wiking
Évaluation : 3.5 sur 5 étoiles
3.5/5 (399)
20.information Modeling of Online Air Tickets Reservation System PDF
Document4 pages
20.information Modeling of Online Air Tickets Reservation System PDF
Hoàng Hải
Pas encore d'évaluation
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
D'Everand
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Gilbert King
Évaluation : 4.5 sur 5 étoiles
4.5/5 (266)
1 DDL
Document2 pages
1 DDL
Kpsmurugesan Kpsm
Pas encore d'évaluation
Shoe Dog: A Memoir by the Creator of Nike
D'Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
Évaluation : 4.5 sur 5 étoiles
4.5/5 (537)
Whats New in Autodesk Revit Building 9 9.1
Document21 pages
Whats New in Autodesk Revit Building 9 9.1
api-3770976
Pas encore d'évaluation
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
D'Everand
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Ashlee Vance
Évaluation : 4.5 sur 5 étoiles
4.5/5 (474)
Database Management Systems Unit-I: Text Book
Document1 page
Database Management Systems Unit-I: Text Book
Haraprasad Naik
Pas encore d'évaluation
Never Split the Difference: Negotiating As If Your Life Depended On It
D'Everand
Never Split the Difference: Negotiating As If Your Life Depended On It
Chris Voss
Évaluation : 4.5 sur 5 étoiles
4.5/5 (838)
04-Random-Variate Generation
Document18 pages
04-Random-Variate Generation
Jesse Sanders
Pas encore d'évaluation
Grit: The Power of Passion and Perseverance
D'Everand
Grit: The Power of Passion and Perseverance
Angela Duckworth
Évaluation : 4 sur 5 étoiles
4/5 (588)
Chap4 ERM Revised
Document61 pages
Chap4 ERM Revised
Ay Ar Ey Valencia
Pas encore d'évaluation
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
D'Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
Évaluation : 3.5 sur 5 étoiles
3.5/5 (231)
Scalar Wars The Brave New World of Scalar Electromagnetics
Document76 pages
Scalar Wars The Brave New World of Scalar Electromagnetics
PerfectKey21
Pas encore d'évaluation
Principles: Life and Work
D'Everand
Principles: Life and Work
Ray Dalio
Évaluation : 4 sur 5 étoiles
4/5 (599)
Test Score, X: Regression Line
Document5 pages
Test Score, X: Regression Line
JeffersonTalan
Pas encore d'évaluation
The Emperor of All Maladies: A Biography of Cancer
D'Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
Évaluation : 4.5 sur 5 étoiles
4.5/5 (271)
Lab 1 Introduction To UML
Document8 pages
Lab 1 Introduction To UML
sayed omran
Pas encore d'évaluation
Yes Please
D'Everand
Yes Please
Amy Poehler
Évaluation : 4 sur 5 étoiles
4/5 (1891)
Chapter 3 - ER
Document73 pages
Chapter 3 - ER
sejal
Pas encore d'évaluation
The World Is Flat 3.0: A Brief History of the Twenty-first Century
D'Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
Évaluation : 3.5 sur 5 étoiles
3.5/5 (2259)
IDL Programming & Data Visualization: Shou-Lien Chen Department of Physics, NCUE
Document67 pages
IDL Programming & Data Visualization: Shou-Lien Chen Department of Physics, NCUE
rajawishes
Pas encore d'évaluation
On Fire: The (Burning) Case for a Green New Deal
D'Everand
On Fire: The (Burning) Case for a Green New Deal
Naomi Klein
Évaluation : 4 sur 5 étoiles
4/5 (73)
CHAPTER3 Continuous Probability Distribution
Document56 pages
CHAPTER3 Continuous Probability Distribution
Julrey Garcia
Pas encore d'évaluation
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
D'Everand
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Ben Horowitz
Évaluation : 4.5 sur 5 étoiles
4.5/5 (344)
Lab Manual
Document17 pages
Lab Manual
Hamza Malik
50% (2)
Rise of ISIS: A Threat We Can't Ignore
D'Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
Évaluation : 3.5 sur 5 étoiles
3.5/5 (137)
Q11 Difference Between C.P.M. & P.E.R.T.: Project
Document1 page
Q11 Difference Between C.P.M. & P.E.R.T.: Project
Atharva Joshi
Pas encore d'évaluation
Team of Rivals: The Political Genius of Abraham Lincoln
D'Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
Évaluation : 4.5 sur 5 étoiles
4.5/5 (234)
Computer Animation
Document2 pages
Computer Animation
ssa_joe
Pas encore d'évaluation
Fear: Trump in the White House
D'Everand
Fear: Trump in the White House
Bob Woodward
Évaluation : 3.5 sur 5 étoiles
3.5/5 (738)
Lec 2 Data Modeling and Database Design
Document10 pages
Lec 2 Data Modeling and Database Design
Jeffrey Fernandez Papa
Pas encore d'évaluation
John Adams
D'Everand
John Adams
David McCullough
Évaluation : 4.5 sur 5 étoiles
4.5/5 (2409)
Chapter # 11 (The Basic Seven-B7-Tools of Quality)
Document54 pages
Chapter # 11 (The Basic Seven-B7-Tools of Quality)
Ali Ahmed
100% (1)
The Unwinding: An Inner History of the New America
D'Everand
The Unwinding: An Inner History of the New America
George Packer
Évaluation : 4 sur 5 étoiles
4/5 (45)
Ty Bcom Sem 5 DBMS Full
Document21 pages
Ty Bcom Sem 5 DBMS Full
Laxmikant Yadav
Pas encore d'évaluation
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
D'Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brené Brown
Évaluation : 4 sur 5 étoiles
4/5 (1090)
C++ Cse 1st Yr
Document34 pages
C++ Cse 1st Yr
Prangya Pradhan
Pas encore d'évaluation
Steve Jobs
D'Everand
Steve Jobs
Walter Isaacson
Évaluation : 4.5 sur 5 étoiles
4.5/5 (806)
Multiple Linear Regression - Six Sigma Study Guide
Document9 pages
Multiple Linear Regression - Six Sigma Study Guide
Sunil
Pas encore d'évaluation
Angela's Ashes: A Memoir
D'Everand
Angela's Ashes: A Memoir
Frank McCourt
Évaluation : 4.5 sur 5 étoiles
4.5/5 (440)
General Hos Lem
Document9 pages
General Hos Lem
SeyChell Norinha
Pas encore d'évaluation
Bad Feminist: Essays
D'Everand
Bad Feminist: Essays
Roxane Gay
Évaluation : 4 sur 5 étoiles
4/5 (1015)
Bayesian Modeling Using The MCMC Procedure
Document22 pages
Bayesian Modeling Using The MCMC Procedure
Kian Jahromi
Pas encore d'évaluation
The Glass Castle: A Memoir
D'Everand
The Glass Castle: A Memoir
Jeannette Walls
Évaluation : 4.5 sur 5 étoiles
4.5/5 (1712)
Combinepdf
Document190 pages
Combinepdf
Umair Rajpoot
Pas encore d'évaluation
The Light Between Oceans: A Novel
D'Everand
The Light Between Oceans: A Novel
M.L. Stedman
Évaluation : 4.5 sur 5 étoiles
4.5/5 (789)
Dbms
Document74 pages
Dbms
Guneet Garg
Pas encore d'évaluation
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
D'Everand
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Viet Thanh Nguyen
Évaluation : 4.5 sur 5 étoiles
4.5/5 (121)
List of Deep Learning and NLP Resources
Document69 pages
List of Deep Learning and NLP Resources
BartoszSowul
100% (1)
The Outsider: A Novel
D'Everand
The Outsider: A Novel
Stephen King
Évaluation : 4 sur 5 étoiles
4/5 (1839)
SAP IBP Production Source Header - Sample Data
Document12 pages
SAP IBP Production Source Header - Sample Data
praty888
100% (1)
Brooklyn: A Novel
D'Everand
Brooklyn: A Novel
Colm Tóibín
Évaluation : 3.5 sur 5 étoiles
3.5/5 (1937)
WHERE Clause: DCL Command
Document8 pages
WHERE Clause: DCL Command
Babita Yadav
Pas encore d'évaluation
A Man Called Ove: A Novel
D'Everand
A Man Called Ove: A Novel
Fredrik Backman
Évaluation : 4.5 sur 5 étoiles
4.5/5 (4609)
The Woman in Cabin 10
D'Everand
The Woman in Cabin 10
Ruth Ware
Évaluation : 3.5 sur 5 étoiles
3.5/5 (2322)
Wolf Hall: A Novel
D'Everand
Wolf Hall: A Novel
Hilary Mantel
Évaluation : 4 sur 5 étoiles
4/5 (3811)
Manhattan Beach: A Novel
D'Everand
Manhattan Beach: A Novel
Jennifer Egan
Évaluation : 3.5 sur 5 étoiles
3.5/5 (792)
The Perks of Being a Wallflower
D'Everand
The Perks of Being a Wallflower
Stephen Chbosky
Évaluation : 4.5 sur 5 étoiles
4.5/5 (2102)
Little Women
D'Everand
Little Women
Louisa May Alcott
Évaluation : 4 sur 5 étoiles
4/5 (104)
The Art of Racing in the Rain: A Novel
D'Everand
The Art of Racing in the Rain: A Novel
Garth Stein
Évaluation : 4 sur 5 étoiles
4/5 (4200)
Sing, Unburied, Sing: A Novel
D'Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
Évaluation : 4 sur 5 étoiles
4/5 (1103)
Her Body and Other Parties: Stories
D'Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
Évaluation : 4 sur 5 étoiles
4/5 (821)
A Tree Grows in Brooklyn
D'Everand
A Tree Grows in Brooklyn
Betty Smith
Évaluation : 4.5 sur 5 étoiles
4.5/5 (1929)
The Constant Gardener: A Novel
D'Everand
The Constant Gardener: A Novel
John le Carré
Évaluation : 3.5 sur 5 étoiles
3.5/5 (104)