Bienvenue sur Scribd !

Ignorer le carrousel

3-Clustering 3 KMeans

Transféré par

Rajkishor gupta

0% ont trouvé ce document utile (0 vote)

17 vues13 pages

K means best book

Copyright

Formats disponibles

PDF, TXT ou lisez en ligne sur Scribd

Partager ce document

Partager ou intégrer le document

Options de partage

Avez-vous trouvé ce document utile ?

Ce contenu est-il inapproprié ?

Signaler ce document

K means best book

Droits d'auteur :

Formats disponibles

Téléchargez comme PDF, TXT ou lisez en ligne sur Scribd

Signaler comme contenu inapproprié

0% ont trouvé ce document utile (0 vote)

17 vues13 pages

3-Clustering 3 KMeans

Transféré par

Rajkishor gupta

K means best book

Droits d'auteur :

Formats disponibles

Téléchargez comme PDF, TXT ou lisez en ligne sur Scribd

Signaler comme contenu inapproprié

Passer à la page

Vous êtes sur la page 1sur 13

Rechercher à l'intérieur du document

Mining

of Massive Datasets

Leskovec, Rajaraman, and Ullman
Stanford University
¡  Assumes Euclidean space/distance

¡  Start by picking k, the number of clusters

¡  Ini<alize clusters by picking one point per

cluster
§  For the moment, assume we pick the k points at
random

2
¡  1) For each point, place it in the cluster whose
current centroid it is nearest
¡  2) AAer all points are assigned, update the
loca<ons of centroids of the k clusters
¡  3) Reassign all points to their closest centroid
§  Some<mes moves points between clusters

¡  Repeat 2 and 3 un.l convergence

§  Convergence: Points don’t move between clusters
and centroids stabilize
3
x
x
x
x
x

x x x x x x

x … data point
… centroid
Round 1
4
x
x
x
x
x

x x x x x x

x … data point
… centroid
Round 2
5
x
x
x
x
x

x x x x x x

x … data point
… centroid
Round 3
6
How to select k?
¡  Try diﬀerent k, looking at the change in the
average distance to centroid, as k increases.

7
Too few; x
many long x
xx x
distances
x x
to centroid. x x x x x
x x x x x
x xx x xx x
x x x x
x x

x x x
x x x x
x x x
x

8
x
Just right; x
distances xx x
rather short. x x
x x x x x
x x x x x
x xx x xx x
x x x x
x x

x x x
x x x x
x x x
x

9
Too many; x
little improvement x
in average xx x
distance. x x
x x x x x
x x x x x
x xx x xx x
x x x x
x x

x x x
x x x x
x x x
x

10
Average falls rapidly un<l right k, then falls
much more slowly

Best value
of k
Average
distance to
centroid k

11
¡  Approach 1: Sampling
§  Cluster a sample of the data using hierarchical
clustering, to obtain k clusters
§  Pick a point from each cluster (e.g. point closest to
centroid)
§  Sample ﬁts in main memory

¡  Approach 2: Pick “dispersed” set of points

§  Pick ﬁrst point at random
§  Pick the next point to be the one whose minimum
distance from the selected points is as large as
possible
§  Repeat un<l we have k points
12
¡  In each round, we have to examine each input
point exactly once to ﬁnd closest centroid

¡  Each round is O(kN) for N points, k clusters

¡  But the number of rounds to convergence can

be very large!

¡  Can we cluster in a single pass over the data?

Vous aimerez peut-être aussi

Shoe Dog: A Memoir by the Creator of Nike
D'Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
Évaluation : 4.5 sur 5 étoiles
4.5/5 (537)
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
D'Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
Évaluation : 4 sur 5 étoiles
4/5 (5794)
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
D'Everand
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Margot Lee Shetterly
Évaluation : 4 sur 5 étoiles
4/5 (890)
The Yellow House: A Memoir (2019 National Book Award Winner)
D'Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
Évaluation : 4 sur 5 étoiles
4/5 (98)
The Little Book of Hygge: Danish Secrets to Happy Living
D'Everand
The Little Book of Hygge: Danish Secrets to Happy Living
Meik Wiking
Évaluation : 3.5 sur 5 étoiles
3.5/5 (399)
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
D'Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
Évaluation : 3.5 sur 5 étoiles
3.5/5 (231)
Never Split the Difference: Negotiating As If Your Life Depended On It
D'Everand
Never Split the Difference: Negotiating As If Your Life Depended On It
Chris Voss
Évaluation : 4.5 sur 5 étoiles
4.5/5 (838)
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
D'Everand
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Ashlee Vance
Évaluation : 4.5 sur 5 étoiles
4.5/5 (474)
Rise of ISIS: A Threat We Can't Ignore
D'Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
Évaluation : 3.5 sur 5 étoiles
3.5/5 (137)
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
D'Everand
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Ben Horowitz
Évaluation : 4.5 sur 5 étoiles
4.5/5 (344)
Grit: The Power of Passion and Perseverance
D'Everand
Grit: The Power of Passion and Perseverance
Angela Duckworth
Évaluation : 4 sur 5 étoiles
4/5 (587)
Yes Please
D'Everand
Yes Please
Amy Poehler
Évaluation : 4 sur 5 étoiles
4/5 (1891)
On Fire: The (Burning) Case for a Green New Deal
D'Everand
On Fire: The (Burning) Case for a Green New Deal
Naomi Klein
Évaluation : 4 sur 5 étoiles
4/5 (73)
The Emperor of All Maladies: A Biography of Cancer
D'Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
Évaluation : 4.5 sur 5 étoiles
4.5/5 (271)
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
D'Everand
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Gilbert King
Évaluation : 4.5 sur 5 étoiles
4.5/5 (265)
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
D'Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brene Brown
Évaluation : 4 sur 5 étoiles
4/5 (1090)
Team of Rivals: The Political Genius of Abraham Lincoln
D'Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
Évaluation : 4.5 sur 5 étoiles
4.5/5 (234)
Angela's Ashes: A Memoir
D'Everand
Angela's Ashes: A Memoir
Frank McCourt
Évaluation : 4.5 sur 5 étoiles
4.5/5 (440)
Principles: Life and Work
D'Everand
Principles: Life and Work
Ray Dalio
Évaluation : 4 sur 5 étoiles
4/5 (599)
Steve Jobs
D'Everand
Steve Jobs
Walter Isaacson
Évaluation : 4.5 sur 5 étoiles
4.5/5 (806)
Fear: Trump in the White House
D'Everand
Fear: Trump in the White House
Bob Woodward
Évaluation : 3.5 sur 5 étoiles
3.5/5 (738)
The World Is Flat 3.0: A Brief History of the Twenty-first Century
D'Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
Évaluation : 3.5 sur 5 étoiles
3.5/5 (2219)
The Unwinding: An Inner History of the New America
D'Everand
The Unwinding: An Inner History of the New America
George Packer
Évaluation : 4 sur 5 étoiles
4/5 (45)
John Adams
D'Everand
John Adams
David McCullough
Évaluation : 4.5 sur 5 étoiles
4.5/5 (2409)
Bad Feminist: Essays
D'Everand
Bad Feminist: Essays
Roxane Gay
Évaluation : 4 sur 5 étoiles
4/5 (1015)
The Glass Castle: A Memoir
D'Everand
The Glass Castle: A Memoir
Jeannette Walls
Évaluation : 4.5 sur 5 étoiles
4.5/5 (1711)
The Outsider: A Novel
D'Everand
The Outsider: A Novel
Stephen King
Évaluation : 4 sur 5 étoiles
4/5 (1839)
The Light Between Oceans: A Novel
D'Everand
The Light Between Oceans: A Novel
M.L. Stedman
Évaluation : 4.5 sur 5 étoiles
4.5/5 (789)
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
D'Everand
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Viet Thanh Nguyen
Évaluation : 4.5 sur 5 étoiles
4.5/5 (119)
The Perks of Being a Wallflower
D'Everand
The Perks of Being a Wallflower
Stephen Chbosky
Évaluation : 4.5 sur 5 étoiles
4.5/5 (2099)
Brooklyn: A Novel
D'Everand
Brooklyn: A Novel
Colm Tóibín
Évaluation : 3.5 sur 5 étoiles
3.5/5 (1937)
Wolf Hall: A Novel
D'Everand
Wolf Hall: A Novel
Hilary Mantel
Évaluation : 4 sur 5 étoiles
4/5 (3811)
The Woman in Cabin 10
D'Everand
The Woman in Cabin 10
Ruth Ware
Évaluation : 3.5 sur 5 étoiles
3.5/5 (2322)
A Man Called Ove: A Novel
D'Everand
A Man Called Ove: A Novel
Fredrik Backman
Évaluation : 4.5 sur 5 étoiles
4.5/5 (4609)
The Art of Racing in the Rain: A Novel
D'Everand
The Art of Racing in the Rain: A Novel
Garth Stein
Évaluation : 4 sur 5 étoiles
4/5 (4200)
Manhattan Beach: A Novel
D'Everand
Manhattan Beach: A Novel
Jennifer Egan
Évaluation : 3.5 sur 5 étoiles
3.5/5 (792)
Sing, Unburied, Sing: A Novel
D'Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
Évaluation : 4 sur 5 étoiles
4/5 (1103)
Little Women
D'Everand
Little Women
Louisa May Alcott
Évaluation : 4 sur 5 étoiles
4/5 (104)
Her Body and Other Parties: Stories
D'Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
Évaluation : 4 sur 5 étoiles
4/5 (821)
A Tree Grows in Brooklyn
D'Everand
A Tree Grows in Brooklyn
Betty Smith
Évaluation : 4.5 sur 5 étoiles
4.5/5 (1929)
The Constant Gardener: A Novel
D'Everand
The Constant Gardener: A Novel
John le Carre
Évaluation : 3.5 sur 5 étoiles
3.5/5 (104)
Evaluating Websites
Document2 pages
Evaluating Websites
api-322361374
Pas encore d'évaluation
Muhammad Zahrandhika Bastian-3
Document2 pages
Muhammad Zahrandhika Bastian-3
dhika zahrandhika
Pas encore d'évaluation
Week 5 - Creativity in Research
Document4 pages
Week 5 - Creativity in Research
D Barik
100% (1)
CS4000 Data Sheet
Document6 pages
CS4000 Data Sheet
Juan Luis Espinosa
Pas encore d'évaluation
Investment Banking Interview Strengths and Weaknesses PDF
Document15 pages
Investment Banking Interview Strengths and Weaknesses PDF
kamrul
Pas encore d'évaluation
Summary of C: How To Program Sixth Edition by Deitel: Introduction To Computers, The Internet and The Web
Document15 pages
Summary of C: How To Program Sixth Edition by Deitel: Introduction To Computers, The Internet and The Web
Frieda Ngaharjo
Pas encore d'évaluation
TCXD 46-1984 / Lightning Protection For Buildings - Standard For Design and Construction
Document30 pages
TCXD 46-1984 / Lightning Protection For Buildings - Standard For Design and Construction
trungjindo
Pas encore d'évaluation
Business Conclave - Concept Design
Document3 pages
Business Conclave - Concept Design
Sajal Gupta
Pas encore d'évaluation
Free Study PDF Download from pebexam Blog
Document22 pages
Free Study PDF Download from pebexam Blog
k_jaiswal
Pas encore d'évaluation
Suffolk Bus S92 Schedule Effective 5-2712
Document2 pages
Suffolk Bus S92 Schedule Effective 5-2712
RiverheadLOCAL
67% (6)
Cooperating Sequential Processes (Dijkstra) - Paper
Document74 pages
Cooperating Sequential Processes (Dijkstra) - Paper
Cole Arora
Pas encore d'évaluation
The Jambeli Culture of South Coastal Ecuador Estrada Meggers y Evans
Document93 pages
The Jambeli Culture of South Coastal Ecuador Estrada Meggers y Evans
Marco Vargas
Pas encore d'évaluation
CE ENG HL660,660L, HL665,665L AUG2018 Rev.0 Web
Document4 pages
CE ENG HL660,660L, HL665,665L AUG2018 Rev.0 Web
John Leonne
Pas encore d'évaluation
Stakeholder Register
Document7 pages
Stakeholder Register
rouzbehk6515
Pas encore d'évaluation
Frankfurt School taxes and ideology critique
Document5 pages
Frankfurt School taxes and ideology critique
Ernesto Bulnes
Pas encore d'évaluation
ISSAQ: An Integrated Sensing Systems For Real-Time Indoor Air Quality Monitoring
Document15 pages
ISSAQ: An Integrated Sensing Systems For Real-Time Indoor Air Quality Monitoring
KemHuyền
Pas encore d'évaluation
5100 Series Gas Analyzer: Product Data Sheet
Document2 pages
5100 Series Gas Analyzer: Product Data Sheet
Sai Kamala
Pas encore d'évaluation
Upto 62rd BPSC Mechanical Question Bank
Document140 pages
Upto 62rd BPSC Mechanical Question Bank
ASHISH KUMAR SINGH
Pas encore d'évaluation
Mass Transfer in Industrial Applications
Document1 page
Mass Transfer in Industrial Applications
MPD19I001 VITHISHA M
Pas encore d'évaluation
QUANTUM TELEPORTATION
Document23 pages
QUANTUM TELEPORTATION
alkagaba
Pas encore d'évaluation
Introduction to SONET and DWDM Digital Transmission Standards
Document35 pages
Introduction to SONET and DWDM Digital Transmission Standards
Omar Ayoub
100% (1)
Scheme of Examination For Master of Computer APPLICATIONS (M.C.A.) W.E.F. Academic Session 2014-15
Document11 pages
Scheme of Examination For Master of Computer APPLICATIONS (M.C.A.) W.E.F. Academic Session 2014-15
Siddharth Jain
Pas encore d'évaluation
Measure RPA ROI with KPIs
Document4 pages
Measure RPA ROI with KPIs
Adnan Farooq
Pas encore d'évaluation
On Teacher's Philosophy of Education: SPARK Your Interest
Document10 pages
On Teacher's Philosophy of Education: SPARK Your Interest
Chuck Garrido
Pas encore d'évaluation
List of Syonoyms: Word Synonym Synonym
Document6 pages
List of Syonoyms: Word Synonym Synonym
Praveen Kumar
Pas encore d'évaluation
Grade 7 holiday assignment anagrams numbers analogies
Document4 pages
Grade 7 holiday assignment anagrams numbers analogies
360MaRko oo.
Pas encore d'évaluation
Swain 2006 Patternsofmaculinity
Document21 pages
Swain 2006 Patternsofmaculinity
Samuel Lim
Pas encore d'évaluation
ECO Report 03
Document96 pages
ECO Report 03
ahmedshah512
Pas encore d'évaluation
Medical Waiver
Document1 page
Medical Waiver
CheerBU
Pas encore d'évaluation
3M Water Filtration Products - High Flow Series - HF40 and HF40 - S Performance Data Sheet
Document2 pages
3M Water Filtration Products - High Flow Series - HF40 and HF40 - S Performance Data Sheet
Sergio
Pas encore d'évaluation