Kruskal and Clustering

Uploaded by

Kruskal's algorithm can be used to solve the clustering problem by finding clusters with maximum spacing between objects. It works by growing a graph connecting the closest objects, representing emerging clusters. As the graph grows, only connections within existing clusters are added. When the graph has k connected components, k clusters have been formed with maximum spacing equal to the weight of the next connection Kruskal's would have added. This clustering method is equivalent to computing the minimum spanning tree and removing the k-1 most expensive edges to form k clusters.

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Kruskal and Clustering

Uploaded by

Hadi Fouani

0% found this document useful (0 votes)

34 views1 page

Original Description:

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Download as pdf or txt

0% found this document useful (0 votes)

34 views1 page

Kruskal and Clustering

Uploaded by

Hadi Fouani

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Download as pdf or txt

Jump to Page

You are on page 1of 1

Search inside document

Kruskal’s Algorithm and Clustering

(following Kleinberg and Tardos, Algorithm design, pp 158–161)

Recall that Kruskal’s algorithm for a graph with weighted links gives a minimal span-
ning tree, i.e., with minimum total weight. This solves, for example, the problem of
constructing the lowest cost network connecting a set of sites, where the weight on the link
represents the cost.
The minimal spanning tree is also relevant to the clustering problem. Given some
notion of similarity between objects, it is frequently useful to group objects into clusters,
where clusters contain objects that are in some sense most similar. The objects could be
photographs, documents, micro-organisms, . . .. Given a set of objects p1 , . . . , pn , a distance
function d(pi , pj ) specifies their similarity (or lack thereof). This function is symmetric:
d(pi , pj ) = d(pj , pi ), and satisfies d(pi , pj ) ≥ 0 (=0 iff i = j).
Suppose the n objects are to be separated into k clusters C1 , . . . Ck . The “spacing” of
any particular clustering is defined as the minimum distance between objects in any pair of
different clusters. One reasonable criterion for a “good” clustering is to find k clusters with
maximum spacing. Since the number of possible clusterings grows exponentially with the
number of objects, we need an efficient algorithm to find the one with maximum spacing.
Consider growing a graph on the objects pi considered as vertices. Start by drawing
an edge between the closest pair of points, then next closest, etc., and at any given con-
figuration the sets of connected vertices represent the clusters. (This procedure is known
as single-link agglomerative clustering.) Note that since only the sets of clusters are of
interest, it is not necessary to add any edges that connect vertices already in the same
connected component. Hence the graph has no cycles — it is a union of trees.
This graph-growing procedure, though motivated by the idea of merging clusters,
is identically Kruskal’s algorithm. To produce a k-clustering of the objects, Kruskal’s
algorithm is simply halted when there are k connected components, and the last k − 1
edges are not added. This iterative merging procedure is also equivalent to computing the
full minimal spanning tree, then deleting the k − 1 most expensive edges and taking the
resulting k connected components to define a clustering C = {C1 , . . . Ck }.
To see that this indeed produces k clusters with maximal spacing, note that the spacing
of C = {C1 , . . . Ck } is the weight d∗ of the (k − 1)st most expensive edge (i.e., the next
edge that would have been added) in the mimimal spanning tree. If C ′ = {C1′ , . . . Ck′ } is
some other clustering, then there is some cluster Cr 6⊂ Cs′ such that there exist pi , pj ∈ Cr
with pi ∈ Cs′ and pj ∈ Ct′ 6= Cs′ . Note that each edge on the path from pi to pj within Cr
has weight ≤ d∗ . Let p′ be the first vertex along this path no longer in Cs′ and let p be the
one just before p′ (i.e., still in Cs′ ). p and p′ are in different clusters of C ′ but d(p, p′ ) ≤ d∗ ,
so the spacing of C ′ is no greater than that of C. The clustering C defined by the above
procedure thus identifies a k-clustering with maximum spacing.

INFO 295, 5 Oct 06

HW4 S12 Sol
Document6 pages
HW4 S12 Sol
aakankshasss
No ratings yet
Fundamentals of Data Structures in C by Ellis Horowitz, Sartaj Sahni, Susan Anderson-Freed
Document605 pages
Fundamentals of Data Structures in C by Ellis Horowitz, Sartaj Sahni, Susan Anderson-Freed
The Chaki
89% (9)
Integer Programming - Sven O. Krumke
Document180 pages
Integer Programming - Sven O. Krumke
royeh
No ratings yet
Samsung New
Document19 pages
Samsung New
tutamasdfghjkl
No ratings yet
Kruskal's Algorithm
Document1 page
Kruskal's Algorithm
Radib Kabir
No ratings yet
Fundamental Algorithms, Assignment 10
Document2 pages
Fundamental Algorithms, Assignment 10
Aashish D
No ratings yet
Ada Research Paper
Document19 pages
Ada Research Paper
Varun Chandra
No ratings yet
An Alternative For The Implementation of Kruskal's Minimal Spanning Tree Algorithm
Document12 pages
An Alternative For The Implementation of Kruskal's Minimal Spanning Tree Algorithm
rcahya_1
No ratings yet
Greedy Algorithms
Document10 pages
Greedy Algorithms
ZhichaoWang
No ratings yet
Bhaumik-Project - C - Report K Mean Complexity
Document10 pages
Bhaumik-Project - C - Report K Mean Complexity
Mahiye Ghosh
No ratings yet
Kruskal's Algorithm: Graph Tree Search Algorithms
Document16 pages
Kruskal's Algorithm: Graph Tree Search Algorithms
Mohanah Jayakumaran
No ratings yet
Minimum-Cost Spanning Tree Asa Path-Finding Problem: Laboratory For Computer Science MIT Cambridge MA 02139
Document5 pages
Minimum-Cost Spanning Tree Asa Path-Finding Problem: Laboratory For Computer Science MIT Cambridge MA 02139
Kevin Skout
No ratings yet
HW 2
Document2 pages
HW 2
Jose guiteerrz
No ratings yet
CS Algorithms Cheatsheet
Document3 pages
CS Algorithms Cheatsheet
Victor Kwan
No ratings yet
Analysis of Agglomerative Clustering
Document12 pages
Analysis of Agglomerative Clustering
Budianto Rinda Panglawa
No ratings yet
Cs 180 Notes UCLA
Document3 pages
Cs 180 Notes UCLA
nattaq12345
No ratings yet
Recitation 10
Document7 pages
Recitation 10
Ari
No ratings yet
IME 6110 Min Spanning Tree Spring 2016
Document6 pages
IME 6110 Min Spanning Tree Spring 2016
LibyaFlower
No ratings yet
Pset7 Sol
Document3 pages
Pset7 Sol
keungboy26
No ratings yet
Convex Hull
Document10 pages
Convex Hull
marionbluna
No ratings yet
Spectral Relaxation For K-Means Clustering: Hongyuan Zha & Xiaofeng He Chris Ding & Horst Simon
Document7 pages
Spectral Relaxation For K-Means Clustering: Hongyuan Zha & Xiaofeng He Chris Ding & Horst Simon
Behzad Khafaie
No ratings yet
Computation of Cohomology
Document10 pages
Computation of Cohomology
Jhon Alexander Vasquez Naranjo
No ratings yet
New Algorithms For K-Center and Extensions: Abstract. The Problem of Interest Is Covering A Given Point Set With
Document15 pages
New Algorithms For K-Center and Extensions: Abstract. The Problem of Interest Is Covering A Given Point Set With
sanjib
No ratings yet
Krushed
Document10 pages
Krushed
Abhishek
No ratings yet
Advances On The Continued Fractions Method Using Better Estimations of Positive Root Bounds
Document6 pages
Advances On The Continued Fractions Method Using Better Estimations of Positive Root Bounds
nitish102
No ratings yet
Greedy Graphs Prim Kruskal Dijstkra
Document45 pages
Greedy Graphs Prim Kruskal Dijstkra
sdawdwsq
No ratings yet
Steiner Tree Problem
Document5 pages
Steiner Tree Problem
Gc
No ratings yet
CHAZELLE
Document20 pages
CHAZELLE
Tomiro Yisajor
No ratings yet
Kmeans Is Np-Hard
Document15 pages
Kmeans Is Np-Hard
shraddha212
No ratings yet
Lecture 22
Document11 pages
Lecture 22
f20201862
No ratings yet
Problem Tutorial: "Apollonian Network"
Document6 pages
Problem Tutorial: "Apollonian Network"
Roberto Franco
No ratings yet
Minimum Cost Spanning Trees
Document4 pages
Minimum Cost Spanning Trees
Sahal CS
No ratings yet
An Algorithm To Compute The Nucleolus of Shortest Path Games
Document15 pages
An Algorithm To Compute The Nucleolus of Shortest Path Games
1bgtg3szdv
No ratings yet
Clustering Large Data Sets With Mixed Numeric and Categorical Values
Document14 pages
Clustering Large Data Sets With Mixed Numeric and Categorical Values
Nurlita Kusuma Dewi
No ratings yet
Homework1 Sol
Document4 pages
Homework1 Sol
Axel
No ratings yet
Clustering
Document4 pages
Clustering
Rutvik
No ratings yet
Matroids
Document5 pages
Matroids
DineshGarg
No ratings yet
K - Means Clustering and Related Algorithms: Ryan P. Adams COS 324 - Elements of Machine Learning Princeton University
Document18 pages
K - Means Clustering and Related Algorithms: Ryan P. Adams COS 324 - Elements of Machine Learning Princeton University
Hiino
No ratings yet
An Introduction To Qua Tern Ions and Their Applications To Rotations in Computer Graphics
Document10 pages
An Introduction To Qua Tern Ions and Their Applications To Rotations in Computer Graphics
Salma El Alami
No ratings yet
Network 9
Document43 pages
Network 9
Faiz Ramadhan
No ratings yet
Clustering: Unsupervised Learning Methods 15-381
Document25 pages
Clustering: Unsupervised Learning Methods 15-381
ashwin57
No ratings yet
Kruskal's Algorithm - Wikipedia
Document22 pages
Kruskal's Algorithm - Wikipedia
Subhadeep Das
No ratings yet
Spanning Trees: Introduction To Algorithms
Document69 pages
Spanning Trees: Introduction To Algorithms
Asad-Ullah Sheikh
No ratings yet
Practice Apx P
Document4 pages
Practice Apx P
ballechase
No ratings yet
Graphs: Prim, Bellman-Ford, Floyd-Warshall. Solutions
Document4 pages
Graphs: Prim, Bellman-Ford, Floyd-Warshall. Solutions
Ana
No ratings yet
Matrix Transpose On Meshes: Theory and Practice
Document5 pages
Matrix Transpose On Meshes: Theory and Practice
shashank
No ratings yet
JDT CSP
Document55 pages
JDT CSP
Anonymous UrVkcd
No ratings yet
Lec3 PDF
Document9 pages
Lec3 PDF
yacp16761
No ratings yet
Minimum Spanning Tree (Prim's and Kruskal's Algorithms)
Document17 pages
Minimum Spanning Tree (Prim's and Kruskal's Algorithms)
HANISH THARWANI 21BCE11631
No ratings yet
Brics: On The Number of Maximal Independent Sets in A Graph
Document13 pages
Brics: On The Number of Maximal Independent Sets in A Graph
Hernan Lcc Herrera
No ratings yet
Integration of Structural Constraints Into TSP Models
Document17 pages
Integration of Structural Constraints Into TSP Models
Alejandro López
No ratings yet
One Complexity Theorist's View of Quantum Computing
Document13 pages
One Complexity Theorist's View of Quantum Computing
zubeir
No ratings yet
Kruskal Algorithm
Document7 pages
Kruskal Algorithm
Siti Hajar Ismail
No ratings yet
3.1 Sequence Alignment
Document8 pages
3.1 Sequence Alignment
Pavan Kumar
No ratings yet
Graph Theory - Solutions To Problem Set 6
Document3 pages
Graph Theory - Solutions To Problem Set 6
riadimazz_910414786
No ratings yet
Ecuaciones Diferenciales Parciales y Kernels-2021
Document41 pages
Ecuaciones Diferenciales Parciales y Kernels-2021
Lina Marcela Valencia Gutierrez
No ratings yet
KMeansPP Soda
Document9 pages
KMeansPP Soda
alanpicard2303
No ratings yet
K-Means++: The Advantages of Careful Seeding: David Arthur and Sergei Vassilvitskii
Document11 pages
K-Means++: The Advantages of Careful Seeding: David Arthur and Sergei Vassilvitskii
Ahmad Luky Ramdani
No ratings yet
CS276A Text Retrieval and Mining
Document48 pages
CS276A Text Retrieval and Mining
Panku Rangaree
No ratings yet
Min Image
Document9 pages
Min Image
Ram Charan Konidela
No ratings yet
Online and Streaming Algorithms For Clustering - UCSD - Lec6
Document9 pages
Online and Streaming Algorithms For Clustering - UCSD - Lec6
Thuy Nguyen
No ratings yet
1999 - An Analysis of Recent Work On Clustering Algorithms
Document24 pages
1999 - An Analysis of Recent Work On Clustering Algorithms
Aishwarya Desai
No ratings yet
Mathematical Functions
From Everand
Mathematical Functions
Oliver Linton
No ratings yet
Design and Analysis of Algorithms Unit 2
Document32 pages
Design and Analysis of Algorithms Unit 2
goyalsb2682
No ratings yet
Algorithm Questions
Document34 pages
Algorithm Questions
Vivek Lata
No ratings yet
Data Structures and Algorithms
Document33 pages
Data Structures and Algorithms
Philipkitheka
No ratings yet
Mca PDF
Document62 pages
Mca PDF
Mrinal Sharma
No ratings yet
Soluciones A Ejercicios de ADA
Document8 pages
Soluciones A Ejercicios de ADA
virybatten
No ratings yet
20.previous Year GATE Questions (With Solutions)
Document32 pages
20.previous Year GATE Questions (With Solutions)
Ch Vyna
No ratings yet
ADA
Document33 pages
ADA
Geetha A L
No ratings yet
Module 3 - DAA Vtu
Document80 pages
Module 3 - DAA Vtu
SATYAM JHA
No ratings yet
Graph Theory Question Bank
Document9 pages
Graph Theory Question Bank
prabal
No ratings yet
Ai LP-II Lab Manual
Document42 pages
Ai LP-II Lab Manual
Abubaker Qureshi
No ratings yet
Mfcs 2007
Document10 pages
Mfcs 2007
andhracolleges
No ratings yet
Chapter 13
Document100 pages
Chapter 13
andromisimo7559
No ratings yet
DAA Lab Manual Simplified Version
Document31 pages
DAA Lab Manual Simplified Version
deepzd517
No ratings yet
Troop 103 News Bits: Camporee Wilderness Survival Campout
Document10 pages
Troop 103 News Bits: Camporee Wilderness Survival Campout
troop103
No ratings yet
Orb Slam
Document17 pages
Orb Slam
Renato Sanabria
No ratings yet
VLSI Cell Placement
Document78 pages
VLSI Cell Placement
dharma_panga8217
No ratings yet
CS1201 DS
Document7 pages
CS1201 DS
senthilever20
No ratings yet
DS Unit 4
Document16 pages
DS Unit 4
devanshmishr25
No ratings yet
Reagin Taylor McNeill - Knot Theory and The Alexander Polynomial
Document105 pages
Reagin Taylor McNeill - Knot Theory and The Alexander Polynomial
Sprite090
No ratings yet
The Hitchhiker's Guide To The Programming Contests
Document78 pages
The Hitchhiker's Guide To The Programming Contests
dpiklu
100% (2)
Graph Theory Notes
Document2 pages
Graph Theory Notes
Destroyer74
No ratings yet
Data Structures and Algorithm Analysis 1005
Document22 pages
Data Structures and Algorithm Analysis 1005
Bhagirat Das
100% (1)
MCS-033 Solved Assignment 2015-16
Document7 pages
MCS-033 Solved Assignment 2015-16
Sonakshi Das
No ratings yet
QMM Assignment
Document3 pages
QMM Assignment
Kushagra Agarwal
No ratings yet
ADS Syllabus
Document1 page
ADS Syllabus
Subhash Kulhari
No ratings yet
Unit-4: Graph Coloring
Document17 pages
Unit-4: Graph Coloring
Ramakant Upadhyay
No ratings yet
EC2202-question Bank
Document24 pages
EC2202-question Bank
Anirudhan Ravi
No ratings yet