DIP Lab 13 DBSCAN Clustering
DIP Lab 13 DBSCAN Clustering
DIP Lab 13 DBSCAN Clustering
Semester:
Lab#13: DBSCAN
Objectives
This laboratory exercise is focused DBSCAN clustering which is a widely used
unsupervised learning technique. Clustering is used on unlabeled data to look
for interesting groups and patterns.
Lab Instructions
This lab activity comprises of following parts: Lab Exercises, and Post-Lab
Viva/Quiz session.
The lab report shall be uploaded on LMS.
Only those tasks that are completed during the allocated lab time will be credited
to the students. Students are however encouraged to practice on their own in spare
time for enhancing their skills.
Lab Report Instructions
All questions should be answered precisely to get maximum credit. Lab report must
ensure following items:
Lab objectives
Python codes
Results (graphs/tables) duly commented and discussed
Conclusion
DBSCAN (Density-Based Spatial Clustering of Applications with Noise)
2. Algorithm:
For each data point p in the dataset D:
If p is not visited:
Mark p as visited.
Find all points in the ε-neighborhood of p (including p).
If the number of points in the neighborhood is less than
MinPts, mark p as noise.
Otherwise, create a new cluster and add p to the cluster.
Expand the cluster by adding all reachable points in the ε-
neighborhood to the cluster.
3. Output:
The algorithm identifies clusters of data points and marks some points
as noise if they don't belong to any cluster.
In the algorithm, a point q is considered to be in the ε-neighborhood of p if the
distance between p and q is less than or equal to ε. The algorithm classifies points
into three categories:
Core points: Points with at least MinPts points in their ε-neighborhood.
Border points: Points with fewer than MinPts points in their ε-neighborhood
but are reachable from a core point.
Noise points: Points that are neither core nor border points.
Figure 1:Credit https://www.theaidream.com/post/dbscan-clustering-algorithm-in-machine-learning