0% found this document useful (0 votes)

42 views

M.L. 3,5,6 Unit 3

S.p.p.u. B.E. comp. science 2019 pattern 7th semester subject - machine learning all syllabus notes

Uploaded by

atharv more

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

42 views

M.L. 3,5,6 Unit 3

S.p.p.u. B.E. comp. science 2019 pattern 7th semester subject - machine learning all syllabus notes

Uploaded by

atharv more

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

M.L.

3,5,6

Unit 3

Bias, Variance, Generalization, Underfitting, Overfitting, Linear Regression, Regression: Lasso

Regression, Ridge Regression, Gradient Descent Algorithm, Evaluation Metrics: MAE, RMSE,
R2

1. Bias:
Bias refers to the error introduced by approximating a real-world problem with a simplified
model. High bias can lead to underfitting, where the model fails to capture the underlying
patterns in the data.

2. Variance:
Variance refers to the sensitivity of a model to fluctuations in the training data. High variance can
lead to overfitting, where the model becomes too complex and fits the noise in the data instead
of the underlying patterns.

3. Generalization:
Generalization refers to the ability of a model to perform well on unseen data. A model with
good generalization is able to accurately predict outcomes for new, unseen instances.

4. Underfitting:
Underfitting occurs when a model is too simple to capture the underlying patterns in the data. It
leads to high bias and poor performance on both the training and test data.

5. Overfitting:
Overfitting occurs when a model becomes too complex and fits the noise or random variations
in the training data. It leads to low bias but high variance, causing poor performance on the test
data.

6. Linear Regression:
Linear regression is a popular regression algorithm used to model the relationship between a
dependent variable and one or more independent variables. It assumes a linear relationship and
aims to find the best-fit line that minimizes the difference between the predicted and actual
values.

7. Regression: Lasso Regression:

Lasso Regression is a linear regression technique that incorporates L1 regularization. It adds a
penalty term to the loss function, forcing some of the coefficients to become exactly zero. Lasso
regression performs feature selection by shrinking less relevant features to zero.

8. Ridge Regression:
Ridge Regression is a linear regression technique that incorporates L2 regularization. It adds a
penalty term to the loss function, which discourages large weights in the model. Ridge
regression helps prevent overfitting by reducing the magnitude of the coefficients.

9. Gradient Descent Algorithm:

Gradient Descent is an iterative optimization algorithm used to find the minimum of a loss
function. It starts with an initial set of model parameters and updates them iteratively in the
direction of steepest descent by calculating the gradients of the loss function with respect to the
parameters.

10. Evaluation Metrics:

Evaluation metrics are used to assess the performance of a model. Three common evaluation
metrics for regression problems are:

- Mean Absolute Error (MAE): It calculates the average absolute difference between the
predicted and actual values. MAE represents the average magnitude of the errors.
- Root Mean Squared Error (RMSE): It calculates the square root of the average squared
difference between the predicted and actual values. RMSE penalizes larger errors more than
MAE and provides a measure of the standard deviation of the errors.
- R2 (R-squared): R2 represents the proportion of the variance in the dependent variable that
can be explained by the independent variables. It ranges from 0 to 1, where a higher value
indicates a better fit of the model to the data.

Unit 5

Clustering is a fundamental task in unsupervised machine learning that involves grouping

similar data points together. Several clustering algorithms exist, each with its own characteristics
and applications. In this response, I will explain the concepts of K-Means, K-medoids,
Hierarchical, Density-based, and Spectral Clustering. I will also touch upon outlier analysis,
specifically the introduction of isolation factor and local outlier factor. Lastly, I will mention
evaluation metrics and scoring methods, including the elbow method and extrinsic and intrinsic
methods.

1. K-Means Clustering:
K-Means is a partition-based clustering algorithm. It aims to divide a dataset into K clusters,
where K is a predefined number. The algorithm iteratively assigns data points to the nearest
cluster centroid and recalculates the centroids until convergence. Each data point belongs to the
cluster with the nearest centroid.

Example: Consider a dataset of customer information, where each data point represents a
customer. K-Means can be used to cluster customers into segments based on their purchasing
patterns, such as high-value, medium-value, and low-value customers.
2. K-medoids Clustering:
K-medoids is similar to K-Means but uses medoids as representatives of clusters instead of
centroids. A medoid is the most centrally located point within a cluster, minimizing the
dissimilarity to other points. The algorithm iteratively assigns data points as medoids and
updates the cluster assignments until convergence.

Example: In a dataset of images, K-medoids clustering can be used to identify representative

images for various themes or categories, such as landscapes, animals, and architecture.

3. Hierarchical Clustering:
Hierarchical clustering creates a hierarchy of clusters using either an agglomerative (bottom-up)
or divisive (top-down) approach. It starts with each data point as an individual cluster and
merges or splits clusters based on similarity metrics. The result is a tree-like structure called a
dendrogram, which can be cut at different levels to obtain different numbers of clusters.

Example: Hierarchical clustering can be used in genetics to classify patients into different
subgroups based on gene expression levels, aiding in the identification of distinct disease
profiles.

4. Density-based Clustering:
Density-based clustering, such as DBSCAN (Density-Based Spatial Clustering of Applications
with Noise), groups together data points based on their density in the feature space. It defines
clusters as regions of high density separated by regions of low density. Data points that do not
belong to any cluster are considered outliers or noise.

Example: Density-based clustering can be applied to identify traffic congestion patterns in a city
based on GPS data, where clusters represent congested areas.

5. Spectral Clustering:
Spectral clustering is a technique that utilizes the eigenvectors of a similarity matrix to perform
dimensionality reduction and clustering. It treats data points as nodes in a graph and performs
clustering based on the graph's Laplacian matrix. Spectral clustering can handle complex
cluster shapes and is effective for non-linearly separable data.

Example: Spectral clustering can be used to group documents into topics based on their
content, where each document is represented as a vector in a high-dimensional space.

Outlier Analysis:
Outliers are data points that significantly deviate from the norm or expected patterns. Outlier
analysis helps identify and understand these unusual observations. Two commonly used
methods are:

1. Isolation Forest: The isolation forest algorithm isolates outliers by randomly selecting a
feature and then randomly selecting a split value between the minimum and maximum values of
that feature. Outliers are expected to require fewer splits to be isolated compared to normal data
points.

2. Local Outlier Factor (LOF): LOF measures the local deviation of a data point with respect to
its neighbors. It calculates the ratio of the average local density of its neighbors to

its own local density. Points with a significantly lower density compared to their neighbors are
considered outliers.

Evaluation Metrics and Score:

To assess the quality of clustering results, several evaluation metrics and scoring methods are
used. Here are a few examples:

1. Elbow Method: The elbow method helps determine the optimal number of clusters (K) for
algorithms like K-Means. It plots the within-cluster sum of squares (WCSS) against the number
of clusters and suggests selecting the number of clusters at the "elbow" point where the
improvement in WCSS starts to diminish significantly.

2. Extrinsic Evaluation: Extrinsic evaluation compares clustering results against externally

available ground truth labels or known class assignments. Metrics such as precision, recall,
F1-score, and Rand Index are used to measure the agreement between the clustering and
ground truth.

3. Intrinsic Evaluation: Intrinsic evaluation assesses the quality of clustering based on internal
criteria without using external labels. Metrics like silhouette coefficient, Calinski-Harabasz index,
and Davies-Bouldin index measure compactness, separation, and density-based clustering
characteristics.

These evaluation metrics help quantify the performance of clustering algorithms and guide the
selection of appropriate parameters and techniques.

Unit 6

Artificial Neural Networks (ANNs) are a class of machine learning models inspired by the
structure and function of the human brain. ANNs consist of interconnected artificial neurons or
nodes that process and transmit information. Here are explanations for various types of ANNs,
including Single Layer Neural Networks, Multilayer Perceptron, Back Propagation Learning,
Functional Link Artificial Neural Network, and Radial Basis Function Network. I will also touch
upon activation functions and provide an introduction to Recurrent Neural Networks (RNNs) and
Convolutional Neural Networks (CNNs).

1. Single Layer Neural Network:

A Single Layer Neural Network, also known as a Single Layer Perceptron, is the simplest form
of an ANN. It consists of one layer of artificial neurons connected directly to the input features
and produces a single output. Single Layer Neural Networks can only learn linearly separable
patterns and are limited in their representation power.

2. Multilayer Perceptron (MLP):

A Multilayer Perceptron is a feedforward neural network consisting of multiple layers of artificial
neurons. It has one input layer, one or more hidden layers, and one output layer. The neurons
are organized in a sequential manner, and information flows only in one direction, from the input
layer through the hidden layers to the output layer. MLPs are capable of learning complex
patterns and are widely used in various applications.

3. Back Propagation Learning:

Back Propagation Learning is a training algorithm commonly used with Multilayer Perceptrons. It
aims to adjust the weights and biases of the network by minimizing the error between the
predicted output and the desired output. The algorithm iteratively computes the gradients of the
error with respect to the network parameters and updates them using gradient descent
optimization.

4. Functional Link Artificial Neural Network (FLANN):

A Functional Link Artificial Neural Network extends the capabilities of a Single Layer Neural
Network by incorporating additional nonlinear transformations of the input features. These
transformations, known as functional link units, enable FLANNs to learn and represent more
complex patterns and relationships.

5. Radial Basis Function Network (RBFN):

A Radial Basis Function Network is a type of feedforward neural network that uses radial basis
functions as activation functions. RBFNs are commonly used for pattern recognition and
function approximation tasks. They have a hidden layer consisting of radial basis function units
that compute their activations based on the distance between the input and the center of each
unit.

Activation Functions:
Activation functions introduce nonlinearity into ANNs, enabling them to learn complex
relationships. Common activation functions include:

- Sigmoid: S-shaped curve that maps inputs to a range between 0 and 1.

- Rectified Linear Unit (ReLU): Activation is 0 for negative inputs and linear for positive inputs.
- Hyperbolic Tangent (tanh): S-shaped curve that maps inputs to a range between -1 and 1.
- Softmax: Used in the output layer for multi-class classification problems, it produces a
probability distribution over the classes.

Introduction to Recurrent Neural Networks (RNNs):

RNNs are designed to process sequential data, where information flows in a recurrent manner,
allowing the network to maintain an internal memory. RNNs have feedback connections that
allow the output of a neuron to serve as input to itself or other neurons in the network. This
recurrent structure enables RNNs to model temporal dependencies and handle tasks such as
natural language processing and time series prediction.

Introduction to Convolutional Neural Networks (CNNs):

CNNs are specialized neural networks designed for processing grid-like data, such as images or
time series data. They consist of convolutional layers, pooling layers, and fully connected layers.
Convolutional layers use filters to detect local patterns and spatial hierarchies, while pooling
layers downsample the feature maps to

extract the most relevant information. CNNs have achieved significant success in computer
vision tasks, including image classification, object detection, and image segmentation.

ML NOTES
No ratings yet
ML NOTES
13 pages
Discovering Knowledge in Data: Lecture Review of
No ratings yet
Discovering Knowledge in Data: Lecture Review of
20 pages
Unit 5
No ratings yet
Unit 5
11 pages
Machine Learning Qs
No ratings yet
Machine Learning Qs
10 pages
Exp5 - Unsupervised Learning
No ratings yet
Exp5 - Unsupervised Learning
13 pages
The Aim of The Dataset - 040835
No ratings yet
The Aim of The Dataset - 040835
4 pages
Data Science for Civil Engineering Unit 5 Notes
No ratings yet
Data Science for Civil Engineering Unit 5 Notes
17 pages
Bcia Cia2
No ratings yet
Bcia Cia2
14 pages
data science notes b
No ratings yet
data science notes b
5 pages
ML U5
No ratings yet
ML U5
24 pages
Machine Learning Theory
100% (1)
Machine Learning Theory
12 pages
Data Science Technical Interview Questions
No ratings yet
Data Science Technical Interview Questions
24 pages
ds unit 2
No ratings yet
ds unit 2
36 pages
Unit 5
No ratings yet
Unit 5
8 pages
DL DL2 DL3 Merged
No ratings yet
DL DL2 DL3 Merged
11 pages
ML Endsem
No ratings yet
ML Endsem
14 pages
DS - UNIT - III - QB & Ans
No ratings yet
DS - UNIT - III - QB & Ans
25 pages
DM Lecture 06
No ratings yet
DM Lecture 06
32 pages
MachineLearning
No ratings yet
MachineLearning
16 pages
Unit 4 Introduction to Algorithm
No ratings yet
Unit 4 Introduction to Algorithm
10 pages
ML - Machine Learning PDF
No ratings yet
ML - Machine Learning PDF
13 pages
Data Minning Unit 2-1
No ratings yet
Data Minning Unit 2-1
10 pages
Chapter 1 Introduction
No ratings yet
Chapter 1 Introduction
49 pages
ML ModuleUntitled 2
No ratings yet
ML ModuleUntitled 2
8 pages
Machine Learning Basics
No ratings yet
Machine Learning Basics
32 pages
Machine learning
No ratings yet
Machine learning
4 pages
Cluster Analysis
No ratings yet
Cluster Analysis
27 pages
ML UNIT-III
No ratings yet
ML UNIT-III
18 pages
MLANS
No ratings yet
MLANS
26 pages
Machine Unit4
No ratings yet
Machine Unit4
55 pages
Data Science Intervieew Questions
100% (1)
Data Science Intervieew Questions
16 pages
Module 3
No ratings yet
Module 3
6 pages
Chapter 1 Introduction
No ratings yet
Chapter 1 Introduction
47 pages
EDAB Module 5 Singular Value Decomposition (SVD)
No ratings yet
EDAB Module 5 Singular Value Decomposition (SVD)
58 pages
Machine Learning QNA
No ratings yet
Machine Learning QNA
1 page
1.0 Modeling: 1.1 Classification
No ratings yet
1.0 Modeling: 1.1 Classification
5 pages
ML
No ratings yet
ML
3 pages
ML unit-4
No ratings yet
ML unit-4
17 pages
ML Detention Work
No ratings yet
ML Detention Work
3 pages
Unit-5
No ratings yet
Unit-5
8 pages
Intro to Machine Learning New (2)
No ratings yet
Intro to Machine Learning New (2)
18 pages
unit 4 mining
No ratings yet
unit 4 mining
12 pages
MACHINE LEARNING
No ratings yet
MACHINE LEARNING
6 pages
MLP U4
No ratings yet
MLP U4
11 pages
UNIT 2 - Notes
No ratings yet
UNIT 2 - Notes
31 pages
ML Questions Answer Q1
No ratings yet
ML Questions Answer Q1
79 pages
2 marks
No ratings yet
2 marks
5 pages
BAI 3303 Notes
No ratings yet
BAI 3303 Notes
12 pages
Machine Learning Basics
No ratings yet
Machine Learning Basics
5 pages
MA Unit 5
No ratings yet
MA Unit 5
7 pages
Machine Learning Algorithms
No ratings yet
Machine Learning Algorithms
5 pages
MLSC Final Notes
No ratings yet
MLSC Final Notes
24 pages
genedata doc
No ratings yet
genedata doc
67 pages
U02Lecture08 Statistical Machine Learning
No ratings yet
U02Lecture08 Statistical Machine Learning
41 pages
Script
No ratings yet
Script
5 pages
DWBI4
No ratings yet
DWBI4
10 pages
Interview questions companie
No ratings yet
Interview questions companie
72 pages
ML Viva Questions
No ratings yet
ML Viva Questions
4 pages
Paper IJRITCC
No ratings yet
Paper IJRITCC
5 pages
Contextual Image Classification: Understanding Visual Data for Effective Classification
From Everand
Contextual Image Classification: Understanding Visual Data for Effective Classification
Fouad Sabry
No ratings yet
Automated Regulatory Compliance
No ratings yet
Automated Regulatory Compliance
17 pages
PDF Handbook of Artificial Intelligence in Education 1st Edition Benedict Du Boulay download
100% (15)
PDF Handbook of Artificial Intelligence in Education 1st Edition Benedict Du Boulay download
50 pages
SOLAR 10.7B Scaling Large Language Models With Simple Yet Effective Depth Up-Scaling
No ratings yet
SOLAR 10.7B Scaling Large Language Models With Simple Yet Effective Depth Up-Scaling
12 pages
Digital Transformation Review 8
100% (1)
Digital Transformation Review 8
92 pages
Juan Manuel Corchado - 1 - Presentación DeepSIEM
No ratings yet
Juan Manuel Corchado - 1 - Presentación DeepSIEM
21 pages
Intelligent Chat Bot Source Code
No ratings yet
Intelligent Chat Bot Source Code
10 pages
Value Added Course BMS Unit 4.Pptx
No ratings yet
Value Added Course BMS Unit 4.Pptx
40 pages
Chord Detection Using Deep Learning
No ratings yet
Chord Detection Using Deep Learning
8 pages
SAI20 Communique 2023
No ratings yet
SAI20 Communique 2023
6 pages
6G Network
No ratings yet
6G Network
6 pages
SMM Brochure
No ratings yet
SMM Brochure
2 pages
Microsoft AI Cloud Partner Program FAQ
No ratings yet
Microsoft AI Cloud Partner Program FAQ
46 pages
Form Sys Time Limit
No ratings yet
Form Sys Time Limit
7 pages
Jacob Devlin BERT
No ratings yet
Jacob Devlin BERT
43 pages
Sequence Generation With RNNs - Pre Quiz - Attempt Review
100% (1)
Sequence Generation With RNNs - Pre Quiz - Attempt Review
5 pages
Assignment 1c
No ratings yet
Assignment 1c
4 pages
PDF Naturalistic Decision Making and Macrocognition Jan Maarten Schraagen Download
100% (6)
PDF Naturalistic Decision Making and Macrocognition Jan Maarten Schraagen Download
84 pages
UNetFormer - A Unified Vision Transformer Model and Pre-Training Framework For 3D Medical Image Segmentation - 2204.00631v2
No ratings yet
UNetFormer - A Unified Vision Transformer Model and Pre-Training Framework For 3D Medical Image Segmentation - 2204.00631v2
12 pages
AI in Academic Libraries The Future Is Now
No ratings yet
AI in Academic Libraries The Future Is Now
7 pages
CH 03 Eng S v1.0
No ratings yet
CH 03 Eng S v1.0
25 pages
Dav Assignment 5
No ratings yet
Dav Assignment 5
2 pages
Ad2 A2 Rawdf Arfaw S HDRTH
No ratings yet
Ad2 A2 Rawdf Arfaw S HDRTH
2 pages
Shridhar DataScientist Resume
No ratings yet
Shridhar DataScientist Resume
1 page
PWC Next in Banking and Capital Markets
No ratings yet
PWC Next in Banking and Capital Markets
9 pages
Artificial Intelligence and The Changing Demand For Skills in The Labour Market
No ratings yet
Artificial Intelligence and The Changing Demand For Skills in The Labour Market
55 pages
2018
No ratings yet
2018
18 pages
10.gate Competitive Interview Questions - Form
No ratings yet
10.gate Competitive Interview Questions - Form
5 pages
IBM-5-trends-for-2025-report
No ratings yet
IBM-5-trends-for-2025-report
28 pages
Survey of Explainable Artificial Intelligence Techniques For Biomedical Imaging With Deep Neural Networks
No ratings yet
Survey of Explainable Artificial Intelligence Techniques For Biomedical Imaging With Deep Neural Networks
29 pages
The Weather Company (TWC) Analyzing Business Opportunities That Leverage Its Big Data For B2B - Mary
No ratings yet
The Weather Company (TWC) Analyzing Business Opportunities That Leverage Its Big Data For B2B - Mary
10 pages