Unit 4-Regression and Learning

221601404
Data Science
Unit – 4 Introduction to Correlation , Regression & Learning Correlation Analysis
Introduction
Significance
Types
Positive and Negative
Linear and Non-linear
Sample , Partial and Multiple
Measurement Of Correlation
Karl Pearson’s Coefficient Of Co-relation
Regression analysis
Introduction
Types Of Regression
Simple and Multiple
Linear and Non – linear
Supervised & Unsupervised Learning
Supervised learning
Classification of supervised learning
Advantages and Disadvantages of Supervised
learning
Unsupervised learning
Classification of Unsupervised learning
Advantages and Disadvantages of UnSupervised
learning
Difference between Supervised & Unsupervised learning
Regression
Regression analysis is a statistical method that
examines the relationship between one or more
independent variables and a dependent variable. It's
commonly used for prediction and understanding the
strength and nature of relationships between variables.
Regression analysis
Types of Regression
Simple Regression: This involves one independent
variable predicting a dependent variable. It's like
predicting someone's height based solely on their age.
Multiple Regression: Here, we have multiple
independent variables predicting a dependent variable.
Imagine predicting someone's weight based on age,
height, and maybe even their daily pizza intake.
Types of Regression
Linear Regression: This type assumes a linear
relationship between variables, meaning the change in
the dependent variable is proportional to the change in
the independent variable(s). Think of a straight line on
a graph.
Non-Linear Regression: In contrast, this type
acknowledges a non-linear relationship. The
relationship might be curved, like a sine wave or a
parabola, making it a bit trickier to model.
Supervised & Unsupervised Learning
• Supervised learning
• Classification of supervised learning
• Advantages and Disadvantages of Supervised
• learning
• Unsupervised learning
• Classification of Unsupervised learning
• Advantages and Disadvantages of UnSupervised
• learning
• Difference between Supervised & Unsupervised
learning
Supervised Machine Learning
●Supervised learning is the types of machine learning in which machines are trained
using well "labelled" training data, and on basis of that data, machines predict the output.
The labelled data means some input data is already tagged with the correct output.
●In supervised learning, the training data provided to the machines work as the supervisor
that teaches the machines to predict the output correctly. It applies the same concept as a
student learns in the supervision of the teacher.
●Supervised learning is a process of providing input data as well as correct output data to
the machine learning model. The aim of a supervised learning algorithm is to find a
mapping function to map the input variable(x) with the output variable(y).
●In the real-world, supervised learning can be used for Risk Assessment, Image
classification, Fraud Detection, spam filtering, etc.
Supervised Machine Learning
1. Regression
Regression algorithms are used if there is a relationship between the input
variable and the output variable.
It is used for the prediction of continuous variables, such as Weather
forecasting, Market Trends, etc.
Below are some popular Regression algorithms which come under supervised
learning:
Linear Regression
Regression Trees
Non-Linear Regression
Bayesian Linear Regression
Polynomial Regression
2. Classification
Classification algorithms are used when the output variable is categorical,
which means there are two classes
such as Yes-No, Male-Female, True-false, etc.
Spam Filtering,
Random Forest
Decision Trees
Logistic Regression
Support vector Machines

Advantages of Supervised learning:
With the help of supervised learning, the model can predict the output on the
basis of prior experiences.
In supervised learning, we can have an exact idea about the classes of objects.
Supervised learning model helps us to solve various real-world problems
such as fraud detection, spam filtering, etc.
Disadvantages of supervised learning:
Supervised learning models are not suitable for handling the complex tasks.
Supervised learning cannot predict the correct output if the test data is
different from the training dataset.
Training required lots of computation times.
In supervised learning, we need enough knowledge about the classes of object.

Unsupervised Machine Learning
“Unsupervised learning is a type of machine learning in which
models are trained using unlabeled dataset and are allowed to
act on that data without any supervision”
Unsupervised learning is helpful for finding useful insights from the data.
●
Unsupervised learning is much similar as a human learns to think by their own

●
experiences, which makes it closer to the real AI.
Unsupervised learning works on unlabeled and uncategorized data which

●
make unsupervised learning more important.
In real-world, we do not always have input data with the corresponding output so
to solve such cases, we need unsupervised learning.

Classification of unsupervised learning
Classification of unsupervised learning
Clustering:
Clustering is a method of grouping the objects into clusters such that objects with
most similarities remains into a group and has less or no similarities with the objects of another
group. Cluster analysis finds the commonalities between the data objects and categorizes them
as per the presence and absence of those commonalities.
Association:
An association rule is an unsupervised learning method which is used for finding the relationships
between variables in the large database. It determines the set of items that occurs together in the
dataset.
Association rule makes marketing strategy more effective.
Such as people who buy X item (suppose a bread) are also tend to purchase Y (Butter/Jam) item.
A typical example of Association rule is Market Basket Analysis.

Advantages of Unsupervised Learning
Unsupervised learning is used for more complex tasks as compared to supervised

●
learning because, in unsupervised learning, we don't have labeled input data.
Unsupervised learning is preferable as it is easy to get unlabeled data in comparison to

●
labeled data.
Disadvantages of Unsupervised Learning
Unsupervised learning is intrinsically more difficult than supervised learning as

●
it does not have corresponding output.
The result of the unsupervised learning algorithm might be less accurate as input data is
●
not labeled, and algorithms do not know the exact output in advance.
Difference between Supervised & Unsupervised learning
Supervised Vs Unsupervised learning
Supervised Vs Unsupervised learning

Unit 4-Regression and Learning

Uploaded by

Copyright:

Available Formats

Unit 4-Regression and Learning

Uploaded by

Document Information

Original Description:

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Unit 4-Regression and Learning

Uploaded by

Copyright:

Available Formats

221601404

variable and the output variable.

It is used for the prediction of continuous variables, such as Weather

forecasting, Market Trends, etc.

Bayesian Linear Regression

Classification algorithms are used when the output variable is categorical,

which means there are two classes

such as Yes-No, Male-Female, True-false, etc.

Support vector Machines

basis of prior experiences.

Supervised learning model helps us to solve various real-world problems

such as fraud detection, spam filtering, etc.

Disadvantages of supervised learning:

different from the training dataset.

Training required lots of computation times.

In supervised learning, we need enough knowledge about the classes of object.

Unsupervised learning is much similar as a human learns to think by their own

experiences, which makes it closer to the real AI.

Unsupervised learning works on unlabeled and uncategorized data which

make unsupervised learning more important.

to solve such cases, we need unsupervised learning.

as per the presence and absence of those commonalities.

Association rule makes marketing strategy more effective.

A typical example of Association rule is Market Basket Analysis.

Unsupervised learning is used for more complex tasks as compared to supervised

learning because, in unsupervised learning, we don't have labeled input data.

Unsupervised learning is preferable as it is easy to get unlabeled data in comparison to

Disadvantages of Unsupervised Learning

Unsupervised learning is intrinsically more difficult than supervised learning as

it does not have corresponding output.

You might also like