Unit 4-Regression and Learning
Unit 4-Regression and Learning
Unit 4-Regression and Learning
Data Science
Unit – 4 Introduction to Correlation , Regression & Learning Correlation Analysis
Introduction
Significance
Types
Positive and Negative
Linear and Non-linear
Sample , Partial and Multiple
Measurement Of Correlation
Karl Pearson’s Coefficient Of Co-relation
Regression analysis
Introduction
Types Of Regression
Simple and Multiple
Linear and Non – linear
Supervised & Unsupervised Learning
Supervised learning
Classification of supervised learning
Advantages and Disadvantages of Supervised
learning
Unsupervised learning
Classification of Unsupervised learning
Advantages and Disadvantages of UnSupervised
learning
Difference between Supervised & Unsupervised learning
Regression
Regression analysis is a statistical method that
examines the relationship between one or more
independent variables and a dependent variable. It's
commonly used for prediction and understanding the
strength and nature of relationships between variables.
Regression analysis
Types of Regression
Simple Regression: This involves one independent
variable predicting a dependent variable. It's like
predicting someone's height based solely on their age.
Multiple Regression: Here, we have multiple
independent variables predicting a dependent variable.
Imagine predicting someone's weight based on age,
height, and maybe even their daily pizza intake.
Types of Regression
Linear Regression: This type assumes a linear
relationship between variables, meaning the change in
the dependent variable is proportional to the change in
the independent variable(s). Think of a straight line on
a graph.
Non-Linear Regression: In contrast, this type
acknowledges a non-linear relationship. The
relationship might be curved, like a sine wave or a
parabola, making it a bit trickier to model.
Supervised & Unsupervised Learning
• Supervised learning
• Classification of supervised learning
• Advantages and Disadvantages of Supervised
• learning
• Unsupervised learning
• Classification of Unsupervised learning
• Advantages and Disadvantages of UnSupervised
• learning
• Difference between Supervised & Unsupervised
learning
Supervised Machine Learning
●Supervised learning is the types of machine learning in which machines are trained
using well "labelled" training data, and on basis of that data, machines predict the output.
The labelled data means some input data is already tagged with the correct output.
●In supervised learning, the training data provided to the machines work as the supervisor
that teaches the machines to predict the output correctly. It applies the same concept as a
student learns in the supervision of the teacher.
●Supervised learning is a process of providing input data as well as correct output data to
the machine learning model. The aim of a supervised learning algorithm is to find a
mapping function to map the input variable(x) with the output variable(y).
●In the real-world, supervised learning can be used for Risk Assessment, Image
classification, Fraud Detection, spam filtering, etc.
Supervised Machine Learning
Classification of supervised learning
Classification of supervised learning
1. Regression
Regression algorithms are used if there is a relationship between the input
Below are some popular Regression algorithms which come under supervised
learning:
Linear Regression
Regression Trees
Non-Linear Regression
Polynomial Regression
Classification of supervised learning
2. Classification
Spam Filtering,
Random Forest
Decision Trees
Logistic Regression
With the help of supervised learning, the model can predict the output on the
In supervised learning, we can have an exact idea about the classes of objects.
Supervised learning models are not suitable for handling the complex tasks.
Supervised learning cannot predict the correct output if the test data is
In real-world, we do not always have input data with the corresponding output so
Clustering is a method of grouping the objects into clusters such that objects with
most similarities remains into a group and has less or no similarities with the objects of another
group. Cluster analysis finds the commonalities between the data objects and categorizes them
Association:
An association rule is an unsupervised learning method which is used for finding the relationships
between variables in the large database. It determines the set of items that occurs together in the
dataset.
Such as people who buy X item (suppose a bread) are also tend to purchase Y (Butter/Jam) item.
labeled data.
The result of the unsupervised learning algorithm might be less accurate as input data is
●
not labeled, and algorithms do not know the exact output in advance.
Difference between Supervised & Unsupervised learning
Supervised Vs Unsupervised learning
Supervised Vs Unsupervised learning