Correlation & Regression
Correlation & Regression
Correlation & Regression
• Types.
• Correlation Coefficient.
Defining Correlation
Positive Correlation:
• Both variables moves in the same direction.
• Association between variables such that high scores on one variable
tend to have high scores on the other variable.
Ex: supply and price.
Negative Correlation:
• Both variables moves in the opposite direction.
• Association between variables such that high scores on one variable
tend to have low scores on the other variable
Ex: demand and price.
Scatter Diagram:
• A scatter diagram is a graphical tool for analyzing
correlation between two variables.
Positive
Positive
linear Correlation
Curvilinear Correlation
Y Y
X X
Negative Curvilinear
correlation
Y Negative Y
linear correlation
X X
Slide from: Statistics for Managers Using Microsoft® Excel 4th Edition, 2004 Prentice-Hall
Scatter Plots
No relationship
Zero Correlation
X
X
Correlation Coefficient
Need for Correlation Coefficient
• Scatter diagram give only a rough idea of relationship between
two variables.
• It can not give a precise quantitative measure of correlation
between two variables.
• We use a statistic called correlation coefficient which is a pure
number that quantifies the relationship between two variables.
The Carl Pearson Correlation Coefficient:
• A statistic that quantifies a linear relation between two continuous
variables.
• Most widely used coefficient in the literature
𝐶𝑜𝑣 𝑋𝑌
rxy = σ𝑋 σ𝑌
shortcut method
Assumptions of Pearson’s Correlation Coefficient:
1 7 12
2 6 8
3 8 12
4 5 10
5 6 11
6 9 13
Age Weight
Serial
(years) (Kg) XY X2 Y2
no.
(x) (Y)
1 7 12 84 49 144
2 6 8 48 36 64
3 8 12 96 64 144
4 5 10 50 25 100
5 6 11 66 36 121
6 9 13 117 81 169
Total ∑x= ∑Y= ∑ xY= ∑x2= ∑y2=
41 66 461 291 742
calculation
r = 0.759
2. Hypotheses:
Ho : ρ = 0
H1 : ρ ≠ 0 (two-tailed),
: ρ > 0 or ρ < 0 (one-tailed)
where ‘ρ’ is the population correlation coefficient.
𝑟
3. Test statistic: t = ( 𝑛−2 ) with (n – 2) d.f
1−𝑟2
where ‘ r ’ = sample correlation coefficient.
Example:1
Suppose twenty observations on prices and
quantity sold yielded a correlation coefficient of
0.62. Can we infer that the price and quantity
sold are correlated? Test at 5% level.
Spearman Rank Correlation Coefficient
It is a non-parametric measure of correlation.
This procedure makes use of the two sets of ranks that may
be assigned to the sample values of x and Y.
i i
∑ di2=64
Problem.II
The following are the ranks given by two judges X & Y for 12
contestants(A, B,…..) in a singing competition. Find out whether
the judges are in agreement.
S.NO A B C D E F G H I J K L
X 1 9 2 10 3 11 8 4 12 7 5 6
Y 2 9 1 7 4 10 8 3 12 6 5 11
Some of the many
Types of Correlation Coefficients