Regression Primer

Advanced Statistics

Uploaded by

kelseytran.global999999

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views

Regression Primer

Advanced Statistics

Uploaded by

kelseytran.global999999

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

BUAD 284 Linear Regression Primer

The Basics
At a very basic level, regression analysis is simply a statistical modeling approach
to ascertain the relationship among certain variables of interest. This is best
understood by example.
Suppose we have a simple linear demand curve that takes the form of P = 20 – 4Q.
Note this is just a form of the equation Y = mX + b, which is the equation of a line.
Graphically, this equation represents the demand curve from our supply and
demand analysis. For our analysis, we are going to rewrite this equation in terms of
the inverse demand representation (i.e. just rewrite the equation by solving for Q):
Q = 5 – 0.25P, which is graphed accordingly.

The interpretation is that a 1 dollar increase in the price causes quantity demanded
to decrease by .25 units, or a 4 dollar increase in price causes the quantity
demanded to decrease by 1 unit. This interpretation is nothing more than
describing the slope coefficient (this is the m in Y=mX + b form), which represents
the rise/run or the ΔY/ΔX. In this example, ΔY/ΔX is actually ΔQ/ΔP. This is just the
application and understanding of the equation for a line.
So what does this have to do with regression analysis? While the above theoretical
example of a demand curve is nice and clean, how does this notion apply in the real
world with actual data? We don’t have perfect linear demand curves in the real
world. In fact, all we have, usually, is a bunch of data that represents the
combination of price and quantity points for a particular good/service over time.
For example, let’s use the “Where’s the Beef?” data set to plot out the monthly data
for the price ($/lb.) and quantity (index 2001=100) of ground chuck roast from
January 2001 through July 2005, with quantity on the vertical axis and price on the
horizontal. In the real world, this quantity and price relationship is represented by
this collection of data points. How do we reconcile this data with the nice, clean
theoretical graph from the previous page? If we want to estimate a demand curve

1
BUAD 284 Linear Regression Primer

for ground chuck roast, we need to find a line that best represents this data. In fact,
we want to find a “line of best fit”. The goal of regression analysis (at a very basic
level) is to estimate that line!

So, how do we fit this line? We want to minimize the distance between the
observations (points) and the estimated regression line across the entire sample.
The distance between an observation and the regression line is the error or the
residual. The “line of best fit” is the line that minimizes the sum of these squared
errors.1

At the end of the day, all we are trying to do is estimate a line to represent this
data. In this case, the “line of best fit” is represented by the following equation:

𝑄̂ = 248 − 56.7𝑃

We can interpret the slope to say something about the relationship between P and
Q. Specifically, on average, a $1 increase in the price of ground chuck roast (per

1
Why squared? We want to treat positive and negative errors the same when summing, thus squaring removes the negative
sign.

2
BUAD 284 Linear Regression Primer

pound), leads to a 56.74 index point decrease in the quantity demanded of ground
chuck roast. This interpretation corresponds directly to the theoretical
interpretation from the previous example, only now we derived a relationship for
actual data. That is regression analysis.
Diving Deeper
Let’s formalize the concepts from above. Managerial decisions are typically based on
the relationship between two or more variables. Regression analysis can be used to
derive an equation describing how said variables are related. The variable being
predicted is called the dependent variable.2 There is only one dependent variable in
the model. The variable(s) used to explain this predicted value is referred to as the
independent variable(s).3 A simple linear regression model that contains only one
independent variable can be expressed as:
𝑦𝑖 = 𝛽0 + 𝛽1 𝑥𝑖 + 𝜀𝑖
The subscript i denotes an individual observation from your sample of size n, where
i = 1, 2, 3, … n. Note that 𝑦 represents the dependent variable, 𝑥 represents the
independent variable, 𝛽0 and 𝛽1 are parameters, 𝛽0 being the intercept and 𝛽1 the
slope, and 𝜀 is an error term that accounts for variability in y that cannot be
explained by x. In general, we can have any number of independent variables, I’ll
represent that with k:
𝑦𝑖 = 𝛽0 + 𝛽1 𝑥𝑖1 + 𝛽2 𝑥𝑖2 + ⋯ + 𝛽𝑘 𝑥𝑖𝑘 + 𝜀𝑖
We estimate the above model using Ordinary Least Squares (OLS) and obtain:
𝑦̂𝑖 = 𝑏0 + 𝑏1 𝑥𝑖1 + 𝑏2 𝑥𝑖2 + ⋯ + 𝑏𝑘 𝑥𝑖𝑘
Note that 𝑦̂𝑖 represents the predicted value of 𝑦𝑖 based upon 𝑏0 , 𝑏1 , 𝑏2 , 𝑒𝑡𝑐. which are
estimates of 𝛽0 , 𝛽1 , 𝛽2, and so on. In a nutshell, we fit a line (surface in the case of 2
or more independent variables) such that the squared error between actual 𝑦 and
predicted 𝑦 is as small as possible:
MIN ∑𝑛𝑖=1(𝑦𝑖 − 𝑦̂𝑖 )2
The above minimization problem leads to what is referred to Ordinary Least
Squares (OLS) regression. This is the standard technique and true workhorse in
linear regression modeling. In general, we are interested in the following
information from the output of our regression:
1) Sign and magnitude of our estimated coefficients 𝑏0 , 𝑏1 , 𝑏2 , 𝑒𝑡𝑐.
2) Statistical significance of our estimated coefficients

2The dependent variable is usually denoted using the letter y. It is also sometimes called the left-hand side variable.
3The independent variable(s) is usually denoted using the letter x. It is also sometimes called the right-hand side variable(s)
or explanatory variable(s).

3
BUAD 284 Linear Regression Primer

a. Are the estimated coefficients statistically different from zero? Look for
P-values ≤ 0.05 as those coefficient are statically different from zero
with 95% confidence.
3) Goodness of fit of our model
a. Check adjusted R2. This ranges between 0 and 1 and tells us how much
variability in y is explained by x.

Finite Element Structural Analysis On An Excel Spreadsheet
No ratings yet
Finite Element Structural Analysis On An Excel Spreadsheet
2 pages
Econometrics: A Simple Introduction
From Everand
Econometrics: A Simple Introduction
K.H. Erickson
3.5/5 (5)
Chapter 2-Simple Regression Model
No ratings yet
Chapter 2-Simple Regression Model
25 pages
Ecotrix Ecotrix: B.A. Economics (Hons.) (University of Delhi) B.A. Economics (Hons.) (University of Delhi)
No ratings yet
Ecotrix Ecotrix: B.A. Economics (Hons.) (University of Delhi) B.A. Economics (Hons.) (University of Delhi)
18 pages
Student Notes Madule 2
No ratings yet
Student Notes Madule 2
12 pages
Module 4
No ratings yet
Module 4
27 pages
Regression and Correlation
No ratings yet
Regression and Correlation
37 pages
CH-3
No ratings yet
CH-3
123 pages
Linear Regression Analysis and Least Square Methods
No ratings yet
Linear Regression Analysis and Least Square Methods
65 pages
Unit-III (Data Analytics)
100% (1)
Unit-III (Data Analytics)
15 pages
2 Descriptive Simple Linear Regression
No ratings yet
2 Descriptive Simple Linear Regression
13 pages
Module 3 EDA
No ratings yet
Module 3 EDA
14 pages
ArunRangrej
No ratings yet
ArunRangrej
5 pages
Regression basics
No ratings yet
Regression basics
27 pages
Chapter 3 - Classical Simple Linear Regression
No ratings yet
Chapter 3 - Classical Simple Linear Regression
52 pages
Session_19&20
No ratings yet
Session_19&20
54 pages
Regression Analysis (Simple)
100% (1)
Regression Analysis (Simple)
8 pages
Econometrics: Two Variable Regression: The Problem of Estimation
No ratings yet
Econometrics: Two Variable Regression: The Problem of Estimation
28 pages
R-programming - Unit 5
No ratings yet
R-programming - Unit 5
43 pages
Handout 05 Regression and Correlation PDF
No ratings yet
Handout 05 Regression and Correlation PDF
17 pages
Regcorr 5
No ratings yet
Regcorr 5
20 pages
Fsgs
No ratings yet
Fsgs
28 pages
Econometrics for Finace Lecture II-Session Three
No ratings yet
Econometrics for Finace Lecture II-Session Three
32 pages
STA2100-Regression Analysis
No ratings yet
STA2100-Regression Analysis
15 pages
06 Simple Linear Regression Part1
No ratings yet
06 Simple Linear Regression Part1
8 pages
Unit III
No ratings yet
Unit III
18 pages
Theme 3 Multivariante Regression Model
No ratings yet
Theme 3 Multivariante Regression Model
8 pages
Answers Review Questions Econometrics
84% (25)
Answers Review Questions Econometrics
59 pages
Terro's Real Estate Agency
No ratings yet
Terro's Real Estate Agency
17 pages
UNIT - III
No ratings yet
UNIT - III
9 pages
Simple Linear Regression: From Wikipedia, The Free Encyclopedia
No ratings yet
Simple Linear Regression: From Wikipedia, The Free Encyclopedia
10 pages
Econometrics Practical
No ratings yet
Econometrics Practical
13 pages
Multiple Regression
No ratings yet
Multiple Regression
63 pages
Bivariate Data Analysis
100% (1)
Bivariate Data Analysis
34 pages
Lecture 3
No ratings yet
Lecture 3
59 pages
5-LR Doc - R Sqared-Bias-Variance-Ridg-Lasso
No ratings yet
5-LR Doc - R Sqared-Bias-Variance-Ridg-Lasso
26 pages
Regression
No ratings yet
Regression
14 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
20 pages
Ordinary Least Squares
No ratings yet
Ordinary Least Squares
21 pages
Statistics Week3
No ratings yet
Statistics Week3
19 pages
UNIt-3 TY
No ratings yet
UNIt-3 TY
67 pages
Pearson-Correlation-and-Linear-Regression
No ratings yet
Pearson-Correlation-and-Linear-Regression
42 pages
CH 08
No ratings yet
CH 08
13 pages
Name-Simran Kaur Syal Subject - Financial Econometrics Assignment No. 4 Q. Explain BLUE in Detail and Conditions For The Same? Ans
No ratings yet
Name-Simran Kaur Syal Subject - Financial Econometrics Assignment No. 4 Q. Explain BLUE in Detail and Conditions For The Same? Ans
4 pages
Econometrics for Finace Lecture II-Session Two
No ratings yet
Econometrics for Finace Lecture II-Session Two
19 pages
Jep 15 4 143
No ratings yet
Jep 15 4 143
125 pages
Quantile Regression
No ratings yet
Quantile Regression
122 pages
Introductory Econometrics For Finance Chris Brooks Solutions To Review - Chapter 3
100% (2)
Introductory Econometrics For Finance Chris Brooks Solutions To Review - Chapter 3
7 pages
Relationship- Correlation and Regression (1)
No ratings yet
Relationship- Correlation and Regression (1)
42 pages
Regression Analysis
No ratings yet
Regression Analysis
32 pages
Linear Regression
100% (2)
Linear Regression
28 pages
Regression
No ratings yet
Regression
45 pages
UNIT I Notes
No ratings yet
UNIT I Notes
23 pages
UNIT I Notes-1
No ratings yet
UNIT I Notes-1
18 pages
FM Project REPORT - Group3
No ratings yet
FM Project REPORT - Group3
24 pages
Review: I Am Examining Differences in The Mean Between Groups
100% (1)
Review: I Am Examining Differences in The Mean Between Groups
44 pages
Unit 2 ML
No ratings yet
Unit 2 ML
201 pages
Topic 6 Mte3105
No ratings yet
Topic 6 Mte3105
9 pages
U02Lecture06 Regression
No ratings yet
U02Lecture06 Regression
25 pages
Week 05
No ratings yet
Week 05
23 pages
Correlation and Regression: Six Sigma Thinking, #8
From Everand
Correlation and Regression: Six Sigma Thinking, #8
Sumeet Savant
5/5 (1)
Quadratic Equations
No ratings yet
Quadratic Equations
71 pages
Term-2 Practical List Computer Science With Python (083) : Class XI
No ratings yet
Term-2 Practical List Computer Science With Python (083) : Class XI
1 page
Quiet-Star: Language Models Can Teach Themselves To Think Before Speaking
No ratings yet
Quiet-Star: Language Models Can Teach Themselves To Think Before Speaking
25 pages
Key Encoding Messages Into Matrices
No ratings yet
Key Encoding Messages Into Matrices
5 pages
BayesianOptimizationfor Designof MultiscaleBiologicalCircuits
No ratings yet
BayesianOptimizationfor Designof MultiscaleBiologicalCircuits
10 pages
1 Linear Algebra 1 Ordinary Differential Equations and Vector Calculus
No ratings yet
1 Linear Algebra 1 Ordinary Differential Equations and Vector Calculus
3 pages
MS14E Chapter 21 Final
No ratings yet
MS14E Chapter 21 Final
12 pages
Link Ratio Method
No ratings yet
Link Ratio Method
18 pages
SEM - An Econometrican S Introduction
No ratings yet
SEM - An Econometrican S Introduction
16 pages
BFD 2
No ratings yet
BFD 2
8 pages
Assignment Matrix Decomposition
No ratings yet
Assignment Matrix Decomposition
9 pages
The Definition of The Laplace Transformation
No ratings yet
The Definition of The Laplace Transformation
10 pages
Crypto Key Management
No ratings yet
Crypto Key Management
31 pages
Applied Cryptography in Network Systems Security For Cyberattack Prevention
No ratings yet
Applied Cryptography in Network Systems Security For Cyberattack Prevention
6 pages
An Introduction To Artificial Neural Network
No ratings yet
An Introduction To Artificial Neural Network
5 pages
A Case Study On Runge Kutta 4 TH Order Differential Equations and Its Application
No ratings yet
A Case Study On Runge Kutta 4 TH Order Differential Equations and Its Application
7 pages
Micro Lecture Plan: Control System & OEC-801B
No ratings yet
Micro Lecture Plan: Control System & OEC-801B
4 pages
Histogram
No ratings yet
Histogram
10 pages
Karig Slides PDF
No ratings yet
Karig Slides PDF
36 pages
Restricted Boltzman Machines
No ratings yet
Restricted Boltzman Machines
25 pages
AI Manual
No ratings yet
AI Manual
69 pages
% Latihan Tugas Interpolasi % Oleh Desan Rafsanjani % 7, Maret 2018 %
No ratings yet
% Latihan Tugas Interpolasi % Oleh Desan Rafsanjani % 7, Maret 2018 %
6 pages
Assign_2_14Home Take Exercise_AFCN_16!09!2022 - Copy (16)
No ratings yet
Assign_2_14Home Take Exercise_AFCN_16!09!2022 - Copy (16)
3 pages
Problem Set 6
No ratings yet
Problem Set 6
6 pages
C++ Exercises II
50% (2)
C++ Exercises II
4 pages
MGMT 2012 Practice Questions 2023
No ratings yet
MGMT 2012 Practice Questions 2023
2 pages
Assignment 2 - Policy Gradients
No ratings yet
Assignment 2 - Policy Gradients
7 pages
DSML Notes
No ratings yet
DSML Notes
32 pages
01a Data Science Introduction 1
No ratings yet
01a Data Science Introduction 1
40 pages

Regression Primer

Uploaded by

Regression Primer

Uploaded by

BUAD 284 Linear Regression Primer

You might also like