Lesson 2: Multiple Linear Regression Model (I) : E L F V A L U A T I O N X E R C I S E S
Lesson 2: Multiple Linear Regression Model (I) : E L F V A L U A T I O N X E R C I S E S
Lesson 2: Multiple Linear Regression Model (I) : E L F V A L U A T I O N X E R C I S E S
ESTIMATION
( M LE) .
SELF-EVALUATION EXERCISES
OBJECTS:
In this lesson we study the specification for the multiple linear
regression model (MLRM) that establishes a linear relation between
the endogenous variable and the exogenous variables.
Then, we will see the basic assumptions underlying the model and
we will estimate the parameters of the model. Finally, we will
present the steps to be followed in order to validate the estimated
model: the determination coefficient and its correction, and the
measures which allow evaluating the goodness of fit. We then
analyze the economic and statistical significance of the parameter
estimated.
KEYWORDS:
Endogenous and exogenous variables, parameters, error term, basic
assumptions, OLS estimation, ML estimation, statistical properties
of the estimators, residuals, estimated variance of the error term,
goodness of fit, R2 coefficient, economic significance, individual or
joint statistical significance.
REFERENCES:
Wooldridge, Jeffrey M. Introductory Econometrics: A Modern
Approach, 5th Edition Michigan State University. ISBN-10:
1111531048 ISBN-13: 9781111531041. 2012. Chapter 2, 3 (no
multicollinearity), 4 until 4.4, approximatively.
William H. Greene, Econometric Analysis, 7/E. Stern School of
Business, New York University. Prentice Hall. 2012. Chapter 2, 3
(3.1, 3.2, 3.5, 3.6), 4 (4.1-4.8), 5 (5.1-5.2), 17 (17.1-17.4).
y = f ( x1 , x 2 , x 3 ,..., x K )
the
causality
y = 1 + 2 x2 + 3x3 +...+ k x k
independent term or
constant
y
j =
x j
j = 2 ,3,..., k
Parameters o coefficients
It measures the amount by which y
changes when x increases by
one unit.
Deterministic Relationship
Stochastic Relationship
Ci = 1 + 2 I i
It implies a deterministic relationship. It supposes that once
you know the value of the Income is possible to know
exactly the amount of Consumption. Moreover, it supposes
that all families with the same Income have the same
Consumption level. However, the reality is different:
C
x
x
x
x
x
x
x
x
x
x
x
Ci = 1 + 2 I i
x
x
Stochastic (random)
part: not observable
y = 1 + 2 x2 + 3x3 +...+ k x k + u
y
Explained Variable
xj : Exogenous Variable or Independent variable (j=2,3,...,k)
or Explanatory Variable
u : Error term or disturbance
1 : Coefficient of the "constant" (independent term)
j : Other coefficients or parameters
(j=2,3,...,k)
Cross sectional
y i = 1 + 2 x 2 i + 3 x 3i +...+ k x ki + u i
Two types of
data
i=1,2,...,N
Time series
y t = 1 + 2 x 2 t + 3 x 3t +...+ k x k t + u t
t=1,2,...,T
y 1 = 1 + 2 x 21 + 3 x 31 +...+ k x k1 + u 1
y 2 = 1 + 2 x 22 + 3 x 32 +...+ k x k 2 + u 2
y 3 = 1 + 2 x 23 + 3 x 33 +...+ k x k 3 + u 3
...
y N = 1 + 2 x 2 N + 3 x 3N +...+ k x kN + u N
y1 1 x 21
y 1 x
22
2 =
... ... ...
yN 1 x2 N
x 31
x 32
...
x 3N
... x k 1 1 u1
... x k 2 2 u 2
+
... ... ... ...
... x kN k u N
Yi = ALi 1 Ki 2
LnYi = LnA + 1 LnL i + 2 LnK i
Inherently nonlinear models: they cannot be made
linear. They are nonlinear with respect to the
parameters.
3) The size of the data sample. The minimum requirement
is that the number of observations is higher or equal than the
numbers of parameters to be estimated. This is the minimum
requirement, it is advisable however to have a big sample
size to guarantee a good and feasible estimation.
8
Nk
Nk 0
Degrees of freedom
E ( U) = 0 Nx1
E ( u1 ) 0
E( u ) 0
2
E ( U) =
=
... ...
E( u N ) 0
VAR ( u i ) = 2u
Homoskedasticity assumption
VAR ( u i ) = 2ui
VAR ( u j ) = 2uj
)] [
COV( u i , u j ) = E ( u i E ( u i )) u j E ( u j ) = E u i u j = 0
i, j = 1,2, ... , N i j
10
u1
2
= VAR(U ) = E [UU '] = E (u1 u 2 ... u N ) =
...
u N
u1u1 u1u 2
u 2 u1 u 2u 2
=E
...
...
u N u1 u N u 2
u2 0
0 u2
=
... ...
0
0
0
1
0
0
2
=
u
...
... ...
0
... u2
...
...
( )
2
... u1u N due E ui = VAR ( ui ),
... u 2u N E ( ui u j ) = COV ( ui u j ) i
=
...
... dueBasic Hypotheses :
0 ... 0
1 ... 0
2
=
I NxN
u
... ... ...
0 ... 1
= VAR ( U) = 2u I N
is a square matrix, symmetric and positive definite.
11
( )
E ui u j =
2u
i= j
i j
of
2
U ~ N(0, u IN)
E x ji u i = 0
i = 1,2 ,..., N
j = 1,2 ,..., k
12
(X)=k
E ( Y ) = E ( X + U ) = X + E ( U ) = X
VAR ( Y ) = E [ ( X + U ) E ( Y )] 2 =
( )
= E [ ( X + U ) X ] 2 = E U 2 = u2
Therefore Y follows a normal distribution with expected
value X and variance u .
2
13
Econometrics II
Heteroskedasticity
Autocorrelation
No normality of the error term U
Econometrics III
Perfect Multicollinearity
Measurement Errors
Specification Error of the X
Structural Change
Econometrics I - Lesson 5
Econometrics I - Lesson 6
14