Lecture1 Introduction
Lecture1 Introduction
Zhuzhu Zhou
Xiamen University
Introduction and Review
1
Economic Questions and Data
Econometrics is the science and art of using economic theory
and statistical techniques to analyze economic data.
2
Economic Questions
4
Economic Questions: Causal Effect
6
Data Types
7
Data Types
8
Review of Probability
Review of Probability
9
1. Random Variables and Probability Distributions
10
Probabilities and Outcomes
11
Random Variables
13
Probability Distribution of a Continuous Random
Variable
14
2. Expected Values, Mean, and Variance
15
Expected Value
16
Expected Value
17
Variance and Standard Deviation
18
Variance and Standard Deviation
19
Mean and Variance of a Linear Function of a Random
Variable
Y = a + bX
• Mean: µY = a + bµX
• Variance: σY2 = b 2 σX2
20
Other Measures of the Shape of a Distribution
E [(Y − µY )3 ]
Skewness =
σY3
21
Other Measures of the Shape of a Distribution
E [(Y − µY )4 ]
Kurtosis =
σY4
22
3. Two Random Vairables
23
Joint Distributions
Discrete case : Pr (X = x, Y = y )
Continuous case : fXY (x, y )
24
Marginal Distributions
25
Joint and Marginal Distributions
26
Conditional Distributions
27
Conditional Distributions
28
Conditional Distributions
E (Y ) = E (E (Y |X ))
Z Z
= yfY |X dy fX dx
E (Y |Z ) = E (E (Y |X , Z )|Z )
30
Independence, Covariance and Correlation
• Mean independent.
E (Y |X ) = E (Y ) then cov (X , Y ) = 0
Proof : E (XY ) = E (E (XY |X )) = E (XE (Y |X )) =
E (XE (Y )) = E (X )E (Y )
• Relationship between Independent, Mean Independent
and Uncorrelated.
31
Means, Variances, and Covariances of Sums of Ran-
dom Variables
32
4. The Normal, Chi-Squared, Student t, and F
Distributions
33
The following distributions are used when testing certain types
of hypothesis in Econometrics.
• Normal Distribution
• Chi-Squared
• Student t
• F Distribution
34
Normal Distribution
1 1 x − µ 2
fX = √ exp − ( )
2πσ 2 2 σ
35
Normal Distribution
36
Normal Distribution
37
Chi-Squared Distribution
38
Student t Distribution
39
F Distribution
40
5. Random Sampling and Distribution of Sample
Average
41
Almost all the statistical and econometric procedures used in
this book involve averages or weighted averages of a sample
of data. Therefore understanding the distributions of sample
averages is important.
42
• Simple random sampling: n objects are selected at
random from a population. Each member of the
population is equally likely to be included in the sample.
• Y1 , ..., Yn can be treated as random variables. (The
randomness comes from sampling. If we are able to
sample again, the values of these variables will be
different.) Before being sampled, they can take many
values; after being sampled, a specific value is taken.
• When Y1 , ..., Yn are drawn from the same distribution and
are independently distributed, they are said to be
independently and identically distributed (i.i.d.).
43
Sample Average
44
Sample Average
• Mean of Ȳ , E (Ȳ )
n
1X
E (Ȳ ) = E (Yi ) = µY = E (Y )
n 1
• Variance of Ȳ
n
1X
var(Ȳ ) =var( Yi )
n i=1
n n n
1 X 1 X X
= 2 var(Yi ) + 2 cov(Yi , Yj )
n i=1 n i=1 j=1,j̸=i
var(Y )
=
n
45
6. Large-Sample Approximations to Sampling
Distributions
46
Large-Sample Approximations to Sampling Distribu-
tion of Ȳ
47
Law of Large Numbers
48
Central Limit Theorem
49
Review of Statistics
Review of Statistics
50
The key insight of statistics is that one can learn about a
population distribution by selecting a random sample from
that population.
51
Three types of statistical methods frequently used in
Econometrics.
• Estimation
• Hypothesis Testing
• Confidence Intervals
52
1. Estimation of Population Mean
53
Estimators
54
Estimators
55
Properties of Estimators
• E(Ȳ ) = µY , Ȳ is unbiased.
p
• µ
bY → µY , Ȳ is consistent.
• Ȳ is the most efficient among linear unbiased estimators.
best linear unbiased estimator (BLUE).
Compare with Y1 and
Ye = n1 ( 12 Y1 + 23 Y2 + ... + 21 Yn−1 + 32 Yn ).
var(Ye ) = 1.25σY2 /n
57
2. Hypothesis Tests Concerning Population Mean
58
Null and Alternative Hypotheses (two-sided)
H0 : E(Y ) = µY ,0
H1 : E(Y ) ̸= µY ,0
59
Null and Alternative Hypotheses (two-sided)
60
p-value
61
p-value
62
p-value
63
Calculating p-Value
• When σY is known:
h i
p-value =PrH0 |Ȳ − µY ,0 | > |Ȳ act − µY ,0 |
h Ȳ − µ Ȳ act − µ i
Y ,0 Y ,0
=PrH0 >
σȲ σȲ
Ȳ act − µ
Y ,0
=2Φ −
σȲ
64
t-Statistic
Ȳ − µY ,0
t=
SE(Ȳ )
65
Hypothesis Testing with a Prespecified Significance
Level
68
3. Confidence Intervals for the Population Mean
69
Confidence Intervals
70