Unit 3 Standard Sampling Distributions-I: Structure
Unit 3 Standard Sampling Distributions-I: Structure
DISTRIBUTIONS-I
Structure
3.1 Introduction
Objectives
3.2 Introduction to 2-distribution
3.3 Properties of 2-distribution
Probability Curve of 2-dsistribution
Mean and Variance of 2-distribution
3.4 Applications of 2-distribution
3.5 Introduction to t-distribution
3.6 Properties of t-distribution
Probability Curve of t-distribution
Mean and Variance of t-distribution
3.7 Applications of t-distribution
3.8 Summary
3.9 Solutions /Answers
3.1 INTRODUCTION
In Unit 1 of this block, we have discussed the fundamentals of sampling
distributions and some basic definitions related to it. In Unit 2, we have
discussed the sampling distributions of various sample statistics such as
sample mean, sample proportion, sample variance, etc. Sometimes statistics
such as sample mean, sample proportion, sample variance, etc. may follow a
particular sampling distribution. Based on this fact Prof. R.A. Fisher, Prof.
G. Snedecor and some other statisticians worked in this area and found some
exact sampling distributions which are often followed by some of the
statistics. In present and next unit of this block, we shall discuss some
important sampling distributions such as 2(read as chi-square), t and F.
Generally, these sampling distributions are named on the name of the
originator, that is, Fisher’s F-distribution is named on Prof. R.A. Fisher.
This unit is divided in 9 sections. Section 3.1 is introductive in nature. In
Section 3.2 the brief introduction of 2-distribution is given. The different
properties and probability curves for various degrees of freedom n = 1, 4, 10
and 22 are described along with the mean and variance in Section 3.3. The
distribution of 2 is used in testing the goodness of fit, independence of
attributes, etc. Therefore, in Section 3.4 the important applications of 2-
distribution are listed. In Section 3.5, the brief introduction of t-distribution
is described. Properties of t-distribution, probability curve, the general
formula for central moment, mean and variance are described in Section 3.6.
Different applications of t-distribution are listed in Section 3.7. Unit ends by
providing summary of what we have discussed in this unit in Section 3.8 and
solution of exercises in Section 3.9.
Objectives
After studying this unit, you should be able to:
explain the 2-distribution;
53
Sampling Distributions describe the properties of 2-distribution;
explain the probability curve of χ2-distribution;
find the mean and variance of 2-distribution;
explore the applications of 2-distribution;
explain the t-distribution;
describe the properties of t-distribution;
describe the probability curve of t-distribution;
find the mean and variance of t-distribution; and
explore the applications of t-distribution.
The restriction (i) reveals sum of observations equal to 2000 (ii) reveals sum
of square of observations equal to 10000 and restriction (iii) implies fifth
observation double to the fourth observation.
With these three restrictions (i), (ii) and (iii), there shall be only 47
observations independently chosen and rest three could be obtained by
solving equations or by adjustments. The degrees of freedom of this set of
observations of size 50 is now (50 –3) = 47.
Chi-square Distribution
The chi-square distribution is first discovered by Helmert in 1876 and later
independently explained by Karl- Pearson in 1900. The chi-square
distribution was discovered mainly as a measure of goodness of fit in case of
frequency.
54
If a random sample X1, X2, …, Xn of size n is drawn from a normal Standard Sampling
population having mean and variance σ2 then the sample variance can be Distributions -I
defined as
1 n 2
S2
n 1 i 1
Xi X
or
n
2
X
i 1
i X n 1 S2 S2
n n 2
X i i
Therefore, 2 Z2i ~ 2( n )
i 1 i 1 i
55
Sampling Distributions 1 2 n / 2 1
f 2 e /2
2
; 0 2 … (2)
n/2 n
2
2
Try the following exercise to make sure that you have understood the chi-
square distribution.
E1) Write down the pdf of chi-square distribution in each of the following
cases:
(i) 6 degrees of freedom
(ii) 10 degrees of freedom
E2) Below, in each case, the pdf of chi-square distribution is given.
Obtain the degrees of freedom of each chi-square distribution:
1 2 / 2 2 3
(i) f 2 e ; 0 2
96
1 2 / 2
(ii) f 2 e ; 0 2
2
2. Chi-square distribution has only one parameter n, that is, the degrees of
freedom.
3. Chi-square probability curve is highly positive skewed.
4. Chi-square-distribution is a uni-modal distribution, that is, it has single
mode.
5. The mean and variance of chi-square distribution with n df are n and 2n
respectively.
After deliberating the properties of 2-distribution above, some of these are
discussed in detailed in the subsequent sub-sections.
56
Standard Sampling
Distributions -I
n n n
1
1 2 2 2 n 2 n 1 n 1
2
n 1 n 1 n n
2
2
n n
2 2 1 n n 2
2 2
Now, we obtain the value of variance by putting the value of first order and
second order moments about origin in the formula given below
2
Variance 2 1
2
n n 2 n n 2 2n n2 2n
So the mean and variance of the chi-square distribution with n df are given
by
Mean = n and Variance = 2n
Hence, we have observed that the variance is twice of the mean of chi-square
distribution.
Similarly, on putting r = 3, 4,… in the formula given in equation (3) one may
get higher order moments about origin such as 3 , 4 ,...
Example 1: What are the mean and variance of chi-square distribution with
5 degrees of freedom?
Solution: We know that the mean and variance of chi-square distribution
with n degrees of freedom are
Mean = n and Variance = 2n
In our case, n = 5, therefore,
Mean = 5 and Variance = 10
To evaluate your understanding try to answer the following exercises.
58
E3) What are the mean and variance of chi-square distribution with 10 Standard Sampling
Distributions -I
degrees of freedom?
E4) What are the mean and variance of chi-square distribution with pdf
given below
1 2 / 2 2 3
f 2 e ; 0 2
96
X
Z
/ n
is distributed normally with mean 0 and variance 1, i.e. Z ~ N 0, 1 .
In general, the standard deviation σ is not known and in such a situation the
only alternative left is to estimate the unknown σ2. The value of sample
variance (S2) is used to estimate the unknown σ2 where,
1 n1 2
S2
n 1 i 1
Xi X
X
Thus, in this case the variate is not normally distributed whereas it
S/ n
59
Sampling Distributions follows t-distribution with (n−1) df i.e.
X
t ~ t n1 … (4)
S/ n
n
1 2
where, S2 Xi X
n 1 i1
The t-variate is a widely used variable and its distribution is called student’s
t-distribution on the pseudonym name ‘Student’ of W.S. Gosset. The
probability density function of variable t with (n-1) = ν degrees of freedom
is given by
1
f t 1 / 2
; t … (5)
2
1 t
B , 1
2 2
1
where, B , is known as beta function and
2 2
1
1 a b
B , 2 2 B a, b
2 2 1 a b
2
Later on Prof. R.A. Fisher found that t-distribution can also be applied to test
the regression coefficient and other practical problems. He proposed t-variate
as the ratio of standard normal variate to the square root of an independent
chi-square variate divided by its degrees of freedom. Therefore, if Z is a
standard normal variate with mean 0 and variance 1 and χ2 is an independent
chi-square variate with n df i.e.
Z ~ N 0, 1 and χ 2 ~ χ 2n
60
Solution: Here, we are given that Standard Sampling
Distributions -I
300, n 25, X 290 and S 50
The value of t-variate can be calculated by the formula given below
X
t
S/ n
Therefore, we have
290 300 10
t 1
50/ 25 10
Now, try the following exercises for your practice.
E6) Write down the pdf of t-distribution in each of the following cases:
(i) 3 degrees of freedom
(ii) 9 degrees of freedom
E7) Obtain the degrees of freedom of t-distribution whose pdf is given
below:
1
f t ; t
2 3
1 5 t
5 B , 1
2 2 5
E8) If the scores on an IQ test of the students of a class are assumed to be
normally distributed with a mean of 60. From the class, 15 students
are randomly selected and an IQ test of the similar level is conducted.
The average test score and standard deviation of test scores in the
sample group are found to be 65 and 12 respectively. Calculate the
value of t-variate.
Fig. 3.2: Probability curves for t-distribution at n = 4, 12 along with standard normal
curve
In the figure given above, we have drawn the probability curves of
t-distribution at two different values of degrees of freedom with probability
curve of standard normal distribution. By looking at the figure, one can
easily understand that the probability curve of t-distribution is similar in
shape to that of normal distribution and asymptotic to the horizontal-axis
whereas it is flatter than standard normal curve. The probability curve of the
t-distribution is tending to the normal curve as the value of n increases.
Therefore, for sufficiently large value of sample size n(> 30), the t-
distribution tends to the normal distribution.
3.6.2 Mean and Variance of t-distribution
In previous sub-section, we have discussed the probability curve of
t-distribution. After discussing the probability curve it came to our
knowledge that t-distribution is symmetrical about t = 0 line. Since f (t) is
symmetrical about t = 0 line, therefore, all moments of odd order about
origin vanish or become zero, i.e.
2r 1 0 for all r 0, 1, 2, 3... … (8)
In particular, for r = 0, we get
1 0 Mean
That is, the mean of the t-distribution is zero. Similarly, one can show that
other odd moments about origin are zero by putting r = 1, 2,….
The moments of even order are given by
2 r t 2 r f t dt
Now, we can also write If f(x
abou
2r 2 t 2r f t dt
0
1 t 2r
2 n 1 / 2
dt
1 n 0
t2
n B , 1
2 2 n
2 r Standard Sampling
2 t … (9) Distributions -I
1 n 0
t2
n 1 / 2 dt
n B , 1
2 2 n
t2 1 1
Putting 1 t 2 n 1
n y y
n n n
Therefore, 2t dt 2
dy 2dt 2 dy dy
y ty 1
n 1 y 2
y
Also t 0 y 1 and t y 0
By putting these values in equation (9), we get
r
1
n y 1
1 0
n
2r n 1 / 2
. dy
1 n 1
1 1 2
n B , n 1 y
2 2 y
y 0 f x dx a f x dx
a 0
1
r
n 1
n 1
1 2
2
2
n 1 y dy
1 n 0 y
B ,
2 2
1
r n 1
nr 1 y
1 2
2
2
0 y y dy
1 n
B ,
2 2
n 1 1
nr 1 r
1 2 r
1 n 0 1 y 2 y 2 2
dy
B ,
2 2
n
nr 1 r 1 1
r 1
y2 1 y 2 dy
1 n 0
B ,
2 2
For a 0 & b 0
nr n 1 1 x a 1 1 x b 1 dx
0
B r, r
1 n 2 2
B , B a, b
2 2
1 n n 1
r r
nr 2 2 2 2 ; n r 0 B a, b a b
1 n n 1 2 ab
rr
2 2 2 2
n 1
r r
2r n r 2 2; n 2r …(10)
1 n
2 2
63
Sampling Distributions Now, if we put r = 1 in the above expression given in equation (10), we get
the value of second order moment about origin i.e.
n 1
1 1
2 n 2 2; n2
1 n
2 2
n 1 1
1
n 2 2 2 n 1 n n
1 n n
1 1
2 2 2
n
; n2
n 2
Now, we obtain the value of variance by putting the value of first order and
second order moments about origin in the formula given below
2
Variance 2 1
n 2
0
n 2
n
; n2
n 2
So the mean and variance of the t-distribution with n df are given by
n
Mean = 0 and Variance ; n2
n 2
Now, you can try an exercise to see how much you learn about the properties
of t-distribution.
E9) What are the mean and variance of t-distribution with 20 degrees of
freedom?
64
Now, it is time to write down the main applications of t-distribution by Standard Sampling
answering the following exercise. Distributions -I
3.8 SUMMARY
In this unit, we have covered the following points:
1. The chi-square distribution.
2. The properties of χ2-distribution.
3. The probability curve of χ2-distribution.
4. Mean and variance of χ2-distribution.
5. Applications of χ2-distribution.
6. Student’s-t and Fisher’s t-distributions.
7. Properties of t-distribution.
8. The probability curve of t-distribution.
9. Mean and variance of t-distribution.
10. Applications of t-distribution.
1 2 6 / 2 1
f 2 e /2
2
; 0 2
6
26 / 2
2
1 2 / 2 2 2
8 3
e
1 2 / 2 2 2
16
e ; 0 2 3 2 2
1 2 10 / 2 1
f 2 e /2
2
; 0 2
10
210 / 2
2
1 2 4
e / 2 2
32 5
65
Sampling Distributions 1 2 / 2 2 4
768
e ; 0 2 5 4 24
66
1 Standard Sampling
f t n 1 / 2
; t Distributions -I
2
1 n t
nB , 1
2 2 n
(i) In our case, n = 3, therefore, for n = 3 df the pdf of t-distribution
is given by
1
f t 31 / 2
; t
2
1 3 t
3 B , 1
2 2 3
1 3
2 2
2
1 3 t2
3 1
2 2 3
2 n n n
2 1 1
1 1 1 t2 2 2 2
3 1
2 2 2 3
2 1
2
; t and 2 1
t2 2
3 1
3
(ii) Similarly, for n = 9 df the pdf of t-distribution is given by
1
f t 91 / 2
; t
2
1 9 t
9 B , 1
2 2 9
1 9
2 2
5
1 9 t2
3 1
2 2 9
5 n n n
5 1 1
1 7 5 3 1 1 t2 2 2 2
3 1
2 2 2 2 2 2 9
24 1
5 and 5 4 24
2
2
105 t
3 1
16 9
128
5
; t
t2
105 1
9
1
51 / 2
; t
2
1 5 t
5 B , 1
2 2 5
Comparing with equation (12), we get n = 5. So degrees of freedom
of t-distribution are 5.
E8) Here, we are given that
60, n 15, X 65 and S 12
We know that the t-variate is
X
t
S/ n
Therefore, we have
65 60 5
t 1.14
12 / 15 4.39
E9) We know that the mean and variance of t-distribution with n degrees
of freedom are
n
Mean = 0 and Variance ; n2
n 2
In our case, n = 20, therefore,
20 10
Mean = 0 and Variance
20 2 11
E10) Refer Section 3.7.
68