Unit 1 Introduction To Sampling Distribution: Structure
Unit 1 Introduction To Sampling Distribution: Structure
DISTRIBUTION
Structure
1.1 Introduction
Objectives
1.2 Basic Terminology
1.3 Introduction to Sampling Distribution
1.4 Standard Error
1.5 Central Limit Theorem
1.6 Law of Large Numbers
1.7 Summary
1.8 Solutions /Answers
1.1 INTRODUCTION
In many situations, if we extract some information from all the elements or
items of a large population in general, it may be time consuming and
expensive. Also, if the elements or items of population are destroyed under
investigation then the information gathering from all the elements is not
making a sense because all the elements will be destroyed. Therefore, in most
of the cases in daily life, business and industry the information is gathered by
means of sampling. The results of a properly taken sample enable the
investigator to arrive at generalisations that are valid for entire population. The
process of generalising sample results to the population is called Statistical
Inference.
For drawing inference about the population parameters, we draw all possible
samples of same size and determine a function of sample values, which is
called statistic, for each sample. The values of statistic are generally varied
from one sample to another sample. Therefore, the sample statistic is a random
variable and follows a distribution. The distribution of the statistic is called
sampling distribution of the statistic.
This unit is divided into 8 sections. Section 1.1 is introductive in nature. In
Section 1.2, we defined the basic terminology used in statistical inference. The
introduction and needs of sampling distributions are described in the Section
1.3. In Section 1.4, we explored the concept of standard error. The most
important theorem of Statistics “Central Limit Theorem” is described in
Section 1.5. In Section 1.6, we explored the concept of law of large numbers.
Unit ends by providing summary of what we have discussed in this unit in
Section 1.7 and solution of exercises in Section 1.8.
Objectives
After studying this unit, you should be able to:
define statistical inference;
define the basic terms as population, sample, parameter, statistic, estimator,
estimate, etc. used in statistical inference;
explain the concept of sampling distribution;
explore the importance and needs of sampling distribution;
7
Sampling Distributions explain the concept of standard error;
describe the most important theorem of Statistics “Central Limit Theorem”;
explain the concept of law of large numbers; and
determine adequate sample size required for sample survey.
10
Sample Mean and Sample Variance Introduction to Sampling
Distribution
If X1 ,X2 , ..., Xn is a random sample of size n taken from a population whose
probability density(mass) function f(x, θ) then sample mean is defined as
n
X 1 X 2 ... X n 1
X
n
n
X
i 1
i
n
2
Here, we divide X
i 1
i X by (n – 1) rather than n as our definition of the
n n
2
(iii) Xi /
i 1
(iv) X
i 1
i
E3) The weights (in kg.) of 5 workers in a factory are 56, 62, 74, 45 and 50.
How many samples of size of 2 are possible with replacement? Also
write all possible samples of size 2.
From the above table, the average (mean) life of these selected bulbs is 1040
hours. Now, if we are using this sample to make inference about the population
average life then we say that, the sample mean is 1040 hours give an estimate
of the average life of electric bulbs for the whole population. But if we take
another sample of same size 10 from the population and calculate the sample
mean as shown in the above table. The average life of bulbs selected in sample-
II is 860 hours. This is different from the mean of sample-I. Thus, if we are
using this sample mean to estimate the population mean then we gets a
different estimate of the average life of bulbs for the whole population.
Similarly, we can take many samples of same size 10 and for each sample we
get sample mean which may or may not be distinct. From all of the sample
means, we try to estimate the average life of bulbs of the whole population.
13
Sampling Distributions For better understanding of the process of generalising these sample results to
the whole population, we consider the example in which size of the population
is very small.
Consider a population comprising four typists who type the sample page of a
manuscript. The number of errors made by each typist is shown in Table 1.2
given below:
Table 1.2: Number of Error per Typist
Typist Number of Errors
A 4
B 2
C 3
D 1
From the above table, we can see that the value of the sample statistic (sample
mean) is varying from sample to sample and out of 16 samples only 4
samples(4,7,10 and 13) have their means equal to population mean whereas
other 12 samples result in some error in the estimation process. The error is the
difference between the population mean and the sample mean used for
estimate.
14
Now, consider all possible values of sample mean and calculate their Introduction to Sampling
probabilities (using relative frequency approach of probability described in Distribution
Unit 2 of MST-003) of occurrence. Then we arrange every possible value of
sample mean with their respective probabilities in the following Table 1.4
given below:
1
1.0 1 1.5 2 ... 4.0 1 2.5
16
The mean of sample means can also be calculated as
k
1 2 1
E(X) X X i pi 1.0 1.5 ... 4.0 2.5
i 1 16 16 16
Thus, we have seen for this population that mean of sample means is equal to
the population mean, that is, X 2.5. The fact that these two means are
equal is not a chance as it can be shown that X equals to population mean for
any population and any given sample size.
But if the population is large say having 1,000 elements and we want to draw
all samples of size 2 then there are (1000)2 1000000 possible simple random
samples with replacement. Then the listing of all such samples and finding the
sampling distribution would be a difficult task empirically. Therefore, we may
consider a theoretical experiment in which we draw all possible sample of a
fixed size and obtain a sampling distribution.
Although in practice only a random sample is actually selected and the concept
of sampling distribution is used to draw the inference about the population
parameters.
15
Sampling Distributions It is now time for you to try the following exercise to make sure that you
understand about sampling distribution.
E4) If lives of 3 Televisions of certain company are 8, 6 and 10 years then
construct the sampling distribution of average life of Televisions by
taking all samples of size 2.
In the previous example, we can calculate the standard error of sample mean as
1 2 2 2
SE X 1 1.0 2.5 2 1.5 2.5 ... 1 4.0 2.5
16
1 10
2.25 2 ... 2.25 0.791
16 16
The computation of the standard error is a tedious process. There is an
alternative method to compute standard error of the mean from a single sample
as:
If X1 ,X2 , ..., Xn is a random sample of size n taken from a population with
mean µ and variance σ2 then the standard errors of sample mean ( X ) is given
by
SE X
n
The standard error is used to express the accuracy or precision of the estimate
of population parameter because the reciprocal of the standard error is the
measure of reliability or precision of the statistic. Standard error also
determines the probable limits or confidence limits (described in Units 7 & 8 of
this course) within which the population parameter may be expected to lie with
certain level of confidence. Standard error is also applicable in testing of
hypothesis (described in Block 3 of this course).
16
The standard errors of some other well known statistics are given below: Introduction to Sampling
Distribution
1. The standard error of sample proportion (p) is given by
PQ
SE(p) =
n
where, P is population proportion and Q = 1-P.
) is given by
2. The standard error of sample median ( X
)= π σ
SE( X = 1.25331 SE ( X )
2 n
3. The standard error of difference of two sample means is given by
σ12 σ 22
SE X Y =
n1 n 2
where, σ12 and σ 22 are the population variances and n1 and n2 are the sample
sizes of two independent samples.
4. The standard error of difference between two sample proportions is given
by
P1Q1 PQ
SE( p1 p 2 ) = 2 2
n1 n2
E6) A machine produces a large number of items of which 15% are found to
be defective. If a random sample of 200 items is taken from the
population, then find the standard error of sampling distribution of
proportion.
18
Introduction to Sampling
Distribution
Fig. 1.1: Sampling distribution of sample means for various populations when n = 2,
n = 5 and n = 30
Sampling Distributions In this figure, we are trying to understand the sampling distribution of sample
mean X for different populations and for varying sample sizes. We divide this
figure into four parts A, B, C and D. The part ‘A’ of this figure shows four
different populations as normal, uniform, binomial and exponential. The rest
parts B, C and D represent the shape of sampling distribution of mean of sizes
n = 2, n = 5 and n = 30 respectively drawn from the populations shown in first
row (Part-A). From the first column of this figure, we observed that when the
parent population is normal then all the sampling distributions for varying
sample sizes are also normal, having same mean but their variances decrease as
n increases.
The second column of this figure represents the uniform population. Here, we
observe that the sampling distribution of X is symmetrical when n = 2 and
tends to normal when n = 30.
However, the third column of this figure represents the binomial population
(discrete). Again when n = 2, the sampling distribution of X is symmetrical
and for n = 5 it is quite bell shaped and tends to normal when n = 30. The last
column of this figure represents the exponential population which is highly
skewed. Here, we observe that for n =30, the distribution of X is symmetrical
bell shaped normal.
Here, we conclude that if we draw a random sample of large size n > 30 from
the population then the sampling distribution of X can be approximated by a
normal probability distribution, whatever the form of parent population.
Note 3: The central limit theorem can also be applicable in the same way for
the sampling distribution of sample proportion, sample standard deviation,
difference of two sample means, difference of two sample proportions, etc. that
is, if we take a random sample of large size n 36 30 from the population
then the sampling distribution of sample proportion, sample standard deviation,
difference of two sample means, difference of two sample proportions, etc.
approximated by a normal probability distribution, whatever the form of parent
population. The mean and variance of these sampling distributions shall be
discussed in the next unit.
Let us do one example based on central limit theorem.
Example 4: Suppose in IGNOU, the average (mean) weight of all male
students is 60 kg and standard deviation is 25 kg. If a sample of 36 male
students is selected at random, find the probability that the male students
having average weight
(i) more than 70 kg
(ii) less than 55 kg
(iii) between 50 kg and 65 kg
Solution: Here, we are given that
= 60 kg, σ = 25 kg, n = 36
Since, we draw a large sample (n > 30), therefore by central limit theorem the
sampling distribution of the sample means will also follow a normal
distribution with mean
E X 60
and variance
20
2 (25)2 Introduction to Sampling
Var X 17.36 Distribution
n 36
(i) The probability that the male students having average weight more than
70 kg is given by
P X 70 SeeFig. 1.2
To get the value of this probability, we convert the variate X into a
standard normal variate Z by the transformation
X E(X) X 60 X 60
Z
Var(X) 17.36 4.17
Therefore, subtract 60 from each term and then divide each term by 4.17
in the above inequality. Thus, the probability becomes
X 60 70 60
P P Z 2.40
4.17 4.17
0.5 P 0 Z 2.40
X 60 X 70
Z0 Z 2.40
Fig. 1.2
(ii) The probability that the male students having average weight less than 55
kg is given by
X 60 55 60
P X 55 P
4.17 4.17
X 55 X 60
Z 1.20 Z0
Fig. 1.3
(iii) The probability that the male students having average weight between 50
kg and 65 kg is given by
50 60 X 60 65 60
P 50 X 65 P
4.17 4.17 4.17
X 50 X 60 X 65
Z 2.40 Z0 Z 1.20
Fig. 1.4
Now, you can try the following exercise which will make you more user
friendly with the use of central limit theorem.
E7) Average height of the students of science group in a college is 65 inches
with a standard deviation of 2.2 inches. If a sample of 40 students is
selected at random, what is the probability that the average height of
these 40 students lies between 64 and 65.5 inches?
Introduction to Sampling
1.6 LAW OF LARGE NUMBERS Distribution
We have already discussed in Section 1.2 of this unit that the population
parameters are generally unknown and for estimating parameters, we draw all
possible random samples of same size from the population and calculate the
values of sample statistic such as sample mean, sample proportion, sample
variance, etc. for all samples and with the help of these values we form
sampling distribution of that statistic. Then we draw inference about the
population parameters.
But in real-world the sampling distributions are never really observed. The
process of finding sampling distribution would be very tedious because it
would involve very large number of samples. So in real-world problems, we
draw a random sample from the population to draw inference about the
population parameters. A very crucial question then arises: “Using a random
sample of finite size say n, can we make a reliable inference about population
parameter?” The answer is “yes”, reliable inference about population parameter
can be made by using only a finite sample and we shall demonstrate this by
“law of large numbers”. This law of large numbers can be stated in words as:
“A positive integer can be determined such that if a random sample of size n or
lager is taken from a population with parameter, say, population mean (), the
probability that the sample mean X will deviate from population mean µ can
be made to be as close to 1 as desired.”
This law states that for any two arbitrary small numbers ε and η where ε 0
and 0 η 1 , there exists an integer n such that if a random sample of size n or
larger is drawn from the population and calculate the sample mean X for this
sample, then the probability that X deviates from by less than is greater
than 1 η (as close to 1 as desired) , that is, X arbitrarily close to .
Symbolically this can be written as
2
For any ε 0 and 0 η 1 there exists an integer n such that n , where
2
σ2 is the finite variance of population then
P X 1
or P X 1 X a a X a
The proof of “law of large numbers” is beyond the scope of this course.
Here, with an example we have to try to understand the law of large numbers in
action, as we take more observations the sample mean approaches to the
population mean.
Suppose that the distribution of the weight of all the young men living in a city
is close to normal distribution with mean 65 kg and standard deviation 25 kg.
To understand the law of large numbers, we calculate the sample mean weight
( X ) for varying sample size n = 1, 2, 3, …. Fig. 1.5 shows the behaviour of the
sample mean weight X of men chosen at random from this city. The graph
plots the values of X (along vertical axis and sample size along horizontal axis)
as sample size varying from n =1, 2, 3…
23
Sampling Distributions First we start a sample of size n = 1, that is, we select a man randomly from the
city. Suppose selected man had weight 70 kg, therefore, the line of the graph
starts from this point. Now, select second man randomly and suppose his
weight is 55 kg. So for n = 2 the sample mean is
70 55
X 62.5 kg
2
This is the second point on the graph. Now, select third man randomly from the
city and suppose his weight is 83 kg. Therefore, for n = 3 the sample mean is
70 55 83
X 69.3 kg
3
This is the third point on the graph.
Fig. 1.5
This process will continue. From the graph we can see that the mean of the
sample changes as we make more observations, eventually, however, the mean
of the observations get close and close to the population mean 65 kg as the
sample size increases.
Note 4: The law of large numbers can also be applied for the sample
proportion, sample standard deviation, difference of two sample means,
difference of two sample proportions, etc.
Example 5: A pathologist wants to estimate the mean time required to
complete a certain analysis on the basis of sample study so that he may be 99%
confident that the mean time may remain with 2 days of the mean. As per the
available records, the population variance is 5 days2. What must be the size of
the sample for this study?
Solution: Here, we are given
1 0.99 0.01 , 2 , σ 2 5
By the law of large numbers, we have
σ2
n
ε 2η
5 Introduction to Sampling
2
125 Distribution
2 0.01
That is, n 125
Hence, at least 125 units must be drawn in a sample.
Example 6: An investigator wishes to estimate the mean of a population using
a sample large enough that the probability will be 0.95 that the sample mean
will not differ from the population mean by more than 25 percent of the
standard deviation. How large a sample should be taken?
Solution: We have
1 0.95 0.05, 0.25
By the law of large numbers, we have
2
n
2
2
2 320
0.25 0.05
That is, n 320
Therefore, at least 320 units must be taken in the sample so that the required
probability will be 0.95.
Now, try the following exercise for your practice.
1.7 SUMMARY
In this unit, we have covered the following points:
1. The statistical procedure which is used for drawing conclusions about the
population parameter on the basis of the sample data is called “statistical
inference”.
2. The group of units or items under study is known as “Population”
whereas a part or a fraction of population is known as “sample”.
3. A “parameter” is a function of population values which is used to
represent the certain characteristic of the population and any quantity
calculated from sample values and does not contain any unknown
population parameter is known as “statistic”.
4. Any statistic used to estimate an unknown population parameter is known
as “estimator” and the particular value of the estimator is known as
“estimate” of parameter.
5. The probability distribution of all possible values of a sample statistic that
would be obtained by drawing all possible samples of the same size from
the population is called “sampling distribution” of that statistic.
25
Sampling Distributions 6. The standard deviation of the sampling distribution of a statistic is known
as “standard error”.
7. The most important theorem of Statistics is “central limit theorem”
which state that the sampling distribution of the sample means tends to
normal distribution as sample size tends to large (n > 30).
8. According to law of large numbers, a positive integer can be determined
such that if a random sample of size n or larger is taken from a population
with parameter, say, population mean (µ), the probability that the sample
mean will be very closed to population mean, can be made as close to 1 as
desired.
26
Table 1.6: Calculation of Sample Mean Introduction to Sampling
Distribution
Sample Sample Sample
Number Observation Mean ( X )
1 8, 8 8
2 8, 6 7
3 8,10 9
4 6, 8 7
5 6, 6 6
6 6, 10 8
7 10, 8 9
8 10, 6 8
9 10, 10 10
Since the arrangement of all possible values of sample mean with their
corresponding probabilities is called the sampling distribution of mean
thus, we arrange every possible value of sample mean with their
respective probabilities in the following Table 1.7 given below:
Table 1.7: Sampling distribution of sample means
S.No. Sample Mean Frequenc Probability
(X ) y
1 6 1 1/9
2 7 2 2/9
3 8 3 3/9
4 9 2 2/9
5 10 1 1/9
PQ 0.15 1 0.15
SE p 0.025
n 200
E7) Here, we are given that
= 65, σ = 2.2, n = 40
Since we draw a large sample n = 40 > 30, therefore, by central limit
theorem, the sampling distribution of the sample mean will follow a
normal distribution with mean
E X 65
27
Sampling Distributions and variance
2
2 2.2
Var X 0.12
n 40
Therefore, the probability that the average height of these 40 students
will lie between 64 and 65.5 inches is given by
P 64 X 65.5 See Fig.1.6
To get the value of this probability, we convert the normal variate X
into a standard normal variate Z by the transformation
X E X X 65 X 65
Z
Var X 0.12 0.35
Therefore, subtract 65 from each term and then divide each term by
0.35 in the above inequality. Thus, the probability becomes
64 65 X 65 65.5 65
P P 2.86 Z 1.43
0.35 0.35 0.35
P 2.86 Z 0 P0 Z 1.43
P0 Z 2.86 P0 Z 1.43
= 0.4979 + 0.4236 = 0.9215
X 64 X 65 X 65.5
Z 2.86 Z0 Z 1.43
Fig. 1.6
E8) Here, we are given that
2 2, 0.5, 1 0.95 0.05
By the law of large numbers, we have
2
n
2
2
2
160
0.5 0.05
That is, n 160
Therefore, at least 160 units must be taken in the sample so that the
probability will be at least 0.95 that the sample mean will lie within 0.5
of the population mean.