Chap1 Sampling Distribution
Chap1 Sampling Distribution
Chap1 Sampling Distribution
1
Sampling
Distribution
1.1 Introduction
1
Chapter 1: Sampling Distribution
characteristics. That is, we do not expect the values of the sample statistics to be
exactly equal to the values of the population parameters. Similarly, we do not
expect the values of the statistics computed from one sample to be equal to the
values computed from another sample selected from the same population. This
is because a second sample will not contain the same elements as the first.
Suppose we want to estimate . Since many random samples are possible from
the same population, we would expect x to vary from sample to sample. If we
consider the process of selecting random samples as an experiment, the sample
mean x is a numerical description of the outcome of the experiment. Thus, the
sample mean x is a random variable, and just like any other random variables, x
has a mean, a variance and a probability distribution.
Definition
Because the possible values of x are the result of different samples, the
probability distribution of X is called the sampling distribution of X .
Knowledge of the sampling distribution and its properties will enable us to make
probability statements about how close a sample statistic is to the population
parameter. The sampling distribution of a statistic depends on the size of the
population, the size of the samples, and the method of choosing the samples. The
sampling distributions discussed in this chapter are the sampling distributions of
the mean, difference between two means, proportion and difference between
two proportions.`
2
Chapter 1: Sampling Distribution
n
Xi 1 n 1 n
X E ( X ) E i 1 E X i E X i
n n i 1 n i 1
1 1
E ( X 1 ) E ( X 2 ) ... E ( X n ) ...
n n
1
n
n
and variance
2
n
Xi 1 n
X2 Var ( X ) Var
n Var Xi
i 1 n i 1
1
Var ( X 1 ) Var ( X 2 ) ... Var ( X n )
n2
1 n 2 2
2 ... 2
2 2 2
n n n
3
Chapter 1: Sampling Distribution
2
X ~ N ,
n
Note that this is the distribution for X when a sample is taken from an infinite
N n 2
population. If the population is finite, 2X
N 1 n
Example 1.1
Solution
4
Chapter 1: Sampling Distribution
2
normal with mean and variance provided that the sample size is large.
n
Then the limiting form of the distribution of
X
Z
/ n
as n , is the standard normal distribution N (0,1) .
Note:
The normal approximation for X will generally be good if the sample
size, n 30.
Continuity correction factor is not needed for the random variable X .
Example 1.2
In an examination taken by a large number of students the mean mark was 64.5
and the variance was 64. If a random sample of 100 scripts is selected, what is
the probability that the mean mark
a) is at least 75?
b) is between 63.8 and 64.5?
c) is at most 70?
5
Chapter 1: Sampling Distribution
Solution
Example 1.3
Solution
6
Chapter 1: Sampling Distribution
12 22
X1 ~ N 1 , and X2 ~ N 2 ,
n1 n2
Now we are interested in finding out what is the sampling distribution of the
The mean of X 1 X 2 is
X X E X 1 X 2 E X 1 E X 2 1 2
1 2
12 22
X1 X 2 ~
N 1 2 ,
n1 n2
Hence,
7
Chapter 1: Sampling Distribution
( X 1 X 2 ) ( 1 2 )
Z ~ N (0,1)
12 22
n n
1 2
Example 1.4
A factory produces two types of tennis balls, type A and type B. It is known that
the mean height of bounce is 140 cm for type A with a standard deviation of 2
cm while the mean height of bounce for type B is 138 cm with a standard
deviation of 3 cm. Assume that the heights of bounce are approximately normal.
A random sample of 49 tennis balls of type A and 36 tennis balls of type B are
taken. Find the probability that the
a) mean height of bounce of type A tennis balls is larger than the mean height
of bounce of type B tennis balls.
b) difference between the mean height of bounce of type A tennis balls and
type B tennis balls is at most 3 cm.
c) mean height of bounce of both types of tennis balls differ by at most 3 cm.
Solution
8
Chapter 1: Sampling Distribution
Example 1.5
An engineering firm sets an aptitude test when applicants first apply for training.
For the first batch of applicants, the mean time taken to complete the test is 40.5
minutes with standard deviation 7.5 minutes. For the second batch of applicants,
the mean time taken to complete the test is 42.5 minutes with standard
deviation 6.5 minutes. Assume that the times taken to complete the aptitude test
are approximately normal. If the times taken by a random sample of 35
applicants in the first batch and the times taken by a random sample of 40
applicants in the second batch are selected, find the probability that
a) mean time taken by the first batch of applicants is less than the time taken
by the second batch of applicants.
b) the mean time taken by the two batches of applicants will differ by more
than 3 minutes.
Solution
9
Chapter 1: Sampling Distribution
n
P( X x) x (1 ) n x , for x = 0, 1, 2, …, n
x
If all possible samples of size n is taken from this distribution, and the number of
successes is determined from each sample, then the proportion of successes for
X
each sample can be characterized by p . Since each sample will give a
n
different value of p, then the proportion is a random variable and symbolized as
P. Therefore the resulting sampling distribution of P has mean and variance
(1 )
. By the Central Limit Theorem, the distribution of P is approximately
n
normal where
X
1 1
P E P E E X n
n
n n
X 1 n (1 ) (1 )
P2 Var P Var 2 Var X
n n n2 n
(1 )
P~ N , when n is large ( n 30 ).
n
Note:
When approximating binomial distribution with normal distribution, a
1 X
continuous correction factor of is used. Since P , then a continuous
2 n
1
correction factor of is used.
2n
10
Chapter 1: Sampling Distribution
Example 1.6
It is known that 3% of frozen curry puffs delivered to a canteen are broken.
What is the probability that, on a morning when 500 curry puffs are delivered,
a) 5% or more are broken?
b) between 2% and 4% are broken?
c) at least 3% are broken?
Solution
11
Chapter 1: Sampling Distribution
Example 1.7
Three quarters of the households in Taman Uda subscribe to internet service
provider. In a random sample of 100 households, find the probability that
a) at least 73 households subscribe to internet service provider.
b) at most 90 households subscribe to internet service provider.
c) between 85 and 90 households subscribe to internet service provider.
Solution
12
Chapter 1: Sampling Distribution
and 2 respectively. Samples of size n1 are taken from population 1 and samples
of size n2 are drawn from population 2. Then P1 and P2 are the proportions from
those samples.
( 1 1 ) (1 2 )
P1 ~ N 1 , 1 and P2 ~ N 2 , 2
n1 n2
provided n1 and n2 are large.
P P E P1 P2 E ( P1 ) E ( P2 ) 1 2
1 2
1 (1 1 ) 2 (1 2 )
P2 P Var ( P1 P2 ) Var ( P1 ) Var ( P2 )
n1 n2
1 2
(1 1 ) 2 (1 2 )
P1 P2 ~ N 1 2 , 1
n1 n2
13
Chapter 1: Sampling Distribution
Example 1.8
Even though breakfast is an important meal of the day, it was found that 10% of
male adults while 25% of female adults skip breakfast. If a random sample of 50
male adults and a random sample of 50 female adults were selected, what is the
probability that
a) the proportion of the female adults who skips breakfast exceeds the
proportion of the male adults who skips breakfast?
b) the proportion of the female adults who skips breakfast exceeds the
proportion of the male adults who skips breakfast by at least 0.1?
c) the proportion of adults who skip breakfast from both groups differ by at
most 0.3?
Solution
14
Chapter 1: Sampling Distribution
Example 1.9
The proportions of single and married male policyholders who made an
insurance claim over the preceding 3-year period are 0.3 and 0.2 respectively. If
a large automobile company selected 50 random samples of single and 40
random samples of married male policyholders who made an insurance claim
over the preceding 3-year period, what is the probability that
a) the difference between the sample proportion of single male policyholders
and the sample proportion of married male policyholders who made an
insurance claim over the preceding 3-year period does not exceed 0.1?
b) the difference between the sample proportion of single policyholders and the
sample proportion of married male policyholders who made an insurance
claim over the preceding 3-year period is at least 0.15?
Solution
15
Chapter 1: Sampling Distribution
Exercises : Chapter 1
5) The average CPA for faculty of Science students is 2.9 with standard
deviation 1.0. A random sample of 100 students is drawn from this
population. What is the probability that the sample mean is smaller than
2.6?
16
Chapter 1: Sampling Distribution
7) A builder claims that water heaters are installed in 70% of all homes in
Taman Perling. A random sample of 50 homes is drawn from Taman
Perling, what is the probability that
a) at most 60% of the houses has water heaters installed in their homes.
b) at least 55% of the houses has water heaters installed in their homes.
c) Between 65% and 75% of the houses has water heaters installed in
their homes.
17
Chapter 1: Sampling Distribution
10) The most popular soccer player is David Beckham. It is known that 60% of
the female soccer fans and 55% of the male soccer fans favor David
Beckham over the other soccer players. A sample of 100 female soccer
fans and a sample of 100 male soccer fans were interviewed at random.
What is the probability that the
a) proportion of female supporters of David Beckham is larger than the
proportion of male supporters?
b) difference between proportion of female supporters of Beckham and
that of male supporters is at most 0.2
11) It is known that 30% and 35% of the residents in Taman Sutera and Bandar
Baru UDA subscribe to “New Straits Times” newspaper respectively. If a
random sample of 50 newspaper readers from Taman Sutera and 50
readers from Bandar Baru UDA were taken randomly, what is the
probability that the proportion of “New Straits Times” subscribers in
Taman Sutera is larger than Bandar Baru UDA?
13) The mean waiting time for drive-through customers before they get their
orders from a fast food restaurant is 20 minutes with standard deviation 5
minutes. If a random sample of 64 customers is observed, what is the
probability that their mean waiting time
18
Chapter 1: Sampling Distribution
15. The mean final examination scores for students taking a statistics test is 30
marks with a standard deviation of 6 marks. Assume that the test scores are
approximately normal. Two random samples were taken randomly
consisting of 32 and 50 students, respectively. What is the probability that
a) the mean test scores will differ by more than 3 marks?
b) the mean test scores from group 1 is larger than group 2?
Chapter 1: Answers
19
Chapter 1: Sampling Distribution
4. a) 0.0793 b) 0.8761
5. 0.0013
6. a) 0.9976 b) 1.0000
7. a) 0.0838 (0.0823) b) 0.9931 c) 0.4648
8. 0.8159 (0.8133)
9. 200(0.3085) 62 boxes
10. a) 0.7611 (0.7642) b) 0.9838 (0.9842)
11. 0.2981
12. a) 0.1998 b) 0.3446 c) 0.0968 d) 80.13
13. a) 0.0541 b) 0.0007 c) 0.0548
14. a) 0 b) 0.0008
15. a) 0.0272 b) 0.5
16. a) 1 b) 0 c) 0
20