Section 1: Practice Questions (Students Are To Attempt All These Questions) Concept of Random and Non-Random Samples 1 (2007/NJC/P2/Q6i)
Section 1: Practice Questions (Students Are To Attempt All These Questions) Concept of Random and Non-Random Samples 1 (2007/NJC/P2/Q6i)
Section 1: Practice Questions (Students Are To Attempt All These Questions) Concept of Random and Non-Random Samples 1 (2007/NJC/P2/Q6i)
2 The head of student welfare at EJC wants to find out what students think of the food
provided by the canteen. He asks the civics tutor of each class to randomly pick five
students from his or her class to take part in a survey.
[Solution]
Since the class size is (typically) different for each class, students from classes with a small
enrolment will have a higher chance of being selected compared to those from bigger classes.
Thus the sample is not a random sample.
Page 1 of 15
4 “Question Removed”
P X 23.5 0.961
X 1 ... X 50 Y1 ... Y30 50 24 30 15 50 22 30 1.52
(ii) Let T ~ N ,
80 80 802
165 107
i.e. T ~ N ,
8 2560
P T 20.5 0.270
6 The heights of a new variety of sunflower can be modelled by a normal distribution with
mean 2 m and standard deviation of 40 cm.
(i) A random sample containing 50 sunflowers is taken and the mean height calculated.
What is the probability that the sample mean lies between 195 cm and 205 cm?
(ii) A hundred such samples, each of 50 observations, are taken. In how many of these
would you expect the sample mean to lie between 195 cm and 205 cm?
[Ans: (i) 0.623 (ii) 62.3]
[Solution]
Let X be the height of a sunflower (in cm). X ~ N 2, 0.42
0.42
X ~ N 2,
When n = 50, 50
(i)
P 1.95 X 2.05 0.623
(ii) Let Y be the number of samples out of 100 which have the sample mean between
195cm and 205cm. Y ~ B 100,0.623
E Y np 62.3
Page 2 of 15
7 Components made by a machine have mean weight 0.50 g and standard deviation 0.02 g.
If two samples are taken, both of 1000 components each, what is the probability that their
means will differ by more than 0.002 g? State one assumption necessary for your
calculation.
Explain why it is not necessary to assume that the weight of the components is normally
distributed.
[Ans: 0.0253]
[Solution]
Let X be the weight of a component.
E(X) = 0.5, Var(X) = 0.022
Let X and Y be the mean weight of the 2 samples.
Since n = 1000 is large, by Central Limit Theorem,
0.022 0.022
X N 0.5, approximately and Y N 0.5, approximately
1000 1000
0.022 0.022
X Y N 0,
1000 1000 i.e. X Y N 0,8 107
P X Y 0.002 1 P 0.002 X Y 0.002 0.0253
Assumptions:
the two samples are random samples;
the observations of the weight of components are independent of each other (i.e. mutually
independent).
Since sample size n=1000 is large, by Central Limit Theorem, the sample means X and Y
are approximately normally distributed EVEN if the weight of the components is not.
8 A large number of random samples of size n are taken from B(20, 0.2). Approximately
90% of the sample means are less than 4.354. Estimate n.
[Ans: 42]
[Solution]
Let X B(20,0.2) .
Then E(X ) = 20(0.2) = 4, Var(X ) = 20(0.2)(0.8) = 3.2
3.2
Assume n is sufficiently large, by Central Limit Theorem, X N 4, approximately
n
P X 4.354 0.90
Using GC, when n 41, P X 4.354 0.89745
Page 3 of 15
4.354 4
P Z 0.90
3.2
n
0.354
1.281552
3.2
n
n 41.9 42
Note: n 42 is sufficiently large so the assumption is valid.
9 The mass of an abalone of a certain grade follows a normal distribution with mean 180g
and standard deviation 14.2 g.
(i) Find the probability that the mean mass of a sample of sixty abalones chosen at
random differs from the population mean mass by more than 2g.
(ii) This grade of abalones is priced at 450 dollars per kilogram. A customer orders
five abalones. Find the probability that the customer ends up paying an average of
more than 84 dollars per abalone.
[Ans: (i) 0.275 (ii) 0.147]
[Solution]
Let X be mass of an abalone in grams.
X ~ N 180,14.22
14.22
(i) For sample size n 60, X ~ N 180, [Note: CLT not used here]
60
P X 180 2 P X 180 2 P ( X 180 2)
2 P X 180 2 or 2P( X 180 2)
2 P X 182 or 2P( X 178)
0.27528
0.275 (3 s.f.)
C1 C2 C3 C4 C5 6.392
For sample size n 5, C ~ N 81,
5 5
P C 84 0.14690 0.147 (3 s.f.)
Page 4 of 15
Alternatively,
X1 X 2 X 3 X 4 X 5 14.22
For sample size n 5, X ~ N 180,
5 5
[Solution]
X ~ N 1, 20
(i) P( X a) 2P( X a)
P( X a) 2 1 P( X a)
2
P( X a)
3
a 2.9263 2.93 (3s.f.)
20
(ii) X ~ N 1,
n
Given P( X 1.5) 0.01
1.5 1
P Z 0.01 0.01
20
n
0.5
From GC, 2.3263
20
n
2.3263 20
n
0.5
n 20.807
n 20.807 2 ( both sides +ve)
432.93
n 433 (n )
Page 5 of 15
4
11 The continuous random variable X has E(X) = 0 and Var(X) = . The random variable Y
5
is defined by Y = aX + b, where a and b are positive constants. It is given that E(Y) = 50
and Var(Y) = 80. Find a and b.
A random sample consists of 160 independent observations of Y. Find an approximate
value for the probability that the sample sum lies between 7840 and 8080.
[modified N97/II/9]
[Ans: 10, 50; 0.682 ]
[Solution]
Y aX b
Using E(Y ) 50
E(aX b) 50
aE( X ) b 50
b 50
Using Var(Y ) 80
Var(aX b) 80
a 2 Var( X ) 0 80
4
a 2 80
5
a 2 100 a 10 ( a 0)
a 10, b 50
Page 6 of 15
12 The ‘reading age’ of children about to start a secondary school is a measure of how good
they are at reading and understanding printed text. A child’s reading age, measured in
years, is denoted by the random variable X. The distribution of X is assumed to be
X ~ N , 2 .
(a) The reading age of a random sample of 20 children were measured and the data
obtained is summarised by
x 232.6, x2 2756.22 .
Calculate unbiased estimates of and 2, giving your answers correct to 2 decimal
places.
(b) In order to obtain a more accurate estimate of , it is proposed that a larger sample
be taken. Estimate the sample size needed so that we can be 95% certain that the
sample mean reading age will differ from the true mean reading age by less than 6
months. Assume that it is known from previous studies that 2 2.25 .
[Ans: (a) 11.63, 2.69 (b) 35]
[Solution]
(a)
Page 7 of 15
13 A sample of the weights of 150 students gives x 9000 and ( x 60)2 200. Find the
unbiased estimates of the population mean and variance.
200
[Ans: 60, ]
149
[Solution]
200
or 1.34 (3 s.f.)
149
14 The power consumption of a certain brand of light-bulb is nominally 100 watts, and may
be assumed to follow a normal distribution with mean 100 watts and standard deviation 2
watts. Calculate the probability that the mean power consumption of 50 randomly selected
bulbs is less than 99.5 watts.
Following a change in the manufacturing process, a sample of 100 bulbs was tested.
Denoting the power consumption in watts of a bulb by w, it was found that
(w 100) 52 and (w 100)2 93. Calculate unbiased estimates of the new mean
and variance of the power consumption of a light-bulb.
(ii) Find the expectation and variance of X in terms of E(W) and Var(W) where
X W 100 . Comment on the relationship between the average of W and X, and
the relationship between the variance of W and X.
[Ans: 0.0385; 100.52, 0.666]
[Solution]
Let X be the power consumption of 1 light bulb. X ~ N 100, 4
4
Then for a sample of 50 bulbs, X ~ N 100,
50
Thus, P X 99.5 0.0385
Page 8 of 15
Unbiased estimate of population variance 2 is
1 52
2
1 ( w 100) 2
s
2
( w 100)
2
93
n 1 n 99 100
0.66626 0.666 (3 s.f.)
Another random sample of 40 melons is weighed and the results are as follows:
Page 9 of 15
[Solution]
Unbiased estimate of population mean = x 1.1021
16 From past trends, the National Health Board (NHB) of a certain country reported a mean
cholesterol level of 199 mg/dL (milligrams per decilitre) for the country’s population. A
NHB employee was tasked to investigate if the cholesterol level of people has changed.
She collected a random sample of 30 that yields a sample mean of x mg/dL and a sample
variance of 281.5 (mg/dL)2 .
(i) By finding the unbiased estimate of the population variance, state the approximate
distribution of the sample mean.
(ii) Another random sample of 30 is chosen. Given that the probability of the sample
mean being less than c mg/dL is not more than 0.05, find the largest value of c.
[Ans: (i) 291.207 (ii) 193]
[Solution]
From the sample, n 30, an unbiased estimate of the population variance is
n 30
s2 (sample variance) (281.5) 291.207
n 1 30 1
291.207
Since n 30 is large, X ~ N 199, approximately by CLT
30
Given P( X c) 0.05
Page 10 of 15
Section 2: Supplementary Questions (For students to practice after going through tutorial for
extra practice)
17 (In this question you should state clearly the values of the parameters of any normal
distribution you use.)
The mass, in grams, of a randomly chosen jar of Tasty brand jam is a random variable with
the distribution N(300, 42). The mass, in grams, of a randomly chosen Yummy brand jam
is a random variable with the distribution N(350, 52).
(i) Find the probability that the masses of 2 randomly selected jars of Tasty brand jam
differ by more than 10g. [3]
(ii) Find the probability that out of four randomly chosen jars of Yummy brand jam,
exactly one weighs more than 355g and the other three weigh not more than 345g each.
[3]
(iii) A crate contains ten jars of Tasty brand jam and five jars of Yummy brand jam. Find
the probability that the average mass of fifteen jars of jam in a randomly chosen crate
lies between 317g and 322g. [4]
[IJC 2011 Prelim/II/10]
[Ans: (i) 0.0771 (ii) 0.00253 (iii) 0.384]
[Solution]
Let T be the r.v. denoting “the mass of a randomly chosen jar of Tasty jam”, and
Y be the r.v. denoting “the mass of a randomly chosen jar of Yummy jam”.
T ~ N(300, 42), Y ~ N(350, 52).
(i) T1 – T2 ~ N( 0, 32)
P( | T1 – T2 | > 10 ) = 2P( T1 – T2 > 10 )
= 0.0771
(ii) 4!
P Y 355 P Y 345 3 0.00253
3!
(iii) T T10 Y1 Y5
Let A = 1
15
1 4750 950
E(A) = 300 10 350 5
15 15 3
Var(A) =
1
152
42 10 52 5
19
15
950 19
A ~ N ,
3 15
P(317 < A < 322 ) = 0.384
Page 11 of 15
18 The weight, x kg, of each student in a random sample of 120 students from a secondary
school is measured, and the results are summarized by
(i) [Solution]
Unbiased estimate of population mean
= ( x 50) 50 100 50
n 120
1
49 or 49.167 (5 s.f ) 49.2 (3 s.f )
6
Unbiased estimate of population variance
1 ( x 50)
2
n 1
( x 50)
2
n
1 (100) 2
(1158 ) 9.0308 9.03 (3 s.f )
119 120
Since n = 120 is large, by Central Limit Theorem,
9.0308
X N (49.167, ) approximately.
n
Page 12 of 15
Method 2 : Using table of values
Least value of n = 91
19 (In this question, state clearly the mean and variance of any normal distribution you use in
your calculation.)
In an office building in Shenton Way, there are 640 male employees and 560 female
employees. The weights, in kg, of male employees and female employees are modelled as
having independent normal distributions with mean and standard deviations as shown in the
table.
[3]
(i) Calculate the probability that the total weight of 4 female employees is less than three
times the weight of a male employee. [3]
(ii) Calculate the probability that the mean weight of a random sample of 80 male
employees differs from their mean by at most 0.5 kg. [3]
(iii) 25 employees are randomly chosen of which k of them are male. If the probability
that the total weight of these 25 employees exceeding 1500 kg is approximately 0.987,
find the value of k. [5]
[TPJC 2011 Prelim/II/9(modified)]
[Ans: (i) 0.710 (ii) 0.975 (iii) 15]
[Solution]
(i) Let Y be the weight of female employees in kg
Then Y ~ N(50, 22 )
Let X be the weight of male employees in kg
Then X N 68, 22
Y1 Y2 Y3 Y4 N 50 4, 22 4 i.e. Y1 Y2 Y3 Y4 N 200,16
3X ~ N 68 3, 22 32 i.e. 3X ~ N 204, 36
Let T Y1 Y2 Y3 Y4 3 X
Page 13 of 15
Then T ~ N 200 204,16 36 i.e. T ~ N 4, 52
P T 0 0.71045 0.710
20 A fruit seller grades apples according to their mass. It is given that the mass of a randomly
chosen apple follows a normal distribution with mean g and standard deviation 30 g.
Apples with a mass exceeding 150 g are graded as ‘large’ while apples with a mass less
than 70 g are graded as ‘small’. The proportion of ‘large’ apples is the same as the
proportion of ‘small apples’.
(i) Explain why is 110 g. [1]
(ii) Find the probability that the total mass of two randomly chosen apples exceeds
230g. [2]
(iii) Find the probability that a buyer who picks 50 apples randomly will have at least
three apples which are graded as ‘large’. [4]
The fruit seller also grades oranges according to their mass. It is given that the mass of a
randomly chosen orange has an independent normal distribution with mean 190 g and
standard deviation 24 g. The fruit seller sells the apples at $0.20 per 100 g and the oranges
at $0.15 per 100 g.
(iv) Find the probability that the average cost of an apple and two oranges exceeds
$0.25. [3]
[YJC 2011 Prelim/II/8]
[Ans: (ii) 0.407 (iii) 0.846 (iv) 0.694 ]
Page 14 of 15
[Solution]
(i) Let X be the random variable of the mass of an apple.
X ~ N ( ,302 )
x
70
70 150
2
110 g (by symmetry)
(ii) X1 X 2 ~ N (220,1800)
(iii) Let W be the random variable of the number of apples (out of 50) which are graded as
‘large’.
W ~ B(50, P( X 150))
W ~ B(50,0.09121121)
P W 3 1 P W 2 0.846
Y ~ N (190, 242 )
Page 15 of 15