HKDSE M1 Probability Distributions

88 CHAPTER 4 PROBABILITY DISTRIBUTIONS
進佳數學團隊 – Dr. Herbert Lam 林康榮博士

HKDSE Mathematics M1
II. The Binomial Distribution
1. Bernoulli distribution
A Bernoulli experiment results in any one of two possible outcomes, which

are often classified as “success” and “failure”. For example, the inspection
of a lot of items is associated with the sample space
S   D, ND
in which D denotes the event of getting a defective and
ND denotes the event of getting a nondefective.
If one of them is termed “success”, then the other is termed “failure”.

Some experiments look like Bernoulli, but not really. For instance, John is
playing a game of chess with his friend. One might associate “success”
with “winning the game” and “failure” with “losing the game”. However,
there is also another possible outcome, a “tie” !
Let us quantify the event “success” by 1 and “failure” by 0.

Then Y  0,1 is called a Bernoulli random variable. Suppose the
probability of a “success” is p . Then the probability function of Y is
given by
P(Y  1)  p, P(Y  0)  1  p .
2. Basic features of a binomial experiment
Let Y , Y2 , , Yn be n independent Bernoulli random variables, and

1
let X  Y  Y2    Yn .
1
Then X is called a binomial random variable, which takes on the values

0, 1, 2, , n .
This approach is rather mathematical. In fact, we can commence our
study of the binomial distribution intuitively as follows.
CHAPTER 4 PROBABILITY DISTRIBUTIONS 89
A binomial experiment possesses the following properties:

1. There are n identical observations or trials.
2. Each trial has two possible outcomes, one called success and the other
failure. The outcomes are mutually exclusive and collectively
exhaustive.
3. The probabilities of success p and of failure 1  p remain the same for
all trials
4. The outcomes of trials are independent of each other.
2. The probability function

In a binomial experiment with a constant probability p of success in each
trial, the probability distribution of the binomial random variable X, the
number of successes in n independent trials, is called the binomial
distribution.
Since the n trials result in x successes and n  x failures, the probability is

p x (1  p )n  x . Moreover, there are C xn combinations of successes and
failures. Therefore, the probability function is given by
P( X  x)  Cxn p x (1  p ) n  x , x  0,1, 2, , n .
The notation X ～ b(n, p) is used to describe a binomial random variable X

with parameters n and p. Furthermore, the mean and variance of the
above binomial distribution are np and np(1  p) respectively.
Example 1
A trading company has four telephone lines. Suppose the probability that
1
any one of the lines is busy at an instant is .
3
(a) Calculate the probability that

(i) two of the four lines are busy.
(ii) at least one of the four lines is busy.
(iii) an incoming phone call cannot be answered immediately.
(b) Suppose at an instant one staff member is on the telephone. What is the
probability that another two lines are busy?
[Solution]
(a) Let X denote the number of telephone lines that are busy.
2 2
1  2
(i) P( X  2)  C    
4
3  3
2
43 1 4 8
    .
2 9 9 27
0 4
1  2
(ii) P( X  1)  1  P( X  0)  1  C    
4
 3  3
0
16 65
 1  .
81 81
(iii) An incoming call not answered immediately implies all the
four lines are busy.
4
1 1
P( X  4)     .
 3  81
(b) Since it is given that one telephone line is engaged, the outcome of
X  0 is removed. We have the conditional probability
P(3 lines are busy/at least 1 line is busy)
3
1 2
C34     4 2 8
P( X  3 & X  1) P( X  3) 3 3 
    .
P( X  1) P( X  1) 65 65 65
81
Example 2
Machine A and B turn out, respectively, 10% and 90% of the total production
of a certain article. Suppose the probability that machine A turns out a
defective article is 0.01 and that machine B turns out a defective one is 0.05.
(a) What is the probability that an article taken at random from a day’s
production was nondefective?
(b) In a quality control process 10 articles from a batch are randomly
selected for inspection. Acceptance of the batch allows no more than
one defective. Find the probability that the batch is rejected.
(c) A batch was rejected yesterday because two defective articles are found.
What is the probability that both items were manufactured by
(i) the same machine?
(ii) different machines?
(d) State the relevant assumptions required for (b) and (c).
[Solution]
(a) For a randomly selected article, define
A: the article is manufactured by machine A
B: the article is manufactured by machine B
ND: the article is nondefective
D: the article is defective.
Then we have
P( A)  0.1, P( B )  0.9
P( ND / A)  0.99, P( ND / B)  0.95
P( D / A)  0.01, P( D / B )  0.05 .
By the theorem of total probability on P. 68,
P( ND)  P( A)P( ND / A)  P( B)P( ND / B)
 (.1)(.99)  (.9)(.95)  .954 .
P( D)  1  P( ND)  1  .954  .046 .
(b) Let X be the number of defective articles found in the inspection
process. Then X ～ b(10, 0.046) .
P(the batch is rejected)
 1  P( X  1)  1  P( X  0)  P( X  1)
 1  C010 (.046)0 (.954)10  C110 (.046)(.954)9  .0745 .
(c) By Bayes’ theorem,

P( A)P( D / A)
P( A / D) 
P( A)P( D / A)  P( B)P( D / B)
(.1)(.01) 1 1
   .
(.1)(.01)  (.9)(.05) 1  45 46
As an article is either manufactured by machine A or B, it follows
that,
45
P( B / D )  1  P( A / D)  .
46
(i) P(both articles manufactured by the same machine)
 P(both manufactured by A)  P(both manufactured by B)
 P( A / D) 2  P( B / D)2
2 2
 1   45  2026
      .9575 .
 46   46  2116
(ii) P(both articles manufactured by different machines)
 1  P(both articles manufactured by the same machine)
 1  .9575  .0425 .
Alternatively, this probability can also be calculated by

1 45
P( A / D)  P( B / D)  P( B / D)  P( A / D)  2   , or
46 46
P(1 defective article manufactured by machine A)
 1  1 
 C12   1   .
 46   46 
(d) The binomial assumptions are required, namely,

(i) independent trials and
(i) the probability of drawing an defective article is constant.
In fact, sampling without replacement does not meet the two
requirements. However, the batch size is very large and therefore
(i) & (ii) are considered satisfied.
See Example 3 for detail.
Example 3
A machine produces, on the average, 5% of defective parts. If 10 parts are

selected at random form a lot of size 1000 for inspection, what is the
probability that exactly three will be defective?

[Solution]
Of the 1000 parts in the lot, there are 50 defective ones.
P(exactly 3 will be defective)
50! 950!

C350  C7950 3!47! 7!943!
 
C101000 1000!
10!990!
50  49  48  47! 950    944  943!

 3!47! 7!943!
1000  999    991 990!
10!990!
10!  950 949 944   50 49 48 
         
3!7! 
 1000 999 994  
 993 
992 
991
C310 (.97)7 (.05)3
 C310 (.05)3 (.95)7 .

Example 4
A machine which is powered by three similar electrical devices will function

properly if at least two of these devices are serviceable. Experience
indicates that the probability of any device failing in less than 50 hours is 0.2,
while failing in less than 100 hours is 0.6.
Find the probability that the machine will function properly

(a) for more than 50 hours,
(b) between 50 and 100 hours.

[Solution]
Let X denote the number of devices failing in less than 50 hours,
Y denote the number of devices failing in less than 100 hours.
We are given that
X ～ b(3, 0.2) , i.e. P( X  x)  C x3 (.2) x (.8)3 x
Y ～ b(3, 0.6) , i.e. P(Y  y )  C y3 (.6) y (.4)3 y .
Define the events
A: the machine functions for more than 50 hours
B: the machine functions for more than 100 hours.
(a) P(A)  P(at most one device fails in 50 hours)

 P( X  1)  P( X  0)  P( X  1)
 C03 (.2)0 (.8)3  C13 (.2)(.8) 2
 .512  .384  .896 .
(b) P(B)  P(at most one device fails in 100 hours)
 P(Y  1)  P(Y  0)  P(Y  1)
 C03 (.6)0 (.4)3  C13 (.6)(.4)2
 .064  .288  .352 .
Note that B  A . Then A  B or A  B is the event
“the machine functions between 50 and 100 hours”
and we have
P( A  B)  P( A)  P( B)  .896  .352  .544 .
Note:
As time is a continuous variable, the terms “more than 50 hours” and “at
least 50 hours” have the same meaning. Also, “between 50 and 100 hours
is well defined. It doesn’t matter if the end points 50 or 100 are inclusive.
III. The Poisson Distribution
1. Basic features
Experiments yielding numerical values of a random variable X, the number
of successes (observations) occurring during a given time interval (or in a
specified region) are often called Poisson experiments.
A Poisson experiment has the following properties:
1. The number of successes in any interval is independent of the number
of successes in other interval.
2. The probability of a single success occurring during a short interval is
proportional to the length of the time interval and does not depend on
the number of successes occurring outside this time interval.
3. The probability of more than one success in a very small interval is
negligible.
Some well known examples of Poisson experiment are

1. The number of customers who arrive during a time period of length t,
2. The number of telephone calls per hour received by an office,
3. The number of typing errors per page,
4. The number of accidents per day at a junction.
2. The probability function

The probability distribution of the Poisson random variable X is called the
Poisson distribution. The probability function is
e  x
P( X  x)  , x  0,1, 2, .
x!
The mean and variance of the above Poisson distribution are  . This is
also known as the average number of “successes” occurring in the given
time interval. Note that  is the only parameter that appears in the
probability function. In other words, a Poisson distribution is completely
determined if  is known.
Example 1
Albert is a life insurance agent. Assume he makes, on the average, one sale
per week, and the number of sales behaves close to a Poisson distribution.
(a) What is the probability that Albert makes

(i) exactly 3 sales in a two-week period?
(ii) at least 3 sales in a three-week period?
(b) What is the probability that he will make only one sale in the coming
December?
(c) Albert has received e-mail confirmation from two of his friends that each
of them will purchase a life insurance policy from him next month.
What is the probability that he will make at least 4 sales next month?
(Take 1 week = 7 days and 1 month = 30 days)

[Solution]
(a) (i)   2/2-week.
e 2 2 3
P( X  3)   .18 04.
3!
(ii)   3/3-week.
P( X  3)  1  P( X  3)
e 3 3 x  32 
2
 1 
x 0
x!
 1  e 3  1  3  
 2! 
 .5768 .
30
(b)   per month.
7
 30 
30

P( X  1)  e 7    .0590 .
 7 
(c) Albert is going to have at least 2 sales next month. Then we
calculate the conditional probability
3
1   P( X  x)
P( X  4 & X  2) P( X  4)
P( X  4 / X  2)    x 0
P( X  2) P( X  2) 1
1   P( X  x)
x 0

30
 30  30  1  30  1 
2 3
1 e 7
1       
 7  7  2!  7  3!
  .6689.
7 
30 
30

1  e 1  
 7 
Example 2
Weak spots occur in a certain kind of weaved cloth on the average of one per
100 m. Assuming a Poisson distribution of the number of weak spots in any
given length of cloth, what is the probability that
(a) a 240-m roll will have at most two defects?
(b) a 120-m roll will have no defects?
(c) Of five 120-m rolls, at least three of them will have no defects?
(d) Suppose a new weaving process can reduce weak spots so that any five
120-m rolls being free of defects has probability at least 0.1. Find the
percentage reduction of weak spots.

[Solution]
(a) Let X 1 be the number of defects per 240-m. Then its mean is 2.4.
P( X 1  2)  P( X 1  0,1, 2)
e2.4 2.4 x  
2

2.42
  e2.4 1  2.4    .5697 .
x 0
x!  2! 
(b) Let X 2 be the number of defects per 120-m. Then its mean is 1.2.
e1.21.20
P( X 2  0)   .3012 .
0!
(c) Let Y be the number of rolls with no defects out of five 120-m rolls.
Then Y ～ b(5, 0.3012) , i.e.
P(Y  y )  C y5 (.3012) y (1  .3012)5 y , y  0,1, 2, ,5.
P(at least three of them have no defects)
 P(Y  3)  P(Y  3)  P(Y  4)  P(Y  5)
 C35 (.3012)3 (.6988) 2  C45 (.3012)4 (.6988)  C55 (.3012)5 (.6988)0
 .1334  .0288  .0025  .1647 .
(d) The mean becomes   1.2(1  d ) if 100d% week spots are reduced.
Then P( X 2  0)  e 1.2(1 d ) by (b).
P(5 rolls free of defects)   e1.2(1 d )   .1 ,
5
i.e. e6(1 d )  .1 .
On solving, 6(1  d )  ln(.1)
1  d  .3838 , d  .6162 .
This means at least 61.62% weak spots are reduced.
Example 3
An English teacher Miss Lee has been marking students’ compositions for
years. She finds that only 5% of the pages are free of errors (for example,
grammatical and spelling errors, etc.). Suppose the number of errors per
page follows a Poisson distribution.
(a) Show that on the average there are about 3 errors per page.
(b) What is the probability that a composition of 2 pages contains more than
4 errors?
(c) Another English teacher Miss Chan requires her students to hand in their
compositions in typing. However, typing errors occurs. It is found
that only 1% of the pages are free of errors. Assuming the number of
typing errors per page follows a Poisson distribution.
Use relevant assumptions to calculate the mean number of typing errors
per page.

[Solution]
(a) Let X 1 be the number of errors per page and  be its mean.
e  x
P( X 1  x)  , x  0,1, 2,
x!
P( X 1  0)  e   .05
   ln(.05)  2.9957  3 .
(b) Let X 2 be the number of errors per 2 pages. Its mean is 6.

4

4
6x
P( X 2  4)  1   P( X 2  x)  1  e 6
x 0 x 0
x!
 62 63 64 
 1  e 6 1  6      .7149 .
 2! 3! 4! 
(c) (i) An error is either due to incorrect language usage (i.e.

grammar, spelling, etc.) or mistyping. The case of both is not
counted.
(ii) The content of one written page equals that of one typed page.
Let Y be the number of error per typed page and  be its mean.
e   y
P(Y  y )  , P(Y  0)  e   .01    4.61 .
y!
The mean typing error per page is 4.61  3  1.61 .
Example 4
(a) Between the hours of 2:30 p.m. and 5:00 p.m., the average number of
calls per minute coming into a switchboard is 2.5. Assume the number
of calls follow a Poisson distribution. Find the probabilities that during
one particular minute there are
(i) 4 or fewer calls, (ii) more than 6 calls.
(b) During the period 3:00 p.m. to 3:02 p.m., show that the probability of
1
receiving an odd number of telephone calls is approximately .
2
(c) Does it follow from (b) that the probability of receiving an even number
1
of calls is also ?
2
1
(d) Suppose the operators answer, on the average, of the incoming calls in
5
English. What is the probability that they answer exactly two telephone
calls in English in a given minute?
[Solution]
(a) Let X be the number of calls per minute.
e 2.5 2.5 x
P( X  x)  , x  0,1, 2,.
x!
e 2.5 2.5 x
4

4
(i) P( X  4)   P( X  x) 
x 0 x 0
x!
 2.52 2.53 2.54 
 e2.5 1  2.5      .8912 .
 2! 3! 4! 
e 2.5 2.5 x
6
(ii) P( X  6)  1  P( X  6)  1  
x 0
x!
 2.52 2.56 
 1  e 2.5  1  2.5      .0142 .
 2! 6! 
(b) During the 2-minute period, let Y be the number of telephone calls.
It is a Poisson random variable with mean 2.5  2  5 calls.
Probability of receiving an odd number of calls
 P(Y takes on an odd number)
 P(Y  1,3,5,)
 P(Y  1)  P(Y  3)  P(Y  5) 
 53 55 
 e5  5      .
 3! 5! 
Recall that
52 53
e5  1  5    (1)
2! 3!
52 53
e 5  1  5    (2)
2! 3!
(1)  (2) gives
 53 55 
e5  e 5  2  5      .
 3! 5! 
e 5
Multiplying both sides by , we get
2
1  e 10  53 55 
 e 5  5      .
2  3! 5! 
1
Note that LHS  as e10  0 . ( LHS  .4999773 )
2
1
Therefore RHS  , which is the probability of receiving an odd
2
number of calls.
(c) (1)  (2) gives

 52 54 
e5  e5  2 1      .
 2! 4! 
Similarly to (b), we get
1  e10  52 54 
 e5 1      .
2  2! 4! 
1
It follows that RHS  , i.e.
2
1
P( X  0)  P( X  2)  P( X  4)    .
2
But 0 is not an even number. Hence,
1
P( X  2)  P( X  4)     P( X  0)
2
 .5  .0067 .
 .4933 .
The probability of receiving an even number of calls is slightly less
1
than .
2
(d) Let Y be the number of calls per minute “in English”.

If Y  2 , then X  2 . By the theorem of total probability on P. 68,

P(Y  2)   P( X  x)P(Y  2 / X  x)
x 2
 x2
e2.5 2.5 x x  1   4   
2
 
x2
x!
 C2    
5 5 
 1
Y ~ b  x, 5  when X  x 

 x 2
e2.5 2.5 x
2
 1 4
x!
     
x2
x! 2!( x  2)!  5   5 
 x 2
e2.5 2.52 1 

4
  2  2.5  
x2
2!( x  2)! 5  5

e2.5  2.5  2 x2
2
 
x2
  
2  5  ( x  2)!

e2.5 2x2

24 x2
( x  2)!
e 2.5  22 23 
  1  2    
24  2! 3! 
e2.5 2 e .5
 e 
24 8
 .0758 .
Note:
We can go further to prove that

e.5 (.5) y
P(Y  2)   P( X  x)P(Y  y / X  x)  , y  0,1, 2, ,
x y y!
i.e. Y has a Poisson distribution with mean 0.5.
Steps of the proof parallel those of above.
 x y
e2.5 2.5 x x  1   4 
y
 P( X  x)P(Y  y / X  x)  

Cy    
x y x y
x! 5 5
 x y
e 2.5 2.5 x
y
 1 4
x!
     
x y
x! y !( x  y )!  5   5 
  
Further reduction is left to the student.
IV. The Geometric Distribution
Suppose independent Bernoulli trials with probability of success p are

conducted. Let X be the number of such Bernoulli trials until we get a
success. Then X is said to have a geometric distribution with parameter p.
Observe that there are X  1 failures preceding a success. Thus the
probability function of X is
P( X  x)  (1  p ) x 1 p, x  1, 2, .
and variance is   1 .

1 1 1
Its mean is
p p p 
Example 1
An American roulette wheel commonly has 38 spots on it of which 18 are

black, 18 are red and 2 are green. Let X and Y be the number of spins
necessary to observe the first red and first green number respectively.
Find the probability functions, means and variances for X and Y.

[Solution]
18
The probability of getting a red number is , while that of getting a
38
2
green number is .
38
Then we have
k 1 k 1
 20   18   10   9 
(i) P( X  x)           , k  1, 2,
 38   38   19   19 
38 19 38  38  38 20 190
X   ,  X 2    1    .
18 9 18  18  18 18 81
k 1 k 1
 36   2   18   1 
(ii) P(Y  y )           , k  1, 2,
 38   38   19   19 
38
Y   19 ,  Y 2  19(19  1)  342 .
2
Example 2
A fair die is rolled until a 6 occurs. Compute the probability that

(a) 10 rolls are needed.
(b) less than 4 rolls are needed.
(c) an odd number of rolls is needed.

[Solution]
Let X be the number of rolls needed.
x 1
5 1
P( X  x)    , x  1, 2,.
6 6
9
5 1
(a) P( X  10)     .0323 .
6 6
(b) P( X  4)  P( X  1)  P( X  2)  P( X  3)
2
1 5 1 5 1
       .4213 .
6 6 6 6 6
(c) P(X is an odd number)  P( X  1)  P( X  3)  P( X  5) 
2 4
1 5 1 5 1
       
6 6 6 6 6
1 5 5 
2 4
 1        
6 6 6 
1 1 1 36 6
     .
6 5
2
6 36  25 11
1  
6
未完待續

HKDSE M1 Probability Distributions

Uploaded by

Copyright:

Available Formats

HKDSE M1 Probability Distributions

Uploaded by

Document Information

Original Description:

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

HKDSE M1 Probability Distributions

Uploaded by

Copyright:

Available Formats

88 CHAPTER 4 PROBABILITY DISTRIBUTIONS

進佳數學團隊 – Dr. Herbert Lam 林康榮博士

II. The Binomial Distribution

A Bernoulli experiment results in any one of two possible outcomes, which

If one of them is termed “success”, then the other is termed “failure”.

Let us quantify the event “success” by 1 and “failure” by 0.

2. Basic features of a binomial experiment

Let Y , Y2 , , Yn be n independent Bernoulli random variables, and

Then X is called a binomial random variable, which takes on the values

A binomial experiment possesses the following properties:

2. The probability function

Since the n trials result in x successes and n  x failures, the probability is

The notation X ～ b(n, p) is used to describe a binomial random variable X

(a) Calculate the probability that

(c) By Bayes’ theorem,

Alternatively, this probability can also be calculated by

(d) The binomial assumptions are required, namely,

A machine produces, on the average, 5% of defective parts. If 10 parts are

 C310 (.05)3 (.95)7 .

A machine which is powered by three similar electrical devices will function

Find the probability that the machine will function properly

(a) P(A)  P(at most one device fails in 50 hours)

III. The Poisson Distribution

Some well known examples of Poisson experiment are

2. The probability function

(a) What is the probability that Albert makes

(b) Let X 2 be the number of errors per 2 pages. Its mean is 6.

(c) (i) An error is either due to incorrect language usage (i.e.

(c) (1)  (2) gives

(d) Let Y be the number of calls per minute “in English”.

IV. The Geometric Distribution

Suppose independent Bernoulli trials with probability of success p are

and variance is   1 .

An American roulette wheel commonly has 38 spots on it of which 18 are

A fair die is rolled until a 6 occurs. Compute the probability that

You might also like