Unit 5
Unit 5
Unit 5
Random Variable
A random variable provides a means of assigning numerical values to experimental outcomes.
The definition of a random variable is as follows:
Notation: Random variables are usually denoted by capital letters like X, Y, Z, etc.
The sample space is S= {H, T} where H denotes the outcome ‘Head’ and T denotes the
outcome ‘Tail’. So, there are two possible outcomes H or T. Now, let the random variable
X represents the outcome `Head’, then X can take the value 0 or 1.
The sample space of this experiment constitutes six possible outcomes, S = {1, 2, 3, 4, 5, and 6}
Let the random variable Y denotes the outcome ‘A number greater than 2 occurs’. Then
the random variable can assume the values 3, 4, 5 or 6.
Examples 3: Consider the experiment of rolling two fair dice once simultaneously.
If the random variable T indicates the outcome `the sum of the numbers on the two dice is greater
than 10, then T can take the pairs (5, 6), (6, 5) or (6, 6) since in each of these cases the sum of
the numbers is greater than 10.
Types of Random Variables: As stated above, a random variable provides a means of
associating a numerical value with each possible experimental outcome. Depending upon the
numerical values it can assume, a random variable can be classified into two major divisions.
A. Discrete Random Variable: data related to numerical responses that arise from a counting
process. It is a random variable that may assume either a finite number of values or an infinite
sequence (e.g. 1, 2, 3…) of values. In general, a discrete random variable takes whole number
values, which can be counted or enumerated. Example: The number of students who are enrolled
Page |1
Compiled by Temesgen Desta (Assist. Professor), Dep’t of Management at WU
Page |2
for a degree program in Wollega University, the number of defective batteries observed in
assessing its quality, the numbers of customers who visit a shop during one day of operation are
all examples of discrete random variables.
B. Continuous Random Variable: data related to numerical responses that arise from measuring
process. It is a random variable which may take on all values in a certain interval or collection
of intervals. A Continuous random variable, as the name implies, assumes all possible values
between any two values. Example: Weight, time, temperature, etc are example of continuous
random variable.
Example 1: Construct a probability distribution for the number of heads in tossing two fair coins
simultaneously.
Let the random variable X denotes the ‘number of heads’. We then use the probability function
P(X) to assign probability to each outcome consequently; the probability distribution is given
below:
Outcome, X 0 1 2
The probability distribution shows that the probability that the random variable can assume the
value 0 is ¼, the value 1 is ½ and the value 2 is ¼. Note that the sum of these probabilities is 1.
Page |2
Compiled by Temesgen Desta (Assist. Professor), Dep’t of Management at WU
Page |3
Example 2: The number of mistakes a typist made in ten days of assessment is shown in the
following table.
No of mistakes 2 3 4 5
No of days 1 4 3 2
No of mistakes, X 2 3 4 5
For instance, in the above example, these two conditions are satisfied since
P(X) = P (2) + P (3) + P (4) + P (5) = 0.1+ 0.4 + 0.3 + 0.2 = 1 and each of these probabilities
is greater than or equal to 0 and less than or equal to 1.
For some discrete random variables, the probability distribution can be given as a formula that
yields (x) for every possible value of x.
Example 3: Suppose a probability distribution is given by the formula:
Page |3
Compiled by Temesgen Desta (Assist. Professor), Dep’t of Management at WU
Page |4
Construct the probability distribution correspondence.
Outcome, x 0 2 3
Probability, (x) 0/5 2/5 3/5
The above formula shows that in order to compute the expected value of a discrete random
variable, we must multiply each value of the random variable by the corresponding probability
P(x) and then add the resulting products.
Variance: While the expected value provides the mean value for the random variable, we
often need a measure of dispersion, or variability, for the random variable. The mathematical
expression for the variance of a discrete random variable is as follows:
i 1 i 1
2
and the standard deviation is ó ó
Example 1: If three fair coins are tossed, find the expected number of heads that will occur and
obtain the variance.
Solution: Begin by constructing the probability distribution for the number of heads in tossing
the three coins.
Page |4
Compiled by Temesgen Desta (Assist. Professor), Dep’t of Management at WU
Page |5
The probability distribution is constructed below:
No of heads, x 0 1 2 3
Probability, P(x) 1/8 3/8 3/8 1/8
Then,
4
E(x) = i 1
xi. P (xi) = xi P(x1) + x2. P(x2) + x3.P(x3) + x4.P(x4)
The theoretical mean = 1.5 implies that if the experiment is done as many times as possible,
then on the average a head occurs 1.5 of the time.
4
2 =
i 1
[(xi-) 2·P (xi)]
= (x1 - ) 2 · P(x1) + (x2 - )2· P(x2) + (x3- )2 · P(x3) + (x4- )2 ·P(x4)
2 = 0.5
Page |5
Compiled by Temesgen Desta (Assist. Professor), Dep’t of Management at WU
Page |6
In a Binomial experiment, the probability of exactly x successes in n trials is given by:
P x
n!
. p x . q n x
n x ! x!
Example 1: Consider the experiment of tossing a coin three times. Show that it is a binomial
experiment and find the probability of getting exactly two heads.
Solution: This is a binomial experiment since
i) There are only two outcomes, head and tail.
ii) The number of trials is fixed (three)
iii) The probability of success, getting a head, does not change from trial to trial.
iv) The trials or tosses are independent, since the outcome of any trial is not affected by
the outcome of any other trial
Now, to find the probability of getting two heads, let p denotes the probability of getting a head
on a single toss.
n = 3, x = 2
P x
n!
. p x . q n x
n x ! x!
2
3! 1 1 32 3! 1 1
P (2) . . . .
3 2!2! 2 2 1!2! 4 2
3
= = 0.375
8
Page |6
Compiled by Temesgen Desta (Assist. Professor), Dep’t of Management at WU
Page |7
Example 2: A new drug is effective 60% of the time. What is the probability that in a random
sample of 4 patients, it will be effective on two of them?
Solution: This is a Binomial experiment as the points of the experiment are satisfied. Define
‘effective’ as ‘successes and ‘non effective’ as ‘failure’. Then,
p = 0.6, q = 1 - 0.6 = 0.4, n = 4, x=2
Required p (2) =?
4 2 !2!
Hence, the drug will be effective on two of a random sample of 4 patients with a probability of
0.3456 (or 34.56%).
The mean, variance and standard deviation of a variable that has the Binomial
distribution is found as:
Mean =n·p
Variance 2 = n·p·q
Standard deviation = npq
Example1: A coin is tossed four times. Find the mean, variance and SD of the number of heads
that will be obtained.
Solution:
Here n = 4, p = 1/2, and q = 1/2
= n *p = 4 * ½ = 2
2 = n * p *q = 4 * 1/2 *1/2 = 1
= 2 = 1= 1
Example 2: A die is rolled 240 times. Find the mean, variance and standard deviation for the
number of 3’s that will be rolled.
Solution:
n = 240, P=1/6
Page |7
Compiled by Temesgen Desta (Assist. Professor), Dep’t of Management at WU
Page |8
= n * p = 240(1/6) = 40
2 = n * p * q = (24)(1/6)(5/6) 33.33
= 33.33 5.77
( kCx ) . ( N kCn x )
Px , Where k is the number of successes in the population.
NC n
Example: suppose a researcher goes to a small college of 200 faculties, 12 of which have blood
type O-negative. She obtains a simple random sample of n=20 of the faculty. Let the random
variable x represent the number of faculty in the sample of size n= 20 that have blood type o-
negative?
A) What is the probability that 3 of the faculty have blood type O-negative?
B) What is the probability that at least one of the faculties has blood type O-negative?
Solution: A). we are looking for the probability of obtaining 3 successes, so x = 3.
If we conducted this experiment 100 times, we would expect to select 3 faculties that have
blood type O-negative about 8 times.
B). the phrase at least means greater than or equal to. The values of the random variable that
are greater than or equal to 1 are 1, 2, 3… 12. P(x≥ 1)= 1-P (0).
Page |8
Compiled by Temesgen Desta (Assist. Professor), Dep’t of Management at WU
Page |9
(12C 0) . (200 12C 20 0) (12C 0) . (188C 20)
P(x≥ 1) = 1-P (0) = 1- = 1- = 0.7282
200C 20 200C 20
There is a 0.7282 probability that, in a random sample of 20 faculties, at least 1 has blood type
O-negative. If we conducted this experiment 100 times, we would expect to select at least one
of the faculties that have blood type O-negative about 73 times.
Exercise: suppose that a machine shop orders 500 bolts from a supplier. To determine whether
to accept the shipment of bolts, the manager of the facility randomly selects 12 bolts. If none of
the 12 randomly selected bolts is found to be defective, he concludes that the shipment is
acceptable.
a). if 10% of the bolts in the population are defective, what is the probability that none of the
selected bolts are defective?
b). if 20% of the bolts in the population are defective, what is the probability that none of the
selected bolts are defective?
A discrete probability distribution that is useful when n is large and p is small and when the
independent variables occur over a period of time is called the Poisson probability distribution.
It mainly concerns about the probability distribution of the number of events occurring within a
time interval or space.
Examples:
i) The probability of an occurrence is the same for any two intervals of equal length.
ii) The occurrence or non-occurrence in any interval is independent of the occurrence or
non-occurrence in any other interval.
Page |9
Compiled by Temesgen Desta (Assist. Professor), Dep’t of Management at WU
P a g e | 10
The Poisson probability function
e . x
Px ;
x!
Where P(x, λ) is the probability of x occurrences in an interval of time, volume, area
etc for a variable, λ denotes the mean number of occurrences and e 2.7183
Example1: Past police records indicate a mean of five accidents per month while investigating
the safety of a dangerous intersection. The number of accidents is distributed according to the
probability in any month of
a) Exactly 3 accidents.
b) Fewer than 2 accidents.
Solution: By assumption the given distribution is a Poisson probability distribution.
Given that =5
x . e
a) P x
x!
x=3
P3
5 . 2.7183
3 5
125 0.00674
3! 6
= 0.1404
b) Fewer than 2 accidents comprise 0 and 1 accident during any month.
P0 P1
5 . 2.7183
0 5
5 2.7183
1 5
0! 1!
0.0674 + 0.3370
0.4044
Example 2: If there are 200 typographical errors randomly distributed in a 500-page manuscript,
find the probability that a given page contains exactly 3 errors.
Solution: First of all, find the mean number of errors
200
0 .4
500
P a g e | 10
Compiled by Temesgen Desta (Assist. Professor), Dep’t of Management at WU
P a g e | 11
Since x = 3,
Thus, there is less than a 1% probability that a give page contains less than 3 errors.
The shape and position of the normal distribution curve depends on two parameters, the mean
and the standard deviation. Each normally distributed variable will have its own normal
distribution curve.
P a g e | 11
Compiled by Temesgen Desta (Assist. Professor), Dep’t of Management at WU
P a g e | 12
Properties of the Normal Probability Distribution
The normal distribution curve is bell-shaped.
The mean, median and mode are equal and are located at the center of the distribution.
The curve is Uni-modal
The curve is symmetrical about the mean.
The curve is continuous and never touches the x-axis
The total area under the normal distribution curve is equal to 1, or 100%
The area under the normal curve that lies within one SD of the mean is approximately
0.68 or 68%, within two SD’s about 0.95 or 95% and within three SD’s about 0.997 or
99.7%.
The mathematical equation of the normal probability distribution is defined by the probability
density function.
x 2
f x
1
e 2 2 3.14159
2
P a g e | 12
Compiled by Temesgen Desta (Assist. Professor), Dep’t of Management at WU
P a g e | 13
Area Under the Normal Curve
As with other continuous random variables, probability calculations with any normal probability
distribution are made by computing areas under the graph of the probability density function.
Thus, to find the probability that a normal random variable lies within any specific interval, we
must, compute the area under the normal curve over that interval. For the standard normal
probability distribution, areas under the normal curve have been computed and are available in
tables that can be used in computing probabilities.
For the solution of problems using the normal distribution, the following steps are used.
1. Draw a picture
2. Transform the given value to z-value
3. Shade the area desired
4. Read the area from the standard normal distribution table.
Example 1: Find the area under the normal curve between z=0 and z=2.34
Solution:
0 2.34
The standard normal curve Representation is shown: From the table the intersection
of z = 2.3 with 0.04 give 0.4904 or 49.04% which is the required area.
Example 2 : Find the area under the normal distribution curve between z = -1.93 and z = 2.35
Solution:
-1.93 0 2.35
For easy look, draw the normal curve and locate the two z-scores.
The total area (the shaded region) is the area between –1.93 and 0 plus the
area between 0 and 2.35;
P a g e | 13
Compiled by Temesgen Desta (Assist. Professor), Dep’t of Management at WU
P a g e | 14
Hence, from the normal distribution table
Area = 0.4732 + 0.4906 = 0.9638 or 96.38%. Note that it is equivalent to say that the probability
of the z-value lying between z = -1.93 and z = 2.35 is 96.38%. This can also be written as:
P (-1.93 < z > 2.35) = 0.9638
Example 3: Find the probability that the z-value of a normally distributed variable lies to the
left of 1.65
Solution
0 1.65
normal curve, which is to the left of 1.65 Hence, total area = area to the left of 0
plus area between 0 and 1.65 = P(z < 1.65)
= 0.5000 + 0.4505 = 0.9505 or 95.05% which is required probability.
Solution
i.e 0 1.91
P (Z > 1.91) = P(z > 0) - P(0 < z < 1.91)
P a g e | 14
Compiled by Temesgen Desta (Assist. Professor), Dep’t of Management at WU
P a g e | 15
Example 1: The scores for an IQ test are normally distributed with a mean of 100 and a standard
deviation of 15. Find the percentage of IQ scores that will fall below 112.
Solution
100 112
0 0.8
Step 2: Find the z-value
Corresponding to an IQ Score 112.
Z = x - = 112 – 100 = 0.8
115
Step3: From the table,
P (z < 0.8) = P (z < 0) + P (0 < z < 0.8) = 0.5000 + 0.2881 = 0.7881
Hence, 78.81% of the IQ scores fall below 112.
Example2: The monthly salaries of 2000 workers are normally distributed with a mean of birr
550 and of workers whose monthly salaries are
600 550
Z 0.625
80
P a g e | 15
Compiled by Temesgen Desta (Assist. Professor), Dep’t of Management at WU
P a g e | 16
Example 3 A colleges’ desires to accept only the top 10% of all graduating seniors on the basis
of the results of a national placement test. The test has a mean of 500 and a standard deviation of
100. Find the cut-off score for the exam.
Solution:
The area is shown.
500 x
0 z
We need to determine the point on the axis that cuts the upper 10% of the area. Let it be
denoted by x.
From the table, the z – value that corresponds to the area 0.4000 is approximately 1.28.
x 500
Then, 1.28 x 628
100
Hence the score 628 should be used as a cut –off score. Any student scoring below 628 should
not be admitted.
P a g e | 16
Compiled by Temesgen Desta (Assist. Professor), Dep’t of Management at WU