Foundations of Quantitative Methods
Foundations of Quantitative Methods
Foundations of Quantitative Methods
Quantitative Methods
FedUni
#52, Nagarjuna Hills, Hyderabad – 500 082
© FedUni, March 2005. All rights reserved.
No part of this publication may be reproduced, stored in a retrieval system, used in
a spreadsheet, or transmitted in any form or by any means – electronic,
mechanical, photocopying or otherwise – without prior permission in writing from
The Federation of Universities (FedUni)
For any clarification regarding this book, please write to FedUni giving the reference number of the book, and the
page number.
While every possible care has been taken in preparing the book, FedUni welcomes suggestions from students for
improvement in future editions.
Contents
Chapter 1 Quantitative Methods: An Overview 1
Chapter 3 Calculus 35
Appendix 156
Introduction to Quantitative Methods
Detailed Contents
Chapter 1: Quantitative Methods - An Overview: Definition of Quantitative
Methods, Evolution of Quantitative Methods, Classification of Quantitative Methods,
Methodology of Quantitative Methods, Advantages of Quantitative Methods,
Limitations of Quantitative Methods
Chapter 2: Basics of Mathematics: Number Systems, Permutations and
Combinations, Logarithms, Progressions, Functions, Solving Simultaneous Equations
Chapter 3: Calculus: Limits, Derivatives: First order and Higher Order Derivatives,
Rules of Differentiation, Partial Differentiation, Optimization, Integral Calculus:
Indefinite integral
Chapter 4: Arranging Data: Meaning of Data, Types of Data, Data Collection,
Tables and Graphs as Data Presentation Devices, Frequency Distribution
Chapter 5: Measures of Central Tendency: Objectives of Averaging, Requisites of
a Good Average, Types of Averages, Mathematical Averages, The Median, The Mode
Chapter 6: Measures of Dispersion: Range, Inter Quartile Range and Quartile
Deviation, Mean Deviation, Variance, Standard Deviation, Bienayme Chebyshev’s
Rule
Chapter 7: Index Numbers: Definition of Index Number, Characteristics of Index
Numbers, Uses of Index Numbers, Types of Index Numbers, Problems Related to
Index Numbers, Methods of Constructing Index Numbers, Quantity and Value Indices
Chapter 8: Introduction to Probability and Probability Distribution: Basic
probability Concepts, Types of Probability, Probability Rules, Random Variables,
Types of Probability Distributions, The Normal Distribution
Chapter 9: Matrices: Definition of Matrix, Types of Matrices, Operations on
Matrices, Determinant of a Matrix , Properties of Determinants, Adjoint of a Matrix ,
Inverse of a Matrix , Application of Matrix Methods to Solve Linear Equations
Chapter 10: Interpolation and Extrapolation: Assumptions of Interpolation and
Extrapolation, Applications of Interpolation and Extrapolation, Methods of
Interpolation and Extrapolation
Chapter 1
Quantitative Methods: An
Overview
In this chapter we will discuss:
Evolution of Quantitative Methods
Classification of Quantitative Methods
Advantages of Quantitative Methods
Limitations of Quantitative Methods
Foundations of Quantitative Methods
2
Quantitative Methods: An Overview
Apart from the above advantages, quantitative methods like probability, variance
analysis, time series analysis, and index numbers are other efficient tools used by
managers to solve their organizational problems. These methods are discussed in
greater detail in the later chapters.
SUMMARY
4
Chapter 2
Basics of Mathematics
In this chapter we will discuss:
• Number Systems
• Permutations and Combinations
• Logarithms
• Progressions
• Functions
• Simultaneous Equations
Foundations of Quantitative Methods
Before introducing new concepts of quantitative methods, let us revisit a few basic
concepts in Mathematics. The decimal, binary and octal number systems along with the
conversions are discussed in the chapter. The concepts of permutations and
combinations are used while calculating probability of a random event. The concept of
progression is used to understand the trend of a data series. The three types of
progression: arithmetic, geometric and harmonic progression is discussed in detail.
The relationship between the dependent and independent variable can be linear,
exponential, logarithmic or inverse. The concept of functions discusses these
relationships in detail.
NUMBER SYSTEMS
The two number systems that we generally come across are decimal system (0, 1, 2, 3,
4, 5, 6, 7, 8, 9) and Roman system (I, II, III, IV, V...). But there are several other
number systems such as binary, octal, hexadecimal etc. The binary number system,
which contains only two digits 0 and 1, is widely used in electronic devices. The two
states of an electronic device ‘ON’ or ‘OFF’ are denoted by 1 and 0 respectively. John
Von Neumann suggested in 1946 that a computer should also use this kind of two way
binary system. Such a system is known as John Von Neumann architecture and most of
the computers use this architecture. Similar to the case of an electronic device when the
switch is said to be ‘ON’ or ‘OFF’, a computer uses two signals - pulse and no pulse.
Pulse is denoted by the digit ‘1’ and no pulse is denoted by the digit ‘0’. The other
number systems are octal number system based on the digits 0 to 7, the hexadecimal
number system based on the digits 0 to 9 and the letters A to F. The decimal system is
the most commonly used number system.
Decimal System
The decimal system comprises of ten basic symbols or digits (0, 1, 2, 3, 4, 5, 6, 7, 8
and 9) and is based on ‘tens’ i.e., each of the digit is a number and has a value or
importance ten times greater than the digit just to its right.
For example, 9875 means reading from the right
5 lots of 1(units) = 05
7 lots of 10 (tens) = 70
8 lots of 100 (hundreds) = 800
9 lots of 1000 (thousands) = 9000
Total = 9875
Each symbol in decimal system has its own absolute value and place value. In the
above example, 9 have got two values, i.e., one its absolute value which is 9 and its
place value which is 9000. Similarly the numbers 8, 7 and 5 have two values; 8, 7 and
5 as absolute values and 800, 70 and 5 as place values respectively. Number 10, which
is called the base or decimal system, is not a basic digit in this system. It is a result of
two digits 1 and 0 placed in order after the last digit of number system i.e. 9.
Thus, 10 = 1 ten + 0 unit
Similarly,
086 = 0 hundreds + 8 tens + 6 units
864 = 8 hundreds + 6 tens + 4 units
As the decimal system has number 10 as its base or radix, the position of the digit in
this system (base to system) determines the magnitude of the numbers. For example the
numbers 86 and 864 can be expressed in the base to system as
6
Basics of Mathematics
7
Foundations of Quantitative Methods
Fractional Numbers
The discussion in this section would be on fractional numbers both in binary number
system and decimal number system.
A fraction such as 0.7 in the decimal number system is written as 7/10. This can also
be written as 7×10-1. Similarly 0.72 can be written as 72/100 = 72 × 10-2 = 7 × 10-1 + 2
×10-2.
The (.) dot is read as ‘decimal point’ in the decimal system and ‘binary point’ in binary
system.
Conversion of fractional binary to decimal number
Convert the number (0.110101)2 to its decimal equivalent.
(0.110101)2 = 1×2-1 + 1×2-2 + 0×2-3 + 1×2-4 + 0×2-5 +1×2-6
= 0.5 + 0.25 + 0 + 0.0625 + 0 + 0.015625
= (0.828125)10
Convert the number (101.11)2 to its decimal equivalent.
(101.11)2 = (1×22 + 0×21 +1×20) . (1×2-1+1×2-2)
= (4 + 0 + 1) . (0.5 + 0.25)
= (5.75)10
9
Foundations of Quantitative Methods
Solution:
0.887651 × 2
1 0.775302 × 2
1 0.550604 × 2
1 0.101208 × 2
0 0.202416 × 2
0 0.404832 × 2
0 0.809664
The nearest equivalent is (0.111000)2
Let us verify and see
(0.111000)2 = 1×2-1 + 1×2-2 + 1×2-3 + 0×2-4 + 0×2-5 + 0×2-6
= 0.5 + 0.25 + 0.125 + 0 + 0 + 0
= 0.875 ≅ 0.887
The number (0.111000)2 is not exactly (0.887651)10 but is approaching it. If the
division is further carried out successively the number obtained may be approximately
the same as the original number. The number obtained is not exact because fractions
other than 1/2, 1/4, 1/8 ...1/2n cannot become zero after the binary point as in the above
example.
Example 2.6
Add the two binary numbers (111)2 and (101)2.
Solution:
Carry over (1) (1)
1 1 1
+ 1 0 1
11 0 0
(111)2 + (101)2 = (1100)2
10
Basics of Mathematics
Solution:
Quotient Remainder
8 879 7
8 109 5
8 13 5
8 1 1
0
Thus the integral part of (879)10 = (1557)8
Conversion of octal into decimal
The decimal value of an octal number is equal to the sum of the decimal values of the
octal digits.
Example 2.8
Convert (357)8 to its decimal equivalent.
Solution:
3578 = 3 × 82 + 5 × 81 + 7 × 80
= 3 ×64 + 5×8 + 7×1
= 192 + 40 + 7
= (239)10
Each of the arrangements that can be made by taking some or all of a number of
things/objects is called a permutation. Suppose there are four distinctive things w, x, y
and z. The permutations that can be made by taking two things at a time from the given
four things are each of these presenting a different arrangement of two letters.
wx wy wz xy xz xw
yz yw yx zw zx zy
Similarly, each of the groups or selections that can be made by taking some or all of a
number of things/objects is called a combination. Consider the same example as above.
The combinations that can be made by taking two things at a time from the given four
things are as below.
wx, wy, wz, xy, xz and yz
It is quite important to understand the difference between a permutation and a
combination. To form a combination we are concerned with the number and kind of
things/objects each group or selection contains; whereas to form a permutation we also
have to consider the order of the things that make up each arrangement.
Suppose there are four distinctive things w, x, y, z and a selection of three things out of
the given four at a time are
wxy, wyz, wxz and xyz
12
Basics of Mathematics
This is a combination. But for each combination above, we have six different
arrangements. The arrangements for the first combination are as follows:
wxy, wyx, xyw, xwy, ywx and yxw
Thus, permutation takes the order of things into consideration other than the number
and kind of things /objects.
Notation
The number of permutations of ‘n’ dissimilar thing taken ‘r’ at a time is denoted by nPr.
The number of combinations of ‘n’ dissimilar thing taken ‘r’ at a time is denoted by
n
Cr. Before going into the details of permutations and combinations, a function
principle is to be explained.
If one operation can be performed in ‘m’ different ways, (when it has been performed
in any one of these ways) and if another operation can then be performed in ‘n’
different ways then the two operations in succession can be performed in (m×n) ways.
Example 2.12
Find the numerical value of 10 P5 .
Solution:
n
Pr = n!
( n − r )!
10 10!
P5 =
(10 − 5)!
10 × 9 × 8 × 7 × 6 × 5!
=
5!
= 30240
Example 2.13
10
Find the numerical value of C5 .
Solution:
n n!
Cr =
r! (n − r )!
10 10!
C5 =
5!(10 − 5)!
10 × 9 × 8 × 7 × 6 × 5!
=
5!×5!
10 × 9 × 8 × 7 × 6
=
5 × 4 × 3 × 2 ×1
= 252
13
Foundations of Quantitative Methods
Example 2.14
Consider three persons (say A, B and C) coming to attend an interview in which there
are four panels (say 1, 2, 3 and 4). In how many ways can they be assigned to a panel
each at a different one?
Solution:
The first candidate, ‘A’ can be assigned to any of the four panels - 1, 2, 3, or 4. When
A has been assigned to any one of these panels, the second candidate, ‘B’ can be
assigned to any of the remaining three panels. The third candidate, ‘C’ can be assigned
to any of the remaining two panels. Thus, once A has been assigned in 4 ways, B has
been assigned in 3 ways and C in 2 ways, the total number of ways in which these
candidates can be assigned in 4×3×2 = 24.
Example 2.15
Consider 5 taxis running between Delhi and Agra. In how many ways can a person go
from Delhi to Agra and return by a different taxi?
Solution:
A person can go from Delhi to Agra in any of the 5 taxis (i.e. in five ways); and with
each of these there is a choice of four ways of returning (since the person is not to
come back by the same taxi); hence the number of ways of making the two journeys is
5×4 = 20 ways.
The number of permutations of n dissimilar things taken r at a time is given by,
n
Pr = n!
( n − r )!
The number of combinations on ‘n’ dissimilar things taken ‘r’ at a time is given by
n n!
Cr =
r! (n − r )!
14
Basics of Mathematics
Example 2.17
In how many ways can 6 balls be selected from a box containing 10 balls?
Solution:
The number of balls in the box=n=10.
We have to select 6 balls out of the 10. Hence r=6.
10! 10!
Number of ways of selecting 6 balls out of 10 =10C6= = =210.
(10 − 6)!×6! 4!×6!
Example 2.18
How many different numbers can be formed by using four out of the seven digits 1, 2,
3, 4, 5, 6, and 7?
Solution:
The number of dissimilar elements = n = 7 and we need to find out the number of
permutations if four of them are selected at a time = r = 4.
nP = 7 P4 = 7! 7! 7 × 6 × 5 × 4 × 3! = 840
r = =
(7 − 4)! 3! 3!
Example 2.19
n
P n +1 P7
If 6 : = 1:8, then find n.
Solution:
n n!
P6 =
(n − 6)!
n +1 (n + 1)!
P7 =
(n − 6)!
n
P6 n! (n − 6)!
n +1
= ×
P7 (n − 6)! (n + 1)!
n! 1
= =
(n + 1)n! (n + 1)
n
P6 1 1
n +1
= =
P7 8 (n + 1)
n +1 = 8
n=7
LOGARITHMS
The ‘logarithm’ of any number to a given base is the index or the power to which the
base must be raised in order to equal the given number. Thus if N, a (≠1) being any two
real numbers and for some real x if N= ax, then x is said to be the logarithm of N to the
base a, and is written as log aN= x. For example 34 = 81, the logarithm of 81 to base 3
is 4.
Functions defined by such equations are called logarithmic functions. The domain of
logarithmic functions is the set of positive real numbers and the range is the set of real
numbers.
If N = ax
log aN= log aax
15
Foundations of Quantitative Methods
loga N= x log aa
⇒ loga N = x (Since, loga a =1)
Example 2.20
(4 2 )x = 645 4
1 2
∴ (2 2
× 2 2 )x = 2 6
×2 5
5x 32
∴2 2 = 25
As bases are equal, indices can be equated
5x 32
∴ =
2 5
64
⇒ x= = 2.56
25
Example 2.21
4
Find the logarithm of 81 3 to the base 9 3 .
Solution:
Let x be the required logarithm; then by definition.
( 9 3 )x = 814 3
5 17
∴ x= 1 x = 1.7
2 4
Any number can be taken as the base of logarithms, and corresponding to each base the
logarithm of a number can be found. But once it is understood that a particular base of
logarithms is in use, the base is omitted. For example, arithmetical calculations are
written as log 6, log 5 instead of writing it as log10 6and log10 5.
The logarithm of 1 to any base is zero, since any number to the power of zero equals 1
so log 1 to any base is zero.
The logarithm of any number (≠ 1) to itself as a base is always unity, since any number
to the power of one gives the number itself i.e. say a1 = a, so logaa =1.
16
Basics of Mathematics
3 9 81 2
log − log + log + 3 log
4 4 8 3
2 4
3 3 3 2
= log − log + log + 3 log
2 2 3 3
2 2 2
PROGRESSIONS
Progressions are of three types
• Arithmetic Progression
• Geometric Progression
• Harmonic Progression
Arithmetic Progression
Quantities are said to be in arithmetic progression (A.P.) when they increase or
decrease by a common difference. The common difference is calculated by subtracting
any term of the series from its immediate succeeding term. The following are two
examples for an arithmetic progression:
4, 8,12,16,20...
-5,-10,-15,-20,-25...
In the above example, the first series is increasing by 4 i.e., a common difference of +4
and the second series is decreasing by 5 i.e., a common difference of -5.
The general form of an arithmetic progression is
a, (a+d), (a+2d), (a+3d), (a+4d), ........ [a+(n-1)d]+......
Where the first term is ‘a’ and the common difference is‘d’. Common difference is the
difference between any two consecutive terms.
It can be noticed that in any term, the coefficient of‘d’ is always less by one than the
number of term in the series. For example,
2nd term = a + d
3rd term = a + 2d
10th term = a + 9d
18
Basics of Mathematics
Example 2.26
The sum of 20 numbers in arithmetic progression is 650. If the first term of the series is
4, what is the common difference between the terms?
Solution:
Sum of ‘n’ terms of an arithmetic progression is given by,
n[2a + (n − 1)d ]
Sn =
2
20[(2 × 4) + 19 × d ]
650 =
2
1300 = 20(8 + 19d)
1300
= 8 + 19d
20
(65-8)/19 = d
∴d = 3
Example 2.27
The sum of first 10 numbers in an arithmetic progression is 255. If the common
difference is 5, what is the first term of the series?
Solution:
Sum of ‘n’ terms of an arithmetic progression is given by,
n[2a + (n − 1)d ]
Sn =
2
10[2a + 9 × 5]
255 =
2
510 = 20a + 450
510 − 450
∴a =
20
a=3
Example 2.28
The 5th term of the arithmetic progression is 18 and the 10th term is 33. Find the 16th
term of the series.
Solution:
The nth term in an A.P. is given by
Tn= a + (n-1) d
Thus the 5th term is given by
18 = a + 4d --(A)
and the 10th term is given by
33 = a + 9d --(B)
Subtracting equation (B) from equation (A) gives
15 = -5d
∴d = 3
21
Foundations of Quantitative Methods
Geometric Progression
Quantities are said to be in geometric progression (G.P.) when they increase or
decrease by a constant factor. The constant factor is called the common ratio and is
calculated by dividing any term by the term that immediately precedes it.
For example,
9, 18, 36, 72...
½, -1, 2, -4, 8...
In the above example, the first series is multiplying by 2, a common ratio of +2 and the
second series is increasing and decreasing alternatively by -2, i.e., a common ratio of
-2.
The general form of a geometric progression is
a, ar, ar2, ar3 , ar4 ..........
where ‘a’ is the first term of the series and ‘r’ the common ratio (ar2/ar = r)
It can be noticed from the general series that the index of r is one less than the number
of the term in series:
For example, 2nd term = ar
3rd term = ar2
10th term = ar9
23
Foundations of Quantitative Methods
a (r n − 1)
S=
(r − 1)
a (55 − 1)
3905 =
(5 − 1)
15620
a=
3125 − 1
∴a = 5
24
Basics of Mathematics
Example 2.32
The third term of a geometric progression is 9 and the seventh term is 729. Find the
sum of the first 10 numbers.
Solution:
The third term of the geometric progression is given by ar2 and the seventh term is
given by ar6 .
ar 2 9
=
ar 6 729
1 1
=
r 4 81
∴r = 3
ar2 = 9
9
∴a = =1
32
The sum of terms in the geometric progression is given by
a (r n − 1)
Sn =
(r − 1)
1(310 − 1)
=
(3 − 1)
= 29524
Geometric Mean
Suppose three quantities are in G.P, then the middle one is said to be the geometric
mean (G.M.) between the other two.
∴ If a, ar, ar2 are in G.P. Then ‘ar’ is the G.M.
Suppose a, G, b are in G.P. Then
G/a = b/G (Common ratios are equal)
⇒ G2 = ab ⇒ G = ab
∴ The Geometric mean, G = ab
If ‘n’ geometric means are inserted between ‘a’ and ‘b’ such that the whole series then
formed will be in G.P, then there are( n+2) terms in the series including the first term
‘a’ and last term ‘b’.
⇒ b is the (n + 2)th term
⇒ b = arn+1
⇒ r = (b/a)1/(n+1)
Therefore the required means between a and b are
ar, ar2 ...arn where r = (b/a)1/(n+1)
Kth G.M. = a (b/a)K/(n+1)
Product of n GMs = (ab)n/2
25
Foundations of Quantitative Methods
Example 2.33
Insert four geometric means between 1/5 and 1 / 15625.
Solution:
Given that there are six terms in the series and the 1st term is 1/5 and last term is
1/15625. Let r denote the common ratio.
∴ 6th term = ar5
⇒ 1/15625 = ar5
⇒ 1/15625 = 1/5 (r)5
⇒ r5 = 1/3125
⇒ r = 1/5
∴ The means are 1/25, 1/125, 1/625, 1/3125.
If ‘r’<1, then greater the value of ‘n’, smaller will be the value of rn and hence arn/(1-r)
tends to 0. Therefore for a series where n tends to infinity, the value arn/1-r is
considered to be negligible and the sum of the series is given by
a
Sn =
(1 − r )
Harmonic Progression
Quantities are said to be in harmonic progression (abbreviated H.P.) if the series
obtained by taking reciprocals of the corresponding terms of the given series is an
arithmetic progression.
For example, the series 1, 1/4, 1/7, 1/10,... is an H.P. since the series obtained by taking
reciprocals of its corresponding terms i.e. 1, 4, 7, 10, ..... is an A.P.
A general H.P. is 1 / a , 1 / ( a + d ) , 1 ( a + 2d ) ,......
nth term of an H.P. = 1 / [ a + ( n –1 ) d ]
Example 2.34
The 7th term of an H.P. is 1/10 and 12th term is 1/25. Find the 20th term, and the nth
term.
Solution
Let the H.P. be 1 / a , 1 / ( a + d ) , 1 ( a + 2d ) , ...
The 7th term = 1 / (a + 6d) = 1 / 10 => a + 6 d = 10
The 12th term = 1 / (a + 11 d) = 1 / 25 => a + 11 d = 25
Solving these two equations, a = -8, d = 3
Hence 20th term = 1 / ( a + 19d ) = 1 / [ -8 + 19 ( 3 ) ] = 1 / 49
and nth term = 1 / [ a + ( n - 1) d ] = 1 / [ -8 + ( n – 1 ) 3] = 1 / [ 3n – 11 ]
Example 2.35
Prove that three quantities a, b, c are in A.P., G.P., or H.P. if
(a-b) / (b-c) = 1, a / b or a / c respectively.
Solution
a, b, c are in A.P. if b -a = c -b i.e.
if (a-b) / (b-c) = 1 = a/a
Also a, b, c are in G.P.
26
Basics of Mathematics
The harmonic mean is never larger than the geometric mean or the arithmetic mean. If
A.M., G.M. and H.M. are the arithmetic mean, geometric mean and harmonic mean of
a set of numbers respectively,
A.M. ≥ G.M. ≥ H.M.
In certain situations, the harmonic mean provides the correct notion of "average". For
instance, if for half the distance of a trip you travel at 40 miles per hour and for the
other half of the distance you travel at 60 miles per hour (i.e. less time), then your
average speed for the trip is given by the harmonic mean of 40 and 60, which is 48. it
is discussed in detail in Chapter 5.
Example 2.36
The Harmonic mean of a series 1/8, 1/12 and 1/16 is
Solution:
n
Harmonic mean is given by H =
1 1 1 1
+ + ............. +
a1 a 2 a 3 an
3 48 144
= = 3× =
1 1 1 13 13
+ +
8 12 16
FUNCTIONS
Many questions in management are studied by constructing mathematical models of
the underlying phenomenon. We may regard a model as a collection of relationships
among economic variables. Functions are a convenient way of stating such
relationships. The purpose of this section is to introduce some elementary
mathematical concepts and techniques that are employed in business analysis.
27
Foundations of Quantitative Methods
Linear Functions
Linear functions are of the form:
y = a0+a1x1+a2x2+........+anxn
where a0, a1, a2... an are constants and x1, x2 ...xn are variables.
In two dimensional space, a linear function is a straight line and is usually written as y
= a + mx.
The Y-intercept is ‘a’, and the slope of the line is ‘m’.
28
Basics of Mathematics
If m > 0: As x increases, y increases. The line rises from left to right. This is illustrated
in the Figure 2.1
If m < 0: As x increases, y decreases. The line falls from left to right.
If m = 0: y = a, is a line parallel to the X-axis.
If m = ]: x = a, a line parallel to y-axis.
y y
m negative
m positive
x x
negative slope
positive slope
y
y
m undefined
m=0
x x
undefined slope
zero slope
29
Foundations of Quantitative Methods
Solution:
If we assume that the 4 points (20,160), (23,172), (26,184) and (29,196) will fall on a
straight line, then we may compute ‘a’ and ‘m’ using any pair of points.
m = (172-160)/ (23-20) = 4, and
a = 160 - (4 × 20) = 80, and
y = 80 + 4x
One may verify that all 4 points fall on the line. For example, consider the point
(29,196).
80 + (4×29) = 196.
Inverse Functions
We have seen that if y is a function of x, then for each given value of x, we can
determine uniquely the value of y as per the functional relationship. For some
functions, it is possible to express x in terms of y so that, given the value of y, the value
of x can be uniquely determined. The function that expresses the variable x in terms of
the variable y is called the inverse function and is denoted by x = f-1(y).
If y = f (x) = 4x + 5
Solving x in terms of y
y−5
x= = f −1 ( y )
4
Exponential Functions
The exponential functions are useful for describing topics like compound interest and
growth. The exponential function is defined as:
y = m.ax
where ‘m’ and ‘a’ are constants with ‘a’ (called the base) which takes only a positive
value and ‘x’ is an independent variable.
The exponential curve rises to the right for a > 1 and m > 0 and rises to left for a < 1
and m > 0. If x takes only positive integral values (1, 2, 3...), y = max is the xth term in a
Geometric Progression.
Logarithmic Functions
If x = by where both b > 0 and x > 0, then we define y = logbx, which is read as “y is
the log of x to the base b”. So a log is simply the inverse function of an exponential
function. Therefore, the domain of any logarithmic function is the set of positive
numbers since the range of any exponential function is the set of positive numbers.
The exponential function and logarithmic function are inverse functions of each other,
since
y = logbx if and only if x = by.
If y = log10x = f(x), then x = 10y and vice-versa.
If x = ey then y = lnx and vice-versa.
SIMULTANEOUS EQUATIONS
Many models are formulated as a set of equations involving a number of variables. The
simplest of these are simultaneous linear equations. The idea is to find one or more set
of values for the variables which will satisfy the system of equations.
30
Basics of Mathematics
Example
Find the equilibrium point if the supply and demand functions are:
S(x) = 2x + 1 and D (x) = 14/(x + 2).
Here x denotes the price of the good, S(x) describes the quantity that the producers are
willing to supply at the given price and D(x) describes the consumer demand at the
given price. The equilibrium point is the price at which supply equals demand.
That is, 2x + 1 = 14/ (x +2)
or, 2x2 + 5x – 12 = 0
− b ± b 2 − 4ac
x=
2a
− 5 ± 5 2 − 4(2)(−12)
=x =
2(2)
− 5 ± 25 + 96
=x =
4
x=6/4 or – 16/4.
Since the price cannot be negative, x = 1.5. One may verify that this value of x satisfies
both the demand and the supply functions.
31
Foundations of Quantitative Methods
TABLE 0
EQUATION X Y Z RHS
1 2 -1 -1 2
2 1 1 1 1
3 1 1 2 1
The idea is to progressively introduce 1 in the diagonal cells (Row 1 and Column 1,
Row 2 and Column 2, Row 3 and Column 3) and 0s elsewhere using only the
following operations
• Multiply a row by any constant
• Add a multiple of any row to any other row
We introduce 1 in Cell (1,1) and 0s in the remaining cells of column 1 by
Multiplying Row 1 by 0.5 and then subtracting the modified Row 1 from Rows 2 and
3. That is,
Row 1: Row 1 * 0.5
Row 2: Row 2 + (-1 * Row 1)
Row 3: Row 3 + (-1 * Row 1)
TABLE 1
EQUATION X Y Z RHS
1 1.0 -0.5 -0.5 1.0
2 0.0 1.5 1.5 0.0
3 0.0 1.5 2.5 0.0
Now we want to introduce 1 in Cell (2,2) and 0s in the other cells of Column 2. To
achieve this, we need to multiply Row 2 by (1/1.5). Follow this by adding 0.5 times
Row 2 to Row 1, and adding -1.5 times Row 2 to Row 3.
TABLE 2
EQUATION X Y Z RHS
1 1.0 0.0 0.0 1.0
2 0.0 1.0 1.0 0.0
3 0.0 0.0 1.0 0.0
We now introduce 1 in Cell (3, 3) and 0s elsewhere in Column 3. But note that Cell (3,
3) is 1 and Cell (1, 3) is 0 already. To achieve our objective, we need Cell (2, 3) = 0.
This is done by adding -1 * Row 3 to Row 2.
TABLE 3
EQUATION X Y Z RHS
1 1.0 0.0 0.0 1.0
2 0.0 1.0 0.0 0.0
3 0.0 0.0 1.0 0.0
Since we cannot go any further in the diagonalization process, let us construct the
equations implied by Table 3.
32
Basics of Mathematics
1x + 0y + 0z = 1.0
0x + 1y + 0z = 0.0
0x + 0y + 1z = 0.0
One solution is immediate: x = 1, y = 0 and z = 0
NOTE
A system of solutions may have
• Exactly one solution
• More than one solution
Our method will generate one of the solutions or imply that there is no solution.
Example
Solve
x+y=2
-2x – 2y = 3
TABLE 0
ROW X Y RHS
1 1 1 2
2 -2 -2 3
33
Foundations of Quantitative Methods
TABLE 1
EQUATION X Y RHS
1 2.0 3.0 7.0
2 3.0 7.0 13.0
3 3.0 2.0 8.0
TABLE 2
EQUATION X Y RHS
1 1.0 1.5 3.5
2 0.0 2.5 2.5
3 0.0 -2.5 -2.5
TABLE 3
EQUATION X Y RHS
1 1.0 0.0 2.0
2 0.0 1.0 1.0
3 0.0 0.0 0.0
The solution can now be read off table 3.
x = 2 and y = 1
That is, produce 2 batches of Minis and 1 batch of Jumbos.
SUMMARY
34
Chapter 3
Calculus
In this chapter we will discuss:
• Limits
• Derivatives
• Optimization
• Integral Calculus
Foundations of Quantitative Methods
LIMITS
The concept of a limit is fundamental in calculus. Often, we are interested in
knowing the behavior of f(x) as the independent variable x approaches some
particular point ‘a’. The question arises, that if we give values to x which are nearer
and nearer to ‘a’, will the values of f(x) come nearer and nearer to any particular
value? Suppose we define a function f(x) as:
f(x) = 2x
It can be seen that as we give values to x which are nearer and nearer to 0, then the
value of f(x) also comes nearer and nearer to 0.
If as x approaches a value ‘a’, f(x) approaches some number L, then we say that the
limit of f(x) as x approaches a is L. This is symbolically written as
lim f(x) = L
x→a
x → a is to be read as ‘x approaches a’ or ‘x tends to a’
Sometimes we may allow x to take values which are larger and larger, without any
limit. This is symbolically written as x → ∞ (read as ‘x approaches infinity’). If f(x)
approaches a limit L as x → ∞ , then we write
lim f(x) = L
x→∞
In some cases, it may so happen that as x approaches a value, then the value of the
function f(x) may become larger and larger without any limit. This is symbolically
written as:
lim f (x ) = ∞
x→L
Example
Suppose f(x) = 2x2 – 1
As x approaches value 1, f(x) approaches the value 1,
lim f(x) = 1
x →1
This is graphically represented below.
1
x
-1
36
Calculus
Example
Let f(x) = (x+1)/x.
From the table and graph given below, we see that f(x) approaches 1 x → ∞ as but
never equals 1.
x y = f(x) = (x+1)/x
1 2
10 1.1
100 1.01
1000 1.001
10,00,000 1.000001
1
x
x +1
Thus lim =1
x
x→∞
Example
1
Suppose f(x) = , what will happen to f(x) as x approaches 1? The table and
(x − 1)2
graph for this function are given below.
y
x y=f(x)=1/(x-1)2
0 1
1/2 4
2/3 9
4/5 25
9/10 100
999/1000 10,00,000
x
1
37
Foundations of Quantitative Methods
DERIVATIVES
The rate of change in the value of a function is useful in studying the behavior of a
function. The change in y for a unit change in x is referred to as the derivative of y
with respect to x. In finance and economics, the rate of change is termed marginal or
incremental. For example, the marginal cost of capital is the rate of change of the
total cost of capital per unit change in new capital raised.
The idea of the derivative as the rate of change of the function at a fixed point has a
geometrical foundation. The slope of the tangent to the function at the point equals
the derivative at the point.
d df
The derivative is usually denoted by of f(x) or or f′(x) It may be noted that
dx dx
the derivative itself is a function, and the value of the derivative depends on where it
is evaluated.
The derivative of a function f(x) at point ‘a’ is defined as:
f ( x ) − f (a )
lim
x−a
x→a
The process of getting the derivatives is called ‘differentiating’ a function. There are
some rules of differentiation which are given below:
Rules of Differentiation
a. If f(x) = c, where c = a constant, then f ′( x ) = 0 . Slope of a horizontal line is
zero, since by definition the value of y does not change when the value of x
changes.
n −1
b. If f ( x ) = x n then f ′( x ) = n. x
c. Let f(x) be a product of two functions, say, g(x) and h(x) so that
f(x)=g(x).h(x), then
f ′( x ) = g ′( x ).h ( x ) + g ( x )h ′( x )
d. If f(x) is a quotient of two functions, say, g(x) and h(x) so that
g(x)
f(x) = , where h(x) ≠ 0
h(x)
g ′(x)h(x) − g(x)h ′(x)
Then f ′(x ) =
[
h (x )
2
]
e. If f(x) = c.g (x) where c is a constant, then,
f ′(x ) = c.g ′(x )
f. If f (x ) = g (x ) + h (x ), then
f ′(x ) = g ′( x ) + h ′(x )
⎛1⎞
g. If f (x ) = In x, then f ′(x ) = ⎜ ⎟
⎝x⎠
g (x ) g (x )
h. If f(x) =e , then f ′(x) = g ′(x).e
g ′(x )
i. If f(x) =In g(x), then f ′(x ) =
g(x )
38
Calculus
Examples
f ( x ) = 4x 4 − 2x 2 + 7 x + 5
a f ′(x ) = 4
d x4 ( )
− 2.
d x2
+ 7.
( )
d(x ) d(5)
+
dx dx dx dx
( )
= 4 4 x − 2(2x ) + 7(1) + 0
3
= 16 x 3 − 4 x + 7
b (
f(x) = x 3 − 4 x 2 + 1 )( )
f ′(x) =
(
d x3 − 4 2 )(
x + 1 + x3 − 4 )(
d x2 +1
)( )
dx dx
= 3x 2 (x 2 + 1) + x 3 − 4 ( 2
d x +1
)( )
dx
= 3x (x + 1) + x − 4 (2x )
2 2 3 ( )
= 5x 4 + 3x 2 − 8x
f ( x ) = x −3
c
3
f ′( x ) = −3x − 4 = −
x4
2x 3 + x + 3
d f (x ) =
x2 +1
(x 2
+1) dxd (2x 3
) (
+ x + 3 − 2x 3 + x + 3 ) dxd (x 2
)
+1
f ′(x ) =
( x 2 + 1) 2
2x + 1
y =
x+2
2x + 1
Let y = μ where μ =
x+2
d
dμ
(x + 2) (2x + 1) − (2x + 1) d (x + 2)
= dx dx
dx (x + 2) 2
=
(x + 2)(2) − (2x + 1)(1)
(x + 2)2
40
Calculus
2x + 4 − 2x − 1 3
= =
(x + 2) 2
(x + 2)2
1
− 1
dy / dμ = 1 / 2.μ 2 = 1 / 2.
μ
3 1 1
dy / dx =
2
(x + 2) 2 μ
3 1 x+2
dy /dx =
2 2 2x + 1
(x + 2)
3 x+2 1
=
2 2 x + 1 ( x + 2) 2
Higher-Order Derivatives
It can be seen that the derivative of a function is also a function.
Considering f′(x) as a function of x, we can take the derivative of f′(x). which will
yield another function; say f”(x). This is called the second derivative of f (x). The
third derivative of f(x) is the derivative of the second derivative f′(x). We can take
derivatives of higher orders by repeating the process. The second, third, fourth, etc.
derivatives are symbolically written as;
2
d f
f ″ (x) or
2
dx
3
d f
f ″′ (x) or
3
dx
4
(4) d f
f (x) or
4
dx
n
(n) d f
f (x) or n
dx
Example
Let f(x) = 3x 3 + 2x 2 − 4x + 6
f ′(x) = 9x 2 + 4x − 4
f ″(x) = 18x + 4
f ″ (x) = 18
(4)
f (x) =0
(5)
f (x) =0
...........................
(n)
f (x) =0
41
Foundations of Quantitative Methods
Example
Let y = x 3 − 9x 2 + 9x
What is the value of d 2 y/dx 2 for x = 1?
y = x 3 − 9x 2 + 9x
dy/dx = 3x 2 − 18x + 9
d 2 y / dx 2 = 6x − 18
d 2 y/dx 2 (at x = 1) = 6 ×1 − 18 = −12
Remark
Of all the higher order derivatives, the second order derivative is of special interest
especially in optimization.
The first derivative of a function, f′(x), is the slope of the function f(x), or the rate of
change in the value of f(x) per unit change in x. Similarly the second derivative, f″(x)
is the slope of the function f′(x), or the rate of change in the value of f′(x) per unit
change in the value of x, i.e., it is the rate of change of the original function f(x).
Definition
A function f(x) is said to be monotonically increasing if its first derivative, f′(x) is
positive for all values of the independent variable x. It is said to be monotonically
decreasing if f′(x) is negative for all values of x.
Examples
a. Consider the linear demand function
p = 100-10q, where
p = price per unit sold (average revenue),
q = quantity sold
The total revenue function, T (q), is obtained as p*q.
Thus T(q)=(100-10q)q=100q-10q2
The first derivative gives the relationship of the change in total revenue per unit
change in quantity sold. This is called the marginal revenue, MR.
dT(q)
MR = = 100 − 20q
dq
The changes in marginal revenue for changes in quantity sold can be obtained from
the slope of the marginal revenue curve. This is the second derivative of T(q).
d
(MR ) = d 2T(q ) = −20
dq dq 2
The marginal revenue declines at a constant rate of 20, per unit increase in quantity
sold. The relationships are shown below.
b. In developing the theory of investor choice under uncertainty, cardinal utility
functions, U(R) are used. These utility functions should have the property,
U′(R)>0 and U″ (R) < 0. Check this property for the following utility functions:
42
Calculus
T (q) MR or P
0 5 10
0 5 10 q
f (x)
f (c)
f (a)
+ + d
a - x
-
f (b)
f (d)
43
Foundations of Quantitative Methods
x = a, if the value of the function at a, f(a) is greater than or equal to f(x) for values of
x sufficiently close to value a. Look at the graph of a function f(x) given below.
This function has a relative maxima at point x = a and at x = c, and a relative minima
at points x = b and x = d. If we consider values of x in the range 0 to d, then global
maxima of the function is f(c) attained at point x = c.
The following characteristic features of the relative maxima and minima could be
observed:
i. For values of x which are sufficiently close to a (or c), the value of the function
first increases and then decreases. The slope of the function is positive up to a
(or c) and then it becomes negative. For points b and d, the slope changes from
negative to positive.
ii. The tangent lines at points a, b, c and d are horizontal, indicating that the slopes
at those points are zero.
From the above observations, the following results could be obtained:
Result 1: if f(x) has a relative maxima or minima at x = a, then the value of the
derivative of f(x) at a, f′(a) is 0.
Note that the converse may not be true. If at a point a, f′(a) = 0, it does not
imply that f (x) has a relative maxima or minima.
Result 2: If, at point a, we have f′(a) = 0 and f″ (a) <0 then f(x) has a relative
maxima at x = a.
If f′(a) = 0 and f″ (a) > 0 then f(x) has a relative minima at x = a.
This is because at relative maxima (or minima) the slope changes from
positive (or negative) to negative (or positive) for points x close to a.
If f′(a) = 0 and f″(a) = 0 then we cannot say anything definitely about the
occurrence of maxima or minima at point a.
Example
A manufacturer fixed the following demand curve:
P(x) = 200-3x
Where x = quantity and p(x) = price at that quantity.
Let the fixed cost of production be Rs. 25 and the variable cost Rs. 2 per unit. The
manufacturer wants to fix the level of output so as to maximize his profit.
The profit function f(x) is:
f(x) = total revenue – total cost
= (200-3x) x – (25 +2x)
= -3x2 +198x – 25
The first order derivative is
f′(x) = - 6x +198
198
This is 0 if – 6x + 198 = 0 or x = = 33
6
At this level of output, i.e. x = 33, we would like to examine whether we have a
minimum or maximum. The second order derivative is:
f″(x) = -6 which is negative for all values of x
Therefore f″(a) < 0.
44
Calculus
Hence, we conclude that at the level of production of 33 units, the manufacturer will
make a maximum profit.
The maximum profit is:
2
- 3 × (33) + 198 × 33 − 25 = Rs. 3,242.
Suppose, the quantity that could be produced is limited to the range, say, 10 to 30
units, then we have to check the value of the profit at the boundary points 10 and 30,
in addition to any other points in the range (10,30) and then find the optimal level of
production.
Partial Derivatives
So far we have considered differentiation of functions of one independent variable. In
many situations, we come across functions with more than one independent variable.
Since the value of the function is influenced by each independent variable, the rate of
change in the value of the function relative to the change in one independent variable
can be studied by holding the other independent variables constant. Let z = f(x, y).
The change in z for changes in x can be obtained by holding y constant. This is the
basic idea behind partial differentiation. The rules for partial differentiation and
ordinary differentiation are exactly the same except that when the partial derivative
of one independent variable is taken, the other independent variables are treated as
constant. The partial derivatives of a function f(x, y) are symbolically represented by
∂f ∂f
, to indicate the partial derivative with respect to x and the partial derivative
∂x ∂y
with respect to y.
Example
Let w(x, y, z) = xy + yz + zx
∂w
then, =y + 0+ z = y + z
∂x
∂w
= x + z + 0= x+ z
∂y
∂w
=0+ y + x= y+ x
∂z
NOTE
If a function has two independent variables x and y so that
Z = f(x, y)
Then, the condition for relative maximum or minimum is:
∂f ∂f
= 0, = 0,
∂x ∂y
2
ii ∂ 2f ∂ 2f ⎡ ∂ 2f ⎤
<⎢ ⎥ for maximum
∂x 2 ∂y 2 ⎢⎣ ∂x∂y ⎦⎥
2
∂ 2f ∂ 2f ⎡ ∂ 2f ⎤
>⎢ ⎥ for minimum
∂x 2 ∂y 2 ⎢⎣ ∂x∂y ⎥⎦
45
Foundations of Quantitative Methods
INTEGRAL CALCULUS
This branch of mathematics concerns itself with an operator called the integral
∫ f(x) dx is called the indefinite integral of f(x)
b
∫ f (x)dx is called the definite integral of f(x).
a
Indefinite Integral
We may regard the indefinite integral as an inverse operator of differentiation.
d
Therefore ∫ f(x)dx = F(x) if F(x) = f(x)
dx
Note that
d d
F(x) = f(x) implies [F(x) + C] = f(x) for any constant C.
dx dx
Example
2
x
F(x) =
1 2
2
x
F′(x) = x ⇒ ∫ xdx =
2
2x
2 F(x) = e
2x 2x 2x
F′(x) = 2e ⇒ ∫ 2e dx = e
F(x) = In (x)
1 1
3 F′(x) = ⇒∫ dx = In x
x x
4 x2
F(x) = + 10
2
x2
F′(x) = x ⇒ ∫ xdx = + 10
2
5 F(x) = e 2x + 3
F′(x) = 2e 2x ⇒ ∫ 2e 2x dx = e 2x + 3
6 F(x) = In (3x) = In 3 + In x
1 1
F′(x) = ⇒∫ dx = In x + In 3
x x
∫ f(x) dx = F(x) + C
where, C is any constant,
and,
d
F(x) = f(x)
dx
46
Chapter 4
Arranging Data
In this chapter we will discuss:
• Meaning of Data
• Types of Data
• Data Collection
• Tables and Graphs as Data Presentation Devices
• Frequency Distribution
Foundations of Quantitative Methods
In organizations managerial decisions are based on vast primary and secondary data.
The raw data available to the managers is mostly in an unorganized form. The first task
of the manager in decision making process is to get the data (figures) properly
organized, classified, and presented in a meaningful manner in order to facilitate
decision making. Presenting raw data in a meaningful way requires two steps to be
carried out: classification of data and tabulation of data. In this chapter we will discuss
the objectives of classification and various devices of data presentation like tables,
graphs, frequency distribution, histograms, frequency polygons, and cumulative
frequency curves.
MEANING OF DATA
TYPES OF DATA
Managers rely on data for decision making. This data can be broadly grouped into two
categories:
• Published data (data that is already collected and published) and,
• Unpublished data (data that is yet to be collected or printed).
Published data can be obtained in the form of bulletins, reports etc. from various
Government agencies. For example, data relating to monetary and banking activities
(industrial production index, price index, etc.) can be obtained from Reserve Bank
bulletins and finance reports published by Reserve Bank of India every month. Apart
from government agencies, many research organizations and private sources provide
useful data to managers.
In some cases, published data may not be available to managers to make decisions. In
such cases, managers have to go in for first hand data collection by the way of sample
survey or a census. The information may be obtained by using various methods like
observation, personal interview or questionnaires. The data can also be grouped as
primary or secondary data. The first hand data that is collected by the researcher
personally is primary data and that collected from other available sources (collected by
others) is known as secondary data. Large volumes of data are available within every
organization in the form of financial statements, sales reports, cash flow data,
production schedules, budgets, etc. Managers can make decisions based on this data in
the light of data relating to the industry and economy as a whole.
48
Arranging Data
Table 4.1 shows the output levels for 30 production runs. The information a manager
gets from the table is an example of raw data. Managers can derive useful inferences
from this raw data by applying various statistical methods. For instance managers can
use this above raw data to find a pattern in production output and also the average out
put per production run.
DATA COLLECTION
Managers select the samples in such a way that all related cross sections of the
population are represented in the data. Population is the entire collection of entities that
a manager is trying to study and sample is a fraction of the population that represents
the entire population in its characteristics proportionately.
For example, when a magazine conducts an opinion poll among 5000 individuals from
all over India, with a view to know the general opinion of Indians towards politicians,
then all Indians is the target population and 5000 individuals represent the population
and it is referred to as a sample.
To estimate the potential market for a new innovation, for example, managers in
research department may study 500 consumers in a particular territory. The managers
make it sure that this sample contains the consumers belonging to all cross sections
(income, religion, education, and locality) of the society.
Managers collect relevant data from either first hand observations or from already
existing records. For example, a welfare manager gets the details about the families of
the employees either from the records or from a fresh survey. This information is then
arranged to produce data that enables easy decision making. When data is arranged in a
crisp, usable form; managers can take reliable information from the available internal
data and the external environment to make intelligent decisions.
Before using the data, managers have to test the data for its adequacy and reliability, as
it influences the quality of the final decision. Managers can pose the following
questions to test the validity of the data:
1. Where does the data originate from?
2. Is the source reliable?
3. Does the data support or contradict the previous decisions?
4. Are the conclusions derived from the data?
5. What is the size of the sample? Does it represent the entire population under
consideration for decision making?
When the manager gets the answers to all the above questions, he can easily improve
the quality of his decisions. The quality of the data collected depends on the sample
selected from the entire population.
49
Foundations of Quantitative Methods
every person living in India. In the same way organizations undertake census of the
employees’ families to collect the information that helps designing of welfare schemes.
Sample method is a method in which enumeration of a part of the population or
universe is taken up and information is gathered regarding this selected part. This is
commonly adopted in organizations to check the quality of the finished products of a
manufacturing unit.
Managers choose one of the above data collection methods depending on the factors
like purpose of the enquiry, time available for making a decision, budget allocation,
and the accuracy of data required for decision making.
Classification of Data
Classification is the process of arranging data into sequences and groups according to
their common characteristics. For example data collected in a consumer survey can be
classified along characteristics like age, gender, education, income, etc.
Some common methods of classification are
• Geographical, i.e. area-wise or region-wise
• Chronological,
• Qualitative, i.e., depending on characteristics
• By magnitude.
For example, the data relating to profits of a XYZ company for the past 10 years can be
classified chronologically as shown in Table 4.2.
Table 4.2: Net Profits of XYZ Company (in $ millions)
Year 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000
Profit 191 154 90 -10 80 200 246 310 360 390
Data collected by various methods can be presented in the form of tables and graphs. In
tables the data is classified based on time of observation, magnitude or some other
characteristics of the variable. On the other hand graphs are pictorial representation of
the data.
Table 4.3: Sales data of Alpha Electronics, for the Month of May and June
2000 (in lakhs Rs.)
Territory (Caption)
T.V. 11 14 8 10 20 16 16 14
Audio 8 6 14 20 12 8 10 12
51
Foundations of Quantitative Methods
Line charts
Line chart is one of the effective graphical methods to depict the trend in a data. For
example, a manager can easily notice the trend in the sales data when presented in a
graph than in a tabular form. Figure 4.1 shows the line chart for the data depicted in
Table 4.4.
Table 4.4: Sales of Mars Corp. Ltd. 1990-1999 (in lakhs Rs.)
Year Sales
1990 300
1991 360
1992 330
1993 390
1994 400
1995 390
1996 420
1997 460
1998 460
1999 480
550
500
450
400
350
Sales
300
250
200
150
100
50
0
1990 1991 1992 1993 1994 1995 1996 1997 1998 1999
Year
Bar charts
Business and statistical data can be presented through bar charts. Bar charts use
rectangles to present the data. These diagrams are one-dimensional because the
magnitude of the data is represented by length of the bar. Bar charts consist of a group
of equidistant rectangular bars, each representing one class interval of a given data in
the table. The thickness or width of the bar has no relevance, only the length of the bar
52
Arranging Data
is important. Bar charts are the easiest and most flexible general purpose charts used
for data representation.
A bar chart can be drawn in different forms: vertical or horizontal. In horizontal charts
the bars originate from the Y-axis but for vertical bar charts they originate from the X-
axis
Guidelines for constructing a bar chart are as follows:
1. Length of the bars should be proportionate to the data they represent.
2. All bars should rise from the same base line.
3. Uniformity in the width of the bars need not be maintained as width is irrelevant in
bar charts.
4. The scale should be selected depending on the highest value in the table.
5. The gaps between the bars should be uniform.
6. The bars should be arranged from left to right.
7. It is desirable to mention the (magnitude) figure on the bar to enable the manager
to get the magnitude at a glance.
Now consider the data relating to sales of Mars Corp. is given in Table 4.4. When we
represent the sales values of last five years (1995 to 99) in a bar chart it appears as in
Figure 4.2.
600
460 460 480
420
Sales
400 390
200
0
1995 1996 1997 1998 1999
Year
Multiple bar diagram or compound bar diagrams are used to compare two or more sets
of related data. This diagram is similar to the simple bar diagram, but bars in each set
are placed together and gap is left between each set of bars. Table 4.5 shows the export
and import value of Bharat Petroleum Corporation Limited (BPCL) for 4 consecutive
years between 1995 and 1998. Figure 4.3 is compound bar chart showing the export,
import values of BPCL.
53
Foundations of Quantitative Methods
Figure 4.3: Compound Bar Chart Showing the Export and Import
Values of BPCL
5000
Export/Import
4000
3000 Export
2000 Import
1000
0
1995 1996 1997 1998
Year
Pie diagrams
Pie diagram is a circle divided into various segments and each segment represents the
percentage contribution of various components to the total. For drawing a pie diagram
it is necessary to express the value of each category as a percentage of the total. 3600 in
a circle represent the whole (i.e., 100%) and 3.60 constitute 1% of the total. Managers
use pie diagrams to compare many components simultaneously.
Table 4.6 depicts the data on share holding pattern in BPCL. Figure 4.4 presents the
same in the form of a pie diagram.
PIE CHART
4% Indian Promoters
10%
IND'I /MF's
MFs
20% FIIS
FIIs
Public
66%
Pictogram
Pictograms represent the data in the form of pictures. The data is presented using
appropriate pictures and their sizes indicate the magnitude of the data.
54
Arranging Data
Scatter diagram
Scatter diagram is used to study the correlation between two dependent variables.
Production manager can study the relation between the cost and the level of production
to find the optimal production level. Figure 4.5 shows the scatter diagram showing
relationship between cost and volume. The cost of production is plotted on the graph at
various levels of production, the obtained points are scattered and hence it is known as
scattered diagram. When the points on the graph follow a pattern, it indicates high
correlation and irregular pattern or behavior indicates low correlation.
4.00
3.75
3.50
3.25
Cost
3.00
2.75
2.50
2.25
2.00 X
50 55 60 65 70 75 80 85 90 95
Volume of Production
FREQUENCY DISTRIBUTION
The table in which raw data is tabulated by dividing it into classes of convenient size
and computing the number of data elements (or their fraction out of the total) falling
within each pair of class boundary is called a frequency distribution table. Frequency
distribution has two parts, the left of the table has sizes or magnitude of values and on
its right the number of times a value or a group of values are repeated.
The raw data relating to the ages of 50 employees of a department are as given in Table
4.7
Table 4.7: Ages of 50 Employees
56 30 36 28 43 52 25 29 38 40
48 57 33 29 32 48 46 49 32 31
39 28 52 21 47 28 39 44 54 30
22 33 48 39 56 27 54 28 36 32
40 46 26 35 45 55 28 31 43 46
As the ages in the table are in a arbitrary manner it is very difficult for the human
resources manager to grasp any trend from the raw data. Logical arrangement of data is
necessary to compress the data and help the manager know to which age groups an
employee belongs and what is the frequency of each age group.
55
Foundations of Quantitative Methods
Histograms
A histogram is a series of rectangles, the width of each being proportional to the range
of values within a class and height being proportional to the number of items falling in
the class. The widths of the bars are uniform when the widths of classes in a frequency
distribution are equal. And the length of the bar is proportionate to the number of data
elements in the class it represents.
56
Arranging Data
10
Frequency
4
2 10 9 7 5 9 4 4
2
57
Foundations of Quantitative Methods
When the cumulative frequencies are plotted on a graph we get an Ogive. A less-than
Ogive of the distribution of ages of 50 employees is shown in Figure 4.7.
The points on the graph represent the number of employees having less age than the
number of years shown on the X-axis.
When an Ogive is drawn for a cumulative (more than) frequency distribution the graph
would slope down and to the right, instead of up as shown in Figure 4.7.
60
50
40
Cumulative
30
Frequency
20
10
0
Age < 30 35 40 45 50 55 60
25
58
Arranging Data
A curve is said to be symmetrical when a vertical line drawn from the center of the
curve to the X-axis divides the area under the curve into equal parts. A symmetric
curve is shown in Figure 4.8.
A curve is said to be skewed when the values in the frequency distribution are
concentrated more towards the left or right side of the curve i.e. the values are not
equally distributed from the center of the curve. A curve is said to be positively skewed
when the tail of the curve is more stretched towards the right side. It is said to be
negatively skewed when the tail is more stretched towards the left side. The skewed
curves are shown in Figure 4.9.
Kurtosis is the degree of peakness of a distribution of points i.e. Kurtosis measures the
peakedness of a distribution. Two curves with same central location and dispersion
may have different degrees of kurtosis (See Figure 4.10).
Curve A Curve B:
59
Foundations of Quantitative Methods
SUMMARY
Data is a collection of related observations, facts or figures. This data can be broadly
grouped into two categories: Published data and unpublished data. Managers use
sampling methods to draw inferences about the data. Population is the entire collection
of entities that a manager is trying to study and sample is a fraction of the population
that represents the entire population in its characteristics proportionately. Data
collected by various methods can be presented in the form of tables and graphs. In
tables, the data is classified based on time of observation, magnitude or some other
characteristics of the variable. On the other hand graphs are pictorial representation of
the data. Several types of graphs or charts are used to present data. Some of them are
line charts, bar charts, pictograms, pie charts and scatter diagrams. This chapter also
discussed how frequency tables can be used to represent data.
60
Chapter 5
Measures of Central
Tendency
In this chapter we will discuss:
• Objectives of Averaging
• Requisites of a Good Average
• Types of Averages
• Mathematical Averages
• The Median
• The Mode
Foundations of Quantitative Methods
The most important objective of a statistical analysis is to calculate a single value that
represents the characteristics of the entire available raw data. This single value
representing the entire data is called the ‘central value’ or an ‘average’. This value is
the point around which all the other values of the data cluster. Therefore it is known as
measure of location and since this value is located at a central point nearest to other
values of the data it is also known as measure of central tendency. This chapter
discusses various measures of central tendency like mean, median and mode and their
use in day to day management activities. For example, the mean sales of a territory
give a rough idea to the sales manager about the performance of the sales personnel in
that territory.
OBJECTIVES OF AVERAGING
TYPES OF AVERAGES
Averages
Geometric Harmonic
Arithmetic Median Mode
mean mean
mean (A.M.) (Md) (Mo)
(G.M.) (H.M.)
63
Foundations of Quantitative Methods
MATHEMATICAL AVERAGES
Arithmetic Mean
The arithmetic mean or mean is the most simple and frequently used average.
Arithmetic mean is represented by notation x (read x - bar).
Solution:
When a manager wants to know the average number of days a driver is on leave in 90
days, he can calculate the mean of the ungrouped data as follows:
x = ∑ x/n = 8 + 6 + 6 + 7 + 4 + 5 + 6 + 2 + 4 + 7
10
= 55/10
= 5.5 days per driver out of 90 days
In the above example, the mean is calculated by adding every observation separately,
in no set order. This is an ungrouped data. One can calculate the mean using the above
method for limited values. But the task becomes difficult while calculating average for
a vast data, say for 5000 employees. In such cases a frequency distribution of the data
will be helpful to a manager, and mean should be calculated using a different method.
universe. Say a Finance manager wants to find out the average monthly pay of 600
employees in an organization, and he is having a frequency distribution (shown in
Table 5.2). He can easily calculate an estimate of the value of the mean of this grouped
data. It is referred to as estimate as he need not use all the 600 observations. The
estimate of the value of mean will not be as accurate as the value obtained by
computing from all the 600 observations (i.e. by adding all the values)
65
Foundations of Quantitative Methods
= 8+6+ 6+ 7+ 4+5+ 6+ 2+ 4+ 7
10
= 55/10 = 5.5 days
Now let us calculate the appropriate mean for the same data presented in a frequency
distribution as follows:
The values of actual mean calculated from ungrouped data and appropriate mean
calculated from grouped data are equal in case of above example Table 5.1. But these
two values may not be equal all the time. There can be minor difference between the
means when the number of observations increases.
Short-cut method of computing arithmetic average for a grouped data (using
codes):
The Shortcut method for computing mean is very useful to managers when the number
of classes is more and the classes are of equal size.
Steps for calculating mean using shortcut method:
1. Locate an assumed mean A. Though any point in the grouped data can be selected
as mean A, for simplicity, it is selected from one of the centrally located classes.
Assign a ‘code value’ zero to the selected class.
2. Assign negative integers as codes to the values smaller than the class mark and
positive integers to larger values as follows:
Class 1-10 11-20 21-30 31-40 41-50 51-60 61-70
Code -3 -2 -1 0 1 2 3
↑
x
66
Measures of Central Tendency
3. Assign ‘x’ to represent the class mark that is assigned the code “zero”. The
following formula is used to determine the sample mean using codes:
x = x0 + w ∑ (u × f)/n
Where,
x = mean
x0 = value of the class mark assigned the code 0
w = numerical width of the class interval
u = code assigned to each class
f = frequency of the class (number of observations)
n = total number of observations in the sample.
Table 5.5 shows how to code the class marks and find the sample mean for the average
monthly income of employees.
Table 5.5: Average Monthly Income of an Employee
Class
Class (Rs) Code (u) Frequency (f) u×f
Marks (X)
(1) (3) (4) (3) × (4)
(2)
1000 – 2999 2000 -4 50 -200
3000 – 4999 4000 -3 110 -330
5000 – 6999 6000 -2 162 -324
7000 – 8999 8000 -1 100 -100
9000 – 10999 10000 0 83 0
11000 – 12999 12000 1 45 45
13000 – 14999 14000 2 25 50
15000 – 16999 16000 3 15 45
17000 – 18999 18000 4 8 32
19000 – 20999 20000 5 2 10
n = 600 ∑f = n = 600 ∑ u×f = -772
x = x0 + w ∑ (u × f)/n
=10,000 + 2,000 x (-772)/600
= 10,000 – 2573.33
= Rs.7426.66 ≈ 7427 (average monthly salary)
67
Foundations of Quantitative Methods
fluctuating values that are not far from other values of the group. Observe that if the
units produced in a day by 5 workers of a batch as in Table 5.6 the mean units
produced per day is
Table 5.6: Number of Units Produced by Workers in a Day
Worker 1 2 3 4 5
Units 23 22 24 21 5
µ = ∑ x/n = (23 +22 +24+21+5) / 5 = 19 units
When the mean units are calculated leaving the fifth worker (i.e. 5) the mean is 22.5
units. Thus, one extreme value ‘5’ has affected the mean. Hence, it is more appropriate
to calculate the mean excluding the extreme value in order to make it more
representative.
The second disadvantage is that it is very difficult to find the actual mean (using µ =
∑x/n). Third disadvantage is that we cannot calculate the mean for a data set with
open-ended classes at either end of the scale. A class that allows either the upper or
lower end of a quantitative classification scheme to be limitless is called as open-ended
class.
x =∑x/n
= (10 + 15 + 20)/3 = Rs. 15 / hour
When the above average wage per hour is taken to calculate the labor cost of one unit
of Product 1, the value would be
15 (2 + 3 + 5) = Rs. 150
And, labor cost for one unit of Product 2 is
15(6 + 2 + 1) = Rs. 135
But these values calculated using simple arithmetic average are incorrect as it does not
take into consideration the fact that different amounts of each class of labor are used.
The correct value (cost per one unit) can be determined in the following manner.
For Product 1, the total labor cost per unit
= (10×2) + (15×3) + (20×5)
= Rs. 165
and, cost per hour
= Rs. 165/(2 + 3 + 5)
=Rs. 16.5/hour
68
Measures of Central Tendency
69
Foundations of Quantitative Methods
Similarly for the second, third, fourth and fifth month is 200, 160 120 and 100
respectively.
70
Measures of Central Tendency
Geometric Mean
Managers often come across quantities that change over a period of time, and may
need to know the average rate of change over a period of time. Arithmetic mean is
inaccurate in tracing such a change. Hence a new measure of central tendency is
needed to calculate the change rate-the “Geometric Mean”.
= n x1 × x 2 × .......x n
Where,
x1, x2,… xn are termed as the growth factor and is equal to 1+ (rate/100)
GM = 5
1 . 07 × 1 . 08 × 1 . 10 × 1 . 12 × 1 . 18 = 1.1093
1.1093 is the average growth factor. The growth rate is calculated as
1.1093 – 1 = 0.1093
Then, 0.1093 x 100 = 10.93
So the growth rate is 10.93 percent per year.
Harmonic Mean
Harmonic mean is based on the reciprocals of numbers averaged. It is defined as the
reciprocal of the arithmetic mean of the reciprocal of the given individual observations.
Thus, by definition:
HM = N
1 1 1
+ + .... +
X1 X 2 Xn
Where, X1, X2, X3, etc. refer to various items of the variable and N refers to the total
number of items.
Example 5.6
Gopi walks from his house to the bank at the speed of 2 kmph, while returning from
the bank to the house, his speed was 3 kmph. Calculate the average speed for the whole
walk.
71
Foundations of Quantitative Methods
N
Average speed =
1 1
+
x1 x 2
2 12
= = = 2.4
1 1 5
+
2 3
Example 5.7
Matel Plastics Ltd. got a raw material delivery order from Blowplast Inc. However, the
condition was that the delivery had to be made within four hours, failing which the
order would be considered cancelled. Robert, the salesman at Matel, was assigned the
responsibility to make the delivery. Robert had to be careful not to exceed the 80 kmph
speed limit, otherwise he would be flouting the traffic rules. The marketing manager
asked him not to go below 60 kmph as there was a risk of the order being cancelled.
Robert divided his journey into four equal parts. He traveled the first quarter of the
distance at the speed of 50 kmph, the second quarter of the distance at 65 kmph and the
last quarter of the distance he covered at the speed at 55kmph.
He was successful in delivering the product on time. If his average speed was 60.5
kmph, what was his speed when he covered the third quarter of the distance?
Solution:
Assume that Robert covers the first, second, third and fourth quarter of the distance at
the speed of X1, X2, X3 and X4 respectively.
Let the average speed of Robert’s whole journey from Matel to Blowplast be Ho =
60.5.
From the given information in the problem, we have
X1 = 50
X2 = 65
X4 =55
Ho = 60.5 and N = 4.
After inserting the values in the formula for calculating the harmonic mean we get:
H= N
⎡ 1 1 1 ⎤
⎢ + + .... + ⎥
⎢⎣ X 1 X2 Xn ⎥⎦
60.5 = 4
⎡ 1 1 1 1 ⎤
⎢ + + + ⎥
⎣⎢ 50 65 X 3 55 ⎥⎦
⇒ 1 ⎡ 1
⎢ +
1
+
1
+
1 ⎤
⎥ = 1 ⎡⎢ 0.0535664 + 1 ⎤⎥ = ⎡ 1 ⎤
4 ⎢ X ⎥ ⎢ 60 . 5 ⎥
4 ⎢⎣ 50 65 X3 55 ⎥⎦ ⎣ 3⎦ ⎣ ⎦
⎡ 1 ⎤
⇒ 1 [0 .0535664 ] + 1
⎢ ⎥ = 0.0165289
4 4 ⎣⎢ X 3 ⎥⎦
⇒ 0.0133916 + 1 = 0.0165289
4X 3
72
Measures of Central Tendency
⇒ 1 = 0.0031373
4X 3
⇒ 1 = 0.0125492
X3
⇒ X3 = 79.68 ≈ 80
Thus, in the third hour Robert traveled at the speed of 80 kmph.
THE MEDIAN
The median, as the name suggests, is the middle value of a series arranged in any order
of magnitude.
As it is distinct from the arithmetic mean which is calculated from the value of every
item in the series, it is called a positional average. The term ‘position’ refers to the
place of a value in the series. The median is just the 50th percentile value below which
50% of the values in the sample fall. The object of median is therefore not merely to
fix a value that shall be representative of a set, but also to establish a dividing line
separating the higher from the lower values.
73
Foundations of Quantitative Methods
Solution:
The median for the above data can be obtained as follows:
The series should first be arranged in an order. In the present case, it has been arranged
in descending order.
Table 5.10: Sales Figures of Companies Arranged in Descending Order
Company Sales Rank
ACC 1520 1
Voltas 980 2
Tata Power 734 3
Andhra Valley 436 4
Tata Tea 412 5
Tomco 312 6
Tata Hydro 292 7
Tinplate Co. 256 8
Indian Hotels 239 9
Excel Inds 228 10
As there are 10 elements, the median will be the mean of the 5th and the 6th items, i.e.
(412 + 312)/2= 362
Thus, the median sales value of the ten companies is 362.
⎡ ((N + 1)/2) − (F + 1) ⎤
Median = ⎢ ⎥ W + Lm
⎣ f m ⎦
where,
Lm = lower limit of the median class
fm = frequency of the median class
F = cumulative frequency up to the lower limit of the median class
W = width of the class interval
N = total frequency.
Example 5.9
Let us find median for the following data.
Table 5.11: Gross Profit as a Percentage of Sales
Gross Profit as a
0-10 10-20 20-30 30-40 40-50
percentage of sales
No. of companies 21 32 43 34 23
74
Measures of Central Tendency
⎡ ((N + 1)/2) − (F + 1) ⎤
Median = ⎢ ⎥ W + Lm
⎣ fm ⎦
⎡ (153 + 1) / 2 − (53 + 1) ⎤
= ⎢ ⎥ 10 + 20 = 25.3488 ≈ 25.35%
⎣ 43 ⎦
Thus 25.35% is the median gross profit of the companies.
Example 5.10
A dataset with its class interval and frequency is given below.
Class interval Frequency
0 – 10 15
10 – 20 25
20 – 30 20
30 – 40 X
40 – 50 40
50 – 60 Y
60 – 70 45
It has been given that total number of observations is 210 and the median is 42.5. Find
out the values of missing frequencies X and Y.
Solution:
As the median is 42.5, it lies in 40 – 50 class interval.
(210 / 2) − (60 + X)
42.5 = 40 + × 10
40
⇒ 4 × 2.5 = 105 - 60 - X
⇒ X = 35
The total number of frequency is 210.
∴15 + 25 + 20 + X + 40 + Y + 45 = 210
Putting the value of X,
Y = 30
75
Foundations of Quantitative Methods
THE MODE
Mode is defined as the value of the variable which occurs most frequently in the data
set.
76
Measures of Central Tendency
9000-11000 83 505
11000-13000 45 550
13000-15000 25 575
15000-17000 15 590
17000-19000 8 598
19000-21000 2 600
Solution:
Lmo = Rs. 5000, d1 = 162-100 = 62, d2 = 162-110 = 52 and w = Rs. 2000
⎛ d1 ⎞
⎜ ⎟
Mo = Lmo + ⎜ d + d ⎟ w
⎝ 1 2 ⎠
⎛ 62 ⎞
= 5000 + ⎜ 62 + 52 ⎟ × 2000
⎝ ⎠
= 5000 + 1087.72
= Rs. 6087.72
We know that,
= 3 × 26 – 2 × 27.5
= 23
77
Foundations of Quantitative Methods
SUMMARY
Central tendency is a statistical measure that takes one number as the representative of
a group. It is an economical estimate of the general characteristics of a group. There
are three main measures: mean, median, & mode. Mean is the average of all the
observations. The median is the middle of a distribution: half the observations are
above the median and half are below the median. The median is less sensitive to
extreme scores than the mean. This makes it a better measure than the mean for a data
set with extreme values. The mode is the most frequently occurring observation in a
distribution.
78
Chapter 6
Measures of Dispersion
In this chapter we will discuss:
• Range
• Interquartile Range and Quartile Deviation
• Mean Deviation
• Variance
• Standard Deviation
• Bienayme Chebyshev’s Rule
Foundations of Quantitative Methods
The previous chapter has discussed how one can calculate a single value that represents
the characteristics of the entire raw data using three main measures: mean, median, &
mode. Another important characteristic of a data set is how it is distributed, or how far
each element is from some measure of central tendency (average). There are several
ways to measure the variability of the data. Although the most common and most
important is the standard deviation, which provides an average distance for each
element from the mean, several others are also important, and are hence discussed here.
This chapter will discuss various methods to study dispersion like: range, inter quartile
range and quartile deviations, mean deviation, variance and standard deviation.
RANGE
Range is the simplest method of studying dispersion. Range is defined as the
difference between the value of the smallest observation and the value of the largest
observation present in the distribution.
Range = L - S
Where, L = Largest value
S = Smallest value
For a grouped frequency distribution or a continuous frequency distribution, range is
defined as the difference between the upper limit of the highest class and the lower
limit of the smallest class.
Range = Upper limit of the highest class - Lower limit of the lowest class.
Coefficient of Range
The range calculated above is not useful to a manager for comparison if the
observations are in different units. For example, a physical trainer cannot compare the
range of the weights of employees with range of their heights as the range of weights
would be in kilograms and that of heights in centimeters. Therefore, for the purpose of
comparison, a relative measure of range is required, which is called coefficient of
range.
largest value − smallest value
Coefficient of Range =
largest value + smallest value
L −S
=
L+S
Example 6.1
Calculate the range and coefficient of range for the following dataset.
78, 55, 76, 45, 89, 43, 65, 67, 48, 58
Range = L - S
Where, L = Largest value = 89
S = Smallest value = 43
Range = 89 – 43
= 46
largest value − smallest value
Coefficient of Range =
largest value + smallest value
89 − 43
=
89 + 43
= 0.348
80
Measures of Dispersion
Example 6.2
Table 6.1 provides the information about the number of employees in an organization
in various age groups. Compute range and coefficient of range.
Table 6.1: Ages of 100 Employees
Class Number of employees
(Yrs) (f)
56-60 7
61-65 16
66-70 39
71-75 28
76-80 10
Total 100
Range = (Upper limit of the highest class) - (Lower limit of the lowest class)
= 80-56 = 24
L −S
And, coefficient of Range = = 24/136 = 0.1765
L+S
Uses of Range
In spite of the above limitations and short-comings, range, as a measure of dispersion
has many uses.
1 Range is used in industry for the quality control of products without 100%
inspection. Range plays an important role in construction of charts used for quality
control. For example, when the weight of a spare exceeds a particular range the
entire production line is checked to correct the deviation in weight of the spare
produced using range, thus, assuring quality in production process.
2 Range is also useful in studying the fluctuations in financial and share markets.
81
Foundations of Quantitative Methods
Interquartile range
¼ of items ¼ of items
Q1 Q2 Q3
82
Measures of Dispersion
Coefficient of Q.D = Q 3 − Q 1
Q 3 + Q1
Q1 = L 1 ( 14 N − C)
+ ×h
f
Q3 = L 3 ( 34 N − C)
+ ×h
f
Where,
L1 = the lower boundary of the first quartile class (Q1)
L3 = the lower boundary of the third quartile class (Q3)
N = Total cumulative frequency
f = Frequency of the quartile class
h = Class interval (width)
C = Cumulative frequency of the class just above the quartile class
Example 6.3
The following example explains the computation of Quartile Deviation for the data
given in Table 6.2.
Table 6.2 Weights of Employees
Weight (Kg) 55-60 60-65 65-70 70-75 75-80
Number of employees 10 18 14 16 12
55-60 10 10
60-65 18 28
83
Foundations of Quantitative Methods
65-70 14 42
70-75 16 58
75-80 12 70
= 60 + 7 . 5 × 5
18
= 60 + 2.083 = 62.083
Q3 = ¾× 70 = 52.5th observation
It falls in class (70-75)
Q3 = 70 + 52 . 5 − 42 × 5
16
= 70 + 3.28
= 73.28
MEAN DEVIATION
84
Measures of Dispersion
∑ x−x
Absolute Mean Deviation = (for a sample)
n
Where,
x = value of observation
µ= the mean of population
N = number of observations in the population
x = sample mean
n = number of observations in the sample
Example 6.4
Calculate the absolute mean deviation of the leave patterns of a sample of 10 drivers in
one year for the values given in Table 6.4.
1 10 -11 11
2 15 -6 6
3 18 -3 3
4 20 -1 1
5 20 -1 1
6 22 1 1
7 23 2 2
8 25 4 4
9 27 6 6
10 30 9 9
N=10 ∑x= 210 ∑ x − x = 44
Mean ( x ) = ∑x = 210/10 = 21
N
x −x
∴ Absolute Mean deviation = ∑ = 44/10 = 4.4 days
N
85
Foundations of Quantitative Methods
Example 6.5
Compute the Mean Deviation for the data given in Table 6.5.
Frequency 4 2 1 3
N= ∑ f ∑ f × X = 72 ∑ f X − X = 46.4
=10
∑f ×X 72
X= = = 7.2
N 10
1 46.4
Absolute Mean Deviation = ∑f X − X = = 4.64
N 10
86
Measures of Dispersion
Merits:
1. Absolute mean deviation is simple and easy to understand.
2. Absolute mean deviation is a more comprehensive measure of dispersion as it is
dependent on all observations of a distribution.
3. As it is obtained by taking the average of the deviations of every observation from
the mean, it is a true measure of dispersion.
Limitations:
1. Absolute Mean deviation is less reliable as it is the arithmetic mean of the absolute
values (ignoring the positive and negative signs).
2. Absolute Mean deviation is not conducive to further algebraic treatment.
3. Absolute Mean deviation cannot be computed for distributions with open-end
classes.
VARIANCE
Variance is similar to mean deviation expect for that, it is calculated using the sum of
the squared distances between the mean and each observation divided by the total
number of elements in the distribution (population). While calculating variance the
differences (deviations) are squared to make them positive.
For Grouped Data
The formula for calculating variance ( σ 2 ) is as follows.
σ2 = ∑ f i (X i − x ) 2
N
Where,
σ 2 = Variance
Xi = the value of observation
x = Mean
N = Total cumulative frequency
fi = Frequency of a class
STANDARD DEVIATION
Standard deviation, or σ , is the square root of the average of the squared distances of
the observations from the mean (i.e. square root the variance). The standard deviation
is expressed in same units as those used in the data set.
87
Foundations of Quantitative Methods
Properties
1. The value of standard deviation remains the same, if in a series each of the
observation is increased or decreased by a constant quantity. In statistical language
we say, standard deviation is independent of change of origin.
For example, for the observations 3, 10 and 12
x = 8.33, σ = 3.859
If we increase the value of each observation by 4.5 we get the observations 7.5, 14.5
and 16.5.
Now x = 12.833
Now σ = 3.859
Hence although x has increased by 4.5, σ remains the same.
2. For a given series, if each observation is multiplied or divided by a constant
quantity standard deviation will also be similarly affected.
Consider the observations 3, 10 and 12. σ = 3.859 as shown in the above
calculation.
Suppose we multiply each observation by 6, the observations become 18, 60 and 72.
x = 50
σ= (18 − 50 ) 2 + ( 60 − 50 ) 2 + ( 72 − 50 ) 2
3
= 23.152.
It is shown that the standard deviation has also been multiplied by 6 i.e.
3.859 × 6=23.152.
Similarly if the observations 3, 10 and 12 with σ = 3.859, are divided by a constant
value, say 2. The observations now become 1.5, 5 and 6.
x = 4.16.
σ= (1 . 5 − 4 . 16 ) 2 + ( 5 − 4 . 16 ) 2 + ( 6 − 4 . 16 ) 2
3
=1.92
Thus the standard deviation is also divided by 2 i.e. 3.859/2=1.92.
88
Measures of Dispersion
n 1σ 12 + n 2 σ 22 + n 1d 12 + n 2 d 22
= σ 12 =
n1 + n 2
Where,
σ1 = standard deviation of first group
σ 2 = standard deviation of second group
d1 =x 1- x
d2 =x 2- x
x = (n1 x 1 + n2 x 2 ) / (n1 + n2)
Coefficient of Variation
The coefficient of variation is a measure of relative dispersion and is given by
89
Foundations of Quantitative Methods
6.26
Coefficient of variation= Standard deviation = =0.3598=35.98%
Mean 17.4
Example 6.7
A security analyst studied hundred companies and obtained the following Return on
Investment (ROI) data for the year 1992.Calculate the standard deviation in ROI of the
companies.
Table 6.9: ROI of 100 Companies
Returns % 0-10 10-20 20-30 30-40
No. of Companies 19 32 41 8
Solution:
We can find the variability in the ROI of the companies by calculating the standard
deviation for the above data.
The steps involved are:
• Find mean for grouped data.
• Find deviations from mean for grouped data.
• Find square of above deviations.
• Sum up the squared deviations taking frequency into account.
• Take square root.
90
Measures of Dispersion
Return on No. of
Mid-point Deviation
investment companies
% X f fX X-x f(X- x )2
Mean µ = ∑ fX 1880
= = 18 . 8 %
∑ f 100
2
Standard Deviation = ∑ f (X − x ) = 7756 = 8.81%
∑ f 100
This rule was developed by Russian mathematicians named Bienayme and P.L.
Chebyshev. It says that what ever may be the shape of a distribution, at least 75 percent
of the values in the population will fall within ± 2 standard deviations of the mean and
at least 89 percent will fall within ± 3 standard deviations from the mean.
The rule states that the percentage of data observations lying within ± k standard
deviations of the mean is atleast ⎛⎜ 1 − 12 ⎞⎟ × 100
⎝ k ⎠
This formula applies to differences greater than one standard deviation about the mean,
and k must be greater than 1.
In case of a symmetrical bell-shaped curve, we can say that:
1. Approximately 68 percent of the observations in the population fall within ±1
standard deviation from the mean
2. Approximately 95 percent of the observations in the population fall within ±2
standard deviations from the mean.
3. Approximately 99 percent of the observations in the population fall within ±3
standard deviation from the mean.
The diagrammatic representation of the location of observations around the mean of a
bell-shaped frequency distribution is given in Figure 6.2
91
Foundations of Quantitative Methods
99%
95%
68%
SUMMARY
Measures of dispersion indicate how much the data in a given set of numerical data are
spread out. The simplest of these measures is the range. The range is the distance
between the largest and the smallest value in the data set. Of all measures of
dispersion, the range is most sensitive to extreme values. Similar in spirit to the range,
but not affected by outliers, is the interquartile range, that is, the difference between the
third and first quartiles. In a sense, the interquartile range is the range of the middle
half of the data. The mean deviation and the standard deviation indicate how much the
average value in the data set differs from the mean. Whereas the mean deviation is just
that - the mean distance of the measurements from the mean, the standard deviation is
usually calculated by first finding the variance and then extracting the square root of
the variance. Although more difficult to compute than the other measures of dispersion,
the standard deviation and the variance are the most useful and accurate methods.
92
Chapter 7
Index Numbers
In this chapter we will discuss:
• Definition of Index Number
• Characteristics of Index Numbers
• Uses of Index Numbers
• Types of Index Numbers
• Problems Related to Index Numbers
• Methods of Constructing Index Numbers
• Quantity and Value Indices
Foundations of Quantitative Methods
Index numbers are defined by different people in different ways. They are referred to
as measures of change, a device to measure change or a series representing the process
of change. Depending on these definitions, the following are some of the
characteristics of index numbers.
Comparative Nature
Index numbers by nature act as devices to measure change and hence they can be used
to compare two variables. They compare changes taking place over time among
variables in a category.
94
Index Numbers
In the beginning, index numbers were constructed to study the price levels over a
period of time. But today these index numbers are extensively used in fields of
economics and business management to compare data relating to production, sales,
revenues and other financial matters. Some of the important uses of index numbers are
as follows.
95
Foundations of Quantitative Methods
Price Index
Price index compares the changes in prices of a basket of commodities from one period
to another. General price index is used to measure the value of money. Price index
numbers are the most popularly used devices employed in various business and
economic decisions. This price index is further classified into wholesale price index
numbers and retail price index numbers.
Wholesale price index numbers
These index numbers give the change in the general price levels in a country. Official
wholesale price index in India was first complied by the Economic Advisor (Ministry
of Commerce) in 1947 (base year August 1939).
Retail price index numbers
Retail price index numbers give the changes in the retail prices of various commodities
like consumables, stocks, shares, bank deposits, government bonds, etc. Labor Ministry
prepares these retail price index numbers separately for urban and rural areas.
Cost of living index is a type of retail price index, which helps in measuring the impact
of changes in price of selected basket of goods on the purchasing power of people of
various income groups.
Quantity Index
Quantity index number measures how much the number or quantity of a variable
changes over time. For example, it measures the changes in the volume of goods
produced, purchased or consumed, the indices of agricultural production, industrial
production, imports and exports, etc. To be precise, quantity index numbers help in
comparing the level of physical output in an economy over a period of time.
Value Index
Value index measures the change in total monetary value over a period of time. Value
index combines price and quantity changes; here both the total value of the product and
value added is taken. Hence, it is useful in cases like sales, inventory, foreign trade,
etc.
Though index numbers offer several advantages, there are many problems which will
influence the construction of index numbers. Some of the problems are as follows:
Inadequate Data
Sometimes, managers find it difficult to obtain suitable data for computing an index.
For example, suppose the sales manager of a fertilizer company want to develop an
index that would describe the seasonal variations in the fertilizer sales. If the sales
figures are available only on yearly basis, he cannot design an index to measure the
seasonal sales pattern.
Incomparability of Indices
This problem arises when one tries to compare one index with another when there is a
change in the basis product or service that is being measured. For example, when we
compare price indices of automobiles from 1990 to 2000, we find that prices have
96
Index Numbers
increased substantially. But this comparison fails to take into consideration the
technological advances in the design and the quality of automobiles manufactured over
a time period (1990-2000).
System of Weights
In developing a composite index like consumer price index all commodities included
are not of equal importance. Hence, the system of weights and the allocation of weights
to different commodities is very important. Inappropriate weighting of commodities
can distort the index. For example, a rupee per liter increase in price of diesel cannot be
compensated by a rupee decrease in road tax as the increase in price of diesel has a
much greater impact on the consumer than that of decrease in the road tax.
Suppose, we are calculating price index using unweighted aggregate method, we use
price as a variable and quantity as weight. The following example (Table 7.1) shows
the computation of price index using unweighted aggregative method.
Table 7.1: Calculation of Price Index Using Unweighted Aggregates
Method
PRICE
Commodities 1990 (in Rs) 2000(in Rs)
A 30.00 40.00
B 20.00 25.00
C 6.00 12.00
D 1.00 1.50
∑ P0 = 57.00 ∑ P1= 78.50
Then unweighted aggregate price index is
= ∑ 1 × 100 = 137.7
P
∑ P0
So the unweighted aggregate price index is 137.7, it implies the prices of commodities
has increased by 37.7 percent from 1990 to 2000.
The major disadvantage of this method is that it fails to assign greater weight to high-
consumption good over low-use goods. Hence it fails to be a real measure of change in
prices over a period of time.
98
Index Numbers
99
Foundations of Quantitative Methods
Thus, Laspeyres price index = 133.2. This implies that the prices have increased by
33.2 percent from that of the base year to the present year. The major advantage of
using this Laspeyres index is that one can directly compare the value of any two years
computed against the same base year. The basic disadvantage Laspeyres method is that
it does not consider the change in quantities (consumption) patterns between the base
year and the current year. This will distort the price index.
Paasche method
Paasche method uses the quantity measures for the current period rather than for the
base period (as used in Laspeyres method). Paasche index is calculated similar to the
Laspeyres price index. The formula for calculating Paasche price index is as follows.
Paasche price index = ∑ Pi Q i × 100
∑ P0 Q i
Where
Pi = Prices of commodities in the current period
P0 = Prices of commodities in the base period
Qi= Quantities in the current period
The most important advantage of Passche method is it uses the changes in price and
consumption patterns. Thus, it is a better measure of general changes in the economy
than the Laspeyres’ method of calculating weighted aggregates index. The following
example (Table 7.4) shows how to calculate a Paasche index.
Table 7.4: Calculating Paasche Index
Current Base price Current
Commodity PiQi P0Qi
price Pi P0 Quantity Qi
A 40 30 120 4,800 3,600
B 25 20 100 2,500 2,000
C 12 6 20 240 120
Total 77 56 240 7,540 5,720
∑ Pi Q i
Paasche price index = × 100 = (7540/5720) x 100 = 131.81
∑ P0 Q i
Hence, Paasche price index is 131.81. This implies that the prices have increased by
31.81 percent from the basic year to the current year. Thus each value used in
developing the Paasche price index is the combination of price and quantity changes
from the base year. The primary disadvantage of this method is that it is impossible to
compare the indices from different periods as the quantity measure used for one period
is different from that used in other period. Although Paasche method is a better
indicator of general changes in economy, it cannot be used to directly compare indices
(as possible in Laspeyres method).
Fisher method
Fisher price index number is given by the geometric mean of Laspeyres and Paasche
formula. The formula for computing Fisher’s index is as follows.
∑ Pi Q 0 ∑ Pi Q i
= × 100 × × 100
∑ P0 Q 0 ∑ P0 Q i
100
Index Numbers
B 70 20 25 1400 1750
C 20 6 12 120 240
∑ Pi Q 2
Fixed weight aggregates price index = × 100 = (6790/5120) × 100 = 132.62
∑ P0 Q 2
Hence fixed weight aggregates price index is 132.6. The basic advantage using this
method is that it gives the manager the freedom to select the base price and the fixed
quantities (capital). This allows the manager to select the base price and quantity from
different periods to develop more accurate indices.
101
Foundations of Quantitative Methods
⎛P ⎞
∑ ⎜⎜ i × 100 ⎟⎟
P
Unweighted average of relatives index = ⎝ 0 ⎠ = 608/4 = 152
n
From this calculation it is clear that the general price index for year 2000 is 152 using
unweighted average of relatives index. But, for the same problem when the index was
calculated by using unweighted aggregates it was 137.7. The difference in indices
calculated by these two methods arises due to the fact that in unweighted average of
relatives method the average of the ratios of the prices for each commodity is
computed separately, and with the unweighted aggregates method the ratio of sums of
the prices of each product is calculated. Instead of assigning weights, the unweighted
average of relatives’ method converts each commodity into a relative scale where each
of them is shown as a percentage.
102
Index Numbers
The value of index computed using the above equation would produce the same results
for any problem as that obtained by using the Laspeyres method. The following
example shows the computation of price index using weighted average of relatives’
method.
Weighted average of relatives index
⎡⎛ P ⎞ ⎤
∑ ⎢ ⎜⎜ i × 100 ⎟⎟ ( Pn Q n ) ⎥
P
= ⎢⎣ ⎝ 0 ⎠ ⎥⎦ = ⎡ 549000 ⎤ = 133.25
⎢ 4120 ⎥
∑ n n
P Q ⎣ ⎦
Weighted average of relatives index is 133.25.
103
Foundations of Quantitative Methods
Where
Qi= Quantities of commodities used in the current period
Q0=Quantities of commodities used in the base period
Qn and Pn =Quantity and price values used as weights
The major advantage of using the quantity index is that it gives a better measure of the
changes in the economy in times of inflation because it measures the actual output of
raw materials and finished goods rather than taking the price of goods.
Value index measures general changes in the total value of some variable. It measures
the combined effects of price and quantity changes. The index numbers can also be
used to describe the changes in quantity and value.
Example 7.1
The consumption of material by a bakery is given below.
Price (in) Quantities used
Inputs Units Po P1 Qo Q1
1993 1996 1993 1996
Flour Kilo 15 30 500 700
Eggs Dozen 8 14 100 70
Milk Litre 6 17 200 120
Sugar Kilo 10 16 50 70
Calculate Laspeyre’s price index, Laspeyre’s quantity index, Paasche’s quantity index
and unweighted average of relative index.
a. Laspeyre’s price Index for the given data is
P1Q0 P0Q0 P0Q1 P1Q1 ⎛ P1 ⎞
⎜⎜ ⎟⎟ × 100
⎝ P0 ⎠
20600
= × 100
10000
= 206
b. Laspeyre’s quantity index for the given data is
Laspeyre’s quantity index = ∑ Q 1 P0 × 100
∑ Q 0 P0
12480
= × 100
10000
= 124.8
104
Index Numbers
SUMMARY
An index number is a statistical measure designed to show changes in variables or a
group of related variables with respect to time, geographic locations or other
characteristics of the variable under study. It is referred to as a measure of change, a
device to measure change or a series representing the process of change. Index
numbers are used as a barometer to indicate the changes in economic activity. They
also provide framework for decision making and to forecast future events. There are
three types of index numbers which are generally used. They are price index, quantity
index and value index. These index numbers can be developed either by aggregate
method or by average of relatives method. This chapter discussed various methods of
weighting an index like the Laspeyres method, Paasche method, Fisher method and the
fixed weight aggregates method. Although there are many problems related to
developing an index it is a handy device to measure or compare the changes in
economic variables over a period of time.
105
Chapter 8
Introduction to Probability
and Probability
Distributions
In this chapter we will discuss:
• Basic Probability Concepts
• Types of Probability
• Probability Rules
• Random Variables
• Types of Probability Distributions
• Normal Distribution
Intro to Prob. and Prob. Distributions
The concept of probability originated in the seventeenth century and has become one
of the most fascinating and debatable subjects in the recent years. Probability has
gained a lot of importance and the mathematical theory of probability has become the
basis for statistical applications in the areas of management, space technology and the
like. In fact, most of the people use probability in their day-to-day lives without being
aware of it. Statements like “It may rain today”, “Probably I will continue with the
same job”, “India might win the cricket series against Australia,” etc. are examples of
the usage of probability in day-to-day life.
Various business decisions in real life are made under situations when a decision
maker is very uncertain as to what will happen after the decisions are made. The theory
of probability is of great help in all such areas. In particular, it enables a person to
make ‘educated guesses’ on matters where either full facts are not known or there is
uncertainty about the outcome. The probability formulae and techniques were
developed by Jacob Bernoulli, De Moiure, Thomas Bayes, and Joseph Lagrange. Later
on Pierre Simon and Laplace unified all these early ideas and compiled the first general
theory of probability. Even though, volumes have been written on probability, the
controversies concerned with the concepts of probability theory continue.
The concept of probability was used by gamblers during the early days in games of
chance such as throwing a die, drawing a card from the deck or tossing a coin. In these
games of chance, there is an uncertainty regarding the face of the die that will appear in
a throw or the card that will appear in a draw or the face of a coin that will appear
when it is tossed. Although there is an uncertainty concerning the outcome of any
particular throw or any particular drawing, there is a predictable long-term outcome.
For instance, if a die is thrown many times, experimental studies have shown that the
probability of a number to appear is one sixth (as the die has 6 faces).
Before defining the term ‘probability’, it is necessary to familiarize with certain terms
that are used in this context.
Experiment
Any operation / process that results in two or more outcomes is called an experiment.
Examples of an experiment:
• Rolling an unbiased die is an experiment, where the number that is to appear on
the face of the die is unpredictable and subject to change.
• Tossing a fair coin is an experiment, where the outcome head or tail is
unpredictable.
Random Experiment
Any well-defined process of observing a given chance phenomena through a series of
trials that are finite or infinite and each of which leads to a single outcome is known as
a random experiment.
Examples of random experiment:
• Drawing a card from a pack of 52 cards. This is also a chance phenomenon with
only one outcome.
• Drawing a ball from a bag containing a given number of red, blue and white balls.
This is also a chance phenomenon with only one outcome.
107
Foundations of Quantitative Methods
Possible Outcome
The result of a random experiment is called an outcome. For example, picking a card
from a pack of 52 cards and getting an ace or a Jack or a Queen or a King is an
outcome.
Event
An event is one or more possible outcomes of an experiment or a result of a trial or an
observation. In other words, an event is used to denote a phenomenon that occurs with
every realization of a set of conditions.
Elementary Event
A simple or elementary event is a single possible outcome of an experiment. A simple
event cannot be further subdivided into a combination of other events.
Example
Throwing a die and the event of getting a six (6) is a simple event.
Compound Event
When two or more events occur in connection with each other, then their simultaneous
occurrence is called a compound event. The compound event is an aggregate of simple
events.
Example
When we roll two dice, then the event of getting a six on either the first or second die is
a compound event.
Favorable Event
The number of cases favorable to an event in a trial is the number of outcomes that
result in the happening of a particular event.
Examples
• In drawing a card from a pack of 52 cards, the number of favorable cases for
drawing an ace is 4, for drawing a spade are 13 and for drawing a black card are
26.
• In throwing of three die, the number of cases favorable to getting the sum of 4 is:
(1, 1, 2), (1, 2, 1), (2, 1, 1), i.e. totally three favorable outcomes.
108
Intro to Prob. and Prob. Distributions
Exhaustive Events
The total number of possible outcomes in any trial is known as exhaustive events or
exhaustive cases.
Examples
• In tossing a fair coin, there are two possible outcomes, head and tail. The list of
these outcomes is exhaustive since the result of any toss must be either head or
tail, if the possibility of the coin standing on an edge is ignored.
• For throwing two dice, the exhaustive number of cases is 62 = 36. In general, for
throwing ‘n’ dice, the exhaustive number of events is 6n. This is because any of the
six numbers from 1 to 6 of the first die may be associated with any of the six
numbers of the other dice.
The sum of the probabilities for a mutually exclusive event or an exhaustive event
should be equal to one.
Complementary Events
A complementary event is the number of unfavorable cases in an experiment. Suppose
‘E’ is an event of the number of favorable cases in the experiment, then a
complementary event denoted by E is the number of unfavorable cases in that
experiment. The events E and E are mutually exclusive and exhaustive.
109
Foundations of Quantitative Methods
Examples
• In drawing a card from a pack of 52 cards, the event of getting an ace of diamond
is only one and that of getting the complementary i.e., unfavorable event is 51.
• In throwing a die, the favorable event of getting a face with number 1 is 1 and the
unfavorable event of getting it is 5.
The sum of the probabilities of an event and its complementary event is one.
TYPES OF PROBABILITY
There are three basic ways of classifying probability based on the conceptual
approaches to the study of probability theory. There is disagreement among the experts
regarding the appropriate approach of probability. The basic approaches are:
• Classical approach
• Relative frequency of occurrence approach
• Subjective approach
Classical Approach
The classical approach is based on the assumption that each event is equally likely to
occur. This is an apriori assumption (the term apriori refers to something that is
known by reason alone) and the probability based on this assumption is known as
apriori probability. This approach employs abstract mathematical logic and hence is
also called as ‘abstract’ or ‘mathematical’ probability. This is the reason for
considerable use of familiar objects like cards, coins, dice, etc., where the answer can
be stated in advance before picking a card, tossing a coin or throwing a die,
respectively.
Definition
If a random experiment results in ‘N’ exhaustive, mutually exclusive, and equally
likely outcomes, out of which ‘f’ are favorable to the happening of an event ‘E’, then
the probability of occurrence of E, usually denoted by P(E) is given by
P = P (E) = f / N
This is rather a complex way of defining something that may seem intuitively obvious,
but it can be used to tossing a coin and throwing a die example.
(or)
110
Intro to Prob. and Prob. Distributions
Hence, the probability of happening of an event plus the probability of not happening
of the event is always equal to unity. Obviously, p as well as q are non-negative and
cannot exceed unity i.e., 0 ≤ p ≤ 1 and 0 ≤ q ≤ 1.
If the probability of an event, P (E) is zero then, the event is called an impossible event
and if P (E) = 1 then the event is called a certain event.
Classical approach can be illustrated for tossing of a coin or a die. Suppose that the
probability of getting a head on a single toss is to be calculated, then using formal
terms,
P (Head) = 1 = 0.5
1+1
The probability of getting ‘3’ on a single throw of a die is to be calculated, then using
formal terms,
P (3) = 1 = 1/6 = 0.167
1+1+1+1+1+1
Example 8.1
What is the probability of drawing a face card (King, Queen or Jack) from a pack of
well shuffled pack of 52 cards?
Solution:
The total number of face cards in a pack of 52 = 12 cards
111
Foundations of Quantitative Methods
such as an employee resigning from a job before his/her retirement age or the delay in
delivery of a product to a nearby customer cannot be predicted using this approach.
Subjective Approach
Subjective probabilities are those assigned to events by the manager or the researcher
based on the past experiences or occurrences or on the evidences available. It may be
an educated guess or intuition. At higher levels of managerial decisions, when the
decision making becomes very important, specific and is demanded to be unique,
managers use subjective probability. The approach was introduced by Frank Ramsey in
1926.
PROBABILITY RULES
112
Intro to Prob. and Prob. Distributions
i.e., 0 ≤ P (A) ≤ 1. This rule says that probabilities cannot be negative and as the
probability of the sample space is 1, the probability of an event contained in the sample
space should be less than or equal to 1.
Rule 3
If A and B are mutually exclusive events, then the probability of (A or B) is equal to
the sum of the probabilities of A and B.
P (A or B) = P (A) + P (B) because P (A and B) = 0 as A and B are mutually exclusive.
Mutually exclusive events are those which do not overlap when represented in Venn
diagrams (See Figure 8.1).
A B C
Two events A and B are mutually exclusive if the occurrence of one implies the non-
occurrence of the other. Hence obtaining a head on tossing a coin and obtaining a tail
are mutually exclusive events.
113
Foundations of Quantitative Methods
As there are six possible equally likely outcomes on throwing the dice,
P (A or B or C) = 3 = 1 + 1 + 1 = P (A) + P (B) + P(C)
6 6 6 6
Nonmutually exclusive events
If two events are not mutually exclusive the probability of one of them occurring is the
sum of the marginal probabilities of the events minus the joint probability of the
occurrence of the events.
C D
114
Intro to Prob. and Prob. Distributions
Multiplication Rule
Independent events
Two events, A and B are said to be independent, if the probability of happening or not
happening of an event is not affected by the happening or not happening of the other.
115
Foundations of Quantitative Methods
Thus, in the case of independent events the probability of any event is not conditional,
and does not depend on the happening or not happening of the other event either
preceding, following or at the same time.
The multiplication rule of probability states that the probability of happening of an
independent event or unconditional event is given by the product or the probabilities of
each event.
Symbolically,
P (A ∩ B) = P (A). P (B)
or P (A and B) = P (A). P (B)
Thus, in the case P (A/B) = P (A) and P (B/A) =P (B)
Because the happening of A or B does not affect the happening of the other as both are
independent.
Example 8.6
In a survey involving 500 children, it was found that 200 children liked strawberry ice-
cream, 125 children liked chocolate ice-cream and 75 children liked both the ice-
creams. What is the probability of finding a child in the survey who neither likes
strawberry ice-cream nor chocolate ice-cream?
Solution:
Let S represent the event that a child likes strawberry ice-cream
200
Thus P (S) = = 0.4
500
Let C represent the event that a child likes chocolate ice-cream
125
Thus P (C) = = 0.25
500
75
P (S and C) = = 0.15
500
P (S or C) = P(S) + P(C) - P (S and C)
= 0.4 + 0.25 – 0.15
= 0.5
P (neither S nor C) = 1 - P (S or C)
= 1- 0.5
= 0.5
Example 8.7
An assembly line consists of drilling, welding and painting machine. The assembly line
works fine if all the three machines work properly. The probability of drilling machine
failing in an year is 0.20. The probability of welding machine failing in an year is 0.25.
The probability of painting machine failing in an year is 0.15. What is the probability
that the assembly line will fail before the end of the year?
Solution:
Let A, B and C represents the event of drilling, welding and painting machines failing
in a year respectively.
116
Intro to Prob. and Prob. Distributions
= 0.92
117
Foundations of Quantitative Methods
Example 8.9
A company wants to decide its strategy for the launch of a new product. It is trying to
estimate the level of competition that might exist in the market after the launch of the
new product. The top management of the company feels that three competitors A, B
and C might enter the market. There are 60% chances that competitor A will enter the
market, 30% chances that competitor B will enter, and 10% chances that competitor C
will enter.
a. What is the probability that no competitor will enter the market?
b. What is the probability that A and C will enter the market as competitors but not
B?
Solution:
Let A represents the event of competitor A entering the market
Α represents the event of competitor A not entering the market
B represents the event of competitor B entering the market
Β represent the event of competitor B not entering the market
C represents the event of competitor C entering the market
C represents the event of competitor C not entering the market
Given, P(A) = 0.6
P (B) = 0.3
P (C) = 0.1
P ( Α ) = 1- 0.6 = 0.4
P ( Β ) = 1-0.3 = 0.7
P ( C ) = 1- 0.1 = 0.9
a. P (A, B and C will not enter the market) = P ( Α ) × P ( Β ) × P ( C )
= 0.4 × 0.7 × 0.9
= 0.252
b. P (A and C will enter the market but not B) = P (A) × P (C) × P ( Β )
= 0.6 × 0.1 × 0.7
= 0.042
RANDOM VARIABLES
Before proceeding further, let us first understand the concept of random variables,
which enables us to understand the concept of probability distributions better.
Random variable is a variable that takes different values as a result of the outcomes of
a random experiment. A random variable is said to be continuous if it is allowed to
assume any value within a specified range and is said to be discrete if it is allowed to
take only a limited number of values, which can be listed. This can be further
explained through the following example. Suppose an unbiased pair of dice is tossed.
The possible outcome of the sum of the upper faces of the two dice can take any
integer value between 2 and 12. The outcome is said to be discrete because it can take
only a finite (or countable) number of values.
On the other hand, if the task is to determine the mean age of a sample of 1000 voters,
the possible outcome (X) can take any value in an interval(s) of numbers and is hence
118
Intro to Prob. and Prob. Distributions
continuous. It is a general practice to use capital letters for random variables and lower
case letters to indicate the actual value it takes. That is X = x.
119
Foundations of Quantitative Methods
because the expected value is a weighted average of the outcomes that can be expected
in the future. The dentist should recompute the expected value and update his
information on a regular basis.
Table 8.3: Calculation of Expected Value
120
Intro to Prob. and Prob. Distributions
The values given in the relative frequency column of the Table 8.4 are nothing but the
probabilities associated with the values of X. So, the above findings can be slightly
modified as shown in Table 8.5.
Table 8.4: Results of Tossing a Balanced Coin
X Frequency Relative Frequency
0 1 1/8
1 3 3/8
2 3 3/8
3 1 1/8
Table 8.5 is a typical example of a discrete probability distribution. One may view this
distribution as eight numbers (for instance, eight students taking a 3-subject exam in
which one failed in all, 3 got through one subject, and so on). The mean of these
numbers is calculated as below.
Mean (µ) = 0 + 1 + 1 + 1 + 2 + 2 + 2 + 3
8
µ = 1.5
The above equation can also be written as
µ = 0 × 1 + 1× 3 + 2 × 3 + 3 × 1
8 8 8 8
This can also be generalized to any discrete probability distribution as:
µ = x1P(X = x1) + x2 P(X = x2 ) + ..... + xn P(X = xn )
n
µ = ∑ x i P( x = x i )
i =1
121
Foundations of Quantitative Methods
downstream. We can see therefore that the parameter, ppm can take on a very large
number of values. This distribution would therefore be called a continuous probability
distribution as all values are so close to each other that we cannot single out a group of
them.
If σ is the standard deviation of the normal distribution, 68% of the observation will be
in the interval µ -1σ to µ +1σ.
95.5% of the observations will lie in the interval µ -2σ to µ + 2σ
99.7% of the observations will lie in the interval µ -3σ to µ + 3σ
122
Intro to Prob. and Prob. Distributions
123
Foundations of Quantitative Methods
Solution:
X −μ − 61 = -5.08
Z= =
σ 12
Example 8.15
A normal variable has a mean of 10 and a standard deviation of 5. What is the
probability that the normal variable will take a value in the interval 0.2 to 19.8?
Solution:
Probability (0.2 < X < 19.8)
= Probability ⎛⎜ 0.2 − 10 ≤ Z ≤ 19.8 − 10 ⎞
⎟⎟
⎜ 5 5
⎝ ⎠
Z = 1.6
Z=1
Example 8.17
In a class of 60 students, 10% of the students received A grade in Business Strategy
and 20% received E grade. Students who scored more than 75 marks received A grade
124
Intro to Prob. and Prob. Distributions
and those who scored less than 35 marks received E grade. If student marks are
normally distributed, what is the mean and standard deviation of marks?
Solution:
The value of standardized normal variable (z) for the students who scored between
mean and 75 = 1.282
75 − μ
Therefore 1.282 =
σ
1 μ + 1.282σ = 75 (i)
30% 40%
20% 10%
35 ⇐ 75
Similarly for the students who score less than 35 marks, the value of standardized
normal variable = -0.842
35 − μ
Therefore − 0.842 =
σ
1 μ − 0.842σ = 35 (ii)
SUMMARY
The concepts of probability were first developed to assist gamblers in games of chance
such as throwing dice or tossing a coin. However, the concepts were later applied in
several areas of management and space technology.
Probability concepts can be broadly classified into three majority types based on the
approaches to the study of probability theory: classical approach, relative frequency
approach and subjective approach. This chapter discussed the rules of probability.
Probability distributions are considered as theoretical frequency distributions. As these
distributions deal with expectations, they are used in making inferences and decisions
under uncertainty conditions.
Random variable is a variable that takes different values as a result of the outcomes of
a random experiment. The expected value of a random variable is obtained by
multiplying each value that the variable can assume with the probability of occurrence
of that value and then summing up these products. This chapter discusses the concepts
of random variables and their expected values.
Probability distributions can be broadly classified into two types: discrete and
continuous. While, binomial and Poisson distributions are discrete in nature, normal
distribution is continuous in nature.
125
Chapter 9
Matrices
In this chapter we will discuss:
• Definition of Matrix
• Types of Matrices
• Operations on Matrices
• Determinant of a Matrix
• Properties of Determinants
• Adjoint of a Matrix
• Inverse of a Matrix
• Application of Matrix Methods to Solve Linear Equations
Matrices
DEFINITION OF MATRIX
and
⎡0 1 2 4 ⎤
Example for a 3 × 4 matrix: ⎢ 0 7 8 9 ⎥
⎢ ⎥
⎢⎣ 5 7 1 2 ⎥⎦ 3× 4
TYPES OF MATRICES
Row Matrix
A matrix having only a single row is called a row-matrix. The following are the
examples
A = [4 6] (Row matrix of order 1 × 2)
B = [3 -4 2] (Row matrix of order 1 × 3)
Note: A row matrix can have any number of columns but it can have only one row.
127
Foundations of Quantitative Methods
Column Matrix
A matrix having only a single column is called a column matrix. The following are the
examples of column matrix:
⎡1 ⎤
X = ⎢3 ⎥ (Column matrix of order 3×1)
⎢ ⎥
⎢⎣ 6 ⎥⎦
3×1
⎡8 ⎤
⎢ ⎥
Y = ⎢6 ⎥ (Column matrix of order 4×1)
⎢4 ⎥
⎢ ⎥
⎣1 ⎦ 4×1
Note: A column matrix can have any number of rows but it can have only one column.
Square Matrix
An m × n matrix for which m = n, i.e., (the number of rows = number of columns) is
called a square matrix of order n (since m = n) or m rowed matrix. The following is a
square matrix of order 3×3.
⎡2 3 4⎤
A = ⎢5 6 7 ⎥ is a square matrix of order 3
⎢ ⎥
⎢⎣1 0 2 ⎥⎦
3× 3
If A = [aij]m x n is a square matrix of order m, then the elements aij for which i = j, are
called the diagonal elements of A (i.e. a11, a22, a33.......ann are diagonal elements). In the
above square matrix A, 2, 6, 2 are the diagonal elements.
The line along which these diagonal elements lie is called the principal diagonal or
the main-diagonal of the matrix.
Diagonal Matrix
A square matrix, in which all the elements except the diagonal elements are zero, is
called a diagonal matrix.
If A = [aij] is a diagonal matrix of order m, where aij = 0 for i ≠ j. It can be expressed
as
A = diag. [a11, a22, a33 .........amm]
⎡7 0 0⎤
Example: A = ⎢ 0 6 0 ⎥ is a diagonal matrix as all the elements other than the
⎢ ⎥
⎣⎢ 0 0 9 ⎥⎦ 3× 3
diagonal elements are zeros. It is denoted as A = diag. [7, 6, 9]
128
Matrices
Scalar Matrix
A diagonal matrix whose diagonal elements are all equal is called as scalar matrix.
⎡2 0 0⎤
A = ⎢0 2 0⎥
⎢ ⎥
⎢⎣ 0 0 2 ⎥⎦
3× 3
I2 = ⎡⎢1 0 ⎤⎥
⎣ 0 1 ⎦ 2× 2
⎡1 0 0 0 ⎤
⎢ ⎥
I4 = ⎢ 0 1 0 0 ⎥
⎢0 0 1 0 ⎥
⎢ ⎥
⎣ 0 0 0 1 ⎦ 4× 4
3 4
and B = ⎡3 4 ⎤ is non-singular as B = = -5 ≠ 0
⎢2 1 ⎥ 21
⎣ ⎦
OPERATIONS ON MATRICES
To use matrices, one should be conversant with mathematical operations like addition,
subtraction and multiplication on matrices. The following are some of the
mathematical operations on matrices and their properties.
Addition of Matrices
Let A and B be two matrices of the same order m×n then their sum (A+B) is defined
to be the matrix of the order m × n obtained by adding the corresponding elements of
A and B.
If A = [aij]m×n and B = [bij]m×n
Then A + B = [aij + bij]m×n
Example 9.1
If A = ⎡⎢ 2 1 ⎤⎥ and B = ⎡⎢ 3 4 ⎤⎥ Find A + B
0 1
⎣ ⎦ 1 2 ⎣ ⎦
Solution:
A + B = ⎡⎢ 2 + 3 1 + 4 ⎤⎥ = ⎡⎢ 5 5 ⎤⎥
⎣0 + 1 1 + 2 ⎦ ⎣1 3 ⎦
If A = ⎡⎢1 3 ⎤⎥ and B = ⎡⎢ 3 1 0 ⎤⎥
⎣2 4⎦ ⎣6 2 5⎦
then A + B is not defined, since A and B are not of the same order.
Properties of matrix addition
Some of the properties of matrix addition are:
Commutative property
If A and B are two matrices of same order m×n then,
A+B=B+A
Example 9.2
If A = ⎡⎢ 2 1 ⎤⎥ and B = ⎡⎢ 3 4 ⎤⎥
⎣ 0 1⎦ ⎣1 2 ⎦
Solution:
Then A + B = ⎡ 2 1⎤ + ⎡3 4 ⎤ = ⎡ 2 + 3 1 + 4 ⎤ = ⎡5 5 ⎤
⎢0 1 ⎥⎦ ⎢1 2 ⎥ ⎢0 + 1 1 + 2 ⎥ ⎢1 3 ⎥
⎣ ⎦
⎣ ⎣ ⎦ ⎣ ⎦
130
Matrices
B + A = ⎡⎢ 3 4 ⎤⎥ + ⎡⎢ 2 1 ⎤⎥ = ⎡⎢ 3 + 2 4 + 1 ⎤⎥ = ⎡⎢ 5 5 ⎤⎥
⎣1 2 ⎦ ⎣ 0 1 ⎦ ⎣1 + 0 2 + 1 ⎦ ⎣1 3 ⎦
∴A + B = B + A
Associative property
If A, B, C are three matrices of order m × n then,
(A+B) + C = A + (B + C)
Example 9.3
If A = ⎡ 2 1 ⎤ B = ⎡3 4 ⎤ C = ⎡1 2 ⎤
⎢0 1⎥ ⎢1 2 ⎥ ⎢3 4 ⎥
⎣ ⎦ ⎣ ⎦ ⎣ ⎦
Solution:
A + B = ⎡ 2 1 ⎤ + ⎡ 3 4 ⎤ = ⎡⎢ 5 5 ⎤⎥
⎢ 0 1 ⎥ ⎢1 2 ⎥ ⎣1 3 ⎦
⎣ ⎦ ⎣ ⎦
Then (A + B) + C = ⎡⎢ 5 5 ⎤⎥ + ⎡ 1 2 ⎤ = ⎡ 6 7 ⎤
⎣1 3 ⎦ ⎢3 4 ⎥ ⎢4 7⎥
⎣ ⎦ ⎣ ⎦
B + C = ⎡ 3 4 ⎤ + ⎡1 2 ⎤ = ⎡ 4 6 ⎤
⎢1 2 ⎥ ⎢3 4 ⎥ ⎢4 6⎥
⎣ ⎦ ⎣ ⎦ ⎣ ⎦
A + (B + C) = ⎡ 2 1 ⎤ + ⎡ 4 6 ⎤ = ⎡ 6 7 ⎤
⎢0 1⎥ ⎢4 6⎥ ⎢4 7⎥
⎣ ⎦ ⎣ ⎦ ⎣ ⎦
Hence (A + B) + C = A + (B + C)
Existence of additive inverse
For a matrix A = [aij]m×n its additive inverse is defined as the matrix [-aij]m×n and is
denoted by -A. Therefore, we have A + (-A) = 0 = (-A) + A
Example 9.4
⎡1 4 7 ⎤
Given the matrix M= ⎢⎢2 5 8⎥⎥ . Find the additive inverse of matrix M.
⎢⎣3 6 9⎥⎦
Solution:
For a matrix A = [aij]m×n its additive inverse is defined as the matrix [-aij]m×n and is
denoted by -A.
⎡ − 1 − 4 − 7⎤
⎢− 2 − 5 − 8 ⎥
⎢ ⎥
⎢⎣ − 3 − 6 − 9 ⎥⎦
Multiplication of Matrices
Let A = [aij]m×n and B = [bjk]n×p
be two matrices such that the number of columns in A is equal to the number of rows
in B. Then the product of the two matrices is given by
C = [cik ]m× p
m n p
where cik = ∑ ∑ ∑ a ij b jk (product of A and B in that order)
i =1 j=1k =1
and is denoted as C = AB
Thus, for two matrices A and B, if A is an m × n matrix and B is an n × p matrix, then
AB will be an m × p matrix.
The (ik)th element of AB is obtained by multiplying each element of ith row of A by
the corresponding element in the kth column of B and then adding the products.
Example 9.6
⎡b11 b12 ⎤
⎡a 11 a 12 a 13 ⎤ ⎢ ⎥
If A = ⎢ ⎥ and B = ⎢b 21 b 22 ⎥
⎣a 21 a 22 a 23 ⎦ ⎢⎣b 31 b 32 ⎥⎦
⎡b11 b12 ⎤
⎡a 11 a 12 a 13 ⎤ ⎢ ⎥
Then AB = ⎢ ⎥ ⎢b 21 b 22 ⎥
⎣a 21 a 22 a 23 ⎦ ⎢⎣b 31 b 32 ⎥⎦
⎡c11 c12 ⎤
Let, AB = ⎢ ⎥
⎣c 21 c 22 ⎦
where c11 = (1st row of A) × (1st column of B)
= a11b11 +a12 b21 +a13b31
c12 = (1st row of A) × (2nd column of B)
132
Matrices
⎡(2 × 4) + (3 × 2) (2 × 5) + (3 × 3)⎤
= ⎢ ⎥
⎣ (5 × 4) + (1 × 2) (5 × 5) + (1 × 3) ⎦
⎡ 8 + 6 10 + 9 ⎤
= ⎢ ⎥
⎣20 + 2 25 + 3⎦
⎡14 19 ⎤
= ⎢ ⎥
⎣22 28⎦
Properties of matrix multiplication
• Matrix multiplication is associative i.e., when A,B,C are three matrices of order
m × n, n × p, p × q respectively, A.(B.C) = (A.B).C
• Matrix multiplication is distributive with respect to operation “+”, i.e. when A, B,
C are three matrices of the order m × n, n × p, n × q then,
A. (B+C) = A.B+A.C
• The matrix multiplication is not always commutative. i.e. AB ≠ BA.
DETERMINANT OF A MATRIX
Determinant is a value of a matrix and is denoted by
det A or A , and is written as
a11,a12,.....a 1n
a 21,a 22,.....a 2 n
det A = ............
............
a n1,a n 2,.....a nn
133
Foundations of Quantitative Methods
i.e. A = 10
Minors of an Element
For a determinant, the minor Mij of an element aij can be defined as the value of the
determinant obtained by leaving the row and the column passing through the element
aij (i.e. deleting the ith row and jth column of the determinant). The second order
determinant thus obtained (in case of 3×3 matrix) is called the minor of that element
aij and it is denoted by Mij.
a 1 b1 c1
For example, consider Δ = a 2 b 2 c 2
a 3 b3 c3
Then the minor of the element a1 is calculated by deleting the row and column in
which it is present (i.e. first row and first column of the above determinant). Then the
minor M11 of a1 is given by
b2 c2
M11 = = (b2 c3 - b3 c2)
b3 c3
Solution:
The minor of an element is calculated by deleting the row and column in which it is
present.
1 2
∴ M23 = = 8-14 = -6
7 8
134
Matrices
Co-factor of an Element
The minor Mij multiplied by (-1)i+j is called the co-factor of the element aij.
It is denoted as
Co-factor of aij = Aij = (-1)i+j . Mij
a11 a12 a13
If Δ = a 21 a 22 a 23
a 31 a 32 a 33
The co-factor of
a 22 a 23
a11 = A11 = (-1)1+1 = M11
a 32 a 33
a 21 a 23
a12 = A12 = (-1)1+2 = –M12
a 31 a 33
Solution:
56
Co-factor of element 1 = (-1)1+1 = 45 - 48 = -3
89
46
Co - factor of element 2 = (-1)1+2 = -1(36 - 42) = 6
79
45
Co - factor of element 3 = (-1)1+3 = 32 - 35 = -3
78
NOTE: For a given determinant the sum of the products of the elements of any row or
column with their corresponding co-factors gives the Value of the determinant.
135
Foundations of Quantitative Methods
Determinant of Order n:
A determinant of order ‘n’ has n rows and n columns i.e. n×n elements. A determinant
of order n is a square array of n×n numbers enclosed between vertical bars.
a 11 , a 12 ,.....a 1n
a 21 , a 22 ,.....a 2 n
Δ= ............
............
a n1 , a n 2 ,.....a nn
PROPERTIES OF DETERMINANTS
• When any 2 rows or columns of determinant are interchanged, the value of the
determinant is multiplied by (-1).
C1 ⇔ C3
a1 b1 c1 c1 b1 a1
i.e., a 2 b 2 c 2 = (-1) c 2 b 2 a 2
a 3 b3 c3 c3 b 3 a 3
• When all the elements of one row or one column of a determinant are multiplied
by the same number ‘k’ the value of the new determinant is k times the value of
the given determinant.
ka1 b1 c1 a1 b1 c1
i.e., ka 2 b 2 c 2 = k a 2 b 2 c 2
ka 3 b 3 c 3 a 3 b3 c3
In the above determinant all the elements of the 1st column are multiplied by k.
• If two rows or two columns of a determinant are identical then the value of the
determinant is zero.
136
Matrices
a1 b1 a1
i.e. a 2 b 2 a 2 = 0
a 3 b3 a 3
Since column one is identical to column three.
• In a determinant the sum of the products of the elements of any row or column
with the co-factors of any other row or column elements is zero.
a 1 b1 c1
Let Δ = a 2 b2 c2
a 3 b3 c3
Let A1, B1, C1, A2, B2, C2, A3, B3 and C3, are the co-factors of the elements a1, b1, c1, a2
in Δ .
...c3
a 1 + mb1 b1 c1 a 1 b1 c1
Then a 2 + mb 2 b 2 c 2 = a 2 b 2 c 2
a 3 + mb 3 b 3 c 3 a 3 b3 c3
ADJOINT OF A MATRIX
When [aij]n×n is a square matrix of order n and Aij is the co-factor of aij. Then, the
adjoint of A, denoted by adj A is defined as adj A= [Aji]n×n, i.e., adj A is the transpose
of the matrix of corresponding co-factors of elements of det.A
137
Foundations of Quantitative Methods
T
⎡A11 A12 A13 ⎤ ⎡A11 A 21 A 31 ⎤
⎢ ⎥ ⎢ ⎥
Then adj A = ⎢A 21 A 22 A 23 ⎥ = ⎢A12 A 22 A 32 ⎥
⎢⎣A 31 A 32 A 33 ⎥⎦ ⎢⎣A13 A 23 A 33 ⎥⎦
A11, A12...........A33 are the co-factors of the corresponding elements a11, a12...........a33
Example 9.11
⎡a 11 a 12 a 13 ⎤ ⎡1 1 1 ⎤
⎢ ⎥
If A = ⎢a 21 a 22 a 23 ⎥ = ⎢⎢1 2 − 3⎥⎥ , find the adjoint of the matrix A.
⎢⎣a 31 a 32 a 33 ⎥⎦ ⎢⎣2 − 1 3 ⎥⎦
Solution:
In the above example
a11 = 1, a12 = 1, a13 = 1, a21 = 1, a22 = 2...........a33 = 3
And co-factors are A11 = 3, A12 = 9, A13 = –5...A33 = 1
T
⎡ 3 − 9 − 5⎤ ⎡ 3 − 4 − 5⎤
Hence, adj A = ⎢⎢− 4 1 3 ⎥⎥ = ⎢⎢− 9 1 − 4⎥⎥
⎢⎣ − 5 − 4 1 ⎥⎦ ⎢⎣ − 5 3 1 ⎥⎦
INVERSE OF A MATRIX
When A is any n-rowed, non-zero square matrix, then a matrix B exists such that
A.B = B.A = In
Then, B is called the inverse of matrix A
Formula for finding A-1
1
A-1 = (adj A)
A
Example 9.12
⎡0 1 2 ⎤
For A = ⎢⎢1 2 3⎥⎥ Find A-1
⎢⎣3 1 1 ⎥⎦
A = 0 – 1(1–9) + 2 (1–6) = 8 – 10 = –2
∴ A ≠0
Since A ≠ 0, A inverse exists (A-1 exists)
Finding adj.A,
138
Matrices
T
⎡A11 A12 A13 ⎤
⎢ ⎥
Then Adj A = ⎢A 21 A 22 A 23 ⎥
⎢ ⎥
⎣⎢A 31 A 32 A 33 ⎦⎥
T
⎡ − 1 8 − 5⎤ ⎡ − 1 1 − 1⎤
= ⎢⎢ 1 − 6 3 ⎥⎥ = ⎢⎢ 8 − 6 2 ⎥⎥
⎢⎣ − 1 2 − 1 ⎥⎦ ⎢⎣ − 5 3 − 1⎥⎦
1
A-1 = (adj A)
A
⎡ − 1 1 − 1⎤
−1 ⎢
= 8 − 6 2 ⎥⎥
2 ⎢
⎢⎣ − 5 3 − 1⎥⎦
⎡1 1 1 ⎤
⎢2 − 2 2 ⎥
⎢ ⎥
= ⎢− 4 3 − 1 ⎥
⎢5 3 1⎥
⎢ − ⎥
⎣2 2 2⎦
Rank of a Matrix
The number ‘r’ is said to be a rank of a matrix A if it possesses the following two
properties.
1. There is at least one sub matrix of A of order ‘r’ whose determinant is not equal
to zero.
2. If the matrix A contains any square sub matrix of order r+1 then the determinant
of every square sub matrix of order r+1 should be zero (i.e. the rank of a matrix is
the order of any highest order non vanishing minor of the matrix.)
139
Foundations of Quantitative Methods
Linear equations can be solved by using matrix methods like: matrix inversion
method, Cramer’s Rule and Gauss - Jordan Elimination methods. However, before
looking into these methods, one should be conversant with how to present the linear
equations in matrix notation.
x1
x2
X = ..... and,
.....
xn
b1
b2
B = ...
...
bn
140
Matrices
Cramer’s Rule
As described earlier, determinants are useful in finding the inverse of a non-singular
matrix. Now, let us see its use in solving linear systems whose matrix coefficient (A)
is non-singular (or invertible).
For example, consider a linear system in matrix form,
A X = B (with two unknowns)
Where, A is the matrix coefficient
B is the non-homogeneous term and
X, the unknown column-matrix
Consider the system,
a1x +b1y = c1............. Equation (i)
a2x +b2y = c2.............. Equation (ii)
Multiplying (i) by b2 and (ii) by b1and subtracting,
We get (a1b2 - a2b1) x = (b2c1 – b1c2)
(b 2 c1 − b1c 2 )
∴x =
(a 1b 2 − a 2 b1 )
c1 b1
c 2 b2
=
a 1 b1
a 2 b2
Now multiply equation (i) by a2 and equation (ii) by a1 and subtracting thus obtained
equations we get (a2b1-a1b2)y = (a2c1 - a1c2)
c1 a 1
(a 2 c1 − a 1c 2 ) c2 a 2
∴y = = −
(a 2 b1 − a1b 2 ) a 1 b1
a 2 b2
a1 b1 c1 b1 c1 a1
Let D = ; D1 = ; D2 =
a 2 b2 c 2 b2 c2 a2
D1 D2
Then, x = and y= −
D D
This is called as Cramer’s rule.
The values of D, D1, and D2 can also be obtained as follows:
a1 b1
D = det.A = i.e. det of coefficients of x and y
a 2 b2
⎛ a1 ⎞
D1 is obtained from A by replacing the first column ⎜ ⎟
⎜a ⎟
⎝ 2⎠
141
Foundations of Quantitative Methods
⎛ c1 ⎞
by column B i.e., ⎜ ⎟
⎜c ⎟
⎝ 2⎠
c1 b1
we get D1 =
c 2 b2
⎛ b1 ⎞ ⎛ c1 ⎞
i.e. ⎜ ⎟ ⎜ ⎟ and taking –ve of it.
⎜ b ⎟ by column B i.e. ⎜c ⎟
⎝ 2⎠ ⎝ 2⎠
a 1 c1
we get D2 =
a2 c2
Example 9.13
Solve the linear system using Cramer’s rule,
7x -5y = 20
2x – y = 7
Solution:
Writing the above system in matrix form A X = B
⎡7 − 5⎤ ⎡ x ⎤ ⎡20⎤
⎢ 2 − 1 ⎥ ⎢ ⎥ = ⎢7 ⎥
⎣ ⎦ ⎣ y⎦ ⎣ ⎦
Using Cramer’s rule, we have.
⎡ 7 − 5⎤
D= ⎢ ⎥ = - 7 - (-10) = 3
⎣2 − 1 ⎦
⎡20 − 5⎤
D1 = ⎢ ⎥ = - 20 – (- 35) = 15
⎣7 − 1 ⎦
⎡20 7 ⎤
D2 = ⎢ ⎥ = 40 – 49 = -9
⎣7 2 ⎦
D1 15
x= = =5
D 3
D 9
y= − 2 = =3
D 3
Cramer’s Rule: (Three unknowns or 3×3 matrix)
Let the system of equations be
a1X +b1Y+c1Z =d1
a2X +b2Y+c2Z =d2
a3X +b3Y+c3Z =d3
⎡ a1 b1 c1 ⎤ ⎡ d1 ⎤ ⎡X ⎤
⎢ ⎥ ⎢ ⎥
A = ⎢ a 2 b2 c 2 ⎥ B = ⎢ d 2 ⎥ X= ⎢⎢Y ⎥
⎥
⎢a b c ⎥ ⎢d ⎥ ⎢⎣ Z ⎥⎦
⎣ 3 3 3⎦ ⎣ 3 ⎦
142
Matrices
a1 b1 c1
Where D = a 2 b2 c 2 ≠0
a 3 b3 c 3
d1 b1 c1
Then D1 = d 2 b 2 c 2
d 3 b3 c 3
1 −2 3 −2 3 1
=5 – (–2) +3
−4 5 2 5 2 −4
= 5(5 – 8) + 2 (15– (– 4)) + 3(-12 – 2)
= 5(– 3) + 2(19) + 3 (–14)
= –15 + 38 – 42 = –19
D = –19
−1 −2 3
D1 =
25 1 −2
− 29 − 4 5
−2 3
= –1 1 −2 – 25 – 29 − 2 3
−4 5 −4 5 1 −2
143
Foundations of Quantitative Methods
3 −2 5 3 5 3
= – (–1) + 25 –(–29)
2 5 2 5 3 −2
3 1 5 −2 5 −2
= –1 – 25 – 29
2 −4 2 −4 3 1
144
Matrices
Solution:
The given system of equations can be expressed as,
AX = B
⎡1 1 1 ⎤ ⎡x⎤ ⎡6 ⎤
Where A = ⎢1 − 1 1 ⎥ , X = ⎢ y ⎥ , B = ⎢2⎥
⎢ ⎥ ⎢ ⎥ ⎢ ⎥
⎢⎣ 2 − 1 3 ⎥⎦ ⎢⎣ z ⎥⎦ ⎢⎣ 9 ⎥⎦
Now, A = – 2 ≠ 0
∴ A-1 exists and
A-1 = 1 (adj A)
A
⎡− 2 − 4 2 ⎤
= −1 ⎢ −1 1 0 ⎥⎥
2 ⎢
⎢⎣ 1 3 − 2 ⎥⎦
Then AX = B
⎡− 2 − 4 2 ⎤ ⎡6⎤
⇒ X = A B = − 1 ⎢⎢ − 1
-1
1 0 ⎥⎥ ⎢2⎥
⎢ ⎥
2 ⎢
⎣ 1 3 − 2 ⎥⎦ ⎢⎣ 9 ⎥⎦
⎡x ⎤
⎡− 2⎤ ⎡1 ⎤
⇒ ⎢⎢ y ⎥⎥ = − 1 ⎢− 4 ⎥⎥ = ⎢⎢ 2 ⎥⎥
2 ⎢
⎢⎣ z ⎥⎦ ⎢⎣ − 6 ⎥⎦ ⎢⎣ 3 ⎥⎦
⇒ x = 1, y = 2, and z = 3
Gauss - Jordan Elimination Method
The procedure for solving linear equations using Gauss- Jordan Elimination method is
as follows:
1. Write the system of equations as an augmented matrix.
2. Identify the left most column that is not all zeros.
3. If the top element in the left most column is zero interchange the top row to bring
a non-zero element in that column.
4. If that non-zero element is ‘a’, then multiply top row by 1/a to get a leading 1 in
that row.
5. Add multiples of this row to other rows so that all other rows have a 0 in this
column.
6. Cover (ignoring) the top row and go back to the second step, considering the rows
below this one (until step 5).
Continue this process until the matrix is reduced to a Unit matrix.
The following example illustrates the procedure to solve system of equations using
Gauss - Jordan Elimination Method.
Example 9.16
Find the values of x, y, and z by solving the following system of equations.
145
Foundations of Quantitative Methods
2y - 3z = 2
2x + z = 3
x - y + 3z = 1
Solution:
Step 1: Write the system of equations as an augmented matrix
⎡0 2 − 3 2⎤
⎢2 0 1 3⎥⎥
⎢
⎢⎣1 − 1 3 1 ⎥⎦
Step 2: Interchange the first and third row to make the top left element non- zero.
⎡1 − 1 3 1 ⎤
⎢2 0 1 3⎥⎥
⎢
⎢⎣0 2 − 3 2⎥⎦
Step 3: Add -2 times of first row to second row to get 0 at the second element of the
first column.
⎡1 − 1 3 1⎤
⎢0 2 − 5 1 ⎥
⎢ ⎥
⎢⎣0 2 − 3 2⎥⎦
Step 5: Add second row to first; and add -2 times second row to third to get 0’s in the
second column
⎡1 0 1 / 2 3 / 2⎤
⎢0 1 − 5 / 2 1 / 2 ⎥
⎢ ⎥
⎢⎣0 0 2 1 ⎥⎦
Step 6: Divide row 3 by 2 to get a leading 1
⎡1 0 1 / 2 3 / 2⎤
⎢0 1 − 5 / 2 1 / 2 ⎥
⎢ ⎥
⎢⎣0 0 1 1 / 2 ⎥⎦
Step 7: Add -1/2 times of row 3 to row 1 and add 5/2 times row 3 to row 2 to get 0’s
in the third column.
⎡1 0 0 5 / 4 ⎤
⎢0 1 0 7 / 4 ⎥
⎢ ⎥
⎢⎣0 0 1 1 / 2 ⎥⎦
146
Matrices
y = 7/4
z = 1/2
Now let us check whether these values satisfy the initial equations.
2y - 3z = 2(7/4) - 3(1/2) = (7/2) – (3/2) = 4/2 = 2
2x + z = 2(5/4) + (1/2) = (5/2) + (1/2) = 6/2 = 3
x - y + 3z = (5/4) - (7/4) + 3(1/2) = -(2/4) + (3/2) = 2/2 = 1
The values of x, y, z calculated using Gauss - Jordan elimination method satisfy the
equations in the example.
Thus, a manager can use Cramer’s rule, matrix inversion method or Gauss - Jordan
elimination method to solve a system of equations.
SUMMARY
147
Chapter 10
Interpolation and
Extrapolation
In this chapter we will discuss:
• Assumptions of Interpolation and Extrapolation
• Applications of Interpolation and Extrapolation
• Methods of Interpolation and Extrapolation
• Applications in Financial Analysis
Interpolation and Extrapolation
Interpolation is a method of statistical estimation and the word literally means `making
insertions’. Let us consider a well-known situation which requires the use of
interpolation. Assume the census is conducted once in ten years. So, typically the exact
head count of the population of the country is available only for years 1921, 1931,
1941, 1951, 1961, 1971, 1981 and 1991. If we need to know population of the country
for any intermediary year, say, 1985, one logical approach would be to work forwards
from the population of 1981, by adding births and inflow of people into the country
and deducting deaths and outflow of people from the country during the period 1981 to
1985. Though, this approach would give us an accurate population figure for 1985, the
required data on births, deaths, inflow and outflow of people may be difficult to gather
and the cost of compiling such data may exceed the benefit that may accrue from the
exercise. The population figure for the year 1985, may be required to correlate the
demand of a product with the population and the inferences to be drawn may not be
highly sensitive to minor variations in the population figure. That is, while the data on
population in 1985 is required, a one hundred percent accurate figure is really not
required.
Interpolation is a statistical technique, which through a study of the time series of
known figures of population, say from 1921 to 1991, allows us to make an insertion of
the population figure for the year 1985.
This is usually accomplished with a few steps of calculation and the figure so obtained
is fairly correct as long as the assumptions underlying interpolation hold good.
Suppose, the same firm by studying the past correlation of demand for its products
with the population wants to predict the demand for a future year, say, 1997, through a
forecast of the population figure. This is possible through the use of the technique
called extrapolation.
Extrapolation allows us to forecast or anticipate a value for some future date.
What is the basic requirement for both interpolation and extrapolation to work? There
must exist a functional relationship between an independent variable and a dependent
variable and it must be required to estimate the value of the dependent variable
corresponding to a certain value of independent variable. This functional relationship
is usually developed on the basis of past data using a branch of statistics called
‘Regression Analysis.’
In the situations considered above, population is taken to be a function of time or years.
While time is the independent variable, population is the dependent variable. One more
point that requires emphasis is that both interpolation and extrapolation provides us
with only estimates of the dependent variable.
Graphical Method
As the name suggests, this method arrives at the solution after plotting a graph using
the given data. While drawing the graph on a natural scale, the independent variables
are marked along the horizontal line and corresponding dependent variables are
marked along vertical line. The points so plotted are joined by a smooth curve. For
interpolation, the value of the independent variable is marked at the appropriate place
on the X-axis and a line perpendicular to the X-axis is to intersect the curve. Again by
drawing a line perpendicular to the Y-axis from the point of intersection on the curve
to meet the Y-axis, the value of the dependent variable is ascertained by reading off the
value at the point of intersection with the Y-axis. Similarly, graphical method can be
used for extrapolation. The illustration given below demonstrates the use of this
method.
Example 10.1
The population of a town is as follows:
Year 1941 1951 1961 1971 1981 1991
Population (in 000’s) 24 30 40 54 78 104
Find the population for the years 1976 and 1992.
100
Population in ‘000s
80
60
40
20
1941 51 61 71 81 91 Year
76 92
150
Interpolation and Extrapolation
Solution:
Marking points on the X-axis, for the years 1976 and 1992, and dropping
perpendiculars, the population for the years can be interpolated as 65,000 and 125,000
respectively.
Example 10.2
The sales in units of a consumer durable are ascertained as follows:
The records of sales for the year 1989 were accidentally lost in a fire. Find the sales for
the year 1989.
Solution:
The sales in this year could be interpolated by the following procedure:
1. The year immediately preceding 1989, is 1988. Hence 24,000 units sold in 1988 is
the base value.
2. In the illustration, since demand for the units is steadily increasing, we may
conclude that it is an ascending series.
3. Value corresponding to the immediately succeeding year (1990), 40,000.
Value corresponding to the immediately preceding year (1988), 24,000
Upper limit – Lower limit = 40,000 – 24,000= 16,000
4. In the above illustration, the time interval between 1988 and 1990 is 2 years.
5. Time interval between 1988 and 1989 is one year.
151
Foundations of Quantitative Methods
6. For the illustration, the sales for the year 1989 will be,
24000 + 40,000 - 24,000 ×1
2
= 24,000 + 8,000 = 32,000
Example 10.3
Solution:
152
Interpolation and Extrapolation
Example 10.5
The market price of Rs.1,000 face value debenture is Rs.800. The coupon interest rate
is 9% and the debenture matures at the end of 8 years at which time it will be redeemed
at the face value. What is the yield to maturity (YTM)?
(The concept of yield to maturity is very similar to the concept of internal rate of
return. YTM is the rate at which the inflows from holding the debenture when
discounted will exactly equal the current market price of the debenture.)
Solution:
The inflows from holding the debenture are,
Years 1 to 8 Interest Rs.90 per year.
End of 8th year Principal Rs.1000
153
Foundations of Quantitative Methods
When the descent is (807.91 – 768.51), the increase in rate is 1. For a descent of
(807.91 – 800), the increase in rate should be,
807 . 91 − 800
×1
807 . 91 − 768 . 51
The YTM of the bond = 13 + 807 . 91 − 800
× 1 = 13 + 0.20 = 13.2%
807 . 91 − 768 . 51
Example 10.6
A finance company offers to pay a lump sum of Rs.16, 000 at the end of 6 years to
investors who deposit Rs.2, 000 annually for 6 years. What is the implicit interest rate?
Solution:
We know that 16,000 = 2000 × FVIFA (k, 6)
Where FVIFA is the future value interest factor of an annuity, and k is the implicit rate.
FVIFA (k, 6) = 16,000/2000 = 8
If we refer to the FVIFA tables,
at k = 11%, FVIFA(11,6) = 7.913 and
at k = 12%, FVIFA(12,6) = 8.115
So, k must be greater than 11% but lower than 12%.
For an ascent of (8.115 – 7.913), the ascent in rate is 1. For a required ascent of (8.000
– 7.913), the ascent in rate is
8 . 000 − 7 . 913
×1
8 . 115 − 7 . 913
The implicit rate of interest = 11 + 8 . 000 − 7 . 913 × 1 = 11 + 0.43 = 11.43%
8 . 115 − 7 . 913
154
Interpolation and Extrapolation
SUMMARY
Interpolation and extrapolation are statistical methods of estimation and forecasting.
Interpolation is a statistical technique, which through a study of the time series of
known figures of population allows us to make a data insertion between a given data
set. On the other hand, extrapolation allows us to forecast or anticipate a value for
some future date.
While interpolation helps in completing the incomplete, lost or destroyed records,
extrapolation is helpful in those circumstances where forecasting and prediction are
required. Businessmen, administrators, sociologists, economists and financial analysts
widely use interpolation and extrapolation. The two methods of interpolation are -
graphical method and the linear approximation method.
155
Appendix
Table 1: Areas under the Standard Normal Distribution between the Mean and
Positive Values of z
a 0.00 0.01 0.02 0.03 0.04 0.05 0.06 0.07 0.08 0.09
0.0 0.0000 0.0040 0.0080 0.0120 0.0160 0.0199 0.0239 0.0279 0.0319 0.0359
0.1 0.0398 0.0438 0.0478 0.0517 0.0557 0.0596 0.0636 0.0675 0.0714 0.0753
0.2 0.0793 0.0832 0.0871 0.0910 0.0948 0.0987 0.1026 0.1064 0.1103 0.1141
0.3 0.1179 0.1217 0.1255 0.1293 0.1331 0.1368 0.1406 0.1443 0.1480 0.1517
0.4 0.1554 0.1591 0.1628 0.1664 0.1700 0.1736 0.1772 0.1808 0.1844 0.1879
0.5 0.1915 0.1950 0.1985 0.2019 0.2054 0.2088 0.2123 0.2157 0.2190 0.2224
0.6 0.2257 0.2291 0.2324 0.2357 0.2389 0.2422 0.2454 0.2486 0.2517 0.2549
0.7 0.2580 0.2611 0.2642 0.2673 0.2704 0.2734 0.2764 0.2794 0.2823 0.2852
0.8 0.2881 0.2910 0.2939 0.2967 0.2995 0.3023 0.3051 0.3078 0.3106 0.3133
0.9 0.3159 0.3186 0.3212 0.3238 0.3264 0.3289 0.3315 0.3340 0.3365 0.3389
1.0 0.3413 0.3438 0.3461 0.3485 0.3508 0.3531 0.3554 0.3577 0.3599 0.3621
1.1 0.3643 0.3665 0.3686 0.3708 0.3729 0.3749 0.3770 0.3790 0.3810 0.3830
1.2 0.3849 0.3869 0.3888 0.3907 0.3925 0.3944 0.3962 0.3980 0.3997 0.4015
1.3 0.4032 0.4049 0.4066 0.4082 0.4099 0.4115 0.4131 0.4147 0.4162 0.4177
1.4 0.4192 0.4207 0.4222 0.4236 0.4251 0.4265 0.4279 0.4292 0.4306 0.4319
1.5 0.4332 0.4345 0.4357 0.4370 0.4382 0.4394 0.4406 0.4418 0.4429 0.4441
1.6 0.4452 0.4463 0.4474 0.4484 0.4495 0.4505 0.4515 0.4525 0.4535 0.4545
1.7 0.4554 0.4564 0.4573 0.4582 0.4591 0.4599 0.4608 0.4616 0.4625 0.4633
1.8 0.4641 0.4649 0.4656 0.4664 0.4671 0.4678 0.4686 0.4693 0.4699 0.4706
1.9 0.4713 0.4719 0.4726 0.4732 0.4738 0.4744 0.4750 0.4756 0.4761 0.4767
2.0 0.4772 0.4778 0.4783 0.4788 0.4793 0.4798 0.4803 0.4808 0.4812 0.4817
2.1 0.4821 0.4826 0.4830 0.4834 0.4838 0.4842 0.4846 0.4850 0.4854 0.4857
2.2 0.4861 0.4864 0.4868 0.4871 0.4875 0.4878 0.4881 0.4884 0.4887 0.4890
2.3 0.4893 0.4896 0.4898 0.4901 0.4904 0.4906 0.4909 0.4911 0.4913 0.4916
2.4 0.4918 0.4920 0.4922 0.4925 0.4927 0.4929 0.4931 0.4932 0.4934 0.4936
2.5 0.4938 0.4940 0.4941 0.4943 0.4945 0.4946 0.4948 0.4949 0.4951 0.4952
2.6 0.4953 0.4955 0.4956 0.4957 0.4959 0.4960 0.4961 0.4962 0.4963 0.4964
2.7 0.4965 0.4966 0.4967 0.4968 0.4969 0.4970 0.4971 0.4972 0.4973 0.4974
2.8 0.4974 0.4975 0.4976 0.4977 0.4977 0.4978 0.4979 0.4979 0.4980 0.4981
2.9 0.4981 0.4982 0.4982 0.4983 0.4984 0.4984 0.4985 0.4985 0.4986 0.4986
3.0 0.4987 0.4987 0.4987 0.4988 0.4988 0.4989 0.4989 0.4989 0.4990 0.4990