Quanti Management
Quanti Management
Quanti Management
Author: P N Mishra & S Jaisankar Copyright 2007, Bharathiar University All Rights Reserved Edited by: Dr. Subodh Kesharwani Produced and Printed by EXCEL BOOKS PRIVATE LIMITED A-45, Naraina, Phase-I, New Delhi-110028 for SCHOOL OF DISTANCE EDUCATION Bharathiar University Coimbatore-641046
CONTENTS
Page No. Unit -I Lesson 1 Lesson 2 Lesson 3 Lesson 4 Lesson 5 Quantitative Techniques Introduction Measures of Central Tendency Mathematical Model Linear Programming: Graphical Method Linear Programming: Simplex Method Unit -II Lesson 6 Lesson 7 Transportation Model Assignment Model 167 209 7 24 110 119 143
Unit -III Lesson 8 Lesson 9 Network Model Waiting Model (Queuing Theory) Unit -IV Lesson 10 Lesson 11 Lesson 12 Probability Theoretical Probability Distributions Probability Distribution of a Random Variable Unit-V Lesson 13 Lesson 14 Lesson 15 Inventory Model Game Theory Simulation 449 472 495 299 359 409 241 272
Subject Description: This course presents the various mathematical models, networking, probability, inventory models and simulations for managerial decisions. Goals: To enable the students to learn techniques of operations research and resources management and their application in decision making in the management. Objectives: On successful completion of the course the students should have: 1. 2. 3. 4. 5. Understood the basic of the quantitative techniques. Learnt the feasible solution and optimum solution for the resource management. Learnt the time estimation and critical path for project. Learnt about the application of probability techniques in the decision making. Learnt the various inventory models and simulations in the resource planning and management. UNIT I QT Introduction Measures of Central Tendency Mean, Median, Mode. Mathematical Models deterministic and probabilistic simple business examples OR and optimization models Linear Programming formulation graphical solution simplex solution. UNIT II Transportation model Initial Basic Feasible solutions optimum solution for non degeneracy and degeneracy model Trans-shipment Model Assignment Model Travelling Salesmen problem. UNIT III Network Model networking CPM critical path Time estimates critical path crashing, Resource levelling, Resources planning. Waiting Line Model Structure of model M/M/1 for infinite population. UNIT IV Probability definitions addition and multiplication Rules (only statements) simple business application problems probability distribution expected value concept theoretical probability distributions Binomial, Poison and Normal Simple problems applied to business. UNIT V Inventory Models Deterministic EOQ EOQ with Price Breaks Probabilistic Inventory Models - Probabilistic EOQ model Game theory-zero sum games: Arithmetic and Graphical Method. Simulation types of simulation Monte Carlo simulation simulation problems. Decision Theory Pay off tables decision criteria decision trees.
Unit-I
LESSON
1
QUANTITATIVE TECHNIQUES INTRODUCTION
CONTENTS
1.0 Aims and Objectives 1.1 Introduction 1.2 Historical Development 1.3 About Quantitative Technique 1.4 Methodology of Quantitative Techniques 1.4.1 Formulating the Problem 1.4.2 Defining the Decision Variables and Constraints 1.4.3 Developing a Suitable Model 1.4.4 Acquiring the Input Data 1.4.5 Solving the Model 1.4.6 Validating the Model 1.4.7 Implementing the Results 1.5 Advantages of Mathematical Modelling 1.6 Scope of Quantitative Technique 1.7 Statistics : An Introduction 1.7.1 Origin and Growth of Statistics 1.7.2 Meaning and Definition of Statistics 1.7.3 Statistics as Data 1.7.4 Statistics as a Science 1.7.5 Statistics as a Science different from Natural Sciences 1.7.6 Statistics as a Scientific Method 1.7.7 Statistics as a Science or an Art 1.8 Let us Sum Up 1.9 Lesson-end Activities 1.10 Keywords 1.11 Questions for Discussion 1.12 Terminal Questions 1.13 Model Answers to Questions for Discussion 1.14 Suggested Readings
1.1 INTRODUCTION
Scientific methods have been mans outstanding asset to pursue an ample number of activities. It is analysed that whenever some national crisis, emerges due to the impact of political, social, economic or cultural factors the talents from all walks of life amalgamate together to overcome the situation and rectify the problem. In this chapter we will see how the quantitative techniques had facilitated the organization in solving complex problems on time with greater accuracy. The historical development will facilitate in managerial decision-making & resource allocation, The methodology helps us in studying the scientific methods with respect to phenomenon connected with human behaviour like formulating the problem, defining decision variable and constraints, developing a suitable model, acquiring the input data, solving the model, validating the model, implementing the results. The major advantage of mathematical model is that its facilitates in taking decision faster and more accurately. Managerial activities have become complex and it is necessary to make right decisions to avoid heavy losses. Whether it is a manufacturing unit, or a service organization, the resources have to be utilized to its maximum in an efficient manner. The future is clouded with uncertainty and fast changing, and decision-making a crucial activity cannot be made on a trial-and-error basis or by using a thumb rule approach. In such situations, there is a greater need for applying scientific methods to decision-making to increase the probability of coming up with good decisions. Quantitative Technique is a scientific approach to managerial decision-making. The successful use of Quantitative Technique for management would help the organization in solving complex problems on time, with greater accuracy and in the most economical way. Today, several scientific management techniques are available to solve managerial problems and use of these techniques helps managers become explicit about their objectives and provides additional information to select an optimal decision. This study material is presented with variety of these techniques with real life problem areas.
Explain with the help of example some of the important Quantitative Techniques used in modern business and in industrial unit.
8
Contd....
Write your answer in the space given below. Please go through the lesson sub-head thoroughly you will get your answers in it. This Chek Your Progress will help you to understand the lesson better. Try to write answers for them, but do not submit your answers to the university for assessment. These are for your practice only.
Do you think the day will come when all decision in a business unit are made with assistance of quantitative techniques? Give reasons for your answer. Notes: (a) (b) Write your answer in the space given below. Please go through the lesson sub-head thoroughly you will get your answers in it.
Contd....
9
(c)
This Chek Your Progress will help you to understand the lesson better. Try to write answers for them, but do not submit your answers to the university for assessment. These are for your practice only.
Figure 1.1
10
: : :
How many units of x1, x2 and x3 are to be manufactured x1, x2 and x3 To maximize profit Machine hours
Decision variables :
Linear Programming Model Integer Programming Sensitivity Analysis Goal Programming Dynamic Programming Non Linear Programming Queuing Theory Inventory Management Techniques
11
PERT/CPM (Network Analysis) Decision Theory Games Theory Transportation and Assignment Models.
Quantitative Technique is a very powerful tools and analytical process that offers the presentation of an optimum solutions in spite of its limitations. Discuss. Notes: (a) (b) (c) Write your answer in the space given below. Please go through the lesson sub-head thoroughly you will get your answers in it. This Chek Your Progress will help you to understand the lesson better. Try to write answers for them, but do not submit your answers to the university for assessment. These are for your practice only.
Contd....
12
Finance and Accounting: Cash flow analysis, Capital budgeting, Dividend and Portfolio management, Financial planning. Marketing Management: Selection of product mix, Sales resources allocation and Assignments. Production Management: Facilities planning, Manufacturing, Aggregate planning, Inventory control, Quality control, Work scheduling, Job sequencing, Maintenance and Project planning and scheduling. Personnel Management: Manpower planning, Resource allocation, Staffing, Scheduling of training programmes. General Management: Decision Support System and Management of Information Systems, MIS, Organizational design and control, Software Process Management and Knowledge Management.
From the various definitions of Quantitative Technique it is clear that scientific management hen got wide scope. In general, whenever there is any problem simple or complicated the scientific management technique can be applied to find the best solutions. In this head we shall try to find the scope of M.S. by seeing its application in various fields of everyday lift this include define operation too.
Check Your Progress 1.4
Discuss the significance and scope of Quantitative Techniques in modern business management. Notes: (a) (b) (c) Write your answer in the space given below. Please go through the lesson sub-head thoroughly you will get your answers in it. This Chek Your Progress will help you to understand the lesson better. Try to write answers for them, but do not submit your answers to the university for assessment. These are for your practice only.
14
Among the noteworthy Indian scholars who contributed to statistics are P.C. Mahalnobis, V.K.R.V. Rao, R.C. Desai, P.V. Sukhatme, etc.
(iii) Numerical facts should be capable of being arranged in relation to each other. On the basis of the above features we can say that data are those numerical facts which have been expressed as a set of numerical figures related to each other and to some area of enquiry or research. We may, however, note here that all the characteristics of data are not covered by the above definition. 2. "By statistics we mean quantitative data affected to a marked extent by multiplicity of causes. - Yule & Kendall This definition covers two aspects, i.e., the data are quantitative and affected by a large number of causes. 3. "Statistics are classified facts respecting the conditions of the people in a stateespecially those facts which can be stated in numbers or in tables of numbers or in any other tabular or classified arrangement. - Webster "A collection of noteworthy facts concerning state, both historical and descriptive. - Achenwall Definitions 3 and 4, given above, are not comprehensive because these confine the scope of statistics only to facts and figures related to the conditions of the people in a state. However, as we know that data are now collected on almost all the aspects of human and natural activities, it cannot be regarded as a state-craft only. 5. "Statistics are measurements, enumerations or estimates of natural or social phenomena, systematically arranged, so as to exhibit their interrelations. - L.R. Connor This definition also covers only some but not all characteristics of data.
15
4.
6.
"By statistics we mean aggregate of facts affected to a marked extent by a multiplicity of causes, numerically expressed, enumerated or estimated according to a reasonable standard of accuracy, collected in a systematic manner for a predetermined purpose and placed in relation to each other. - H. Secrist This definition can be taken as a comprehensive definition of statistics since most of the characteristics of statistics are covered by it.
2.
4.
16
We may note here that if the area of investigation is large or the cost of measurement is high, the statistics may also be collected by examining only a fraction of the total area of investigation. When statistics are being obtained by measurement of units, it is necessary to maintain a reasonable degree or standard of accuracy in measurements. The degree of accuracy needed in an investigation depends upon its nature and objectivity on the one hand and upon time and resources on the other. For example, in weighing of gold, even milligrams may be significant where as, for weighing wheat, a few grams may not make much difference. Sometimes, a higher degree of accuracy is needed in order that the problem, to be investigated, gets highlighted by the data. Suppose the diameter of bolts produced by a machine are measured as 1.546 cms, 1.549 cms, 1.548 cms, etc. If, instead, we obtain measurements only up to two places after decimal, all the measurements would be equal and as such nothing could be inferred about the working of the machine. In addition to this, the degree of accuracy also depends upon the availability of time and resources. For any investigation, a greater degree of accuracy can be achieved by devoting more time or resources or both. As will be discussed later, in statistics, generalisations about a large group (known as population) are often made on the basis of small group (known as sample). It is possible to achieve this by maintaining a reasonable degree of accuracy of measurements. Therefore, it is not necessary to always have a high degree of accuracy but whatever degree of accuracy is once decided must be uniformly maintained throughout the investigation. 5. Statistics are collected in a systematic manner and for a predetermined purpose: In order that the results obtained from statistics are free from errors, it is necessary that these should be collected in a systematic manner. Haphazardly collected figures are not desirable as they may lead to wrong conclusions. Moreover, statistics should be collected for a well defined and specific objective, otherwise it might happen that the unnecessary statistics are collected while the necessary statistics are left out. Hence, a given set of numerical figures cannot be termed as statistics if it has been collected in a haphazard manner and without proper specification of the objective. Statistics should be capable of being placed in relation to each other: This characteristic requires that the collected statistics should be comparable with reference to time or place or any other condition. In order that statistics are comparable it is essential that they are homogeneous and pertain to the same investigation. This can be achieved by collecting data in identical manner for different periods or for different places or for different conditions. Hence, any set of numerical facts possessing the above mentioned characteristics can be termed as statistics or data. Example 1: Would you regard the following information as statistics? Explain by giving reasons. (i) (ii) The height of a person is 160 cms. The height of Ram is 165 cms and of Shyam is 155 cms.
6.
(iii) Ram is taller than Shyam. (iv) Ram is taller than Shyam by 10 cms. (v) The height of Ram is 165 cms and weight of Shyam is 55 kgs. Solution: Each of the above statement should be examined with reference to the following conditions:
17
Whether information is presented as aggregate of numerical figures Whether numerical figures are homogeneous or comparable Whether numerical figures are affected by a multiplicity of factors
On examination of the given information in the light of these conditions we find that only the information given by statement (ii) can be regarded as statistics. It should be noted that condition (c) will be satisfied, almost invariably. In order to illustrate the circumstances in which this condition is not satisfied, we assume that a relation between quantity demanded and price of a commodity is given by the mathematical equation q = 100 - 10p and the quantity demanded at various prices, using this equation, is shown in the following table,
p q 1 90 2 80 3 70 4 60 5 50 6 40 7 30 8 20 9 10 10 0
The above information cannot be regarded as statistics because here quantity demanded is affected by only one factor, i.e., price and not by a multiplicity of factors. Contrary to this, the figures of quantity demanded obtained from a market at these very prices are to be regarded as statistics.
"Statistics is the science of measurement of social organism regarded as a whole in all its manifestations. - A.L. Bowley "Statistics is the science of estimates and probabilities. - Boddington All of the above definitions are incomplete in one sense or the other because each consider only one aspect of statistics. According to the first definition, statistics is the science of counting. However, we know that if the population or group under investigation is large, we do not count but obtain estimates. The second definition viz. statistics is the science of averages, covers only one aspect, i.e., measures of average but, besides this, there are other measures used to describe a given set of data. The third definition limits the scope of statistics to social sciences only. Bowley himself realised this limitation and admitted that scope of statistics is not confined to this area only. The fourth definition considers yet another aspect of statistics. Although, use of estimates and probabilities have become very popular in modern statistics but there are other techniques, as well, which are also very important. The following definitions covers some more but not all aspects of statistics.
5.
"The science of statistics is the method of judging collective, natural or social phenomena from the results obtained by the analysis or enumeration or collection of estimates. - W.I. King "Statistics or statistical method may be defined as collection, presentation, analysis and interpretation of numerical data. - Croxton and Cowden This is a simple and comprehensive definition of statistics which implies that statistics is a scientific method.
6.
18
7.
"Statistics is a science which deals with collection, classification and tabulation of numerical facts as the basis for the explanation, description and comparison of phenomena. - Lovitt "Statistics is the science which deals with the methods of collecting, classifying, presenting, comparing and interpreting numerical data collected to throw some light on any sphere of enquiry. - Seligman The definitions given by Lovitt and Seligman are similar to the definition of Croxton and Cowden except that they regard statistics as a science while Croxton and Cowden has termed it as a scientific method. With the development of the subject of statistics, the definitions of statistics given above have also become outdated. In the last few decades the discipline of drawing conclusions and making decisions under uncertainty has grown which is proving to be very helpful to decision-makers, particularly in the field of business. Although, various definitions have been given which include this aspect of statistics also, we shall now give a definition of statistics, given by Spiegel, to reflect this new dimension of statistics.
8.
9.
"Statistics is concerned with scientific method for collecting, organising, summarising, presenting and analysing data as well as drawing valid conclusions and making reasonable decisions on the basis of such analysis.
On the basis of the above definitions we can say that statistics, in singular sense, is a science which consists of various statistical methods that can be used for collection, classification, presentation and analysis of data relating to social, political, natural, economical, business or any other phenomena. The results of the analysis can be used further to draw valid conclusions and to make reasonable decisions in the face of uncertainty.
19
In view of the uses of statistics in almost all the disciplines of natural as well as social sciences, it will be more appropriate to regard it as a scientific method rather than a science. Statistics as a scientific method can be divided into the following two categories: (a) Theoretical Statistics and (b) Applied Statistics (a) Theoretical Statistics: Theoretical statistics can be further sub-divided into the following three categories: (i) Descriptive Statistics: All those methods which are used for the collection, classification, tabulation, diagrammatic presentation of data and the methods of calculating average, dispersion, correlation and regression, index numbers, etc., are included in descriptive statistics. Inductive Statistics: It includes all those methods which are used to make generalisations about a population on the basis of a sample. The techniques of forecasting are also included in inductive statistics.
(ii)
(iii) Inferential Statistics: It includes all those methods which are used to test certain hypotheses regarding characteristics of a population. (b) Applied Statistics: It consists of the application of statistical methods to practical problems. Design of sample surveys, techniques of quality control, decision-making in business, etc., are included in applied statistics.
20
2.
Why there is a need of statistics. Indicate one incidence of statistics application in your daily routine. How the statistics application had bring a paradigm shift.
1.10 KEYWORDS
Management science Model Analysis Decision-making Mathematical model Algorithm Problem
3. 4. 5. 6. 7. 8. 9. 11.
Explain the methodology adopted in solving problems with the help of a flow chart diagram. What is a model? Explain with a suitable example. What is meant by validation of model? Explain the advantages of modelling with the help of a short example. Discuss the advantages and limitations of using results from a mathematical model to make decision as out operations. What are different type of models used in management science. What are some of the opportunities in management science? What are some of sources of input data?
10. What is implementation and why it is important? 12. Briefly trace the history of management science. 13. What is the Quantitative Techniques process? Give several examples of this process. 14. Give a brief account of the origin and development of statistics. 15. Define statistics and discuss its relationship with natural and other sciences. 16. Distinguish between statistical methods and statistics. Discuss the scope and significance of the study of statistics. 17. Who gave the following definitions of statistics? (i) (ii) Statistics is the science of counting. (Bowley, Boddington, King, Saligman) Statistics is the science of estimates and probabilities. (Webster, Secrist, Boddington, Yule & Kendall)
(iii) The science of statistics is the method of judging collective, natural or social phenomena from the results obtained by the analysis or enumeration or collection of estimates. (Achenwall, Marshall, W.I. King, Croxton & Cowden) 18. Statistics are numerical statements of facts, but all facts stated numerically are not statistics. Clarify this statement and point out briefly which numerical statements of facts are statistics. 19. Discuss briefly the utility of statistics in economic analysis and business. 20. Which of the following statements are true? (a) (b) (c) (d) Statistics is helpful in administration. Statistics is helpful in business. Statistics is helpful in economic analysis. Statistics is helpful in all of the above.
21. Statistics are the straws out of which I like other economists have to make bricks. Discuss. 22. Science without statistics bear no fruit, statistics without science have no roots. Explain the above statement. 23. It is usually said that statistics is science and art both. Do you agree with this statement? Discuss the scope of statistics. 24. Define Statistics and explain briefly the divisions of the science of statistics.
22
25. Statistics is not a science, it is a scientific method. Discuss it critically and explain the scope of statistics.
26. Explain clearly the three meanings of the word 'Statistics' contained in the following statement : You compute statistics from statistics by statistics. [Hint : Mean, standard deviation, etc., computed from a sample are also known as statistics.] 27. Economics and statistics are twin sisters. Discuss. 28. Discuss the nature and scope of statistics. What are the fields of investigation and research where statistical methods and techniques can be usefully employed? 29. Explain the following statements : (a) (b) (c) (i) (ii) Statistics is the science of counting. Statistics is the science of estimates and probabilities. Statistics is the science of averages. Arun is more intelligent than Avinash. Arun got 75% marks in B.Sc. and Avinash got 70% marks in B.Com.
30. Explain by giving reasons whether the following are data or not:
(iii) Arun was born on August 25, 1974. (iv) The consumption function of a community is C = 1,000 + 0.8Y, therefore, the levels of consumption for different levels of income are :
Y C 0 1000 1000 1800 2000 2600 4000 4200 6000 5800 8000 7400
31. Statistics are aggregates of facts, affected to a marked extent by a multiplicity of causes. Discuss the above statement and explain the main characteristics of statistics. 32. Statistics are not merely heap of numbers. Explain. 33. Elucidate the following statement : Not a datum, but data are the subject-matter of statistics.
ANSWERS
(c) True
TO
(d) True
QUESTIONS
(e) False
FOR
(b) False
(b) decision-making
LESSON
2
MEASURES OF CENTRAL TENDENCY
CONTENTS
2.0 Aims and Objectives 2.1 Introduction 2.2 Definition of Average 2.3 Functions and Characterstics of an Average 2.4 Various Measures of Average 2.5 Arithmetic Mean 2.6 Median 2.7 Other Partition or Positional Measures 2.8 Mode 2.9 Relation between Mean, Median and Mode 2.10 Geometric Mean 2.11 Harmonic Mean 2.12 Let us Sum Up 2.13 Lesson-end Activity 2.14 Keywords 2.15 Questions for Discussion 2.16 Terminal Questions 2.17 Model Answers to Questions for Discussion 2.18 Suggested Readings
2.1 INTRODUCTION
Summarisation of the data is a necessary function of any statistical analysis. As a first step in this direction, the huge mass of unwieldy data are summarised in the form of tables and frequency distributions. In order to bring the characteristics of the data into sharp focus, these tables and frequency distributions need to be summarised further. A measure of central tendency or an average is very essential and an important summary measure in any statistical analysis. An average is a single value which can be taken as representative of the whole distribution.
24
2.
3.
Characteristics of a Good Average A good measure of average must posses the following characteristics : 1. It should be rigidly defined, preferably by an algebraic formula, so that different persons obtain the same value for a given set of data. 2. It should be easy to compute. 3. It should be easy to understand. 4. It should be based on all the observations. 5. It should be capable of further algebraic treatment. 6. It should not be unduly affected by extreme observations. 7. It should not be much affected by the fluctuations of sampling.
25
(c)
(iii) Composite Average The above measures of central tendency will be discussed in the order of their popularity. Out of these, the Arithmetic Mean, Median and Mode, being most popular, are discussed in that order.
abbreviated form as
!X
i =1
The subscript of X, i.e., 'i' is a positive integer, which indicates the serial number of the observation. Since there are n observations, variation in i will be from 1 to n. This is indicated by writing it below and above S, as written earlier. When there is no ambiguity in range of summation, this indication can be skipped and we may simply write X1 + X2 + ..... + Xn = SXi. Arithmetic Mean is defined as the sum of observations divided by the number of observations. It can be computed in two ways : (i) Simple arithmetic mean and (ii) weighted arithmetic mean. In case of simple arithmetic mean, equal importance is given to all the observations while in weighted arithmetic mean, the importance given to various observations is not same. Calculation of Simple Arithmetic Mean (a) When Individual Observations are given. Let there be n observations X1, X2 ..... Xn. Their arithmetic mean can be calculated either by direct method or by short cut method. The arithmetic mean of these observations will be denoted by X Direct Method: Under this method, X is obtained by dividing sum of observations by number of observations, i.e.,
n
!X
X=
i =1
n Short-cut Method: This method is used when the magnitude of individual observations is large. The use of short-cut method is helpful in the simplification of calculation work. Let A be any assumed mean. We subtract A from every observation. The difference between an observation and A, i.e., Xi - A is called the deviation of i th observation from A and is denoted by di. Thus, we can write ; d1 = X1 - A, d2 = X2 - A, ..... dn = Xn - A. On adding these deviations and dividing by n we get ! di = ! ( Xi " A) = ! Xi " nA = ! Xi " A n n n n di ! ) (Where d = d =X"A or n
On rearranging, we get X = A + d = A +
!d
n
26
This result can be used for the calculation of X . Remarks: Theoretically we can select any value as assumed mean. However, for the purpose of simplification of calculation work, the selected value should be as nearer to the value of X as possible.
Example 1: The following figures relate to monthly output of cloth of a factory in a given year:
Months : Jan Feb Mar Apr May Output : 80 88 92 84 96 ( in '000 metres ) Jun Jul Aug Sep Oct Nov Dec 92 96 100 92 94 98 86
Calculate the average monthly output. Solution: (i) Using Direct Method
X=
(ii)
80 88 92 84 96 92 96 100 92 94 98 86 d i = X i - A - 10 - 2 2 - 6 6 2 6 10 2 4 8 - 4
\ X = 90 +
Xi
Total di = 18
Let there be n values X1, X2, ..... Xn out of which X1 has occurred f1 times, X2 has occurred f 2 times, ..... Xn has occurred f n times. Let N be the total frequency,
n
i.e., N =
!f
i =1
V alu es F requ en cy
Direct Method : The arithmetic mean of these observations using direct method is given by
X1 + X1 + ... + X1 + X 2 + ... + ... + X 2 + ... + ... + X n + ... + X n 1442443 1444 2444 3 144 244 3
x=
f1times f 2times f ntimes
f1 + f 2 + ... f n
Since X1 + X1 + ..... + X1 added f1 times can also be written f1X1. Similarly, by writing other observation in same manner, we have
n n ! fi X i ! fi X i f X + f X + ... + fn Xn i = 1 i =1 = = X= 1 1 2 2 n f1 + f2 + ... + fn N ! f i i =1
.... (3)
Short-Cut Method: As before, we take the deviations of observations from an arbitrary value A. The deviation of i th observation from A is di = Xi A. Multiplying both sides by fi we have fi di = fi (Xi A) Taking sum over all the observations S fi d i = S fi X i " A = S fi X i " AS fi = S fiXi - A.N
27
Example 2: The following is the frequency distribution of age of 670 students of a school. Compute the arithmetic mean of the data.
X (in years) 5 6 45 7 90 8 165 9 112 10 96 11 81 12 26 13 18 14 12
Frequency 25
Solution: Direct method: The computations are shown in the following table :
X 5 6 7 8 9 10 11 12 13 14 Total f 25 45 90 165 112 96 81 26 18 12 ! f = 670 fX 125 270 630 1320 1008 960 891 312 234 168 ! fX = 5918 ! fX 5918 = = 8.83 years. 670 !f
X=
Short-Cut Method: The method of computations are shown in the following table :
X 5 6 7 8 9 10 11 12 13 14 Total f 25 45 90 165 112 96 81 26 18 12 670 d = X "8 "3 "2 "1 0 1 2 3 4 5 6 fd " 75 " 90 " 90 0 112 192 243 104 90 72 558
X = A+
In a grouped frequency distribution, there are classes along with their respective frequencies. Let li be the lower limit and ui be the upper limit of i th class. Further, let the number of classes be n, so that i = 1, 2,.....n. Also let fi be the frequency of i th class. This distribution can written in tabular form, as shown. Note: Here u1 may or may not be equal to l2, i.e., the upper limit of a class may or may not be equal to the lower limit of its following class. It may be recalled here that, in a grouped frequency distribution, we only know the number of observations in a particular class interval and not their individual magnitudes. Therefore, to calculate mean, we have to make a fundamental Frequency assumption that the observations in a class are uniformly distributed. Class (f ) Intervals Under this assumption, the mid-value of a class will be equal to the mean of observations in that class and hence can be taken as their l1 -u1 f1 representative. Therefore, if Xi is the mid-value of i th class with l2 -u2 f2 frequency fi , the above assumption implies that there are fi M M fn observations each with magnitude Xi (i = 1 to n). Thus, the ln -un arithmetic mean of a grouped frequency distribution can also be Total = ! fi = N calculated by the use of the formula, given in 9.5.1(b). Frequency Remarks: The accuracy of arithmetic mean calculated for a grouped frequency distribution depends upon the validity of the fundamental assumption. This assumption is rarely met in practice. Therefore, we can only get an approximate value of the arithmetic mean of a grouped frequency distribution.
28
Solution: Here only short-cut method will be used to calculate arithmetic mean but it can also be calculated by the use of direct-method.
Frequency Class Mid d = X - 35 (f ) Intervals Values (X ) 0-10 10-20 20-30 30-40 40-50 50-60 60-70 70-80 Total 5 15 25 35 45 55 65 75 3 8 12 15 18 16 11 5 88 - 30 - 20 -10 0 10 20 30 40 fd - 90 -160 -120 0 180 320 330 200 660
fd 660 \ X = A + ! = 35 + = 42.5 N 88
Example 4: The following table gives the distribution of weekly wages of workers in a factory. Calculate the arithmetic mean of the distribution.
Weekly Wages : 240 - 269 270 - 299 300 - 329 330 - 359 360 - 389 390 - 419 420 - 449 No. of 7 19 27 15 12 12 8 Workers :
Solution: It may be noted here that the given class intervals are inclusive. However, for the computation of mean, they need not be converted into exclusive class intervals.
Class Mid Frequency d = X - 344.5 Intervals Values (X ) 240-269 270-299 300-329 330-359 360-389 390-419 420-449 254.5 284.5 314.5 344.5 374.5 404.5 434.5 Total 7 19 27 15 12 12 8 100 - 90 - 60 - 30 0 30 60 90 fd - 630 -1140 - 810 0 360 720 720 -780
X = A+
In a grouped frequency distribution, if all the classes are of equal width, say 'h', the successive mid-values of various classes will differ from each other by this width. This fact can be utilised for reducing the work of computations. Let us define ui =
observations we have,
n
! fu
i =1 n i =1
i i
1 n ! fi ( Xi " A) h i =1
n i =1 n i =1
or
29
!fui i
h#
i =1
!f X
i i
i =1
" A= X" A
n
\ X = A + h # i =1 N
! fu
i i
.... (5)
Using this relation we can simplify the computations of Example 4, as shown below.
u= X - 344.5 30 f fu -3 -2 -1 0 1 2 3 Total 100 - 26
7 19 27 15 12 12 8 - 21 - 38 - 27 0 12 24 24
X = 344.5 "
30 $ 26 = 336.7 100
Example 5: Following table gives the distribution of companies according to size of capital. Find the mean size of the capital of a company.
Capital ( Lacs Rs ) < 5 < 10 < 15 < 20 < 25 < 30 29 38 48 53 No . of Companies 20 27
Solution: This is a 'less than' cumulative frequency distribution. This will first be converted into class intervals.
Class Intervals 0-5 5 - 10 10 - 15 15 - 20 20 - 25 25 - 30 Total Frequency (f ) 20 7 2 9 10 5 53 Mid - values (X) 2.5 7.5 12.5 17.5 22.5 27.5 u= X " 12.5 fu 5 "2 " 40 "1 "7 0 1 2 3 0 9 20 15 "3
X = 12.5 "
5$ 3 = Rs 12.22 Lacs 53
Example 6: A charitable organisation decided to give old age pension to people over sixty years of age. The scale of pension were fixed as follows :
Age Group : 60 -65 65-70 70 -75 75-80 80-85 85-90 90- 95 Pension / Month ( Rs ) : 100 120 140 160 180 200 220
If the total pension paid per month in various age groups are :
: 60 - 65 65 - 70 70 - 75 75 - 80 80 - 85 85 - 90 90 - 95 Age Group Total Pension/ Month : 700 600 840 800 720 600 440
Calculate the average amount of pension paid per month per head and the average age of the group of old persons.
30
Solution: The computations of pension per head and the average age are shown in the following table.
Rate of Pension per month ( Y ) ( in Rs ) 100 120 140 160 180 200 220
Total Pension paid per month ( T ) ( in Rs ) 700 600 840 800 720 600 440 4700
No . of Persons f =T Y 7 5 6 5 4 3 2 32
5 $ ( " 21) 32
When the arithmetic mean of a frequency distribution is calculated by short-cut or stepdeviation method, the accuracy of the calculations can be checked by using the following formulae, given by Charlier.
For short-cut method
! f (d
i
+ 1) = ! fi di + ! fi
i i
or
! f d = ! f (d
i i
+ 1) " ! fi = ! fi ( di + 1) " N
! f (u
i
+ 1) = ! fi ui + ! fi
i i
or
! f u = ! f (u
i i
! f (u + 1) = 20 ! ( - 1 ) + ( 7 ! 0) + ( 2 ! 1) + ( 9 ! 2) + ( 10 ! 3) + ( 5 ! 4) = 50
Since
31
Let X1, X2 ....., Xn be n values with their respective weights w1, w2 ....., wn. Their weighted arithmetic mean denoted as Xw is given by, (i)
Xw =
!w X !w
i i
(ii)
Xw = A +
!w d !w
i
i i
(where di = Xi - A)
(iii)
Xw = A +
!w u !w
i
i i
Example 7: Ram purchased equity shares of a company in 4 successive months, as given below. Find the average price per share.
Month No. of Shares Price per share (in Rs.)
Solution: The average price is given by the weighted average of prices, taking the number of shares purchased as weights.
Month Dec - 91 Jan - 92 Feb - 92 Mar - 92 Total Price of share ( X ) No . of shares d = X " 150 dw ( in Rs ) (w) 100 200 " 50 " 10000 150 250 0 0 200 280 50 14000 125 300 " 25 " 7500 1030 " 3500
Xw = 150 "
Example 8: From the following results of two colleges A and B, find out which of the two is better : Examination College A College B Appeared Passed Appeared Passed 60 40 200 160 M.Sc. 100 60 240 200 M. A. 200 150 200 140 B.Sc. B. A. 120 75 160 100 Solution: Performance of the two colleges can be compared by taking weighted arithmetic mean of the pass percentage in various classes. The calculation of weighted arithmetic mean is shown in the following table.
32
X w for College A
!w X = !w
A A B B
X w for College B =
!w X !w
Since the weighted average of pass percentage is higher for college B, hence college B is better. Remarks: If X denotes simple mean and Xw denotes the weighted mean of the same data, then (i) (ii)
X = Xw , when equal weights are assigned to all the items. X > Xw , when items of small magnitude are assigned greater weights and items of
large magnitude are assigned lesser weights. (iii) X < Xw , when items of small magnitude are assigned lesser weights and items of large magnitude are assigned greater weights.
!fX
i
Let di = X i - X , where i =1, 2 ..... n. Multiplying both sides by fi and taking sum over all the observations, we have
2.
The sum of squares of deviations of observations is minimum when taken from their arithmetic mean. Because of this, the mean is sometimes termed as 'least square' measure of central tendency. Proof: The sum of squares of deviations of observations from arithmetic mean =
! f (X
i
" X)
Similarly, we can define sum of squares of deviations of observations from any arbitrary value A as S = ! fi ( Xi " A )
2
.... (1)
We have to show that S will be minimum when A X . To prove this, we try to find that value of A for which S is minimum. The necessary and sufficient conditions for minimum of S are :
dS = 0 and dA
33
.... (2)
! f (X
i
" A ) = 0 or
i
!fX
i
" NA = 0
or
!fX
i
Thus,
Further, to show that S is minimum, it will be shown that Differentiating (2) further w.r.t. A, we have
dS 2 > 0 at A = X . dA
! f X and
i i
N. According to this property, if any two of the three values are known, the third can be easily computed. This property is obvious and requires no proof. 4. If X1 and N1 are the mean and number of observations of a series and X2 and N2 are the corresponding magnitudes of another series, then the mean X of the combined series of N1 + N2 observations is given by X =
N1 X 1 + N 2 X 2 N1 + N 2
Proof : To find mean of the combined series, we have to find sum of its observations. Now, the sum of observations of the first series, i.e., of observations of the second series, i.e.,
!fX
1
!f X
2
= N 2 X2 .
\ The sum of observations of the combined series, i.e., N1 X 1 + N 2 X 2 . Thus, the combined mean X =
N1 X 1 + N 2 X 2 N1 + N 2
This result can be generalised: If there are k series each with mean Xi and number of observations equal to Ni , where i = 1,2 ..... k, the mean of the combined series of N1 + N2 + ..... + Nk observations is given by
N1 X 1 + N 2 X 2 + ... + N k X k = N1 + N 2 + ... + N k Ni X i
i =1 k
X =
Ni
i =1
5.
34
If a constant B is added (subtracted) from every observation, the mean of these observations also gets added (subtracted) by it.
Proof : Let X be the mean of the observations X1, X2.....Xn with respective frequencies as f1, f2 ..... fn. When B is added to every observations, let ui = Xi + B. Multiply both sides by fi and take sum over all the observations, we get Sfiui = Sfi(Xi + B) = SfiXi + NB Dividing both sides by N we get
! fu
N
i i
fi Xi + B or u = X + B . N
i.e., The mean of ui = Xi + B is obtained by adding B to the mean of Xi values. Similarly, it can be shown that if vi = Xi - B, then v = X - B. 6. If every observation is multiplied (divided) by a constant b, the mean of these observations also gets multiplied (divided) by it. Proof: Let us define wi = % Xi . Multiplying both sides by fi and taking sum over all the observations, we get Sfiwi = bSfiXi. Dividing both sides by N, we get
fw
i
=b
fi X i or w = b X N
Xi X , then D = b b
Using properties 5 and 6, we can derive the following results : If Yi = a + bXi , then SfiYi = Sfi(a + bXi) or SfiYi = aSfi + bSfiXi. Dividing both sides by N( = Sfi ), we have
fY
N
i i
= a +b
fi X i or Y = a + bX N
This shows that relationship between the means of two variables is same as the relationship between the variables themselves. 7. If some observations of a series are replaced by some other observations, then the mean of original observations will change by the average change in magnitude of the changed observations. Proof: Let mean of n observations be X =
X + X + LL + X 1 2 n . Further, Let X , 1 n
X2, X3 are replaced by the respective observations Y1, Y2, Y3. Therefore, the change in magnitude of the changed observations = (Y1 + Y2 + Y3) - (X1 + X2 + X3). Hence average change in magnitude =
(Y1 + Y2 + Y3 ) " (X1 + X 2 + X 3 ) . n
Thus, new X = old X + average change in magnitude. Example 9: There are 130 teachers and 100 non-teaching employees in a college. The respective distributions of their monthly salaries are given in the following table :
35
06
From the above data find : (i) (ii) Average monthly salary of a teacher. Average monthly salary of a non-teaching employee.
(iii) Average monthly salary of a college employee (teaching and non-teaching). Solution: (i) Average monthly salary of a teacher
Example 10: The average rainfall for a week, excluding Sunday, was 10 cms. Due to heavy rainfall on Sunday, the average for the week rose to 15 cms. How much rainfall was on Sunday? Solution: A week can be treated as composed of two groups: First group consisting of 6 days excluding Sunday for which N1 = 6 and X 1 = 10; the second group consisting of only Sunday for which N2 = 1. Also, mean of this group will be equal to the observation itself. Let this be X. We have to determine the value of X.
36
X = 105 - 60 = 45 cms. Thus, the rainfall on Sunday was 45 cms. Example 11: The mean age of the combined group of men and women is 30.5 years. If the mean age of the sub-group of men is 35 years and that of the sub-group of women is 25 years, find out percentage of men and women in the group. Solution: Let x be the percentage of men in the combined group. Therefore, percentage of women = 100 - x. We are given that X1 (men) = 35 years and X2 (women) = 25 years Also X (combined) = 30.5
30.5 = 35 x + 25 100 " x x + 100 " x
x=
550 = 55%. Thus, there are 55% men and 45% women in the group. 10
Example 12: The following is the distribution of weights (in lbs.) of 60 students of a class:
Weights No. of Students Weights No. of Students : : : : 93 - 97 2 118 - 122 ? 98 - 102 5 123 - 127 3 103 - 107 12 128 - 132 1 108 - 112 ? Total 60 113 - 117 14
If the mean weight of the students is 110.917, find the missing frequencies. Solution: Let f1 be the frequency of the class 108-112. Then, the frequency of the class 118-122 is given by 60 - (2 + 5 + 12 + 14 + 3 + 1 + f 1) = 23 - f1 Writing this information in tabular form we have :
No. of Weights Mid -Values X - 110 u= 5 (in lbs.) Students (f ) (X ) -3 93-97 2 95 -2 98-102 5 100 -1 103-107 12 105 f1 108-112 110 0 113-117 118-122 123-127 128-132 Total 14 23 - f1 3 1 60 115 120 125 130 1 2 3 4 fu -6 -10 -12 0 14 46 - 2 f1 9 4 45 - 2 f1
(45 - 2 f1 )5 60
or 11.004 = 45 - 2f1 or 2f1 = 33.996 = 34 (approximately) Thus, f1 = 17 is the frequency of the class 108 - 112 and 23 - 17= 6 is the frequency of the class 118 - 122. Example 13: Find out the missing item (x) of the following frequency distribution whose arithmetic mean is 11.37.
37
X : f :
5 2
7 4
axf
29
11 54
13 11
16 8
20 4
X=
11.37 = \ x=
Example 14: The arithmetic mean of 50 items of a series was calculated by a student as 20. However, it was later discovered that an item 25 was misread as 35. Find the correct value of mean. Solution: N = 50 and X = 20 \ SXi = 50 ! 20 = 1000 Thus SXi (corrected) = 1000 + 25 - 35 = 990 and X (corrected) = Alternatively, using property 7 :
X new = X old + average change in magnitude = 20 -
990 = 19.8 50
10 = 20 - 0.2 = 19.8 50
Example 15: The sales of a balloon seller on seven days of a week are as given below:
Days Mon Tue Wed Thu Fri Sat Sun Sales ( in Rs ) 100 150 125 140 160 200 250
If the profit is 20% of sales, find his average profit per day. Solution: Let P denote profit and S denote sales, \ P = Using property 6, we can write P = Now S =
20 S 100
20 S 100
or
P=
1 S 5
100 + 150 + 125 + 140 + 160 + 200 + 250 = 160.71 7 P= 160.71 = Rs 32.14 5
Hence, the average profit of the balloon seller is Rs 32.14 per day. Alternatively, we can find profit of each day and take mean of these values.
Days Mon Tue Wed Thu Fri Sat Sun Profit ( in Rs ) 20 30 25 28 32 40 50
P=
20 + 30 + 25 + 28 + 32 + 40 + 50 = Rs 32.14 7
Out of all averages arithmetic mean is the most popular average in statistics because of its merits given below:
1. 2.
Arithmetic mean is rigidly defined by an algebraic formula. Calculation of arithmetic mean requires simple knowledge of addition, multiplication and division of numbers and hence, is easy to calculate. It is also simple to understand the meaning of arithmetic mean, e.g., the value per item or per unit, etc. Calculation of arithmetic mean is based on all the observations and hence, it can be regarded as representative of the given data. It is capable of being treated mathematically and hence, is widely used in statistical analysis. Arithmetic mean can be computed even if the detailed distribution is not known but sum of observations and number of observations are known. It is least affected by the fluctuations of sampling. It represents the centre of gravity of the distribution because it balances the magnitudes of observations which are greater and less than it. It provides a good basis for the comparison of two or more distributions.
3. 4. 5. 6. 7. 8.
Demerits
Although, arithmetic mean satisfies most of the properties of an ideal average, it has certain drawbacks and should be used with care. Some demerits of arithmetic mean are: 1. 2. 3. 4. 5. It can neither be determined by inspection nor by graphical location. Arithmetic mean cannot be computed for a qualitative data; like data on intelligence, honesty, smoking habit, etc. It is too much affected by extreme observations and hence, it does not adequately represent data consisting of some extreme observations. The value of mean obtained for a data may not be an observation of the data and as such it is called a fictitious average. Arithmetic mean cannot be computed when class intervals have open ends. To compute mean, some assumption regarding the width of class intervals is to be made. In the absence of a complete distribution of observations the arithmetic mean may lead to fallacious conclusions. For example, there may be two entirely different distributions with same value of arithmetic mean. Simple arithmetic mean gives greater importance to larger values and lesser importance to smaller values.
6.
7.
Hint : Take the mid-value of a class as the mean of its limits and find arithmetic mean by the step-deviation method. 2. The following table gives the monthly income (in rupees) of families in a certain locality. By stating the necessary assumptions, calculate arithmetic mean of the distribution.
39
1000 1000 - 2000 2000 - 3000 3000 - 4000 4000 - 5000 5000 100 1200 1450 250 70 30
Hint : This distribution is with open end classes. To calculate mean, it is to be assumed that the width of first class is same as the width of second class. On this assumption the lower limit of the first class will be 0. Similarly, it is assumed that the width of last class is equal to the width of last but one class. Therefore, the upper limit of the last class can be taken as 6,000. 3. Compute arithmetic mean of the following distribution of marks in Economics of 50 students.
Marks more than 0 10 20 30 40 No. of Students 50 46 40 33 25 Marks more than 50 60 70 80 No. of Students 15 8 3 0
Hint: First convert the distribution into class intervals and then calculate X . 4. The monthly profits, in '000 rupees, of 100 shops are distributed as follows:
Profit per Shop : 0 - 100 0 - 200 0 - 300 0 - 400 0 - 500 0 - 600 No. of Shops : 12 30 57 77 94 100
Find average profit per shop. Hint: This is a less than type cumulative frequency distribution. 5. Typist A can type a letter in five minutes, typist B in ten minutes and typist C in fifteen minutes. What is the average number of letters typed per hour per typist?
Hint: In one hour, A will type 12 letters, B will type 6 letters and C will type 4 letters. 6. A taxi ride in Delhi costs Rs 5 for the first kilometre and Rs 3 for every additional kilometre travelled. The cost of each kilometre is incurred at the beginning of the kilometre so that the rider pays for the whole kilometre. What is the average cost of travelling 2
3 kilometres? 4 3 kilometres = Rs 5 + 3 + 3 = Rs 11. 4
A company gave bonus to its employees. The rates of bonus in various salary groups are :
Monthly Salary : 1000 - 2000 2000 - 3000 3000 - 4000 4000 - 5000 ( in Rs ) Rate of Bonus : 2000 2500 3000 3500 ( in Rs )
The actual salaries of staff members are as given below : 1120, 1200, 1500, 4500, 4250, 3900, 3700, 3950, 3750, 2900, 2500, 1650, 1350, 4800, 3300, 3500, 1100, 1800, 2450, 2700, 3550, 2400, 2900, 2600, 2750, 2900, 2100, 2600, 2350, 2450, 2500, 2700, 3200, 3800, 3100. Determine (i) Total amount of bonus paid and (ii) Average bonus paid per employee. Hint: Find the frequencies of the classes from the given information. 8.
40
Calculate arithmetic mean from the following distribution of weights of 100 students of a college. It is given that there is no student having weight below 90 lbs. and the total weight of persons in the highest class interval is 350 lbs.
Weights : < 100 < 110 < 120 < 130 < 140 < 150 < 160 < 170 170 & Frequency : 3 5 23 45 66 85 95 98 2
Hint: Rearrange this in the form of frequency distribution by taking class intervals as 90 - 100, 100 - 110, etc. 9. By arranging the following information in the form of a frequency distribution, find arithmetic mean. "In a group of companies 15%, 25%, 40% and 75% of them get profits less than Rs 6 lakhs, 10 lakhs, 14 lakhs and 20 lakhs respectively and 10% get Rs 30 lakhs or more but less than 40 lakhs." Hint: Take class intervals as 0 - 6, 6 - 10, 10 - 14, 14 - 20, etc. 10. Find class intervals if the arithmetic mean of the following distribution is 38.2 and the assumed mean is equal to 40.
Step deviations Frequency : : 3 8 2 14 1 18 0 28 1 17 2 10 3 5
! fu
N
From the following data, calculate the mean rate of dividend obtainable to an investor holding shares of various companies as shown :
Percentage Dividend No. of Companies Average no. of shares of each company held by the investor : : : 30 - 40 4 250 20 - 30 25 150 10 - 20 15 200 0 - 10 6 300
Hint: The no. of shares of each type = no. of companies $ average no. of shares. 12. The mean weight of 150 students in a certain class is 60 kgs. The mean weight of boys in the class is 70 kgs and that of girls is 55 kgs. Find the number of girls and boys in the class. Hint: Take n1 as the no. of boys and 150 - n1 as the no. of girls. 13. The mean wage of 100 labourers working in a factory, running two shifts of 60 and 40 workers respectively, is Rs 38. The mean wage of 60 labourers working in the morning shift is Rs 40. Find the mean wage of 40 laboures working in the evening shift. Hint: See example 10. 14. The mean of 25 items was calculated by a student as 20. If an item 13 is replaced by 30, find the changed value of mean. Hint: See example 14. 15. The average daily price of share of a company from Monday to Friday was Rs 130. If the highest and lowest price during the week were Rs 200 and Rs 100 respectively, find average daily price when the highest and lowest price are not included. Hint: See example 10. 16. The mean salary paid to 1000 employees of an establishment was found to be Rs 180.40. Later on, after disbursement of the salary, it was discovered that the salaries of two employees were wrongly recorded as Rs 297 and Rs 165 instead of Rs 197 and Rs 185. Find the correct arithmetic mean. Hint: See example 14. 17. Find the missing frequencies of the following frequency distribution :
41
Hint: See example 12. 18. Marks obtained by students who passed a given examination are given below :
Marks obtained : 40 - 50 50 - 60 60 - 70 70 - 80 80 - 90 90 - 100 ( in percent ) No . of : 10 12 20 9 5 4 Students
If 100 students took the examination and their mean marks were 51, calculate the mean marks of students who failed. Hint: See example 9. 19. A appeared in three tests of the value of 20, 50 and 30 marks respectively. He obtained 75% marks in the first and 60% marks in the second test. What should be his percentage of marks in the third test in order that his aggregate is 60%? Hint: Let x be the percentage of marks in third test. Then the weighted average of 75, 60 and x should be 60, where weights are 20, 50 and 30 respectively. 20. Price of a banana is 80 paise and the price of an orange is Rs 1.20. If a person purchases two dozens of bananas and one dozen of oranges, show by stating reasons that the average price per piece of fruit is 93 paise and not one rupee. Hint: Correct average is weighted arithmetic average. 21. The average marks of 39 students of a class is 50. The marks obtained by 40th student are 39 more than the average marks of all the 40 students. Find mean marks of all the 40 students. Hint: X + 39 + 39 ! 50 = 40 X . 22. The means calculated for frequency distributions I and II were 36 and 32 respectively. Find the missing frequencies of the two distributions.
Frequency of Frequency of Class Intervals Distribution I Distribution II 5 - 15 4 10 15 - 25 10 14 25 - 35 14 3y 35 - 45 16 13 45 - 55 2x 10 55 - 65 y x
Hint: 36 =
32 =
100 + 280 + 90 y + 520 + 500 + 60 x 47 + 3 y + x Solve these equations simultaneously for the values of x and y.
23. The following table gives the number of workers and total wages paid in three departments of a manufacturing unit :
Department No . of Workers A B C 105 304 424 Total wages ( in Rs ) 1, 68 , 000 4 , 25 , 600 5 , 08 , 800
42
If a bonus of Rs 200 is given to each worker, what is the average percentage increase in wages of the workers of each department and of the total workers?
Hint:
(i)
(ii)
1,68,000 + 4, 25,600 + 5,08,800 105 + 304 + 424 Then, find percentage increase as before.
Average wage for total workers =
24. The following table gives the distribution of the number of kilometres travelled per salesman, of a pharmaceutical company, per day and their rates of conveyance allowance:
No. of kilometre travelled per salesman 10 - 20 20 - 30 30 - 40 40 - 50 Rate of conveyance No. of per kilo salesman allowancein Rs) metre ( 3 2.50 8 2.60 15 2.70 4 2.80
Calculate the average rate of conveyance allowance given to each salesman per kilometre by the company. Hint: Obtain total number of kilometre travelled for each rate of conveyance allowance by multiplying mid-values of column 1 with column 2. Treat this as frequency 'f' and third column as 'X' and find X . 25. The details of monthly income and expenditure of a group of five families are given in the following table:
Family A B C D E Income Expenditure per ( in Rs ) member ( in Rs ) 1100 220 1200 190 1300 230 1400 260 1500 250 No. of members in the family 4 5 4 3 4
Average income per member for the entire group of families. Average expenditure per family. The difference between actual and average expenditure for each family.
Total income of the group of families Total no. of members in the group Total expenditure of the group No. of families
26. The following table gives distribution of monthly incomes of 200 employees of a firm:
Income ( in Rs '00 ) : 10-15 15- 20 20- 25 25- 30 30 - 35 35- 40 No. of employees : 30 50 55 32 20 13
Estimate: (i) (ii) Mean income of an employee per month. Monthly contribution to welfare fund if every employee belonging to the top 80% of the earners is supposed to contribute 2% of his income to this fund.
43
Hint: The distribution of top 80% of the wage earners can be written as :
By taking mid-values of class intervals find Sfx, i.e., total salary and take 2% of this. 27. The number of patients visiting diabetic clinic and protein urea clinic in a hospital during April 1991, are given below :
No. of days of attending No. of Patients Diabetic Clinic Protein Urea Clinic 2 4 0 - 10 8 6 10 - 20 7 5 20 - 30 7 8 30 - 40 4 3 40 - 50 2 4 50 - 60
Which of these two diseases has more incidence in April 1991? Justify your conclusion. Hint: The more incidence of disease is given by higher average number of patients. 28. A company has three categories of workers A, B and C. During 1994, the number of workers in respective category were 40, 240 and 120 with monthly wages Rs 1,000, Rs 1,300 and Rs 1,500. During the following year, the monthly wages of all the workers were increased by 15% and their number, in each category, were 130, 150 and 20, respectively. (a) (b) Compute the average monthly wages of workers for the two years. Compute the percentage change of average wage in 1995 as compared with 1994. Is it equal to 15%? Explain.
Hint: Since the weight of the largest wage is less in 1995, the increase in average wage will be less than 15%. 29. (a) (b) The average cost of producing 10 units is Rs 6 and the average cost of producing 11 units is Rs 6.5. Find the marginal cost of the 11th unit. A salesman is entitled to bonus in a year if his average quarterly sales are at least Rs 40,000. If his average sales of the first three quarters is Rs 35,000, find his minimum level of sales in the fourth quarter so that he becomes eligible for bonus. The monthly salaries of five persons were Rs 5,000, Rs 5,500, Rs 6,000, Rs 7,000 and Rs 20,000. Compute their mean salary. Would you regard this mean as typical of the salaries? Explain. There are 100 workers in a company out of which 70 are males and 30 females. If a male worker earns Rs 100 per day and a female worker earns Rs. 70 per day, find average wage. Would you regard this as a typical wage? Explain
(b)
Hint: An average that is representative of most of the observations is said to be a typical average.
2.6 MEDIAN
Median of distribution is that value of the variate which divides it into two equal parts. In terms of frequency curve, the ordinate drawn at median divides the area under the curve into two equal parts. Median is a positional average because its value depends upon the
44
Determination of Median
(a) When individual observations are given
The following steps are involved in the determination of median : (i) The given observations are arranged in either ascending or descending order of magnitude. (ii) Given that there are n observations, the median is given by: 1.
2.
Example 16: Find median of the following observations : 20, 15, 25, 28, 18, 16, 30. Solution: Writing the observations in ascending order, we get 15, 16, 18, 20, 25, 28, 30.
7 + 1 Since n = 7, i.e., odd, the median is the size of 2 th, i.e., 4th observation.
Hence, median, denoted by Md = 20. Note: The same value of Md will be obtained by arranging the observations in descending order of magnitude. Example 17: Find median of the data : 245, 230, 265, 236, 220, 250. Solution: Arranging these observations in ascending order of magnitude, we get 220, 230, 236, 245, 250, 265. Here n = 6, i.e., even.
6 6 \ Median will be arithmetic mean of the size of 2 th, i.e., 3rd and + 1 th, 2
Remarks: Consider the observations: 13, 16, 16, 17, 17, 18, 19, 21, 23. On the basis of the method given above, their median is 17. According to the above definition of median, "half (i.e., 50%) of the observations should be below 17 and half of the observations should be above 17". Here we may note that only 3 observations are below 17 and 4 observations are above it and hence, the definition of median given above is some what ambiguous. In order to avoid this ambiguity, the median of a distribution may also be defined in the following way : Median of a distribution is that value of the variate such that at least half of the observations are less than or equal to it and at least half of the observations are greater than or equal to it. Based on this definition, we find that there are 5 observations which are less than or equal to 17 and there are 6 observations which are greater than or equal to 17. Since n = 9, the numbers 5 and 6 are both more than half, i.e., 4.5. Thus, median of the distribtion is 17. Further, if the number of observations is even and the two middle most observations are not equal, e.g., if the observations are 2, 2, 5, 6, 7, 8, then there are 3 observations
45
n = 3 which are less than or equal to 5 and there are 4 (i.e., more than half) observations 2
which are greater than or equal to 5. Further, there are 4 observations which are less than or equal to 6 and there are 3 observations which are greater than or equal to 6. Hence, both 5 and 6 satisfy the conditions of the new definition of median. In such a case, any value lying in the closed interval [5, 6] can be taken as median. By convention we take the middle value of the interval as median. Thus, median is
(b) When ungrouped frequency distribution is given
5+6 = 5.5 2
In this case, the data are already arranged in the order of magnitude. Here, cumulative frequency is computed and the median is determined in a manner similar to that of individual observations. Example 18: Locate median of the following frequency distribution :
Variable (X) : 10 11 12 13 14 15 16 Frequency ( f ) : 8 15 25 20 12 10 5
Solution:
X : 10 11 12 13 14 15 16 f : 8 15 25 20 12 10 5 c. f . : 8 23 48 68 80 90 95
Here N = 95, which is odd. Thus, median is size of i.e., 48th observation. From the table 48th observation is 12, \ Md = 12.
LM 95 + 1OP N 2 Q
th
N 95 = = 47.5 Looking at the frequency distribution we note that 2 2 there are 48 observations which are less than or equal to 12 and there are 72 (i.e., 95 - 23) observations which are greater than or equal to 12. Hence, median is 12.
Alternative Method: Example 19: Locate median of the following frequency distribution :
X f : : 0 7 7 0 7 1 14 21 1 14 2 18 39 2 18 3 36 75 3 36 4 51 5 54 5 54 180 6 52 6 52 232 7 20 7 20 252
Solution:
X f c. f . 4 51 126
\ Median is the mean of the size of 126th and 127th observation. From the table we note that 126th observation is 4 and 127th observation is 5. \ Md =
4+5 = 4.5 2 Alternative Method: Looking at the frequency distribution we note that there are 126 observations which are less than or equal to 4 and there are 252 - 75 = 177 observations which are greater than or equal to 4. Similarly, observation 5 also satisfies this criterion.
Therefore, median =
46
4+5 = 4.5. 2
The determination of median, in this case, will be explained with the help of the following example. Example 20: Suppose we wish to find the median of the following frequency distribution.
Classes Frequency : : 0 - 10 5 10 - 20 12 20 - 30 14 30 - 40 18 40 - 50 13 50 - 60 8
Solution: The median of a distribution is that value of the variate which divides the distribution into two equal parts. In case of a grouped frequency distribution, this implies that the ordinate drawn at the median divides the area under the histogram into two equal parts. Writing the given data in a tabular form, we have :
Classes (1) 0 - 10 10 - 20 20 - 30 30 - 40 40 - 50 50 - 60 Frequency ( f ) (2) 5 12 14 18 13 8 ' Less than' type c. f . ( 3) 5 17 31 49 62 70 Frequency Density ( 4) 0. 5 1. 2 1. 4 1. 8 1. 3 0. 8
frequency of the class f Note : frequency density in a class = Width of the class = h
For the location of median, we make a histogram with heights of different rectangles equal to frequency density of the corresponding class. Such a histogram is shown below:
Histogram
Figure : 2.1
Since the ordinate at median divides the total area under the histogram into two equal parts, therefore we have to find a point (like Md as shown in the figure) on X - axis such that an ordinate (AMd) drawn at it divides the total area under the histogram into two equal parts. We may note here that area under each rectangle is equal to the frequency of the corresponding class. Since area = length $ breadth = frequency density! width of class =
f ! h = f. h
Thus, the total area under the histogram is equal to total frequency N. In the given example N = 70, therefore N = 35. We note that area of first three rectangles is
2
5 + 12 + 14 = 31 and the area of first four rectangles is 5 + 12 + 14 + 18 = 49. Thus, median lies in the fourth class interval which is also termed as median class. Let the point, in median class, at which median lies be denoted by Md. The position of this point
47
should be such that the ordinate AMd (in the above histogram) divides the area of median rectangle so that there are only 35 - 31 = 4 observations to its left. From the histogram, we can also say that the position of Md should be such that
M d - 30 4 = 40 - 30 18
Thus, M d =
.... (1)
40 + 30 = 32.2 18
N M d - Lm 2 - C = or Md = Lm + h fm
N -C 2 h fm
...(2)
Where, Lm is lower limit, h is the width and fm is frequency of the median class and C is the cumulative frequency of classes preceding median class. Equation (2) gives the required formula for the computation of median. Remarks: 1. Since the variable, in a grouped frequency distribution, is assumed to be continuous we always take exact value of 2. 3.
N , including figures after decimals, when N is odd. 2
The above formula is also applicable when classes are of unequal width. Median can be computed even if there are open end classes because here we need to know only the frequencies of classes preceding or following the median class.
Determination of Median When 'greater than' type cumulative frequencies are given
By looking at the histogram, we note that one has to find a point denoted by Md such that area to the right of the ordinate at Md is 35. The area of the last two rectangles is 13 + 8 = 21. Therefore, we have to get 35 - 21 = 14 units of area from the median rectangle towards right of the ordinate. Let U m be the upper limit of the median class. Then the formula for median in this case can be written as
N -C - Md Um = 2 or h fm
N -C h M d = Um - 2 fm
.... (3)
Note that C denotes the 'greater than type' cumulative frequency of classes following the median class. Applying this formula to the above example, we get Md = 40
(35 - 21) 10
18
= 32.2
Solution:
Calculation of Median
Since
Thus, Md = 7 +
50 - 38 1 = 7.55 inches. 22
Example 22: The following table gives the distribution of marks by 500 students in an examination. Obtain median of the given data.
: 0 - 9 10 - 19 20 - 29 30 - 39 40 - 49 50 - 59 60 - 69 70 - 79 Marks 40 50 48 24 162 132 14 No. of Students : 30
Solution: Since the class intervals are inclusive, therefore, it is necessary to convert them into class boundaries.
Class Intervals 0-9 10 - 19 20 - 29 30 - 39 40 - 49 50 - 59 60 - 69 70 - 79 Class Boundaries Frequency ' Less than' type c . f . " 0 . 5 - 9. 5 30 30 9 . 5 - 19 . 5 40 70 19. 5 - 29. 5 50 120 29. 5 - 39. 5 48 168 39 . 5 - 49 . 5 24 192 49 . 5 - 59 . 5 162 354 59 . 5 - 69 . 5 132 486 69 . 5 - 79. 5 14 500
Since
fm = 162, C = 192.
N = 250, the median class is 49.5 - 59.5 and, therefore, Lm = 49.5, h = 10, 2
Thus, Md = 49.5 +
Example 23: The weekly wages of 1,000 workers of a factory are shown in the following table. Calculate median.
Weekly Wages (less than) : 425 475 525 575 625 675 725 775 825 875 No. of Workers : 2 10 43 123 293 506 719 864 955 1000
Solution: The above is a 'less than' type frequency distribution. This will first be converted into class intervals.
Class Intervals less than 425 425 - 475 475 - 525 525 - 575 575 - 625 625 - 675 675 - 725 725 - 775 775 - 825 825 - 875 Frequency 2 8 33 80 170 213 213 145 91 45 Less than c. f . 2 10 43 123 293 506 719 864 955 1000
49
Since
Since
Example 25: The following table gives the daily profits (in Rs) of 195 shops of a town. Calculate mean and median.
Profits : 50 - 60 60 - 70 70 - 80 80 - 90 90 - 100 100 - 110 110 - 120 120 - 130 130 - 140 No.of shops : 15 20 32 35 33 22 20 10 8
X =A+
50
195
97.5 - 67 \ Md = 80 + 10 = Rs 88.71 35
Example 26: Find median of the following distribution :
Mid - Values Frequency : : 1500 27 2500 32 3500 65 4500 78 5500 58 6500 32 7500 8
Solution: Since the mid-values are equally spaced, the difference between their two successive values will be the width of each class interval. This width is 1,000. On subtracting and adding half of this, i.e., 500 to each of the mid-values, we get the lower and the upper limits of the respective class intervals. After this, the calculation of median can be done in the usual way.
Mid - Values Class Intervals Frequency c. f .(less than) 1500 1000 - 2000 27 27 2500 2000 - 3000 32 59 3500 3000 - 4000 65 124 4500 4000 - 5000 78 202 5500 5000 - 6000 58 260 6500 6000 - 7000 32 292 7500 7000 - 8000 8 300 N Since = 150, the median class is 4,000 - 5,000. 2
Hence Md = 4,000 +
If the frequencies of some classes are missing, however, the median of the distribution is known, then these frequencies can be determined by the use of median formula. Example 27: The following table gives the distribution of daily wages of 900 workers. However, the frequencies of the classes 40 - 50 and 60 - 70 are missing. If the median of the distribution is Rs 59.25, find the missing frequencies.
Wages ( Rs ) : 30 - 40 40 - 50 50 - 60 60 - 70 70 - 80 No . of Workers : 120 ? 200 ? 185
Since median is given as 59.25, the median class is 50 - 60. Therefore, we can write 59.25 = 50 +
450 - (120 + f1 ) 200 10 = 50 +
330 f1 20
or 9.25 ! 20 = 330 - f1 or f1 = 330 - 185 = 145 Further, f2 = 900 - (120 + 145 + 200 + 185) = 250.
51
So far we have calculated median by the use of a formula. Alternatively, it can be determined graphically, as illustrated in the following example. Example 28: The following table shows the daily sales of 230 footpath sellers of Chandni Chowk :
Sales ( in Rs ) No . of Sellers Sales ( in Rs ) No . of Sellers 0 - 500 500 - 1000 1000 - 1500 1500 - 2000 : 12 18 35 42 : : 2000 - 2500 2500 - 3000 3000 - 3500 3500 - 4000 50 45 20 8 :
Locate the median of the above data using (i) (ii) only the less than type ogive, and both, the less than and the greater than type ogives.
Less than c. f . 12 30 65 107 157 202 222 230 More than c. f . 230 218 200 165 123 73 28 8
Figure 2.2 The value N = 115 is marked on the vertical axis and a horizontal line is drawn from this
2
point to meet the ogive at point S. Drop a perpendicular from S. The point at which this meets X- axis is the median.
(ii) Using both types of ogives
52
Figure 2.3
A perpendicular is dropped from the point of intersection of the two ogives. The point at which it intersects the X-axis gives median. It is obvious from Fig. 2.2 and 2.3 that median = 2080. Properties of Median 1. 2. It is a positional average. It can be shown that the sum of absolute deviations is minimum when taken from median. This property implies that median is centrally located.
Merits and Demerits of Median (a) Merits 1. It is easy to understand and easy to calculate, especially in series of individual observations and ungrouped frequency distributions. In such cases it can even be located by inspection. Median can be determined even when class intervals have open ends or not of equal width. It is not much affected by extreme observations. It is also independent of range or dispersion of the data. Median can also be located graphically. It is centrally located measure of average since the sum of absolute deviation is minimum when taken from median. It is the only suitable average when data are qualitative and it is possible to rank various items according to qualitative characteristics. Median conveys the idea of a typical observation. In case of individual observations, the process of location of median requires their arrangement in the order of magnitude which may be a cumbersome task, particularly when the number of observations is very large. It, being a positional average, is not capable of being treated algebraically. In case of individual observations, when the number of observations is even, the median is estimated by taking mean of the two middle-most observations, which is not an actual observation of the given data. It is not based on the magnitudes of all the observations. There may be a situation where different sets of observations give same value of median. For example, the following two different sets of observations, have median equal to 30. Set I : 10, 20, 30, 40, 50 and Set II : 15, 25, 30, 60, 90. 5. 6. In comparison to arithmetic mean, it is much affected by the fluctuations of sampling. The formula for the computation of median, in case of grouped frequency distribution, is based on the assumption that the observations in the median class are uniformly distributed. This assumption is rarely met in practice. Since it is not possible to define weighted median like weighted arithmetic mean, this average is not suitable when different items are of unequal importance. It is an appropriate measure of central tendency when the characteristics are not measurable but different items are capable of being ranked. Median is used to convey the idea of a typical observation of the given data. Median is the most suitable measure of central tendency when the frequency distribution is skewed. For example, income distribution of the people is generally positively skewed and median is the most suitable measure of average in this case. Median is often computed when quick estimates of average are desired. When the given data has class intervals with open ends, median is preferred as a measure of central tendency since it is not possible to calculate mean in this case.
53
2. 3. 4. 5. 6. 7. 1.
(b) Demerits
2. 3.
4.
7. Uses 1. 2. 3.
4. 5.
1 2.
What are the merits and demerits of Mean and Median? Find Arithmetic mean of first ten prime numbers. Write your answer in the space given below. Please go through the lesson sub-head thoroughly you will get your answers in it. This Chek Your Progress will help you to understand the lesson better. Try to write answers for them, but do not submit your answers to the university for assessment. These are for your practice only. (b) (c)
Notes: (a)
Q1 = LQ1
FG N " CIJ H 4 K $h +
fQ
1
Here, LQ1 is lower limit of the first quartile class, h is its width, fQ1 is its frequency and C is cumulative frequency of classes preceding the first quartile class. By definition, the second quartile is median of the distribution. The third quartile (Q3) of a distribution can also be defined in a similar manner. For a discrete distribution, Q3 is that value of the variate such that at least 75% of the observations are less than or equal to it and at least 25% of the observations are greater than or equal to it. For a grouped frequency distribution, Q3 is that value of the variate such that area under the histogram to the left of the ordinate at Q3 is 75% and the area to its right is 25%. The formula for computation of Q3 can be written as
54
Q3 = LQ3
Deciles Deciles divide a distribution into 10 equal parts and there are, in all, 9 deciles denoted as D1, D2, ...... D9 respectively. For a discrete distribution, the i th decile Di is that value of the variate such that at least (10i)% of the observation are less than or equal to it and at least (100 - 10i)% of the observations are greater than or equal to it (i = 1, 2, ...... 9). For a continuous or grouped frequency distribution, D i is that value of the variate such that the area under the histogram to the left of the ordinate at Di is (10i)% and the area to its right is (100 - 10i)%. The formula for the i th decile can be written as
Di = LDi
FG iN " CIJ H 10 K $ h +
f Di
(i = 1, 2, ...... 9)
Percentiles Percentiles divide a distribution into 100 equal parts and there are, in all, 99 percentiles denoted as P1, P2, ...... P25, ...... P40, ...... P60, ...... P99 respectively. For a discrete distribution, the kth percentile Pk is that value of the variate such that at least k% of the observations are less than or equal to it and at least (100 - k)% of the observations are greater than or equal to it. For a grouped frequency distribution, Pk is that value of the variate such that the area under the histogram to the left of the ordinate at Pk is k% and the area to its right is (100 - k)% . The formula for the kth percentile can be written as
Pk = L Pk
Remarks : (i) We may note here that P25 = Q1, P50 = D5 = Q2 = Md, P75 = Q3, P10 = D1, P20 = D2, etc.
(ii) In continuation of the above, the partition values are known as Quintiles (Octiles) if a distribution is divided in to 5 (8) equal parts. (iii) The formulae for various partition values of a grouped frequency distribution, given so far, are based on 'less than' type cumulative frequencies. The corresponding formulae based on 'greater than' type cumulative frequencies can be written in a similar manner, as given below:
3N N - C - C 4 4 h , Q3 = U Q3 h Q1 = U Q1 f Q1 f Q3
Di = U Di -
iN N - 10 - C f Di
h,
Pk = U PK
kN N - 100 - C - h f Pk
Here UQ1 ,UQ3 ,UDi ,U PK are the upper limits of the corresponding classes and C denotes the greater than type cumulative frequencies. Example 29: Locate Median, Q1, Q3, D4, D7, P15, P60 and P90 from the following data :
Daily Profit ( in Rs ) : 75 76 77 78 79 80 81 82 83 84 85 No . of Shops : 15 20 32 35 33 22 20 10 8 3 2
55
75 15 15
76 20 35
1.
we note that there are 102 (greater than 50% of the total) observations that are less than or equal to 78 and there are 133 observations that are greater than or equal to 78. Therefore, Md = Rs 78. 2. Determination of Q1 and Q3: First we determine
N which is equal to 50. From 4
the cumulative frequency column, we note that there are 67 (which is greater than 25% of the total) observations that are less than or equal to 77 and there are 165 (which is greater than 75% of the total) observations that are greater than or equal to 77. Therefore, Q1 = Rs 77. Similarly, Q3 = Rs 80. 3. Determination of D4 and D7: From the cumulative frequency column, we note that there are 102 (greater than 40% of the total) observations that are less than or equal to 78 and there are 133 (greater than 60% of the total) observations that are greater than or equal to 78. Therefore, D4 = Rs 78. Similarly, D7 = Rs 80. Determination of P15, P60 and P90: From the cumulative frequency column, we note that there are 35 (greater than 15% of the total) observations that are less than or equal to 76 and there are 185 (greater than 85% of the total) observations that are greater than or equal to 76. Therefore, P15 = Rs 76. Similarly, P60 = Rs 79 and P90 = Rs 82.
4.
Example 30: Calculate median, quartiles, 3rd and 6th deciles and 40th and 70th percentiles, from the following data:
Wages per Week ( in Rs ) No . of Workers Wages per Week ( in Rs ) No . of Workers : 50 - 100 100 - 150 150 - 200 200 - 250 250 - 300 15 40 35 60 125 : : 300 - 350 350 - 400 400 - 450 450 - 500 100 70 40 15 :
Also determine (i) The percentage of workers getting weekly wages between Rs 125 and Rs 260 and (ii) percentage of worker getting wages greater than Rs 340. Solution: First we make a cumulative frequency distribution table :
Class Intervals 50 - 100 100 - 150 150 - 200 200 - 250 250 - 300 300 - 350 350 - 400 400 - 450 450 - 500
(i)
250 - 300 and hence Lm = 250, fm = 125, h = 50 and C = 150. Substituting these values in the formula for median, we get Md = 250 +
56
(ii)
\ (b)
Q1 = 200 +
125 - 90 50 = Rs 229.17 60
3N which is equal to 375. The third quartile class is 4 300 - 350 and hence LQ3 = 300, fQ3 = 100, h = 50 and C =275.
Q3 = 300 +
\ (b)
D3 = 200 +
150 - 90 50 = Rs 250 60
6N which is equal to 300. The sixth decile class is 10 300 - 350 and hence LD6 = 300, fD6 = 100, h = 50 and C = 275.
D6 = 300 +
\ (b)
P40 = 250 +
70N which is equal to 350. The 70th percentile class is 100 300 - 350 and hence LP70 = 300, fP70 = 100, h = 50 and C = 275.
\ (v)
P70 = 300 +
Determination of percentage of workers getting wages between Rs 125 and Rs 260: Let x be the percentage of workers getting wage less than 125. Since 125 lies in the class 100 - 150, this is xth percentiles class. Using the formula for xth percentile we have
57
260 = 250 +
Hence percentage of workers getting wages between Rs 125 and Rs 260 is given by 35 7 = 28%.
Alternative Method
The number of workers getting wages between 125 and 260 can be written directly as =
\ Percentage of workers =
(vi) Determination of percentage of workers getting wages greater than Rs 340: Since we have already computed 'less than' type cumulative frequencies, in the above table, we shall first find percentage of workers getting wages less than 340. Let x be this percentage. Also xth percentiles class is 300 - 350. \ 340 = 300 +
Hence, percentage of workers getting wages greater than Rs 340 is (100 - 71) = 29%.
Alternative Method
This percentage can also be obtained directly as shown below. The percentage of workers getting wages greater than Rs 340 =
\ Percentage =
Example 31: From the following table, showing the wage distribution of workers, find (i) (ii) the range of incomes earned by middle 50% of the workers, the range of incomes earned by middle 80% of the workers,
Monthly Income ( Rs ) No . 0 - 200 0 - 400 0 - 600 0 - 800 0 - 1000 of Workers 150 250 330 380 400
Solution: The above table gives a 'less than' type cumulative frequency distribution. Therefore, we can rewrite the above table as :
Monthly Income ( Rs ) 0 - 200 200 - 400 400 - 600 600 - 800 800 - 1000 c . f . ( less than ) Frequency ( f ) 150 150 250 100 330 80 380 50 400 20
58
(i)
The range of incomes earned by middle 50% of the workers is given by Q3 - Q1. Now and Thus, Q1 = 0 +
100 0 ! 200 = Rs 133.33 150
Q3 = 400 +
(ii)
The range of incomes of middle 80% of the workers is given by P90 - P10.
Now
10 400 -0 100 200 = Rs 53.33 P10 = 0 + 150 90 400 - 330 100 200 = Rs 720. P90 = 600 + 50
P90 - P10 = 720 - 53.33 = Rs 666.67.
and Thus,
(iii) The No. of workers earning between Rs 550 and Rs 880 is given by 600 - 550 880 - 800 80 + 50 + 20 = 78. 200 200 \ Percentage of workers =
78 ! 100 = 19.5% 400
Example 32: The following incomplete table gives the number of students in different age groups of a town. If the median of the distribution is 11 years, find out the missing frequencies.
: 0-5 Age Group No. of Students : 15 5 - 10 10 - 15 15 - 20 20 - 25 25 - 30 Total 125 ? 66 ? 4 300
Solution: Let x be the frequency of age group 10 - 15. Then the frequency of the age group 20 - 25 will be 300 - (15 + 125 + x + 66 + 4) = 90 - x. Making a cumulative frequency table we have
Age Groups 0-5 5 - 10 10 - 15 15 - 20 20 - 25 25 - 30 No . of Students 15 125 x 66 90 " x 4 c . f . ( less than ) 15 140 140 + x 206 + x 296 300
Here
N 300 = = 150. Since median is given as 11, the median class is 10 - 15. 2 2 150 - 140 5 or x = 50. Hence, 11 = 10 + x
Calculate Md, Q1, Q3, D4, D7, P20, P45, and P95.
Hint: Arrange the data in ascending or descending order of magnitude and then calculate various values. For calculation of Q1 there are two values satisfying the definition. These two values are 82 and 84. Thus, Q1 can be any value in the closed interval [82, 84]. By convention, the mid-value of the interval is taken as Q1. 2. Calculate the value of Md, Q1, Q3, D2, D8, P35, P48, and P68, from the following data:
Classes : below 10 10 -15 15- 20 20 - 25 25- 30 30 - 35 35- 40 40 - 45 45- 50 Frequency : 1 2 5 7 10 7 5 2 1
Hint: See example 30. 3. Find median from the following data:
Wages/Week ( Rs ) : 50 - 59 60 - 69 70 - 79 80 - 89 90 - 99 100 - 109 110 - 119 No . of Workers : 15 40 50 60 45 40 15
Hint: This is a distribution with inclusive class intervals. To compute median, these are to be converted into exclusive intervals like 49.5 - 59.5, 59.5 - 69.5, etc. 4. The following table gives the distribution of wages of 65 employees in a factory :
Wages ( )
No. of employees : 65 57 47 31 17
Draw a 'less than type' ogive from the above data and estimate the number of employees earning at least Rs 63 but less than Rs 75. Hint: To draw a 'less than' type ogive, the distribution is to be converted into 'less than' type cumulative frequencies. 5. The following table shows the age distribution of persons in a particular region:
Age ( years) Below 10 Below 20 Below 30 Below 40 Below 50 Below 60 Below 70 70 and above No. of Persons (' 000) 2 5 9 12 14 15 15. 5 15. 6
(i)
(ii) Why is the median a more suitable measure of central tendency than mean in this case? Hint: Median is suitable here because the upper limit of the last class is not known and therefore, mean cannot be satisfactorily calculated. 6. A number of particular articles have been classified according to their weights. After drying for two weeks the same articles have again been weighed and similarly classified. It is known that median weight in the first weighing was 20.83 oz. while in second weighing it was 17.35 oz. Some frequencies a and b in the first weighing and x and y in the second weighing, are missing. It is known that a = b = y. Find the missing frequencies.
Classes 0 - 5 5 - 10 10 - 15 15 - 20 20 - 25 25 - 30 a b 11 52 75 22 1st weighing Frequency 2nd y 40 50 30 28 2nd weighing x
60
1 x and 3
1 2
Hint:
20.83 = 20 +
160 + a + b - (63 + a + b ) 2 75
.... (1)
and 17.35 = 15 +
148 + x + 4 - ( 40 + x + y ) 2 50
.... (2)
Put x = 3a and y = 2b in equation (2) and solve (1) and (2) simultaneously. 7. The percentage distribution of regularly employed workers who commute between home and work place by foot and those who use cycles, according to the distance is given below. How will you find the mean distance and the median distance of the walkers and cyclists? State your assumptions carefully.
Distance in kms Walkers Cyclists 45.3 11 less than 1 4 . . 1 4 -1 2 211 6.0 1 2 -1 15.2 9.6 1- 2 9.8 17.9 2-3 5.3 20.5 3- 4 2.2 19.2 19.2 4 -5 0.6 15.2 0.5 10.5 above 5
Hint: The given percentage of walkers and cyclists can be taken as frequencies. For calculation of mean, the necessary assumption is that the width of the first class is equal to the width of the following class, i.e., of the first class can be taken as 0. Similarly, on the assumption that width of the last class is equal to the width of last but one class, the upper limit of last class can be taken as 6. No assumption is needed for the calculation of median. 8. In a factory employing 3,000 persons, 5 percent earn less than Rs 3 per hour, 580 earn Rs 3.01 to 4.50 per hour, 30 percent earn from Rs 4.51 to Rs 6.00 per hour, 500 earn from 6.01 to Rs 7.5 per hour, 20 percent earn from Rs 7.51 to Rs 9.00 per hour and the rest earn Rs 9.01 or more per hour. What is the median wage?
1 . On this assumption, the lower limit 4
Hint: Write down the above information in the form of a frequency distribution. The class intervals given above are inclusive type. These should be converted into exclusive type for the calculation of median. 9. The distribution of 2,000 houses of a new locality according to their distance from a milk booth is given in the following table :
Distance Distance No . of No . of ( in metres ) Houses ( in metres ) Houses 0 - 50 20 350 - 400 275 50 - 100 30 400 - 450 400 100 - 150 35 450 - 500 325 150 - 200 46 500 - 550 205 200 - 250 50 550 - 600 184 250 - 300 105 600 - 650 75 300 - 350 200 650 - 700 50
(i)
(ii) In second phase of the construction of the locality, 500 additional houses were constructed out of which the distances of 200, 150 and 150 houses from the milk booth were in the intervals 450 - 500, 550 - 600 and 650 - 700 meters respectively. Calculate the median distance, taking all the 2500 houses into account.
61
Hint:
Add 200, 150 and 150 to the respective frequencies of the class intervals 450 - 500, 550 - 600 and 650 - 700.
10. The monthly salary distribution of 250 families in a certain locality of Agra is given below.
Monthly Salary more than 0 more than 500 more than 1000 more than 1500 No. of Families 250 200 120 80 Monthly Salary more than 2000 more than 2500 more than 3000 more than 3500 No. of Families 55 30 15 5
Draw a less than ogive for the data given above and hence find out : (i) The limits of the income of the middle 50% of the families. (ii) If income tax is to be levied on families whose income exceeds Rs 1800 p.m., calculate the percentage of families which will be paying income tax. Hint: 11. See example 23. The following table gives the frequency distribution of marks of 800 candidates in an examination :
Marks No. of candidates Marks No. of candidates : : : : 0 - 10 10 50 - 60 130 10 - 20 40 60 - 70 100 20 - 30 80 70 - 80 70 30 - 40 140 80 - 90 40 40 - 50 170 90 - 100 20
Draw 'less than' and 'more than' type ogives for the above data and answer the following from the graph : (i) (ii) If the minimum marks required for passing are 35, what percentage of candidates pass the examination? It is decided to allow 80% of the candidate to pass, what should be the minimum marks for passing?
(iii) Find the median of the distribution. Hint: See example 28. 12. Following are the marks obtained by a batch of 10 students in a certain class test in statistics (X) and accountancy (Y).
Roll No. X Y : : : 1 63 68 2 64 66 3 62 35 4 32 42 5 30 26 6 60 85 7 47 44 8 46 80 9 35 33 10 28 72
In which subject the level of knowledge of student is higher? Hint: Compare median of the two series. 13. The mean and median marks of the students of a class are 50% and 60% respectively. Is it correct to say that majority of the students have secured more than 50% marks? Explain. Hint: It is given that at least 50% of the students have got 60% or more marks. 14. The monthly wages of 7 workers of a factory are : Rs 1,000, Rs 1,500, Rs 1,700, Rs 1,800, Rs 1,900, Rs 2,000 and Rs 3,000. Compute mean and median. Which measure is more appropriate? Which measure would you use to describe the situation if you were (i) a trade union leader, (ii) an employer? Hint: (i) median, (ii) mean. 15. A boy saves Re. 1 on the first day, Rs 2 on the second day, ...... Rs 31 on the 31st day of a particular month. Compute the mean and median of his savings per day. If his father contributes Rs 10 and Rs 100 on the 32nd and 33rd day respectively, compute mean and median of his savings per day. Comment upon the results.
62
2.8 MODE
Mode is that value of the variate which occurs maximum number of times in a distribution and around which other items are densely distributed. In the words of Croxton and Cowden, The mode of a distribution is the value at the point around which the items tend to be most heavily concentrated. It may be regarded the most typical of a series of values. Further, according to A.M. Tuttle, Mode is the value which has the greatest frequency density in its immediate neighbourhood. If the frequency distribution is regular, then mode is determined by the value corresponding to maximum frequency. There may be a situation where concentration of observations around a value having maximum frequency is less than the concentration of observations around some other value. In such a situation, mode cannot be determined by the use of maximum frequency criterion. Further, there may be concentration of observations around more than one value of the variable and, accordingly, the distribution is said to be bimodal or multi-modal depending upon whether it is around two or more than two values. The concept of mode, as a measure of central tendency, is preferable to mean and median when it is desired to know the most typical value, e.g., the most common size of shoes, the most common size of a ready-made garment, the most common size of income, the most common size of pocket expenditure of a college student, the most common size of a family in a locality, the most common duration of cure of viral-fever, the most popular candidate in an election, etc.
Determination of Mode
(a) When data are either in the form of individual observations or in the form of ungrouped frequency distribution
Given individual observations, these are first transformed into an ungrouped frequency distribution. The mode of an ungrouped frequency distribution can be determined in two ways, as given below : (i) (i) By inspection or By inspection: When a frequency distribution is fairly regular, then mode is often determined by inspection. It is that value of the variate for which frequency is maximum. By a fairly regular frequency distribution we mean that as the values of the variable increase the corresponding frequencies of these values first increase in a gradual manner and reach a peak at certain value and, finally, start declining gradually in, approximately, the same manner as in case of increase. 3, 4, 5, 10, 15, 3, 6, 7, 9, 12, 10, 16, 18, 20, 10, 9, 8, 19, 11, 14, 10, 13, 17, 9, 11 Solution: Writing this in the form of a frequency distribution, we get
Values : 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 Frequency : 2 1 1 1 1 1 3 4 2 1 1 1 1 1 1 1 1 1
\ Mode = 10 Remarks : (i) If the frequency of each possible value of the variable is same, there is no mode.
63
(ii) If there are two values having maximum frequency, the distribution is said to be bimodal.
Solution: The given distribution is fairly regular. Therefore, the mode can be determined just by inspection. Since for X = 25 the frequency is maximum, mode = 25. (ii) By method of Grouping: This method is used when the frequency distribution is not regular. Let us consider the following example to illustrate this method.
Solution: This distribution is not regular because there is sudden increase in frequency from 20 to 100. Therefore, mode cannot be located by inspection and hence the method of grouping is used. Various steps involved in this method are as follows : (i) Prepare a table consisting of 6 columns in addition to a column for various values of X.
(ii) In the first column, write the frequencies against various values of X as given in the question. (iii) In second column, the sum of frequencies, starting from the top and grouped in twos, are written. (iv) In third column, the sum of frequencies, starting from the second and grouped in twos, are written. (v) In fourth column, the sum of frequencies, starting from the top and grouped in threes are written.
(vi) In fifth column, the sum of frequencies, starting from the second and grouped in threes are written. (vii) In the sixth column, the sum of frequencies, starting from the third and grouped in threes are written. The highest frequency total in each of the six columns is identified and analysed to determine mode. We apply this method for determining mode of the above example.
Columns 1 2 3 4 5 6 Total
10
V 11
64
Analysis Table A R I A 12 13 14 15 1 1 1 1 1 1 1 1 1 1 1 0 3 4 4
B 16
L 17
E 18
19
1 1 2
1 1
Since the value 14 and 15 are both repeated maximum number of times in the analysis table, therefore, mode is ill defined. Mode in this case can be approximately located by the use of the following formula, which will be discussed later, in this chapter. Mode = 3 Median - 2 mean
X f c. f . fX 10 8 8 80 11 15 23 165
Calculation of Median and Mean 12 13 14 15 16 17 20 100 98 95 90 75 43 143 241 336 426 501 240 1300 1372 1425 1440 1275
18 50 551 900
8767 581 + 1 Median = Size of 2 th, i.e., 291st observation = 15. Mean = 581 = 15.09
\ Mode = 3 !15 - 2 !15.09 = 45 - 30.18 = 14.82
Remarks: If the most repeated values, in the above analysis table, were not adjacent, the distribution would have been bi-modal, i.e., having two modes Example 36: From the following data regarding weights of 60 students of a class, find modal weight :
Weight No. of Students : : 50 2 51 4 52 5 53 6 54 8 55 5 56 4 57 7 58 11 59 5 60 3
Solution: Since the distribution is not regular, method of grouping will be used for determination of mode.
Grouping Table
Analysis Table
Columns 1 2 3 4 5 6 Total
50
W 51 52
E 53
I 54
G 55
H 56
T 57
1 0 0 1 1 1 1 1 1 0 1
1 1 1 3
S 58 1 1 1 1 1 1 6
59 1 1 1 3
60
1 1
Since the value 58 has occurred maximum number of times, therefore, mode of the distribution is 58 kgs.
(b) When data are in the form of a grouped frequency distribution
The following steps are involved in the computation of mode from a grouped frequency distribution. (i) Determination of modal class: It is the class in which mode of the distribution lies. If the distribution is regular, the modal class can be determined by inspection, otherwise, by method of grouping.
65
(ii)
Exact location of mode in a modal class (interpolation formula): The exact location of mode, in a modal class, will depend upon the frequencies of the classes immediately preceding and following it. If these frequencies are equal, the mode would lie at the middle of the modal class interval. However, the position of mode would be to the left or to the right of the middle point depending upon whether the frequency of preceding class is greater or less than the frequency of the class following it. The exact location of mode can be done by the use of interpolation formula, developed below : Let the modal class be denoted by Lm - Um, where Lm and Um denote its lower and the upper limits respectively. Further, let fm be its frequency and h its width. Also let f1 and f2 be the respective frequencies of the immediately preceding and following classes.
Figure 2.4
We assume that the width of all the class intervals of the distribution are equal. If these are not equal, make them so by regrouping under the assumption that frequencies in a class are uniformly distributed. Make a histogram of the frequency distribution with height of each rectangle equal to the frequency of the corresponding class. Only three rectangles, out of the complete histogram, that are necessary for the purpose are shown in the above figure. Let ' 1 = fm - f1 and ' 2 = fm - f2. Then the mode, denoted by Mo, will divide the
D1 modal class interval in the ratio D . The graphical location of mode is shown 2
in Fig. 2.4. To derive a formula for mode, the point Mo in the figure, should be such that
.... (1)
By slight adjustment, the above formula can also be written in terms of the upper limit (Um) of the modal class.
1 1 Mo = Um - h + ' + ' $ h = Um - 1 " ' + ' $ h 1 2 1 2
'
LM N
'
OP Q
D2 h = Um - D1 + D 2
.... (2)
66
f m - f1 Mo = Lm + 2 f - f - f h m 1 2
.... (3)
and
fm - f2 Mo = Um - 2 f - f - f h m 1 2
.... (4)
Note: The above formulae are applicable only to a unimodal frequency distribution. Example 37: The monthly profits (in Rs) of 100 shops are distributed as follows :
Profit per Shop : 0 - 100 100 - 200 200 - 300 300 - 400 400 - 500 500 - 600 No. of Shops : 12 18 27 20 17 6
Determine the 'modal value' of the distribution graphically and verify the result by calculation. Solution: Since the distribution is regular, the modal class would be a class having the highest frequency. The modal class, of the given distribution, is 200 - 300.
Graphical Location of Mode
To locate mode we draw a histogram of the given frequency distribution. The mode is located as shown in Fig. 9.5. From the figure, mode = Rs 256.
Determination of Mode by interpolation formula Figure 2.5
Since the modal class is 200 - 300, Lm = 200, D1 = 27 - 18 = 9, D2 = 27 - 20 = 7 and h = 100. \ Mo = 200 +
Example 38: The frequency distribution of marks obtained by 60 students of a class in a college is given below :
: 30 - 34 35 - 39 40 - 44 45 - 49 50 - 54 55 - 59 60 - 64 Marks Frequency : 3 5 12 18 14 6 2
Find mode of the distribution. Solution: The given class intervals are first converted into class boundaries, as given in the following table :
Marks Frequency Marks Frequency : 29. 5 - 34. 5 34. 5 - 39. 5 39. 5 - 44. 5 44. 5 - 49. 5 : 3 5 12 18 : 49. 5 - 54. 5 54. 5 - 59. 5 59. 5 - 64. 5 : 14 6 2
We note that the distribution is regular. Thus, the modal class, by inspection, is 44.5 - 49.5. Further, Lm = 44.5, D1 = 18 - 12 = 6, D2 = 18 - 14 = 4 and h = 5 \ Mode = 44.5 +
Solution: Since the frequency distribution is not regular, the modal class will be determined by the method of grouping.
67
Grouping Table
Analysis Table
Columns 300 - 350 350 - 400 400 - 450 450 - 500 500 - 550 1 1 2 1 1 3 1 1 4 1 1 1 5 1 1 1 6 1 1 1 Total 1 3 6 3 1
The modal class, from analysis table, is 400 - 500. Thus, Lm = 400, D1 = 33 - 12 = 21, D2 = 33 - 17 = 16 and h = 50 Hence, mode = 400 +
21 50 = Rs 428.38 37
below 110 6 below 160 96 below 120 24 below 170 99 below 130 46 below 180 100 below 140 67
Solution: Rewriting the above distribution in the form of a frequency distribution with class limits, we get
Weights (lbs. ) Frequency Weights (lbs. ) Frequency : Less than 100 : 4 : 140 - 150 : 19 100 - 110 110 - 120 120 - 130 130 - 140 2 18 22 21 150 - 160 160 - 170 170 - 180 10 3 1
We note that there is a concentration of observations in classes 120 - 130 and 130 - 140, therefore, modal class can be determined by the method of grouping.
Grouping Table
68
Analysis Table
Columns 110 - 120 120 - 130 130 - 140 140 - 150 150 - 160 1 1 2 1 1 1 1 3 1 1 4 1 1 1 5 1 1 1 6 1 1 1 Total 2 5 5 3 1
Since the two classes, 120 - 130 and 130 - 140, are repeated maximum number of times in the above table, it is not possible to locate modal class even by the method of grouping. However, an approximate value of mode is given by the empirical formula: Mode = 3 Median - 2 Mean (See 2.9) Looking at the cumulative frequency column, given in the question, the median class is 130 - 140. Thus, Lm = 130, C = 46, fm = 21, h = 10. \ Md = 130 +
50 - 46 10 = 131.9 lbs. 21
Assuming that the width of the first class is equal to the width of second, we can write
Mid - Values ( X ) f X 135 u 10 fu 95 4 4 16 105 2 3 6 115 18 2 36 125 22 1 22 135 145 155 165 175 Total 21 19 10 3 1 100 0 0 1 19 2 20 3 9 4 4 28
Thus, X = 135 -
Using the values of mean and median, we get Mo = 3 ! 131.9 - 2 ! 132.2 = 131.3 lbs. Remarks: Another situation, in which we can use the empirical formula, rather than the interpolation formula, is when there is maximum frequency either in the first or in the last class. Calculation of Mode when either D1 or D2 is negative The interpolation formula, for the calculation of mode, is applicable only if both D1 and D2 are positive. If either D1 or D2 is negative, we use an alternative formula that gives only an approximate value of the mode. We recall that the position of mode, in a modal class, depends upon the frequencies of its preceding and following classes, denoted by f1 and f2 respectively. If f1 = f2, the mode
f2 will be at the middle point which can be obtained by adding f + f h to the lower limit 1 2 f2 of the modal class or, equivalently, it can be obtained by subtracting f + f h from its 1 2
upper limit. We may note that f f = f f = 2 when f1 = f2. 1 2 1 2 Further, if f2 > f1, the mode will lie to the right of the mid-value of modal class and, therefore, the ratio f 1
f2 f1 f2 1
will be greater than 2 . Similarly, if f2 < f1, the mode will lie to f2
69
f2 the left of the mid-value of modal class and, therefore, the ratio f + f will be less than 1 2
1 . Thus, we can write an alternative formula for mode as : 2
Mode = Lm +
f2 h or equivalently, f1 + f 2
Mode = Um
f2 h f1 + f 2
Remarks: The above formula gives only an approximate estimate of mode vis-a-vis the interpolation formula. Example 41: Calculate mode of the following distribution.
Mid - Values Frequency : : 5 7 15 15 25 18 35 30 45 31 55 4 65 3 75 1
Solution: The mid-values with equal gaps are given, therefore, the corresponding class intervals would be 0 - 10, 10 - 20, 20 - 30, etc. Since the given frequency distribution is not regular, the modal class will be determined by the method of grouping.
Grouping Table
Analysis Table
Columns 1 2 3 4 5 6 Total
10 - 20
20 - 30 1
30 - 40 1 1 1 1 1 5
40 - 50 1 1 1 1 4
50 - 60
1 1
1 1
1 1 3
From the analysis table, the modal class is 30 - 40. Therefore, Lm = 30, D1 = 30 - 18 = 12, D2 = 30 - 31 = - 1 (negative) and h = 10. We note that the interpolation formula is not applicable.
f2 31 10 = 36.33 Mode = Lm + f + f h = 30 + 18 + 31 1 2
Example 42: The rate of sales tax as a percentage of sales, paid by 400 shopkeepers of a market during an assessment year ranged from 0 to 25%. The sales tax paid by 18% of them was not greater than 5%. The median rate of sales tax was 10% and 75th percentile rate of sales tax was 15%. If only 8% of the shopkeepers paid sales tax at a rate greater than 20% but not greater than 25%, summarise the information in the form of a frequency distribution taking intervals of 5%. Also find the modal rate of sales tax.
70
Solution: The above information can be written in the form of the following distribution :
No. of Shopkeepers
18 400 = 72 100 5-10 200 - 72 = 128 10-15 300 - 200 = 100 15-20 400 - 72 - 128 - 100 - 32 = 68 8 400 = 32 20-25 100 By inspection, the modal class is 5 - 10.
\ Mo = 5 +
Example 43: The following table gives the incomplete income distribution of 300 workers of a firm, where the frequencies of the classes 3000 - 4000 and 5000 - 6000 are missing. If the mode of the distribution is Rs 4428.57, find the missing frequencies.
Monthly Income ( Rs ) 1000- 2000 2000- 3000 3000- 4000 4000- 5000 5000- 6000 6000-7000 7000- 8000 No. of Workers 30 35 ? 75 ? 30 15
Solution: Let the frequency of the class 3000 - 4000 be f1. Then the frequency of the class 5000 - 6000 will be equal to 300 - 30 - 35 - f1 - 75 - 30 - 15 = 115 - f1. It is given that mode = 4428.57, therefore, modal class is 4000 - 5000. Thus, Lm = 4000, D1 = 75 - f1, D2 = 75 - (115 - f1) = f1 - 40 and h = 1000. Using the interpolation formula, we have 4428.57 = 4000 +
75 - f1 1000 75 - f1 + f1 - 40
or or
428.57 =
75 - f1 1000 or 14.999 = 75 - f1 35
1. 2. 3. 4. 5.
It is easy to understand and easy to calculate. In many cases it can be located just by inspection. It can be located in situations where the variable is not measurable but categorisation or ranking of observations is possible. Like mean or median, it is not affected by extreme observations. It can be calculated even if these extreme observations are not known. It can be determined even if the distribution has open end classes. It can be located even when the class intervals are of unequal width provided that the width of modal and that of its preceding and following classes are equal.
71
6.
It is a value around which there is more concentration of observations and hence the best representative of the data.
Demerits
1. 2. 3. 4. 5. 6. 7.
It is not based on all the observations. It is not capable of further mathematical treatment. In certain cases mode is not rigidly defined and hence, the important requisite of a good measure of central tendency is not satisfied. It is much affected by the fluctuations of sampling. It is not easy to calculate unless the number of observations is sufficiently large and reveal a marked tendency of concentration around a particular value. It is not suitable when different items of the data are of unequal importance. It is an unstable average because, mode of a distribution, depends upon the choice of width of class intervals.
Imagine a situation in which the symmetrical distribution is made asymmetrical or positively (or negatively) skewed by adding some observations of very high (or very low) magnitudes, so that the right hand (or the left hand) tail of the frequency curve gets elongated. Consequently, the three measures will depart from each other. Since mean takes into account the magnitudes of observations, it would be highly affected. Further, since the total number of observations will also increase, the median would also be affected but to a lesser extent than mean. Finally, there would be no change in the position of mode. More specifically, we shall have Mo < Md < X , when skewness is positive and X < Md < Mo, when skewness is negative, as shown in Fig 2.8.
72
Fig. 2.8
Solution: (a)
X - M o = 3 ( X - M d ) or
\ (b)
M o = 3M d - 2 X
It is given that X = 42.2 and Md = 41.9 Mo = 3 ! 41.9 - 2 ! 42.2 = 125.7 - 84.4 = 41.3
Using the empirical relation, we can write X = It is given that Md = Rs 380 and Mo = Rs. 350 \
3M d - M o 2
X=
Solution: Since the highest frequency occurs in the first class interval, the interpolation formula is not applicable. Thus, mode will be calculated by the use of empirical formula.
Class Intervals 0 - 10 10 - 20 20 - 30 30 - 40 40 - 50 Total
Calculation of Mean and Median X 25 Mid Frequency c. f . Values u 10 5 2 45 45 65 15 1 20 79 25 0 14 86 35 1 7 89 45 2 3 89
fu 90 20 0 7 6 97
73
Since
\ Also
Md = 0 +
X = 25
44.5 - 0 10 = 9.89 45
97 10 = 14.10 89
Thus, M o = 3M d - 2 X = 3 ! 9.89 - 2 ! 14.10 = 1.47 Example 46: Estimate mode of the following distribution :
Weekly Wages of Workers ( Rs ) No.of Workers : 105-115 115-125 125-135 135-145 145-155 : 8 15 25 40 62
Class Intervals 105 - 115 115 - 125 125 - 135 135 - 145 145 - 155 Total
Frequency 8 15 25 40 62 150
c. f . 8 23 48 88 150
X 130 10 2 1 0 1 2
fu 16 15 0 40 124 133
Since \
N 2
Md = 135 +
Also X = 130 +
Name of the Candidates : A B C D E No. of votes polled : 10,000 5,000 15,000 50,000 17,000
Since the above characteristic, i.e., name of the candidate, is neither measurable nor can be arranged in the order of its intensity, it is not possible to calculate the mean and median. However, the mode of the distribution is D and hence, it can be taken as the representative of the above distribution.
74
2.
If the characteristic is not measurable but various items of the distribution can be arranged in order of intensity of the characteristics, it is possible to locate median in addition to mode. For example, students of a class can be classified into four categories as poor, intelligent, very intelligent and most intelligent. Here the characteristic, intelligence, is not measurable. However, the data can be arranged in ascending or descending order of intelligence. It is not possible to calculate mean in this case. If the characteristic is measurable but class intervals are open at one or both ends of the distribution, it is possible to calculate median and mode but not a satisfactory value of mean. However, an approximate value of mean can also be computed by making certain assumptions about the width of class(es) having open ends. If the distribution is skewed, the median may represent the data more appropriately than mean and mode. If various class intervals are of unequal width, mean and median can be satisfactorily calculated. However, an approximate value of mode can be calculated by making class intervals of equal width under the assumption that observations in a class are uniformly distributed. The accuracy of the computed mode will depend upon the validity of this assumption. The choice of an appropriate measure of central tendency also depends upon the purpose of investigation. If the collected data are the figures of income of the people of a particular region and our purpose is to estimate the average income of the people of that region, computation of mean will be most appropriate. On the other hand, if it is desired to study the pattern of income distribution, the computation of median, quartiles or percentiles, etc., might be more appropriate. For example, the median will give a figure such that 50% of the people have income less than or equal to it. Similarly, by calculating quartiles or percentiles, it is possible to know the percentage of people having at least a given level of income or the percentage of people having income between any two limits, etc. If the purpose of investigation is to determine the most common or modal size of the distribution, mode is to be computed, e.g., modal family size, modal size of garments, modal size of shoes, etc. The computation of mean and median will provide no useful interpretation of the above situations.
3.
4. 5.
(b)
2.
(c)
Considerations based on various merits of an average: The presence or absence of various characteristics of an average may also affect its selection in a given situation. 1. If the requirement is that an average should be rigidly defined, mean or median can be chosen in preference to mode because mode is not rigidly defined in all the situations. An average should be easy to understand and easy to interpret. This characteristic is satisfied by all the three averages. It should be easy to compute. We know that all the three averages are easy to compute. It is to be noted here that, for the location of median, the data must be arranged in order of magnitude. Similarly, for the location of mode, the data should be converted into a frequency distribution. This type of exercise is not necessary for the computation of mean. It should be based on all the observations. This characteristic is met only by mean and not by median or mode.
2. 3.
4.
75
5.
It should be least affected by the fluctuations of sampling. If a number of independent random samples of same size are taken from a population, the variations among means of these samples are less than the variations among their medians or modes. These variations are often termed as sampling variations. Therefore, preference should be given to mean when the requirement of least sampling variations is to be fulfilled. It should be noted here that if the population is highly skewed, the sampling variations in mean may be larger than the sampling variations in median. It should not be unduly affected by the extreme observations. The mode is most suitable average from this point of view. Median is only slightly affected while mean is very much affected by the presence of extreme observations. It should be capable of further mathematical treatment. This characteristic is satisfied only by mean and, consequently, most of the statistical theories use mean as a measure of central tendency. It should not be affected by the method of grouping of observations. Very often the data are summarised by grouping observations into class intervals. The chosen average should not be much affected by the changes in size of class intervals. It can be shown that if the same data are grouped in various ways by taking class intervals of different size, the effect of grouping on mean and median will be very small particularly when the number of observations is very large. Mode is very sensitive to the method of grouping. It should represent the central tendency of the data. The main purpose of computing an average is to represent the central tendency of the given distribution and, therefore, it is desirable that it should fall in the middle of distribution. Both mean and median satisfy this requirement but in certain cases mode may be at (or near) either end of the distribution.
6.
7.
8.
9.
Hint: Convert the class intervals into class boundaries. 2. Calculate mode of the following distribution of weekly income of workers of a factory :
Weekly Income : 0 - 75 75 - 100 100 - 150 150 - 175 175 - 300 300 - 500 No. of Workers : 9 44 192 116 435 304
Hint: Make class intervals of equal width on the assumption that observations in a class are uniformly distributed. On this basis, the class of 0 - 75 can be written as 0 - 25, 25 - 50 and 50 - 75 each with frequency 3. The class 100 - 150 will be split as 100 - 125 and 125 - 150 each with frequency 96, etc. 3. Calculate the modal marks from the following distribution of marks of 100 students of a class :
Marks ( More than) No. of Students
76
90 0
80 4
70 15
60 33
50 53
40 76
30 92
20 98
10 100
4.
The following table gives the number of geysers of different sizes (in litres) sold by a company during winter season of last year. Compute a suitable average of the distribution:
Capacity : less than 5 5 - 10 10 - 15 15 - 20 20 - 25 25 - 30 above 30 Frequency : 1500 3000 2325 1750 1400 1225 800
Hint: Mode is the most suitable average. 5. Locate a suitable measure of tendency for the following distribution :
Colour of the hair No. of Persons : : Brown 200 Black 250 Grey 150
Hint: Since the characteristic is neither measurable nor can be arranged in order of magnitude, mode is most suitable. 6. The following table gives the classification of students of a class into various categories according to their level of intelligence. Compute a suitable measures of central tendency.
Characteristics No. of Students : : Poor 8 Intelligent 21 Very Intelligent 25 Most Intelligent 6
Hint: Median as well as mode. 7. The following table gives the distribution of 200 families according to the number of children :
No. of Children No. of families : : 0 12 1 18 2 49 3 62 4 36 5 13 6 7 7 3
Find X , Md and Mo and interpret these averages. Hint: X represents mean number of children per family. Similarly interpret Md and Mo. 8. Given below is the income distribution of 500 families of a certain locality :
Monthly Income : 500 - 1000 1000 - 1500 1500 - 2000 2000 - 2500 2500 - 3000 No. of Families : 50 210 150 60 30
Find the most suitable average if (i) (ii) it is desired to estimate average income per family, it is to be representative of the distribution,
(iii) it is desired to study the pattern of the distribution. Hint: (i) X , (ii) Mo, (iii) Md, quartiles, percentiles, etc. 9. A distribution of wages paid to foremen would show that, although a few reach very high levels, most foremen are at lower levels of the distribution. The same applies, of course, to most income distributions. If you were an employer, resisting a foreman's claim for an increase of wages, which average would suit your case? Give reasons for supporting your argument. Do you think your argument will be different in case you are a trade union leader?
Hint: An employer should use arithmetic mean because this is the highest average when distribution is positively skewed. Mode will be used by a trade union leader. 10. Atul gets a pocket money allowance of Rs 12 per month. Thinking that this was rather less, he asked his friends about their allowances and obtained the following data which includes his allowance (in Rs) also. 12, 18, 10, 5, 25, 20, 20, 22, 15, 10, 10, 15, 13, 20, 18, 10, 15, 10, 18, 15, 12, 15, 10, 15, 10, 12, 18, 20, 5, 8. He presented this data to his father and asked for an increase in his allowance as he was getting less than the average amount. His father, a statistician, countered pointing out that Atul's allowance was actually more than the average amount. Reconcile these statements.
77
Hint: Atul's demand for more pocket money is based on the calculation of arithmetic mean while his father countered his argument on the basis of mode.
If there are n observations, X1, X2, ...... Xn, such that Xi > 0 for each i, their geometric mean (GM) is defined as
1 1 n =
GM = X 1 . X 2 KK X n
product of observations. To evaluate GM, we have to use logarithms. Taking log of both sides we have log (GM) =
=
1 log X 1 . X 2 KK X n n
g
=
! log X
n
log X i GM = antilog n
This result shows that the GM of a set of observations is the antilog of the arithmetic mean of their logarithms. Example 47: Calculate geometric mean of the following data : 1, 7, 29, 92, 115 and 375 Solution:
Calculation of Geometric Mean
X log X
1 0. 0000
7 0. 8451
29 1. 4624
92 1. 9638
115 2. 0607
375 2. 5740
log X 8. 9060
LM N
OP Q
If the data consists of observations X1, X2, ...... Xn with respective frequencies f1, f2,
n
!f
i =1
78
L = MX NM
f1
1
. X2
f2
KK X n
fn
OP N QP
f log X
i i =1
1 n or GM = antilog fi log X i , which is again equal to the antilog of the arithmetic N i =1 mean of the logarithm of observations.
Example 48: Calculate geometric mean of the following distribution :
X f : : 5 13 10 18 15 50 20 40 25 10 30 6
Solution:
Calculation of GM
X 5 10 15 20 25 30 Total
f 13 18 50 40 10 6 137
f logX 9. 0870 18. 0000 58. 8050 52. 0400 13. 9790 8. 8626 160. 7736
\ GM = antilog
In case of a continuous frequency distribution, the class intervals are given. Let X1, X2, ......Xn denote the mid-values of the first, second ...... nth class interval respectively with corresponding frequencies f1, f2, ...... fn, such that ) fi = N. The formula for calculation of GM is same as the formula used for an ungrouped frequency distribution
Solution:
Calculation of GM
Class 5 - 15 15 - 25 25 - 35 35 - 45 45 - 55 Total
f 10 22 25 20 8 85
Mid - Value (X ) 10 20 30 40 50
f logX 10. 0000 28. 6227 36. 9280 32. 0412 13. 5918 121.1837
GM = antilog
79
wi log X i , i.e., weighted geometric mean of GM = antilog wi observations is equal to the antilog of weighted arithmetic mean of their logarithms.
Example 50: Calculate weighted geometric mean of the following data :
Variable X Weights w
a f a f
: :
5 10
8 9
44 3
160 2
500 1
Weights ( w) 10 9 3 2 1 25
27.1554 = antilog 1.0862 = 12.20 25 8.1487 Simple GM = antilog (n = 5) = antilog 1.6297 = 42.63 5
Weighted GM = antilog
Note that the simple GM is greater than the weighted GM because the given system of weights assigns more importance to values having smaller magnitude.
Example 51: If the geometric means of two groups consisting of 10 and 25 observations are 90.4 and 125.5 respectively, find the geometric mean of all the 35 observations combined into a single group. Solution:
n1 log G1 + n2 log G2 Combined GM = antilog n1 + n2 Here n1 = 10, G1 = 90.4 and n2 = 25, G2 = 125.5 10log90.4 + 25log125.5 \ GM = antilog 35 10 1.9562 + 25 2.0986 = antilog = antilog 2.0579 = 114.27 35
80
To determine the average rate of change of price for the entire period when the rate of change of prices for different periods are given
Let P0 be the price of a commodity in the beginning of the first year. If it increases by k1 % in the first year, the price at the end of 1st year (or beginning of second year) is given by
b gb
g b
1 n
.... (3)
This shows that (1 + r) is geometric mean of (1 + r1), (1 + r2), ...... and (1 + rn). From (3), we get r = 1 + r1 1 + r2 KK 1 + rn
1 n
b gb
g b
-1
.... (4)
Note: Here r denotes the per unit rate of change. This rate is termed as the rate of increase or the rate of growth if positive and the rate of decrease or the rate of decay if negative. Example 52: The price of a commodity went up by 5%, 8% and 77% respectively in the last three years. The annual average rise of price is 26% and not 30%. Comment. Solution: The correct average in this case is given by equation (4), given above. Let r1, r2 and r3 be the increase in price per rupee in the respective years. \
r1 =
r = (1 + r1 )(1 + r2 ) (1 + r3 ) 3 - 1
= (1 + 0.05)(1 + 0.08)(1 + 0.77 ) 3 - 1 = (1.05 1.08 1.77 ) 3 - 1
81
1 1
Also, the percentage rise of price is 100r% = 26%. Note: 30% is the arithmetic mean of 5%, 8% and 77%, which is not a correct average. This can be verified as below : If we take the average rise of price as 30% per year, then the price at the end of first year, taking it to be 100 in the beginning of the year, becomes 130. Price at the end of 2nd year =
Similarly, taking the average as 26%, the price at the end of 3rd year = 100
105 108 177 = 2007 . This price is correctly given by the 100 100 100 geometric average and hence, it is the most suitable average in this case.
= 100
FP I r=G J HP K
n
1 n
- 1.
Similarly, Equation (4), given above, can be used to find the average rate of growth of population when its rates of growth in various years are given. Remarks: The formulae of price and population changes, considered above, can also be extended to various other situations like growth of money, capital, output, etc. Example 53: The population of a country increased from 2,00,000 to 2,40,000 within a period of 10 years. Find the average rate of growth of population per year. Solution: Let r be the average rate of growth of population per year for the period of 10 years. Let P0 be initial and P10 be the final population for this period. We are given P0 = 2,00,000 and P10 = 2,40,000.
1 10
1
82
FP I \ r=G J HP K
10 0
2, 40,000 10 -1 1= 2,00,000
Now
24 20
1 10
1 = antilog ( log 24 - log 20) 10 1 = anti log (1.3802 - 1.3010) = anti log (0.0079) =1.018 10
Thus, r = 1.018 - 1 = 0.018. Hence, the percentage rate of growth = 0.018 !!100 = 1.8% p. a. Example 54: The gross national product of a country was Rs 20,000 crores before 5 years. If it is Rs 30,000 crores now, find the annual rate of growth of G.N.P. Solution: Here P5 = 30,000, P0 = 20,000 and n = 5. \
30,000 5 r = -1 20,000
1 1
1 3 5 1 Now = antilog (log 3 - log 2) = antilog (0.4771 - 0.3010) 2 5 5 = antilog (0.0352) = 1.084 Hence r = 1.084 - 1 = 0.084
Thus, the percentage rate of growth of G.N.P. is 8.4% p.a Example 55: Find the average rate of increase of population per decade, which increased by 20% in first, 30% in second and 40% in the third decade. Solution: Let r denote the average rate of growth of population per decade, then
1 120 130 140 3 r= - 1 = (1.2 1.3 1.4) 3 - 1 100 100 100 1
LM 1 (0.0792 N3
OP Q
r = 1.297 - 1 = 0.297
F xI GH y JK
Ratio
FG y IJ H xK
2/ 3 1/ 4
3/ 2 4
83
We note that their product is not equal to unity. However, the product of their respective geometric means, i.e., unity. Since it is desirable that a method of average should be independent of the way in which a ratio is expressed, it seems reasonable to regard geometric mean as more appropriate than arithmetic mean while averaging ratios.
1 and 6 6 , is equal to
1. 2. 3.
It is a rigidly defined average. It is based on all the observations. It is capable of mathematical treatment. If any two out of the three values, i.e., (i) product of observations, (ii) GM of observations and (iii) number of observations, are known, the third can be calculated. In contrast to AM, it is less affected by extreme observations. It gives more weights to smaller observations and vice-versa.
4. 5.
Demerits
1. 2. 3.
Uses
It is not very easy to calculate and hence is not very popular. Like AM, it may be a value which does not exist in the set of given observations. It cannot be calculated if any observation is zero or negative.
1. 2. 3.
It is most suitable for averaging ratios and exponential rates of changes. It is used in the construction of index numbers. It is often used to study certain social or economic phenomena.
84
104 105 106 108 4 - 1 , \ average rate of interest = 100r %. Hint: r = 100 100 100 100
2.
The number of bacteria in a certain culture was found to be 4 $ 106 at noon of one day. At noon of the next day, the number was 9 $ 106. If the number increased at a constant rate per hour, how many bacteria were there at the intervening midnight? If the price of a commodity doubles in a period of 4 years, what is the average percentage increase per year?
1 1
P n 2 4 Hint: r = n - 1 = - 1 . 1 P
0
4.
A machine is assumed to depreciate by 40% in value in the first year, by 25% in second year and by 10% p.a. for the next three years, each percentage being calculated on the diminishing value. Find the percentage depreciation p.a. for the entire period.
1
3 Hint: 1 - r = (1 - r1 )(1 - r2 ) (1 - r3 ) 5 .
5.
A certain store made profits of Rs 5,000, Rs 10,000 and Rs 80,000 in 1965, 1966 and 1967 respectively. Determine the average rate of growth of its profits.
1
6.
An economy grows at the rate of 2% in the first year, 2.5% in the second, 3% in the third, 4% in the fourth ...... and 10% in the tenth year. What is the average rate of growth of the economy?
1
Hint: r = (1.02 $ 1.025 $ 1.03 $ 1.04 $ 1.05 $ 1.06 $ 1.07 $ 1.08 $ 1.09 $ 1.10)10 " 1 . 7. The export of a commodity increased by 30% in 1988, decreased by 22% in 1989 and then increased by 45% in the following year. The increase/decrease, in each year, being measured in comparison to its previous year. Calculate the average rate of change of the exports per annum.
1
Hint: r = (1.30 $ 0.78 $ 1.45) 3 " 1 . 8. Show that the arithmetic mean of two positive numbers a and b is at least as large as their geometric mean.
Hint: We know that the square of the difference of two numbers is always positive, i.e., (a - b)2 & 0. Make adjustments to get the inequality (a + b)2 & 4ab and then get the desired result, i.e., AM & GM. 9. If population has doubled itself in 20 years, is it correct to say that the rate of growth has been 5% per annum?
1 Hint: The annual rate of growth is given by 100r = 100 (2) 20 - 1 = 3.53%, which is not equal to 5%.
10. The weighted geometric mean of 5 numbers 10, 15, 25, 12 and 20 is 17.15. If the weights of the first four numbers are 2, 3, 5, and 2 respectively, find weight of the fifth number. Hint: Let x be the weight of the 5th number, then 102.153.255.122.20 x 12+ x = 17.15.
1
85
If there are n observations X1, X2, ...... Xn, their harmonic mean is defined as
HM = n n = 1 1 1 n 1 + + LL + X1 X 2 X n i!1 X = i
Example 56: Obtain harmonic mean of 15, 18, 23, 25 and 30. Solution:
HM = 5 5 = = 20.92 Ans. 1 1 1 1 1 0.239 + + + + 15 18 23 25 30
For ungrouped data, i.e., each X1, X2, ...... Xn, occur with respective frequency f1, f2 ...... fn, where Sfi = N is total frequency, the arithmetic mean of the reciprocals of observations is given by N i ! 1 X . = i
N
1 n fi
Thus,
HM =
!X
fi
i
Solution:
Calculation of Harmonic Mean
X Frequency ( f ) 1 f X
10 5 0. 5000
11 8 0. 7273
12 10 0. 8333
13 9 0. 6923
14 6 0. 4286
Total 38 3.1815
\ HM =
(c) Continuous Frequency Distribution
38 = 11.94 3.1815
In case of a continuous frequency distribution, the class intervals are given. The midvalues of the first, second ...... nth classes are denoted by X1, X2, ...... Xn. The formula for the harmonic mean is same, as given in (b) above. Example 58: Find the harmonic mean of the following distribution :
86
Solution:
Calculation of Harmonic Mean
Class Intervals 0 - 10 10 - 20 20 - 30 30 - 40 40 - 50 50 - 60 60 - 70 70 - 80 Total Frequency ( f ) 5 8 11 21 35 30 22 18 150 15 25 35 45 55 65 75 Mid - Values (X ) 5 f 1. 0000 0. 5333 0. 4400 0. 6000 0. 7778 0. 5455 0. 3385 0. 2400 4. 4751 X
\ HM =
HM =
w w X
i i i
Example 59: A train travels 50 kms at a speed of 40 kms/hour, 60 kms at a speed of 50 kms/hour and 40 kms at a speed of 60 kms/hour. Calculate the weighted harmonic mean of the speed of the train taking distances travelled as weights. Verify that this harmonic mean represents an appropriate average of the speed of train. Solution: HM =
w w X
i i i
.... (1)
We note that the numerator of Equation (1) gives the total distance travelled by train. Further, its denominator represents total time taken by the train in travelling 150 kms, since
50 is time taken by the train in travelling 50 kms at a speed of 40 kms/hour. 40 60 40 and are time taken by the train in travelling 60 kms and 40 kms at the Similarly 50 60
speeds of 50 kms./hour and 60 kms/hour respectively. Hence, weighted harmonic mean is most appropriate average in this case. Example 60: Ram goes from his house to office on a cycle at a speed of 12 kms/hour and returns at a speed of 14 kms/hour. Find his average speed. Solution: Since the distances of travel at various speeds are equal, the average speed of Ram will be given by the simple harmonic mean of the given speeds.
2 2 = 12.92 kms/hour Average speed = 1 1 = 0.1547 + 12 14
Choice between Harmonic Mean and Arithmetic Mean The harmonic mean, like arithmetic mean, is also used in averaging of rates like price per unit, kms per hour, work done per hour, etc., under certain conditions. To explain the method of choosing an appropriate average, consider the following illustration.
87
Let the price of a commodity be Rs 3, 4 and 5 per unit in three successive years. If we 3+4+5 take A.M. of these prices, i.e., = 4 , then it will denote average price when 3 equal quantities of the commodity are purchased in each year. To verify this, let us assume that 10 units of commodity are purchased in each year. \ Total expenditure on the commodity in 3 years = 10 $ 3 + 10 $ 4 + 10 $ 5. Total expenditure 10 3 + 10 4 + 10 5 3 + 4 + 5 Also, Average price = = = , Total quantity purchased 10 + 10 + 10 3 which is arithmetic mean of the prices in three years. Further, if we take harmonic mean of the given prices, i.e.,
3 , it will denote the 1 1 1 + + 3 4 5 average price when equal amounts of money are spent on the commodity in three years. To verify this let us assume that Rs 100 is spent in each year on the purchase of the commodity.
\ Average price =
Next, we consider a situation where different quantities are purchased in the three years. Let us assume that 10, 15 and 20 units of the commodity are purchased at prices of Rs 3, 4 and 5 respectively. Average price =
Total expenditure 3 10 + 4 15 + 5 20 = , which is weighted Total quantity purchased 10 + 15 + 20 arithmetic mean of the prices taking respective quantities as weights.
Further, if Rs 150, 200 and 250 are spent on the purchase of the commodity at prices of Rs 3, 4 and 5 respectively, then
150 + 200 + 250 150 200 250 Average price = 150 200 250 , where , and are the quantities 3 4 5 + + 3 4 5 purchased in respective situations.
The above average price is equal to the weighted harmonic mean of prices taking money spent as weights. Therefore, to decide about the type of average to be used in a given situation, the first step is to examine the rate to be averaged. It may be noted here that a rate represents a money distance work done ratio, e.g., price = quantity , speed = , work done per hour = , etc. time time taken We have seen above that arithmetic mean is appropriate average of prices
money quantity when quantities, that appear in the denominator of the rate to be averaged, purchased in different situations are given. Similarly, harmonic mean will be appropriate when sums of money, that appear in the numerator of the rate to be averaged, spent in different situations are given.
To conclude, we can say that the average of a rate, defined by the ratio p/q, is given by the arithmetic mean of its values in different situations if the conditions are given in terms of q and by the harmonic mean if the conditions are given in terms of p. Further, if the conditions are same in different situations, use simple AM or HM and otherwise use weighted AM or HM.
88
Example 61: An individual purchases three qualities of pencils. The relevant data are given below :
Quality A B C Price per pencil ( Rs) Money Spent ( Rs) 1.00 1.50 2.00 50 30 20
Calculate average price per pencil. Solution: Since different sums of money spent in various situations are given, we shall calculate weighted harmonic mean to calculate average price.
50+30+20 100 = = Rs 1.25 50 30 20 50 + 20 + 10 Weighted HM = + + 1.00 1.50 2.00
Example 62: In a 400 metre athlete competition, a participant covers the distance as given below. Find his average speed.
Speed (Metres per second)
First 80 metres Next 240 metres Last 80 metres Solution: Since Speed =
10 7.5 10
distance and the conditions are given in terms of distance time travelled at various speeds, HM will be the appropriate average.
80 + 240 + 80 400 = = 8.33 metres/second 80 240 80 8 + 32 + 8 + + 10 7.5 10
Example 63: Peter travelled by a car for four days. He drove 10 hours each day. He drove first day at the rate of 45 kms/hour, second day at the rate of 40 kms/hour, third day at the rate of 38 kms/hour and fourth day at the rate of 37 kms/hour. What was his average speed.
distance Solution: Since the rate to be averaged is speed= and the conditions are time given in terms of time, therefore AM will be appropriate. Further, since Peter travelled for equal number of hours on each of the four days, simple AM will be calculated.
\ Average speed =
45 + 40 + 38 + 37 = 40 kms/hour 4
Example 64: In a certain factory, a unit of work is completed by A in 4 minutes, by B in 5 minutes, by C in 6 minutes, by D in 10 minutes and by E in 12 minutes. What is their average rate of working? What is the average number of units of work completed per minute? At this rate, how many units of work each of them, on the average, will complete in a six hour day? Also find the total units of work completed. Solution: Here the rate to be averaged is time taken to complete a unit of work,
time i.e., units of work done . Since we have to determine the average with reference to a (six hours) day, therefore, HM of the rates will give us appropriate average.
89
The average number of units of work completed per minute = 6.25 = 0.16. The average number of units of work completed by each person = 0.16 $ 360 = 57.6. Total units of work completed by all the five persons = 57.6 $ 5 = 288.0. Example 65: A scooterist purchased petrol at the rate of Rs 14, 15.50 and 16 per litre during three successive years. Calculate the average price of petrol (i) if he purchased 150, 160 and 170 litres of petrol in the respective years and (ii) if he spent Rs 2,200, 2,500 and 2,600 in the three years. Solution: The rate to be averaged is expressed as (i)
money litre
Since the condition is given in terms of different litres of petrol in three years, therefore, weighted AM will be appropriate. \ Average price =
(ii)
= Rs 15.18/litre Merits and Demerits of Harmonic Mean Merits 1. 2. 3. 4. 5. 1. 2. 3. 4. It is a rigidly defined average. It is based on all the observations. It gives less weight to large items and vice-versa. It is capable of further mathematical treatment. It is suitable in computing average rate under certain conditions. It is not easy to compute and is difficult to understand. It may not be an actual item of the given observations. It cannot be calculated if one or more observations are equal to zero. It may not be representative of the data if small observations are given correspondingly small weights.
Demerits
a +b 2 2ab = , GM = ab and HM = 1 1 a +b . 2 + a b
Since the square of the difference between a and b is always a non-negative number, we can write (a - b)2 & 0 or a2 + b2 - 2ab " 0 or a2 + b2 2ab. Adding 2ab to both sides, we have a2 + b2 + 2ab 4ab or (a + b)2 4ab or
(a + b)2 ab
4
AM GM
or
a+b ab 2
Divide both sides of inequality (1) by Multiply both sides by ab , to get GM HM Combining (2) and (3), we can write AM GM HM
2 ab a b , to get 1 2 a+b
ab
2ab a+b
.... (3)
Note: The equality sign will hold when a = b Example 67: For any two positive numbers, show that GM = Solution: If a and b are two positive numbers, then a +b 2ab AM = , GM = ab and HM = 2 a +b Now or (a) (b) AM.HM =
GM = AM HM .
Example 68: If AM of two observations is 15 and their GM is 9, find their HM and the two observations. Comment on the following : The AM of 20 observations is 25, GM = 20 and HM = 21. Solution: (a)
AM HM = GM
\ 15 HM = 9 or 15 ! HM = 81. Thus, HM = 5.4. Let the two observations be X1 and X2. We are given that or Also X1 + X2 = 30.
X1 .X2 9 or X1.X2 = 81
X1 + X 2 = 15 2
.... (1)
We can write (X1 - X2)2 = (X1 + X2)2 4X1X2 = 900 4 $ 81 = 576 or X1 - X2 = 24 2X1 = 54, \ X1 = 27. Also X2 = 3 (b) The statement is wrong because HM cannot be greater than GM.
91
.... (2)
2.
Prices per share of a company during first five days of a month were Rs 100, 120, 150, 140 and 50. (i) Find the average daily price per share. (ii) Find the average price paid by an investor who purchased Rs 20,000 worth of shares on each day. (iii) Find the average price paid by an investor who purchased 100, 110, 120, 130 and 150 shares on respective days.
Hint: Find simple HM in (ii) and weighted AM in (iii). 3. Typist A can type a letter in five minutes, B in ten minutes and C in fifteen minutes. What is the average number of letters typed per hour per typist?
Hint: Since we are given conditions in terms of per hour, therefore, simple HM of speed will give the average time taken to type one letter. From this we can obtain the average number of letters typed in one hour by each typist. Simple HM =
3 = 8.18 minutes per letter. 1 1 1 + + 5 10 15
60 = 7.33 8.18
Ram paid Rs 15 for two dozens of bananas in one shop, another Rs 15 for three dozens of bananas in second shop and Rs 15 for four dozens of bananas in third shop. Find the average price per dozen paid by him.
Hint: First find the prices per dozen in three situations and since equal money is spent, HM is the appropriate average. 5. A country accumulates Rs 100 crores of capital stock at the rate of Rs 10 crores/ year, another Rs 100 crores at the rate of Rs 20 crores/year and Rs 100 crores at the rate of Rs 25 crores/year. What is the average rate of accumulation?
Hint: Since Rs 100 crores, each, is accumulated at the rates of Rs 10, 20 and 25 crores/year, simple HM of these rates would be most appropriate. 6. A motor car covered a distance of 50 miles 4 times. The first time at 50 m.p.h., the second at 20 m.p.h., the third at 40 m.p.h. and the fourth at 25 m.p.h. Calculate the average speed. The interest paid on each of the three different sums of money yielding 10%, 12% and 15% simple interest p.a. is the same. What is the average yield percent on the sum invested?
92
Quadratic Mean
Quadratic mean is the square root of the arithmetic mean of squares of observations. If X1, X2 ...... Xn are n observations, their quadratic mean is given by
QM = X 1 2 + X 2 2 + LL + X n 2 n =
!X
n
Similarly, the QM of observations X1, X2 ...... Xn with their respective frequencies as f1, f2 ...... fn is given by QM
fX
i
2 i
, where N = Sfi.
Moving Average
This is a special type of average used to eliminate periodic fluctuations from the time series data.
Progressive Average
A progressive average is a cumulative average which is computed by taking all the available figures in each succeeding years. The average for different periods are obtained as shown below :
X1 , X1 + X 2 X1 + X 2 + X 3 , , LL etc. 2 3
Composite Average
A composite average is an average of various other averages. If for example,
X 1 , X 2 , KK X k are the arithmetic means of k series, their composite average
= X 1 + X 2 + KK + X k . k
Check Your Progress 2.2
1 2.
Establish the relation between AM, GM and HM. What is Empirical relation among mean, median and mode. Write your answer in the space given below. Please go through the lesson sub-head thoroughly you will get your answers in it. This Chek Your Progress will help you to understand the lesson better. Try to write answers for them, but do not submit your answers to the university for assessment. These are for your practice only. (b) (c)
Notes: (a)
X=
fX
i
(Simple AM)
i i
X = A+
fd
N
(Short-cut method)
i i
X = A+ h Xw =
X =
i
fu
N
i
w X w
i
N 1 X 1 + N 2 X 2 + LL + N k X k N 1 + N 2 + LL+ N k
(vi)
N -C h M d = Lm + 2 fm
Qi LQi iN C 4 fQi
(Median)
(vii)
where i = 1, 3
(Quartiles)
(viii)
kN -C 100 h Pk = LPk + f Pk
(k th Percentile)
(ix)
(Simple GM)
(Weighted GM)
LM n log G + n log G + LL + n log G OP n + n + LL+ n N Q (GM of the combined series) r = b1 + r gb1 + r gb1 + r gKK b1 + r g " 1 is average annual rate of growth per unit
1 2 k
1 2 3 n 1 n
where r1, r2 ...... rn are the rates of growth in various years. (xiii) HM =
N f Xi i
(Simple HM)
(xiv)
HM w =
w w X
i i i
(Weighted HM)
94
2.14 KEYWORDS
Mean Median Mode Average Central Tendency
Distinguish between:
95
4. 5. 6. 7. 8. 9.
What do you mean by 'Central Tendency'? Describe the advantages and the disadvantages of arithmetic mean and mode. What are the characteristics of an ideal average? How far these are satisfied by the mode and median? Distinguish between a mathematical average and a positional average. Give advantages and disadvantages of each type of average. What do you understand by partition values? Give the definitions of quartiles, deciles and percentiles. "Each average has its own special features and it is difficult to say which one is the best". Explain this statement. Discuss the considerations that determine the selection of a suitable average. Explain by giving one example of each case.
10. Explain the empirical relation between mean, median and mode. What are its uses? Under what circumstances it is expected to hold true? 11. Distinguish between a simple average and a weighted average. Explain with an example the circumstances in which the latter is more appropriate than the former.
12. "An average is a substitute for a complex group of variables but it is not always safe to depend on the substitute alone to the exclusion of individual measurements of groups". Discuss. 13. Show that if all observations of a series are added, subtracted, multiplied or divided by a constant b, the mean is also added, subtracted, multiplied or divided by the same constant. 14. Prove that the algebric sum of deviations of a given set of observations from their mean is zero. 15. Prove that the sum of squared deviations is least when taken from the mean. 16. The heights of 15 students of a class were noted as shown below. Compute arithmetic mean by using (i) Direct Method and (ii) Short-Cut Method.
S. No. : 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 Ht (cms) : 160 167 174 158 155 171 162 152 156 175 178 167 177 162 153
21. The weights (in gms) of 30 articles are given below : 14 16 16 14 22 13 15 24 12 23 14 20 17 21 18 18 19 20 17 16 15 11 12 21 20 17 18 19 22 23.
96
Construct a grouped frequency distribution by taking equal class intervals in which the first interval should be 11 - 13 (exclusive). Also find the arithmetic mean. 22. The following information relates to wages of workers in a factory, their total working hours and the average working hours per worker. Calculate the wage per worker and the total wage.
Wages ( Rs ) : 50-70 70- 90 90 -110 110-130 130-150 150-170 Total hours worked : 72 200 255 154 78 38 Average No. of hours : 9 8 8.5 7 7.8 7.6 worked per worker
23. The monthly salaries of 30 employees of a firm are given below : 69 148 132 118 142 116 139 126 114 100 88 62 77 99 103 144 148 63 104 123 95 80 85 106 123 133 140 134 108 129 The firm gave bonus of Rs 10, 15, 20, 25, 30 and 35 for individuals in the respective salary group; exceeding Rs 60 but not exceeding Rs 75, exceeding Rs 75 but not exceeding Rs 90 and so on up to exceeding Rs 135 but not exceeding Rs 150. Find out the average bonus paid per employee. 24. Find out the missing frequency in the following distribution with mean equal to 30.
Class Frequency : : 0 - 10 5 10 - 20 6 20 - 30 10 30 - 40 ? 40 - 50 13
25. (a)
The following table gives the monthly salary of academic staff of a college. Calculate the simple and weighted arithmetic means of their monthly salary. Which of these averages is most appropriate and why?
(i) (ii) (iii ) (iv) Designation Monthly Salary No. of Teachers Principal 4500 1 Reader 3700 5 3000 15 Senior - Lecturer Lecturer 2200 25
(b)
The sum of deviations of a certain number of observations from 12 is 166 and the sum of deviations of these observations from 16 is 54. Find the number of observations and their mean.
26. Twelve persons gambled on a certain night. Seven of them lost at an average rate of Rs 10.50 while remaining five gained at an average of Rs 13.00. Is the information given above is correct? If not, why? 27. The incomes of employees in an industrial concern are given below. The total income of ten employees in the class over Rs 250 is Rs 3,000. Compute mean income. Every employee belonging to the top 25% of the earners is required to pay 1% of his income to workers' relief fund. Estimate the contribution to this fund.
Income ( Rs ) : 0- 50 50-100 100-150 150- 200 200- 250 250 and above Frequency : 90 150 100 80 70 10
28. Comment on the performance of the students of three universities given below:
Calcutta University Madras University Bombay University Courses of Study Pass% No. of Students Pass% No. of Students Pass% No. of Students 82 200 81 200 M. A. 71 300 76 300 76 350 400 M. Com. 83 60 700 73 200 M. Sc. 66 300 73 73 600 74 450 500 B. A. 76 700 58 200 74 200 B. Com. 65 300 70 700 65 300 B. Sc.
97
29. (a)
Compute the weighted arithmetic mean of the indices of various groups as given below:
Group Food Clothing Housing Education of Children Miscellaneous Index 120 130 150 100 160 Weight 4 2 2 1 1
(b)
A cumulative frequency distribution has 65 as the mid-value of its last class interval. The cumulative frequencies of the first, second ...... seventh classes are 5, 21, 45, 72, 85, 94 and 100 respectively. If all the class intervals are of equal width of 10 units, write down the relevant frequency distribution. Also calculate its mean and median.
30. A distribution consists of three components each with total frequency of 200, 250 and 300 and with means of 25, 10 and 15 respectively. Find out the mean of the combined distribution. 31. Find the average number of children per family for the sub-groups separately as well as combined as a whole.
Sub - group I No. of Children No. of families 0 10 1 50 2 60 3 40 Sub - group II No. of Children No. of families 4-5 20 6-7 12 8-9 4 10 - 11 4
The mean of a certain number of items is 20. If an observation 25 is added to the data, the mean becomes 21. Find the number of items in the original data. The mean age of a combined group of men and women is 30 years. If the mean age of the men's group is 32 years and that for the womens group is 27 years, find the percentage of men and women in the combined group.
33. The average age of 40 students entering B.A. (Honours) Economics first year in a college was 19 years. Out of this only 25 students passed the third year examination. If the average age of these 25 students is 22.5 years, find the average age of the remaining students. 34. Fifty students took a test. The result of those who passed the test is given below:
Marks No. of Students : : 4 8 5 10 6 9 7 6 8 4 9 3
If the average marks for all the 50 students was 5.16, find the average marks of those who failed. 35. A person had 7 children. The average age of the children was 14 years when one of the child died at the age of 8 years. What will be the average age of the remaining children after five years of this death? 36. The mean marks of 100 students was calculated as 40. Later on it was discovered that a score 53 was misread as 83. Find the correct mean. 37. An examination was held to decide the award of a scholarship. The weights given to various subjects were different. Only three applicants for the scholarship obtained over 50% marks in aggregate. The marks were as follows :
Subjects Cost Accounting Statistics Business Law Economics Insurance Weights 5 4 2 3 1 % Marks of A 70 63 50 55 60 % Marks of B 65 80 40 50 40 % Marks of C 90 75 65 40 38
98
Of the candidates, the one getting the highest average marks is to be awarded the scholarship. Determine, who will get it? 38. The number of fully formed tomatoes on 100 plants were counted with the following results : 2 plants had 0 tomatoes 5 " 1 " 7 " 2 " 11 " 3 " 18 " 4 " 24 " 5 " 12 " 6 " 8 " 7 " 6 " 8 " 4 " 9 " 3 " 10 " (i) (ii) 39. (a) How many tomatoes were there in all? What was the average number of tomatoes per plant? The average income of 300 employees of a company is Rs 1,800 p.m. Due to rise in prices the company owner decided to give ad-hoc increase of 25% of the average income to each of the 25% lowest paid employees, 10% of the average income to each of the 10% highest paid employees and 15% to each of the remaining employees. Find out the amount of money required for ad hoc increase and also the average income of an employee after this increase.
(b) The frequency distribution of the number of casual leave taken by the employees of a firm in a particular year is given below in which one entry marked as '?' is missing. Determine the missing value if the average number of casual leave taken by an employee is 8.5.
No. of Casual leave taken No. of Employees : : 0 8 4 35 5 40 ? 65 9 79 10 91 12 82
40. The mean salary paid to 1,000 employees of an establishment was found to be Rs 180.40. Later on, after disbursement of salary, it was discovered that the salaries of two employees were wrongly entered as Rs 297 and Rs 165 instead of Rs 197 and 185 respectively. Find the correct mean salary. 41. The following variations were recorded in the measurements of parts by a machine:
Variations from the Standard ( mm. ) 10 to 15 5 to 10 0 to 5 5 to 0 10 to 5 15 to 10 20 to 15 25 to 20 30 to 25 35 to 30 No. of parts 1 3 20 25 22 17 13 10 7 2
(i) (ii)
Find average variations. What proportion fell within a range of 5 mm. either way of the standard?
99
(iii) If those which fall more than 10 mm. apart from the standard are classified as bad, what percentage of the parts are bad? (iv) Which stretch of 15 mm. contains the greatest number of parts and what fraction of the total fall inside this stretch? 42. (a) The average monthly production of a certain factory for the first ten months of a year was 3,500 units. Due to workers' unrest in the last two months, the average monthly production for the whole year came down to 3,200 units. Find the average monthly production of the last two months. The average sales of a balloon seller on the first five days (i.e., Monday to Friday) of a particular week was Rs 50 and his average sales for the entire week was Rs 70. If his sales on Sunday were 40% higher than his sales on Saturday, find his sales on each of the last two days, i.e., on Saturday and Sunday.
(b)
43. Determine median from the following data : 30, 37, 54, 58, 61, 64, 31, 34, 52, 55, 62, 28, 47, 55, 60 44. Locate median of the following data: 65, 85, 55, 75, 96, 76, 65, 60, 40, 85, 80, 125, 115, 40 45. Locate Md, Q1, Q3, D3, D6, P20, P40, P85 and P90 from the following data :
S. No. 1 2 3 4 5 6 Marks 17 32 35 33 15 21 S. No. 7 8 9 10 11 12 Marks 41 32 10 18 20 22 S. No. 13 14 15 16 17 18 Marks 11 15 35 23 38 12
46. In a class of 16 students, the following are the marks obtained by them in statistics. Find out the lower quartile, upper quartile, seventh decile and thirty-fifth percentile.
S. No. : 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 Marks : 5 12 17 23 28 31 37 41 42 49 54 58 65 68 17 77
47. Locate Md, Q1, Q3, D4, D7, P26, P45, P66, P70 and P79 from the following data :
Age of Children (in years ) : 6 No. of Children : 32 7 33 8 39 9 10 11 12 43 58 59 52 13 14 15 38 33 13
100
54. With the help of the following figures, prepare a cumulative frequency curve and locate the median and quartiles:
Marks Obtained No. of Students : : 0 - 10 10 10 - 20 12 20 - 30 20 30 - 40 18 40 - 50 10
55. Draw a cumulative frequency curve from the following data and find out the median and both quartiles:
: 1 - 5 6 - 10 11 - 15 16 - 20 21- 25 26 - 30 31- 35 36 - 40 41- 45 Class Frequency : 7 10 16 32 24 18 10 5 1
56. Calculate median and both quartiles from the following data :
: 20 - 24 25 - 29 30 - 34 35 - 39 40 - 44 45 - 49 50 - 54 55 - 59 Age No. of Persons : 50 70 100 180 150 120 70 60
57. Calculate the quartiles, D7 and P85 from the following data :
Class Frequency Class Frequency : Less than 100 100 - 250 250 - 400 400 - 500 500 - 550 : 85 100 175 74 66 : 550 - 600 600 - 800 800 - 900 900 - 1000 : 35 5 18 2
58. Calculate arithmetic mean and median from the data given below :
Income in Rs ( less than ) No . of Workers : 80 70 60 50 40 30 20 10 : 100 90 80 60 32 20 13 5
63. Following is the distribution of marks obtained by 50 students in 'mercantile law'. Calculate median marks. If 60% of the students pass this test, find the minimum marks obtained by a passed candidate.
Marks (more than) No. of Students : : 0 50 10 46 20 40 30 20 40 10 50 3
101
64. Estimate the number of first, second and third divisioners and the number of failures from the following data. First division is awarded at 60 or more marks, second division at 50 and above but less than 60, third division at 36 or more but less than 50 and those securing less than 36 are failures.
M arks ( out of 100 ) N o . of Students : : 0 - 20 18 20 - 40 30 40 - 60 66 65 60 - 80 25 80 and above 11 12
65. Following relate to the weekly wages (in Rs) of workers of a factory : 100, 75, 79, 80, 110, 93, 109, 84, 95, 77, 100, 89, 84, 81, 106, 96, 94, 83, 95, 78, 101, 99, 83, 89, 102, 97, 93, 82, 97, 80, 102, 96, 87, 99, 107, 99, 97, 80, 98, 93, 106, 94, 88, 104, 103, 100, 98, 84, 100, 96, 86, 93, 89, 100, 101, 106, 92, 86, 105, 97, 82, 92, 75, 103, 101, 103, 100, 88, 106, 98, 87, 90, 76, 104, 101, 107, 97, 91, 103, 98, 109, 86, 76, 107, 88, 107, 88, 93, 85, 98, 104, 78, 79, 110, 94, 108, 86, 95, 84, 87. Prepare a frequency distribution by taking class intervals as 75 - 80, 80 - 85, etc. and locate its median and the two quartiles. 66. Find an appropriate average for the following distribution :
Weekly Income ( in Rs ) Below 100 100 - 200 200 - 300 300 - 400 400 - 500 500 and above No . of families 50 500 555 100 3 2
67. In the frequency distribution of 100 families given below, the number of families corresponding to weekly expenditure groups 200 - 400 and 600 - 800 are missing. However, the median of the distribution is known to be Rs 500. Find the missing frequencies.
Expenditure No. of families : : 0 - 200 200 - 400 14 ? 400 - 600 27 600 - 800 ? 800 - 1000 15
(a) (b)
Find the median wage. A fund is to be raised and it is decided that the workers getting less than Rs 120 should contribute 5% of their wages and those getting Rs 120 or more should contribute 10% of their wages. What sum should be collected?
70. Determine the mode of the following data : 58, 60, 31, 62, 48, 37, 78, 43, 65, 48 71. Locate mode of the following series :
102
S. No. Age
: :
1 9
2 7
3 4
4 9
5 10
6 8
7 4
8 10
9 5
10 8
11 15
12 8
73. The number of calls received in 240 successive one minute intervals at an exchange are shown in the following frequency distribution. Calculate mode:
No. of calls Frequency : : 0 14 1 21 2 25 3 43 4 51 5 35 6 39 7 12
Midpoints Frequency
: 1
: 5 50 45 30 20 10 15 5
80. Find out mode of the following data graphically and check the result by calculation:
: 0 - 1 1 - 2 2 - 3 3 - 4 4 - 5 5 - 6 6 - 7 7 - 8 8 - 9 9 - 10 10 - 11 Size Frequency : 3 7 9 15 25 20 14 12 8 6 2
81. (a)
Construct a frequency distribution of the marks obtained by 50 students in economics as given below : 42, 53, 65, 63, 61, 47, 58, 60, 64, 45, 55, 57, 82, 42, 39, 51, 65, 55, 33, 70, 50, 52, 53, 45, 45, 25, 36, 59, 63, 39, 65, 30, 45, 35, 49, 15, 54, 48, 64, 26, 75, 20, 42, 40, 41, 55, 52, 46, 35, 18. (Take the first class interval as 10 - 20).
(b)
82. The monthly profits (in Rs) of 100 shops are distributed as follows :
: 0 - 100 100 - 200 200 - 300 300 - 500 500 - 600 600 - 800 Profits No. of Shops : 19 21 30 40 10 12
Calculate mode of the distribution. 83. The mode of the following incomplete distribution of weights of 160 students is 56. Find the missing frequencies.
Weights ( kgs) : 30 - 40 40 - 50 50 -60 60 -70 70 -80 80- 90 No. of Students : 20 36 ? ? 15 5
103
84. Calculate mean, median and mode from the following table :
Wages ( Rs ) No. of Persons 5 Less than 8 12 Less than 16 8- 24 29 24 and above 31 32 - 40 8 40 and above 19 48- 56 5
In a moderately skewed distribution, the arithmetic mean is 10 and mode is 7. Find median. In a moderately asymmetrical distribution, the mean is 25 and the median is 23.5. Find mode.
86. Find geometric mean from the following daily income (in Rs) of 10 families: 85, 70, 15, 75, 500, 8, 45, 250, 40 and 36. 87. Calculate geometric mean of the following distribution :
Marks (less than) No. of Students : : 10 12 20 27 30 72 40 93 50 100
88. The value of a machine depreciates at a constant rate from the cost price of Rs 1,000 to the scrap value of Rs 100 in ten years. Find the annual rate of depreciation and the value of the machine at the end of one, two, three years. 89. Calculate weighted GM from the following data :
Items Wheat Milk Sugar Eggs Weights 10 5 2 6 Price Index 135 140 160 120
90. The price of a commodity increased by 12% in 1986, by 30% in 1987 and by 15% in 1988. Calculate the average increase of price per year. 91. The population of a city was 30 lakh in 1981 which increased to 45 lakh in 1991. Determine the rate of growth of population per annum. If the same growth continues, what will be the population of the city in 1995. 92. The value of a machine depreciated by 30% in 1st year, 13% in 2nd year and by 5% in each of the following three years. Determine the average rate of depreciation for the entire period. 93. The following table gives the diameters of screws obtained in a sample enquiry. Calculate mean diameter by using geometric average.
Diameter (mm) : 130 135 140 145 146 148 149 150 157 No. of Screws : 3 4 6 6 3 5 2 1 1
The price of a commodity doubles in a period of 5 years. What will be the average rate of increase per annum. If a sum of Rs 1,500 is invested at 15% rate of interest compounded annually, determine the amount after 5 years.
Find the average rate of increase per decade in the population which increased by 10% in the first decade, by 20% in the second and by 40% in the third. The price of a commodity increased by 10% in 1st year, by 15% in 2nd year and decreased by 10% in 3rd year. Determine the average change of price after 3 years.
96. The following table gives the marks obtained by 70 students in mathematics. Calculate arithmetic and geometric means:
Marks ( more than) No. of Students : : 80 0 70 7 60 18 50 40 40 40 30 63 20 70
Find the average growth per decade. 98. The geometric means of three groups consisting of 15, 20 and 23 observations are 14.5, 30.2 and 28.8 respectively. Find geometric mean of the combined group. 99. A sum of money was invested for 3 years. The rates of interest in the first, second and third year were 10%, 12% and 14% respectively. Determine the average rate of interest per annum. 100. The weighted geometric mean of four numbers 8, 25, 17 and 30 is 15.3. If the weights of first three numbers are 5, 3 and 4 respectively, find the weight of the fourth number. 101. The annual rates of growth of output of a factory in five years are 5.0, 6.5, 4.5, 8.5 and 7.5 percent respectively. What is the compound rate of growth of output per annum for the period? 102. (a) A man invested Rs 1,000, Rs 12,000 and Rs 15,000 at the respective rates of return of 5%, 14% and 13% p.a. respectively. Determine his average rate of return per annum. The arithmetic and the geometric means of two numbers are 20.5 and 20 respectively. Find the numbers. Calculate the harmonic mean of the following data : 9, 5, 2, 10, 15, 35, 20, 24, 21 (b) Calculate HM of the following items : 1.0, 1.5, 15.0, 250, 0.5, 0.05, 0.095, 1245, 0.009 104. Calculate X , GM and HM and verify that X > GM > HM.
Class Intervals Frequency : : 5 - 15 6 15 - 25 9 25 - 35 15 35 - 45 8 45 - 55 4
105. Four typists take 15, 10, 8, 7 minutes respectively to type a letter. Determine the average time required to type a letter if (a) (b) 106. (a) Four letters are to be typed by each typist. Each typist works for two hours. A person spends Rs 60 for oranges costing Rs 10 per dozen and another Rs 70 for oranges costing Rs 14 per dozen. What is the average price per dozen paid by him? Three mechanics take 10, 8, and 6 hours respectively to assemble a machine. Determine the average number of hours required to assemble one machine.
105
(b)
107. At harvesting time, a farmer employed 10 men, 20 women and 16 boys to lift potatoes. A woman's work was three quarters as effective as that of a man, while a boy's work was only half. Find the daily wage bill if a man's rate was Rs 24 per day and the rates for the women and boys were in proportion to their effectiveness. Calculate the average daily rate for the 46 workers. 108. Saddam takes a trip which entails travelling 1,350 kms by train at a speed of 60 kms/hr, 630 kms by aeroplane at 350 kms/hr, 4,500 kms by ship at 25 kms/hr and 20 kms by car at 30 kms/hr. What is the average speed for the entire journey? 109. (a) A man travels from Lucknow to Kanpur, a distance of 80 kms, at a speed of 45 kms/hr. From Kanpur he goes to Etawah, a distance of 165 kms, at a speed of 65 kms/hr and from Etawah he comes back to Lucknow, along the same route, at a speed of 60 kms/hr. What is his average speed for the entire journey? If refills for 5 rupees are purchased at 40 paise each and for another 5 rupees are purchased at 60 paise each, the average price would be 48 paise and not 50 paise. Explain and verify. An aeroplane travels distances of 2,500, 1,200, and 500 kms at the speeds of 500, 400 and 250 kms/hour respectively. Find the average speed for the entire trip, commenting upon the choice of your average. A train goes from Delhi to Agra in four hours at speeds of 25, 60, 80 and 40 kms/hour in each successive hour respectively. Find the average speed of the train and verify your answer.
(b)
110. (a)
(b)
111. A can do a unit of work in 10 minutes, B in 18 minutes and C in 20 minutes. Find their average rate of working when : (i) (ii) A works for 8 hours, B for 9 hours and C for 10 hours per day. Each of them have to complete 40 units of work per day.
Also determine the total units of work done per day in each of the above situations and verify your answer. 112. Choose an appropriate average to find the average price per kg., for the following data:
Articles Qty Purchased Rate ( in gms ./ rupee ) 5 kg . 250 Wheat 3 kg . 150 Rice 1 kg . 100 Sugar 2 kg . 90 Pulses
Now change the weights as 12, 6, 8 and 2 respectively and recalculate the weighted harmonic mean. What do you conclude? 114. (a) The speeds of various buses of a company plying on the same route was found to be as given below :
Speed ( in miles / hour ) : 12 15 18 No . of Buses : 3 5 2
(b)
Find mean daily earnings from the following data : 50 men get at the rate of Rs 50 per man per day 35 25 10 " " " 60 75 100 " " "
115. A college canteen sells tea for 75 paise per cup, coffee for Rs 1.50 per cup and bread pakora for Rs 2 per plate. If on a particular day, it sold tea worth Rs 150, coffee worth Rs 165 and bread pakora worth Rs 200, what is the average price per item sold? 116. A firm of readymade garments makes both men's and women's shirts. Its profit average 6% of sales ; its profit in men's shirts average 8% of sales. If the share of women's shirts in total sales is 60%, find the average profit as a percentage of the sales of women's shirts. 117. Which of the averages will be most suitable in the following circumstances? (i) Average rate of growth of population in a given period. (ii) Average number of children in a family. (iii) Average size of oranges on a tree. (iv) Average speed of work. (v) Average marks of students in a class. (vi) Average intelligence of students in a class. (vii) Average size of collars. (viii) Average income of a lawyer. (ix) Average size of readymade garments. (x) Average size of agricultural holdings. (xi) Average change in prices. (xii) Average level of health. 118. Select the correct alternative. (a) Relationship between mean (m), geometric mean (g) and harmonic mean (h) is: (i) g = (b)
m+h m.h (ii) g = m.h (iii) g = (iv) None of the these. 2 m+h
3 X - 2M d , (ii) M o = 3 X - 2 M d 2
(iii) M o = 3 X - 3M d , (iv) M o = 3M d - 2 X (c) Which of the following would be an appropriate average for determining the average size of readymade garments : (i) Arithmetic mean (ii) Median (iii) Mode (iv) Geometric mean (d) (e) Most appropriate average to determine the size of oranges on a tree is: (i) Mode (ii) Median (iii) Mean (iv) None of the these. Most appropriate measure for qualitative measurements is : (i) Mode (ii) Median (iii) Mean (iv) None of the these.
107
The most unstable measure of central tendency is : (i) Mean (ii) Median (iii) Mode (iv) None of the these. The sum of deviations of observations is zero when measured from : (i) Median (ii) GM (iii) Mode (iv) Mean The average, most affected by the extreme observations, is : (i) Mode (ii) Mean (iii) GM (iv) Median The most stable average is : (i) Mode (ii) Mean (iii) Median (iv) GM
119. State whether the following statements are true or false : (i) X can be calculated for a distribution with open ends. (ii) Md is not affected by the extreme observations. (iii) X is based on all the observations. (iv) X = Mo = Md, for a symmetrical distribution. (v) Mo can be calculated if class intervals are of unequal width. (vi) The class limits should be exclusive for the calculation of Md and Mo. 120. Fill in the blanks : (i) (ii) ...... is most suitable for measuring average rate of growth. ...... or ...... are used for averaging rates under certain conditions.
(iii) ...... or ...... are the averages which can be calculated for a distribution with open ends. (iv) ...... or ...... are the averages used to study the pattern of a distribution. (v) ...... or ...... are the averages which can be calculated when the characteristics are not measurable.
(vi) ...... or ...... or ...... averages depend upon all the observations. (vii) The sum of squares of deviations is ...... when taken from mean. (viii) The average which divides a distribution into two equal parts is ...... . (ix) Md of a distribution is also equal to its ...... quartile. (x) The point of intersection of the 'less than type' and 'more than type' ogives corresponds to ...... .
(xi) The algebric sum of deviations of 30 observations from a value 14 is 3. The mean of these observations is ...... . 121. Examine the validity of the following statements giving necessary proofs and reasons for your answer : (i) For a set of 50 observations Xi, i = 1, 2 ...... 50,
X = 10.
(X
i =1
50
- 10) = 90 , when
(ii) Geometric mean of a given number of observations cannot be obtained if one of them is zero. (iii) The mean depth of water of a river is 130 cms, therefore, a man with a height of 165 cms can cross the river safely. (iv) For a wholesale manufacturer, interested in the type which is usually in demand, median is the most suitable average.
108
(v)
(vi) For a set of 8 observations AM, GM and HM are 5.2, 6.3 and 7.1 respectively. (vii) If 2y 6x = 6 and mode of y is 66, then mode of x is 21.
ANSWERS
(c) False
TO
(d) True
QUESTIONS
(e) True (d) Ten
FOR
(b) True
(c) Median
109
LESSON
3
MATHEMATICAL MODEL
CONTENTS
3.0 Aims and Objectives 3.1 Introduction 3.2 Mathematics The Language of Modelling 3.3 Building a Mathematical Model 3.4 Verifying and Refining a Model 3.5 Variables and Parameters 3.6 Continuous-in-Time vs. Discrete-in-Time Models 3.7 Deterministic Model Example 3.8 Probabilistic Models 3.9 Let us Sum Up 3.10 Lesson-end Activity 3.11 Keywords 3.12 Questions for Discussion 3.13 Model Answers to Questions for Discussion 3.14 Suggested Readings
3.1 INTRODUCTION
Models in science come in different forms. A physical model that you probably are familiar with is an anatomically detailed model of the human body. Mathematical models are less commonly found in science classes, but they form the core of modem cosmology. Mathematical models are extremely powerful because they usually enable predictions to be made about a system. The predictions then provide a road map for further experimentation. Consequently, it is important for you to develop an appreciation for this type of model as you learn more about cosmology. Two sections of the activity develop mathematical models of direct relevance to cosmology and astronomy. The math skills required in the activity increase with each section, but nothing terribly advanced is required. A very common approach to the mathematical modeling of a physical system is to collect
110
a set of experimental data and then figure out a way to graph the data so that one gets a straight line. Once a straight line is obtained, it is possible to generalize the information contained in the straight line in terms of the powerful algebraic equation: You probably are familiar with this equation. In it y represents a value on the y-axis, x represents a value on the x-axis, m represents the slope of the straight line, and b represents the value of the intercept of the line on the y-axis. In all sections of this activity, your goal will be to analyze and then graph a set of data so that you obtain a straight line. Then you will derive the equation that describes the line, and use the equation to make predictions about the system. So relax and have fun with math! y = mx + b Mathematical modeling is the process of creating a mathematical representation of some phenomenon in order to gain a better understanding of that phenomenon. It is a process that attempts to match observation with symbolic statement. During the process of building a mathematical model, the model will decide what factors are relevant to the problem and what factors can be de-emphasized. Once a model has been developed and used to answer questions, it should be critically examined and often modified to obtain a more accurate reflection of the observed reality of that phenomenon. In this way, mathematical modeling is an evolving process; as new insight is gained, the process begins again as additional factors are considered. Generally the success of a model depends on how easily it can be used and how accurate are its predictions. (Edwards & Hamson, 1994, p. 3)
Mathematical Model
111
2. 3. 4.
Begin with a simple model, stating the assumptions that you make as you focus on particular aspects of the phenomenon. Identify important variables and constants and determine how they relate to each other. Develop the equation(s) that express the relationships between the variables and constants.
Check Your Progress 3.1
1 2.
What is the difference between physical model and mathematical model? What are the different steps of mathematical modelling process? Write your answer in the space given below. Please go through the lesson sub-head thoroughly you will get your answers in it. This Chek Your Progress will help you to understand the lesson better. Try to write answers for them, but do not submit your answers to the university for assessment. These are for your practice only. (b) (c)
Notes: (a)
Is the information produced reasonable? Are the assumptions made while developing the model reasonable? Are there any factors that were not considered that could affect the outcome? How do the results compare with real data, if available?
In answering these questions, you may need to modify your model. This refining process should continue until you obtain a model that agrees as closely as possible with the real world observations of the phenomenon that you have set out to model.
112
Mathematical Model
One of the purposes of a model such as this is to make predictions and try What If? scenarios. You can change the inputs and recalculate the model and youll get a new answer. You might even want to plot a graph of the future value (F) vs. years (Y). In some cases, you may have a fixed interest rate, but what do you do if the interest rate is allowed to change? For this simple equation, you might only care to know a worst/best case scenario, where you calculate the future value based upon the lowest and highest interest rates that you might expect.
Suppose that you have $10.00 and that you want to win an additional $10.00. We will consider two different strategies.
The Flamboyant Strategy: You stride purposefully up to the wheel with a devil-may-care smile on your face. You bet your entire fortune of $10.00 on one spin of the wheel. If the ball lands in a red slot then you win, pocket your winnings, and leave with $20.00 and a genuine happy smile on your face. If the ball lands in a slot of a different color then you
114
smile bravely at everyone as if $10.00 is mere chickenfeed and leave with empty pockets and feeling gloomy. With the flamboyant strategy your chances of winning are 18/38 or roughly 0.4737.
Mathematical Model
The Timid Strategy: With this strategy you approach the roulette table with obvious trepidation. After watching for a while and working up your courage, you bet $1.00. When the ball falls in a slot you either win or lose $1.00. Now you have either $9.00 or $11.00. You continue betting one dollar on each spin of the wheel until you either go broke or reach your goal of $20.00.
Before continuing pause and think about these two strategies. Which of the two do you think gives you the best chance of winning? or are your chances of winning the same whichever strategy you use?
One way to study the questions raised above is by trying the two strategies in real casinos, wagering your own real money. This approach has several advantages and several disadvantages. One advantage is that this approach is realistic. Real casinos are run by people who know how to make a profit. They are skilled at creating an atmosphere that is likely to encourage customers to bet and lose more than they might like. The lessons that you learn in a real casino are more likely to be real lessons than the ones you learn in a simulated casino like the one we use below. One disadvantage is that this approach can be very costly both in terms of money and time. We take a different approach using the CAS window to simulate playing with the second, or timid, strategy. We already know the chances of winning with the first, or flamboyant, strategy 18/38, or roughly 0.4737.
Computer algebra systems like Maple, MathCad, Mathematica, or the CAS system in the TI-92 have a procedure that generates random numbers. For example, on the TI-92 the command randO, produces a random number between zero and one. The screen below shows the results of executing this command seven times. Notice that it produced seven different random numbers.
Using the random number generator in your CAS window, you can easily simulate one spin of a roulette with a procedure like the one shown below.
115
Your CAS window has a program that is built on this basic idea and will simulate playing roulette using the timid strategy. Use this program to answer the questions below.
Compare the timid strategy to the flamboyant strategy. Consider an intermediate strategy betting $2.00 on each spin of the wheel. Consider another, intermediate strategy betting $5.00 on each spin of the wheel. Some people enjoy gambling. If you play the flamboyant strategy then you spin the wheel just once. On the average how often would you spin the wheel with each of the strategies above. What conclusion can you draw from our work in this module regarding the advisability of diversifying your investments? Be careful. Your answer depends on your investment goals and your beliefs about whether stock prices are more likely to rise or to fall.
Check Your Progress 3.2
1 2.
What is the difference between Variables and Parameters? Give two applications of computer algebra system. Write your answer in the space given below. Please go through the lesson sub-head thoroughly you will get your answers in it. This Chek Your Progress will help you to understand the lesson better. Try to write answers for them, but do not submit your answers to the university for assessment. These are for your practice only. (b) (c)
Notes: (a)
116
phenomena. In this first chapter we have looked at the following applications. Everything we have discussed above the content of the course the tools and the technology would be useless without you. Indeed, without you there would be no purpose. The purpose of mathematical modeling is to enable people like you and me to learn about our world, to form mental pictures of how it works and how we can make it a bit better. Mathematical modeling requires your active participation thinking, working with your computer algebra system, with old-fashioned paper and pencil, exploring the world with the TI-CBL, rubber bands, and TinkerToys, and exchanging ideas with friends and colleagues.
Mathematical Model
3.11 KEYWORDS
Model Time Models Flamboyant Strategy Timid Strategy Parameters
3.
Compound Interest Model Probabilistic Models Strategy A good model should .................. the essential character of the model to be analysed. Building Model can be .................. yet .................. task. .................. model is based on a set of equations. The best example of probabilistic model is .................. .................. is based other than flamboyant and timid strategy.
ANSWERS
TO
QUESTIONS
FOR
118
LESSON
4
LINEAR PROGRAMMING: GRAPHICAL METHOD
CONTENTS
4.0 Aims and Objectives 4.1 Introduction 4.2 Essentials of Linear Programming Model 4.3 Properties of Linear Programming Model 4.4 Formulation of Linear Programming 4.5 General Linear Programming Model 4.6 Maximization & Minimization Models 4.7 Graphical Method 4.8 Solving Linear Programming Graphically Using Computer 4.9 Summary of Graphical Method 4.10 Unbounded LP Problem 4.11 Let us Sum Up 4.12 Lesson-end Activity 4.13 Keywords 4.14 Questions for Discussion 4.15 Terminal Questions 4.16 Model Answers to Questions for Discussion 4.17 Suggested Readings
4.1 INTRODUCTION
Linear programming is a widely used mathematical modeling technique to determine the optimum allocation of scarce resources among competing demands. Resources typically include raw materials, manpower, machinery, time, money and space. The technique is very powerful and found especially useful because of its application to many different types of real business problems in areas like finance, production, sales and distribution, personnel, marketing and many more areas of management. As its name implies, the
linear programming model consists of linear objectives and linear constraints, which means that the variables in a model have a proportionate relationship. For example, an increase in manpower resource will result in an increase in work output.
Objective function
The objective of the problem is identified and converted into a suitable objective function. The objective function represents the aim or goal of the system (i.e., decision variables) which has to be determined from the problem. Generally, the objective in most cases will be either to maximize resources or profits or, to minimize the cost or time. For example, assume that a furniture manufacturer produces tables and chairs. If the manufacturer wants to maximize his profits, he has to determine the optimal quantity of tables and chairs to be produced. Let
120
x1 p1
= =
x2 p2 Hence,
= =
Constraints
When the availability of resources are in surplus, there will be no problem in making decisions. But in real life, organizations normally have scarce resources within which the job has to be performed in the most effective way. Therefore, problem situations are within confined limits in which the optimal solution to the problem must be found. Considering the previous example of furniture manufacturer, let w be the amount of wood available to produce tables and chairs. Each unit of table consumes w1 unit of wood and each unit of chair consumes w2 units of wood. For the constraint of raw material availability, the mathematical expression is, w1 x1 + w2 x2 w In addition to raw material, if other resources such as labour, machinery and time are also considered as constraint equations.
Non-negativity constraint
Negative values of physical quantities are impossible, like producing negative number of chairs, tables, etc., so it is necessary to include the element of non-negativity as a constraint i.e., x1, x2 0
wm1 x1 + wm2 x2 +wmn xn ! or = " wm Non-negativity constraint, xi " o (where i = 1,2,3 ..n)
Check Your Progress 4.1
1 2.
Write your answer in the space given below. Please go through the lesson sub-head thoroughly you will get your answers in it. This Chek Your Progress will help you to understand the lesson better. Try to write answers for them, but do not submit your answers to the university for assessment. These are for your practice only.
(iii) Labour, of which daily availability is 40 man-hours. The resources used are shown in Table 1. If the unit profit of round and square biscuits is Rs 3.00 and Rs 2.00 respectively, how many round and square biscuits should be produced to maximize total profit ?
Table 4.1: Resources Used
Resources Raw Material Machine Manpower Requirement/Unit Round Square 100 115 10 12 3 2 Daily availability 1500 grams 720 minutes 240 minutes
Solution: Key Decision: To determine the number of round and square biscuits to be produced. Decision Variables: Let x1 be the number of round biscuits to be produced daily, and x2 be the number of square biscuits to be produced daily Objective function: It is given that the profit on each unit of round biscuits is Rs 3.00 and of square biscuits is Rs. 2.00. The objective is to maximize profits, therefore, the total profit will be given by the equation, Zmax = 3x1+2x2 Constraints: Now, the manufacturing process is imposed by a constraint with the limited availability of raw material. For the production of round biscuits, 100x 1 of raw material is used daily and for the production of square biscuits, 115x2 of raw material is used daily. It is given that the total availability of raw material per day is 1500 grams. Therefore, the constraint for raw material is,
122
Similarly, the constraint for machine hours is, 10x1+12x2 720 and for the manpower is, 3x1 +2x2 240 Since the resources are to be used within or below the daily available level, inequality sign of less than or equal sign () is used. Further, we cannot produce negative number of units of biscuits which is a non-negative constraint expressed as, x1 0 and x2 0 Thus, the linear programming model for the given problem is, Maximize Z = 3x1 + 2x2 Subject to constraints, 100x1+115x2 1500 10x1+12x2 720 3x1+2x2 240 where x1 0, x2 0 Example 2: Rahul Ads, an advertising company is planning a promotional campaign for the client's product, i.e., sunglasses. The client is willing to spend Rs. 5 lakhs. It was decided to limit the campaign media to a weekly magazine, a daily newspaper and TV advertisement. The product is targeted at middle-aged men and women, and the following data was collected (Table 4.2).
Table 4.2: Data Collected
Campaign media Weekly Magazine Daily Newspaper TV Advetisement Cost per advertisement (Rs.) 30,000 45,000 1,25,000 Expected viewers 1,15,000 2,05,000 7,00,000
The client is interested to spend only Rs. 1 lakh on the ads in the weekly magazine which expecting a viewership of a minimum of 21 lakh people in the case of the television advertising. Maximize the viewers to the advertisements. Solution: Key Decision: To determine number of advertisements on weekly magazine, daily newspaper and TV. Let x1 be the number of weekly magazine advertisements. x2 be the number of daily newspaper advertisements. x3 be the number of TV advertisements. Objective function: The objective is to maximize the number of viewers through all media. The total viewers will be given by the equation, Zmax = 115000x1 + 205000x2+ 700000x3 Constraints: Firstly, the client is willing to spend Rs. 500000 on all media, 30000x1 + 45000x2 + 125000x3 500000 or 30x1 + 45x2+ 125x3 500 ..........................(i)
123
Secondly, a minimum of 2100000 people should view the television advertising, 700000x3 2100000 or x3 3 30000x1 100000 or 3x1 10 Summarizing the LP model for the given problem, Maximize Z = 115000x1 + 205000x2 + 700000x3 Subject to constraints, 30x1 + 45x2 + 125x3 500 x3 3 3x1 10 where x1, x2, x3 0 Example 3: The data given in Table 4.3 represents the shipping cost (in Rs.) per unit for shipping from each warehouse to each distribution centre. The supply and demand data of each warehouse and distribution centre is given. Determine how many units should be shipped from each warehouse to each centre in order to minimize the overall transportation cost.
Table 4.3: Data Shows Shipping Cost from Warehouse to Distribution
Warehouse 1 2 Demand Distribution Centre 1 2 3 9 10 11 4 6 8 150 100 150 Supply 150 250 400
..........................(ii)
Lastly, the client is interested to pay only Rs. 100000 in weekly magazine advertising,
..........................(iii)
Solution: Decision Variables Let xij be the number of units to be shipped from warehouse i to distribution centre j. x11 be the number of units to be shipped from warehouse 1 to distribution centre 1. x12 be the number of units to be shipped from warehouse 1 to distribution centre 2. x13 be the number of units to be shipped from warehouse 1 to distribution centre 3. x21 be the number of units to be shipped from warehouse 2 to distribution centre 1. x22 be the number of units to be shipped from warehouse 2 to distribution centre 2. x23 be the number of units to be shipped from warehouse 2 to distribution centre 3. Objective Function: The Table 4.3 shows the transportation cost from each warehouse to each distribution centre. Therefore 9x11 represents the total cost of shipping x11 units from warehouse 1 to distribution centre 1. The objective function is to minimize the transportation cost. Therefore, the objective function is, Minimize Z = 9x11 + 10x12+11x13+4x21+6x22+8x23 Constraints: The supply and demand constraints to ship the units from warehouses are, to ship the units and distribution centres must receive the shipped units. Since the given table is a 2 3 matrix we have a total 5 constraints apart from the non-negativity constraint. The constraints are as follows,
124
x11+ x12+x13 150 x21+ x22+ x23 250 x11+ x21 = 150 x12+ x22=100 x13+ x23=150 where xij 0 (i =1,2, and j = 1,2,3)
Thus the LP model for the given transportation problem is summarized as, Minimize Z = 9x11 + 10x12+11x13+4x21+6x22+8x23 Subject to constraints, x11+ x12+ x13 150 x21+ x22+ x23 250 x11 + x21 = 150 x12 + x22 = 100 x13 + x23 = 150 where xij > 0 (i =1,2, and j = 1,2,3) ..........................(i) ..........................(ii) ..........................(iii) ..........................(iv) ..........................(v)
Example 4: Sivakumar & Co., manufactures two types of T-shirts, one with collar and another without collar. Each T-shirt with collar yields a profit of Rs. 20, while each Tshirt without collar yields Rs. 30. Shirt with collar requires 15 minutes of cutting and 25 minutes of stitching. Shirt without collar requires 10 minutes of cutting and 20 minutes of stitching. The full shift time is available for cutting in an 8 hour shift, but only 6 hours are available for stitching. Formulate the problem as an LP model to maximize the profit. Solution: Key decision: To determine the number of T-shirts with collar and without collar to be manufactured. Decision variables: Let x1 be the number of T-shirts with collar x2 be the number of T-shirts without collar Objective Function: Zmax = 20x1 + 30x2 Constraints: 15x1 + 10x2 8 60 (Cutting) 25x1 + 20x2 6 60 (Stitching) Non-negativity constraints: x1 0 , x2 0 The linear programming model is, Zmax = 20x1 + 30x2 Subject to constraints, 15x1 + 10x2 480 25x1 + 20x2 360 where x1 , x2 0 ..........................(i) ..........................(ii)
125
..........................(i) ..........................(ii)
Example 5: An agricultural urea company must daily produce 500 kg of a mixture consisting of ingredients x1, x2 and x3. Ingredient x1 costs Rs. 30 per kg, x2 Rs. 50 per kg and x3 Rs. 20 per kg. Due to raw material constraint, not more than 100 kg of x1, 70 kg of x2 and 45 kg of x3 must be used. Determine how much of each ingredient should be used if the company wants to minimize the cost. Solution: Let x1 be the kg of ingredient x1 to be used x2 be the kg of ingredient x2 to be used x3 be the kg of ingredient x3 to be used The objective is to minimize the cost, Minimize Z = 30x1 + 50x2 + 20x3 Subject to constraints, x1+ x2+ x3 = 500 x1 x2 x3 where 100 70 45 (total production) (max. use of x1) (max. use of x2) (max. use of x3) (non-negativity) .......................(i) .......................(ii) .......................(iii) .......................(iv)
x1, x2, x3 0
Example 6: Chandru Bag Company produces two types of school bags: deluxe and ordinary. If the company is producing only ordinary bags, it can make a total of 200 ordinary bags a day. Deluxe bag requires twice as much labour and time as an ordinary type. The demand for deluxe bag and ordinary bag are 75 and 100 bags per day respectively. The deluxe bag yields a profit of Rs 12.00 per bag and ordinary bag yields a profit of Rs. 7.00 per bag. Formulate the problem as LP model. Solution: Let x1 be deluxe bags to be produced per day x2 be ordinary bags to be produced per day Objective function: The objective is to maximize the profit. Deluxe bag yields a profit of Rs. 12.00 per bag and ordinary bag yields a profit of Rs. 7.00 per bag. Maximize Z = 12x1 + 7x2 Constraints: There are two constraints in the problem, the "number of bags" constraint and "demand" constraint. It is given that the deluxe bag takes twice as much time of ordinary bag and if only ordinary bags alone are produced, the company can make 200 bags. The constraint is, 2x1 + x2 200 The demand for the deluxe bag is 75 bags and ordinary bag is 100 bags The constraints are, x1 75 x2 100 and the non-negativity constraint is,
126
x1 0 , x 2 0
The LP formulation is Maximize, Z = 12x1+ 7x2 Subject to constraints, 2x1 + x2 200 x1 75 x2 100 where x1 , x2 0 ..........................(i) ..........................(ii) ..........................(iii)
Example 7: Geetha Perfume Company produces both perfumes and body spray from two flower extracts F1 and F2 The following data is provided:
Table 4.4: Data Collected
Litres of Extract Perfume Body Spray 8 4 2 3 7 5 Daily Availability (litres) 20 8
The maximum daily demand of body spray is 20 bottles of 100 ml each. A market survey indicates that the daily demand of body spray cannot exceed that of perfume by more than 2 litres. The company wants to find out the optimal mix of perfume and body spray that maximizes the total daily profit. Formulate the problem as a linear programming model. Solution: Let x1 be the litres of perfume produced daily x2 be the litres of body spray produced daily Objective function: The company wants to increase the profit by optimal product mix Zmax = 7x1+5x2 Constraints: The total availability of flower extract F1 and flower extract F2 are 20 and 8 litres respectively. The sum of flower extract F1 used for perfume and body spray must not exceed 20 litres. Similarly, flower extract F2 must not exceed 8 litres daily. The constraints are, 8x1+4x2 20 (Flower extract F1) 2x1+3x2 8 (Flower extract F2) The daily demand of body spray x2 is limited to 20 bottles of 100ml each (i.e, 20 100 = 2000 ml = 2 litres) Therefore, x2 2
Again, there is an additional restriction, that the difference between the daily production of perfume and body spray , x2 x1 does not exceed 2 litres, which is expressed as x2x1 2 (or) x1 + x2 2. The model for Geetha perfumes company is, Maximize , Z = 7x1+ 5x2
127
Subject to constraints, 8x1 + 4x2 20 2x1 + 3x2 8 x1 + x2 2 x2 2 where x1, x2 0 .(i) ..(ii) .....(iii) .............(iv)
Feasible Solution: Any values of x1 and x2 that satisfy all the constraints of the model constitute a feasible solution. For example, in the above problem if the values of x1 = 2 and x2 = l are substituted in the constraint equation, we get (i) (ii) 8(2) + 4(1) 20 20 20 2(2) + 3 (1) 8 78 (iii) 2 +1 2 12 (iv) 1 2 All the above constraints (including non-negativity constraint) are satisfied. The objective function for these values of x1 = 2 and x2 = 1, are Zmax = 7(2 ) + 5(1) = 14 + 5 = Rs. 19.00 As said earlier, all the values that do not violate the constraint equations are feasible solutions. But, the problem is to find out the values of x1 and x2 to obtain the optimum feasible solution that maximizes the profit. These optimum values of x1 and x2 can be found by using the Graphical Method or by Simplex Method. (The above problem is solved using graphical method shown on page number 117).
Objective Function: The objective is to maximize the profits. Given profits on corrugated box and carton box are Rs. 6 and Rs. 4 respectively.
The objective function is, Zmax = 6x1 + 4x2 Constraints: The available machine-hours for each machine and the time consumed by each product are given. Therefore, the constraints are, 2x1 + 3x2 120 2x1+ x2 60 where x1, x2 0 ..........................(i) ..........................(ii)
Graphical Solution: As a first step, the inequality constraints are removed by replacing equal to sign to give the following equations: 2x1 + 3x2 = 120 2x1 + x2 = 60 .......................(1) .......................(2)
Find the co-ordinates of the lines by substituting x1 = 0 and x2 = 0 in each equation. In equation (1), put x1 = 0 to get x2 and vice versa 2x1 + 3x2 = 120 2(0) + 3x2 = 120, x2 = 40 Similarly, put x2 = 0, 2x1 + 3x2 = 120 2x1 + 3(0) = 120, x1 = 60 The line 2x1 + 3x2 = 120 passes through co-ordinates (0, 40) (60, 0). The line 2x1 + x2 = 60 passes through co-ordinates (0,60)(30,0). The lines are drawn on a graph with horizontal and vertical axis representing boxes x1 and x2 respectively. Figure 4.1 shows the first line plotted.
The inequality constraint of the first line is (less than or equal to) type which means the feasible solution zone lies towards the origin. The no shaded portion can be seen is the feasible area shown in Figure 4.2 (Note: If the constraint type is then the solution zone area lies away from the origin in the opposite direction). Now the second constraints line is drawn.
Figure 4.2: Graph Showing Feasible Area
When the second constraint is drawn, you may notice that a portion of feasible area is cut. This indicates that while considering both the constraints, the feasible region gets reduced further. Now any point in the shaded portion will satisfy the constraint equations. For example, let the solution point be (15,20) which lies in the feasible region. If the points are substituted in all the equations, it should satisfy the conditions. 2x1 + 3x2 120 = 30 + 60 120 = 90 120 2x1 + x2 60 = 30 + 20 60 = 50 60 Now, the objective is to maximize the profit. The point that lies at the furthermost point of the feasible area will give the maximum profit. To locate the point, we need to plot the objective function (profit) line. Equate the objective function for any specific profit value Z, Consider a Z-value of 60, i.e., 6x1 + 4x2 = 60 Substituting x1 = 0, we get x2 = 15 and if x2 = 0, then x1 = 10 Therefore, the co-ordinates for the objective function line are (0,15), (10,0) as indicated by dotted line L1 in Figure 4.2. The objective function line contains all possible combinations of values of xl and x2. The line L1 does not give the maximum profit because the furthermost point of the feasible area lies above the line L1. Move the line (parallel to line L1) away from the origin to locate the furthermost point. The point P, is the furthermost point, since no area is seen further. Take the corresponding values of x1 and x2 from point P, which is 15 and 30 respectively, and are the optimum feasible values of x1 and x2.
130
Therefore, we conclude that to maximize profit, 15 numbers of corrugated boxes and 30 numbers of carton boxes should be produced to get a maximum profit. Substituting x1 = 15 and x2= 30 in objective function, we get Zmax = 6x1 + 4x2 = 6(15) + 4(30) Maximum profit : Rs. 210.00
Now, go to Solve Menu and click Graphical in the 'solve problem' options. Then click Graphical , and then press Go to Output . The output screen is displayed with the graph grid on the right hand side and equations in the left hand side. To plot the graphs one by one, click the first constraint equation. Now the line for the first constraint is drawn connecting the points (40, 60). Now, click the second equation to draw the second line on the graph. You can notice that a portion of the graph is cut while the second constraint is also taken into consideration. This means the feasible area is reduced further. Click on the objective function equation. The objective function line locates the furthermost point (maximization) in the feasible area which is (15,30) shown in Figure 4.4 below.
131
Example 9: A soft drink manufacturing company has 300 ml and 150 ml canned cola as its products with profit margin of Rs. 4 and Rs. 2 per unit respectively. Both the products have to undergo process in three types of machine. The following Table 4.5, indicates the time required on each machine and the available machine-hours per week.
Table 4.5: Available Data
Requirement Machine 1 Machine 2 Machine 3 Cola 300 ml 3 2 5 Cola 150 ml 2 4 7 Available machinehours per week 300 480 560
Formulate the linear programming problem specifying the product mix which will maximize the profits within the limited resources. Also solve the problem using computer. Solution: Let x1 be the number of units of 300 ml cola and x2 be the number of units of 150 ml cola to be produced respectively. Formulating the given problem, we get Objective function: Zmax = 4x1 + 2x2 Subject to constraints, 3x1 + 2x2 300 2x1 +4x2 480 5x1 +7x2 560 where x1 , x2 0 3x1 + 2x2 = 300 2x1 + 4x2 = 480 5x1 + 7x2 = 560 Therefore, Line 3x2 + 2x2 = 300 passes through (0,150),(100,0) Line 2x1 + 4x2 = 480 passes through (0,120),(240,0) Line 5x1 + 7x2 = 650 passes through (0,80),(112,0) ............................(iv) ............................(v) ............................(vi) The inequalities are removed to give the following equations: ............................(i) ............................(ii) ............................(iii)
132
For objective function, The Line 4x1 + 2x2 = 0 passes through (10,20),(10,20) Plot the lines on the graph as shown in the computer output Figure 4.5. The objective is to maximize the profit. Move the objective function line away from the origin by drawing parallel lines. The line that touches the furthermost point of the feasible area is (100, 0). Therefore, the values of x1 and x2 are 100 and 0 respectively. Maximum Profit, Zmax = 4x1 + 2x2 = 4(100) + 2(0) = Rs. 400.00 Example 10: Solve the following LPP by graphical method. Minimize Z = 18x1+ 12x2 Subject to constraints, 2x1 + 4x2 60 3x1 + x2 30 8x1 + 4x2 120 where Solution: The inequality constraints are removed to give the equations, 2x1 + 4x2 = 60 3x1 + x2 = 30 8x1 + 4x2 = 120 The equation lines pass through the co-ordinates as follows: For constraints, 2x1 + 4x2 = 60 passes through (0,15), (30,0). 3x1 + x2 = 30 passes through (0,30), (10,0). 8x1 + 4x2 = 120 passes through (0,30), (15,0). The objective function, 18x1 + 12x2 = 0 passes through (10,15), (10,15). Plot the lines on the graph as shown in Figure 4.6 Here the objective is minimization. Move the objective function line and locate a point in the feasible region which is nearest to the origin, i.e., the shortest distance from the origin. Locate the point P, which lies on the x axis. The co-ordinates of the point P are (15,0) or x1 = 15 and x2 = 0. The minimum value of Z Zmin = 18 x1 + 12x2 = 18 (15) + 12 (0) = Rs. 270.00 ........................(iv) ........................(v) ........................(vi) x1 , x2 0 ........................(i) ........................(ii) ........................(iii)
133
If the solution point is a single point on the line, take the corresponding values of x1 and x2. If the solution point lies at the intersection of two equations, then solve for x 1 and x2 using the two equations. If the solution appears as a small line, then a multiple solution exists. If the solution has no confined boundary, the solution is said to be an unbound solution.
Example 11: Solve the Geetha perfume company (Example 1.7) graphically using computer. The formulated LP model is, Zmax = 7x1 + 5x2 Subject to constraints, 8x1+ 4x2 20 2x1+ 3x2 8 x1+ x2 2 x2 2 where x1, x2 0 .........................(i) .........................(ii) .........................(iii) .........................(iv)
Solution: The input values of the problem are given to obtain the output screen as shown in Figure 4.7.
Results: Perfumes to be produced, x1 = 1.75 litres or 17.5 say 18 bottles of 100 ml each Body sprays to be produced, x2 = 1.50 litres or 15 bottles of 100 ml each Maximum profit, Zmax = Rs. 19.75
Check Your Progress 4.2
Discuss the limitations of graphical method in solving LPP. Notes: (a) (b) (c) Write your answer in the space given below. Please go through the lesson sub-head thoroughly you will get your answers in it. This Chek Your Progress will help you to understand the lesson better. Try to write answers for them, but do not submit your answers to the university for assessment. These are for your practice only.
Contd....
135
The given problem is maximization one. The solution point should be located at the furthermost point of the feasible region.
The feasible zone (shaded area) shown in Figure 4.8 is open-ended, i.e., it has no confined boundary. This means that the maximization is not possible or the LPP has no finite solution, and hence the solution is unbounded. Example 13: Solve the given linear programming problem graphically using a computer. Maximize Z = 3x1 + 2x2 Subject to constraints x1 x2 1 x1 + x2 3 x1 , x2 0 Solution: The input as required is entered into the TORA input screen, the following output is obtained as shown in Figure 4.9 which shows that the solution is unbounded. ..........................(i) ..........................(ii)
Figure 4.9: Graphical LP Solution (Output Screen, TORA) Check Your Progress 4.3
What is unbound LP problem? Notes: (a) (b) (c) Write your answer in the space given below. Please go through the lesson sub-head thoroughly you will get your answers in it. This Chek Your Progress will help you to understand the lesson better. Try to write answers for them, but do not submit your answers to the university for assessment. These are for your practice only.
4.13 KEYWORDS
Linear Programming Graphical Method Maximisation Minimisation Constraints Profit Optimality
Briefly comment on the following statements: (a) (b) (c) (d) Formulation of LP is the representation of problem situation in a mathematical form. A model must have an objective function. When feasible zone lies towards the origin. LP techniques are used to optimize the resource for best result. LP techniques are used in analyzing the effect of changes.
138
(e)
3.
Fill in the blanks: (a) (b) (c) (d) (e) Organization normally have _________ resources. A model has a _________ constraint. In real life, the two _________ problems are practiced very little. _________ refer to the products, workers, efficiency, and machines are assumed to be identical. The _________ function represents the aim or goal of the system.
Exercise Problems
1. For the problem given in Example 7, formulate the constraints for the following without any change in R.H.S.: (a) (b) (c) 2. The flower extract F1 must be used at most to 15 litres and at least 5 litres. The demand for perfume cannot be less than the demand for body spray. The daily demand of body spray exceeds that of perfume by at least 2 litres.
For the problem given in Example 1.7, determine the best feasible solution among the following values of x1 and x2: (a) (b) (c) (d) (e) (f) x1 =2, x1 =0, x1 =3, x1 = 5, x1 = 2, x1 = 1.75, x2 = 1 x2 = 3 x2 = 1 x2 = 1 x2 = 1 x2 = 1.50
3.
Determine the feasible space for each of the following constraints: (a) (b) (c) (d) 2x1 2x2 5 5x1 + 10x2 60 x1 x2 0 4x1 + 3x2 15
139
(e) (f) 4.
x2 5 x1 30
A company manufactures two types of products, A and B. Each product uses two processes, I and II. The processing time per unit of product A on process I is 6 hours and on the process II is 5 hours. The processing time per unit of product B on process I is 12 hours and on process II is 4 hours. The maximum number of hours available per week on process I and II are 75 and 55 hours respectively. The profit per unit of selling A and B are Rs.12 and Rs.10 respectively. (i) Formulate a linear programming model so that the profit is maximized. (ii) Solve the problem graphically and determine the optimum values of product A and B.
5.
6.
A nutrition scheme for babies is proposed by a committee of doctors. Babies can be given two types of food (I and II) which are available in standard sized packets, weighing 50 gms. The cost per packet of these foods are Rs. 2 and Rs. 3 respectively. The vitamin availability in each type of food per packet and the minimum vitamin requirement for each type of vitamin are summarized in the table given. Develop a linear programming model to determine the optimal combination of food type with the minimum cost such that the minimum requirement of vitamin is each type is satisfied.
Details of food type Vitamin availability per product Vitamin Food Type I 1 2 Cost/Packet (Rs.) 1 7 2 Food Type II 1 1 3 Minimum Daily requirement 6 14
7.
8.
140
Solve the Chandru Bag company problem graphically. (a) Determine the values of x1, x2 and Zmax.
(b) (c) 9.
If the company has increased the demand for ordinary bag from 100 to 150, what is the new Zmax value? If the demand for deluxe bags has reduced to 50 bags, determine the optimal profit value.
Solve the following linear programming model graphically: Maximize Z = 30x1 + 100x2 Subject to constraints, 4x1 + 6x2 90 8x1 + 6x2 100 5x1 + 4x2 80 where x1 , x2 0
10. Solve the following LP graphically: Maximize Z = 8x1 + 10x2 Subject to constraints, 2x1 + 3x2 20 4x1 + 2x2 25 where 11. x1 , x2 0 Solve the two variable constraints using graphical method. Maximize Z = 50x1 + 40x2 Subject to constraints x1 20 x2 25 2x1 + x2 60 where x1 , x2 0 12. Solve the following LP graphically using TORA. Maximize Z = 1200x1 + 1000x2 Subject to constraints, 10x1 + 4x2 600 7x1 + 10x2 300 2x1 + 4x2 1000 9x1 + 7x2 2500 5x1 + 4x2 1200 where 13. Solve graphically: Maximize Z = 2x1 + 3x2 Subject to constraints, x1 x2 0 3x1 + x2 25 where x1 , x2 0
141
x1 , x2 0
14. Solve the following LP graphically: Maximize Z = 8x1 + 10x2 Subject to constraints, 0.5x1 + 0.5x2 150 0.6x1 + 0.4x2 145 x1 30 x1 150 x2 40 x2 200 where x1 , x 2 0 15. Determine the optimal values of x1 and x2 and hence find the maximum profits for the following LP problem: Maximize Z = 4x1 + 5x2 Subject to constraints x1 + 3x2 2 4x1 + 5x2 6 where x1 , x 2 0
ANSWERS
(c) False
TO
(d) True
QUESTIONS
(e) False
FOR
(b) True
(b) non-negative
142
LESSON
5
LINEAR PROGRAMMING: SIMPLEX METHOD
CONTENTS
5.0 Aims and Objectives 5.1 Introduction 5.2 Additional Variables used in Solving LPP 5.3 Maximization Case 5.4 Solving LP Problems Using Computer with TORA 5.5 Minimization LP Problems 5.6 Big M Method 5.7 Degeneracy in LP Problems 5.8 Unbounded Solutions in LPP 5.9 Multiple Solutions in LPP 5.10 Duality in LP Problems 5.11 Sensitivity Analysis 5.12 Let us Sum Up 5.13 Lesson-end Activities 5.14 Keywords 5.15 Questions for Discussion 5.16 Terminal Questions 5.17 Model Answers to Questions for Discussion 5.18 Suggested Readings
5.1 INTRODUCTION
In practice, most problems contain more than two variables and are consequently too large to be tackled by conventional means. Therefore, an algebraic technique is used to solve large problems using Simplex Method. This method is carried out through iterative process systematically step by step, and finally the maximum or minimum values of the objective function are attained.
The basic concepts of simplex method are explained using the Example 1.8 of the packaging product mix problem illustrated in the previous chapter. The simplex method solves the linear programming problem in iterations to improve the value of the objective function. The simplex approach not only yields the optimal solution but also other valuable information to perform economic and 'what if' analysis.
The above variables are used to convert the inequalities into equality equations, as given in the Table 5.1 below.
Table 5.1: Types of Additional Variables
Constraint Type a) b) Less than or equal to Greater than or equal to
Variable added Add Slack Variable Subtract surplus variable and add artificial variable
Format +S -S+a
c)
Equal to
+a
144
If variables x1 and x2 are equated to zero, i.e., x1 = 0 and x2 = 0, then S3 = 120 S4 = 60 This is the basic solution of the system, and variables S3 and S4 are known as Basic Variables, SB while x1 and x2 known as Non-Basic Variables. If all the variables are non-negative, a basic feasible solution of a linear programming problem is called a Basic Feasible Solution. Rewriting the constraints with slack variables gives us, Zmax = 6x1 + 4x2 + 0S3 + 0S4 Subject to constraints, 2x1 + 3x2 + S3 = 120 2x1 + x2 + S4 = 60 where x1, x2 0 Though there are many forms of presenting Simplex Table for calculation, we represent the coefficients of variables in a tabular form as shown in Table 5.2.
Table 5.2: Co-efficients of Variables
Iteration Number 0 Basic Variables S3 S4 Zj Solution Value 120 60 0 X1 KC 2 2 6 Minimum Ratio 60 30 Equation
....................(i) ....................(ii)
X2 3 1 4
S3 1 0 0
S4 0 1 0
If the objective of the given problem is a maximization one, enter the co-efficient of the objective function Zj with opposite sign as shown in Table 5.3. Take the most negative coefficient of the objective function and that is the key column Kc. In this case, it is -6. Find the ratio between the solution value and the key column coefficient and enter it in the minimum ratio column. The intersecting coefficients of the key column and key row are called the pivotal element i.e. 2. The variable corresponding to the key column is the entering element of the next iteration table and the corresponding variable of the key row is the leaving element of the next iteration table. In other words, x1 replaces S4 in the next iteration table. Table 5.3 indicates the key column, key row and the pivotal element.
Table 5.3
Iteration Number 0 Kr
X1 KC 2 2 -6
X2 3 1 -4
S3 1 0 0
S4 0 1 0
Minimum Ratio 60 30
Equation
145
In the next iteration, enter the basic variables by eliminating the leaving variable (i.e., key row) and introducing the entering variable (i.e., key column). Make the pivotal element as 1 and enter the values of other elements in that row accordingly. In this case, convert the pivotal element value 2 as 1 in the next interation table. For this, divide the pivotal element by 2. Similarly divide the other elements in that row by 2. The equation is S4 /2. This row is called as Pivotal Equation Row Pe. The other co-efficients of the key column in iteration Table 5.4 must be made as zero in the iteration Table 5.5. For this, a solver, Q, is formed for easy calculation. Change the sign of the key column coefficient, multiply with pivotal equation element and add with the corresponding variable to get the equation, Solver, Q = SB + (Kc Pe) For S3 Q = SB + ( Kc Pe) = S3 + (2x Pe) = S3 2Pe For Z, Q = SB + ( Kc Pe) = Z + (( 6) Pe) = Z + 6Pe (ii) Using the equations (i) and (ii) the values of S3 and Z for the values of Table 1 are found as shown in Table 5.4
Table 5.4: S3 and Z Values Calculated
Iteration Number 0 Kr Basic Variables S3 S4 Zj 1 Kr Pe S3 x1 Zj Solution Value 120 60 0 60 30 100 X1 KC 2 2 6 0 1 0 X2 KC 3 1 4 2 ! 1 Minimum Ratio 60 30
The equations for the variables in the iteration number 1 of table 8 are,
(i)
S3 1 0 0 1 0 0
S4 0 1 0 1 ! 3
Equation
30 60
S3 2Pe S4 / 2 Z + 6Pe
Using these equations, enter the values of basic variables SB and objective function Z. If all the values in the objective function are non-negative, the solution is optimal. Here, we have one negative value 1. Repeat the steps to find the key row and pivotal equation values for the iteration 2 and check for optimality. In the iteration 2 number of Table 5.5, all the values of Zj are non-negative, Zj 0, hence optimality is reached. The corresponding values of x1 and x2 for the final iteration table gives the optimal values of the decision variables i.e., x1 = 15, x2 = 30. Substituting these values in the objectives function equation, we get Zmax = 6x1 + 4x2 = 6(15) + 4(30) = 90 + 120
146
= Rs. 210.00
The solution is, x1 = 15 corrugated boxes are to be produced and x2 = 30 carton boxes are to be produced to yield a Profit, Zmax = Rs. 210.00
147
Step 12: Check the values of objective function. If there are negative values, the solution is not an optimal one; go to step 5. Else, if all the values are positive, optimality is reached. Non-negativity for objective function value is not considered. Write down the values of x1, x2,..xi and calculate the objective function for maximization or minimization.
Note:
(i)
If there are no x1, x2 variables in the final iteration table, the values of x1 and x2 are zero.
(ii) Neglect the sign for objective function value in the final iteration table.
Figure 5.1: Solving LPP using Computer with TORA (Input Screen )
Click Solve Menu, and select Solve Problem ! Algebraic ! Iterations ! All-Slack Starting Solution. Now, click Go To Output screen, then the first iteration table will be displayed. To select the entering variable, click a non-basic variable (if correct, the column turns green). Similarly, select the leaving variable (if correct, the row turns red), Figure 5.2.
148
Then click Next Iteration button to display the next iteration table as shown in Figure 5.3.
Again click next iteration button to get the third and final iteration table. A pop-up menu also indicates that the solution has reached the optimal level. Now we can notice that all the values in the objective function Zmax row are non-negative which indicates that the solution is optimal. The final Iteration Table is shown in Figure 5.4.
From the final Iteration Table, the values of X1, X2 and Zmax are taken to the corresponding values in the solution column (last column) of the simplex table. i.e., Zmax = 210.00 X1 = 30.00 X2 = 15.00 Example 1: Solve the LP problem using Simplex method. Determine the following : (a) (b) What is the optimal solution? What is the value of the objective function?
149
(c)
Which constraint has excess resources and how much? Zmax = 5x1 + 6x2 Subject to constraints, 2x1 + x2 2000 x1 800 x2 200 where x1, x2 0 ....................(i) ....................(ii) ....................(iii)
Solution: Converting the inequality constraints by introducing the slack variables, Zmax = 5x1 + 6x2 + 0S3 + 0S4 + 0S5 2x1 + x2 + S3 = 2000 x1 + S4 = 800 x2 + S5 = 200 Equate x1 and x2 to zero , to find the initial basic solution 2(0) + 0 + S3 =2000 0 + S4 = 800 0 + S5 = 200 The initial basic solution is, S3 = 2000 S4 = 800 S5 = 200 Establish a simplex table to represent the co-efficient of variables for optimal computation as shown in Table 5.6.
Table 5.6: Simplex Table
Iteration Number 0 Basic Variable S3 S4 Kr -Z 1 Kr Pe S3 S4 X2 -Z 2 Pe S3 X1 X2 -Zj S3 Solution Value 2000 800 200 1200 1800 800 200 1200 200 800 200 5200 X1 2 1 0 -5 2 1 0 -5 0 1 0 0 X2 1 0 1 -6 0 0 1 0 -2 0 1 0 S3 1 0 0 0 1 0 0 0 1 0 0 0 S4 0 1 0 0 0 1 0 0 -2 1 0 5 S5 0 0 1 0 -1 0 1 6 -1 0 1 6 900 800 " S3 P e S4 S5 Z + 6Pe S3 2Pe S4 X2 Z + 5Pe Min Ratio 2000 " 200 Equation
In the final table, all the values of Zj are 0, hence optimality is reached. The optimum solution is, (a)
150
(b)
(c)
In the final iteration Table 5.2, slack variable S3 represents the first constraint, therefore this constraint has excess unused resources of 200 units.
Solution: Introduce slack and auxiliary variables to represent in the standard form. Constraint 4x1 + x2 = 4 is introduced by adding an artificial variable a1, i.e., 4x1 + x2 + a1 = 4 Constraint, 5x1 + 3x2 7 is converted by subtracting a slack S1 and adding an auxiliary variable a2. 5x1+ 3x2 S1 + a2 = 7 Constraint 3x2 + 2x2 6 is included with a slack variable S2 3x2 + 2x2 + S2 = 6
151
The objective must also be altered if auxiliary variables exist. If the objective function is minimization, the co-efficient of auxiliary variable is +M (and -M, in case of maximization) The objective function is minimization, Minimize Z = 3x1+ x2 + 0S1+ 0S2+ Ma1+ Ma2 Zmin = 3x1 + x2+ Ma1+ Ma2 The initial feasible solution is (Put x1, x2, S1 = 0) a1 = 4 a2 = 7 s2 = 6 Establish a table as shown below and solve:
Table 5.7: Simplex Table
Iteration B asic N um ber V ariables 0 Kr Z a1 a2 S2 Z1 1 Pe Kr X1 a2 S2 Z1 2 x1 S olution V alue 0 4 7 6 11M 1 2 3 2M -3 5/7 X1 3 4 5 3 9M + 3 1 0 0 0 1 X2 1 1 3 2 4M + 1 # 7/4 5/4 7M /4 +1/4 0 S1 0 0 1 0 M 0 1 0 M 1/7 x2 8/7 0 1 4/7 S2 22/14 0 0 10/ 14 Z1 23/7 0 0 1/7 0 1 0
S2 0 0 0 1 0 0 0 1 0 0
a
1
a
2
M in R atio
E quation
M 1 0 0 0
a 1 /4 a 2 5P e S 2 3P e Z 1 + (9M 3) P e X 1 P e /4
Z 1 + (7M /4 # ) Pe
The solution is, x1 x2 Zmin = 5/7 or 0.71 = 8/7 or 1.14 = 3 x 5 / 7 + 8/7 = 23/7 or 3.29
Check Your Progress 5.1
1. 2.
152
What are the different types of additional variables used in simplex method? How will you introduce/auxiliary variables in solving LPT problem?
Contd....
Write your answer in the space given below. Please go through the lesson sub-head thoroughly you will get your answers in it. This Chek Your Progress will help you to understand the lesson better. Try to write answers for them, but do not submit your answers to the university for assessment. These are for your practice only.
Solution: Converting the inequality constraints by introducing the slack variables, Maximize Z=2x1+ x2 Subject to constraints, 4x1+ 3x2+ S3 = 12 4x1+ x2 + S4 = 8 4x1 x2+ S5 S3 = 12 S4 = 8 S5 = 8
Table 5.8: Illustrating Degeneracy
Iteration Number 0 Basic Variables S3 S4 S5 -Z Solution Value 12 8 8 0 X1 4 4 4 -2 X2 3 1 -1 -1 S3 1 0 0 0 S4 0 1 0 0 S5 0 0 1 0 Minimum Ratio 3 2 2
153
= 8
Equation
After entering all the values in the first iteration table, the key column is -2, variable corresponding is x1. To identify the key row there is tie between row S4 and row S5 with same values of 2, which means degeneracy in solution. To break or to resolve this, consider the column right side and divide the values of the key column values. We shall consider column x2, the values corresponding to the tie values 1, 1. Divide the key column values with these values, i.e., 1/4, 1/4 which is 0.25 and 0.25. Now in selecting the key row, always the minimum positive value is chosen i.e., row S4. Now, S4 is the leaving variable and x1 is an entering variable of the next iteration table. The problem is solved. Using computer and the solution is given in the Figure 5.5.
Figure 5.5: LPP Solved Using Computer with TORA (Output Screen)
Procedure
Step 1: Convert the objective function if maximization in the primal into minimization in the dual and vice versa. Write the equation considering the transpose of RHS of the constraints Step 2: The number of variables in the primal will be the number of constraints in the dual and vice versa. Step 3: The co-efficient in the objective function of the primal will be the RHS constraints in the dual and vice versa. Step 4: In forming the constraints for the dual, consider the transpose of the body matrix of the primal problems. Note: Constraint inequality signs are reversed Example 4: Construct the dual to the primal problem Maximize Z = 6x1 + 10x2 Subject to constraints, 2x1 + 8x2 60 3x1 + 5x2 45 5x1 - 6x2 10 x2 40 where Solution: Minimize W = 60y1 + 45y2 + 10y3 + 40y4
155
x1, x2 0
Subject to constraints, 2y1+3y2+5y3+ 0y4 6 8y1+ 5y2 + 6y3+ y4 10 where y1, y2, y3, y4 0 Example 5: Construct a dual for the following primal Minimize Z = 6x1 4x2+ 4x3 Subject to constraints, 6x1 10x2 + 4x3 14 6x1+ 2x2 + 6x3 10 7x1 2x2 + 5x3 20 x1 4x2 + 5x3 3 4x1+ 7x2 4x3 20 where x1, x2, x3 0 ..................(i) ..................(ii) ..................(iii) ..................(iv) ..................(v)
Solution: Convert 'less than' constraints into 'greater than' type by multiplying by (1) on both sides (i.e., for e.g. iii). 6x1 10x2 + 4x3 14 6x1+ 2x2 + 6x3 10 7x1 + 2x2 5x3 20 x1 4x2 + 5x3 3 4x1 + 7x2 4x3 20 The dual for the primal problem is, Maximize W = 14y1+10y2+20y3+3y4+20y5 Subject to constraints, 6y1+ 6y2 7y3+ y4 + 4y5 6 10y1+ 2y2 + 2y3 4y4+7y5 4 4y1+ 6y2 5y3+ 5y4 4y5 4 where y1, y2, y3, y4 and y5 0
How will a change in an objective function co-efficient affect the optimal solution? How will a change in a right-hand side value for a constraint affect the optimal solution?
For example, a company produces two products x1 and x2 with the use of three different materials 1, 2 and 3. The availability of materials 1, 2 and 3 are 175, 50 and 150 respectively. The profit for selling per unit of product x1 is Rs. 40 and that of x2 is Rs. 30. The raw material requirements for the products are shown by equations, as given below. Zmax = 40x1 + 30x2 Subject to constraints 4x1 + 5x2 175 2x2 50 6x1 + 3x2 150 where x1, x2 0 x1 = Rs. 12.50 x2 = Rs. 25.00 Zmax = 40 12.50 + 30 25.00 = Rs. 1250.00 The problem is solved using TORA software and the output screen showing sensitivity analysis is given in Table 5.11. The optimal solution is ....................(i) ....................(ii) ....................(iii)
Referring to the current objective co-efficient, if the values of the objective function coefficient decrease by 16 (Min. obj. co-efficient) and increase by 20 (Max. obj. coefficient) there will not be any change in the optimal values of x 1 = 12.50 and x2 = 25.00. But there will be a change in the optimal solution, i.e. Zmax.
157
Note: This applies only when there is a change in any one of the co-efficients of variables i.e., x1 or x2. Simultaneous changes in values of the co-efficients need to apply for 100 Percent Rule for objective function co-efficients. For x1, Allowable decrease = Current value - Min. Obj. co-efficient = 40 24 = Rs. 16 = 60 40 = Rs. 20.00 Similarly, For x2, Allowable decrease = Rs. 10.00 Allowable increase = Rs. 20.00 ---------------- (ii) ---------------- (iii) --------------- (iv) ------------------ (i) Allowable increase = Max. Obj. co-efficient Current value
For example, if co-efficient of x 1 is increased to 48, then the increase is 48 40 = Rs. 8.00. From (ii), the allowable increase is 20, thus the increase in x1 coefficient is 8/20 = 0.40 or 40%. Similarly, If co-efficient of x2 is decreased to 27, then the decrease is 30 - 27 = Rs. 3.00. From (iii), the allowable decrease is 10, thus the decrease in x2 co-efficient is 3/10 = 0.30 or 30%. Therefore, the percentage of increase in x1 and the percentage of decrease in x2 is 40 and 30 respectively. i.e. 40% + 30% = 70% For all the objective function co-efficients that are changed, sum the percentage of the allowable increase and allowable decrease. If the sum of the percentages is less than or equal to 100%, the optimal solution does not change, i.e., x1 and x2 values will not change. But Zmax will change, i.e., = 48(12.50) + 27(25) = Rs. 1275.00 If the sum of the percentages of increase and decrease is greater than 100%, a different optimal solution exists. A revised problem must be solved in order to determine the new optimal values. Change in the right-hand side constraints values and effect on optimal solution Suppose an additional 40 kgs of material 3 is available, the right-hand side constraint increases from 150 to 190 kgs. Now, if the problem is solved, we get the optimal values as x1 = 23.61, x2 = 16.11 and Zmax = 1427.78 From this, we can infer that an additional resources of 40 kgs increases the profit by = 1427.78 1250 = Rs. 177.78 Therefore, for one kg or one unit increase, the profit will increase by = 177.78 / 40 = Rs. 4.44
158
Dual price is the improvement in the value of the optimal solution per unit increase in the right-hand side of a constraint. Hence, the dual price of material 3 is Rs 4.44 per kg.
Increase in material 2 will simply increase the unused material 2 rather than increase in objective function. We cannot increase the RHS constraint values or the resources. If the limit increases, there will be a change in the optimal values. The limit values are given in Table 2.10, i.e., Min RHS and Max RHS values. For example, for material 3, the dual price Rs. 4.44 applies only to the limit range 150 kgs to 262.50 kgs. Where there are simultaneous changes in more than one constraint RHS values, the 100 per cent Rule must be applied. Reduced Cost
( Cost of consumed% ( Profit per unit % & # & # Reduced cost / unit of activity = & resources per unit # ) & of activity # & of activity # & # ' $ ' $
If the activity's reduced cost per unit is positive, then its unit cost of consumed resources is higher than its unit profit, and the activity should be discarded. This means that the value of its associated variable in the optimum solution should be zero. Alternatively, an activity that is economically attractive will have a zero reduced cost in the optimum solution signifying equilibrium between the output (unit profit) and the input (unit cost of consumed resources). In the problem, both x1 and x2 assume positive values in the optimum solution and hence have zero reduced cost. Considering one more variable x3 with profit Rs. 50 Zmax = 40x1 + 30x2 + 50x3 Subject to constraints, 4x1 + 5x2 + 6x3 175 2x2 + 1x3 50 6x1 + 3x2 + 3x3 150 where x1, x2, x3 0 ....................(i) ....................(ii) ....................(iii)
The sensitivity analysis of the problem is shown in the computer output below in Table 5.12.
Table 5.12: Sensitivity Analysis
159
The reduced cost indicates how much the objective function co-efficient for a particular variable would have to improve before that decision function assumes a positive value in the optimal solution. The reduced cost of Rs.12.50 for decision variable x2 tells us that the profit contribution would have to increase to at least 30 + 12.50 = 42.50 before x3 could assume a positive value in the optimal solution.
Check Your Progress 5.2
1 2.
What is Duality concept? What is meant by degeneracy in Linear Programming? Write your answer in the space given below. Please go through the lesson sub-head thoroughly you will get your answers in it. This Chek Your Progress will help you to understand the lesson better. Try to write answers for them, but do not submit your answers to the university for assessment. These are for your practice only. (b) (c)
Notes: (a)
5.14 KEYWORDS
Slack Simplex method Surplus
160
Variable Solution
Briefly comment on the following statement: (a) (b) (c) (d) (e) Two or more entries in the ratio column. LP is a planning techniques. LP techniques are used to optimise the resources for best result. LP in a part of management science. Algebraic techniques is used to solve large problems using simplex method.
10. In sensitivity analysis, explain i. ii. The effect of change of objective function coefficients The effect of change in the right hand side of constraints
Exercise Problems
1. A company manufactures three products A, B and C, which require three raw materials I, II and III. The table given below shows the amount of raw materials required per kg of each product. The resource availability per day and the profit contribution for each product is also given.
161
A 4 5 2 9
B 1 6 4 10
C 6 8 1 6
i. ii. 2.
Formulate the problem as a linear programming problem. Solve the problem and determine the optimal product mix.
A metal fabricator manufactures three types of windows. Each of the windows needs four processes. The time taken on various machines differ due to the size of windows. The time taken and available hours are given in the table below:
Window Type A B C Available time (Hrs) Cutting 5 7 4 20 Heat Treating 7 4 8 24 Forging 1 4 6 28 Grinding 4 8 2 22
The profit contribution for windows A, B and C are Rs. 3.00, Rs. 4.00 and Rs. 5.00 respectively. a. b. c. 3. Formulate the problem. Solve the problem using simplex method to maximize the profit. Determine the excess time available in each processes and by how much.
Solve the following LPP using simplex method. Maximize , Z = 2x1 + x2 Subject to constraints, 4x1+ 3x2 12 4x1+ x2 8 4x1 x2 8 where x1, x2 0 .....................(i) .....................(ii) .....................(iii)
4.
Solve the following LPP: Zmax = 20x1 + 28x2 + 23x3 Subject to constraints, 4x1+ 4x2 75 2x1+ x2 + 2x3 100 3x1+ 2x2 + x3 50 where x1, x2, x3 0 ....................(i) ....................(ii) ....................(iii)
5.
162
Three high precision products are manufactured by a Hi-Tech Machine Tools Company. All the products must undergo process through three machining centers A, B and C. The machine hours required per unit are,
Machining Center A B C I 2 3 3
Product II 4 6 2
III 6 2 1
a. b. c. d. 6.
Formulate the problem as a LPP. Solve the problem to determine the optimal solution. What is the number of units to be made on each product. Does machining center C has any extra time to spare? If so, how much spare time is available ? If additional 10 machine hours are available with machining center A, then what is the optimal product mix ? What is the change in the value of profit ?
Raghu Constructions is considering four projects over the next 3 years. The expected returns of each project and cash outlays for these projects are listed in the tables given. All values are in Lacs of Rupees.
Project 1 2 3 4 Available funds (lakh Rs.) Cash outlay (lakh Rs.) Year 1 12.32 11.15 7.65 10.71 110.00 Year 2 11.10 9.75 5.50 10.31 40.00 Year 3 9.50 8.11 4.75 7.77 35.00 42.25 31.20 15.10 12.05 Return
Raghu has to decide to undertake construction projects. Ignore the time value of money. As a consultant, what suggestion you would like to give Raghu in deciding about the projects to select. Determine the solution using TORA. 7. Solve the following LP Problem using Big M Method. Minimize, Z = 2x1+ 9x2 + x3 Subject to constraints, x1+ 4x2+ 2x3 5 3x1 + x2 + 2x3 4 where 8. Zmin = 4x1 + x2 x1, x2, x3 0 Solve the following LPP
163
.......................(i) .......................(ii)
Solve the following LPP. Find whether multiple or alternate solution exists Maximize Z = 2x1+ 4x2 + 6x3 Subject to constraints, 10x1 + 4x2 + 6x3 150 8x1+ 6x2 + 2x3 100 x1 + 2x2 + x3 120 where x1, x2, x3 0 ..................(i) ..................(ii) ....................(iii)
10. Write the dual of the following LP problem Minimize Z = 6x1 4x2 + 4x3 Subject to constraints, 3x1 + 10x2 + 4x3 15 12x1 + 2x2 + 5x3 4 5x1 4x2 2x3 10 x1 3x2 + 6x3 3 4x1 + 9x2 4x3 2 where 11. x1, x2, x3 0 Obtain the dual of the following linear programming problem Maximize Z , = 4x1 + 9x2 + 6x3 Subject to constraints, 10x1 + 10x2 2x3 6 -5x1 + 5x3 + 6x3 8 14x1 2x2 5x3 20 5x1 4x2 +7x3 3 8x1+ 10x2 5x3 = 2 where x1, x2, x3 0 .......................(i) .......................(ii) ........................(iii) ........................(iv) .......................(v) .......................(i) ...................(ii) ...................(iii) ...................(iv) ...................(v)
ANSWERS
(c) False
TO
QUESTIONS
FOR
(b) True
Bersitman, D, and J Tsitsiklin, Introduction to Linear Optimization, Belmont. Mass: Athena Publishing 1997.
Unit-II
LESSON
6
TRANSPORTATION MODEL
CONTENTS
6.0 Aims and Objectives 6.1 Introduction 6.2 Mathematical Formulation 6.3 Network Representation of Transportation Model 6.4 General Representation of Transportation Model 6.5 Use of Linear Programming to Solve Transportation Problem 6.6 Formulation of LP model 6.7 Solving Transportation Problem Using Computer 6.8 Balanced Transportation Problem 6.9 Unbalanced Transportation Problem 6.10 Procedure to Solve Transportation Problem 6.11 Degeneracy in Transportation Problems 6.12 Maximization Transportation Problem 6.13 Prohibited Routes Problem 6.14 Transhipment Problem 6.15 Let us Sum Up 6.16 Lesson-end Activity 6.17 Keywords 6.18 Questions for Discussion 6.19 Terminal Questions 6.20 Model Answers to Questions for Discussion 6.21 Suggested Readings
6.1 INTRODUCTION
Transportation problem is a particular class of linear programming, which is associated with day-to-day activities in our real life and mainly deals with logistics. It helps in solving problems on distribution and transportation of resources from one place to another. The goods are transported from a set of sources (e.g., factory) to a set of destinations (e.g., warehouse) to meet the specific requirements. In other words, transportation problems deal with the transportation of a product manufactured at different plants (supply origins) to a number of different warehouses (demand destinations). The objective is to satisfy the demand at destinations from the supply constraints at the minimum transportation cost possible. To achieve this objective, we must know the quantity of available supplies and the quantities demanded. In addition, we must also know the location, to find the cost of transporting one unit of commodity from the place of origin to the destination. The model is useful for making strategic decisions involved in selecting optimum transportation routes so as to allocate the production of various plants to several warehouses or distribution centers. The transportation model can also be used in making location decisions. The model helps in locating a new facility, a manufacturing plant or an office when two or more number of locations is under consideration. The total transportation cost, distribution cost or shipping cost and production costs are to be minimized by applying the model.
! !
=
cijxij
!
=
= ai,
!
=
= bj,
Transportation Model
Supply S2
Demand D2
Sm
m cmn : xmn
Dn
where, m be the number of sources, n be the number of destinations, Sm be the supply at source m, Dn be the demand at destination n, cij be the cost of transportation from source i to destination j, and xij be the number of units to be shipped from source i to destination j. The objective is to minimize the total transportation cost by determining the unknowns xij, i.e., the number of units to be shipped from the sources and the destinations while satisfying all the supply and demand requirements.
C 12
C 1n
A1
C 22 . . . Cm2
C 2n . . . Cmn
A2 . . . Am
B1
B2
Bn
!
=
!
=
!
=
!
=
If the total supply is equal to total demand, then the given transportation problem is a balanced one.
6 4000
3 5 3
Cochin 3
2000
Madurai 3
Transportation cost
Goa 4
4000
170
The network diagram shown in Figure 6.2 represents the transportation model of M/s GM Textiles units located at Chennai, Coimbatore and Madurai. GM Textiles produces ready-made garments at these locations with capacities 6000, 5000 and 4000 units per week at Chennai, Coimbatore and Madurai respectively. The textile unit distributes its ready-made garments through four of its wholesale distributors situated at four locations Bangalore, Hyderabad, Cochin and Goa. The weekly demand of the distributors are 5000, 4000, 2000 and 4000 units for Bangalore, Hyderabad, Cochin and Goa respectively. The cost of transportation per unit varies between different supply points and destination points. The transportation costs are given in the network diagram. The management of GM Textiles would like to determine the number of units to be shipped from each textile unit to satisfy the demand of each wholesale distributor. The supply, demand and transportation cost are as follows:
Table 6.2: Production Capacities
Supply 1 2 3 Textile Unit Chennai Coimbatore Madurai Weekly Production (Units) 6000 5000 4000
Transportation Model
A linear programming model can be used to solve the transportation problem. Let, X11 be number of units shipped from source1 (Chennai) to destination 1 (Blore). X12 be number of units shipped from source1 (Chennai) to destination 2 (Hyderabad). X13 number of units shipped from source 1 (Chennai) to destination 3 (Cochin). X14 number of units shipped from source 1 (Chennai) to destination 4 (Goa) and so on. Xij = number of units shipped from source i to destination j, where i = 1,2,..m and, j = 1,2,n.
Check Your Progress 6.1
1 2.
Write your answer in the space given below. Please go through the lesson sub-head thoroughly you will get your answers in it. This Chek Your Progress will help you to understand the lesson better. Try to write answers for them, but do not submit your answers to the university for assessment. These are for your practice only.
......................(i)
X21+x22+x23+x24 < 5000 X31+x32+x33+x34 < 4000 X11+x21+x31 = 5000 X12+x22+x32 = 4000 X13+x23+x33 = 2000 X14+x24+x34 = 4000 Where, xij > 0 for i = 1, 2, 3 and j = 1, 2, 3, 4.
Transportation Model
173
Example 1: Consider the following transportation problem (Table 6.5) and develop a linear programming (LP) model.
Table 6.5: Transportation Problem
Source 1 2 3 Demand Destination 1 15 10 14 250 2 20 9 12 400 3 30 15 18 300 Supply 350 200 400
Solution: Let xij be the number of units to be transported from the source i to the destination j, where i = 1, 2, 3,m and j = 1, 2, 3,n. The linear programming model is Minimize Z = 15x11+20x12+30x13+10x21+9x22+15x23+14x31+12x32+18x33 Subject to constraints, x11+x12+x13 < 350 x21+x22+x23 < 200 x31+x32+x33 < 400 x11+x12+x31 = 250 x12+x22+x32 = 400 x13+x23+x33 = 300 xij > 0 for all i and j. In the above LP problem, there are m ! n = 3 ! 3 = 9 decision variables and m + n = 3 + 3 = 6 constraints. ..................(i) ...................(ii) ...................(iii) ...................(iv) ...................(v) ...................(vi)
!
=
!
=
!
=
"
!
=
In real-life, supply and demand requirements will rarely be equal. This is because of variation in production from the supplier end, and variations in forecast from the customer
end. Supply variations may be because of shortage of raw materials, labour problems, improper planning and scheduling. Demand variations may be because of change in customer preference, change in prices and introduction of new products by competitors. These unbalanced problems can be easily solved by introducing dummy sources and dummy destinations. If the total supply is greater than the total demand, a dummy destination (dummy column) with demand equal to the supply surplus is added. If the total demand is greater than the total supply, a dummy source (dummy row) with supply equal to the demand surplus is added. The unit transportation cost for the dummy column and dummy row are assigned zero values, because no shipment is actually made in case of a dummy source and dummy destination. Example 2: Check whether the given transportation problem shown in Table 6.6 is a balanced one. If not, convert the unbalanced problem into a balanced transportation problem.
Table 6.6: Transportation Model with Supply Exceeding Demand
Source 1 2 3 Demand Destination 1 25 30 15 200 2 45 65 40 100 3 10 15 55 300 Supply 200 100 400
Transportation Model
Solution: For the given problem, the total supply is not equal to the total demand.
! !b
ai "
i=1 j=1
3 j
since,
3 3
!
i=1
a i = 700 0and
!b = 600
j j=1
The given problem is an unbalanced transportation problem. To convert the unbalanced transportation problem into a balanced problem, add a dummy destination (dummy column). i.e., the demand of the dummy destination is equal to,
3 i 3 j
!a " !b
i=1 j=1
Thus, a dummy destination is added to the table, with a demand of 100 units. The modified table is shown in Table 6.7 which has been converted into a balanced transportation table. The unit costs of transportation of dummy destinations are assigned as zero.
Table 6.7: Dummy Destination Added
Source 1 2 3 Demand Destination 1 25 30 15 200 2 45 65 40 100 3 10 15 55 300 4 0 0 0 100 Supply 200 100 400 700/700
Similarly,
If
!
=
>
175
Demand Greater than Supply Example 3: Convert the transportation problem shown in Table 6.8 into a balanced problem.
Table 6.8: Demand Exceeding Supply
Source 1 2 3 Demand Destination 1 10 12 14 100 2 16 12 8 200 3 9 13 13 450 4 12 5 4 250 Supply 200 300 300 1000/800
! !a
bj >
j =1
3
i=1
4
i
! a = 800 and !b
i =1
= 1000
j =1
The given problem is an unbalanced one. To convert it into a balanced transportation problem, include a dummy source (dummy row) as shown in Table 6.9
Table 6.9: Balanced TP Model
Source 1 1 2 3 4 Demand 10 12 14 0 100 Destination 2 16 12 8 0 200 3 9 13 13 0 450 4 12 5 4 0 250 Supply 200 300 300 200 1000/1000
176
Algorithm for North-West Corner Method (NWC) (i) Select the North-west (i.e., upper left) corner cell of the table and allocate the maximum possible units between the supply and demand requirements. During allocation, the transportation cost is completely discarded (not taken into consideration). Delete that row or column which has no values (fully exhausted) for supply or demand.
Transportation Model
(ii)
(iii) Now, with the new reduced table, again select the North-west corner cell and allocate the available values. (iv) Repeat steps (ii) and (iii) until all the supply and demand values are zero. (v) (i) Obtain the initial basic feasible solution. Select the smallest transportation cost cell available in the entire table and allocate the supply and demand. Algorithm for Least Cost Method (LCM)
(ii) Delete that row/column which has exhausted. The deleted row/column must not be considered for further allocation. (iii) Again select the smallest cost cell in the existing table and allocate. (Note: In case, if there are more than one smallest costs, select the cells where maximum allocation can be made) (iv) Obtain the initial basic feasible solution. Algorithm for Vogels Approximation Method (VAM) (i) Calculate penalties for each row and column by taking the difference between the smallest cost and next highest cost available in that row/column. If there are two smallest costs, then the penalty is zero.
(ii) Select the row/column, which has the largest penalty and make allocation in the cell having the least cost in the selected row/column. If two or more equal penalties exist, select one where a row/column contains minimum unit cost. If there is again a tie, select one where maximum allocation can be made. (iii) Delete the row/column, which has satisfied the supply and demand. (iv) Repeat steps (i) and (ii) until the entire supply and demands are satisfied. (v) Obtain the initial basic feasible solution. Remarks: The initial solution obtained by any of the three methods must satisfy the following conditions: (a) (b) The solution must be feasible, i.e., the supply and demand constraints must be satisfied (also known as rim conditions). The number of positive allocations, N must be equal to m+n-1, where m is the number of rows and n is the number of columns.
Step 4:
Resolving degeneracy To resolve degeneracy at the initial solution, allocate a small positive quantity e to one or more unoccupied cell that have lowest transportation costs, so as to make m + n 1 allocations (i.e., to satisfy the condition N = m + n 1). The cell chosen for allocating e must be of an independent position. In other words, the allocation of e should avoid a closed loop and should not have a path. The following Table 6.10 shows independent allocations.
Table 6.10: Independent Allocations
* * * * *
* * * *
The following Tables 6.10 (a), (b) and (c) show non-independent allocations.
Table 6.10 (a): Non-Independent Allocations
* *
Table 6.10 (b)
* *
* *
Table 6.10 (c)
* *
* *
*
Optimal Solution Step 5: Test for optimality
The solution is tested for optimality using the Modified Distribution (MODI) method (also known as U-V method). Once an initial solution is obtained, the next step is to test its optimality. An optimal solution is one in which there are no other transportation routes that would reduce the total transportation cost, for which we have to evaluate each unoccupied cell in the table in terms of opportunity cost. In this process, if there is no negative opportunity cost, and the solution is an optimal solution. (i)
178
Row 1, row 2,, row i of the cost matrix are assigned with variables U1, U2, ,Ui and the column 1, column 2,, column j are assigned with variables V1, V2, ,Vj respectively.
(ii) Initially, assume any one of Ui values as zero and compute the values for U1, U2, ,Ui and V1, V2, ,Vj by applying the formula for occupied cell. For occupied cells, Cij + Ui + Vj = 0
C ij A
Transportation Model
(iii) Obtain all the values of Cij for unoccupied cells by applying the formula for unoccupied cell. For unoccupied cells, Opportunity Cost, = Cij + Ui + Vj
Cij
Ci
Vj
If values are > 0 then, the basic initial feasible solution is optimal. Go to step 7. If values are =0 then, the multiple basic initial feasible solution exists. Go to step 7. If values are < 0 then, the basic initial feasible solution is not optimal. Go to step 6. Step 6: Procedure for shifting of allocations Select the cell which has the most negative value and introduce a positive quantity called q in that cell. To balance that row, allocate a q to that row in occupied cell. Again, to balance that column put a positive q in an occupied cell and similarly a -q to that row. Connecting all the qs and -qs, a closed loop is formed. Two cases are represented in Table 6.11(a) and 6.11(b). In Table 6.11(a) if all the q allocations are joined by horizontal and vertical lines, a closed loop is obtained. The set of cells forming a closed loop is CL = {(A, 1), (A, 3), (C, 3), (C, 4), (E, 4), (E, 1), (A, 1)} The loop in Table 6.11(b) is not allowed because the cell (D3) appears twice.
Table 6.11(a): Showing Closed Loop
* * * *
179
Table 6.11(b)
* *
* * * * *
Conditions for forming a loop (i) The start and end points of a loop must be the same. (ii) The lines connecting the cells must be horizontal and vertical. (iii) The turns must be taken at occupied cells only. (iv) Take a shortest path possible (for easy calculations). Remarks on forming a loop (i) Every loop has an even number of cells and at least four cells (ii) Each row or column should have only one + and sign. (iii) Closed loop may or may not be square in shape. It can also be a rectangle or a stepped shape. (iv) It doesnt matter whether the loop is traced in a clockwise or anticlockwise direction. Take the most negative ' q' value, and shift the allocated cells accordingly by adding the value in positive cells and subtracting it in the negative cells. This gives a new improved table. Then go to step 5 to test for optimality. Step 7: Calculate the Total Transportation Cost. Since all the values are positive, optimality is reached and hence the present allocations are the optimum allocations. Calculate the total transportation cost by summing the product of allocated units and unit costs. Example 4: The cost of transportation per unit from three sources and four destinations are given in Table 6.12. Obtain the initial basic feasible solutions using the following methods. (i) (ii) North-west corner method Least cost method
Table 6.12: Transportation Model
Source 1 2 3 Demand Destination 1 4 3 9 200 2 2 7 4 400 3 7 5 3 300 4 3 8 1 300 Supply 250 450 500 1200
Solution: The problem given in Table 6.13 is a balanced one as the total sum of supply is equal to the total sum of demand. The problem can be solved by all the three methods. North-West Corner Method: In the given matrix, select the North-West corner cell. The North-West corner cell is (1,1) and the supply and demand values corresponding to cell (1,1) are 250 and 200 respectively. Allocate the maximum possible value to satisfy the demand from the supply. Here the demand and supply are 200 and 250 respectively. Hence allocate 200 to the cell (1,1) as shown in Table 6.13.
180
Transportation Model
2 4 3 9 7 4 2
3 7 5 3
4 3 8 1
D em an d
200 0
400
300
300
Now, delete the exhausted column 1 which gives a new reduced table as shown in Tables 6.14 (a, b, c, d). Again repeat the steps.
Table 6.14 (a): Exhausted Column 1 Deleted
Destination
2 1 Source 2 3
2 50 7 4
3
7 5 3
4
3 8 1
300
350
Demand
350 0
300
300
Destination 3 Source 2 3 Demand 5 100 3 300 200 4 8 1 300 Supply 100 0 500
181
Now only source 3 is left. Allocating to destinations 3 and 4 satisfies the supply of 500. The initial basic feasible solution using North-west corner method is shown in Table 6.15
Table 6.15: Initial Basic Feasible Solution Using NWC Method
P la n t
1 1 10
20
3
25
S u p p ly
25 2 3 15
10
9 30
2
W a reh o u se
5 3 15 4 20
D em and
10
20
15
4 20
14
5
15
25
30 15 20 13 30 # 10 8 25 105 20
Transportation cost
= (4 ! 200) + (2 ! 50) + (7 ! 350) + (5 ! 100) + (2 ! 300) + (1 ! 300) = 800 + 100 + 2450 + 500 + 600 + 300 = Rs. 4,750.00
Least Cost Method Select the minimum cost cell from the entire Table 6.16, the least cell is (3,4). The corresponding supply and demand values are 500 and 300 respectively. Allocate the maximum possible units. The allocation is shown in Table 6.16.
Table 6.16: Allocation of Maximum Possible Units
Destination
1 1 Source 2 3
Demand
4 0 3 7 350 5 100 8
2
2
3
7
4
3
9 33
3 200
1 300
200
400
300
300 0
182
From the supply value of 500, the demand value of 300 is satisfied. Subtract 300 from the supply value of 500 and subtract 300 from the demand value of 300. The demand of
destination 4 is fully satisfied. Hence, delete the column 4; as a result we get, the table as shown in Table 6.17.
Table 6.17: Exhausted Column 4 Deleted
Transportation Model
Destination
1 1 Source 2 3
Demand
4 0 3
2
2 250 7 350
3
7
5 100
9 33
3 200
200
400 150
300
Now, again take the minimum cost value available in the existing table and allocate it with a value of 250 in the cell (1,2). The reduced matrix is shown in Table 6.18
Table 6.18: Exhausted Row 1 Deleted
Desitnation
1 2
Source Demand
3 2000 9
2
7
3
5
Supply
200
150
300
In the reduced Table 6.18, the minimum value 3 exists in cell (2,1) and (3,3), which is a tie. If there is a tie, it is preferable to select a cell where maximum allocation can be made. In this case, the maximum allocation is 200 in both the cells. Choose a cell arbitrarily and allocate. The cell allocated in (2,1) is shown in Table 6.18. The reduced matrix is shown in Table 6.19.
Table 6.19: Reduced Matrix
Destination
2 2
Source
7 4 350
3
5 3 200
Supply
250 200 0
150
Demand
300 100
Now, deleting the exhausted demand row 3, we get the matrix as shown in Table 6.20
Table 6.20: Exhausted Row 3 Deleted
D e stin a tio n
2
Source D em and
3
5 100
S u p p ly 250 0
183
7 150
150 0
100 0
The initial basic feasible solution using least cost method is shown in a single Table 6.21
Table 6.21: Initial Basic Feasible Solution Using LCM Method
D e s t in a t io n
1 1
Source
4 0 3 200
2
2 250 7 1500
3
7
4
3
S u p p ly
2 3
5 1000
9 33
3 200
1 300
D em and
200
400
300
300 0
Transportation Cost = (2 ! 250)+ (3 ! 200) + (7 ! 150) + (5 ! 100)+ ( 3 ! 200) + (1 ! 300) = 500 + 600 + 1050 + 500 + 600 + 300 = Rs. 3550 Vogels Approximation Method (VAM): The penalties for each row and column are calculated (steps given on pages 176-77) Choose the row/column, which has the maximum value for allocation. In this case there are five penalties, which have the maximum value 2. The cell with least cost is Row 3 and hence select cell (3,4) for allocation. The supply and demand are 500 and 300 respectively and hence allocate 300 in cell (3,4) as shown in Table 6.22
Table 6.22: Penalty Calculation for each Row and Column
D e s t in a t io n
1 1
Source
4 0 3
2
2
3
7
4
3
2 3
7 350
5 100
9 33
3 200
1 300
D em and
200 (1 )
400 (2 )
300 (2 )
300 0 (2 )
Since the demand is satisfied for destination 4, delete column 4 . Now again calculate the penalties for the remaining rows and columns.
Table 6.23: Exhausted Column 4 Deleted
D e s tin a tio n
1 1
Source
4 0 3
2
2 250 7 350
3
7
2 3
5 100
9 33
3 200
D em and
200
400 150
300
184
(1 )
(2 )
(2 )
In the Table 6.24 shown, there are four maximum penalties of values which is 2. Selecting the least cost cell, (1,2) which has the least unit transportation cost 2. The cell (1, 2) is selected for allocation as shown in Table 6.23. Table 6.24 shows the reduced table after deleting row l.
Table 6.24: Row 1 Deleted
Transportation Model
Destination
2 Source 3 Demand
2 7 4 150 (3)
3 5 3 300 (2)
After deleting column 1 we get the table as shown in the Table 6.25 below.
Table 6.25: Column 1 Deleted
Destination
2 2
Source
3 5 3
Penalty (2)
7 4
3 150 150
Demand
(1)
300 (2)
0 (3) $
185
W4
Transportation cost
= (2 ! 250) + (3 ! 200) + (5 ! 250) + (4 ! 150) + (3 ! 50) + (1 ! 300) = 500 + 600 + 1250 + 600 + 150 + 300 = Rs. 3,400.00
Example 5: Find the initial basic solution for the transportation problem and hence solve it.
Table 6.28: Transportation Problem
Destination 1 1 Source 2 3 Demand 4 3 9 200 2 2 7 4 400 3 7 5 3 300 4 3 8 1 300 Supply 250 450 500
Solution: Vogels Approximation Method (VAM) is preferred to find initial feasible solution. The advantage of this method is that it gives an initial solution which is nearer to an optimal solution or the optimal solution itself. Step 1: Step 2: The given transportation problem is a balanced one as the sum of supply equals to sum of demand. The initial basic solution is found by applying the Vogels Approximation method and the result is shown in Table 6.29.
186
Transportation Model
Destination
1 1 4 0 3 200 9 33 200 2 2 250 7 350 4 150 400 5 2500 3 500 300 1 300 300 500 8 3 7 4 3 Supply 250
Source
450
3 Demand
Step 3:
Calculate the Total Transportation Cost. (2 ! 250) + (3 ! 200) + (5 ! 250) + (4 ! 150) + (3 ! 50) + (1 ! 300) 500 + 600 + 1250 + 600 + 150 + 300 Rs. 3,400
Check for degeneracy. For this, verify the condition, Number of allocations, N= m + n 1 6=3+41 6=6 Since the condition is satisfied, degeneracy does not exist.
Step 5:
Test for optimality using modified distribution method. Compute the values of Ui and Vj for rows and columns respectively by applying the formula for occupied cells. Cij+Ui+Vj = 0 Then, the opportunity cost for each unoccupied cell is calculated using the formula = Cij + Ui + Vj and denoted at the left hand bottom corner of each unoccupied cell. The computed valued of uj and vi and are shown in Table 6.30.
Table 6.30: Calculation of the Opportunity Cost
Destination
1
4
3
Supply 250 U1 = 2
1
5
250 3 7
6 5
4 8
Source
2
200 1 250 5
450 U2 = -2
9 4 3 1
3
8 150 50 300
Demand
V1 = 1
V2 = 4
V3 = -3
V4 = 1
187
Calculate the values of Ui and Vj, using the formula for occupied cells. Assume any one of Ui and Vj value as zero (U3 is taken as 0)
Cij + Ui + Vj = 0 4 + 0 + V2 = 0, V2 = 4 5 + V2 3 = 0, U2 = 2 3 2 + V1 = 0, V1 = 1 2 4 + U1 = 0, U1 = 2 Calculate the values of , using the formula for unoccupied cells
= Cij + Ui + Vj
C11 = 4+2 1 = 5 C13 = 7+2 3 = 6 C14 = 3+2 1 = 4 C22 = 72 4 = 1 C24 = 82 1 = 5 C31 = 9 +0 1 = 8 Since all the opportunity cost, values are positive the solution is optimum. Total transportation cost = (2 ! 25) + (3 ! 200) + (5 ! 250) + (4 ! 150) + (3 ! 50) + (1 ! 300) = 50 + 600 +1250 + 600 + 150 + 300 = Rs 2,950/Example 6: Find the initial basic feasible solution for the transportation problem given in Table 6.31.
Table 6.31: Transportation Problem
From I II III Requirement To A 50 90 250 4 B 30 45 200 2 C 220 170 50 2 Available 1 3 4
Solution : The initial basic feasible solution using VAM is shown in Table 6.32.
Table 6.32: Initial Basic Feasible Solution Using VAM To A B C Available
50 90
30 45
200
220
170
250
50
2 40 (40)
188
2 20 0 20 0 (120) --
(15) (15)
(40)
Check for degeneracy, The number of allocations, N must be equal to m + n 1. i.e., since N = m+n 1 5 = 3+3 1 4 5, therefore degeneracy exists. To overcome degeneracy, the condition N = m + n 1 is satisfied by allocating a very small quantity, close to zero in an occupied independent cell. (i.e., it should not form a closed loop) or the cell having the lowest transportation cost. This quantity is denoted by e. This quantity would not affect the total cost as well as the supply and demand values. Table 6.33 shows the resolved degenerate table.
Table 6.33: Resolved Degenerate Table
Transportation Model
A I From I 3 III
250
To B 30 45
C
220
Available
50 90
1 3 4
170
200
50
Requirement
2 4 2
2 2
= (50 ! 1)+ (90 ! 3) + (200 ! 2) + (50 ! 2) + (250 ! e) = 50 + 270 + 400 + 100 + 250 e = 820 + 250 e = Rs. 820 since e 0
Example 7: Obtain an optimal solution for the transportation problem by MODI method given in Table 6.34.
Table 6.34: Transportation Problem
Destination D1 S1 Source S2 S3
Demand
D2 30 30 8 8
D3 50 40 70 7
D4 10 60 20 14
Supply 7 9 18
19 70 40 5
Solution: Step1: The initial basic feasible solution is found using Vogels Approximation Method as shown in Table 6.35.
189
Destination D1 S1 Source S2 S3
Demand 19
D2
30
D3
50
D4
10
Supply 7 (9) (9) (40) (40 2 0 9 (10) (20) (20) (20) 2 0 18 (12) (20) (50) -10 0 4 2 0
70
30
40
60
7
40 8 70
2 10 20 14
8 5 0 8 0 7 0
(21) (22) (10) (10) (21) -(10) (10) --(10) (10) --(10) (50)$ Total transportation cost = (19 ! 5) + (10 ! 2) + (40 ! 7) + (60 ! 2) + (8 ! 8) + (20 ! 10)
= = Step 2:
To check for degeneracy, verify the number of allocations, N = m+n 1. In this problem, number of allocation is 6 which is equal m+n 1. \N=m+n1 6=3+41 6=6 therefore degeneracy does not exist.
Step 3:
Test for optimality using MODI method. In Table 6.36 the values of Ui and Vj are calculated by applying the formula Cij + Ui + Vj = 0 for occupied cells , and = Cij + Ui + Vj for unoccupied cells respectively.
Table 6.36: Optimality Test Using MODI Method Destination
D1
19
D2
30
D3
50
D4
10
Supply 7 U1 = 0
S1
5 70 32 30 60 40 2 60
Source
S2
1 -18 7 2
9U2 = -50
40 8 70 20
S3
11 Demand 8 70 10
18 U3 = 10 5 8 7 14
V1 = 19 Initially assume Ui = 0,
190
V2 = 2
V3 = 10 V4 = 10
Find the values of the dual variables Ui and Vj for occupied cells. Cij + Ui + Vj = 0,
19 + 0 + Vi 10 + 0 + V4 60 + U2 10 20 + U3 10 8 10 + V2 40 50 + V3
= 0, = 0, = 0, = 0, = 0, = 0,
V1 = 19 V4 = 10 U2 = 50 U3 = 10 V2 = 2 V3 = 10
Transportation Model
= Cij + Ui + Vj
C12 = 30 + 0 + 2 = 32 C13 = 50 + 0 + 10 = 60 C21 = 70 50 19 = 1 C22 = 30 50 + 2 = 18 C31 = 40 10 19 = 11 C33 = 70 10 + 10 = 70 In Table the cell (2,2) has the most negative opportunity cost. This negative cost has to be converted to a positive cost without altering the supply and demand value. Step 4: Construct a closed loop . Introduce a quantity + q in the most negative cell (S2, D2 ) and a put q in cell (S3, D2 ) in order to balance the column D2. Now, take a right angle turn and locate an occupied cell in column D4. The occupied cell is (S3, D4) and put a + q in that cell. Now, put a q in cell (S2, D4 ) to balance the column D4. Join all the cells to have a complete closed path. The closed path is shown in Figure 6.5.
% 0
-% 2
-% 8
Figure 6.5: Closed Path
% 0
Now, identify the q values, which are 2 and 8. Take the minimum value, 2 which is the allocating value. This value is then added to cells (S2, D2 ) and (S3, D4 ) which have + signs and subtract from cells (S2, D4 ) and (S3, D2 ) which have signs. The process is shown in Figure 6.6
Figure 6.6
D1
19
Destination D2 D3
30 50
D4
10
Supply 7
S1
5 32 70 30 60 40 7 2 2 60 %
Source S2 S3
1 18
% 40 8 -% 8 5 8
70 70 7 10
20 %
11 Demand
18
14
D1
19
Destination D2 D3
30 50
D4
10
Supply 7
S1 Source S2 S3 Demand
5 2 70 2 30 7 40 60
9 18
40 5
8 6 8
70 12 7
20
14
Now, again check for degeneracy. Here allocation number is 6. Verify whether number of allocations, N=m+n1 6=3+41 6=6 therefore degeneracy does not exits. Again find the values of Ui, Vj and for the Table 6.39 shown earlier. For occupied cells, Cij + Ui + Vj = 0 19 + 0 + V1 = 0, 10 + 0 + V4 = 0, 20 + U3 10 = 0, 8 10 + V2 = 0, 30 + U2 + 2 = 0,
192
V1 = 19 V4 = 10 U3 = 10 V2 = 2 U2 = 32 V3 = 10
40 50 + V3 = 0,
For unoccupied cells, = Cij+Ui+Vj C12 = 30 + 0 + 20 = 50 C13 = 50 + 0 8 = 42 C21 = 70 32 19 = 19 C24 = 60 32 10 = 18 C31 = 40 10 19 = 11 C33 = 70 10 8 = 52 The values of the opportunity cost are positive. Hence the optimality is reached. The final allocations are shown in Table 6.39.
Table 6.39: Final Allocation
Destination D1 19 S1 5 70 Source S2 2 40 S3 6 Demand 5 8 7 12 14 8 7 70 20 18 U3 = 10 30 40 2 60 9 U2 = 32 D2 30 D3 50 D4 10 7 U1 = 0 Supply
Transportation Model
V1 = 19
V2 = 2
V3 = 8 V4 = 10
Total transportation cost = (19 ! 5) + (10 ! 2) + (30 ! 2) + (40 ! 7) + (8 ! 6) + (20 ! 12) = 95 + 20 + 60 + 280 + 48 + 240 = Rs. 743 Example 8: Solve the transportation problem
Supply 10 8 5 23
Here the supply does not meet the demand and is short of 2 units. To convert it to a balanced transportation problem add a dummy row and assume the unit cost for the dummy cells as zero as shown in Table 6.40 and solve.
Table 6.40: Dummy Row Added to TP
The company has five warehouses. The demands at these warehouses and the transportation costs per unit are given in the Table 6.42 below. The selling price per unit is Rs. 30/Table 6.42: Transportation Problem
Warehouse Transportation cost (Rs) Unit-wise A 1 2 3 4 5 6 8 2 11 3 B 9 10 6 6 4 C 5 7 3 2 8 D 3 7 8 9 10 100 200 120 80 70 Demand
(i) (ii)
194
Formulate the problem to maximize profits. Determine the solution using TORA.
Solution: (i) The objective is to maximize the profits. Formulation of transportation problem as profit matrix table is shown in Table 6.43. The profit values are arrived as follows. Profit = Selling Price Production cost Transportation cost
Table 6.43: Profit Matrix
Destination A 1 2 3 4 5 Supply 6 4 10 1 9 150 B 4 3 7 7 9 250 C 10 8 12 13 7 100 D 15 11 10 9 8 70 Demand 100 200 120 80 70 570
Transportation Model
Converting the profit matrix to an equivalent loss matrix by subtracting all the profit values from the highest value 13. Subtracting all the values from 13, the loss matrix obtained is shown in the Table 6.44
Table 6.44: Loss Matrix
Destination A 1 2 3 4 5 Supply 9 11 5 14 6 150 B 11 12 8 8 6 250 C 5 7 3 2 8 100 D 0 4 5 6 7 70 Demand 100 200 120 80 70 570
195
Output Screen:
The first iteration itself is optimal, hence optimality is reached. (iii) To find the total cost: The total maximization profit associated with the solution is Total Profit = (6 ! 10) + (4 ! 20) + (10 ! 120) + (3 ! 180) + (9 ! 70) + (10 ! 20) + (13 ! 80) + (15 ! 70) = 60 + 80 + 1200 + 540 + 630 + 200 + 1040 + 1050 = Rs 4800.00
Destination
A 1 2 3 Demand 25 15 10 150 B 21 7 12 125 16 75 C 19 Supply 120 150 80
Solution: The entries of the transportation cost are made using TORA
196
Input Screen:
Transportation Model
Output Screen:
From the output Schedule, there are no goods that are to be shipped from source 2 to destination C. The total transportation cost is Rs 4600 /-
Considering a company with its manufacturing facilities situated at two places, Coimbatore and Pune. The units produced at each facility are shipped to either of the companys warehouse hubs located at Chennai and Mumbai. The company has its own retail outlets in Delhi, Hyderabad, Bangalore and Thiruvananthapuram. The network diagram representing the nodes and transportation per unit cost is shown in Figure 6.11. The supply and demand requirements are also given. Manufacturing facility (Origin nodes) Warehouses Retail Outlets Demand (Transshipment nodes ) (Destination nodes)
D elh i 5
C o im b a to re 1
C h en n a i 3 H yd era b a d 6
S u p p ly
D em and
Pune 2
M um bai 4
B a n g a lo re 7
T h iru v a n a n th ap u ra m 8
7 5
Objective
The objective is to minimize the total cost Minimize Z = 4X13+ 7X14+ 6X23+ 3X24+ 7X35+ 4X36+ 3X37+ 5X38+ 5X456X46+ 7X47+ 8X48 Constraints: The number of units shipped from Coimbatore must be less than or equal to 800. Because the supply from Coimbatore facility is 800 units. Therefore, the constraints equation is as follows: X13+ X14 < 800 .. (i) Similarly, for Pune facility X23+ X24 < 600 ...(ii) Now, considering the node 3, Number of units shipped out from node 1 and 2 are, X13+ X23 Number of units shipped out from node 3 is, X35 + X36 + X37 + X38 The number of units shipped in must be equal to number of units shipped out, therefore X13 + X23 = X35 + X36 + X37 + X38 Bringing all the variables to one side, we get X13 X23 + X35 + X36 + X37 + X38 = 0 Similarly for node 4 X14 X24 + X45X46 + X47 + X48 =0 ..(iv) Now considering the retail outlet nodes, the demand requirements of each outlet must be satisfied. Therefore for retail node 5, the constraint equation is X35 + X45 = 350 Similarly for nodes 6, 7, and 8, we get, X36 + X46 = 200 X37 + X47 = 400 X38 + X48 = 450 Linear Programming formulation, Minimize Z = 4X13+7X14+6X22+3X24+7X35+4X36+3X37+5X38+5X45+6X46+7X47+8X48 Subject to constraints , X13+ X14 X23+ X23 < 800 ( < 600 &
' origin constraints
Transportation Model
.(iii)
X13 X23 + X35 + X36 + X37 + X38 = 0 X14 X24 + X45 + X46 + X47 + X48 = 0 X35 + X45 = 350 ( X36 + X46 = 200 ) ) X37 + X47 = 400 ) X38 + X48 = 450 ) &
) ) ' destination constraints
199
1. 2.
In the transportation model an example of decision under certainty or decisionmaking under uncertainty. How can the travelling sales man problem be solved using transportation model. Write your answer in the space given below. Please go through the lesson sub-head thoroughly you will get your answers in it. This Chek Your Progress will help you to understand the lesson better. Try to write answers for them, but do not submit your answers to the university for assessment. These are for your practice only. (b) (c)
Notes: (a)
6.17 KEYWORDS
Origin Destination Source Northwest corner Degeneracy
200
: Origin of a TP is the from which shipments are dispatched. : Destination of TA is a point to where shipment are transported. : Supply location is a TP. : A systematic procedure for establishing our initial feasible solution to an optimal : A situation that occurs where the number of occupied squares in any solution is less than number of row play number of column in a transportation basic.
: A situation is which demand in not equal to supply. : Vogel Approximation Method is an interactive proceeded of a feasible solution.
Transportation Model
6.
3. 4. 5. 6. 7. 8. 9.
What are the methods used to find the initial transportation cost ? Which of the initial three methods give a near optimal solution ? Explain Vogels approximation method of finding the initial solution. What is degeneracy in a transportation problem ? How is it resolved ? What are the conditions for forming a closed loop ? How are the maximization problems solved using transportation model ? How is optimality tested in solving transportation problems ?
Exercise Problems
1. Develop a network representation of the transportation problem for a company that manufactures products at three plants and ships them to three warehouses. The plant capacities and warehouse demands are shown in the following table: The transportations cost per unit (in Rs.) is given in matrix.
Plant W1 P1 P2 P3 Warehouse demand (no. of units) 22 12 14 250 Warehouse W2 18 12 20 450 W3 26 10 10 300 350 450 200 Plant Capacity (no. of units)
2.
Determine whether a dummy source or a dummy destination is required to balance the model given. (a) (b) (c) Supply a1 = 15, a2 = 5, a3 = 4, a4 = 6 Demand b1 = 4, b2 = 15, b3 = 6, b4 = 10 Supply a1 = 27, a2 = 13, a3 = 10 Demand b1 = 30, b2 = 10, b3 = 6, b4 = 10 Supply a1 = 2, a2 = 3, a3 = 5 Demand b1 = 3, b2 = 2, b3 = 2, b4 = 2, b5 = 1.
3.
A state has three power plants with generating capacities of 30, 40 and 25 million KWH that supply electricity to three cities located in the same state. The demand requirements (maximum) of the three cities are 35, 40 and 20 million KWH. The distribution cost (Rs. in thousand) per million unit for the three cities are given in the table below:
City 1 1 Plant 2 3 60 35 55 2 75 35 50 3 45 40 45
202
Formulate the problem as a transportation model. Determine an economical distribution plan. If the demand is estimated to increase by 15%, what is your revised plan? If the transmission loss of 5% is considered, determine the optimal plan.
Transportation Model
Find the initial transportation cost for the transportation matrix given using NorthWest corner method, Least cost method and Vogels Approximation Method.
Destination 1 A Source B C Demand 5 7 6 50 2 6 5 1 30 3 7 4 3 20 4 8 2 2 15 Supply 25 75 15
5. 6.
In problem No. 4, if the demand for destination 4 increases from 15 units to 25 units, develop the transportation schedule incorporating the change. Find the initial solution using all the three methods and hence find the optimal solution using TORA package for the following transportation problem. The unit transportation cost is given in the following matrix:
Warehouse 1 A B Factory C D E Demand 10 11 21 25 16 55 2 25 22 32 24 21 45 3 35 16 41 23 18 35 4 16 18 20 22 20 40 5 18 22 20 23 19 70 6 22 19 11 24 16 65 Supply 70 60 50 85 45
7.
The Sharp Manufacturing Company produces three types of monoblock pumps for domestic use. Five machines are used for manufacturing the pumps. The production rate varies for each machine and also the unit product cost. Daily demand and machine availability are given below.
Demand Information
Product A Demand (units) 2000 B 15000 C 700
Determine the minimum production schedule for the products and machines. 8. A company has plants at locations A, B and C with the daily capacity to produce chemicals to a maximum of 3000 kg, 1000 kg and 2000 kg respectively. The cost of production (per kg) are Rs. 800 Rs. 900 and Rs. 7.50 respectively. Customers requirement of chemicals per day is as follows:
Customer 1 2 3 4 Chemical Required 2000 1000 2500 1000 Price offered 200 215 225 200
Transportation cost (in rupees) per kg from plant locations to customers place is given in table.
Customer 1 A Plant B C 5 7 4 2 7 3 6 3 10 4 3 4 12 2 9
Find the transportation schedule that minimizes the total transportation cost. 9. A transportation model has four supplies and five destinations. The following table shows the cost of shipping one unit from a particular supply to a particular destination.
Source 1 1 2 3 Demand 13 8 2 10 2 6 2 12 15 Destination 3 9 7 5 7 4 6 7 8 10 5 10 9 7 2 13 15 13 Supply
The following feasible transportation pattern is proposed: x11 = 10, x12 = 3, x22 = 9, x23 = 6, x33 = 9, x34 = 4, x44 = 9, x45 = 5. Test whether these allocations involve least transportation cost. If not, determine the optimal solution. 10. A linear programming model is given:
204
Minimize Z = 8x11 + 12x12 + 9x22 + 10x23 + 7x31 + 6x32 + 15x33 , subject to the constraints,
x11 + x12 + x13 = 60 ( ) x21 + x22 + x23 = 50 ' Supply constraints x31 + x32 + x33 = 30 &
( ' & )
Transportation Model
x11 + x21 + x31 = 20 ) x12 + x22 + x32 = 60 ) Demand constraints x13 + x23 + x33 = 30 Formulate and solve as a transportation problem to minimize the transportation cost. 11. A company has four factories situated in four different locations in the state and four company showrooms in four other locations outside the state. The per unit sale price, transportation cost and cost of production is given in table below, along with weekly requirement.
Factory 1 A B C D Factory A B C D 9 4 4 8 2 4 4 6 7 Showrooms 3 5 4 5 7 Weekly Capacity (units) 15 20 25 20 4 3 4 6 4 12 17 19 17 Weekly demand (units) 10 14 20 22 Cost of production (Rs)
Determine the weekly distribution schedule to maximize the sales profits. 12. Solve the given transportation problem to maximize profit.
Source 1 A B C Demand 65 60 70 45 2 30 51 62 55 Profit / unit 3 77 65 21 40 4 31 42 71 60 5 65 64 45 25 6 51 76 52 70 200 225 125 Supply
Use TORA to solve the problem. 13. A computer manufacturer has decided to launch an advertising campaign on television, magazines and radio. It is estimated that maximum exposure for these media will be 70, 50, and 40 million respectively. According to a market survey, it was found that the minimum desired exposures within age groups 15-20, 21-25, 2630, 31-35 and above 35 are 10, 20, 25, 35 and 55 million respectively. The table below gives the estimated cost in paise per exposure for each of the media. Determine an advertising plan to minimize the cost.
205
Solve the problem and find the optimal solution, i.e., maximum coverage at minimum cost. 14. A garment manufacturer has 4 units I, II, III, and IV, the production from which are received by 4 direct customers. The weekly production of each manufacturing unit is 1200 units and all the units are of the same capacity. The company supplies the entire production from one unit to one supplier. Since the customers are situated at different locations, the transportation cost per unit varies. The unit cost of transportation is given in the table. As per the companys policy, the supply from unit B is restricted to customer 2 and 4, and from unit D to customer 1 and 3. Solve the problem to cope with the supply and demand constraints.
Manufacturing unit A B C D 1 4 4 6 2 6 5 7 3 8 5 5 4 3 9 6
15. Check whether the following transportation problem has an optimal allocation:
Warehouse 1 A B C D Dummy Demand 150 50 50 100 100 25 25 50 50 100 50 100 150 2 3 4 5 Supply 100 25 75 200 100
16. A company dealing in home appliances has a sales force of 20 men who operate from three distribution centers. The sales manager feels that 5 salesmen are needed to distribute product line I, 6 to distribute product line II, 5 for product line III and 4 to distribute product line IV. The cost (in Rs) per day of assigning salesmen from each of the offices are as follows:
Product Line I A Source B C 10 9 7 II 12 11 8 III 13 12 9 IV 9 13 10
206
Currently, 8 salesmen are available at center A, 5 at center B and 7 at center C. How many salesmen should be assigned from each center to sell each product line, in order to minimize the cost? Is the solution unique?
Transportation Model
18. Three water distribution tanks with daily capacities of 7, 6 and 9 lakh litres respectively, supply three distribution areas with daily demands of 5, 8 and 9 lakh litres respectively. Water is transported to the distribution areas through an underground network of pipelines. The cost of transportation is Rs 0.50 per 1000 litres per pipeline kilometer. The table shows the pipeline lengths between the water tanks and the distribution areas.
Distribution Area 1 A Source B C 75 250 300 2 95 150 250 3 120 80 140
A. B.
Formulate the transportation model Use TORA to determine the optimum distribution schedule
19. In problem 18, if the demand for distribution area 3 increases to 11 lakh litres, determine a suitable distribution plan to meet the excess demand and minimize the distribution cost. Use TORA to solve the problem. 20. Formulate a linear programming model for the following transshipment network given below.
D5
O1
T3
D6
O2
T4
D7
207
ANSWERS
False (b) least (c) (c)
TO
QUESTIONS
(e) (e)
FOR
True equal
208
Assignment Model
LESSON
7
ASSIGNMENT MODEL
CONTENTS
7.0 Aims and Objectives 7.1 Introduction 7.2 Mathematical Structure of Assignment Problem 7.3 Network Representation of Assignment Problem 7.4 Use of Linear Programming to Solve Assignment Problem 7.5 Types of Assignment Problem 7.6 Hungarian Method for Solving Assignment Problem 7.7 Unbalanced Assignment Problem 7.8 Restricted Assignment Problem 7.9 Multiple and Unique Solutions 7.10 Maximization Problem 7.11 Travelling Salesman Problem 7.12 Solving Problems on the Computer with TORA 7.13 Solving Unbalanced Assignment Problem using Computer 7.14 Solving Maximization Problems Using Computers 7.15 Let us Sum Up 7.16 Lesson-end Activity 7.17 Keywords 7.18 Questions for Discussion 7.19 Terminal Questions 7.20 Model Answers to Questions for Discussion 7.21 Suggested Readings
7.1 INTRODUCTION
The basic objective of an assignment problem is to assign n number of resources to n number of activities so as to minimize the total cost or to maximize the total profit of allocation in such a way that the measure of effectiveness is optimized. The problem of
209
assignment arises because available resources such as men, machines, etc., have varying degree of efficiency for performing different activities such as job. Therefore cost, profit or time for performing the different activities is different. Hence the problem is, how should the assignments be made so as to optimize (maximize or minimize) the given objective. The assignment model can be applied in many decision-making processes like determining optimum processing time in machine operators and jobs, effectiveness of teachers and subjects, designing of good plant layout, etc. This technique is found suitable for routing travelling salesmen to minimize the total travelling cost, or to maximize the sales.
1 Job 2 . I . n
Let n be the number of jobs and number of operators. tij be the processing time of job i taken by operator j. A few applications of assignment problem are: i. ii. iii. iv. v. vi. assignment of employees to machines. assignment of operators to jobs. effectiveness of teachers and subjects. allocation of machines for optimum utilization of space. salesmen to different sales areas. clerks to various counters.
In all the cases, the objective is to minimize the total time and cost or otherwise maximize the sales and returns.
210
The assignment problem is a special case of transportation problem where all sources and demand are equal to 1.
Destination Job 1 1
Assignment Model
Job 2
Demand
6 1 Operator C 5
13 Job 3
and so on. Formulating the equations for the time taken by each operator, 10 x11 + 16 x12 + 7 x13 = time taken by operator A. 9 x21 + 17 x22 + 6 x23 = time taken by operator B. 6 x31 + 13 x32 + 5 x33 = time taken by operator C. The constraint in this assignment problem is that each operator must be assigned to only one job and similarly, each job must be performed by only one operator. Taking this constraint into account, the constraint equations are as follows: x11 + x12 + x13 < 1 operator A x21 + x22 + x23 < 1 operator B x31 + x32 + x33 < 1 operator C x11 + x21 + x31 = 1 Job 1
211
x12 + x22 + x32 = 1 Job 2 x13 + x23 + x33 = 1 Job 3 Objective function: The objective function is to minimize the time taken to complete all the jobs. Using the cost data table, the following equation can be arrived at: The objective function is, Minimize Z = 10 x11 + 16 x12 + 7 x13 +9 x21 + 17 x22 + 6 x23 +6 x31 + 13 x32 + 5 x33 The linear programming model for the problem will be, Minimize Z = 10 x11 + 16 x12 + 7 x13 +9 x21 + 17 x22 + 6 x23 +6 x31 + 13 x32 + 5 x33 subject to constraints x11 + x12 + x13 < 1 x21 + x22 + x23 < 1 x31 + x32 + x33 < 1 x11 + x12 + x13 = 1 x12 + x22 + x32 = 1 x13 + x23 + x33 = 1 ....................(i) ....................(ii) ....................(iii) ....................(iv) ....................(v) ....................(vi)
where, xij > 0 for i = 1,2,3 and j = 1,2,3. The problem is solved on a computer, using transportation model in TORA package. The input screen and output screens are shown in Figure 7.1 and Figure 7.2 respectively.
212
Assignment Model
Figure 7.3: TORA, Output Screen The objective function value = 28 mins. Table 7.3: The Assignment Schedule
Men Job Time Taken (in mins.) 1 2 3 2 3 1 Total 16 6 6 28
Reduce the matrix by selecting the smallest element in each row and subtract with other elements in that row. Reduce the new matrix column-wise using the same method as given in step 2. Draw minimum number of lines to cover all zeros. If Number of lines drawn = order of matrix, then optimally is reached, so proceed to step 7. If optimally is not reached, then go to step 6. Select the smallest element of the whole matrix, which is NOT COVERED by lines. Subtract this smallest element with all other remaining elements that are NOT COVERED by lines and add the element at the intersection of lines. Leave the elements covered by single line as it is. Now go to step 4. Take any row or column which has a single zero and assign by squaring it. Strike off the remaining zeros, if any, in that row and column (X). Repeat the process until all the assignments have been made. Write down the assignment results and find the minimum cost/time.
Step 7:
Step 8:
Note: While assigning, if there is no single zero exists in the row or column, choose any one zero and assign it. Strike off the remaining zeros in that column or row, and repeat the same for other assignments also. If there is no single zero allocation, it means multiple number of solutions exist. But the cost will remain the same for different sets of allocations. Example 1: Assign the four tasks to four operators. The assigning costs are given in Table 7.4.
Table 7.4: Assignment Problem
Operators 1 A Tasks B C D 20 15 40 21 2 28 30 21 28 3 19 31 20 26 4 13 28 17 12
Solution: Step 1: Step 2: The given matrix is a square matrix and it is not necessary to add a dummy row/column Reduce the matrix by selecting the smallest value in each row and subtracting from other values in that corresponding row. In row A, the smallest value is 13, row B is 15, row C is 17 and row D is 12. The row wise reduced matrix is shown in Table 7.5.
Table 7.5: Row-wise Reduction
Operators 1 A Tasks B C D
214
2 15 15 4 16
3 6 16 3 14
4 0 13 0 0
7 0 23 9
Step 3:
Reduce the new matrix given in Table 6 by selecting the smallest value in each column and subtract from other values in that corresponding column. In column 1, the smallest value is 0, column 2 is 4, column 3 is 3 and column 4 is 0. The column-wise reduction matrix is shown in Table 7.6.
Table 7.6: Column-wise Reduction Matrix
Operators 1 A Tasks B C D 7 0 23 9 2 11 11 0 12 3 3 13 0 11 4 6 13 0 0
Assignment Model
Step 4:
Draw minimum number of lines possible to cover all the zeros in the matrix given in Table 7.7
Table 7.7: Matrix with all Zeros Covered
Operators 1 A Tasks B C D 7 0 23 9 2 11 11 0 12 3 3 13 0 11 4 0 13 0 0
The first line is drawn crossing row C covering three zeros, second line is drawn crossing column 4 covering two zeros and third line is drawn crossing column 1 (or row B) covering a single zero. Step 5: Step 6: Check whether number of lines drawn is equal to the order of the matrix, i.e., 3 4. Therefore optimally is not reached. Go to step 6. Take the smallest element of the matrix that is not covered by single line, which is 3. Subtract 3 from all other values that are not covered and add 3 at the intersection of lines. Leave the values which are covered by single line. Table 7.8 shows the details.
Table 7.8: Subtracted or Added to Uncovered Values and Intersection Lines Respectively
Operators 1 A Tasks B C D 7 0 26 9 2 9 9 0 9 3 0 10 0 8 4 0 13 3 0
215
Step 7:
Now, draw minimum number of lines to cover all the zeros and check for optimiality. Here in Table 7.9 minimum number of lines drawn is 4 which is equal to the order of matrix. Hence optimality is reached.
Table 7.9: Optimality Matrix
Operators 1 A Tasks B C D
Step 8:
2 9 9 0 9
3 0 10 0 8
7 0 26 9
Assign the tasks to the operators. Select a row that has a single zero and assign by squaring it. Strike off remaining zeros if any in that row or column. Repeat the assignment for other tasks. The final assignment is shown in Table 7.10.
Table 7.10: Final Assignment
Operators 1 A Tasks B C D 7 0 26 9 2 9 9 0 9 3 0 10 0 ! 8 4 0 ! 13 3 0
Task A B C D
Operator 3 1 2 4
Cost 19 15 21 12
Total Cost = Rs. 67.00 Example 2: Solve the following assignment problem shown in Table 7.11 using Hungarian method. The matrix entries are processing time of each man in hours.
216
Assignment Model
M en 1 I II Job III IV V 20 18 21 17 18 2 15 20 23 18 18 3 18 12 25 21 16 4 20 14 27 23 19 5 25 15 25 20 20
Matrix with minimum number of lines drawn to cover all zeros is shown in Table 7.14.
Table 7.14: Matrix will all Zeros Covered
The number of lines drawn is 5, which is equal to the order of matrix. Hence optimality is reached. The optimal assignments are shown in Table 7.15.
Table 7.15: Optimal Assignment
Machines A 1 Job 2 3 4 5 8 6 10 B 7 5 7 4 C 11 5 10 8 D 6 6 7 2 E 7 5 3 4
218
Solution: Convert the 4 ! 5 matrix into a square matrix by adding a dummy row D5.
Assignment Model
Machines A 1 2 Job 3 4 D5 5 8 6 10 0 B 7 5 7 4 0 C 11 5 10 8 0 D 6 6 7 2 0 E 7 5 3 4 0
Machines A 1 2 Job 3 4 D5 0 3 3 8 0 B 2 0 4 2 0 C 6 0 7 6 0 D 1 1 4 2 0 E 2 0 0 0 0
Column-wise reduction is not necessary since all columns contain a single zero. Now, draw minimum number of lines to cover all the zeros, as shown in Table 7.19.
Table 7.19: All Zeros in the Matrix Covered
Machines A 1 2 Job 3 4 D5 0 3 3 8 0 B 2 0 4 2 0 C 6 0 7 6 0 D 1 1 4 2 0 E 2 0 0 0 0
Number of lines drawn Order of matrix. Hence not optimal. Select the least uncovered element, i.e., 1, subtract it from other uncovered elements, add to the elements at intersection of lines and leave the elements that are covered with single line unchanged as shown in Table 7.20.
219
Machines A 1 2 Job 3 4 D5 0 4 3 8 1 B 1 0 3 1 0 C 5 0 6 5 0 D 0 1 3 1 0 E 2 1 0 0 1
Machines A 1 2 Job 3 4 D5 0 4 2 7 1 B 1 0 2 0 0 C 5 0 5 4 0 D 0 1 2 0 0 E 3 2 0 0 2
Number of lines drawn = Order of matrix. Hence optimality is reached. Now assign the jobs to machines, as shown in Table 7.22.
Table 7.22: Assigning Jobs to Machines
Machines A 1 2 Job 3 4 D5
220
B 1 0 2 0 ! 0 !
C 5 0 ! 5 4 0
D 0 ! 1 2 0 0 !
E 3 2 0 0 ! 2
0 4 2 7 1
Hence, the optimal solution is: Job 1 2 3 4 D5 Machine A B E D C Total Cost Cost 5 5 3 2 0 = Rs.15.00
Assignment Model
Example 4: In a plant layout, four different machines M1, M2, M3 and M4 are to be erected in a machine shop. There are five vacant areas A, B, C, D and E. Because of limited space, Machine M2 cannot be erected at area C and Machine M4 cannot be erected at area A. The cost of erection of machines is given in the Table 7.23.
Table 7.23: Assignment Problem
Area A M1 Machine M2 M3 M4 4 6 4 -B 5 4 5 2 C 9 -8 6 D 4 4 5 1 E 5 3 1 2
Find the optimal assignment plan. Solution: As the given matrix is not balanced, add a dummy row D5 with zero cost values. Assign a high cost H for (M2, C) and (M4, A). While selecting the lowest cost element neglect the high cost assigned H, as shown in Table 7.24 below.
Table 7.24: Dummy Row D5 Added
Area C 9 H 8 6 0
A M1 Machine M2 M3 M4 D5 4 6 4 H 0
B 5 4 5 2 0
D 4 4 5 1 0
E 5 3 1 2 0
Area A M1 Machine M2 M3 M4 D5 0 3 3 H 0 B 1 1 4 1 0 C 5 H 7 5 0 D 0 1 4 0 0 E 1 0 0 1 0
Note: Column-wise reduction is not necessary, as each column has at least one single zero. Now, draw minimum number of lines to cover all the zeros, see Table 7.26.
Table 7.26: Lines Drawn to Cover all Zeros
Area A M1 Machine M2 M3 M4 D5 0 3 3 H 0 B 1 1 4 1 0 C 5 H 7 5 0 D 0 1 4 0 0 E 1 0 0 1 0
Number of lines drawn Order of matrix. Hence not Optimal. Select the smallest uncovered element, in this case 1. Subtract 1 from all other uncovered element and add 1 with the elements at the intersection. The element covered by single line remains unchanged. These changes are shown in Table 7.27. Now try to draw minimum number of lines to cover all the zeros.
Table 7.27: Added or Subtracted 1 from Elements
Area A M1 Machine M2 M3 M4 D5 0 2 2 H 0 B 1 0 3 0 0 C 5 H 6 4 0 D 1 1 4 0 1 E 2 0 0 1 1
222
Now number of lines drawn = Order of matrix, hence optimality is reached. Optimal assignment of machines to areas are shown in Table 7.28.
Assignment Model
Area A M1 Machine M2 M3 M4 D5 0 2 2 H 0 ! B 1 0 3 0 0 ! C 5 H 6 4 0 D 1 1 4 0 1 E 2 0 ! 0 1 1
223
Solution: Assign large value to the restricted combinations or introduce M, see Table 7.30.
Table 7.30: Large Value Assignment to Restricted Combinations
Job 1 1 2 Men 3 4 5 16 13 20 16 20 2 12 15 21 13 19 3 11 11 18 M 18 4 M 16 19 16 17 5 15 18 17 12 19
Job 1 1 2 Men 3 4 5 5 2 3 4 3 2 1 4 4 1 2 3 0 0 1 M 1 4 M 5 2 4 0 5 4 7 0 0 1
Job 1 1 2 Men 3 4 5 3 0 1 2 1 2 0 3 3 0 1 3 0 0 1 M 1 4 M 5 2 4 0 5 4 7 0 0 1
Draw minimum number of lines to cover all zeros, see Table 7.33.
Table 7.33: All Zeros Covered
Job 1 1 2 Men 3 4 5 3 0 1 2 1 2 0 3 3 0 1 3 0 0 1 M 1 4 M 5 2 4 0 5 4 7 0 0 1
224
Now, number of lines drawn = Order of matrix, hence optimality is reached (Table 7.34). Allocating Jobs to Men.
Table 7.34: Job Allocation to Men
Assignment Model
Job 1 1 2 Men 3 4 5 3 0 1 2 1 2 0 ! 3 3 0 1 3 0 0 ! 1 M 1 4 M 5 2 4 0 5 4 7 0 0 ! 1
As per the restriction conditions given in the problem, Man 1 and Man 4 are not assigned to Job 4 and Job 3 respectively.
Example 6: A marketing manager has five salesmen and sales districts. Considering the capabilities of the salesmen and the nature of districts, the marketing manager estimates that sales per month (in hundred rupees) for each salesman in each district would be as follows (Table 7.36). Find the assignment of salesmen to districts that will result in maximum sales.
225
District A 1 2 Salesman 3 4 5 32 40 41 22 29 B 38 24 27 38 33 C 40 28 33 41 40 D 28 21 30 36 35 E 40 36 37 36 39
Solution: The given maximization problem is converted into minimization problem (Table 7.37) by subtracting from the highest sales value (i.e., 41) with all elements of the given table.
Table 7.37: Conversion to Minimization Problem
District A 1 2 Salesman 3 4 5 9 1 0 19 12 B 3 17 14 3 8 C 1 13 8 0 1 D 13 20 11 5 6 E 1 5 4 5 2
District A 1 2 Salesman 3 4 5 8 0 0 19 11 B 2 16 14 3 7 C 0 12 8 0 0 D 12 19 11 5 5 E 0 4 4 5 1
226
Reduce the matrix column-wise and draw minimum number of lines to cover all the zeros in the matrix, as shown in Table 7.39.
Assignment Model
District A 1 2 Salesman 3 4 5 8 0 0 19 11 B 0 14 12 1 5 C 0 12 8 0 0 D 7 14 6 0 0 E 0 4 4 5 1
Number of lines drawn Order of matrix. Hence not optimal. Select the least uncovered element, i.e., 4 and subtract it from other uncovered elements, add it to the elements at intersection of line and leave the elements that are covered with single line unchanged, Table 7.40.
Table 7.40: Added & Subtracted the least Uncovered Element
District A 1 2 Salesman 3 4 5 12 0 0 23 15 B 0 10 8 1 5 C 0 8 4 0 0 D 7 10 2 0 0 E 0 0 0 5 1
Now, number of lines drawn = Order of matrix, hence optimality is reached. There are two alternative assignments due to presence of zero elements in cells (4, C), (4, D), (5, C) and (5, D).
Table 7.41: Two Alternative Assignments
A 1 2 3 4 5 12 0 0 23 15
B
0
C 0 8 4 0 0
D 7 10 2 0 0
E 0 0 0 5 1 1 1 2 2 3 3 4 4
A 12 0 0 23 15
B 0 10 8 1 5
C 0 8 4 0 0
D 7 10 2 0 0
E 0 0 0 5 1
227
10 8 1 5
5 5
Therefore, Assignment 1 Salesman Districts 1 2 3 4 5 B A E C D Sales (in 00) Rs. 38 40 37 41 35 1 2 3 4 5 B E A C D Assignment 2 Salesman Districts Sales (in 00) Rs. 38 36 41 41 35
(Note: If there are two non-zero values in the matrix, it means that there are two optimal solutions. Calculate the cost for the two allocations and find the optimal solution.) Example 7: A Travelling salesman has to visit five cities. He wishes to start from a particular city, visit each city once and then return to his starting point. The travelling cost (in Rs.) of each city from a particular city is given below.
Table 7.42: Travelling Salesman Problem
What should be the sequence of the salesman's visit, so that the cost is minimum?
228
Solution: The problem is solved as an assignment problem using Hungarian method; an optimal solution is reached as shown in Table 7.43.
Table 7.43: Optimal Solution Reached Using Hungarian Method
Assignment Model
B 1 3
0
C 3
0
D 6 6
0
E
0
0 ! 3 1
1 0 !
In this assignment, it means that the travelling salesman will start from city A, then go to city E and return to city A without visiting the other cities. The cycle is not complete. To overcome this situation, the next highest element can be assigned to start with. In this case it is 1, and there are three 1s. Therefore, consider all these 1s one by one and find the route which completes the cycle. Case 1: Make the assignment for the cell (A, B) which has the value 1. Now, make the assignments for zeros in the usual manner. The resulting assignments are shown in Table 7.44.
Table 7.44: Resulting Assignment
B 1 3 0 ! 2
C 3
0
D 6 6
0
E 0 ! 0 ! 3 1
1 0 !
The assignment shown in Table 7.42 gives the route sequence A B, B C, C D, D E and E A. The travelling cost to this solution is = 2000 + 3000 + 4000 + 5000 + 1000 = Rs.15,000.00 Case 2: If the assignment is made for cell (D, C) instead of (D, E), the feasible solution cannot be obtained. The route for the assignment will be A B C D C. In this case, the salesman visits city C twice and cycle is not complete. Therefore the sequence feasible for this assignment is A B C D E A. with the travelling cost of Rs.15,000.00
229
Output screen:
From the output screen, the objective is to minimize cost = Rs. 67.00
230
Assignment Model
Output screen:
231
From the output obtained, the objective function value is Rs.15.00. The assignment schedule is given in the Table 7.46 below.
Table 7.46: Assignment Schedule
District A 1 2 Salesman 3 4 5 32 40 41 22 29 B 38 24 27 38 33 C 40 28 33 41 40 D 28 21 30 36 35 E 40 36 37 36 39
Taking the highest value in the given maximization matrix, i.e., 41 and subtracting all other values, we get the following input matrix:
District A 1 2 Salesman 3 4 5
232
B 3 17 14 3 8
C 1 13 8 0 1
D 13 20 11 5 6
E 1 5 4 5 2
9 1 0 19 12
Input screen:
Assignment Model
The output given by TORA is the assignment schedule with the objective of minimization. The given problem is to maximize the sales. To arrive at the maximize sales value, add the assigned values from the given matrix, as shown in Table 7.48.
Table 7.48: Assignment Schedule
1. 2.
How could can assignment problem be solved using the transportation approach. Describe the approach you would use to solve an assignment problem with the help of illustration. Write your answer in the space given below. Please go through the lesson sub-head thoroughly you will get your answers in it. This Chek Your Progress will help you to understand the lesson better. Try to write answers for them, but do not submit your answers to the university for assessment. These are for your practice only. (b) (c)
Notes: (a)
7.17 KEYWORDS
Balanced Assigned Problem Unbalanced Assignment Problem Hungarian Method Restricted Assignment Problem Dummy job Opportunity cost
Basic objective of an AP is to assign n-number of resources to a number of activities. Application of AP is an allocation of machine for optimum utilization of space.
Hungarian method could also be applicable to transportation model. Assignment problem not consider the allocation of number of jobs to a number of person. An optimal assignment is found, if the number of assigned cells equal the number of row (columns). Assignment problem are of the types balanced and unbalanced. Cost or time value for the dummy cells are assumed zero. Maximization problem objective is to maximize profit. Assignment model can be applied in many ______________. If the given matrix is not a _____________, matrix, the AP is called an __________ problem. Transportation model is used for _____________ values. A dummy job is an ______________ jobs. What is meant by matrix reduction. Describe the approach of the Hungarian method.
Assignment Model
3.
4.
10. Discuss how assignment problems are solved using transportation model.
Exercise Problems
1. Consider the assignment problem having the following cost table:
a. b.
Draw the network representation of the problem. Solve the problem and determine the optimal assignment for each man.
235
2.
Consider the assignment problem having the following table. Use TORA to find the optimal solution that minimizes the total cost:
Operator 1 A B C D E 12 9 10 13 15 2 14 13 12 10 9 Job 3 16 17 20 21 15 4 11 9 7 6 11 5 10 7 8 12 13
3.
Four trucks are used for transporting goods to four locations. Because of varying costs of loading and unloading the goods, the cost of transportation also varies for each truck. The cost details (in Rs.) is given in the table below. There is no constraint, and any truck can be sent to any location. The objective is to assign the four trucks to minimize the total transportation cost. Formulate and solve the problem using TORA.
Truck 1 A B C D 525 600 500 620 2 825 750 900 800 Location 3 320 250 270 300 4 200 175 150 160
4.
A two-wheeler service station head has four workmen and four tasks to be performed daily as a routine work. Before assigning the work, the service station head carried out a test by giving each work to all the workmen. The time taken by workmen is given in the table, below.
Work 1 A B C D 20 15 40 17 Time Taken (in mins) Workman 2 28 30 17 28 3 19 16 20 22 4 13 23 13 8
How should the service station head assign the work to each workman so as to minimize the total time? 5. Consider an unbalanced assignment problem having the following cost table:
Operator 1 A B C 12 10 8 2 14 11 9 Task 3 15 13 17 4 16 21 23
236
6.
Assignment Model
a. b. 7.
Draw the network representation of the assignment problem. Formulate a linear programming model for the assignment problem.
Five operators have to be assigned to five machines. Depending on the efficiency and skill, the time taken by the operators differs. Operator B cannot operate machine 4 and operator D cannot operator machine 2. The time taken is given in the following table.
Operator 1 A B C D E 6 6 5 7 5 2 6 7 6 --4 Machine 3 3 2 4 7 3 4 --5 6 6 6 5 5 3 4 7 5
Determine the optimal assignment using TORA. 8. A consumer durables manufacturing company has plans to increase its product line, namely, washing machine, refrigerator, television and music system. The company is setting up new plants and considering four locations. The demand forecast per month for washing machine, refrigerator, television and music system are 1000, 750, 850 and 1200, respectively. The company decides to produce the forecasted demand. The fixed and variable cost per unit for each location and item is given in the following table. The management has decided not to set-up more than one unit in one location.
Location WM Chennai Coimbatore Madurai Selam 30 25 35 20 Fixed cost (lakhs) RF 35 40 32 25 TV 18 16 15 14 MS 16 12 10 12 WM 4 3 4 2 Variable cost / unit RF 3 2 2 1 TV 6 4 7 3 MS 2 4 6 7
Determine the location and product combinations so that the total cost is minimized.
237
9.
Solve the following travelling salesman problem so as to minimize the cost of travel.
10. Solve the travelling salesman problem for the given matrix cell values which represent the distances between cities. c12 = 31, c21 = 9, c34 = 9, c13 = 10, c23 = 12, c41 = 18, c14 = 15, c31 = 10, c42 = 25.
There is no route between cities i and j if value for cij is not given.
ANSWERS
(b) True (c) (b) (d)
TO
False imaginary
QUESTIONS
(d) False
FOR
(e) True
decision-making
square, unbalanced
238
Unit-III
LESSON
8
NETWORK MODEL
CONTENTS
8.0 Aims and Objectives 8.1 Introduction 8.2 PERT / CPM Network Components 8.3 Errors to be avoided in Constructing a Network 8.4 Rules in Constructing a Network 8.5 Procedure for Numbering the Events Using Fulkerson's Rule 8.6 Critical Path Analysis 8.7 Determination of Float and Slack Times 8.8 Solving CPM Problems using Computer 8.9 Project Evaluation Review Technique, PERT 8.10 Solving PERT Problems using Computer 8.11 Cost Analysis 8.12 Let us Sum Up 8.13 Lesson-end Activity 8.14 Keywords 8.15 Questions for Discussion 8.16 Terminal Questions 8.17 Model Answers to Questions for Discussion 8.18 Suggested Readings
8.1 INTRODUCTION
Any project involves planning, scheduling and controlling a number of interrelated activities with use of limited resources, namely, men, machines, materials, money and time. The projects may be extremely large and complex such as construction of a power plant, a highway, a shopping complex, ships and aircraft, introduction of new products and research and development projects. It is required that managers must have a dynamic planning and scheduling system to produce the best possible results and also to react immediately to the changing conditions and make necessary changes in the plan and schedule. A convenient analytical and visual technique of PERT and CPM prove extremely valuable in assisting the managers in managing the projects.
Both the techniques use similar terminology and have the same purpose. PERT stands for Project Evaluation and Review Technique developed during 1950s. The technique was developed and used in conjunction with the planning and designing of the Polaris missile project. CPM stands for Critical Path Method which was developed by DuPont Company and applied first to the construction projects in the chemical industry. Though both PERT and CPM techniques have similarity in terms of concepts, the basic difference is, PERT is used for analysis of project scheduling problems. CPM has single time estimate and PERT has three time estimates for activities and uses probability theory to find the chance of reaching the scheduled time. Project management generally consists of three phases. Planning: Planning involves setting the objectives of the project. Identifying various activities to be performed and determining the requirement of resources such as men, materials, machines, etc. The cost and time for all the activities are estimated, and a network diagram is developed showing sequential interrelationships (predecessor and successor) between various activities during the planning stage. Scheduling: Based on the time estimates, the start and finish times for each activity are worked out by applying forward and backward pass techniques, critical path is identified, along with the slack and float for the non-critical paths. Controlling: Controlling refers to analyzing and evaluating the actual progress against the plan. Reallocation of resources, crashing and review of projects with periodical reports are carried out.
Activity: An activity represents an action and consumption of resources (time, money, energy) required to complete a portion of a project. Activity is represented by an arrow, (Figure 8.1).
A i j
Figure 8.1: An Activity
A is called as an Activity
Event: An event (or node) will always occur at the beginning and end of an activity. The event has no resources and is represented by a circle. The ith event and jth event are the tail event and head event respectively, (Figure 8.2).
A i j
Head Event
Figure 8.2: An Event
One or more activities can start and end simultaneously at an event (Figure 8.3 a, b).
Network Model
A i j
C l
B k
Dummy Activity
An imaginary activity which does not consume any resource and time is called a dummy activity. Dummy activities are simply used to represent a connection between events in order to maintain a logic in the network. It is represented by a dotted line in a network, see Figure 8.5.
3 1 A B 2 C 4
Figure 8.5: Dummy Activity
Dummy
3
Dummy
A 1 2 1 2
Incorrect
Figure 8.6: Correct and Incorrect Activities
Correct
243
b.
Looping error should not be formed in a network, as it represents performance of activities repeatedly in a cyclic manner, as shown below in Figure 8.7.
1
Incorrect
c.
In a network, there should be only one start event and one ending event as shown below, in Figure 8.8.
3
D ummy
A 1 2
1 2
d.
The direction of arrows should flow from left to right avoiding mixing of direction as shown in Figure 8.9.
1
Incorrect
3. 4.
244
5.
Some conventions of network diagram are shown in Figure 8.10 (a), (b), (c), (d) below:
Network Model
(a)
B C
Activity B can be performed only after completing activity A, and activity C can be performed only after completing activity B.
(b)
(c)
(d)
Activity C must start only after completing activities A and B. But activity D can start after completion of activity B.
Figure 8.10 (a), (b), (c), (d): Some Conventions followed in making Network Diagrams
Step3:
Example 1: Draw a network for a house construction project. The sequence of activities with their predecessors are given in Table 8.1, below.
Table 8.1: Sequence of Activities for House Construction Project
Name of the activity A B C D E F Starting and finishing event (1,2) (2,3) (3,4) (3,5) (4,6) (5,6) Description of activity Prepare the house plan Construct the house Fix the door / windows Wiring the house Paint the house Polish the doors / windows Predecessor -A B B C D Time duration (days) 4 58 2 2 1 1
245
Solution:
Figure 8.11: Network diagram representing house construction project.
The network diagram in Figure 8.11 shows the procedure relationship between the activities. Activity A (preparation of house plan), has a start event 1 as well as an ending event 2. Activity B (Construction of house) begins at event 2 and ends at event 3. The activity B cannot start until activity A has been completed. Activities C and D cannot begin until activity B has been completed, but they can be performed simultaneously. Similarly, activities E and F can start only after completion of activities C and D respectively. Both activities E and F finish at the end of event 6. Example 2: Consider the project given in Table 8.2 and construct a network diagram.
Table 8.2: Sequence of Activities for Building Construction Project
Activity A B C D E F Description Purchase of Land Preparation of building plan Level or clean the land Register and get approval Construct the building Paint the building Predecessor A A, B C D
Solution: The activities C and D have a common predecessor A. The network representation shown in Figure 8.12 (a), (b) violates the rule that no two activities can begin and end at the same events. It appears as if activity B is a predecessor of activity C, which is not the case. To construct the network in a logical order, it is necessary to introduce a dummy activity as shown in Figure 8.12.
C A B D
(a)
246
Network Model
E A C
D
(b)
C E A Dummy B D
Figure 8.13: Correct representation of Network using Dummy Activity
Example 3: Construct a network for a project whose activities and their predecessor relationship are given in Table 8.3.
Table 8.3: Activity Sequence for a Project
Solution: The network diagram for the given problem is shown in Figure 8.14 with activities A, B and C starting simultaneously.
247
Solution: An activity network diagram describing the project is shown in Figure 8.15, below:
C B A D G E H
F K Dummy J L
248
Step 4:
Backward Pass Computations (to calculate Latest Time TL) Procedure Step 1: Step 2: Step 3: Begin from end event and move towards the start event. Assume that the direction of arrows is reversed. Latest Time TL for the last event is the earliest time. TE of the last event. Go to the next event, if there is an incoming activity, subtract the value of TL of previous event from the activity duration time. The arrived value is TL for that event. If there are more than one incoming activities, take the minimum TE value. Repeat the same procedure from step 2 till the start event.
Check Your Progress 8.1
Network Model
Step 4:
1 2.
What are the differences between critical and non-critical? Discuss procedural steps of Hungarian method for solving assignment problem. Write your answer in the space given below. Please go through the lesson sub-head thoroughly you will get your answers in it. This Chek Your Progress will help you to understand the lesson better. Try to write answers for them, but do not submit your answers to the university for assessment. These are for your practice only. (b) (c)
Notes: (a)
Total Float TFij: The total float of an activity is the difference between the latest start time and the earliest start time of that activity. TFij = LS ij ESij or TFij = (TL TE) tij ....................(2)
249
....................(1)
Free Float FFij: The time by which the completion of an activity can be delayed from its earliest finish time without affecting the earliest start time of the succeeding activity is called free float. FF ij = (Ej Ei) tij FFij = Total float Head event slack Independent Float IFij: The amount of time by which the start of an activity can be delayed without affecting the earliest start time of any immediately following activities, assuming that the preceding activity has finished at its latest finish time. IF ij = (Ej Li) tij IFij = Free float Tail event slack Where tail event slack = Li Ei The negative value of independent float is considered to be zero. Critical Path: After determining the earliest and the latest scheduled times for various activities, the minimum time required to complete the project is calculated. In a network, among various paths, the longest path which determines the total time duration of the project is called the critical path. The following conditions must be satisfied in locating the critical path of a network. An activity is said to be critical only if both the conditions are satisfied. 1. 2. TL TE = 0 TLj tij TEj = 0 ....................(4) ....................(3)
Example 8.5: A project schedule has the following characteristics as shown in Table 8.5
Table 8.5: Project Schedule
Activity 1-2 1-3 2-4 3-4 3-5 4-9 Name A B C D E F Time 4 1 1 1 6 5 Activity 5-6 5-7 6-8 7-8 8-10 9-10 Name G H I J K L Time (days) 4 8 1 2 5 7
Construct PERT network. Compute TE and TL for each activity. Find the critical path. From the data given in the problem, the activity network is constructed as shown in Figure 8.16.
Solution:
2 4 1 1 3 1
9 7 5 7 2 1 4 6 10
8 6 5
250
(ii)
To determine the critical path, compute the earliest, time TE and latest time TL for each of the activity of the project. The calculations of TE and TL are as follows: To calculate TE for all activities, T E1 = T E2 = T E3 = T E4 = = = T E5 = T E6 = T E7 = T E8 = = = T E9 = T E10 = = = T L10 = T L9 = T L8 = T L7 = T L6 = T L5 = = = T L4 = T L3 = = = T L2 = T L1 = = 0 TE1 + t1, 2 = 0 + 4 = 4 TE1 + t1, 3 = 0 + 1 =1 max (TE2 + t2, 4 and TE3 + t3, 4) max (4 + 1 and 1 + 1) = max (5, 2) 5 days TE3 + t3, 6 = 1 + 6 = 7 TE5 + t5, 6 = 7 + 4 = 11 TE5 + t5, 7 = 7 + 8 = 15 max (TE6 + t6, 8 and TE7 + t7, 8) max (11 + 1 and 15 + 2) = max (12, 17) 17 days TE4 + t4, 9 = 5 + 5 = 10 max (TE9 + t9, 10 and TE8 + t8, 10) max (10 + 7 and 17 + 5) = max (17, 22) 22 days TE10 = 22 TE10 t9,10 = 22 7 = 15 TE10 t8, 10 = 22 5 = 17 TE8 t7, 8 = 17 2 = 15 TE8 t6, 8 = 17 1 = 16 min (TE6 t5, 6 and TE7 t5, 7) min (16 4 and 15 8) = min (12, 7) 7 days TL9 t4, 9 = 15 5 =10 min (TL4 t3, 4 and TL5 t3, 5 ) min (10 1 and 7 6) = min (9, 1) 1 day TL4 t2, 4 = 10 1 = 9 Min (TL2 t1, 2 and TL3 t1, 3) Min (9 4 and 1 1) = 0
Table 8.6: Various Activities and their Floats
Network Model
Activity
Activity Name
Normal Time
Total Float
A B C
4 1 1
0 0 4
5 0 5
Contd...
251
D E F G H I J K L
1 6 5 4 8 1 2 5 7
1 1 5 7 7 11 15 17 10
2 7 10 11 15 12 17 22 17
9 1 10 12 7 16 15 19 15
10 7 15 16 15 17 17 22 22
8 0 5 5 0 5 0 0 5
(iii) From the Table 8.6, we observe that the activities 1 3, 3 5, 5 7,7 8 and 8 10 are critical activities as their floats are zero.
4 TE 0 TL 0 2 1
9 4
5 5
10 9
10 15 TE 7 15 15 10 TL
22 22
4 1 1 7 2 8 1 1 3 1 6 5 4 2 1 7 7 11 16 6 5
17 17 2
The critical path is 1-3-5-7-8-10 (shown in double line in Figure 8.17) with the project duration of 22 days.
Check Your Progress 8.2
Which does a critical path actually signify in a project i.e. in what ways does it differ from any other path? And What ways are its activities particularly impossible? Notes: (a) (b) (c) Write your answer in the space given below. Please go through the lesson sub-head thoroughly you will get your answers in it. This Chek Your Progress will help you to understand the lesson better. Try to write answers for them, but do not submit your answers to the university for assessment. These are for your practice only.
__________________________________________________________________
Network Model
Figure 8.18: Solving Network Problem on Computer Using TORA (Input Screen)
Now select SOLVE MENU and GO TO OUTPUT SCREEN. There are two options for output, select CPM calculations. For step-by-step calculation of earliest time and latest time using forward pass and backward pass procedure click NEXT STEP button. To get all the values instantly, then press ALL STEPS button. The screen gives all the required values to analyze the problem. You may note that at the bottom of the table, the critical activities are highlighted in red colour. The output screen is shown in Figure 8.19, below:
Figure 8.19: Solving Network Problem on Computer Using TORA (Output Screen)
Example 5: The following Table 8.7 gives the activities in construction project and time duration.
253
a. b. a.
Draw the activity network of the project. Find the total float and free float for each activity. From the activity relationship given, the activity network is shown in Figure 8.20 below:
2 10 1 25 3 5 4 10 5
Solution:
20
12
b.
The total and free floats for each activity are calculated as shown in Table 8.8
Table 8.8: Calculation of Total and Free Floats
Activity
Float Free 0 5 0 3 0 0
Example 6: Draw the network for the following project given in Table 8.9.
Table 8.9: Project Schedule
Activity a b c d e f g h i j k
254
Duration (weeks) 10 9 7 6 12 6 8 8 4 11 5 7
Number the events by Fulkersons rule and find the critical path. Also find the time for completing the project. Solution: The network is drawn as shown in Figure 8.21 using the data provided. Number the events using Fulkersons rule and find the Earliest and Latest time and total float is computed for each activity to find out the critical path as given Table 8.10.
Table 8.10: TL, TL and TFij Calculated
Activity Duration weeks 10 9 7 6 12 6 8 8 4 11 5 7 Earliest Time Start a b c d e f g h i j k l 0 10 10 19 19 17 17 23 25 31 31 29 Finish 10 19 17 25 31 23 25 31 29 42 36 36 Latest Time Start 0 16 10 25 25 17 23 23 31 31 37 35 Finish 10 25 17 31 37 23 31 31 35 42 42 42 0 6 0 6 6 0 6 0 6 0 6 6 Total Float
Network Model
19 25 3 0 1 0 a 10 10 c 2 B e d
25 12 5 i
29 35 10 l k 8
Dummy
6 25 31 g 4 f 17 17 7 23 23
11 42 42 j
9 h 31 31
The critical path is a c f h j and the minimum time for the completion of the project is 42 weeks.
255
probabilistic method using three time estimates for an activity, rather than a single estimate, as shown in Figure 8.22.
Probability
Beta Curve
Optimistic time tO: It is the shortest time taken to complete the activity. It means that if everything goes well then there is more chance of completing the activity within this time. Most likely time tm: It is the normal time taken to complete an activity, if the activity were frequently repeated under the same conditions. Pessimistic time tp: It is the longest time that an activity would take to complete. It is the worst time estimate that an activity would take if unexpected problems are faced. Taking all these time estimates into consideration, the expected time of an activity is arrived at. The average or mean (ta) value of the activity duration is given by,
+ +
.....................(5)
+ +
...................(6)
256
Network Model
.......................(7)
Probability of completing the project within the scheduled time is, P (T< Ts) = P ( Z< Z0 ) (from normal tables) .................(8)
Example 8: An R & D project has a list of tasks to be performed whose time estimates are given in the Table 8.11, as follows. Time expected for each activity is calculated using the formula (5):
= + +
+ () + = = 6 days for activity A
Similarly, the expected time is calculated for all the activities. The variance of activity time is calculated using the formula (6).
tp t0 6
2
1
Similarly, variances of all the activities are calculated. Construct a network diagram and calculate the time earliest, TE and time Latest TL for all the activities.
6 2 6 0 0 1 9 4 7 4 3 4 2 8 5 4 12 8 10 7 2 5 7 14 14 6 8
257
a. b. c.
Draw the project network. Find the critical path. Find the probability that the project is completed in 19 days. If the probability is less that 20%, find the probability of completing it in 24 days.
Solution: Calculate the time average ta and variances of each activity as shown in Table 8.12.
Table 8.12: Te & s2 Calculated
Activity 1-2 1-3 1-4 2-4 3-4 3-5 4-6 4-7 5-7 6-7 To 4 2 6 1 6 6 3 4 2 2 Tm 6 3 8 2 7 7 5 11 4 9 Tp 8 10 16 3 8 14 7 12 6 10 Ta 6 4 9 2 7 8 5 10 4 8 '2 0.444 1.777 2.777 0.111 0.111 1.777 0.444 1.777 0.444 1.777
From the network diagram Figure 8.24, the critical path is identified as 1-4, 4-6, 6-7, with a project duration of 22 days. The probability of completing the project within 19 days is given by, P (Z< Z0) To find Z0 ,
Z0
Ts Te in critical path
19 22 2.777 0.444 1.777
3 = 1.3416 days 5
258
we know, P (Z <Z0) = 0.5 Y (1.3416) (from normal tables, Y (1.3416) = 0.4099) = 0.5 0.4099 = 0.0901 = 9.01% Thus, the probability of completing the R & D project in 19 days is 9.01%. Since the probability of completing the project in 19 days is less than 20%, we find the probability of completing it in 24 days.
Network Model
Z0 =
24 22 2 = 0.8944 days 5 5
(from normal tables, Y (0.8944) = 0.3133) = 0.5 + 0.3133 = 0.8133 = 81.33%
Figure 8.24: Solving PERT Problem Using Computer with TORA (Input Screen)
Now, go to solve menu and click. In the output screen, select Activity mean / Variance option in select output option. The following screen appears as shown in Figure 8.25.
259
Selecting the PERT calculations option. The following screen appears. This shows the average duration and standard deviation for the activities.
Figure 8.26: TORA (Output Screen) Showing Average Durations and Standard Deviation for Activities
activities. But if the construction has to the finished earlier, it requires additional cost to complete the project. We need to arrive at a time / cost trade-off between total cost of project and total time required to complete it. Normal time: Normal time is the time required to complete the activity at normal conditions and cost. Crash time: Crash time is the shortest possible activity time; crashing more than the normal time will increase the direct cost.
Network Model
Cost Slope
Cost slope is the increase in cost per unit of time saved by crashing. A linear cost curve is shown in Figure 8.27.
Cost
Crash cost
Cost slope
Cc N c N t Ct
.........................(9)
Example 8: An activity takes 4 days to complete at a normal cost of Rs. 500.00. If it is possible to complete the activity in 2 days with an additional cost of Rs. 700.00, what is the incremental cost of the activity? Solution:
c c Incremental Cost or Cost Slope N C t t
C N
00 500
42
= Rs. 100.00
It means, if one day is reduced we have to spend Rs. 100/- extra per day.
Project Crashing
Procedure for crashing
Step1: Step2:
Draw the network diagram and mark the Normal time and Crash time. Calculate TE and TL for all the activities.
261
Find the critical path and other paths. Find the slope for all activities and rank them in ascending order. Establish a tabular column with required field. Select the lowest ranked activity; check whether it is a critical activity. If so, crash the activity, else go to the next highest ranked activity. Note: The critical path must remain critical while crashing. Calculate the total cost of project for each crashing. Repeat Step 6 until all the activities in the critical path are fully crashed.
Step 7: Step 8:
Example 9: The following Table 8.13 gives the activities of a construction project and other data.
Table 8.13: Construction Project Data
Activity 1-2 1-3 2-4 2-5 3-4 4-5 Normal Time (days) 6 5 5 8 5 2 Cost (Rs) 50 80 60 100 140 60 Time (days) 4 3 2 6 2 1 Crash Cost (Rs) 80 150 90 300 200 80
If the indirect cost is Rs. 20 per day, crash the activities to find the minimum duration of the project and the project cost associated. Solution: From the data provided in the table, draw the network diagram (Figure 8.28) and find the critical path.
6 6 8 2 0 0 1 5 3 5 5 7 6 5 2 11 12 4 5 14 14
From the diagram, we observe that the critical path is 1-2-5 with project duration of 14 days The cost slope for all activities and their rank is calculated as shown in Table 8.14
Cost slope
262
30 = 15. 2
Network Model
The available paths of the network are listed down in Table 8.15 indicating the sequence of crashing (see Figure 8.29).
Table 8.15: Sequence of Crashing
Path 1-2-5 1-2-4-5 1-3-4-5 14 13 12 Number of days crashed 12 11 10 11 11 10 12 11 10
7 8 2 6-4 4 1 53 3 52 4 3
Figure 8.29: Network Diagram Indicating Sequence of Crashing
2-1 52 4
The sequence of crashing and the total cost involved is given in Table 8.16 Initial direct cost = sum of all normal costs given = Rs. 490.00
Table 8.16: Sequence of Crashing & Total Cost
Activity Crashed 1 2(2) Project Duration 14 12 Critical Path 125 125 Direct Cost in (Rs.) 490 490 + (2 ! 15) = 520 Indirect Cost (in Rs.) 14 ! 20 = 280 12 ! 20 = 240 Total Cost (in Rs) 770 760
Contd...
263
2 5 (1) 3 4 (1)
11
11 ! 20 = 220
860
10
10 ! 20 = 200
970
It is not possible to crash more than 10 days, as all the activities in the critical path are fully crashed. Hence the minimum project duration is 10 days with the total cost of Rs. 970.00.
Check Your Progress 8.3
If an activity zero free float, does this mean that a delay in completing that activity is likely to delay the completion of data of the project on whole. Notes: (a) (b) (c) Write your answer in the space given below. Please go through the lesson sub-head thoroughly you will get your answers in it. This Chek Your Progress will help you to understand the lesson better. Try to write answers for them, but do not submit your answers to the university for assessment. These are for your practice only.
8.14 KEYWORDS
Critical path
264
: Is a network and a continuous chain of activities that connect the initial event to the terminal event. : An activity represents an action and consumption of sources.
Activity
PERT
: Project Evaluation Review Technique is a unique and important controlling device. The PERT take into consideration the three types of time optimistic time, pessimistic time and likely time. : Critical Plan Method is a diagrammatical technique for planning and scheduling of projects. : Is used in the context of network analysis. Float may be +ive or ive. : Direction shows the general progression in time. : Normally associated with events. It indicates the amount of latitude. : Is a series of related activities which result in once produces (or services). It is a pictorial presentation of the various events and activities covering a project. : An event represent the start or completion of activity.
Network Model
Event
10. What is project crashing? Explain the procedure for crashing of project activities.
Exercise Problems
1. You are required to prepare a network diagram for constructing a 5 floor apartment. The major activities of the project are given as follows:
Activity A B C D E F G H I Description Selection of site Preparation of drawings Arranging the for finance Selection of contractor Getting approval from Govt Laying the foundation Start construction Advertise in newspaper Allocation of tenants Immediate Predecessor A A A E D, F B, C G, H
2.
For the problem No.1 the time estimates in days are given. Determine the Time earliest and Time latest, and the critical activities
Activity Time (days) A 3 B 5 C 7 D 2 E 5 F 20 G 60 H 2 I 10
3.
An assembly having the following sequence of activities given along with their predecessor in the table below. Draw a network diagram for the assembly.
Activity A B C D E F Description Pick bolt & washer Insert washer in screw Fix the bolt in flange Screw the nut with bolt Pick the spanner Tighten the nut Place the assembly apart Predecessor A A B, C D E F
266
4.
Network Model
Predecessor
5.
Determine the critical path and project duration for the following project:
Activity A B C D E F G Immediate Predecessor A B C,D A E,F Time (days) 3 7 4 2 5 6 3
6.
A national conference is planned in a college. The activities are listed down along with their predecessors and time taken. Prepare a network diagram and determine the critical activities.
Description Confirm lead speaker and topic Prepare brochure Send letters to other speakers Get confirmation from speakers Send letters to participants Obtain travel plans from speakers Arrange for accommodation for speakers Get handouts from speakers Finalize registrations Arrange hall and AV Conduct of programme Immediate Predecessor A B C D E F G H I J K B C C,D D F F G,H I J 5 1 2 5 2 2 1 4 10 1 1 Duration (days)
Activity
7.
J K L M N
I I J K L
3 4 3 5 5
a. b. c. d. e. 8.
Construct the project network diagram. Compute the earliest start time and earliest finish time. Find the latest start and latest finish time. Find the slack for each activity. Determine the critical path and project duration. Use TORA to compare and check answer.
You are alone at home and have to prepare a bread sandwich for yourself. The preparation activities and time taken are given in the table below:
Task A B C D E F G H Description Purchase of bread Take cheese and apply on bread Get onions from freezer Fry onions with pepper Purchase sauce for bread Toast Bread Assemble bread and fried onions Arrange in plate Predecessor A A B,C B,C F G Time (minutes) 20 5 1 6 15 4 2 1
a. b. c. 9.
Determine the critical activities and preparation time for tasks given in table. Find the earliest time and latest time for all activities. While purchasing sauce, you met a friend and spoke to him for 6 minutes. Did this cause any delay in preparation?
An amusement park is planned at a suitable location. The various activities are listed with time estimates. Using TORA, determine the critical path. Also, find whether the amusement park can be opened for public within 35 days from the start of the project work. Activity : A 9 B 6 C 2 D 7 E 10 F 3 G 6 H 1 I 7 J 2 K 5 Time (days) :
10. Draw the network from the following activity and find the critical path and total duration of project.
Activity 1-2 1-3 1-4 2-3 2-5 3-5 4-5 Duration (days) 5 3 6 8 7 2 6
Network Model
11.
12. Determine the critical path and project duration for the network given.
A (5) 1
2 D (0)
3 C (8) B (6)
269
13. For the PERT problem find the critical path and project duration. What is the probability that the project will be completed in 25 days?
Activity Predecessor Optimistic A B C D E F G H I A A C D B E,F G 2 1 0 1 3 3 1 5 3 Time Most likely 5 10 0 4 10 5 2 10 6 Pessimistic 14 12 6 7 15 7 3 15 9
14. The following table lists the jobs of a network along with their estimates.
Activity Time (Weeks) Normal 1-2 1-3 2-3 2-4 2-5 3-6 4-5 5-6 9 15 7 7 12 12 6 9 Crash 4 13 4 3 6 11 2 6 Normal 1300 1000 7000 1200 1700 600 1000 900 Cost (Rs) Crash 2400 1380 1540 1920 2240 700 1600 1200
a. b. c.
Draw the project network diagram. Calculate the length and variance of the critical path. What is the probability that the jobs on the critical path can be completed in 41 days?
15. The following table gives data at normal time and cost crashed time and project cost.
Activity Time (W eeks) Normal 1-2 1-3 2-3 2-4 2-5 3-6 4-5
270
Cost (Rs) Normal 1300 1000 7000 1200 1700 600 1000 900 Crash 2400 1380 1540 1920 2240 700 1600 1200
Crash 4 13 4 3 6 11 2 6
9 15 7 7 12 12 6 9
5-6
Find the optimum project time and corresponding minimum total project cost by crashing appropriate activities in proper order. Show the network on time-scale at each step. Indicated cost per day is Rs. 400.00. 16. Solve the following project, and find the optimum project time and project cost.
Activity t0 1-2 2-3 2-4 2-5 3-6 4-6 5-7 6-7 1 1 1 5 2 5 4 1 Time (weeks) tm tp 3 4 3 8 4 6 5 3 Crash time 5 7 5 11 6 7 6 5 1 3 2 7 2 4 4 1 500 800 400 500 300 200 1000 700 900 1400 600 600 500 360 1400 1060 Cost (Rs.) Normal Crash
Network Model
ANSWERS
(b) (b) False (c)
TO
True
QUESTIONS
(d) (c) False unit (e)
FOR
False
single, three
(d) optimistic
271
LESSON
9
WAITING MODEL (QUEUING THEORY)
CONTENTS
9.0 Aims and Objectives 9.1 Introduction 9.2 Queuing Systems 9.3 Characteristics of Queuing System 9.3.1 The Arrival Pattern 9.3.2 The Service Mechanism 9.3.3 The Queue Discipline 9.3.4 The Number of Customers allowed in the System 9.3.5 The Number of Service Channels 9.3.6 Attitude of Customers 9.4 Poisson and Exponential Distribution 9.5 Symbols and Notations 9.6 Single Server Queuing Model 9.7 Solving the Problem Using Computer with TORA 9.8 Let us Sum Up 9.9 Lesson-end Activity 9.10 Keywords 9.11 Questions for Discussion 9.12 Terminal Questions 9.13 Model Answers to Questions for Discussion 9.14 Suggested Readings
9.1 INTRODUCTION
Queuing theory deals with problems that involve waiting (or queuing). It is quite common that instances of queue occurs everyday in our daily life. Examples of queues or long waiting lines might be
l
272
Waiting for service in banks and at reservation counters. Waiting for a train or a bus.
l l
Waiting for checking out at the Supermarket. Waiting at the telephone booth or a barber's saloon.
Whenever a customer arrives at a service facility, some of them usually have to wait before they receive the desired service. This forms a queue or waiting line and customers feel discomfort either mentally or physically because of long waiting queue. We infer that queues form because the service facilities are inadequate. If service facilities are increased, then the question arises how much to increase? For example, how many buses would be needed to avoid queues? How many reservation counters would be needed to reduce the queue? Increase in number of buses and reservation counters requires additional resource. At the same time, costs due to customer dissatisfaction must also be considered. In designing a queuing system, the system should balance service to customers (short queue) and also the economic considerations (not too many servers). Queuing theory explores and measures the performance in a queuing situation such as average number of customers waiting in the queue, average waiting time of a customer and average server utilization.
S1 S2 (x)
. . .
Sn.
Queue
Customers
XXXX
274
(1)
XXXX
Served Customers
(2) (b)
Customers XXXX
(c)
Service Facilities
Figure 9.2: Arrangements of Service Facilities (a, b, c)
i.
Probability that an arrival is observed during a small time interval (say of length v) is proportional to the length of interval. Let the proportionality constant be l, so that the probability is lv. Probability of two or more arrivals in such a small interval is zero. Number of arrivals in any time interval is independent of the number in nonoverlapping time interval.
ii. iii.
These assumptions may be combined to yield what probability distributions are likely to be, under Poisson distribution with exactly n customers in the system. Suppose function P is defined as follows: P (n customers during period t) = the probability that n arrivals will be observed in a time interval of length t then, P (n, t) =
(!t)n e "!t
n!
(n = 0, 1, 2,)
..................(1)
This is the Poisson probability distribution for the discrete random variable n, the number of arrivals, where the length of time interval, t is assumed to be given. This situation in queuing theory is called Poisson arrivals. Since the arrivals alone are considered (not departures), it is called a pure birth process. The time between successive arrivals is called inter-arrival time. In the case where the number of arrivals in a given time interval has Poisson distribution, inter-arrival times can be shown to have the exponential distribution. If the inter-arrival times are independent random variables, they must follow an exponential distribution with density f(t) where, f (t) = le lt (t > 0) .................(2) Thus for Poisson arrivals at the constant rate l per unit, the time between successive arrivals (inter-arrival time) has the exponential distribution. The average Inter - arrival time is denoted by . I By integration, it can be shown that E(t) = I/ !
.................(3)
If the arrival rate l = 30/hour, the average time between two successive arrivals are 1/30 hour or 2 minutes. For example, in the following arrival situations, the average arrival rate per hour, l and the average inter arrival time in hour, are determined. (i) One arrival comes every 15 minutes. Average arrival rate , l =
60 = 4 arrivals per hour. 15
(ii)
Average inter arrival time = 15 minutes = ! or 0.25 hour. I Three arrivals occur every 6 minutes. Average arrival rate, l = 30 arrivals per hour. Average Inter-arrival time, = I 3 =
6 1 or 0.33 hr. 30
2 minutes =
(iii) Average interval between successive intervals is 0.2 hour. Average arrival rate, l =
1 = 5 arrivals per hour. 0. 2
Similarly, in the following service situations, the average service rate per hour, and average service time in hours are determined.
(i)
(ii)
Average service time, S = = 10 minutes or 0.166 hour. 4 Number of customers served in 15 minutes is 4. Average service rate, m =
4 x 60 =16 services per hour. 15 30
Average services time, S = = 3.75 mins or 0.0625 hour. 4 (iii) Average service time is 0.25 hour. Average service rate , m = 4 services per hour. Average service time S = 15 mins or 0.25 hour. Example 1: In a factory, the machines break down and require service according to a Poisson distribution at the average of four per day. What is the probability that exactly six machines break down in two days? Solution: Given l = 4, n = 6, t =2 P(n, t) = P(6, 4) when l = 4 we know, P(6,2) = P(n, t) =
6 4#2
(!t)n e!t
n!
(4 # 2) e
6!
8 e 720 = 0.1221
= Solving the Problem using Computer Example 1 is solved using computer with TORA. Enter into TORA package and select Queuing Analysis option. Press 'go to input screen' to enter the values. The input screen is shown in Figure 9.3 given below. The numbers scenarios is 1 and the value of Lambda is lt = 4 " 2 = 8.
277
Press 'solve', to view the Queuing Analysis output . Select Scenario 1 option, to get the result, as shown in Figure 9.4.
In the output screen, for n = 6 the probability, Pn is given as 0.12214. Example 2: On an average, 6 customers arrive in a coffee shop per hour. Determine the probability that exactly 3 customers will reach in a 30 minute period, assuming that the arrivals follow Poisson distribution. Solution: Given, l = 6 customers / hour t = 30 Minutes = 0.5 hour n=2 we know, P(n, t) =
(!t)n e!t
n!
P(6,2) =
(6 # 0 . 5 ) e
2
6# 0 . 5
2!
= 0.22404
Similarly, when the time taken to serve different customers are independent, the probability that no more than t periods would be required to serve a customer is given by exponential distribution as follows: p(not more than t time period) = 1 e mt where m = average service rate Example 3: A manager of a fast food restaurant observes that, an average of 9 customers are served by a waiter in a one-hour time period. Assuming that the service time has an exponential distribution, what is the probability that (a)
278
A customer shall be free within 12 minutes. A customer shall be serviced in more than 25 minutes.
(b)
Solution: (a) Given, m = 9 customers / hour t = 15 minutes = 0.25 hour Therefore, p (less than 15 minutes) = l e mt = 1 e 9 " 0.25 = 0.8946 (b) Given, m = 9 customers / hour t = 25 minutes = 0.4166 hour Therefore, P (more than 25 minutes) = l e mt = 1 e 9 " 0.4166 = 0.0235 To analyze queuing situations, the questions of interest that are typically concerned with measures of queuing system performance include,
l l l l l l l
What will be the waiting time for a customer before service is complete? What will be the average length of the queue? What will be the probability that the queue length exceeds a certain length? How can a system be designed at minimum total cost? How many servers should be employed? Should priorities of the customers be considered? Is there sufficient waiting area for the customers?
Pn (t) =
(iii) Arrivals are infinite population a. (iv) Customers are served on a First-in, First-out basis (FIFO). (v) There is only a single server.
Ls =
$
n=1
nPn =
$ n(1 !/ )(!/ )
n= 1
........................(2)
Ln =
$ (n 1)P
n =1 % n
% n n =1
$ nP $P
n =1
!2 &2 = ( ! ) 1 &
....................(3)
280
With an average arrival rate l, the average time between the arrivals is 1 / l. Therefore, the mean waiting time in queue, wq is the product of the average time between the arrivals and the average queue length,
Wq
, 1) *! ' + ( , 1) *! ' + (
....................(4)
Substituting
, !2 ) & '= * + ( ! )( !
.......................(5)
putting Ls = l (m l) , we get
1 Ws = " !
Queuing Equations
The evaluation of Model I is listed below: 1. Expected number of customers in the system,
Ls = ! & = " ! 1" &
2.
3.
4.
5.
6.
7.
, !) *1 ' + (
8.
281
9.
Example 4: Consider a situation where the mean arrival rate (l) is one customer every 4 minutes and the mean service time (m) is 2# minutes. Calculate the average number of customers in the system, the average queue length and the time taken by a customer in the system and the average time a customer waits before being served. Solution: Given, Average Arrival Rate l = 1 customer every 4 minutes or 15 customers per hour Average Service -Rate m = 1 customer every 2# minutes or 24 customers per hour (i) The average number of customers in the system,
Ls = ! "!
15 = 1.66 customers 24 " 15
(ii)
, ! ) * ' + " ! (
15 15 # 24 24 " 15
= 1.04 customers (iii) The average time a customer spends in the system,
Ws = 1 "!
1 24 " 15
= 0.11 " 60 = 6.66 minutes (iv) The average time a customer waits before being served,
Wq = ! !( " ! ) 15 24(24 " 15)
= 0.069 " 60 = 4.16 minutes Example 5: Trucks at a single platform weigh-bridge arrive according to Poisson probability distribution. The time required to weigh the truck follows an exponential probability distribution. The mean arrival rate is 12 trucks per day, and the mean service rate is 18 trucks per day. Determine the following: (a) (b) (c)
282
What is the probability that no trucks are in the system? What is the average number of trucks waiting for service? What is the average time a truck waits for weighing service to begin? What is the probability that an arriving truck will have to wait for service?
(d)
Solution: Given l = 12 trucks per days, m = 18 trucks per day. (a) Probability that no trucks are waiting for service,
P0 = 1" !
12 18
= 1"
= 1.33 trucks (c) Average time a truck waits for weighing service to begin,
Wq = ! ( " ! ) 12 18(18 " 12)
= 0.1111 days or 53.3 minutes. (d) Probability that an arriving truck will have to wait for service, P0 = 1 P0 = 1 0.333 = 0.6667 or 66.67%
Check Your Progress 9.1
1 2.
Explain Queuing Theory giving few examples. Both the Poisson and Exponential distributions play a prominent role in queuing theory. Jusify the statement. Write your answer in the space given below. Please go through the lesson sub-head thoroughly you will get your answers in it. This Chek Your Progress will help you to understand the lesson better. Try to write answers for them, but do not submit your answers to the university for assessment. These are for your practice only. (b) (c)
Notes: (a)
Press Solve to get the output screen and select scenario 1 option in the select output option menu. The output screen for the problem is displayed as shown in Figure 9.6.
284
The values are (a) (b) (c) (d) P0 = 0.3333 (for n = 0) Lq = 1.33 Wq = 0.1111 Pb (or)
& = 0.66667 C
In the same problem, to determine the probability that there are 2 trucks in the system, we use the formula,
,! ) , ! ) Pn = * ' *1" ' + ( + ( , 12 ) , 12 ) = * ' *1" ' + 18 ( + 18 (
2 n
= 0.4444 " 0.3333 = 0.14815 or 14.81% This can also be read in the output screen for n=2 the probability P n = 0.14815, Similarly, the probabilities for different values of n can be directly read. Example 6: A TV repairman finds that the time spent on his jobs has a exponential distribution with mean 30 minutes. If he repairs TV sets in the order in which they come in, and if the arrivals follow approximately Poisson distribution with an average rate of 10 per 8 hour day, what is the repairman's expected idle time each day? How many jobs are ahead of the average with the set just brought in? Solution: Given l = 10 TV sets per day. m = 16 TV sets per day. (i) The Probability for the repairman to be idle is, P0 = 1 r We know, r = l / 30 = 10 / 16 =0.625 P0 = 1 r = 1 0.625 = 0.375 Expected idle time per day = 8 " 0.375 = 3 hours. (ii) How many jobs are ahead of the average set just brought in
Ls = ! "!
10 10 = 16 " 10 6
= 1.66 say 2 jobs. Example 7: Auto car service provides a single channel water wash service. The incoming arrivals occur at the rate of 4 cars per hour and the mean service rate is 8 cars per hour. Assume that arrivals follow a Poisson distribution and the service rate follows an exponential probability distribution. Determine the following measures of performance:
285
What is the average time that a car waits for water wash to begin? What is the average time a car spends in the system? What is the average number of cars in the system?
Solution: Given l = 4 cars per hour, m = 8 cars per day. (a) Average time that a car waits for water - wash to begin,
Wq = ! !( " ! ) 4 8(8 " 4)
= (b)
= 0.125 hours or 7.5 mins. Average time a car spends in the system,
Ws = 1 "!
1 1 = = 0.25 hours or 15 mins. 8"4 4
= (c)
Ls =
=
4 ! = 8" 4 " !
4 = 1 car. 4
Example 8: Arrivals at a telephone booth are considered to be Poisson distributed with an average time of 10 minutes between one arrival and the next. The length of phone call is assumed to be distributed exponentially, with mean 3 minutes. (i) What is the probability that a person arriving at the booth will have to wait? (ii) The telephone department will install a second booth when convinced that an arrival would expect waiting for at least 3 minutes for phone call. By how much should the flow of arrivals increase in order to justify a second booth? (iii) What is the average length of the queue that forms from time to time? (iv) What is the probability that it will take him more than 10 minutes altogether to wait for the phone and complete his call? (v) What is the probability that it will take him more than 10 minutes altogether to wait for the phone and complete his call? m = 1/3 = 0.33 person per minute. (i) Probability that a person arriving at the booth will have to wait, P (w > 0) = 1 P0 = 1 (1 - l / m) = l / m =
0.10 = 0.3 0.33
(ii) The installation of second booth will be justified if the arrival rate is more than the waiting time. Expected waiting time in the queue will be,
Wq =
286
! ( " ! )
Where, E(w) = 3 and l = l (say ) for second booth. Simplifying we get l = 0.16 Hence the increase in arrival rate is, 0.16-0.10 =0.06 arrivals per minute. (iii) Average number of units in the system is given by,
Ls = & 0.3 = = 0.43 customers 1" & 1" 0.3
dt
a 10
= 0.03 This shows that 3 percent of the arrivals on an average will have to wait for 10 minutes or more before they can use the phone. Example 9: A bank has decided to open a single server drive-in banking facility at its main branch office. It is estimated that 20 customers arrive each hour on an average. The time required to serve a customer is 3 minutes on an average. Assume that arrivals follow a Poisson distribution and the service rate follows an exponential probability distribution. The bank manager is interested in knowing the following: (a) (b) (c) What will be the average waiting time of a customer to get the service? The proportion of time that the system will be idle. The space required to accommodate all the arrivals, on an average, the space taken by each car is 10 feet that is waiting for service.
60 = 2.4 customers per hour. 25
20 20 = 24 ( 24 " 20 ) 96
(b)
= 0.208 hour or 12.5 mins. The proportion of time that the system will be idle,
P0 = 1"
= 1"
!
20 24
287
20 2 400 = 24 ( 24 " 20 ) 96
= 4.66 customers. 10 feet is required for 1 customer. Hence, for 4.66 customers, the space required is 10 " 4.66 = 46.6 feet. Example 10: In a Bank, customers arrive to deposit cash to a single counter server every 15 minutes. The bank staff on an average takes 10 minutes to serve a customer. The manager of the bank noticed that on an average at least one customer was waiting at the counter. To eliminate the customer waiting time, the manager provided an automatic currency counting machine to the staff. This decreased the service time to 5 minutes on an average to every customer. Determine whether this rate of service will satisfy the manager's interest. Also use computer with TORA for solving the problem. Solution: Case 1: ! =
60 60 = 4 customers per hour, = = 60 = 6 customers per hour. 15 10
Case 2: l = 4 , =
Since no customers are standing in the queue the manager's interest is satisfied.
288
The problem is worked out using TORA. Enter the values as shown in the input screen below in Figure 9.7.
Press Solve and go to output screen. Select comparative analysis option in the queuing output analysis menu. The following output screen is displayed (Figure 9.8).
Figure 9.8: Comparative Analysis of Queuing Output Analysis Using TORA (Output Screen)
Now, on comparing scenario 1 and scenario 2, under Ls i.e., the average number of customers in the system is 2 and 0.5 respectively. In the first scenario, it means that in the entire system, one customer will be waiting in the queue while others are being served. In scenario 2, only one customer is in the system and being served, where on an average no customer will be waiting. Example 11: 12 counters are available in a computerized railway reservation system. The arrival rate during peak hours is 90 customers per hour. It takes 5 minutes to serve a customer on an average. Assume that the arrivals joining in a queue will not be jockeying (i.e., move to another queue). How many counters have to be opened if the customers need not to wait for more than 15 minutes?
289
Solution: The problem is to be solved as one system comprising of 'n' number of single server queuing model. Arrival rate, l =90 customers per hour Service rate, m =
60 =12 per hour 5 15 = 0.25 hours 60
i.e.,
................................(i)
Let, number of counters be x, Considering the single server queuing system, the number of counters required to serve 90 arrivals per hour, ! =
90 90 substituting ! = in equation (i), x x
0.25 =
90 / x 90 / 2 120 12 x . 1
0.25 =
90 12(12x 90 )
0.25 " 12(12x 90) = 90 3(12x 90) = 90 36x 270 = 90 36x = 360
x= 360 = 10 counters 36
Hence, 10 counters are required so that an average arrival will wait less than 15 minutes. Example 12: In a single pump petrol station, vehicles arrive at the rate of 20 customers per hour and petrol filling takes 2 minutes on an average. Assume the arrival rate is Poisson probability distribution and service rate is exponentially distributed, determine (a) (b) (c) (d)
290
What is the probability that no vehicles are in the petrol station? What is the probability that 1 customer is filling and no one is waiting in the queue? What is the probability that 1 customer is filling and 2 customers are waiting in the queue? What is the probability that more than 2 customers are waiting?
(a)
P1 = 1 "
20 ! = 1" 30
= 0.3334 or 33.34% (b) Probability that 1 customer is filling and no one is waiting in the queue,
Pn ,! ) ,! ) = * ' P0 = * ' + ( + (
1
, ! ) *1 " ' + (
= 0.6666 " 0.3334 = 0.2222 or 22.22% (c) Probability that 1 customer is filling and 2 customers are waiting in the queue, i.e., there are 3 customers in the system,
20 ) , 20 ) , P3 = * 1" 30 ' + 30 ' * ( + (
3
= 0.2963 " 0.3334 = 0.09878 or 9.87 % (d) Probability that more than 3 customers are in the system,
20 ) , 20 ) , P4 = * ' * 1 " 30 ' + 30 ( + (
4
= 0.1975 " 0.334 = 0.6585 or 65.85% The calculation made for the above problem is represented in the TORA output screen shown below in Figure 9.9.
291
1. 2.
The assumption of queuing theory are so restrictive as to render behaviour prediction of queuing system practically worthless Discuss. Explain the meaning of a queue and state the object of queuing analysis. Briefly describe with the help of hypothetical example the elements of the queuing system. Give examples of five situations/circumstances in which there in a limited a finite waiting line. Elaborate the vital operating characteristis of a queuing system. What are the modules of the following queuing system? Draw and explains the configuration of each
(a) (b) (c) (d) General store Big Bazar Railway reservation Car wash at the service center.
3. 4. 5.
Write your answer in the space given below. Please go through the lesson sub-head thoroughly you will get your answers in it. This Chek Your Progress will help you to understand the lesson better. Try to write answers for them, but do not submit your answers to the university for assessment. These are for your practice only.
292
9.10 KEYWORDS
Balking Reneging : A customer may not like to join the queue seeing it very long and he may not like to wait. : He may leave the queue due to impatience after joining in collusion several customers may collaborate and only one of them may stand in queue. : If there are number of queues then one may leave one queue to join another. : No. of customers waiting in the queue. : System consisting arrival of customers, waiting in queue, picked up for sevice according to a certain discipline, being serviced and departure of customers. : Point where service is provided : Person or unit arriving at a station for service. Customer may be a machine or person. : Time a customer spends in the queue before being serviced.
4.
Exercise Problems
1. A Bank operates a single facility ATM machine. Customers arrive at the rate of 10 customers per hour according to Poisson probability distribution. The time taken for an ATM transaction is exponential which means 3 minutes on an average. Find the following: (a) (b) (c) 2. Average waiting time of a customer before service. Average number of customers in the system. Probability that the ATM is idle.
At an average 12 cars per hour arrive at a single-server, drive-in teller. The average service time for each customer is 4 minutes, and the arrivals and services are Poisson and exponentially distributed respectively. Answer the following questions: (a) (b) (c) What is the proportion that the teller is idle? What is the time spent by a customer to complete his transaction? What is the probability that an arriving car need not wait to take-up service?
3.
At a single facility security check at an airport, passengers arrive at the checkpoint on an average of 8 passengers per minute and follows a Poisson probability distribution. The checking time for a customer entering security check area takes 10 passengers per minute and follows an exponential probability distribution. Determine the following: (a) (b) On an average, how many passengers are waiting in queue to enter the checkpoint? On an average, what is the time taken by a customer leaving the checkpoint?
4.
In a college computer lab, computers are interconnected to one laser printer. The printer receives data files for printing from these 25 computers interconnected to it. The printer prints the files received from these 25 computers at the rate of 5 data files per minute. The average time required to print a data file is 6 minutes. Assuming the arrivals are Poisson distributed and service times are exponentially distributed, determine (a) (b) (c) What is the probability that the printer is busy? On an average, how much time must a computer operator wait to take a print-out? On an average, what is the expected number of operators that will be waiting to take a print-out?
5.
Skyline pizza is a famous restaurant operating a number of outlets. The restaurant uses a toll-free telephone number to book pizzas at any of its outlets. It was found that an average of 15 calls are received per hour and the average time to handle each call is 2.5 minutes. Determine the following:
294
What is the average waiting time of an incoming caller? What is the probability that a caller gets connected immediately? If the restaurant manager feels that average waiting time of a caller is more than 5 minutes, will lead to customer loss and the restaurant will have to go in for a second toll free facility, what should be the new arrival rate in order to justify another facility?
6.
From historical data, a two-wheeler service station observe that bikes arrive only for water wash is at the rate of 7 per hour per 8 hour shift. The manager has a record that it takes 5 minutes for water service and another 2 minutes for greasing and general check. Assuming that one bike is washed at a time, find the following: (a) (b) (c) (d) (e) Average number of bikes in line. Average time a bike waits before it is washed. Average time a bike spends in the system. Utilization rate of the bike wash. Probability that no bikes are in the system.
7.
In a department at store, an automated coffee vending machine is installed. Customers arrive at a rate of 3 per minute and it takes average time of 10 seconds to dispense a cup of coffee: (a) (b) (c) Determine the number of customers in the queue. Determine the waiting time of a customer. Find the probability that there are exactly 10 customers in the system.
8.
In a toll gate, vehicles arrive at a rate of 120 per hour. An average time for a vehicle to get a pass is 25 seconds. The arrivals follow a Poisson distribution and service times follow an exponential distribution. (a) Find the average number of vehicles waiting and the idle time of the check-post. (b) If the idle time of the check post is less than 10%, the check-post authorities will install a second gate. Suggest whether a second gate is necessary ? A hospital has an X-ray lab where patients (both in-patient and out-patient) arrive at a rate of 5 per minute. Due to variation in requirement, the time taken for one patient is 3 minutes and follows an exponential distribution. (a) What is the probability that the system is busy? and (b) What is the probability that nobody is in the system?
9.
10. In the production shop of a company breakdown of the machine is found to be Poisson with an average rate of 3 machines per hour. Breakdown time at one machine costs Rs. 40 per hour to the company. There are two choices before the company for hiring the repairmen, one of the repairmen is slow but cheap, the other is fast but expensive. The slow-cheap repairman demands Rs. 20 per hour and will repair the breakdown machine exponentially at the rate of 4 per hour. The fast expensive repairman demands Rs. 30 per hour and will repair exponentially on an average rate of Rs.6 per hour. Which repairman should be hired?
ANSWERS
(b) (e) (b) (d) True False
TO
QUESTIONS
(c) False
FOR
296
Unit-IV
LESSON
10
PROBABILITY
CONTENTS
10.0 Aims and Objectives 10.1 Introduction 10.2 Classical Definition of Probability 10.3 Counting Techniques 10.4 Statistical or Empirical Definition of Probability 10.5 Axiomatic or Modern Approach to Probability 10.6 Theorems on Probability-I 10.7 Theorems on Probability-II 10.8 Let us Sum Up 10.9 Lesson-end Activity 10.10 Keywords 10.11 Questions for Discussion 10.12 Terminal Questions 10.13 Model Answers to Questions for Discussion 10.14 Suggested Readings
10.1 INTRODUCTION
The concept of probability originated from the analysis of the games of chance in the 17th century. Now the subject has been developed to the extent that it is very difficult to imagine a discipline, be it from social or natural sciences, that can do without it. The theory of probability is a study of Statistical or Random Experiments. It is the backbone of Statistical Inference and Decision Theory that are essential tools of the analysis of most of the modern business and economic problems. Often, in our day-to-day life, we hear sentences like 'it may rain today', 'Mr X has fiftyfifty chances of passing the examination', 'India may win the forthcoming cricket match against Sri Lanka', 'the chances of making profits by investing in shares of company A are very bright', etc. Each of the above sentences involves an element of uncertainty.
A phenomenon or an experiment which can result into more than one possible outcome, is called a random phenomenon or random experiment or statistical experiment. Although, we may be aware of all the possible outcomes of a random experiment, it is not possible to predetermine the outcome associated with a particular experimentation or trial. Consider, for example, the toss of a coin. The result of a toss can be a head or a tail, therefore, it is a random experiment. Here we know that either a head or a tail would occur as a result of the toss, however, it is not possible to predetermine the outcome. With the use of probability theory, it is possible to assign a quantitative measure, to express the extent of uncertainty, associated with the occurrence of each possible outcome of a random experiment.
If n is the number of equally likely, mutually exclusive and exhaustive outcomes of a random experiment out of which m outcomes are favourable to the occurrence of an event A, then the probability that A occurs, denoted by P(A), is given by :
P ( A) =
Various terms used in the above definition are explained below : 1. Equally likely outcomes: The outcomes of random experiment are said to be equally likely or equally probable if the occurrence of none of them is expected in preference to others. For example, if an unbiased coin is tossed, the two possible outcomes, a head or a tail are equally likely. Mutually exclusive outcomes: Two or more outcomes of an experiment are said to be mutually exclusive if the occurrence of one of them precludes the occurrence of all others in the same trial. For example, the two possible outcomes of toss of a coin are mutually exclusive. Similarly, the occurrences of the numbers 1, 2, 3, 4, 5, 6 in the roll of a six faced die are mutually exclusive. Exhaustive outcomes: It is the totality of all possible outcomes of a random experiment. The number of exhaustive outcomes in the roll of a die are six. Similarly, there are 52 exhaustive outcomes in the experiment of drawing a card from a pack of 52 cards. Event: The occurrence or non-occurrence of a phenomenon is called an event. For example, in the toss of two coins, there are four exhaustive outcomes, viz. (H, H), (H, T), (T, H), (T, T). The events associated with this experiment can be defined in a number of ways. For example, (i) the event of occurrence of head on both the coins, (ii) the event of occurrence of head on at least one of the two coins, (iii) the event of non-occurrence of head on the two coins, etc.
2.
3.
4.
An event can be simple or composite depending upon whether it corresponds to a single outcome of the experiment or not. In the example, given above, the event defined by (i) is simple, while those defined by (ii) and (iii) are composite events. Example 1: What is the probability of obtaining a head in the toss of an unbiased coin? Solution: This experiment has two possible outcomes, i.e., occurrence of a head or tail. These two outcomes are mutually exclusive and exhaustive. Since the coin is given to be unbiased, the two outcomes are equally likely. Thus, all the conditions of the classical definition are satisfied.
300
No. of cases favourable to the occurrence of head = 1 No. of exhaustive cases = 2 \ Probability of obtaining head P ( H ) = 1 . 2 Example 2: What is the probability of obtaining at least one head in the simultaneous toss of two unbiased coins? Solution: The equally likely, mutually exclusive and exhaustive outcomes of the experiment are (H, H), (H, T), (T, H) and (T, T), where H denotes a head and T denotes a tail. Thus, n = 4. Let A be the event that at least one head occurs. This event corresponds the first three outcomes of the random experiment. Therefore, m = 3. Hence, probability that A occurs, i.e., P ( A) = 3 . 4 Example 3: Find the probability of obtaining an odd number in the roll of an unbiased die. Solution: The number of equally likely, mutually exclusive and exhaustive outcomes, i.e., n = 6. There are three odd numbers out of the numbers 1, 2, 3, 4, 5 and 6. Therefore, m = 3. Thus, probability of occurrence of an odd number = 3 = 1 . 6 2 Example 4: What is the chance of drawing a face card in a draw from a pack of 52 well-shuffled cards? Solution: Total possible outcomes n = 52. Since the pack is well-shuffled, these outcomes are equally likely. Further, since only one card is to be drawn, the outcomes are mutually exclusive. There are 12 face cards, \ m = 12. Thus, probability of drawing a face card = 12 = 3 . 52 13 Example 5: What is the probability that a leap year selected at random will contain 53 Sundays? Solution: A leap year has 366 days. It contains 52 complete weeks, i.e, 52 Sundays. The remaining two days of the year could be anyone of the following pairs : (Monday, Tuesday), (Tuesday, Wednesday), (Wednesday, Thursday), (Thursday, Friday), (Friday, Saturday), (Saturday, Sunday), (Sunday, Monday). Thus, there are seven possibilities out of which last two are favourable to the occurrence of 53rd Sunday. 2 . Hence, the required probability = 7 Example 6: Find the probability of throwing a total of six in a single throw with two unbiased dice. Solution: The number of exhaustive cases n = 36, because with two dice all the possible outcomes are : (1, 1), (1, 2), (1, 3), (1, 4), (1, 5), (1, 6), (2, 1), (2, 2), (2, 3), (2, 4), (2, 5), (2, 6), (3, 1), (3, 2), (3, 3), (3, 4), (3, 5), (3, 6), (4, 1), (4, 2), (4, 3), (4, 4), (4, 5), (4, 6), (5, 1), (5, 2), (5, 3), (5, 4), (5, 5), (5, 6), (6, 1), (6, 2), (6, 3), (6, 4), (6, 5), (6, 6).
Probability
301
Out of these outcomes the number of cases favourable to the event A of getting 6 are : (1, 5), (2, 4), (3, 3), (4, 2), (5, 1). Thus, we have m = 5.
\ P ( A) =
5 36
Example 7: A bag contains 15 tickets marked with numbers 1 to 15. One ticket is drawn at random. Find the probability that: (i) (ii) the number on it is greater than 10, the number on it is even,
(iii) the number on it is a multiple of 2 or 5. Solution: Number of exhaustive cases n = 15 (i) Tickets with number greater than 10 are 11, 12, 13, 14 and 15. Therefore, m = 5 and 5 1 hence the required probability = = 15 3 (ii) Number of even numbered tickets m = 7
7 15 . (iii) The multiple of 2 are : 2, 4, 6, 8, 10, 12, 14 and the multiple of 5 are : 5, 10, 15. \ m = 9 (note that 10 is repeated in both multiples will be counted only once).
\ Required probability = Thus, the required probability =
9 15
3 5
Permutations of n objects taking r at a time: In terms of the example, considered above, now we have n persons to be seated on r chairs, where r n. Thus, n Pr = n(n 1)(n 2) ...... [n (r 1)] = n(n 1)(n 2) ...... (n r + 1).
Probability
Pr =
b gb g b bn " r g !
gb g
n! (n " r )!
(c)
Permutations of n objects taking r at a time when any object may be repeated any number of times: Here, each of the r places can be filled in n ways. Therefore, total number of permutations is nr. Permutations of n objects in a circular order: Suppose that there are three persons A, B and C, to be seated on the three chairs 1, 2 and 3, in a circular order. Then, the following three arrangements are identical:
(d)
Figure 10.1
Similarly, if n objects are seated in a circle, there will be n identical arrangements of the above type. Thus, in order to obtain distinct permutation of n objects in circular order we divide n Pn by n, where n Pn denotes number of permutations in a row. Hence, the number of permutations in a circular order n! = (n " 1)! n Permutations with restrictions: If out of n objects n1 are alike of one kind, n2 are
(e)
n! n1 ! n2 ! .... nk ! Since permutation of ni objects, which are alike, is only one (i = 1, 2, ...... k). Therefore, n! is to be divided by n1!, n2! .... nk!, to get the required permutations.
alike of another kind, ...... nk are alike, the number of permutations are Example 8: What is the total number of ways of simultaneous throwing of (i) 3 coins, (ii) 2 dice and (iii) 2 coins and a die ? Solution: (i) Each coin can be thrown in any one of the two ways, i.e, a head or a tail, therefore, the number of ways of simultaneous throwing of 3 coins = 23 = 8.
(ii) Similarly, the total number of ways of simultaneous throwing of two dice is equal to 62 = 36 and (iii) The total number of ways of simultaneous throwing of 2 coins and a die is equal to 22 ! 6 = 24. Example 9: A person can go from Delhi to Port-Blair via Allahabad and Calcutta using following mode of transport :
Delhi to Allahabad By Rail By Bus By Car By Air Allahabad to Calcutta By Rail By Bus By Car By Air Calcutta to Port-Blair By Air By Ship
In how many different ways the journey can be planned? Solution: The journey from Delhi to Port-Blair can be treated as three operations; From Delhi to Allahabad, from Allahabad to Calcutta and from Calcutta to Port-Blair. Using the fundamental principle of counting, the journey can be planned in 4 ! 4 ! 2 = 32 ways.
303
Example 10: In how many ways the first, second and third prize can be given to 10 competitors? Solution: There are 10 ways of giving first prize, nine ways of giving second prize and eight ways of giving third prize. Therefore, total no. ways is 10 ! 9 ! 8 = 720. Alternative method: Here n = 10 and r = 3, \ Example 11: (a) (b) (c) (d) There are 5 doors in a room. In how many ways can three persons enter the room using different doors? A lady is asked to rank 5 types of washing powders according to her preference. Calculate the total number of possible rankings. In how many ways 6 passengers can be seated on 15 available seats. If there are six different trains available for journey between Delhi to Kanpur, calculate the number of ways in which a person can complete his return journey by using a different train in each direction. In how many ways President, Vice-President, Secretary and Treasurer of an association can be nominated at random out of 130 members?
10
P3 =
(e)
Solution: (a) The first person can use any of the 5 doors and hence can enter the room in 5 ways. Similarly, the second person can enter in 4 ways and third person can enter in 3 ways. Thus, the total number of ways is (b) (c) Total number of rankings are
5 5
P3 =
5! = 60 . 2!
5! (Note that 0! = 1) = 120 . 0! Total number of ways of seating 6 passengers on 15 seats are 15! 15 P6 = = 36,03,600. 9! P5 =
(d) (e)
Total number of ways of performing return journey, using different train in each direction are 6 ! 5 = 30, which is also equal to 6 P2 . Total number of ways of nominating for the 4 post of association are
130
P4 =
Example 12: Three prizes are awarded each for getting more than 80% marks, 98% attendance and good behaviour in the college. In how many ways the prizes can be awarded if 15 students of the college are eligible for the three prizes? Solution: Note that all the three prizes can be awarded to the same student. The prize for getting more than 80% marks can be awarded in 15 ways, prize for 90% attendance can be awarded in 15 ways and prize for good behaviour can also be awarded in 15 ways. Thus, the total number of ways is nr = 153 = 3,375. Example 13: (a) (b)
304
In how many ways can the letters of the word EDUCATION be arranged? In how many ways can the letters of the word STATISTICS be arranged?
(c) (d)
In how many ways can 20 students be allotted to 4 tutorial groups of 4, 5, 5 and 6 students respectively? In how many ways 10 members of a committee can be seated at a round table if (i) they can sit anywhere (ii) president and secretary must not sit next to each other? The given word EDUCATION has 9 letters. Therefore, number of permutations of 9 letters is 9! = 3,62,880. The word STATISTICS has 10 letters in which there are 3S's, 3T's, 2I's, 1A and 1C. Thus, the required number of permutations
10! = 50,400. 3!3!2!1!1!
Probability
(c) (d)
(ii) We first find the number of permutations when president and secretary must sit together. For this we consider president and secretary as one person. Thus, the number of permutations of 9 persons at round table = 8! = 40,320. \ The number of permutations when president and secretary must not sit together = 3,62,880 - 40,320 = 3,22,560. Example 14: (a) (b) In how many ways 4 men and 3 women can be seated in a row such that women occupy the even places? In how many ways 4 men and 4 women can be seated such that men and women occupy alternative places? 4 men can be seated in 4! ways and 3 women can be seated in 3! ways. Since each arrangement of men is associated with each arrangement of women, therefore, the required number of permutations = 4! 3! = 144. There are two ways in which 4 men and 4 women can be seated MWMWMWMWMW or WMWMWMWMWM \ The required number of permutations = 2 .4! 4! = 1,152 Example 15: There are 3 different books of economics, 4 different books of commerce and 5 different books of statistics. In how many ways these can be arranged on a shelf when (a) (b) (c) (d) (a) (b) all the books are arranged at random, books of each subject are arranged together, books of only statistics are arranged together, and books of statistics and books of other subjects are arranged together? The required number of permutations = 12! The economics books can be arranged in 3! ways, commerce books in 4! ways and statistics book in 5! ways. Further, the three groups can be arranged in 3! ways. \ The required number of permutations = 3! 4! 5! 3! =1,03,680. Consider 5 books of statistics as one book. Then 8 books can be arranged in 8! ways and 5 books of statistics can be arranged among themselves in 5! ways. \ The required number of permutations = 8! 5! = 48,38,400.
305
Solution: (a)
(b)
Solution:
(c)
(d)
There are two groups which can be arranged in 2! ways. The books of other subjects can be arranged in 7! ways and books of statistics can be arranged in 5! ways. Thus, the required number of ways = 2! 7! 5! = 12,09,600.
Combination
When no attention is given to the order of arrangement of the selected objects, we get a combination. We know that the number of permutations of n objects taking r at a time is
n
corresponding to one combination. Thus, the number of combinations of n objects taking r at a time, denoted by nCr , can be obtained by dividing
n n
Pr by r!, i.e.,
Cr =
Pr n! . = r ! r !( n " r )!
Note: (a) Since nCr nCn r , therefore, nCr is also equal to the combinations of n objects taking (n - r) at a time. (b) The total number of combinations of n distinct objects taking 1, 2, ...... n respectively, at a time is n C1 + n C2 + ...... + n Cn = 2n - 1 . Example 16: (a) (b) (c) In how many ways two balls can be selected from 8 balls? In how many ways a group of 12 persons can be divided into two groups of 7 and 5 persons respectively? A committee of 8 teachers is to be formed out of 6 science, 8 arts teachers and a physical instructor. In how many ways the committee can be formed if 1. 2. Any teacher can be included in the committee. There should be 3 science and 4 arts teachers on the committee such that (i) any science teacher and any arts teacher can be included, (ii) one particular science teacher must be on the committee, (iii) three particular arts teachers must not be on the committee?
8! = 28 ways. 2!6! Since n Cr = n Cn - r , therefore, the number of groups of 7 persons out of 12 is also equal to the number of groups of 5 persons out of 12. Hence, the required number
2 balls can be selected from 8 balls in
8
C2 =
12! = 792 . 7!5! Alternative Method: We may regard 7 persons of one type and remaining 5 persons of another type. The required number of groups are equal to the number of permutations of 12 persons where 7 are alike of one type and 5 are alike of another type.
of groups is
12
C7 =
(c)
15
C8 =
3 science teachers can be selected out of 6 teachers in 6 C3 ways and 4 arts teachers can be selected out of 8 in 8C4 ways and the physical instructor can be selected in 1C1 way. Therefore, the required number of ways = 6C3 ! 8C4 ! 1C1 = 20 ! 70 ! 1 = 1400.
306
(ii) 2 additional science teachers can be selected in 5C2 ways. The number of selections of other teachers is same as in (i) above. Thus, the required number of ways = 5C2 ! 8C4 ! 1C1 = 10 ! 70 ! 1 = 700. (iii) 3 science teachers can be selected in 6 C3 ways and 4 arts teachers out of remaining 5 arts teachers can be selected in 5C4 ways. \ The required number of ways = 6C3 ! 5C4 = 20 ! 5 = 100.
Probability
Ordered Partitions
1. Ordered Partitions (distinguishable objects) (a) The total number of ways of putting n distinct objects into r compartments which are marked as 1, 2, ...... r is equal to rn. Since first object can be put in any of the r compartments in r ways, second can be put in any of the r compartments in r ways and so on. (b) The number of ways in which n objects can be put into r compartments such that the first compartment contains n1 objects, second contains n2 objects and so on the rth compartment contains nr objects, where n1 + n2 + ...... + nr = n, is given by n !n ! ...... n ! . 1 2 r To illustrate this, let r = 3. Then n1 objects in the first compartment can be put in nCn1 ways. Out of the remaining n n1 objects, n2 objects can be put in the second compartment in n n1 Cn2 ways. Finally the remaining n n1 n2 = n3 objects can be put in the third compartment in one way. Thus, the required number of ways is 2.
n
n!
Cn1 n - n1 Cn2 =
n! n1 !n2 !n3 !
Ordered Partitions (identical objects) (a) The total number of ways of putting n identical objects into r compartments marked as 1, 2, ...... r, is n r 1Cr 1 , where each compartment may have none or any number of objects. We can think of n objects being placed in a row and partitioned by the (r 1) vertical lines into r compartments. This is equivalent to permutations of (n + r 1) objects out of which n are of one type and (r 1) of another type. The required number of permutations are
(n + r "1)
Cn or
( n + r "1)
C(r "1) .
(n "1)
(b)
C(r "1) or
object. In order that each compartment must have at least one object, we first put one object in each of the r compartments. Then the remaining (n r) objects can be placed as in (a) above. The formula, given in (b) above, can be generalised. If each compartment is supposed to have at least k objects, the total number of ways is where k = 0, 1, 2, .... etc. such that k < n . r
( n " kr ) + (r "1)
(c)
C(r "1) ,
307
Example 17: 4 couples occupy eight seats in a row at random. What is the probability that all the ladies are sitting next to each other? Solution: Eight persons can be seated in a row in 8! ways. We can treat 4 ladies as one person. Then, five persons can be seated in a row in 5! ways. Further, 4 ladies can be seated among themselves in 4! ways. \ The required probability =
5!4! 1 = 8! 14
Example 18: 12 persons are seated at random (i) in a row, (ii) in a ring. Find the probabilities that three particular persons are sitting together. Solution: (i) The required probability =
10!3! 1 = 12! 22
9!3! 11! = 3 55
Example 19: 5 red and 2 black balls, each of different sizes, are randomly laid down in a row. Find the probability that (i) (ii) the two end balls are black, there are three red balls between two black balls and
(iii) the two black balls are placed side by side. Solution: The seven balls can be placed in a row in 7! ways. (i) The black can be placed at the ends in 2! ways and, in-between them, 5 red balls can be placed in 5! ways. \ The required probability =
2!5! 1 = . 7! 21
(ii) We can treat BRRRB as one ball. Therefore, this ball along with the remaining two balls can be arranged in 3! ways. The sequence BRRRB can be arranged in 2! 3! ways and the three red balls of the sequence can be obtained from 5 balls in
5
C3 ways.
3!2!3! 5 1 C3 = . 7! 7
(iii) The 2 black balls can be treated as one and, therefore, this ball along with 5 red balls can be arranged in 6! ways. Further, 2 black ball can be arranged in 2! ways. \ The required probability =
6!2! 2 = 7! 7
Example 20: Each of the two players, A and B, get 26 cards at random. Find the probability that each player has an equal number of red and black cards. Solution: Each player can get 26 cards at random in 52 C26 ways. In order that a player gets an equal number of red and black cards, he should have 13 cards of each colour, note that there are 26 red cards and 26 black cards in a pack of playing cards. This can be done in
26
26
C13
26
308
probability =
Example 21: 8 distinguishable marbles are distributed at random into 3 boxes marked as 1, 2 and 3. Find the probability that they contain 3, 4 and 1 marbles respectively. Solution: Since the first, second .... 8th marble, each, can go to any of the three boxes in 3 ways, the total number of ways of putting 8 distinguishable marbles into three boxes is 38. The number of ways of putting the marbles, so that the first box contains 3 marbles, second contains 4 and the third contains 1, are \ The required probability =
8! 3!4!1!
Probability
Example 22: 12 'one rupee' coins are distributed at random among 5 beggars A, B, C, D and E. Find the probability that : (i) (ii) They get 4, 2, 0, 5 and 1 coins respectively. Each beggar gets at least two coins.
(iii) None of them goes empty handed. Solution: The total number of ways of distributing 12 one rupee coins among 5 beggars are (i)
12 +5 -1
C5-1 = 16 C4 = 1820 . Since the distribution 4, 2, 0, 5, 1 is one way out of 1820 ways, the required probability
=
1 . 1820
2 + 5-1
(ii) After distributing two coins to each of the five beggars, we are left with two coins, which can be distributed among five beggars in \ The required probability =
C5-1 = 6 C4 = 15 ways.
15 3 = 1820 364
(iii) No beggar goes empty handed if each gets at least one coin. 7 coins, that are left after giving one coin to each of the five beggars, can be distributed among five beggars in
7 +5 -1
(iii) When various outcomes of a random experiment are not equally likely. (iv) This definition doesn't lead to any mathematical treatment of probability. In view of the above shortcomings of the classical definition, an attempt was made to establish a correspondence between relative frequency and the probability of an event when the total number of trials become su1fficiently large.
309
This definition also suffers from the following shortcomings : The conditions of the experiment may not remain identical, particularly when the number of trials is sufficiently large.
n
(ii) The relative frequency, m , may not attain a unique value no matter how large is the total number of trials. (iii) It may not be possible to repeat an experiment a large number of times. (iv) Like the classical definition, this definition doesn't lead to any mathematical treatment of probability.
310
Discrete and Continuous Sample Space A discrete sample space consists of finite or countably infinite number of elements. The sample spaces, discussed so far, are some examples of discrete sample spaces. Contrary to this, a continuous sample space consists of an uncountable number of elements. This type of sample space is obtained when the result of an experiment is a measurement on continuous scale like measurements of weight, height, area, volume, time, etc. Event An event is any subset of a sample space. In the experiment of roll of a die, the sample space is S = {1, 2, 3, 4, 5, 6}. It is possible to define various events on this sample space, as shown below : Let A be the event that an odd number appears on the die. Then A = {1, 3, 5} is a subset of S. Further, let B be the event of getting a number greater than 4. Then B = {5, 6} is another subset of S. Similarly, if C denotes an event of getting a number 3 on the die, then C = {3}. It should be noted here that the events A and B are composite while C is a simple or elementary event. Occurrence of an Event An event is said to have occurred whenever the outcome of the experiment is an element of its set. For example, if we throw a die and obtain 5, then both the events A and B, defined above, are said to have occurred. It should be noted here that the sample space is certain to occur since the outcome of the experiment must always be one of its elements. Definition of Probability (Modern Approach) Let S be a sample space of an experiment and A be any event of this sample space. The probability of A, denoted by P(A), is defined as a real value set function which associates a real value corresponding to a subset A of the sample space S. In order that P(A) denotes a probability function, the following rules, popularly known as axioms or postulates of probability, must be satisfied. Axiom I : Axiom II : For any event A in sample space S, we have 0 P(A) 1. P(S) = 1.
Probability
Axiom III : If A1, A2, ...... Ak are k mutually exclusive events (i.e., Ai 1 A j = & , i% j where f denotes a null set) of the sample space S, then
P A1 7 A2 ...... 7 Ak = ' P Ai
i =1
b g
The first axiom implies that the probability of an event is a non-negative number less than or equal to unity. The second axiom implies that the probability of an event that is certain to occur must be equal to unity. Axiom III gives a basic rule of addition of probabilities when events are mutually exclusive. The above axioms provide a set of basic rules that can be used to find the probability of any event of a sample space. Probability of an Event Let there be a sample space consisting of n elements, i.e., S = {e1, e2, ...... en}. Since the elementary events e1, e2, ...... en are mutually exclusive, we have, according to axiom III, P ( S ) = ' P (ei ) . Similarly, if A = {e1, e2, ...... em} is any subset of S consisting of m
i =1 n
elements, where m n, then P ( A) = ' P ( ei ) .Thus, the probability of a sample space or an event is equal to the sum of probabilities of its elementary events.
i =1
311
It is obvious from the above that the probability of an event can be determined if the probabilities of elementary events, belonging to it, are known. The Assignment of Probabilities to various Elementary Events The assignment of probabilities to various elementary events of a sample space can be done in any one of the following three ways : 1. Using Classical Definition: We know that various elementary events of a random experiment, under the classical definition, are equally likely and, therefore, can be assigned equal probabilities. Thus, if there are n elementary events in the sample space of an experiment and in view of the fact that P ( S ) = ' P (ei ) = 1 (from
i =1 n
axiom II), we can assign a probability equal to 1 to every elementary event or, n 1 for i = 1, 2, .... n. using symbols, we can write P ei n
c h
a f
1 n
1 n
......
1 n
a m timesf
bg bg
We note that the above expression is similar to the formula obtained under classical definition. 2. Using Statistical Definition: Using this definition, the assignment of probabilities to various elementary events of a sample space can be done by repeating an experiment a large number of times or by using the past records. Subjective Assignment: The assignment of probabilities on the basis of the statistical and the classical definitions is objective. Contrary to this, it is also possible to have subjective assignment of probabilities. Under the subjective assignment, the probabilities to various elementary events are assigned on the basis of the expectations or the degree of belief of the statistician. These probabilities, also known as personal probabilities, are very useful in the analysis of various business and economic problems. It is obvious from the above that the Modern Definition of probability is a general one which includes the classical and the statistical definitions as its particular cases. Besides this, it provides a set of mathematical rules that are useful for further mathematical treatment of the subject of probability.
Check Your Progress 10.1
3.
1 2.
Explain Exhaustive outcomes with examples. What are combinational methods? Write your answer in the space given below. Please go through the lesson sub-head thoroughly you will get your answers in it. This Chek Your Progress will help you to understand the lesson better. Try to write answers for them, but do not submit your answers to the university for assessment. These are for your practice only. (b) (c)
Notes: (a)
__________________________________________________________________
Probability
g bg
d i
1 P A , where A is compliment of A.
a f
A 7 A = S or P A 7 A = P ( S )
Since A and A are mutually exclusive, we can write
P ( A) + P A = P ( S ) = 1. Hence, P A = 1 - P ( A) .
Theorem 3: For any two events A and B in a sample space S
( ) (
( )
P A 1 B = P ( B) - P ( A 1 B)
Proof: From the Venn diagram, we can write
B = A 1 B 7 ( A 1 B ) or
P ( B ) = P A 1 B 7 ( A 1 B )
are
or
bg d i b g P d A 1 B i = P b B g " Pb A 1 B g .
P B = P A 1 B + P A1 B
Figure. 10.2
i bg b
g bg bg b
i b g
g
i
g bg d
g bg bg b
g b g bg bg
Remarks : 1. If A and B are mutually exclusive, i.e., A 1 B = & , then according to theorem 1, we have P A 1 B = 0 . The addition rule, in this case, becomes P A 7 B = P A + P B , which is in conformity with axiom III.
313
2. 3. 4.
The event A 7 B denotes the occurrence of either A or B or both. Alternatively, it implies the occurrence of at least one of the two events. The event A 1 B is a compound event that denotes the simultaneous occurrence of the two events. Alternatively, the event A 7 B is also denoted by A + B and the event A 1 B by AB.
Corollaries: 1. From the Venn diagram, we can write P A 7 B = 1 " P A 1 B , where P A 1 B is the probability that none of the events A and B occur simultaneously.
P exactly one of A and B occurs = P A 1 B 7 A 1 B
2.
id
i id i
Since A 1 B 7 A 1 B = &
3.
The addition theorem can be generalised for more than two events. If A, B and C are three events of a sample space S, then the probability of occurrence of at least one of them is given by
P A7 B 7C = P A7 B 7C
bg b g b = P b Ag + P b B 7 C g " P b A 1 B g 7 b A 1 C g
bg bg bg b
g d
g b
g b
g b
.... (1)
Alternatively, the probability of occurrence of at least one of the three events can also be written as
P A7 B 7 C = 1" P A 1 B 1 C
.... (2)
b
b
g bg bg bg
g d i
.... (3)
If A1, A2, ...... An are n events of a sample space S, the respective equations (1), (2) and (3) can be modified as
P A1 7 A2 ... 7 An = ' P Ai " ' ' P Ai 1 A j + ' ' ' P Ai 1 A j 1 Ak n + "1 P A1 1 A2 1 ... 1 An ( i % j % k , etc. )
(if the events are mutually exclusive) 4. The probability of occurrence of at least two of the three events can be written as
314
g b g 3Pb A 1 B 1 C g + Pb A 1 B 1 C g = Pb A 1 Bg + Pb B 1 C g + Pb A 1 C g " 2 Pb A 1 B 1 Cg
gb
gb
g b
5.
The probability of occurrence of exactly two of the three events can be written as
P A1 B 1C 7 A1 B 1C 7 A 1 B1C = P A1 B 7 B 1C 7 A1C
Probability
6.
The probability of occurrence of exactly one of the three events can be written as
id
id
bg bg bg
Example 23: In a group of 1,000 persons, there are 650 who can speak Hindi, 400 can speak English and 150 can speak both Hindi and English. If a person is selected at random, what is the probability that he speaks (i) Hindi only, (ii) English only, (iii) only one of the two languages, (iv) at least one of the two languages? Solution: Let A denote the event that a person selected at random speaks Hindi and B denotes the event that he speaks English. Thus, we have n(A) = 650, n(B) = 400, n A 1 B = 150 and n(S) = 1000, where n(A), n(B), etc. denote the number of persons belonging to the respective event. (i) The probability that a person selected at random speaks Hindi only, is given by
P A1 B =
(ii)
The probability that a person selected at random speaks English only, is given by P A 1 B =
(iii) The probability that a person selected at random speaks only one of the languages, is given by
P A1 B 7 A 1 B = P A + P B " 2P A1 B
id
bg bg
(see corollary 2)
n A + n B " 2n A 1 B
bg bg b nb S g
(iv) The probability that a person selected at random speaks at least one of the languages, is given by
P A7 B =
Alternative Method: The above probabilities can easily be computed by the following nine-square table :
A A Total B 150 250 400 B 500 100 600 Total 650 350 1000
P A1 B = P A1B =
d d
P A1 B 7 A 1 B = P A7 B =
id
100 9 = . 1000 10
Example 24: What is the probability of drawing a black card or a king from a wellshuffled pack of playing cards? Solution: There are 52 cards in a pack, \ n(S) = 52. Let A be the event that the drawn card is black and B be the event that it is a king. We have to find P A 7 B .
Since there are 26 black cards, 4 kings and two black kings in a pack, we have n(A) = 26, n(B) = 4 and n A 7 B = 2 Thus, P A 7 B = 26 + 4 " 2 = 7
52
13
Alternative Method: The given information can be written in the form of the following table: B B Total
A A Total 2 2 4 24 24 48 26 26 52
24 7 = 52 13
Example 25: A pair of unbiased dice is thrown. Find the probability that (i) the sum of spots is either 5 or 10, (ii) either there is a doublet or a sum less than 6. Solution: Since the first die can be thrown in 6 ways and the second also in 6 ways, therefore, both can be thrown in 36 ways (fundamental principle of counting). Since both the dice are given to be unbiased, 36 elementary outcomes are equally likely. (i) Let A be the event that the sum of spots is 5 and B be the event that their sum is 10. Thus, we can write A = {(1, 4), (2, 3), (3, 2), (4, 1)} and B = {(4, 6), (5, 5), (6, 4)} We note that A 1 B = & , i.e. A and B are mutually exclusive. \ By addition theorem, we have (ii)
P A7 B = P A + P B =
g bg bg
4 3 7 + = . 36 36 36
Let C be the event that there is a doublet and D be the event that the sum is less than 6. Thus, we can write C = {(1, 1), (2, 2), (3, 3), (4, 4), (5, 5), (6, 6)} and D = {(1, 1), (1, 2), (1, 3), (1, 4), (2, 1), (2, 2), (2, 3), (3, 1), (3, 2), (4, 1)} Further, C 1 D = {(1, 1), (2, 2)} By addition theorem, we have P C 7 D = 6 + 10 " 2 = 7 .
316
36
36
36
18
Alternative Methods: (i) It is given that n(A) = 4, n(B) = 3 and n(S) = 36. Also n A 1 B = 0 . Thus, the corresponding nine-square table can be written as follows :
A A Total B B Total 0 4 4 3 29 32 3 33 36
Probability
36
36
(ii)
36
18
Example 26: Two unbiased coins are tossed. Let A1 be the event that the first coin shows a tail and A2 be the event that the second coin shows a head. Are A1 and A2 mutually exclusive? Obtain P A1 1 A2 and P A1 7 A2 . Further, let A1 be the event that both coins show heads and A2 be the event that both show tails. Are A1 and A2 mutually exclusive? Find P A1 1 A2 and P A1 7 A2 . Solution: The sample space of the experiment is S = {(H, H), (H, T), (T, H), (T, T)} (i) A1 = {(T, H), (T, T)} and A2 = {(H, H), (T, H)} Also A1 1 A2 = {(T, H)}, Since A1n A2 % & , A1 and A2 are not mutually exclusive. Further, the coins are given to be unbiased, therefore, all the elementary events are equally likely. \ P A1 =
b g
2 1 2 1 1 = , P A2 = = , P A1 1 A2 = 4 2 4 2 4
b g
Thus, P A1 7 A2 = 1 + 1 " 1 = 3 .
(ii) When both the coins show heads; A1 = {(H, H)} When both the coins show tails; A2 = {(T, T)} Here A1 1 A 2 = &, ( A1 and A 2 are mutually exclusive. Thus, P A1 7 A2 = 1 + 1 = 1 .
Alternatively, the problem can also be attempted by making the following ninesquare tables for the two cases :
(i) A1 A1 Total A2 1 1 2 A2 Total 1 2 1 2 2 4 (ii) A2 0 1 1 A2 Total 1 1 2 3 3 4
317
Theorem 5: Multiplication or Compound Probability Theorem: A compound event is the result of the simultaneous occurrence of two or more events. For convenience, we assume that there are two events, however, the results can be easily generalised. The probability of the compound event would depend upon whether the events are independent or not. Thus, we shall discuss two theorems; (a) Conditional Probability Theorem, and (b) Multiplicative Theorem for Independent Events. (a) Conditional Probability Theorem: For any two events A and B in a sample space S, the probability of their simultaneous occurrence, is given by
P ( A 1 B ) = P ( A) P ( B / A)
or equivalently =P(B)P(A/B) Here, P(B/A) is the conditional probability of B given that A has already occurred. Similar interpretation can be given to the term P(A/B). Proof: Let all the outcomes of the random experiment be equally likely. Therefore,
P A1 B =
For the event B/A, the sample space is the set of elements in A and out of these the number of cases favourable to B is given by n A 1 B . \ P B/ A =
If we multiply the numerator and denominator of the above expression by n(S), we get
P B/ A =
or
The other result can also be shown in a similar way. Note: To avoid mathematical complications, we have assumed that the elementary events are equally likely. However, the above results will hold true even for the cases where the elementary events are not equally likely. (b) Multiplicative Theorem for Independent Events: If A and B are independent, the probability of their simultaneous occurrence is given by P A 1 B = P A . P B . Proof: We can write A = A 1 B 7 A 1 B .
g bg bg
f Pd A / B i .
n (B) =
318
Probability
g bg bg
Corollaries: 1. (i) If A and B are mutually exclusive and P(A).P(B) > 0, then they cannot be independent since P A 1 B = 0 . (ii) If A and B are independent and P(A).P(B) > 0, then they cannot be mutually exclusive since P A 1 B > 0 . 2. Generalisation of Multiplicative Theorem : If A, B and C are three events, then
P A1 B 1C = P A . P B / A . P C / A1 B
g bg b
g b g b
g b g b g
)
b g
3.
If A and B are independent, then A and B , A and B, A and B are also independent. We can write P A 1 B = P ( A) - P ( A 1 B ) (by theorem 3)
P A
a f Pa A f. PaBf Pa Af 1 PaBf
(
a f di
independent. The other results can also be shown in a similar way. 4. by P ( A1 7 A2 7 .... 7 An ) = 1 - P A1 1 A2 1 .... 1 An . The probability of occurrence of at least one of the events A1, A2, ...... An, is given
If A1, A2, ...... An are independent then their compliments will also be independent, therefore, the above result can be modified as
P ( A1 7 A2 7 .... 7 An ) = 1 - P A1 .P A2 .... P An .
Pair-wise and Mutual Independence
( ) ( )
( )
Three events A, B and C are said to be mutually independent if the following conditions are simultaneously satisfied :
P ( A 1 B ) = P ( A) .P ( B ) , P ( B 1 C ) = P ( B ) .P (C ) , P ( A 1 C ) = P ( A) .P (C ) and P ( A 1 B 1 C ) = P ( A) .P ( B ) .P (C ) .
If the last condition is not satisfied, the events are said to be pair-wise independent. From the above we note that mutually independent events will always be pair-wise independent but not vice-versa. Example 27: Among 1,000 applicants for admission to M.A. economics course in a University, 600 were economics graduates and 400 were non-economics graduates; 30% of economics graduate applicants and 5% of non-economics graduate applicants obtained admission. If an applicant selected at random is found to have been given admission, what is the probability that he/she is an economics graduate?
319
Solution: Let A be the event that the applicant selected at random is an economics graduate and B be the event that he/she is given admission. We are given n(S) = 1000, n(A) = 600, n A = 400 Also, n ( B) = 600 ! 30 + 400 ! 5 = 200 and n ( A 1 B ) = 100 100 Thus, the required probability is given by P ( A / B ) =
( )
180 9 = 200 10
Example 28: A bag contains 2 black and 3 white balls. Two balls are drawn at random one after the other without replacement. Obtain the probability that (a) Second ball is black given that the first is white, (b) First ball is white given that the second is black. Solution: First ball can be drawn in any one of the 5 ways and then a second ball can be drawn in any one of the 4 ways. Therefore, two balls can be drawn in 5 ! 4 = 20 ways. Thus, n(S) = 20. (a) Let A1 be the event that first ball is white and A2 be the event that second is black. We want to find P A2 / A1 .
First white ball can be drawn in any of the 3 ways and then a second ball can be drawn in any of the 4 ways, \ n(A1) = 3 ! 4 = 12. Further, first white ball can be drawn in any of the 3 ways and then a black ball can be drawn in any of the 2 ways, \ n A1 1 A2 = 3 ! 2 = 6 . Thus, P ( A2 / A1 ) = (b)
n ( A1 1 A2 ) n ( A1 )
6 1 = . 12 2
Here we have to find P A1 / A2 . The second black ball can be drawn in the following two mutually exclusive ways: (i) (ii) First ball is white and second is black or both the balls are black.
Thus, n(A2) = 3 ! 2 + 2 ! 1 = 8, ( P ( A1 / A2 ) =
n ( A1 1 A2 ) n ( A2 )
6 3 = . 8 4
Alternative Method: The given problem can be summarised into the following ninesquare table:
A A Total
320
B B Total 6 6 12 2 6 8 8 12 20
The required probabilities can be directly written from the above table.
Example 29: Two unbiased dice are tossed. Let w denote the number on the first die and r denote the number on the second die. Let A be the event that w + r 4 and B be the event that w + r 3. Are A and B independent? Solution: The sample space of this experiment consists of 36 elements, i.e., n(S) = 36. Also, A = {(1, 1), (1, 2), (1, 3), (2, 1), (2, 2), (3, 1)} and B = {(1, 1), (1, 2), (2, 1)}. From the above, we can write
Probability
P ( A) =
6 1 3 1 = , P ( B) = = 36 6 36 12 3 1 = 36 12
g bgbg
Example 30: It is known that 40% of the students in a certain college are girls and 50% of the students are above the median height. If 2/3 of the boys are above median height, what is the probability that a randomly selected student who is below the median height is a girl? Solution: Let A be the event that a randomly selected student is a girl and B be the event that he/she is above median height. The given information can be summarised into the following table :
A A Total B 10 40 50 B 30 20 50 Total 40 60 100
30 = 0.6 . 50
Example 31: A problem in statistics is given to three students A, B and C, whose chances of solving it independently are that (a) (b) (c) (d) the problem is solved. at least two of them are able to solve the problem. exactly two of them are able to solve the problem. exactly one of them is able to solve the problem.
1 1 1 , and respectively. Find the probability 2 3 4
Solution: Let A be the event that student A solves the problem. Similarly, we can define the events B and C. Further, A, B and C are given to be independent. (a) The problem is solved if at least one of them is able to solve it. This probability is given by P ( A 7 B 7 C ) = 1 - P A .P ( B ) .P C = 1 (b)
( )
( )
1 2 3 3 = 2 3 4 4
gb
gb
1 1 1 1 1 1 1 1 1 7 + + - 2. = 2 3 3 4 2 4 2 3 4 24
(c)
id
id
1 1 1 1 1 + + - = . 6 12 8 8 4
(d)
The required probability is given by P A 1 B 1 C 7 A 1 B 1 C 7 A 1 B 1 C = P(A) + P(B) + P(C) 2P(A).P(B) 2P(B).P(C) 2P(A). P(C) + 3 P(A).P(B).P(C)
id
id
1 1 1 1 1 1 1 11 + + " " " + = . 2 3 4 3 6 4 8 24 Note that the formulae used in (a), (b), (c) and (d) above are the modified forms of corollaries (following theorem 4) 3, 4, 5 and 6 respectively. =
Example 32: A bag contains 2 red and 1 black ball and another bag contains 2 red and 2 black balls. One ball is selected at random from each bag. Find the probability of drawing (a) at least a red ball, (b) a black ball from the second bag given that ball from the first is red; (c) show that the event of drawing a red ball from the first bag and the event of drawing a red ball from the second bag are independent. Solution: Let A1 be the event of drawing a red ball from the first bag and A2 be the event of drawing a red ball from the second bag. Thus, we can write:
nd A 1 A i = 2 ! 2 = 4, b g nd A 1 A i = 1 ! 2 = 2, nd A 1 A i = 1 ! 2 = 2 Also, nb S g = nb A 1 A g + nd A 1 A i + nd A 1 A i + nd A 1 A i = 12 n A1 1 A2 = 2 ! 2 = 4,
1 2
Writing the given information in the form of a nine-square table, we get A2 A2 Total A1 4 4 8 2 2 4 A1 6 6 12 Total (a) The probability of drawing at least a red ball is given by
P A1 7 A2 = 1 "
n A1 1 A2 nS
bg
2
i = 1" 2 = 5
12 6
(b)
We have to find P A2 / A1
P A2 / A1 =
d
1
n A 1A 1 i d nb A g i = 4 = 2 8
1
(c)
g b g b g
4 g nb Anb1gA g = 12 = 1 S 3
1 2
P ( A1 ) .P ( A2 ) =
322
n ( A1 ) n ( A2 ) 8 6 1 . = ! = n ( S ) n ( S ) 12 12 3
Example 33: An urn contains 3 red and 2 white balls. 2 balls are drawn at random. Find the probability that either both of them are red or both are white. Solution: Let A be the event that both the balls are red and B be the event that both the balls are white. Thus, we can write
n S = 5C2 = 10, n A = 3C2 = 3, n B = 2 C2 = 1, also n A 1 B = 0
Probability
bg
bg
bg
+ g nb Angb+Snb Bg = 3101 = 2 5 g
Example 34: A bag contains 10 red and 8 black balls. Two balls are drawn at random. Find the probability that (a) both of them are red, (b) one is red and the other is black. Solution: Let A be the event that both the balls are red and B be the event that one is red and the other is black. Two balls can be drawn from 18 balls in 18 C2 equally likely ways.
( n ( S ) = 18C2 =
(a)
C2 ways.
( n ( A) = 10C2 =
Thus, P ( A) =
10! = 45 2!8!
n ( A) 45 5 = = n ( S ) 153 17
(b)
One red ball can be drawn in 10 C1 ways and one black ball can be drawn in 8C1 ways.
( n ( B) =
10
C1 ! 8C1 = 10 ! 8 = 80 Thus, P ( B ) =
80 153
Example 35: Five cards are drawn in succession and without replacement from an ordinary deck of 52 well-shuffled cards : (a) (b) (c) (d) What is the probability that there will be no ace among the five cards? What is the probability that first three cards are aces and the last two cards are kings? What is the probability that only first three cards are aces? What is the probability that an ace will appear only on the fifth draw?
Solution: (a)
P ( there is no ace ) =
48 ! 47 ! 46 ! 45 ! 44 = 0.66 52 ! 51 ! 50 ! 49 ! 48
(b)
- first three card are aces and . 4!3!2!4!3 P/ 0 = 52 ! 51 ! 50 ! 49 ! 48 = 0.0000009 1 the last two are kings 2
(c)
4 ! 3 ! 2 ! 48 ! 47 = 0.00017 52 ! 51 ! 50 ! 49 ! 48
(d)
323
Example 36: Two cards are drawn in succession from a pack of 52 well-shuffled cards. Find the probability that : (a) (b) (c) (d) (e) (f) (g) (h) Only first card is a king. First card is jack of diamond or a king. At least one card is a picture card. Not more than one card is a picture card. Cards are not of the same suit. Second card is not a spade. Second card is not a spade given that first is a spade. The cards are aces or diamonds or both.
Solution:
4 ! 48 16 = . 52 ! 51 221
- first card is a jack of . 5 ! 51 5 (b) P / 0 = 52 ! 51 = 52 . 1 diamond or a king 2 - at least one card is. 40 ! 39 7 (c) P / 0 = 1 " 52 ! 51 = 17 . 1 a picture card 2 - not more than one card. 40 ! 39 12 ! 40 40 ! 12 210 (d) P / = + + = . is a picture card 0 52 ! 51 52 ! 51 52 ! 51 221 1 2
52 ! 39 13 = . 52 ! 51 17
13 ! 39 39 ! 38 3 + = . 52 ! 51 52 ! 51 4
- second card is not a spade. 39 13 (g) P / = = . 1 given that first is spade 0 51 17 2 - the cards are aces or . 16 ! 15 20 (h) P / = = . 1 diamonds or both 0 52 ! 51 221 2
Example 37: The odds are 9 : 7 against a person A, who is now 35 years of age, living till he is 65 and 3 : 2 against a person B, now 45 years of age, living till he is 75. Find the chance that at least one of these persons will be alive 30 years hence. Solution: Note: If a is the number of cases favourable to an event A and a is the number of cases favourable to its compliment event (a + a = n), then odds in favour of A are a : a and odds against A are a : a.
a . a and P A = a+a a+a Let A be the event that person A will be alive 30 years hence and B be the event that person B will be alive 30 years hence. 7 7 2 2 ( P ( A) = = = and P ( B) = 9 + 7 16 3+2 5
Obviously P ( A) =
( )
\ P A 7 B = 7 + 2 " 7 ! 2 = 53
16
16
80
Alternative Method:
P A7 B = 1"
Probability
9 3 53 ! = 16 5 80
bg
2 3
1 and 6
i d
i and Pd B i .
Also examine whether the events A and B are : (a) Equally likely, (b) Exhaustive, (c) Mutually exclusive and (d) Independent. Solution: The probabilities of various events are obtained as follows :
P B = P A 1 B + P A1 B = 1 1 1 + = 6 3 2
bg d b b g
i b
P A7 B =
2 1 1 5 + " = 3 2 3 6
g PbPAb1gBg = 1 ! 2 = 2 3 1 3 B Pb A 1 B g 1 3 1 Pb B / Ag = = ! = Pb Ag 3 2 2
P A/ B = P A7B = P A + P B " P A1B =
d d
i d i bg d i b g
1 1 1 2 + " = 3 2 6 3
P A 1 B = 1" P A7 B = 1"
5 1 = 6 6
P ( B ) = 1 " P ( B) = 1 "
(a) (b) (c) (d)
1 1 = 2 2
Since P(A) P(B), A and B are not equally likely events. Since P A 7 B % 1 , A and B are not exhaustive events.
b g Since Pb A 1 Bg % 0 , A and B are not mutually exclusive. Since Pb Ag Pb Bg = Pb A 1 Bg , A and B are independent events.
Example 39: Two players A and B toss an unbiased die alternatively. He who first throws a six wins the game. If A begins, what is the probability that B wins the game? Solution: Let Ai and Bi be the respective events that A and B throw a six in Ith toss, i = 1, 2, .... . B will win the game if any one of the following mutually exclusive events occur: A1 B1 or A1 B1 A2 B2 or A1 B1 A2 B2 A3 B3 , etc. Thus, P ( B wins) =
5 1 5 5 5 1 5 5 5 5 5 1 ! + ! ! ! + ! ! ! ! ! + ...... 6 6 6 6 6 6 6 6 6 6 6 6
2 4 * 5 5 ) - 5. 1 5 - 5. = ! = 31 + / 0 + / 0 + ...... 4 = 2 1 62 36 3 1 6 2 11 - 5. 4 36 + , 1" / 0 1 62
325
Example 40: A bag contains 5 red and 3 black balls and second bag contains 4 red and 5 black balls. (a) (b) If one ball is selected at random from each bag, what is the probability that both of them are of same colour? If a bag is selected at random and two balls are drawn from it, what is the probability that they are of (i) same colour, (ii) different colours?
Solution: (a)
) Probability that ball * ) Probability that balls * Required Probability = 3from both bags are red 4 + 3from both bags are black 4 + , + ,
(b)
5 4 3 5 35 ! + ! = 8 9 8 9 72 Let A be the event that first bag is drawn so that A denotes the event that second bag is drawn. Since the two events are equally likely, mutually exclusive and =
exhaustive, we have P ( A) = P A = (i)
( )
1 . 2
Let R be the event that two drawn balls are red and B be the event that they are black. The required probability is given by
= P ( A ) ) P ( R / A ) + P ( B / A )* + P ( A ) ) P ( R / A ) + P ( B / A )* + , + ,
P (C ) = P ( A) P (C / A) + P A P C / A
=
( ) (
1 ) 5 ! 3 * 1 ) 4 ! 5 * 1 ) 15 20 * 275 + = 3 4+ 3 4= 2 + 8 C2 , 2 + 9 C2 , 2 3 28 36 4 504 + ,
Example 41: There are two urns U1 and U2. U1 contains 9 white and 4 red balls and U2 contains 3 white and 6 red balls. Two balls are transferred from U1 to U2 and then a ball is drawn from U2. What is the probability that it is a white ball? Solution: Let A be the event that the two transferred balls are white, B be the event that they are red and C be the event that one is white and the other is red. Further, let W be the event that a white ball is drawn from U2. The event W can occur with any one of the mutually exclusive events A, B and C.
P (W ) = P ( A) .P (W / A) + P ( B ) P (W / B ) + P (C ) P (W / C ) C2 5 4 C2 3 9 4 4 57 = 13 + 13 + 13 = C2 11 C2 11 C2 11 143
9
Example 42: A bag contains tickets numbered as 112, 121, 211 and 222. One ticket is drawn at random from the bag. Let Ei (i = 1, 2, 3) be the event that i th digit on the ticket is
326
Solution: The event E1 occurs if the number on the drawn ticket 211 or 222, therefore,
Probability
P ( E1 ) =
1 . Similarly 1 1 P ( E2 ) = and P ( E3 ) = . 2 2 2
mutually independent.
b g b g b g
Example 43: Probability that an electric bulb will last for 150 days or more is 0.7 and that it will last at the most 160 days is 0.8. Find the probability that it will last between 150 to 160 days. Solution: Let A be the event that the bulb will last for 150 days or more and B be the event that it will last at the most 160 days. It is given that P(A) = 0.7 and P(B) = 0.8. The event A 7 B is a certain event because at least one of A or B is bound to occur. Thus, P A 7 B = 1 . We have to find P A 1 B . This probability is given by
Example 44: The odds that A speaks the truth are 2 : 3 and the odds that B speaks the truth are 4 : 5. In what percentage of cases they are likely to contradict each other on an identical point? Solution: Let A and B denote the respective events that A and B speak truth. It is given that P ( A) = 2 and P ( B) = 4 . 5 9 The event that they contradict each other on an identical point is given by A 1 B 7 A 1 B , where A 1 B and A 1 B are mutually exclusive. Also A and B are independent events. Thus, we have
P A1 B 7 A 1 B = P A1 B + P A 1 B = P A . P B + P A . P B
id
id
i d
i bg d i d i bg
2 5 3 4 22 ! + ! = = 0.49 5 9 5 9 45
Hence, A and B are likely to contradict each other in 49% of the cases. Example 45: The probability that a student A solves a mathematics problem is the probability that a student B solves it is
2 and 5
is not solved, (b) the problem is solved, (c) Both A and B, working independently of each other, solve the problem? Solution: Let A and B be the respective events that students A and B solve the problem. We note that A and B are independent events.
3 bag Pd A 1 B i = Pd A i. Pd B i = 5 ! 1 = 1 3 5
327
Example 46: A bag contains 8 red and 5 white balls. Two successive drawings of 3 balls each are made such that (i) balls are replaced before the second trial, (ii) balls are not replaced before the second trial. Find the probability that the first drawing will give 3 white and the second 3 red balls. Solution: Let A be the event that all the 3 balls obtained at the first draw are white and B be the event that all the 3 balls obtained at the second draw are red. (a) When balls are replaced before the second draw, we have
P ( A) =
8 C3 C 5 28 = and P ( B ) = 13 3 = 13 C3 143 C3 143 5
The required probability is given by P A 1 B , where A and B are independent. Thus, we have
P A1 B = P A . P B =
g bg bg
(b)
When the balls are not replaced before the second draw We have P ( B / A) =
8
10
C3 7 = . Thus, we have C3 15
P A1 B = P A . P B / A =
g bg b
5 7 7 ! = 143 15 429
Example 47: Computers A and B are to be marketed. A salesman who is assigned the job of finding customers for them has 60% and 40% chances respectively of succeeding in case of computer A and B. The two computers can be sold independently. Given that the salesman is able to sell at least one computer, what is the probability that computer A has been sold? Solution: Let A be the event that the salesman is able to sell computer A and B be the event that he is able to sell computer B. It is given that P(A) = 0.6 and P(B) = 0.4. The probability that the salesman is able to sell at least one computer, is given by
P A7 B = P A + P B " P A1 B = P A + P B " P A . P B
g bg bg b
g bg bg bg bg
Now the required probability, the probability that computer A is sold given that the salesman is able to sell at least one computer, is given by
P A / A7 B =
Example 48: Two men M1 and M2 and three women W1, W2 and W3, in a big industrial firm, are trying for promotion to a single post which falls vacant. Those of the same sex have equal probabilities of getting promotion but each man is twice as likely to get the promotion as any women.
328
(a) (b)
Find the probability that a woman gets the promotion. If M2 and W2 are husband and wife, find the probability that one of them gets the promotion.
Probability
Solution: Let p be the probability that a woman gets the promotion, therefore 2p will be the probability that a man gets the promotion. Thus, we can write, P(M1) = P(M2) = 2p and P(W1) = P(W2) = P(W3) = p, where P(Mi) denotes the probability that i th man gets the promotion (i = 1, 2) and P(W j) denotes the probability that j th woman gets the promotion. Since the post is to be given only to one of the five persons, the events M1, M2 , W1, W2 and W3 are mutually exclusive and exhaustive.
( P M1 7 M 2 7 W1 7 W2 7 W3 = P M1 + P M 2 + P W1 + P W2 + P W3 = 1
g b g b g b g b g b g
1 7
5 2 p + 2 p + p + p + p = 1 or p =
(a)
b b
g b g b g b g
3 7
3 7
(b)
g b g b g
Example 49: An unbiased die is thrown 8 times. What is the probability of getting a six in at least one of the throws? Solution: Let Ai be the event that a six is obtained in the ith throw (i = 1, 2, ...... 8). Therefore, P ( Ai ) = 1 . 6 The event that a six is obtained in at least one of the throws is represented by
d i d i
d i
FG 5 IJ H 6K
Example 50: Two students X and Y are very weak students of mathematics and their chances of solving a problem correctly are 0.11 and 0.14 respectively. If the probability of their making a common mistake is 0.081 and they get the same answer, what is the chance that their answer is correct? Solution: Let A be the event that both the students get a correct answer, B be the event that both get incorrect answer by making a common mistake and C be the event that both get the same answer. Thus, we have
Similarly,
P ( B 1 C ) = P ( X gets incorrect answer ) ! P (Y gets incorrect answer ) ! P ( X and Y make a common mistake )
329
gb
bg b
g b
Example 51: Given below are the daily wages (in rupees) of six workers of a factory : 77, 105, 91, 100, 90, 83 If two of these workers are selected at random to serve as representatives, what is the probability that at least one will have a wage lower than the average? Solution: The average wage X = 77 + 105 + 91 + 100 + 90 + 83 = 91 6 Let A be the event that two workers selected at random have their wages greater than or equal to average wage.
\ P ( A) =
3 6
C2 1 = C2 5
Thus, the probability that at least one of the workers has a wage less than the average
=1"
1 4 = 5 5
Example 52: There are two groups of subjects one of which consists of 5 science subjects and 3 engineering subjects and the other consists of 3 science subjects and 5 engineering subjects. An unbiased die is cast. If the number 3 or 5 turns up, a subject from the first group is selected at random otherwise a subject is randomly selected from the second group. Find the probability that an engineering subject is selected ultimately. Solution: Let A be the event that an engineering subject is selected and B be the event that 3 or 5 turns on the die. The given information can be summarised into symbols, as given below :
1 3 P ( A ) = , P ( A / B ) = , and 3 8
To find P(A), we write
P(A / B) =
5 8
P A = P A1 B + P A1 B = P B . P A / B + P B . P A / B
bg b
=
g d
i bg b
g d i d
1 3 2 5 13 ! + ! = 3 8 3 8 24
Example 53: Find the probability of obtaining two heads in the toss of two unbiased coins when (a) at least one of the coins shows a head, (b) second coin shows a head. Solution: Let A be the event that both coins show heads, B be the event that at least one coin shows a head and C be the event that second coin shows a head. The sample space and the three events can be written as :
330
S = {(H, H), (H, T), (T, H), (T, T)}, B = {(H, H), (H, T), (T, H)} Further, A 1 B = H , H and
Probability
mb
gr and A 1 C = mb H , H gr
3 , 4 PC =
Since the coins are given to be unbiased, the elementary events are equally likely, therefore
P A =
bg
b b
1 , 4
P B =
bg
bg
1 , 2
P A1 B = P A1C =
g b
1 4
(a)
(b)
Hint: Two aces can be drawn from four aces in 4 C2 ways. 2. Two cards are drawn at random from a deck of 52 well-shuffled cards. What is the probability that one of them is an ace and the other is a queen? What is the probability of getting all the four heads in four throws of an unbiased coin? What is the probability of getting 5 on each of the two throws of a six faced unbiased die? Four cards are drawn at random without replacement from a pack of 52 cards. What is the probability that : (a) (b) (c) 6. All of them are aces? All of them are of different suits? All of them are picture cards or spades or both?
Hint: See example 36. Find the probability of throwing an even number from a single throw of a pair of unbiased dice.
Hint: An even number is obtained if both dice show either odd or even numbers. 7. A bag contains 50 balls serially numbered from 1 to 50. One ball is drawn at random from the bag. What is the probability that the number on it is a multiple of 3 or 4? Hint: The number of serial numbers that are multiple of 3 or 4 are integral part of
50 . L.C.M . of 3 and 4
331
8.
A bag contains 4 white and 5 red balls. Two balls are drawn in succession at random. What is the probability that (a) both the balls are white, (b) both are red, (c) one of them is red and the other is white? A bag contains 5 red, 8 white and 3 blue balls. If three balls are drawn at random, find the probability that (a) all the balls are blue, (b) each ball is of different colour, (c) the drawn balls are in the order red, white and blue, (d) none of the balls are white.
Hint: (b) This event is same as that of drawing one ball of each colour. (c) n(S) = 16 ! 15 ! 14. 10. 4 cards are drawn at random from a pack of 52 well-shuffled cards. Find the chance that (i) each card is of a different suit, (ii) they consist of a Jack, Queen, King and an Ace, (iii) they are 4 honours of the same suit. Hint: Honours of a suit are its Jack, Queen, King and Ace. 11. In how many ways the letters of the following words can be arranged? MANAGEMENT, ASSESSMENT, COMMITTEE Hint: See example 13. 12. How many distinct words can be formed from the letters of the word MEERUT? How many of these words start at M and end at T? Hint: Fixing M and T, determine the number of permutations of remaining letters. 13. In a random arrangement of letters of the word DROUGHT, find the probability that vowels come together. Hint: See example 15. 14. The letters of the word STUDENT are arranged at random. Find the probability that the word, so formed; (a) (b) (c) (d) starts with S, starts with S and ends with T, the vowels occupy odd positions only, the vowels occupy even positions only.
Hint: See examples 14 and 15. 15. How many triangles can be formed by joining 12 points in a plane, given that 7 points are on one line. Hint: No. of triangles =
12
C 3 " 7C 3 .
16. In a random arrangement of 10 members of a committee, find the probability that there are exactly 3 members sitting between the president and secretary when the arrangement is done (i) in a row, (ii) in a ring. Hint: Considering 5 members as one, there are 6 members. No. of permutations (i) 2! ! 8C3 ! 3! ! 6! , (ii) 2! ! 8C3 ! 3! ! 5! 17. A six digit number is formed by the digits 5, 9, 0, 7, 1, 3; no digit being repeated. Find the probability that the number formed is (i) divisible by 5, (ii) not divisible by 5. Hint: 0 cannot come at the sixth place of a six digit number. 18. If 30 blankets are distributed at random among 10 beggars, find the probability that a particular beggar receives 5 blankets. Hint: A particular beggar can receive 5 blankets in 9 beggars in 925 ways.
30
332
19. A statistical experiment consists of asking 3 housewives, selected at random, if they wash their dishes with brand X detergent. List the elements of the sale space S using the letter Y for 'yes' and N for 'no'. Also list the elements of the event : "The second woman interviewed uses brand X'. Find the probability of this event if it is assumed that all the elements of S are equally likely to occur. Hint: The sample space would consist of eight 3-tuples of the type (Y,Y,Y), etc. 20. n persons are sitting in a row. If two persons are picked up at random, what is the probability that they are sitting adjacent to each other? Hint: Two adjacent persons can be picked up in (n - 1) ways. 21. A committee of 5 persons is to be formed out of 7 Indians and 5 Japanese. Find the probability that (a) the committee is represented only by the Indians, (b) there are at least two Japanese on the committee, (c) there are at least two Japanese and two Indians on the committee. Hint: See example 16. 22. 4 letters are placed at random in 4 addressed envelopes. Find the probability that all the letters are not placed in right envelopes. Hint: The letters can be placed in their respective envelopes in one way. 23. Find the probability that a family with 4 children has (a) 2 boys and 2 girls, (b) no boy, (c) at the most two boys, (d) at least a girl. Assume equal probability for boys and girls. Hint: (a) The event can occur in 4 C2 mutually exclusive ways each with probability 1 . 4
2
Probability
24. One child is selected at random from each of the three groups of children, namely, 3 girls and 1 boy, 2 girls and 2 boys, 1 girl and 3 boys. Find the probability of selecting 1 girl and 2 boys. Hint: The event can occur in any one of the following mutually exclusive ways : BBG, BGB, GBB. 25. A can hit a target in 3 out of 4 attempts while B can hit it in 2 out of 3 attempts. If both of them try simultaneously, what is the probability that the target will be hit? Hint: Find the probability of hitting the target at least once. 26. A and B played 12 chess matches out of which A won 6 matches, B won 4 matches and 2 resulted in draw. If they decide to play 3 more matches, what is the probability that (a) A wins all the three matches, (b) two matches end in draw, (c) B wins at least a match, (d) A wins at least a match, (e) A and B wins alternatively? Hint: (b) P(two matches end in draw) =
2 2 10 ! ! !3. 12 12 12
27. A and B who are equally perfect players of badminton, stopped playing a match when their scores were 12 and 13 respectively. If 15 points are needed to win this match, what are their respective probabilities of winning? Hint: A can win in following mutually exclusive ways; AAA, BAAA, ABAA, AABA. 28. A problem in accountancy is given to five students. Their chances of solving it are
1 1 1 1 1 , , , and respectively. What is the probability that the problem will be 2 3 4 5 6
d i d i d i d i d i
333
29. (a)
A guard of 12 soldiers is to be formed out of n soldiers. Find the probability that (i) two particular soldiers A and B are together on the guard, (ii) three particular soldiers C, D and E are together on the guard. (iii) Also find n if A and B are 3 times as often together on the guard as C, D and E. A has 6 shares in a lottery in which there are 3 prizes and 10 blanks. B has 2 shares in a lottery in which there are 4 prizes and 8 blanks. Which of them has a better chance to win a prize? in
n "2
(b)
Hint: (a) When A and B are on the guard, remaining 10 soldiers can be selected
C10 ways.
10
(b)
P ( A) = 1 "
13
C6 . C6
30. It is 8 to 5 against a person, who is now 40 years old, living till he is 70 and 4 to 3 against a person, now 50 years old, living till he is 80. Find the probability that at least one of them would be alive 30 years hence. Hint: See example 37. 31. A candidate is selected for interview for 3 posts. There are 3 candidates for the first, 4 for the second and 2 for the third post. What are the chance of his getting at least one post? Hint: Probability that he gets the first post is 1 , etc.
3
32. A bag contains 6 Rupee and 9 Dollar coins. Two drawings of 4 coins each are made without replacement. What is the probability that first draw will give 4 Rupee coins and second 4 dollar coins? Hint: See example 46. 33. Three tokens marked as 1, 2 and 3 are placed in a bag and one is drawn and replaced. The operation being repeated three times. What is the probability of obtaining a total of 6? Hint: A total of 6 can be obtained if different number is obtained in each operation or 2 is obtained in all the three operations. There are 3! ways of obtaining different numbers. 34. A certain player, say X, is known to win with probability 0.3 if the track is fast and with probability 0.4 if the track is slow. On Monday, there is a 0.7 probability of a fast track. What is the probability that X will win on Monday? Hint: Let A be the event that the track is fast and B be the event that X wins, then
P B = P A1 B + P A 1 B
bg b
g d
35. The probability that a vacuum-cleaner salesman will succeed in persuading a customer on the first call is 0.4. If he fails, the probability of success on the second call is 0.2. If he fails on the first two calls, the probability of success on the third and last call is 0.1. Find the probability that the salesman makes a sale of vacuumcleaner to a customer. Hint: Try as in exercise 34 above. 36. There are two contractors A and B, for the completion of a project. Contractor A does the first part of the project and then contractor B, by doing the second part, completes the project. B cannot start until A has finished. If A finishes on time, B has 85% chance of completing the project on time. If A doesn't finish on time, then
334
B has only 30% chance of completing the project on time. If A has 70% chance of finishing his work on time, what is the probability that the project will be finished on time? Hint: Find P A 1 B + P A 1 B . 37. The probability that a person stopping at a petrol pump will ask to have his tyres checked is 0.12, the probability that he will ask to have his oil checked is 0.29 and the probability that he will ask to have both of them checked is 0.07. (i) (ii) What is the probability that a person stopping at the petrol pump will have either tyres or oil checked? What is the probability that a person who has tyres checked will also have oil checked?
Probability
g d
(iii) What is the probability that a person who has oil checked will also have tyres checked? Hint: See example 32. 38. There are three brands, say X, Y and Z, of an item available in the market. A consumer chooses exactly one of them for his use. He never buys two or more brands simultaneously. The probabilities that he buys brands X, Y and Z are 0.20, 0.16 and 0.45 respectively. (i) (ii) What is the probability that he doesn't buy any of the brands? Given that the consumer buys some brand, what is the probability that he buys brand X?
Hint: (i) The required probability = 1 " P X 7 Y 7 Z , where X , Y and Z are mutually exclusive. 39. A person applies for the post of manager in two firms A and B. He estimates that the probability of his being selected on firm A is 0.75, the probability of being rejected in firm B is 0.45 and the probability of rejection in at least one of the firms is 0.55. What is the probability that he will be selected in at least one of the firms? Hint: P A 1 B = 1 " P A 7 B . 40. (a) A student is given a true-false examination with 10 questions. If he gets 8 or more correct answers, he passes the examination. Given that he guesses at an answer to each question, compute the probability that he passes the examination. In a multiple choice question, there are four alternative answers out of which one or more are correct. A candidate will get marks in the question only if he ticks all the correct answers. If he is allowed up to three chances to answer the question, find the probability that he will get marks in the question. n(S) = 210. No. of favourable cases is
10
(b)
Hint:(a) (b)
C8 +
10
C9 +
10
C10 .
Total no. of ways in which the student can tick the answers in one attempt = 24 - 1 (since at least one of the answer is correct, therefore, it is not possible that he will leave all the answers unticked). The total no. of ways of selecting three solutions from 15 is 15 C3 . Note that it will be in the interest of the candidate to select a different solution in each attempt. Since out of 15 solutions, only one (way of marking the questions) is correct, therefore, the no. of ways of selecting incorrect solutions is 14 C3 .
14
15
C3 . C3
335
41. 200 students were admitted to an under graduate course through an entrance test out of which only 150 completed it successfully. On the examination of their admission data, it was found that 70% of those who passed and 50% of those who failed had a first division in their senior secondary examination. Find (a) the probability that a student with first division in the senior secondary examination is successful in the under graduate course, (b) the probability that a student without first division in senior secondary examination, is successful in the under graduate course, (c) the probability that an admitted student is a first divisioner in senior secondary examination, (d) the probability that an admitted student is unsuccessful in the under graduate course. Hint: See example 27. 42. 300 employees of a firm were asked if they would favour increasing their working day by one hour so that they could have a five day week. The results are given in the following table :
Men M Women W
a f a f
Favour F 102 42
af
Disfavour D 90 6
a f
Neutral N 48 12
a f
b
g (g) PbW 1 F g , (h) Pb N 1 M g , (i) PbW 1 N g , (j) P a F / M f , (k) P aW / F f , (l) P a D /W f , (m) P a M / N f , (n) P a N /W f , (o) P b M F g , (p) PbW 7 Dg , (q) Pb M 7 Dg , (r) Pb F 7 Dg , (s) Pb M 7 W g , (t) Pb M 7 F 7 Dg .
Find (a) P(M), (b) P(W), (c) P(F), (d) P(D), (e) P(N), (f) P M 1 F , Hint: See example 27. 43. In a bridge game of playing cards, 4 players are distributed one card each by turn so that each player gets 13 cards. What is the probability that a specified player gets a black ace and a king? Hint: No. of favourable cases are 2 C1 ! 4C1 !
46
C11 .
44. A bag contains 4 white and 2 black balls. Two balls are drawn successively one after another without replacement. What is the probability that (a) the first ball is white and the second is black, (b) the first is black and second is white. Hint: Use conditional probability theorem. 45. (a) (b) What is the probability that out of 3 friends, Ram, Shyam and Mohan, at least two have the same birthday? What is the probability that out of a group of 4 persons, all born in the month of April, at least three have same birthday?
Hint: Suppose that Ram states his birthday, then the probability of Shyam having a different birthday is 364 and then the probability of Mohan having a different
365
birthday is 363 , etc. The required probability is 1 " 364 ! 363 . 365 365 365 46. The probability that a man aged 70 years will die in a year is 2 . Find the probability that out of 5 men A1, A2, A3, A4 and A5, each aged 70 years, A1 will die in a year and will be the first to die. Hint: P(A1 dies first out of 5 men) = 1 . Multiply this by the probability that at least one
336
47. The probability of rain tomorrow is 0.65 and the probability that the temperature will rise above 35C is 0.8. The probability there is no rain and temperature remaining below 35C is 0.1. (a) (b) What is the probability of rain if temperature rises above 35C? What is the probability that temperature remains below 35C, given that there is no rain?
Probability
Hint: Try as in exercise 38 above. 48. A bag contains 4 red and 2 black balls. Three men X, Y and Z draw a ball in succession, without replacement, until a black ball is obtained. Find their respective chances of getting first black ball. Hint: X can get first black ball in the following two mutually exclusive ways: B or WWWB, etc. 49. A and B are two candidates for admission to a certain course. The probability that A is selected is 0.80 and the probability that both A and B are selected is at the most 0.25. Is it possible that probability of selection of B is 0.50? Hint: P A 7 B 6 1 . 50. Delhi has three independent reserved sources of electric power to use to prevent a blackout in the event that its regular source fails. The probability that any reserved source is available when its regular source fails is 0.7. What is the probability of not having a blackout if the regular source fails? Hint: The required probability = 1 - the probability that power is not available from any of the reserved sources. 51. In a locality, out of 5,000 people residing, 1,200 are above 30 years of age and 3,000 are females. Out of 1,200, who are above 30 years, 200 are females. If a person selected at random is a female, what is the probability that she is above 30 years of age? Hint: See example 27. 52. The probability that both the events A and B occur simultaneously is probability of occurrence of neither of them is
1 and the 5
P(B) on the assumption that the events are independent. Hint: Let P(A) = x and P(B) = y. Use the equation 1" P A 7 B = P A 1 B to find x + y. Find x y from it by using the equation (x y) = (x + y) 4xy. 53. Two factories A and B manufacture the same machine part. Each part is classified as having 0, 1, 2 or 3 manufacturing defects. The joint probabilities are as follows:
Number of defects 0 1 2 3 Factory A 0.1250 0.0625 0.1875 0.1250 Factory B 0.0625 0.0625 0.1250 0.2500
2 2
g d
(i) (ii)
A part is observed to have no defects. What is the probability that it was produced by factory A? A part is known to have been produced by factory A. What is the probability that the part has no defects?
337
(iii) A part is known to have two or more defects. What is the probability that it was manufacture by factory A?
(iv) A part is known to have one or more defects. What is the probability that it was manufactured by factory B? Hint: See example 30. 54. A man is dealt 4 spade cards from an ordinary pack of 52 cards. If he is given three more cards, find the probability that at least one of the additional cards is also a spade. Hint: The probability that no spade is obtained from the remaining 48 cards
39
is
C3 C3
48
55. An unbiased die is thrown three times. Find the probability of (a) throwing 4 on the first die if the sum of numbers obtained in three throws is 15, (b) obtaining a sum of 15 when first die shows 4. Hint: (a) There are 10 ways of obtaining the sum 15 out of which 2 are favourable, (b) there are 36 cases in which first die shows 4, out of which only two are favourable.
56. A committee of 4 has to be formed from among 3 economists, 4 engineers, 2 statisticians and 1 doctor. (i) (ii) What is the probability that each of the four professions are represented on the committee? What is the probability that the committee consists of doctor and at least one economist?
Hint: (ii) The required probability is obtained by finding the probabilities of the following mutually exclusive events : {1 doc, 1 eco, 2 others}, {1 doc, 2 eco, 1 other} and {1 doc, 3 eco}. 57. Six persons toss a coin turn by turn. The game is won by the player who first throws a head. Find the probability of success of the fifth player. Hint: See example 39. 58. Find the probability that an assessee files his tax return and cheats on it, given that 70% of all the assessee files returns and 20%, of those who file, cheat. Hint: See example 27. 59. Two persons A and B throw three unbiased dice. If A throws 14, find B's chances of throwing a higher number. Hint: The event that A throws 14 is independent of the event that B throws a higher number. 60. A is one of 6 horses entered for a race and is to be ridden by one of the jockeys B and C. It is 2 : 1 that B rides A, in which case all the horses are equally likely to win; if C rides A, his chances are trebled; what are the odds against his winning? 1 Hint: P(A wins given that he is ridden by jockey B) = 6 3 P(A wins given that he is ridden by jockey C) = 6 61. What is the probability that over a two day period the number of requests would either be 11 or 12 if at a motor garage the records of service requests alongwith their probabilities are given below?
Daily demand : 5 6 7 Probability : 0. 25 0.65 0.10
338
62. The probability that T.V. of a company fails during first month of its use is 0.02. Of those that do not fail during first month, the probability of failure in the next five months is 0.01. Of those that do not fail during the first six months, the probability of failure by the end of the first year is 0.001. The company replaces, free of charge, any set that fails during its warranty period. If 2,000 sets are sold, how many will have to be replaced if the warranty period is (a) six months, (b) one year? Hint: Probability that a set fails during first year = 0.02 + 0.98 ! 0.01 + 0.9902 ! 0.001. 63. A salesman has 60% chances of making sales to each customer. The behaviour of each successive customer is assumed to be independent. If two customers A and B enter, what is the probability that the salesman will make sales to A or B? Hint: P A 7 B = 1 " P A 1 B . 64. A box contains 24 bulbs out of which 4 are defective. A customer draws a sample of 3 bulbs at random in succession and rejects the box if the sample contains one or more defectives. What is the probability that the box is rejected? Hint: The box will be rejected if the sample contains at least one defective.
Probability
P ( Ak / D ) =
P ( Ak ).P ( D / Ak )
' P ( A ).P ( D / A )
i i i =1
Proof: Since A1, A2, ...... An are n exhaustive events, therefore, S = A 1 7 A 2 ...... 7 A n . Since D is another event that can occur in combination with any of the mutually exclusive and exhaustive events A1, A2, ...... An, we can write
D = A1 1 D 7 A2 1 D 7 ...... 7 An 1 D
gb
P D = ' P Ai 1 D = ' P Ai . P D / Ai
i =1 i =1
b g
b g b
.... (1)
339
The conditional probability of an event Ak given that D has already occurred, is given by
P Ak / D =
1 . b g Pb Ab DgDg = Pb A gPPDD / A g P b g
k k k
.... (2)
P ( Ak / D ) =
P ( Ak ) .P ( D / Ak )
' P ( A ) .P ( D / A )
i i i =1
.... (3)
Example 54: A manufacturing firm purchases a certain component, for its manufacturing process, from three sub-contractors A, B and C. These supply 60%, 30% and 10% of the firm's requirements, respectively. It is known that 2%, 5% and 8% of the items supplied by the respective suppliers are defective. On a particular day, a normal shipment arrives from each of the three suppliers and the contents get mixed. A component is chosen at random from the day's shipment : (a) (b) What is the probability that it is defective? If this component is found to be defective, what is the probability that it was supplied by (i) A, (ii) B, (iii) C ?
Solution: Let A be the event that the item is supplied by A. Similarly, B and C denote the events that the item is supplied by B and C respectively. Further, let D be the event that the item is defective. It is given that : P(A) = 0.6, P(B) = 0.3, P(C) = 0.1, P(D/A) = 0.02 P(D/B) = 0.05, P(D/C) = 0.08. (a) We have to find P(D) From equation (1), we can write
P ( D ) = P ( A 1 D ) + P ( B 1 D ) + P (C 1 D )
= P ( A) P ( D / A) + P ( B ) P ( D / B ) + P (C ) P ( D / C ) = 0.6 ! 0.02 + 0.3 ! 0.05 + 0.1 ! 0.08 = 0.035 (b) (i) We have to find P(A/D)
P ( A / D) =
P ( A ) P ( D / A) P (D)
P ( B) P ( D / B) P ( D) P (D) =
P (C ) P ( D / C )
Alternative Method: The above problem can also be attempted by writing various probabilities in the form of following table :
D D
340
Total
b d
g b i d
g b i d
Total
g i
Thus P A / D =
Probability
Example 55: A box contains 4 identical dice out of which three are fair and the fourth is loaded in such a way that the face marked as 5 appears in 60% of the tosses. A die is selected at random from the box and tossed. If it shows 5, what is the probability that it was a loaded die? Solution: Let A be the event that a fair die is selected and B be the event that the loaded die is selected from the box. Then, we have P ( A) = 3 and P ( B ) = 1 . 4 4 Further, let D be the event that 5 is obtained on the die, then
P ( D / A) =
1 6 and P ( D / B ) = 6 10 3 1 1 6 11 ! + ! = 4 6 4 10 40
B 1 6 6 g PbPb1 gDg = 4 ! 10 ! 40 = 11 D 11
Example 56: A bag contains 6 red and 4 white balls. Another bag contains 3 red and 5 white balls. A fair die is tossed for the selection of bag. If the die shows 1 or 2, the first bag is selected otherwise the second bag is selected. A ball is drawn from the selected bag and is found to be red. What is the probability that the first bag was selected? Solution: Let A be the event that first bag is selected, B be the event that second bag is selected and D be the event of drawing a red ball. Then, we can write
1 2 6 3 P ( A) = , P ( B ) = , P ( D / A) = , P ( D / B ) = 3 3 10 8
Further, P ( D ) = 1 ! 6 + 2 ! 3 = 9 . 3 10 3 8 20
( P A/ D =
6 g PbPAb1 gDg = 1 ! 10 ! 20 = 4 D 3 9 9
Example 57: In a certain recruitment test there are multiple-choice questions. There are 4 possible answers to each questio n out of which only one is correct. An intelligent student knows 90% of the answers while a weak student knows only 20% of the answers. (i) (ii) An intelligent student gets the correct answer, what is the probability that he was guessing? A weak student gets the correct answer, what is the probability that he was guessing?
Solution: Let A be the event that an intelligent student knows the answer, B be the event that the weak student knows the answer and C be the event that the student gets a correct answer. (i) We have to find P A /C . We can write
P A/C =
P A1C P (C )
)=
( ) ( ) P ( A ) P (C / A ) + P ( A) P (C / A)
P A P C/A
.... (1)
341
From the above, we can also write P A = 0.10 Substituting these values, we get
( )
P( A / C) =
(ii)
We have to find P B /C . Replacing A by B , in equation (1), we can get this probability. It is given that P(B) = 0.20, P C / B = 0.25 and P (C / B ) = 1.0 From the above, we can also write P B = 0.80 Thus, we get P ( B / C ) =
( )
Example 58: An electronic manufacturer has two lines A and B assembling identical electronic units. 5% of the units assembled on line A and 10%of those assembled on line B are defective. All defective units must be reworked at a significant increase in cost. During the last eight-hour shift, line A produced 200 units while the line B produced 300 units. One unit is selected at random from the 500 units produced and is found to be defective. What is the probability that it was assembled (i) on line A, (ii) on line B? Answer the above questions if the selected unit was found to be non-defective. Solution: Let A be the event that the unit is assembled on line A, B be the event that it is assembled on line B and D be the event that it is defective. Thus, we can write
P ( A) =
Further, we have
P A1 D =
D D Total
From the above table, we can write
A 1 50 19 50 20 50
B Total 3 4 50 50 27 46 50 50 30 1 50
P ( A / D) = P ( A / D) =
1 50 1 3 50 3 ! = , P ( B / D) = ! = 50 4 4 50 4 4 19 50 19 27 50 27 ! = ! = , P ( B / D) = 50 46 46 50 46 46
342
Probability
1 Hint: P ( B1 ) = P ( B2 ) = P ( B3 ) = 1 , P ( S / B1 ) = 0, P ( S / B2 ) = 1, P ( S / B3 ) = . 2 3 4. In a factory producing bolts, Machines A, B and C manufacture 25%, 35% and 40% of total output. Of their output, 5%, 4% and 2% are defective respectively. A bolt is drawn at random from the product and is found to be defective. What is the probability that it was manufactured by machine A?
Hint: Apply Bayes' Rule. 5. Consider a population of consumers consisting of two types. The upper class of consumers comprise 35% of the population and each member has a probability 0.8 of purchasing brand A of a product. Each member of the rest of the population has a probability 0.3 of purchasing brand A of the product. A consumer, chosen at random, is found to be buyer of brand A. What is the probability that the buyer belongs to the middle and lower class of consumers? At an electric plant, it is known from the past experience that the probability is 0.86 that new worker who has attended the company's training programme will meet his production quota and that the corresponding probability is 0.35 for a new worker who has not attended the company's training programme. If 80% of the new workers attend the training programe, what is the probability that new worker will meet his production quota?
Hint: Apply P ( D ) = P ( A) .P ( D / A) + P ( B ) .P ( D / B ) 7. A talcum powder manufacturing company had launched a new type of advertisement. The company estimated that a person who comes across the advertisement will buy their product with a probability of 0.7 and those who does not see the advertisement will buy the product with a probability of 0.3. If in an area of 1,000 people, 70% had come across the advertisement, what is the probability that a person who buys the product (a) has not come across the advertisement (b) has come across the advertisement?
343
8.
There are two boxes, of identical appearance, each containing 4 sparkplugs. It is known that box I contains only one defective sparkplug, while all the four sparkplugs of box II are non-defective. A sparkplug is drawn at random from a box, selected at random, is found to be non-defective. What is the probability that it came from box I? A man has 5 one rupee coins and one of them is known to have two heads. He takes out a coin at random and tosses it 5 times; it always falls head upward. What is the probability that it is a coin with two heads?
1 2.
Give Statistical or Empirical Definition of Probablity? Explain Permutation with restrictions. Write your answer in the space given below. Please go through the lesson sub-head thoroughly you will get your answers in it. This Chek Your Progress will help you to understand the lesson better. Try to write answers for them, but do not submit your answers to the university for assessment. These are for your practice only. (b) (c)
Notes: (a)
(e)
Probability
Cr =
n! r !( n " r )!
2.
(a) (b)
The probability of occurrence of at least one of the two events A and B is given by : P A 7 B = P A + P B " P A 1 B = 1 " P A 1 B . The probability of occurrence of exactly one of the events A or B is given by:
P A 1 B + P A 1 B or P A 7 B " P A 1 B
g bg bg b i b g b
i d
3.
(a)
(b) 4.
The probability of simultaneous occurrence of the two events A and B is given by: P A 1 B = P A . P B / A or = P B . P A / B
g bg b
Bayes' Theorem :
P ( Ak / D ) =
, (k = 1,2, ...... n)
' P ( A ) .P ( D / A )
i i i =1
Here A1, A2, ...... An are n mutually exclusive and exhaustive events.
10.10 KEYWORDS
Probability Event Outcome Occurrence Combination Inverse probability
Distinguish between
345
Permutation and Combination Prior possibilities or Inverse possibilities It is not possible to predetermine the outcome association with a particular experimentation. The total no. of permutations of n distinct objects is n! Each element of the set is called sample point. A compound event is simultaneous occurrence of only two events. The assignment of probabilities on basis of statistical and classical events is objective.
6.
bg
bg
Explain the meaning of a statistical experiment and corresponding sample space. Write down the sample space of an experiment of simultaneous toss of two coins and a die. State and prove Bayes' theorem on inverse probability. What is the probability of getting exactly two heads in three throws of an unbiased coin?
8. 9.
10. What is the probability of getting a sum of 2 or 8 or 12 in single throw of two unbiased dice? 11. Two cards are drawn at random from a pack of 52 cards. What is the probability that the first is a king and second is a queen?
12. What is the probability of successive drawing of an ace, a king, a queen and a jack from a pack of 52 well shuffled cards? The drawn cards are not replaced. 13. 5 unbiased coins with faces marked as 2 and 3 are tossed. Find the probability of getting a sum of 12.
346
14. If 15 chocolates are distributed at random among 5 children, what is the probability that a particular child receives 8 chocolates? 15. A and B stand in a ring with 10 other persons. If arrangement of 12 persons is at random, find the chance that there are exactly three persons between A and B. 16. Two different digits are chosen at random from the set 1, 2, 3, 4, 5, 6, 7, 8. Find the probability that sum of two digits exceeds 13. 17. From each of the four married couples one of the partner is selected at random. What is the probability that they are of the same sex? 18. A bag contains 5 red and 4 green balls. Two draws of three balls each are done with replacement of balls in the first draw. Find the probability that all the three balls are red in the first draw and green in the second draw. 19. Two die are thrown two times. What is the probability of getting a sum 10 in the first and 11 in the second throw? 20. 4 cards are drawn successively one after the other without replacement. What is the probability of getting cards of the same denominations? 21. A bag contains 4 white and 2 black balls. Two balls are drawn one after another without replacement. What is the probability that first ball is white and second is black or first is black and second is white? 22. A bag contains 4 white and 3 red balls. Another bag contains 3 white and 5 red balls. One ball is drawn at random from each bag. What is the probability that (a) both balls are white, (b) both are red, (c) one of them is white and the other is red? 23. What is the probability of a player getting all the four aces, when playing cards are uniformly distributed among the four players? 24. A bag contains 10 white and 6 red balls. Two balls are drawn one after another with replacement. Find the probability that both balls are red. 25. Three persons A, B and C successively draw one card from a pack of 52 cards with replacement of the card drawn earlier. The first to obtain a card of spade wins. What are their respective chances of winning? 26. A bag contains 6 red and 4 green balls. A ball is drawn at random and replaced and a second ball is drawn at random. Find the probability that the two balls drawn are of different colours. 27. The letters of the word GANESHPURI are arranged at random. Find the probability that in the word, so formed; (a) (b) (c) (d) (e) The letter G always occupies the first place. The letter P and I respectively occupy first and last places. The vowels are always together. The letters E, H, P are never together. The vowels always occupy even places (i.e., 2nd, 4th, etc.)
Probability
28. 5-letter words are formed from the letters of the word ORDINATES. What is the probability that the word so formed consists of 2 vowels and 3 consonants? 29. Maximum number of different committees are formed out of 100 teachers, including principal, of a college such that each committee consists of the same number of members. What is the probability that principal is a member of any committee? 30. Letters of the word INTERMEDIATE are arranged at random to form different words. What is the probability that :
347
First letter of the word is R? First letter is M and last letter is E? All the vowels come together? The vowels are never together?
Hint: (d) The event will occur if the letters are arranged as VCVCVCVCVCVCV where V and C denote vowels and consonants respectively. 6 places for vowels can be chosen in 7 C6 . 31. Five persons entered the lift cabin on the ground floor of an 8-floor building. Suppose that each of them independently and with equal probability can leave the floor beginning with first. Find out the probability of all the persons leaving at different floors. Hint: There are 7 floors along with ground floor. 32. A team of first eleven players is to be selected at random from a group of 15 players. What is the probability that (a) a particular player is included, (b) a particular player is excluded? 33. Out of 18 players of a cricket club there are 2 wicket keepers, 5 bowlers and rest batsmen. What is the probability of selection of a team of 11 players including one wicket keeper and at least 3 bowlers? 34. Four persons are selected at random from a group consisting of 3 men, 2 women and 4 children. Find the chance that exactly 2 of them are children. 35. A committee of 6 is chosen from 10 men and 7 women so as to contain at least 3 men and 2 women. Find the probability that 2 particular women don't serve on the same committee. 36. If n persons are seated around a round table, find the probability that in no two ways a man has the same neighbours. 37. 6 teachers, of whom 2 are from science, 2 from arts and 2 from commerce, are seated in a row. What is the probability that the teachers of the same discipline are sitting together? 38 (a) If P(A) = 0.5, P(B) = 0.4 and P A 7 B = 0.7 , find P(A/B) and P A 7 B , where A is compliment of A. State whether A and B are independent. (b) (c) If P ( A) = 1 , P ( B ) = 1 , P ( A / B ) = 1 , find P(B/A) and P B / A . 3 2 6 If A, B and C are three mutually exclusive events, find P(B) if
1 1 P (C ) = P ( A) = P ( B ) . 3 2
39. Let A be the event that a business executive selected at random has stomach ulcer and B be the event that he has a heart disease. Interpret the following events :
( i) A c 7 A, ( ii) A c 1 A, ( iii) A c 1 B , ( iv) A 1 B c , ( v) A 1 B ,
where c stands for compliment. 40. Let A, B and C be three events. Write down the following events in usual set notations :
348
(i) A and B occur together, (ii) Both A and B occur but not C, (iii) all the three events occur, (iv) at least one event occur and (v) at least two events occur.
Probability
If an examinee is selected at random from this group, find (i) (ii) the probability that he is a commerce graduate, the probability that he is a science graduate, given that his score is above 60 and
(iii) the probability that his score is below 50, given that he is B.A. 42. It is given that P ( A + B) = 5 , P ( AB) = 1 and P ( B ) = 1 , where P B stands for 6 3 2 the probability that event B doesn't happen. Determine P(A) and P(B). Hence, show that the events A and B are independent.
di
43. A can solve 75% of the problems in accountancy while B can solve 70% of the problems. Find the probability that a problem selected at random from an accountancy book; (a) (b) (c) 44. (a) (b) will be solved by both A and B, will be solved by A or B, will be solved by one of them. One card is drawn from each of two ordinary sets of 52 cards. Find the probability that at least one of them will be the ace of hearts. Two cards are drawn simultaneously from a set of 52 cards. Find the probability that at least one of them will be the ace of hearts.
45. An article manufactured by a company consists of two parts X and Y. In the process of manufacture of part X, 9 out of 104 parts may be defective. Similarly, 5 out of 100 are likely to be defective in the manufacture of part Y. Compute the probability that the assembled product will not be defective. 46. A salesman has 80% chance of making a sale to each customer. The behaviour of each customer is independent. If two customers A and enter, what is the probability that the salesman will make a sale to A or B? 47. A problem in economics is given to 3 students whose chances of solving it are
3 4 and respectively. What is the probability that the problem will be solved? 4 5 2 , 3
48. A man and a woman appear in an interview for two vacancies in the same post. The probability of man's selection is the probability that (a) (b) (c) (d) both of them will be selected? only one of them will be selected? none of them will be selected? at least one of them will be selected?
349
49. What is the chance that a non-leap year selected at random will contain 53 Sundays? 50. In a group of equal number of men and women 15% of men and 30% of women are unemployed. What is the probability that a person selected at random is employed? 51. An anti-aircraft gun can take a maximum of four shots at enemy plane moving away from it. The probabilities of hitting the plane at first, second, third and fourth shot are 0.4, 0.3, 0.2 and 0.1 respectively. What is the probability that the gun hits the plane? 52. A piece of equipment will function only when all the three components A, B, C are working. The probability of A failing during one year is 0.15 and that of B failing is 0.05 and of C failing is 0.10. What is the probability that the equipment will fail before the end of the year? 53. A worker attends three machines each of which operates independently of the other two. The probabilities of events that machines will not require operator's intervention during a shift are p1 = 0.4, p2 = 0.3 and p3 = 0.2. Find the probability that at least one machine will require worker's intervention during a shift. 54. The probability that a contractor will get a plumbing contract is 2/3 and the probability that he will not get a electric contract is 5/9. If the probability of getting at least one of the contract is 4/5, what is the probability that he will get both? 55. An M.B.A. applies for job in two firms X and Y. The probability of his being selected in firm X is 0.7 and being rejected in the firm Y is 0.5. The probability of at least one of his application being rejected is 0.6. What is the probability that he will be selected in one of the firms? 56. A researcher has to consult a recently published book. The probability of its being available is 0.5 for library A and 0.7 for library B. Assuming the two events to be statistically independent, find the probability of book being available in library A and not available in library B. 57. An investment consultant predicts that the odds against the price of certain stock will go up next week are 2:1 and odds in favour of price remaining same are 1:3. What is the probability that price of the stock will go down during the week? 58. In a random sample of 1,000 residents of a city 700 read newspaper A and 400 read newspaper B. If the habit of reading newspaper A and B is independent, what is the probability that a person selected at random would be reading (a) both the papers, (b) exactly one of the papers, (c) at least one of the papers? Also find the absolute number of persons in each of the cases (a), (b) and (c). 59. The odds that a book will be reviewed favourably by three independent experts are 5 to 2, 4 to 3 and 3 to 4 respectively. What is the probability that of the three reviews a majority will be favourable? 60. In a certain city two newspapers, A and B, are published. It is known that 25% of the city population reads A and 20% reads B while 8% reads both A and B. It is also known that 30% of those who read A but not B look into advertisements and 40% of those who read B but not A look into advertisements while 50% of those who read both A and B look into advertisements. What is percentage of population who reads an advertisement? 61. The probability that a new entrant to a college will be a student of economics is 1/3, that he will be a student of political science is 7/10 and that he will not be a student of economics and political science is 1/5. If one of the new entrants is selected at random, what is the probability that (a) he will be a student of economics and political science, (b) he will be a student of economics if he is a student of
350
political science? Comment upon the independence of two events : a student of economics and a student of political science. 62. 20% of all students at a university are graduates and 80% are undergraduates. The probability that a graduate student is married is 0.5 and the probability that an undergraduate student is married is 0.1. One student is selected at random. (a) (b) What is probability that he is married? What is the probability that he is a graduate if he is found to be married?
Probability
63. In a city three daily news papers X, Y and Z are published. 40% of the people of the city read X, 50% read Y, 30% read Z, 20% read both X and Y, 15% read X and Z, 10% read Y and Z and 24% read all the 3 papers. Calculate the percentage of people who do not read any of the 3 newspapers. 64. A bag contains 4 red and 3 blue balls. Two drawings of 2 balls are made. Find the probability of drawing first 2 red balls and the second 2 blue balls (i) (ii) if the balls are returned to the bag after the first draw, if the balls are not returned after the first draw.
65. A die is loaded in such a way that each odd number is twice as likely to occur as each even number. Find (i) the probability that the number rolled is a perfect square and (ii) the probability that the number rolled is a perfect square provided it is greater than 3. 66. There are 100 students in a college class of which 36 boys are studying statistics and 13 girls are not studying statistics. If there are 55 girls in all, find the probability that a boy picked at random is not studying statistics. 67. If a pair of dice is thrown, find the probability that (i) (ii) the sum is neither 7 nor 11 the sum is neither 8 nor 10
(iii) the sum is greater than 12. 68. Three horses A, B and C are in race. A is twice as likely to win as B, and B is twice as likely to win as C. What are the respective probabilities of winning? 69. A sample of 3 items is selected at random from a box containing 12 items of which 3 are defective. Find the possible number of defective combinations of the said 3 selected items along with their respective probabilities. 70. In an examination 30% of students have failed in mathematics, 20% of the students have failed in chemistry and 10% have failed in both mathematics and chemistry. A student is selected at random. (i) (ii) What is the probability that the student has failed either in mathematics or in chemistry? What is the probability that the student has failed in mathematics if is known that he has failed in chemistry?
71. There are two bags. The first contains 2 red and 1 white balls whereas the second bag contains 1 red and 2 white balls. One ball is taken out at random from the first bag and is being put in the second. Then, a ball is chosen at random from the second bag. What is the probability that this ball is red? 72. From the sale force of 150 people, one will be chosen to attend a special meeting. If 52 are single and 72 are college graduates, and 3/4 of 52 that are single are college graduates, what is the probability that a sales person, selected at random, will be neither single nor a college graduate?
351
73. Data on readership of a certain magazine indicate that the proportion of male readers over 30 years old is 0.20. The proportion of male readers under 30 is 0.40. If the proportion of readers under 30 is 0.70, what is the proportion of subscribers that are male? Also find the probability that a randomly selected male subscriber is under 30. 74. Two union leaders and 10 directors of a company sit randomly to decide upon the wage hike as demanded by the union. Find the probability that there will be exactly three directors between the two union leaders. 75. Suppose a company hires both MBAs and non-MBAs for the same kind of managerial task. After a period of employment some of each category are promoted and some are not. Table below gives the proportion of company's managers among the said classes :
Academic Qualification MBA Non - MBA (A) Promoted (B) Not Promoted(B) Total 0.42 0.28 0.70 (A) 0.18 0.12 0.30 0.60 0.40 1.00
Promotional Status
Total
Calculate P(A/B) andP(B/A), and find out whether A and B are independent events? 76. Each of A, B and C throws with two dice for a prize. The highest throw wins, but if equal highest throws occur the player with these throw continue. If A throws 10 find his chance of winning. 77. The probability of a man hitting a target is 1/4. How many times must he fire so that probability of hitting the target at least once is greater than 2/3? 78. Find the probability that an assessee files his tax return and cheats on it, given that 70% of all assessee file returns and 25% of those who file, cheat. 79. The probability of an aircraft engine failure is 0.10. With how many engines should the aircraft be equipped to be 0.999 sure against an engine failure? Assume that only one engine is needed for successful operation of the aircraft. 80. A market research firm is interested in surveying certain attitude in small community. There are 125 house holds broken down according to income, ownership of a telephone and ownership of a T.V.
Households with annual income of Rs. 1,00,000 or less Telephone subscriber Own T.V. set No T.V. set 59 2 No Telephone 10 4 Households with annual income above Rs. 1,00,000 Telephone subscriber 40 4 No Telephone 5 1
(a) (b)
If a person is selected at random, what is the probability that he is a T.V. owner? If the person selected at random is found to be having income greater than 100,000 and a telephone subscriber, what is the probability that he is a T.V. owner? What is the conditional probability of drawing a household that owns a T.V., given that he is a telephone subscriber?
(c)
352
81. An investment firm purchases three stocks for one week trading purposes. It assesses the probability that the stock will increase in value over the week as 0.8, 0.7 and 0.6 respectively. What is the chance that (a) all the three stocks will increase, and (b) at least two stocks will increase? (Assume that the movements of these stocks is independent.) 82. A company has two plants to manufacture scooters. Plant I manufactures 70% of the scooters and plant II manufactures 30%. At plant I 80% of the scooters are rated standard quality and at plant II 90% of the scooters are rated standard quality. A scooter is picked up at random and is found to be standard quality. What is the chance that it has been produced by plant I? 83. A person has 4 coins each of a different denomination. How many different sums of money can be formed? 84. Two sets of candidate avoid touching for the position of Board of Directors of a company. The probabilities of winning are 0.7 and 0.3 for the two. If the first set wins, they will introduce a new product with the probability 0.4. Similarly, the probability that the second set will introduce a new product is 0.8. If the new product has been introduced, what is the chance that the first set of candidates has won? 85. By examining the chest X-ray, the probability that T.B. is detected when a person is actually suffering is 0.99. The probability that the doctor diagnoses incorrectly, that a person has T.B., on the basis of X-ray is 0.001. In a certain city, 1 in 1000 persons suffers from T.B. A person selected at random is diagnosed to have T.B. What is the chance that he actually has T.B.? 86. The compressors used in refrigerators are manufactured by XYZ company at three factories located at Pune, Nasik and Nagpur. It is known that the Pune factory produces twice as many compressors as Nasik one, which produces the same number as the Nagpur one (during the same period). Experience also shows that 0.2% of the compressors produced at Pune and Nasik and 0.4% of those produced at Nagpur are defective. A quality control engineer while maintaining a refrigerator finds a defective compressor. What is the probability that Nasik factory is not to be blamed? 87. A company estimates that the probability of a person buying its product after seeing the advertisement is 0.7. If 60% of the persons have come across the advertisement, What is the probability that the person, who buys the product, has not come across the advertisement? 88. In an automobile factory, certain parts are to be fixed to the chassis in a section before it moves into another section. On a given day, one of the three persons A, B or C carries out this task. A has 45%, B has 35% and C has 20% chance of doing it. The probabilities that A, B or C will take more than the allotted time are
1 1 1 , and respectively. If it is found that one of them has taken more time, 16 10 20
Probability
what is the probability that A has taken more time? 89. The probabilities of X, Y and Z becoming managers are
3 1 4 , and respectively. 10 2 5 4 2 1 , and respectively. 9 9 3
The probabilities that the Bonus Scheme will be introduced if X, Y or Z become manager are (a) (b)
What is the probability that the Bonus Scheme will be introduced? What is the probability that X was appointed as manager given that the Bonus Scheme has been introduced?
353
90. There are 3 bags. The first bag contains 5 red and 3 black balls, the second contains 4 red and 5 black balls and the third contains 3 red and 4 black balls. A bag is selected at random and the two balls drawn, at random, are found to be red. Revise the probabilities of selection of each bag in the light of this observation. 91. On an average, 20% of the persons going to a handicraft emporium are foreigners and the remaining 80% are local persons. 75% of foreigners and 50% of local persons are found to make purchases. If a bundle of purchased items is sent to the cash counter, what is the probability that the purchaser is a foreigner? 92. The chance that doctor A will diagnose disease B correctly is 60%. The chance that a patient will die by his treatment after correct diagnosis is 40% and the chance of death by wrong diagnosis is 70%. A patient of doctor A, who had disease B, died. What is the chance that his disease was correctly diagnosed? 93. A company has four production sections S1, S2, S3 and S4 which contribute 30%, 20%, 28% and 22%, respectively, to the total output. It was observed that these sections produced 1%, 2%, 3% and 4% defective units respectively. If a unit is selected at random and found to be defective, what is the probability that it has come from either S1 or S4? 94. A factory produces certain type of output by three machines. The respective daily production figures are : machine A = 3,000 units, machine B = 2,500 units, machine C = 4,500 units. Past experience shows that 1 % of the output produced by machine A is defective. The corresponding fractions of defectives for the other two machines are 1.2 and 2% respectively. An item is selected at random from a day's production and is found to be defective. What is the probability that it came from the output of (i) machine A, (ii) machine B, (iii) machine C? 95. It is known that 20% of the males and 5% of the females are unemployed in a certain town consisting of an equal number of males and females. A person selected at random is found to be unemployed. What is the probability that he/she is a (i) male, (ii) female? 96. In a typing-pool, three typists share the total work in the ratio 30%, 35% and 35% of the total work. The first, second and the third typist spoil the work to the extent of 3%, 4% and 5% respectively. A completed work is selected at random and found to be spoiled. What is the probability that the work was done by the third typist? 97. An organisation dealing with consumer products, wants to introduce a new product in the market. Based on their past experience, it has a chance of 65% of being successful and 35% of not being successful. In order to help them to make a decision on the new product, i.e., whether to introduce the new product or not, it decides to get additional information on consumers' attitude towards the product. For this purpose, the organisation decides to conduct a survey. In the past, when the product of this type were successful, the surveys yielded favourable indications 85% of the times, whereas unsuccessful products received favourable indications 30% of the time. Determine the probability of the product being a success given the survey information. 98. In a class of 75 students, 15 were considered to be very intelligent, 45 as medium and the rest below average. The probability that a very intelligent student fails in a viva-voce examination is 0.005; the medium student failing has a probability 0.05; and the corresponding probability for a below average student is 0.15. If a student is known to have passed the viva-voce examination, what is the probability that he is below average?
354
99. Comment on the following statements : (a) Since accident statistics show that the probability that a person will be involved in a road accident is 0.02, the probability that he will be involved in 2 accidents in that year is 0.0004. For three mutually exclusive events A, B and C of a sample space S, where
Probability
(b)
1 3 1 P ( A) = , P ( B ) = and P (C ) = . 3 5 5
(c) A and B are two events in a sample space S where P A
P A1 B =
a f
5 , P B 6
af
2 and 3
2. 5
(d)
Four persons are asked the same question by an interviewer. If each has, independently, probability of 1/6 of answering correctly, the probability that at least one answers correctly is 4 ! 1 = 2 . 6 3 The probability that A and B, working independently, will solve a problem is
2 and probability that A will solve the problem 1 . 3 3
(e)
(f)
For a biased dice the probabilities for different faces to turn up are as given in the following table:
Number on the dice 1 2 3 4 5 6
(g) (h)
If the probability of A to fail in an examination is 0.15 and that for B is 0.27, then the probability that either A or B fails in examination is 0.42. If the probability that Congress wins from a constituency is 0.40 and that B.J.P. wins from the same constituency is 0.42, than the probability that either Congress or B.J.P. wins from that constituency is 0.82. The probability of occurrence of event A is 0.6 and the probability of occurrence of at least one of the four events A, B, C and D is 0.5.
(i)
100. Four alternative answers are given to each question. Point put the correct answer : (a) If A and B are any two events of a sample space S, then P A 7 B + P A 1 B equals (i) (ii) (iii)
g b
P ( A) + P ( B )
d i 1" Pd A 7 B i
1" P A 1 B
(iv) none of the above. (b) If A and B are independent and mutually exclusive events, then (i) (ii)
P ( A) = P ( A / B ) P ( B ) = P ( B / A)
355
(iii) either P(A) or P(B) or both must be zero. (iv) none of the above. (c) If A and B are independent events, then P A 1 B equals (i) (ii) (iii) (iv)
P ( A) + P ( B )
a f a f P a Bf. P a A / Bf P a A f. P a B f
P A . P B/ A
(d) If A and B are independent events, then P A 7 B equals (i) (ii) (iii)
a f a f P a Bf P a A f. P d B i P a B f P d A i. P d B i P a A f
P A .P B
(iv) none of the above. (e) If A and B are two events such that P A 7 B = 5 , P A 1 B = 1 , P A = 1 , 6 3 3 the events are
( )
(i) (ii)
dependent independent
(iii) mutually exclusive (iv) none of the above. 101. Which of the following statements are TRUE or FALSE : (i) (ii) The probability of an impossible event is always zero. The number of permutations is always greater than the number of combinations.
(iii) If two events are independent, then they will also be mutually exclusive. (iv) If P(A) and P(B) are non-zero and A and B are independent, then they cannot be mutually exclusive. (v) If P(A) and P(B) are non-zero and A and B are mutually exclusive, then they may be independent.
(vi) The probability that the roof of a room will fall on the floor can be determined with the help of Classical definition. (vii) Personal judgement or experience cannot be used in the assignment of probabilities. (viii) Revision of the past probabilities of various events is possible on the basis of the outcome of the experiment. (ix) The probability of occurrence of an event cannot be a negative number. (x) The probability of occurrence of an event that is sure to occur can be greater than unity. The probability of getting a number greater than 4 from the throw of an unbiased dice is
(i) (b)
1 2
(ii)
1 3
(iii)
1 4
Probability
The probability of getting exactly one tail in the toss of two unbiased coins is (i)
1 2
(ii)
2 3
(iii)
1 4
(c)
(ii)
3 8
(iii)
5 8
(d)
Four dice and six coins are tossed simultaneously. The number of elements in the sample space are (i) 46 ! 62 (ii) 26 ! 62 (iii) 64 ! 26 (iv) none of these. Two cards are drawn successively without replacement from a well-shuffled pack of 52 cards. The probability that one of them is king and the other is queen is (i)
(e)
8 13 ! 51
1 4
(ii)
4 13 ! 51
1 2
(iii)
1 13 ! 17
1 3
(f)
Two unbiased dice are rolled. The chance of obtaining an even sum is (i) (ii) (iii) (iv) none of these.
(g)
Two unbiased dice are rolled. The chance of obtaining a six only on the second die is (i)
5 6
(ii)
1 6
(iii)
1 4
(h)
If P A (i)
a f
1:4
(ii)
5:4
(iii) 4 : 5
(i)
The probability of occurrence of an event A is 0.60 and that of B is 0.25. If A and B are mutually exclusive events, then the probability of occurrence of neither of them is (i) 0.35
1 8
(ii)
0.75
7 8
(iii) 0.15
3 8
(j)
The probability of getting at least one head in 3 throws of an unbiased coin is (i) (ii) (iii) (iv) none of these.
ANSWERS
TO
QUESTIONS
(c) enumeration (e) True
FOR
I Csiszar, I divergence geometry of probability distributions and minimization problems, The Annals of probability, Vol 3, No 1, pp. 146-158. Sheldon M. Ross, A first course in probability, Prentice Hall. Morris H. Degroot, Mark J, Schervish, "Probability and Statistics," Addison-Wesley, 2001 William Mendenhall, Robert, J. Beaver, Barbara, M. Beaver, Introduction to Probability and Statistics.
358
LESSON
11
THEORETICAL PROBABILITY DISTRIBUTIONS
CONTENTS 11.0 Aims and Objectives 11.1 Introduction 11.2 Probability Distribution 11.3 Binomial Distribution 11.4 Hypergeometric Distribution 11.5 Pascal Distribution 11.6 Geometrical Distribution 11.7 Uniform Distribution (Discrete Random Variable) 11.8 Poisson Distribution 11.9 Exponential Distribution 11.10 Uniform Distribution (Continuous Variable) 11.11 Normal Distribution 11.12 Let us Sum Up 11.13 Lesson-end Activity 11.14 Keywords 11.15 Questions for Discussion 11.16 Terminal Questions 11.17 Model Answers to Questions for Discussion 11.18 Suggested Readings
11.1 INTRODUCTION
Usual manager is forced to make decisions when there is uncertainty as to what will happen after the decisions are made. In this situation the mathematical theory of probability furnishes a tool that can be of great help to the decision maker. A probability function is a rule that assigns probabilities to each element of a set of events that may occur. Probability distribution can either discrete or continuous. A discrete probability distribution is sometimes called a probability mass function and a continuous one is called a probability density function.
Since out of n trials any r trials can be success, the number of sequences showing any r trials as success and remaining (n - r) trials as failure is nCr , where the probability of r successes in each trial is prqn-r. Hence, the required probability is P(r) r = 0, 1, 2, ...... n.
n
Cr p q
r n r
, where
360
0 C0 p 0 q n
1 C 1 p q n !1
2 C 2 p 2 q n! 2
KK KK
n Cn p n q 0
Total 1
It should be noted here that the probabilities obtained for various values of r are the terms in the binomial expansion of (q + p)n and thus, the distribution is termed as Binomial Distribution. P(r ) = n Cr p r q n - r is termed as the probability function or probability mass function (p.m.f.) of the distribution. Summary Measures of Binomial Distribution (a) Mean: The mean of a binomial variate r, denoted by , is equal to E(r), i.e.,
= np"
(b)
( n ! 1)! r =1 (r ! 1)! ( n ! r )!
n
2
. pr !1q n !r = np (q + p )
2,
n !1
= np
Qq + p = 1
is given by
( )
( )
= E r 2 ! n2 p 2
Thus, to find # 2 , we first determine E(r2). Now, E r 2 = " r 2 .nCr pr q n !r = $r (r ! 1) + r % nCr pr q n !r & '
r =1
( )
.... (1)
( )
n r =2
r ( r ! 1) n! r !( n ! r )!
. pr q n !r + np
r =1
="
n n n !1 . n ! 2 ! ( )( ) . r n !r + n! . pr q n !r + np = " pq np r = 2 (r ! 2 )! ( n ! r )! r = 2 (r ! 2 )! ( n ! r )!
= n ( n ! 1) p 2 "
( n ! 2 )! . r !2 n!r + p q np r = 2 (r ! 2 )! ( n ! r )!
n n! 2
= n ( n ! 1) p 2 ( q + p)
+ np = n ( n ! 1) p 2 + np
# 2 = n ( n ! 1) p2 + np ! n 2 p 2 = np (1 ! p ) = npq
Or the standard deviation = npq Remarks: # 2 = npq = mean ( q , which shows that # 2 < mean , since 0 < q <1.
361
(c)
Also
The above result shows that the distribution is symmetrical when p=q=
1 , negatively skewed if q < p, and positively skewed if q > p 2
)2 =
2 2 2 (1 ! 6 pq ) 4 3n p q + npq (1 ! 6 pq ) = = 3+ 2 2 2 2 2 n p q npq
The above result shows that the distribution is leptokurtic if 6pq < 1, platykurtic if 6pq > 1 and mesokurtic if 6pq = 1. (d) Mode: Mode is that value of the random variable for which probability is maximum. If r is mode of a binomial distribution, then we have P(r 1) P(r) P(r + 1) Consider the inequality P(r) * P(r + 1) or nCr pr q n !r * nCr +1 pr +1q n !r !1
n! n! r n !r r +1 n ! r !1 or r ! n ! r ! p q * r + 1 ! n ! r ! 1 ! p q ( ) ( )( ) 1 1 or ( n ! r ) .q * (r + 1) . p or qr + q * np ! pr
Solving the above inequality for r, we get
r * ( n + 1) p ! 1
.... (1)
r + ( n + 1) p
Combining inequalities (1) and (2), we get
.... (2)
(n + 1) p ! 1 + r + ( n + 1) p
Case I: When (n + 1)p is not an integer When (n + 1)p is not an integer, then (n + 1) p 1 is also not an integer. Therefore, mode will be an integer between (n + 1)p - 1 and (n + 1)p or mode will be an integral part of (n + 1)p. Case II: hen (n + 1)p is an integer When (n + 1)p is an integer, the distribution will be bimodal and the two modal values would be (n + 1) p 1 and (n + 1)p.
362
Example 1: An unbiased die is tossed three times. Find the probability of obtaining (a) no six, (b) one six, (c) at least one six, (d) two sixes and (e) three sixes. Solution: The three tosses of a die can be taken as three repeated trials which are independent. Let the occurrence of six be termed as a success. Therefore, r will denote 1 the number of six obtained. Further, n = 3 and p = . 6 (a) Probability of obtaining no six, i.e.,
125 1 5 P ( r = 0) = C0 p q = 1. = 6 6 216
3 0 3 0 3
25 1 5 P (r = 1) = C1 p q = 3. = 6 6 72
3 1 2
Example 2: Assuming that it is true that 2 in 10 industrial accidents are due to fatigue, find the probability that: (a) (b) Exactly 2 of 8 industrial accidents will be due to fatigue. At least 2 of the 8 industrial accidents will be due to fatigue.
Solution: Eight industrial accidents can be regarded as Bernoulli trials each with probability of success p = 2 = 1 . The random variable r denotes the number of accidents due to 10 5 fatigue. (a) (b)
1 4 P (r = 2) = 8C2 = 0.294 5 5
2 6
We have to find P(r * 2). We can write P(r * 2) = 1 - P(0) - P(1), thus, we first find P(0) and P(1). We have
0 8
1 4 P (0) = C0 = 0.168 5 5
8
and
\ P(r 2) = 1- 0.168 - 0.336 = 0.496 Example 3: The proportion of male and female students in a class is found to be 1 : 2. What is the probability that out of 4 students selected at random with replacement, 2 or more will be females? Solution: Let the selection of a female student be termed as a success. Since the selection of a student is made with replacement, the selection of 4 students can be taken as 4 2 repeated trials each with probability of success p = . 3 Thus, P(r * 2) = P(r = 2) + P(r = 3) +P(r = 4)
363
8 2 1 2 1 2 = C2 + 4 C3 + 4 C4 = 3 3 3 3 3 9
4
Note that P(r 2) can alternatively be found as 1 P(0) P(1) Example 4: The probability of a bomb hitting a target is 1/5. Two bombs are enough to destroy a bridge. If six bombs are aimed at the bridge, find the probability that the bridge is destroyed. Solution: Here n = 6 and p =
1 5
The bridge will be destroyed if at least two bomb hit it. Thus, we have to find P(r 2). This is given by
1077 , 4, 1- , 46 P(r * 2) = 1 P(0) P(1) = 1 ! C0 . / ! C1 . / . / = 0 51 0 51 0 51 3125
6 6 5
Example 5: An insurance salesman sells policies to 5 men all of identical age and good health. According to the actuarial tables, the probability that a man of this particular age will be alive 30 years hence is 2/3. Find the probability that 30 years hence (i) at least 1 man will be alive, (ii) at least 3 men will be alive. Solution: Let the event that a man will be alive 30 years hence be termed as a success. Therefore, n = 5 and p = (i) (ii)
2 . 3
5 P(r 1) = 1 P(r = 0) = 1 ! C0
FG 2 IJ FG 1IJ H 3 K H 3K
4
242 243
5
Example 6: Ten percent of items produced on a machine are usually found to be defective. What is the probability that in a random sample of 12 items (i) none, (ii) one, (iii) two, (iv) at the most two, (v) at least two items are found to be defective? Solution: Let the event that an item is found to be defective be termed as a success. Thus, we are given n = 12 and p = 0.1. (i) (ii) (iii) (iv) (v)
Example 7: In a large group of students 80% have a recommended statistics book. Three students are selected at random. Find the probability distribution of the number of students having the book. Also compute the mean and variance of the distribution. Solution: Let the event that 'a student selected at random has the book' be termed as a success. Since the group of students is large, 3 trials, i.e., the selection of 3 students, can be regarded as independent with probability of a success p = 0.8. Thus, the conditions of the given experiment satisfies the conditions of binomial distribution.
364
3- r
where r = 0, 1, 2 and 3 The mean is np = 3 ( 0.8 = 2.4 and Variance is npq = 2.4 ( 0.2 = 0.48 Example 8: (a) (b) The mean and variance of a discrete random variable X are 6 and 2 respectively. Assuming X to be a binomial variate, find P(5 X 7). In a binomial distribution consisting of 5 independent trials, the probability of 1 and 2 successes are 0.4096 and 0.2048 respectively. Calculate the mean, variance and mode of the distribution. It is given that np = 6 and npq = 2 \ q=
Solution: (a)
=
(b)
10 p (1 - p )
2
5 p (1 - p )
4 3
Example 9: 5 unbiased coins are tossed simultaneously and the occurrence of a head is termed as a success. Write down various probabilities for the occurrence of 0, 1, 2, 3, 4, 5 successes. Find mean, variance and mode of the distribution. Solution: Here n = 5 and p = q =
1 . 2
5
af
Fitting of Binomial Distribution The fitting of a distribution to given data implies the determination of expected (or theoretical) frequencies for different values of the random variable on the basis of this data. The purpose of fitting a distribution is to examine whether the observed frequency distribution can be regarded as a sample from a population with a known probability distribution. To fit a binomial distribution to the given data, we find its mean. Given the value of n, we can compute the value of p and, using n and p, the probabilities of various values of the random variable can be computed. These probabilities are multiplied by total frequency to give the required expected frequencies. In certain cases, the value of p may be determined by the given conditions of the experiment. Example 10: The following data give the number of seeds germinating (X) out of 10 on damp filter for 80 sets of seed. Fit a binomial distribution to the data.
X : 0 1 2 3 4 5 6 7 8 9 10 f : 6 20 28 12 8 6 0 0 0 0 0
Solution: Here the random variable X denotes the number of seeds germinating out of a set of 10 seeds. The total number of trials n = 10. The mean of the given data
0 ( 6 + 1 ( 20 + 2 ( 28 + 3 ( 12 + 4 ( 8 + 5 ( 6 174 = = 2.175 80 80 Since mean of a binomial distribution is np, \ np = 2.175. Thus, we get . X=
p= 2.175 = 0.22 (approx.) . Further, q = 1 - 0.22 = 0.78. 10
X 10 - X
Using these values, we can compute P ( X ) =10 C X (0.22 ) (0.78) and then expected frequency [= N ! P(X)] for X = 0, 1, 2, ...... 10. The calculated probabilities and the respective expected frequencies are shown in the following table:
X 0 1 2 3 4 5 P(X ) 0.0834 0.2351 0.2984 0.2244 0.1108 0.0375 N ( P (X ) 6.67 18.81 23.87 17.96 8.86 3.00 Approximated Frequency 6 19 24 18 9 3 X 6 7 8 9 10 P(X) 0.0088 0.0014 N ( P (X ) 0.71 0.11 0.01 0.00 0.00 Approximated Frequency 1 0 0 0 0 80
366
Features of Binomial Distribution 1. It is a discrete probability distribution. 2. It depends upon two parameters n and p. It may be pointed out that a distribution is known if the values of its parameters are known. 3. The total number of possible values of the random variable are n + 1. The successive binomial coefficients are nC0 , nC1 , n C2 , .... n Cn . Further, since nCr n Cn r , these coefficients are symmetric.
The values of these coefficients, for various values of n, can be obtained directly by using Pascal's triangle.
PASCAL'S TRIANGLE
We can note that it is very easy to write this triangle. In the first row, both the coefficients will be unity because 1 C0 = 1C1 . To write the second row, we write 1 in the beginning and the end and the value of the middle coefficients is obtained by adding the coefficients of the first row. Other rows of the Pascal's triangle can be written in a similar way. 4. (a) The shape and location of binomial distribution changes as the value of p changes for a given value of n. It can be shown that for a given value of n, if p is increased gradually in the interval (0, 0.5), the distribution changes from a positively skewed to a symmetrical shape. When p = 0.5, the distribution is perfectly symmetrical. Further, for larger values of p the distribution tends to become more and more negatively skewed. For a given value of p, which is neither too small nor too large, the distribution becomes more and more symmetrical as n becomes larger and larger.
(b)
Cn 0, 1, 2, ...... n. Also n + k.
( C )( P (r ) =
k r
N -k
Cn - r
, N ! n. N ! 1 / .npq , where p 0 1
k and q = 1 p. N
Example 11: A retailer has 10 identical television sets of a company out which 4 are defective. If 3 televisions are selected at random, construct the probability distribution of the number of defective television sets. Solution: Let the random variable r denote the number of defective televisions. In terms of notations, we can write N = 10, k = 4 and n = 3.
367
The distribution of r is hypergeometric. This distribution can also be written in a tabular form as given below :
r P r 0 5 30 1 15 30 2 9 30 3 Total 1 1 30
af
P (r ) =
Cr (
200
120
C5!r
C5
, r = 0,1, 2,3, 4, 5
The probabilities for various values of r are given in the following table :
r P r
af
5
(ii)
af
Cr 0. 4
a f a0.6f
r
5 r
,r
0,1, 2, 3, 4, 5
The probabilities for various values of r are given in the following table :
r P r
af
We note that these probabilities are in close conformity with the hypergeometric probabilities.
Check Your Progress 11.1
1 2.
Write the characterstics of Binomial Distribution. What is the use of Hypergeometric Distribution? Write your answer in the space given below. Please go through the lesson sub-head thoroughly you will get your answers in it.
Contd....
(c)
This Chek Your Progress will help you to understand the lesson better. Try to write answers for them, but do not submit your answers to the university for assessment. These are for your practice only.
n -1
Cr -1 p r -1q n - r p =
n -1
It can be shown that the mean and variance of Pascal distribution are respectively.
This distribution is also known as Negative Binomial Distribution because various values of P(n) are given by the terms of the binomial expansion of pr(1 - q)- r.
P (n ) =
n!1
Here n is a random variable which denotes the number of trials required to get a success. This distribution is known as geometrical distribution. The mean and variance of the distribution are
q 1 and 2 respectively. p p
369
Hint: Mean = np and Variance = npq. 3. (a) The probability of a man hitting a target is
the probability of his hitting the target at least twice? (ii) How many times must he fire so that the probability of his hitting the target at least once is greater than
2 ? 3
(b) How many dice must be thrown so that there is better than even chance of obtaining at least one six?
3 Hint: (a) (ii) Probability of hitting the target at least once in n trials is 1 - . 4
n
4.
A machine produces an average of 20% defective bolts. A batch is accepted if a sample of 5 bolts taken from the batch contains no defective and rejected if the sample contains 3 or more defectives. In other cases, a second sample is taken. What is the probability that the second sample is required? A multiple choice test consists of 8 questions with 3 answers to each question (of which only one is correct). A student answers each question by throwing a balanced die and checking the first answer if he gets 1 or 2, the second answer if he gets 3 or 4 and the third answer if he gets 5 or 6. To get a distinction, the student must secure at least 75% correct answers. If there is no negative marking, what is the probability that the student secures a distinction? What is the most probable number of times an ace will appear if a die is tossed (i) 50 times, (ii) 53 times? Out of 320 families with 5 children each, what percentage would be expected to have (i) 2 boys and 3 girls, (ii) at least one boy, (iii) at the most one girl? Assume equal probability for boys and girls. Fit a binomial distribution to the following data :
X : 0 1 2 3 4 f : 28 62 46 10 4
Hint: A second sample is required if the first sample is neither rejected nor accepted. 5.
370
9.
A question paper contains 6 questions of equal value divided into two sections of three questions each. If each question poses the same amount of difficulty to Mr. X, an examinee, and he has only 50% chance of solving it correctly, find the answer to the following : (i) (ii) If Mr. X is required to answer only three questions from any one of the two sections, find the probability that he will solve all the three questions correctly. If Mr. X is given the option to answer the three questions by selecting one question out of the two standing at serial number one in the two sections, one question out of the two standing at serial number two in the two sections and one question out of the two standing at serial number three in the two sections, find the probability that he will solve all three questions correctly.
Hint: (i) A section can be selected in 2 C1 ways and the probability of attempting all the
1 3 three questions correctly is C3
F I H 2K 1 C F I H 2K
2 1
10. A binomial random variable satisfies the relation 9P(X = 4) = P(X = 2) for n = 6. Find the value of the parameter p. Hint: P ( X = 2) = 6C2 p 2 q 4 etc. 11. Three fair coins are tossed 3,000 times. Find the frequency distributions of the number of heads and tails and tabulate the results. Also calculate mean and standard deviation of each distribution.
Hint: See example 9. 12. Take 100 sets of 10 tosses of an unbiased coin. In how many cases do you expect to get (i) 6 heads and 4 tails and (ii) at least 9 heads? Hint: Use binomial distribution with n = 10 and p = 0.5. 13. In a binomial distribution consisting of 5 independent trials, the probabilities of 1 and 2 successes are 0.4096 and 0.2048 respectively. Find the probability of success. Hint: Use the condition P(1) = 2P(2). 14. For a binomial distribution, the mean and variance are respectively 4 and 3. Calculate the probability of getting a nonzero value of this distribution. Hint: Find P(r 0). 15. (a) There are 300, seemingly identical, tyres with a dealer. The probability of a tyre being defective is 0.3. If 2 tyres are selected at random, find the probability that there is non defective tyre. If instead of 300 tyres the dealer had only 10 tyres out of which 3 are defective, find the probability that no tyre is defective in a random sample of 2 tyres. Write down the probability distribution of the number of defectives in each case.
(b) (c)
Hint: Use (a) binomial (b) hypergeometric distributions. 16. Write down the mean and variance of a binomial distribution with parameters n and p. If the mean and variance are 4 and 8/3 respectively, find the values of n and p. State whether it is symmetric for these values? Hint: Binomial distribution is symmetric when p = 0.5.
371
17. Evaluate k if f(x) = k, when x = 1, 2, 3, 4, 5, 6 and = 0 elsewhere, is a probability mass function. Find its mean and standard deviation. Hint:
" f (x ) = 1 .
a score of 3 or less occurs on exactly 2 throws; a score of more than 2 occurs on exactly 3 throws;
18. If a die is thrown 6 times, calculate the probability that : (i) (ii)
(iii) a score of 5 or less occurs at least once; (iv) a score of 2 or less occurs on at least 5 throws. Hint: Use binomial distribution with n = 6 and a different value of p in each case. 19. If we take 1,280 sets each of 10 tosses of a fair coins, in how many sets should we expect to get 7 heads and 3 tails? Hint: See example 9. 20. If a production unit is made up from 20 identical components and each component has a probability of 0.25 of being defective, what is the average number of defective components in a unit? Further, what is the probability that in a unit (i) less than 3 components are defective? (ii) exactly 3 components are defective? Hint: Take n = 20 and p = 0.25. 21. It is known from the past experience that 80% of the students in a school do their home work. Find the probability that during a random check of 10 students (i) (ii) all have done their home work, at the most 2 have not done their home work,
(iii) at least one has not done the home work. Hint: Take n = 10 and p = 0.8. 22. There are 24 battery cells in a box containing 6 defective cells that are randomly mixed. A customer buys 3 cells. What is the probability that he gets one defective cell? Hint: Use hypergeometric distribution. 23. There are 400 tyres in the stock of a wholesaler among which 40 tyres, having slight defects, are randomly mixed. A retailer purchases 6 tyres from this stock. What is the probability that he gets at least 4 non defective tyres? Hint: n is less than 5% of N.
Poisson Process
Let us assume that on an average 3 telephone calls are received per 10 minutes at a telephone exchange desk and we want to find the probability of receiving a telephone call in the next 10 minutes. In an effort to apply binomial distribution, we can divide the interval of 10 minutes into 10 intervals of 1 minute each so that the probability of receiving a telephone call (i.e., a success) in each minute (i.e., trial) becomes 3/10 ( note that p = m/n, where m denotes mean). Thus, there are 10 trials which are independent, each with probability of success = 3/10. However, the main difficulty with this formulation is that, strictly speaking, these trials are not Bernoulli trials. One essential requirement of such trials, that each trial must result into one of the two possible outcomes, is violated here. In the above example, a trial, i.e. an interval of one minute, may result into 0, 1, 2, ...... successes depending upon whether the exchange desk receives none, one, two, ...... telephone calls respectively. One possible way out is to divide the time interval of 10 minutes into a large number of small intervals so that the probability of receiving two or more telephone calls in an interval becomes almost zero. This is illustrated by the following table which shows that the probabilities of receiving two calls decreases sharply as the number of intervals are increased, keeping the average number of calls, 3 calls in 10 minutes in our example, as constant.
n P one call is received 10 0.3 100 0.03 1, 000 0.003 10, 000 0.0003
Using symbols, we may note that as n increases then p automatically declines in such a way that the mean m (= np) is always equal to a constant. Such a process is termed as a Poisson Process. The chief characteristics of Poisson process can be summarised as given below : 1. 2. 3. The number of occurrences in an interval is independent of the number of occurrences in another interval. The expected number of occurrences in an interval is constant. It is possible to identify a small interval so that the occurrence of more than one event, in any interval of this size, becomes extremely unlikely.
, m.1 ! / n1 0
n!r
= lim n23
n! ,m. / r !( n ! r )! 0 n 1
n !r
, m.1 ! / n1 0
n!r
m, ..1 ! / n1 0
% 5 5 '
mr , m= lim . 1 ! / , since each of the remaining terms will tend to unity as n23 r! n1 0
n23
m .e r!
r -m n m m -m m lim 1 - = lim 1 - = e . , since n23 n23 n n n m
P (r ) =
Here e is a constant with value = 2.71828... . Note that Poisson distribution is a discrete probability distribution with single parameter m. Total probability = "
, m m 2 m3 e ! m .mr = e! m . 1 + + + + .... / r! r =0 0 1! 2! 3! 1
3
= e- m .em = 1 .
E (r ) = " r .
r =0
( )
r =0
3
r =0
r =0
r =0
3 e ! m .m r mr + m = e! m " +m r! r =2 (r ! 2 )!
2 m2 1 )1 = 3 = 3 = \ 3 2 m m
Since m is a positive quantity, therefore, b1 is always positive and hence the Poisson distribution is always positively skewed. We note that b1 0 as m , therefore the distribution tends to become more and more symmetrical for large values of m. Further, ) 2 =
distribution becomes normal for large values of m. (d) Mode: As in binomial distribution, a Poisson variate r will be mode if
P (r ! 1) + P (r ) * P (r + 1)
The inequality P (r ! 1) + P (r ) can be written as
e ! m .m r !1 e ! m .m r + r! (r ! 1)! 6 1+ m 6 r+m r
.... (1)
Similarly, the inequality P (r ) * P (r + 1) can be shown to imply that rm-1 Combining (1) and (2), we can write m - 1 r m. Case I: When m is not an integer The integral part of m will be mode. Case II: When m is an integer The distribution is bimodal with values m and m - 1. Example 13: The average number of customer arrivals per minute at a super bazaar is 2. Find the probability that during one particular minute (i) exactly 3 customers will arrive, (ii) at the most two customers will arrive, (iii) at least one customer will arrive. Solution: It is given that m = 2. Let the number of arrivals per minute be denoted by the random variable r. The required probability is given by (i) .... (2)
P (r = 3 ) =
(ii)
P (r + 2 ) = "
(iii)
P (r * 1) = 1 ! P ( r = 0 ) = 1 !
Example 14: An executive makes, on an average, 5 telephone calls per hour at a cost which may be taken as Rs 2 per call. Determine the probability that in any hour the telephone calls' cost (i) exceeds Rs 6, (ii) remains less than Rs 10. Solution: The number of telephone calls per hour is a random variable with mean = 5. The required probability is given by
375
(i)
P (r > 3 ) = 1 ! P (r + 3 ) = 1 ! "
e !5 .5r r! r =0
3
P (r + 4 ) = "
Example 15: A company makes electric toys. The probability that an electric toy is defective is 0.01. What is the probability that a shipment of 300 toys will contain exactly 5 defectives? Solution: Since n is large and p is small, Poisson distribution is applicable. The random variable is the number of defective toys with mean m = np = 300 ! 0.01 = 3. The required probability is given by
P (r = 5 ) =
Example 16: In a town, on an average 10 accidents occur in a span of 50 days. Assuming that the number of accidents per day follow Poisson distribution, find the probability that there will be three or more accidents in a day. Solution: The random variable denotes the number accidents per day. Thus, we have . 10 m= = 0.2 .The required probability is given by 50
P (r * 3 ) = 1 ! P (r + 2 ) = 1 ! e
!0.2
Example 17: A car hire firm has two cars which it hire out every day. The number of demands for a car on each day is distributed as a Poisson variate with mean 1.5. Calculate the proportion of days on which neither car is used and the proportion of days on which some demand is refused. [ e-1.5 = 0.2231] Solution: When both car are not used, r = 0 \ P ( r = 0) = e -1.5 = 0.2231 . Hence the proportion of days on which neither car is used is 22.31%. Further, some demand is refused when more than 2 cars are demanded, i.e., r > 2 \ P (r > 2) = 1 - P (r 2) = 1 -
r e-1.5 (1.5) (1.5)2 = 0.1913. = 1 - 0.2231 1 + 1.5 + r! 2! r =0 2
Hence the proportion of days is 19.13%. Example 18: A firm produces articles of which 0.1 percent are usually defective. It packs them in cases each containing 500 articles. If a wholesaler purchases 100 such cases, how many cases are expected to be free of defective items and how many are expected to contain one defective item? Solution: The Poisson variate is number of defective items with mean
m=
376
P (r = 0 ) = e !0.5 = 0.6065. Hence the number of cases having no defective items = 0.6065 ! 100 = 60.65
Similarly, P (r = 1) = e!0.5 ( 0.5 = 0.6065 ( 0.5 = 0.3033. Hence the number of cases having one defective item are 30.33. Example 19: A manager accepts the work submitted by his typist only when there is no mistake in the work. The typist has to type on an average 20 letters per day of about 200 words each. Find the chance of her making a mistake (i) if less than 1% of the letters submitted by her are rejected; (ii) if on 90% of days all the work submitted by her is accepted. [As the probability of making a mistake is small, you may use Poisson distribution. Take e = 2.72]. Solution: Let p be the probability of making a mistake in typing a word. (i) Let the random variable r denote the number of mistakes per letter. Since 20 letters are typed, r will follow Poisson distribution with mean = 20p. Since less than 1% of the letters are rejected, it implies that the probability of making at least one mistake is less than 0.01, i.e., P(r 1) 0.01 or 1 - P(r = 0) 0.01 1 - e-20p 0.01 or e-20p 0.99 Taking log of both sides 20p.log 2.72 log 0.99
! 20 ( 0.4346 p * 1.9956 1 2 3 No. of mistakes per page : 0 Frequency : 211 90 19 5
In this case r is a Poisson variate which denotes the number of mistakes per day. Since the typist has to type 20 ! 200 = 4000 words per day, the mean number of mistakes = 4000p. It is given that there is no mistake on 90% of the days, i.e., P(r = 0) = 0.90 or e-4000p = 0.90 Taking log of both sides, we have - 4000p log 2.72 = log 0.90 or ! 4000 ( 0.4346 p = 1.9542 = ! 0.0458 \
p=
Example 20: A manufacturer of pins knows that on an average 5% of his product is defective. He sells pins in boxes of 100 and guarantees that not more than 4 pins will be defective. What is the probability that the box will meet the guaranteed quality? Solution: The number of defective pins in a box is a Poisson variate with mean equal to 5. A box will meet the guaranteed quality if r 4. Thus, the required probability is given by
P (r + 4 ) = e !5 "
377
(ii)
Example 22: The following mistakes per page were observed in a book :
0 211
1 90
2 19
3 5
Fit a Poisson distribution to find the theoretical frequencies. Solution: The mean of the given frequency distribution is
m= 0 ( 211 + 1 ( 90 + 2 ( 19 + 3 ( 5 143 = = 0.44 211 + 90 + 19 + 5 325
. Substituting r = 0, 1, 2 and 3, we get the probabilities r! for various values of r, as shown in the following table:
r 0 1 2 P (r ) 0.6440 0.2834 0.0623 N ( P (r ) 209.30 92.10 20.25 2.96 Expected Frequencies Approximated to the nearest integer 210 92 20 3 325
We can write P (r ) =
e-0.44 (0.44)
3 0.0091 Total
(ii)
(iii) The range of the random variable is 0 r < . (iv) The Poisson distribution is a positively skewed distribution. The skewness decreases as m increases.
Hint: m = 2.
2 ( 200 = 4. 100
If r is a Poisson variate such that P(r) = P(r + 1), what are the mean and standard deviation of r? The number of arrivals of telephone calls at a switch board follows a Poisson process at an average rate of 8 calls per 10 minutes. The operator leaves for a 5 minutes tea break. Find the probability that (a) at the most two calls go unanswered and (b) 3 calls go unanswered, while the operator is away. What probability model is appropriate to describe a situation where 100 misprints are distributed randomly throughout the 100 pages of a book? For this model, what is the probability that a page observed at random will contain (i) no misprint, (ii) at the most two misprints, (iii) at least three misprints? If the probability of getting a defective transistor in a consignment is 0.01, find the mean and standard deviation of the number of defective transistors in a large consignment of 900 transistors. What is the probability that there is at the most one defective transistor in the consignment? In a certain factory turning out blades, there is a small chance 1/500 for any one blade to be defective. The blades are supplied in packets of 10. Use Poisson distribution to compute the approximate number of packets containing no defective, one defective, two defective, three defective blades respectively in a consignment of 10,000 packets. A manufacturer knows that 0.3% of items produced in his factory are defective. If the items are supplied in boxes, each containing 250 items, what is the probability that a box contains (i) no defective, (ii) at the most two defective items?
Hint: m = 4. 4.
Hint: The random variable is the number of defective blades in a packet of 10 blades. 7.
Hint: m = 8.
A random variable r follows Poisson distribution, where P(r = 2) = P(r = 3). Find (i) P(r = 0), (ii) P(1 r 3).
379
9.
If X is a Poisson variate such that P(X = 2) = 9P(X = 4) + 90P(X = 6), find the mean and variance of X.
Hint: mean = Variance. 10. Lots of 400 wall-clocks are purchased by a retailer. The retailer inspects sample of 20 clocks from each lot and returns the lot to the supplier if there are more than two defectives in the sample. Suppose a lot containing 30 defective clocks is received by the retailer, what is the probability that it will be returned to the supplier? Hint: n = 20 and p = 30/400. 11. An industrial area has power breakdown once in 15 days, on the average. Assuming that the number of breakdowns follow a Poisson process, what is the probability of (i) no power breakdown in the next six days, (ii) more than one power breakdown in the next six days?
Hint: The random variable is the number of power breakdowns in six days. 12. After correcting the proofs of first 50 pages or so of a book, it is found that on the average there are 3 errors per 5 pages. Use Poisson probabilities and estimate the number of pages with 0, 1, 2, 3, errors in the whole book of 1,000 pages. [Given that e- 0.6 = 0.5488]. Hint: Take random variable as the number of errors per page. 13. Between 2 and 4 p.m., the number of phone calls coming into the switch board of a company is 300. Find the probability that during one particular minute there will be (i) no phone call at all, (ii) exactly 3 calls, (iii) at least 7 calls. [Given e- 2 = 0.13534 and e-0.5 = 0.60650]. Hint: Random variable is the number of calls per minute. 14. It is known that 0.5% of ball pen refills produced by a factory are defective. These refills are dispatched in packagings of equal numbers. Using Poisson distribution determine the number of refills in a packing to be sure that at least 95% of them contain no defective refills. Hint: Let n be the number of refills in a package, then m = 0.005n. 15. Records show that the probability is 0.00002 that a car will have a flat tyre while driving over a certain bridge. Find the probability that out of 20,000 cars driven over the bridge, not more than one will have a flat tyre. Hint: The random variable is number of cars driven over the bridge having flat tyre. 16. A radioactive source emits on the average 2.5 particles per second. Calculate the probability that two or more particles will be emitted in an interval of 4 seconds. Hint: m = 2.5 ( 4. 17. The number of accidents in a year attributed to taxi drivers in a city follows Poisson distribution with mean 3. Out of 1,000 taxi drivers, find approximately the number of drivers with (i) no accident in a year, (ii) more than 3 accidents in a year. [Given e-1 = 0.3679, e- 2 = 0.1353, e- 3 = 0.0498]. Hint: Number of drivers = probability ! 1000. 18. A big industrial plant has to be shut down for repairs on an average of 3 times in a month. When more than 5 shut downs occur for repairs in a month, the production schedule cannot be attained. Find the probability that production schedule cannot be attained in a given month, assuming that the number of shut downs are a Poisson variate.
380
19. A manager receives an average of 12 telephone calls per 8-hour day. Assuming that the number of telephone calls received by him follow a Poisson variate, what is the probability that he will not be interrupted by a call during a meeting lasting 2 hours? Hint: Take m = 3. 20. Assuming that the probability of a fatal accident in a factory during a year is 1/1200, calculate the probability that in a factory employing 300 workers, there will be at least two fatal accidents in a year. [Given e- 0.25 = 0.7788]. Hint: The average number of accidents per year in the factory = 0.25. 21. If 2% of electric bulbs manufactured by a certain company are defective, find the probability that in a sample of 200 bulbs (i) less than 2 bulbs are defective (ii) more than 3 bulbs are defective. [Given e-4 = 0.0183]. Hint: m = 4. 22. If for a Poisson variate X, P(X = 1) = P(X = 2), find P(X = 1 or 2). Also find its mean and standard deviation. Hint: Find m from the given condition. 23. If 5% of the families in Calcutta do not use gas as a fuel, what will be the probability of selecting 10 families in a random sample of 100 families who do not use gas as a fuel? You may assume Poisson distribution. [Given e-5 = 0.0067]. Hint: m = 5, find P(r = 10). 24. The probability that a Poisson variate X takes a positive value is 1 - e-1.5. Find the variance and also the probability that X lies between 1.5 and 1.5. Hint: 1- e-1.5 = P(r > 0). Find P(-1.5 < X < 1.5) = P(X = 0) + P(X = 1). 25. 250 passengers have made reservations for a flight from Delhi to Mumbai. If the probability that a passenger, who has reservation, will not turn up is 0.016, find the probability that at the most 3 passengers will not turn up. Hint: The number of passengers who do not turn up is a Poisson variate.
( )
( )
d i
the probability that no event occurs in the time interval t. Since the mean number of occurrence of events in time t is mt, we have , by Poisson distribution,
P A = P ( r = 0) =
Thus, we get F(t) + e-mt = 1 or Thus, P(0 to t) = F(t) = 1 emt. f(t) = F'(t) = memt =0 when t > 0 otherwise.
( )
e ! mt ( mt ) = e ! mt . 0!
.... (1)
To get the probability density function, we differentiate equation (1) with respect to t.
e ! mt !m
= ! e! mt
0
3 0
= 0 + 1 = 1.
1 , where m denotes the average number of m occurrence of events per unit of time or distance. E (t ) = 7 t.m.e ! mt dt =
0 3
Example 23: A telephone operator attends on an average 150 telephone calls per hour. Assuming that the distribution of time between consecutive calls follows an exponential distribution, find the probability that (i) the time between two consecutive calls is less than 2 minutes, (ii) the next call will be received only after 3 minutes. Solution: Here m = the average number of calls per minute = (i)
P (t + 2 ) = 7 2.5e!2.5t dt = F (2 )
0 2
150 = 2.5. 60
We know that F(t) = 1 - e-mt, \ F(2) = 1 - e-2.5 ! 2 = 0.9933 (ii) P(t > 3) = 1 P(t 3) = 1 - F(3) = 1 [1 e2.5 ! 3] = 0.0006 Example 24: The average number of accidents in an industry during a year is estimated to be 5. If the distribution of time between two consecutive accidents is known to be exponential, find the probability that there will be no accidents during the next two months. Solution: Here m denotes the average number of accidents per month = \ P(t > 2) = 1 F(2) = e!12(2 = e!0.833 = 0.4347. Example 25: The distribution of life, in hours, of a bulb is known to be exponential with mean life of 600 hours. What is the probability that (i) it will not last more than 500 hours, (ii) it will last more than 700 hours? Solution: Since the random variable denote hours, therefore m = (i)
382
5
5 . 12
1 600
P(t 500) = F(500) = 1 ! e ! 600 (500 = 1 ! e !0.833 = 0.5653. P(t > 700) = 1 - F(700) = e! 600 = e!1.1667 = 0.3114.
700
(ii)
DISTRIBUTION
p(X)
(CONTINUOUS
A continuous random variable X is said to be uniformly distributed in a close interval (a, b) with probability density function p(X)
1 if p ( X ) = ) ! 8 for a X b and = 0
Otherwise The uniform distribution is alternatively known as rectangular distribution. Note that the total area under the curve is unity, i.e. ,
Figure 11.1 X
The diagram of the probability density function is shown in the figure 19.1.
)
1 1 ) dX = X 8 =1 ) !8 ) !8
1 Further, E X = ) !8
b g
E X2 =
d i
1 ) !8
z z
8 )
1 X2 X . dX = ) !8 2 X 2 . dX =
=
8
8 +) 2
)3 !83 1 2 = ) + 8) + 8 2 3 ) !8 3
g d
i
2
\ Var ( X ) =
(8 + ) ) = ( ) ! 8 ) 1 2 ) + 8) + 8 2 ! 3 4 12
Example 26: The buses on a certain route run after every 20 minutes. If a person arrives at the bus stop at random, what is the probability that (a) (b) (c) he has to wait between 5 to 15 minutes, he gets a bus within 10 minutes, he has to wait at least 15 minutes.
Solution: Let the random variable X denote the waiting time, which follows a uniform distribution with p.d.f. 1 f (X ) = for 0 X 20 20 1 15 1 1 (a) P (5 + X + 15) = 75 dX = 20 (15 ! 5) = 2 20 1 1 (b) P (0 + X + 10 ) = ( 10 = 20 2 20 ! 15 1 = . (c) P (15 + X + 20 ) = 20 4
383
Since Gauss used this curve to describe the theory of accidental errors of measurements involved in the calculation of orbits of heavenly bodies, it is also called as Gaussian curve.
(ii)
(iii) Condition of independence: The factors, affecting observations, must act independently of each other. (iv) Condition of symmetry: Various factors operate in such a way that the deviations of observations above and below mean are balanced with regard to their magnitude as well as their number. Random variables observed in many phenomena related to economics, business and other social as well as physical sciences are often found to be distributed normally. For example, observations relating to the life of an electrical component, weight of packages, height of persons, income of the inhabitants of certain area, diameter of wire, etc., are affected by a large number of factors and hence, tend to follow a pattern that is very similar to the normal curve. In addition to this, when the number of observations become large, a number of probability distributions like Binomial, Poisson, etc., can also be approximated by this distribution.
# 29
Here p and s are absolute constants with values 3.14159.... and 2.71828.... respectively. It may be noted here that this distribution is completely known if the values of mean m and standard deviation s are known. Thus, the distribution has two parameters, viz. mean and standard deviation.
384
Figure 11.2
It should be noted here that although we seldom encounter variables that have a range from - to , as shown by the normal curve, nevertheless the curves generated by the relative frequency histograms of various variables closely resembles the shape of normal curve.
!3
# 29
1 , X ! ! . / 20 # 1
dX = 1.
5.
Since median = m, the ordinate at X = divides the area under the normal curve into two equal parts, i.e.,
7
6. 7. 8.
!3
p ( X ) dX = 7 p ( X ) dX =0.5
The value of p(X) is always non-negative for all values of X, i.e., the whole curve lies above X axis. The points of inflexion (the point at which curvature changes) of the curve are at X = # . The quartiles are equidistant from median, i.e., Md - Q1 = Q3 - Md , by virtue of symmetry. Also Q1 = - 0.6745 # , Q3 = + 0.6745 # , quartile deviation = 0.6745 # and mean deviation = 0.8 # , approximately. Since the distribution is symmetrical, all odd ordered central moments are zero.
9.
10. The successive even ordered central moments are related according to the following recurrence formula
2n = (2n - 1) # 2 2n - 2 for = 1, 2, 3, ......
11.
4 3# 4 = = 3. 2 2 # 4
Note that the above expression makes use of property 10. 13. Additive or reproductive property If X1, X2, ...... Xn are n independent normal variates with means 1 , 2 , KK n and
2 2 variances # 1 , # 2 , KK # n , respectively, then their linear combinationa1X1 + a2X2 2
n n i =1
+ ...... + anXn is also a normal variate with mean " ai i and variance
i =1
"a #
2 i
2 i
"X
"
"#
2 i
14. Area property: The area under the normal curve is distributed by its standard deviation in the following manner :
Figure 11.3
(i)
The area between the ordinates at m s and m + s is 0.6826. This implies that for a normal distribution about 68% of the observations will lie between m s and m + s. The area between the ordinates at m 2s and m + 2s is 0.9544. This implies that for a normal distribution about 95% of the observations will lie between m 2s and m + 2s.
(ii)
(iii) The area between the ordinates at m 3s and m + 3s is 0.9974. This implies that for a normal distribution about 99% of the observations will lie between m 3s and m + 3s. This result shows that, practically, the range of the distribution is 6s although, theoretically, the range is from to .
P ( X1 + X + X 2 ) = 7
X2
X1
# 29
1 , X ! ! . / 20 # 1
dX
In terms of figure, this probability is equal to the area under the normal curve between the ordinates at X = X 1 and X = X 2 respectively.
Figure 11.4
Note: It may be recalled that the probability that a continuous random variable takes a particular value is defined to be zero even though the event is not impossible. It is obvious from the above that, to find P(X1 X X2), we have to evaluate an integral which might be cumbersome and time consuming task. Fortunately, an alternative procedure is available for performing this task. To devise this procedure, we define a new variable z =
X! . #
,X!- 1 $ % We note that E ( z ) = E . / = &E ( X ) ! ' = 0 0 # 1 # 1 ,X!- 1 and Var ( z ) = Var . / = 2 Var ( X ! ) = 2 Var ( X ) = 1. # 0 # 1 #
386
Further, from the reproductive property, it follows that the distribution of z is also normal.
Thus, we conclude that if X is a normal variate with mean m and standard deviation
X! is a normal variate with mean zero and standard deviation unity. # Since the parameters of the distribution of z are fixed, it is a known distribution and is termed as standard normal distribution (s.n.d.). Further, z is termed as a standard normal variate (s.n.v.).
s, then z = It is obvious from the above that the distribution of any normal variate X can always be transformed into the distribution of standard normal variate z. This fact can be utilised to evaluate the integral given above.
In terms of figure, this probability is equal to the area under the standard normal curve between the ordinates at z = z1 and z = z2. Figure 11.5 Since the distribution of z is fixed, the probabilities of z lying in various intervals are tabulated. These tables can be used to write down the desired probability. Example 27: Using the table of areas under the standard normal curve, find the following probabilities : (i) P(0 z 1.3) (iv) P( z 1.54) (ii) P(1 z 0) (v) P(|z| > 2) (iii) P(1 z 12) (vi) P(|z| < 2)
Solution: The required probability, in each question, is indicated by the shaded are of the corresponding figure. (i) (ii) From the table, we can write P(0 z 1.3) = 0.4032. We can write P(1 z 0) = P(0 z 1), because the distribution is symmetrical.
From the table, we can write P(1 z 0) = P(0 z 1) = 0.3413. (iii) We can write P(1 z 2) = P(1 z 0) + P(0 z 2) = P(0 z 1) + P(0 z 2) = 0.3413 + 0.4772 = 0.8185.
387
(iv) We can write P(z 1.54) = 0.5000 P(0 z 1.54) = 0.5000 0.4382 = 0.0618. (v) P(|z| > 2) = P(z > 2) + P(z < 2) = 2P(z > 2) = 2[0.5000 - P(0 z 2)] = 1 2P(0 z 2) = 1 2 ! 0.4772 = 0.0456.
(vi) P(|z| < 2) = P(- 2 z 0) + P(0 z 2) = 2P(0 z 2) = 2 ! 0.4772 = 0.9544. Example 28: Determine the value or values of z in each of the following situations: (a) (b) (c) (d) (e) (a) Area between 0 and z is 0.4495. Area between to z is 0.1401. Area between to z is 0.6103. Area between 1.65 and z is 0.0173. Area between 0.5 and z is 0.5376. On locating the value of z corresponding to an entry of area 0.4495 in the table of areas under the normal curve, we have z = 1.64. We note that the same situation may correspond to a negative value of z. Thus, z can be 1.64 or - 1.64. Since the area between to z < 0.5, z will be negative. Further, the area between z and 0 = 0.5000 0.1401 = 0.3599. On locating the value of z corresponding to this entry in the table, we get z = 1.08. Since the area between to z > 0.5000, z will be positive. Further, the area between 0 to z = 0.6103 - 0.5000 = 0.1103. On locating the value of z corresponding to this entry in the table, we get z = 0.28. Since the area between 1.65 and z < the area between 1.65 and 0 (which, from table, is 0.4505), z is negative. Further z can be to the right or to the left of the value 1.65. Thus, when z lies to the right of 1.65, its value, corresponds to an area (0.4505 0.0173) = 0.4332, is given by z = 1.5 (from table). Further, when z lies to the left of - 1.65, its value, corresponds to an area (0.4505 + 0.0173) = 0.4678, is given by z = 1.85 (from table). Since the area between 0.5 to z > area between 0.5 to 0 ( which, from table, is 0.1915), z is positive. The value of z, located corresponding to an area (0.5376 0.1915) = 0.3461, is given by 1.02.
Solution:
(b)
(c)
(d)
(e)
Example 29: If X is a random variate which is distributed normally with mean 60 and standard deviation 5, find the probabilities of the following events : (i) 60 X 70, (ii) 50 X 65, (iii) X > 45, (iv) X 50. Solution: It is given that m = 60 and s = 5 (i) Given X1 = 60 and X2 = 70, we can write
z1 =
388
X 1 - m 60 - 60 X - m 70 - 60 = = 0 and z2 = 2 = = 2. 5 5 s s
(ii)
z1 =
50 ! 60 65 ! 60 = ! 2 and z2 = = 1. 5 5
= 0.4772 + 0.3413 = 0.8185
(iii)
45 ! 60 , = P ( z * ! 3) P ( X > 45 ) = P . z * 5 / 0 1 = P ( ! 3 + z + 0 ) + P ( 0 + z + 3 ) = P (0 + z + 3 ) + P ( 0 + z + 3 )
= 0.4987 + 0.5000 = 0.9987
(iv)
50 ! 60 , P ( X + 50 ) = P . z + / = P (z + ! 2) 5 1 0 = 0.5000 ! P ( ! 2 + z + 0 ) = 0.5000 ! P (0 + z + 2 )
= 0.5000 0.4772 = 0.0228
Example 30: The average monthly sales of 5,000 firms are normally distributed with mean Rs 36,000 and standard deviation Rs 10,000. Find : (i) (ii) The number of firms with sales of over Rs 40,000. The percentage of firms with sales between Rs 38,500 and Rs 41,000.
(iii) The number of firms with sales between Rs 30,000 and Rs 40,000. Solution: Let X be the normal variate which represents the monthly sales of a firm. Thus X ~ N(36,000, 10,000). (i)
40000 - 36000 P ( X > 40000) = P z > = P ( z > 0.4) 10000 = 0.5000 ! P (0 + z + 0.4 ) = 0.5000 ! 0.1554 = 0.3446.
Thus, the number of firms having sales over Rs 40,000 = 0.3446 ! 5000 = 1723
389
(ii)
41000 ! 36000 , 38500 ! 36000 +z+ P (38500 + X + 41000 ) = P . / 10000 10000 0 1 = P (0.25 + z + 0.5 ) = P (0 + z + 0.5 ) ! P (0 + z + 0.25 ) = 0.1915 ! 0.0987 = 0.0987.
Thus, the required percentage of firms =0.0987 !100 = 9.87%.
(iii)
40000 ! 36000 , 30000 ! 36000 +z+ P (30000 + X + 40000 ) = P . / 10000 10000 0 1 = P ( ! 0.6 + z + 0.4 ) = P (0 + z + 0.6 ) + P (0 + z + 0.4 ) = 0.2258 + 0.1554 = 0.3812.
Thus, the required number of firms = 0.3812 ( 5000 = 1906
Example 31: In a large institution, 2.28% of employees have income below Rs 4,500 and 15.87% of employees have income above Rs. 7,500 per month. Assuming the distribution of income to be normal, find its mean and standard deviation. Solution: Let the mean and standard deviation of the given distribution be m and s respectively.
4500 ! = ! 2 or 4500 ! = ! 2# #
.... (1)
7500 ! = 1 or 7500 ! = # #
Solving (1) and (2) simultaneously, we get m = Rs 6,500 and s = Rs 1,000.
.... (2)
Example 32: Marks in an examination are approximately normally distributed with mean 75 and standard deviation 5. If the top 5% of the students get grade A and the bottom 25% get grade F, what mark is the lowest A and what mark is the highest F? Solution: Let A be the lowest mark in grade A and F be the highest mark in grade F. From the given information, we can write
P X * A = 0.05 or P z *
FG H
A ! 75 = 0.05 5
IJ K
On locating the value of z corresponding to an area 0.4500 (0.5000 - 0.0500), we can write
A ! 75 = 1.645 6 A = 83.225 5
FG H
F ! 75 = 0.25 5
IJ K
On locating the value of z corresponding to an area 0.2500 (0.5000 0.2500), we can write
F ! 75 = ! 0.675 6 F = 71.625 5
Example 33: The mean inside diameter of a sample of 200 washers produced by a machine is 5.02 mm and the standard deviation is 0.05 mm. The purpose for which these washers are intended allows a maximum tolerance in the diameter of 4.96 to 5.08 mm, otherwise the washers are considered as defective. Determine the percentage of defective washers produced by the machine on the assumption that diameters are normally distributed. Solution: Let X denote the diameter of the washer. Thus, X ~ N (5.02, 0.05). The probability that a washer is defective = 1 P(4.96 X 5.08)
$, 4.96 ! 5.02 , 5.08 ! 5.02 - % = 1 ! P 4. /+ z+. /5 0.05 0.05 1 0 1' &0
645.00 ! 532.50 , 457.50 ! 532.50 +z+ P ( 457.50 + X + 645.00 ) = P . / 75 75 0 1 = P ( ! 1 + z + 1.5 ) = P (0 + z + 1) + P (0 + z + 1.5 ) = 0.34134 + 0.43319 = 0.77453
Thus, the percentage of days = 77.453 (ii)
Example 35: The distribution of 1,000 examinees according to marks percentage is given below :
% Marks less than 40 40 - 75 75 or more Total No. of examinees 430 420 150 1000
Assuming the marks percentage to follow a normal distribution, calculate the mean and standard deviation of marks. If not more than 300 examinees are to fail, what should be the passing marks? Solution: Let X denote the percentage of marks and its mean and S.D. be m and s respectively. From the given table, we can write
391
P(X < 40) = 0.43 and P(X * 75) = 0.15, which can also be written as
40 ! = ! 0.175 or 40 ! = ! 0.175# #
and
75 ! = 1.04 or 75 ! = 1.04# #
Solving the above equations simultaneously, we get m = 45.04 and s = 28.81. Let X1 be the percentage of marks required to pass the examination.
Example 36: In a certain book, the frequency distribution of the number of words per page may be taken as approximately normal with mean 800 and standard deviation 50. If three pages are chosen at random, what is the probability that none of them has between 830 and 845 words each? Solution: Let X be a normal variate which denotes the number of words per page. It is given that X ~ N(800, 50). The probability that a page, select at random, does not have number of words between 830 and 845, is given by
845 - 800 830 - 800 <z< 1 - P (830 < X < 845) = 1 - P 50 50 = 1 - P (0.6 < z < 0.9) = 1 - P (0 < z < 0.9) + P (0 < z < 0.6)
= 1- 0.3159 + 0.2257 = 0.9098 0.91 Thus, the probability that none of the three pages, selected at random, have number of words lying between 830 and 845 = (0.91)3 = 0.7536. Example 37: At a petrol station, the mean quantity of petrol sold to a vehicle is 20 litres per day with a standard deviation of 10 litres. If on a particular day, 100 vehicles took 25 or more litres of petrol, estimate the total number of vehicles who took petrol from the station on that day. Assume that the quantity of petrol taken from the station by a vehicle is a normal variate. Solution: Let X denote the quantity of petrol taken by a vehicle. It is given that X ~ N(20, 10).
25 ! 20 , = P ( z * 0.5 ) \ P ( X * 25 ) = P . z * 10 / 0 1
= 0.5000 ! P 0 + z + 0.5 = 0.5000 ! 01915 = 0.3085 .
X - np npq
N (0,1) .
It may be noted here that as X varies from 0 to n, the standard normal variate z would vary from - to because
, ! np . / . npq / 0 1
, np .! /=!3 . q / 0 1
Since the number of successes is a discrete variable, to use normal approximation, we have make corrections for continuity. For example,
1 1, P(X1 X X2) is to be corrected as P . X1 ! + X + X 2 + / , while using normal 2 21 0 approximation to binomial since the gap between successive values of a binomial variate 1 1, is unity. Similarly, P(X1 < X < X2) is to be corrected as P . X1 + + X + X 2 ! / , 2 21 0 since X1 < X does not include X1 and X < X2 does not include X2.
Note: The normal approximation to binomial probability mass function is good when n 50 and neither p nor q is less than 0.1. Example 38: An unbiased die is tossed 600 times. Use normal approximation to binomial to find the probability obtaining (i) (ii) more than 125 aces, number of aces between 80 and 110,
(iii) exactly 150 aces. Solution: Let X denote the number of successes, i.e., the number of aces. \ = np = 600 ( (i)
To make correction for continuity, we can write P(X > 125) = P(X > 125 + 0.5)
125.5 ! 100 , Thus, P ( X * 125.5 ) = P . z * / = P ( z * 2.80 ) 9.1 0 1 = 0.5000 ! P (0 + z + 2.80 ) = 0.5000 ! 0.4974 = 0.0026.
393
(ii)
In a similar way, the probability of the number of aces between 80 and 110 is given by
110.5 ! 100 , 79.5 ! 100 +z+ P ( 79.5 + X + 110.5 ) = P . / 9.1 9.1 0 1 = P ( ! 2.25 + z + 1.15 ) = P (0 + z + 2.25 ) + P (0 + z + 1.15 )
= 0.4878 + 0.3749 = 0.8627
20.5 , 19.5 +z+ (iii) P(X = 120) = P(119.5 X 120.5) = P . 9.1 / 0 9.1 1
= P(2.14 z 2.25) = P(0 z 2.25) - P(0 z 2.14) = 0.4878 - 0.4838 = 0.0040
FG H
29.5 ! 25 = P z * 0.9 5
IJ b K
(a) Method of Ordinates: In this method, the ordinate f(X) of the normal curve, for various values of the random variate X are obtained by using the table of ordinates for a standard normal variate. We can write f ( X ) =
# 29
1 , X ! ! . / 20 # 1
# 29
1 ! z2 2
1 : (z ) #
where z =
1 ! 2 z2 X! and : (z ) = e . # 29
394
N : ( z ) and therefore, the expected frequency of a class = y ! h, where # h is the class interval. y = N .: ( X ) =
Solution: First we compute mean and standard deviation of the given data.
Class Mid -values Intervals (X ) 10-20 20-30 30-40 40-50 50-60 60-70 70-80 Total 15 25 35 45 55 65 75 Frequency X - 45 d= (f) 10 2 11 24 33 20 8 2 100 -3 -2 -1 0 1 2 3 fd - 6 - 22 - 24 0 20 16 6 - 10 fd 2 18 44 24 0 20 32 18 156
Note: If the class intervals are not continuous, they should first be made so. \ = 45 ! 10 (
10 = 44 100
and
# = 10
FG IJ H K
= 10 155 = 12.4 .
Class Mid -values X! z= # (X) Intervals 10-20 20-30 30-40 40-50 50-60 60-70 70-80 15 25 35 45 55 65 75 ! 2.34 ! 1.53 ! 0.73 0.08 0.89 1.69 2.50
: (z )
( from table) 0.0258 0.1238 0.3056 0.3977 0.2685 0.0957 0.0175
y=
N : (z ) #
fe * 2 10 25 32 22 8 1
(b) Method of Areas: Under this method, the probabilities or the areas of the random variable lying in various intervals are determined. These probabilities are then multiplied by N to get the expected frequencies. This procedure is explained below for the data of the above example.
Class Intervals 10-20 20-30 30-40 40-50 50-60 60-70 70-80 Lower Limit (X ) 10 20 30 40 50 60 70 80 z= X - 44 12.4 Area from 0 to z 0.4969 0.4738 0.3708 0.1255 0.1844 0.4015 0.4821 0.4981 Area under the class 0.0231 0.1030 0.2453 0.3099 0.2171 0.0806 0.0160 fe * 2 10 25 31 22 8 2
395
Hint: Apply correction for continuity. There are 600 business students in the post graduate department of a university and the probability for any student to need a copy of a particular text book from the university library on any day is 0.05. How many copies of the book should be kept in the library so that the probability that none of the students, needing a copy, has to come back disappointed is greater than 0.90? (Use normal approximation to binomial.) The grades on a short quiz in biology were 0, 1, 2, 3, ...... 10 points, depending upon the number of correct answers out of 10 questions. The mean grade was 6.7 with standard deviation of 1.2. Assuming the grades to be normally distributed, determine (i) the percentage of students scoring 6 points, (ii) the maximum grade of the lowest 10% of the class. The following rules are followed in a certain examination. "A candidate is awarded a first division if his aggregate marks are 60% or above, a second division if his aggregate marks are 45% or above but less than 60% and a third division if the aggregate marks are 30% or above but less than 45%. A candidate is declared failed if his aggregate marks are below 30% and awarded a distinction if his aggregate marks are 80% or above." At such an examination, it is found that 10% of the candidates have failed, 5% have obtained distinction. Calculate the percentage of students who were placed in the second division. Assume that the distribution of marks is normal. The areas under the standard normal curve from 0 to z are :
z Area
396
: :
1. 28 0. 4000
1.64 0. 4500
0. 41 0.1591
0. 47 0.1808
Hint: First find parameters of the distribution on the basis of the given information.
9.
For a certain normal distribution, the first moment about 10 is 40 and the fourth moment about 50 is 48. What is the mean and standard deviation of the distribution?
Hint: Use the condition b2 = 3, for a normal distribution. 10. In a test of clerical ability, a recruiting agency found that the mean and standard deviation of scores for a group of fresh candidates were 55 and 10 respectively. For another experienced group, the mean and standard deviation of scores were found to be 62 and 8 respectively. Assuming a cut-off scores of 70, (i) what percentage of the experienced group is likely to be rejected, (ii) what percentage of the fresh group is likely to be selected, (iii) what will be the likely percentage of fresh candidates in the selected group? Assume that the scores are normally distributed. Hint: See example 33. 11. 1,000 light bulbs with mean life of 120 days are installed in a new factory. Their length of life is normally distributed with standard deviation of 20 days. (i) How many bulbs will expire in less than 90 days? (ii) If it is decided to replace all the bulbs together, what interval should be allowed between replacements if not more than 10 percent bulbs should expire before replacement?
Hint: (ii) P(X X1) = 0.9. 12. The probability density function of a random variable is expressed as
, 2 - !2 X !3 2 p ( X ) = . / e ( ) , ( - < X < ) 09 1
(i) (ii) Identify the distribution. Determine the mean and standard deviation of the distribution.
(iii) Write down two important properties of the distribution. Hint: Normal distribution. 13. The weekly wages of 2,000 workers in a factory are normally distributed with a mean of Rs 200 and a variance of 400. Estimate the lowest weekly wages of the 197 highest paid workers and the highest weekly wages of the 197 lowest paid workers. Hint: See example 32. 14. Among 10,000 random digits, in how many cases do we expect that the digit 3 appears at the most 950 times. (The area under the standard normal curve for z = 1.667 is 0.4525 approximately.) Hint: m = 10000 ( 0.10 and s2 = 1000 ( 0.9. 15. Marks obtained by certain number of students are assumed to be normally distributed with mean 65 and variance 25. If three students are taken at random, what is the probability that exactly two of them will have marks over 70? Hint: Find the probability (p) that a student Then find C2 p q . 16. The wage distribution of workers in a factory is normal with mean Rs 400 and standard deviation Rs 50. If the wages of 40 workers be less than Rs 350, what is the total number of workers in the factory? [ given (0,1).] Hint: N ( Probability that wage is less than 350 = 40.
397
3 2
17. The probability density function of a continuous random variable X is given by f(X) = kX(2 - X), 0 < X < 2 =0 Calculate the value of the constant k and E(X). Hint: To find k, use the fact that total probability is unity. 18. elsewhere.
9 distribution with mean zero and variance 1/50. Is the statement true?
Hint: Transform X into standard normal variate z.
f (X ) =
19. The income of a group of 10,000 persons was found to be normally distributed with mean Rs 1,750 p.m. and standard deviation Rs 50. Show that about 95% of the persons of the group had income exceeding Rs 1,668 and only 5% had income exceeding Rs 1,832. Hint: See example 30. 20. A complex television component has 1,000 joints by a machine which is known to produce on an average one defective in forty. The components are examined and the faulty soldering corrected by hand. If the components requiring more than 35 corrections are discarded, what proportion of the components will be thrown away? Hint: Use Poisson approximation to normal distribution. 21. The average number of units produced by a manufacturing concern per day is 355 with a standard deviation of 50. It makes a profit of Rs 150 per unit. Determine the percentage of days when its total profit per day is (i) between Rs 457.50 and Rs 645.00, (ii) greater than Rs 628.50. Hint: Find the probabilities of producing 457.50/150 to 645/150 units etc. 22. A tyre manufacturing company wants 90% of its tyres to have a wear life of at least 40,000 kms. If the standard deviation of the wear lives is known to be 3,000 kms, find the lowest acceptable average wear life that must be achieved by the company. Assume that the wear life of tyres is normally distributed.
25. An automobile company buys nuts of a specified mean diameters m. A nut is classified as defective if its diameter differs from m by more than 0.2 mm. The company requires that not more than 1% of the nuts may be defective. What should be the maximum variability that the manufacturer can allow in the production of nuts so as to satisfy the automobile company? Hint: Find S.D.
Check Your Progress 11.2
1 2.
What are the characteristics of Poisson Distribution? What are the objectives for fitting a normal curve? Write your answer in the space given below. Please go through the lesson sub-head thoroughly you will get your answers in it. This Chek Your Progress will help you to understand the lesson better. Try to write answers for them, but do not submit your answers to the university for assessment. These are for your practice only. (b) (c)
Notes: (a)
Cr
)(
N
N !k
Cn ! r
Cr
n,N and k m m
e! m .mr r! m.e! mt 1 e # 29 1 ! 1 z2 e 2 29
1 , X ! ! . / 20 # 1
and #
0 and 1
11.14 KEYWORDS
Binomial Distribution Random Variable Poisson Distribution
399
Poisson distribution is not used as a model. The curve used to describe the accidental errors is Gaussion curve. Fitting of Binomial Distribution Pascal Distribution Poisson Distribution Geometrical Distribution Normal Distribution
What do you understand by a theoretical probability distribution? How it is useful in business decision-making? Define a binomial distribution. State the conditions under which binomial probability model is appropriate. What are the parameters of a binomial distribution? Obtain expressions for mean and variance of the binomial variate in terms of these parameters. What is a 'Poisson Process'? Obtain probability mass function of Poisson variate as a limiting form of the probability mass function of binomial variate. Obtain mean and standard deviation of a Poisson random variate. Discuss some business and economic situations where Poisson probability model is appropriate. How will you use Poisson distribution as an approximation to binomial? Explain with the help of an example. Under what conditions will a random variable follow a normal distribution? State some important features of a normal probability curve. What is a standard normal distribution? Discuss the importance of normal distribution in statistical theory. State clearly the assumptions under which a binomial distribution tends to Poisson and to normal distribution.
10. Assume that the probability that a bomb dropped from an aeroplane will strike a target is 1/5. If six bombs are dropped, find the probability that (i) exactly two will strike the target, (ii) at least two will strike the target. 11. An unbiased coin is tossed 5 times. Find the probability of getting (i) two heads, (ii) at least two heads.
12. An experiment succeeds twice as many times as it fails. Find the probability that in 6 trials there will be (i) no successes, (ii) at least 5 successes, (iii) at the most 5 successes. 13. In an army battalion 60% of the soldiers are known to be married and remaining unmarried. If p(r) denotes the probability of getting r married soldiers from 5 soldiers, calculate p(0), p(1), p(2), p(3), p(4) and p(5). If there are 500 rows each consisting of 5 soldiers, approximately how many rows are expected to contain (i) all married soldiers, (ii) all unmarried soldiers? 14. A company has appointed 10 new secretaries out of which 7 are trained. If a particular executive is to get three secretaries, selected at random, what is the chance that at least one of them will be untrained? 15. The overall pass rate in a university examination is 70%. Four candidates take up such examination. What is the probability that (i) at least one of them will pass (ii) at the most 3 will pass (iii) all of them will pass, the examination? 16. 20% of bolts produced by a machine are defective. Deduce the probability distribution of the number of defectives in a sample of 5 bolts. 17. 25% employees of a firm are females. If 8 employees are chosen at random, find the probability that (i) 5 of them are males (ii) more than 4 are males (iii) less than 3 are females. 18. A supposed coffee connoisseur claims that he can distinguish between a cup of instant coffee and a cup of percolator coffee 75% of the times. It is agreed that his claim will be accepted if he correctly identifies at least 5 of the 6 cups. Find, (i) his chance of having the claim accepted if he is in fact only guessing and, (ii) his chance of having the claim rejected when he does not have the ability he claims. 19. It is known that 10% of the accounts of a firm contain errors. An auditor selects 5 accounts of the firm at random and finds that 3 of them contain errors. What is the probability of this result? What do you conclude on the basis of this result? 20. The incidence of an occupational disease in an industry is such that the workers have a 20% chance of suffering from it. What is the probability that out of 6 workmen, 4 or more will contract the disease? 21. A local politician claims that the assessed value of houses, for house tax purposes by the Municipal Corporation of Delhi, is not correct in 90% of the cases. Assuming this claim to be true, what is the probability that out of a sample of 4 houses selected at random (i) at least one will be found to be correctly assessed (ii) at least one will be found to be wrongly assessed? 22. There are 64 beds in a garden and 3 seeds of a particular type are sown in each bed. The probability of a flower being white is 0.25. Find the number of beds with 3, 2, 1 and 0 white flowers. 23. Suppose that half the population of a town are consumers of rice. 100 investigators are appointed to find out its truth. Each investigator interviews 10 individuals. How many investigators do you expect to report that three or less of the people interviewed are consumers of rice? 24. If the probability of a success in a trial is 0.2, find (a) mean, (b) variance, (c) moment coefficient of skewness and kurtosis of the number of successes in 100 trials.
401
25. There are 5 machines in a factory which may require adjustment from time to time during a day of their use. Two of these machines are of type I, each having a probability of 1/5 of needing adjustment during a day and 3 are of type II, having corresponding probability of 1/10. Assuming that no machine needs adjustment twice on the same day, find the probability that on a particular day (i) (ii) Just 2 machines of type II and none of type I need adjustment. If just 2 machines need an adjustment, they are of the same type.
x f : : 0 28 1 62 2 46 3 10 4 4
27. Five coins were tossed 100 times and the outcomes are recorded as given below. Compute the expected frequencies.
No. of heads Observed frequency : : 0 2 1 10 2 24 3 38 4 18 5 8
28. The administrator of a large airport is interested in the number of aircraft departure delays that are attributable to inadequate control facilities. A random sample of 10 aircraft take off is to be thoroughly investigated. If the true proportion of such delays in all departures is 0.40, what is the probability that 4 of the sample departures are delayed because of control inadequacies? Also find mean, variance and mode of the random variable. 29. A company makes T.Vs., of which 15% are defective. 15 T.Vs. are shipped to a dealer. If each T.V. assembled is considered an independent trial, what is the probability that the shipment of 15 T.Vs. contain (i) no defective (ii) at the most one defective T.V.? 30. If 2% bulbs manufactured by a company are defective and the random variable denotes the number of defective bulbs, find mean, variance, measures of moment coefficient of skewness and kurtosis in a total of 50 bulbs. 31. 4096 families having just 4 children were chosen at random. Assuming the probability of a male birth equal to 1/2, compute the expected number of families having 0, 1, 2, 3 and 4 male children. 32. If the number of telephone calls an operator receives from 9.00 A.M. to 9.30 A.M. follows a Poisson distribution with mean, m = 2, what is the probability that the operator will not receive a phone call in the same interval tomorrow? 33. (a) Write down the probability mass function of a Poisson distribution with parameter 3. What are the values of mean and variance of the corresponding random variable? If X is a Poisson variate with parameter 2, find P(3 X 5). The standard deviation of a Poisson variate is 2, find P(1 X < 2).
(b) (c)
34. Suppose that a telephone switch board handles 240 calls on the average during a rush hour and that the board can make at the most 10 connections per minute. Using Poisson distribution, estimate the probability that the board will be over taxed during a given minute. 35. An automatic machine makes paper clips from coils of a wire. On the average, 1 in 400 paper clips is defective. If the paper clips are packed in boxes of 100, what is the probability that any given box of clips will contain (i) no defective (ii) one or more defectives, (iii) less than two defectives?
402
36. Certain mass produced articles, of which 0.5% are defective, are packed in cartons each containing 100 articles. What proportions of the cartons are expected to be free from defective articles and what proportion contain, 2 or more defective articles? 37. A certain firm uses large fleet of delivery vehicles. Their records over a long period of time (during which their fleet size utilisation may be assumed to have remained constant) show that the average number of vehicles serviced per day is 3. Estimate the probability that on a given day (i) (ii) no vehicle will be serviceable. at the most 3 vehicles will be serviceable.
(iii) more than 2 vehicles will be unserviceable. 38. Suppose that a local electrical appliances shop has found from experience that the demand for tube lights is distributed as Poisson variate with a mean of 4 tube lights per week. If the shop keeps 6 tube lights during a particular week, what is the probability that the demand will exceed the supply during that week? 39. In a Poisson distribution, the probability of zero success is 15%. Find its mean and standard deviation. 40. Four unbiased coins are tossed 1,600 times. Using Poisson distribution, find the approximate probability of getting 4 heads r times. 41. The number of road accidents on a highway during a month follows a Poisson distribution with mean 6. Find the probability that in a certain month the number of accidents will be (i) not more than 3, (ii) between 2 and 4. 42. A random variable X follows Poisson law such that P(X = k) = P(X = k + 1). Find its mean and variance. 43. The probability that a man aged 45 years will die with in a year is 0.012. What is the probability that of 10 such men at least 9 will reach their 46th birthday? (Given e- 0.12= 0.88692). 44. During a period, persons arrive at a railway booking counter at the rate of 30 per hour. What is the probability that two or less persons will arrive in a period of 5 minutes? 45. An insurance company insures 4,000 people against loss of both eyes in a car accident. Based upon previous data, the rate were computed on the assumption that on the average 10 persons in 1,00,000 will have car accidents each year with this type of injury. What is the probability that more than 3 of the insured will collect on their policy in a given year? 46. A manufacturer, who produces medicine bottles, finds that 0.1% of the bottles are defective. The bottles are packed in boxes containing 500 bottles. A drug manufacturer buys 100 boxes from the producer of the bottles. Use Poisson distribution to find the number of boxes containing (i) no defective bottles (ii) at least two defective bottles. 47. A factory turning out lenses, supplies them in packets of 1,000. The packet is considered by the purchaser to be unacceptable if it contains 50 or more defective lenses. If a purchaser selects 30 lenses at random from a packet and adopts the criterion of rejecting the packet if it contains 3 or more defectives, what is the probability that the packet (i) will be accepted, (ii) will not be accepted? 48. 800 employees of a company are covered under the medical group insurance scheme. Under the terms of coverage, 40 employees are identified as belonging to 'High Risk' category. If 50 employees are selected at random, what is the probability that
403
(i) none of them is in the high risk category, (ii) at the most two are in the high risk category? (You may use Poisson approximation to Binomial). 49. In Delhi with 100 municipal wards, each having approximately the same population, the distribution of meningitis cases in 1987 were recorded as follows:
No. of Cases : 0 1 2 3 4 No. of Wards : 63 28 6 2 1
Fit a Poisson distribution to the above data. 50. The following table gives the number of days in 50 day-period during which automobile accidents occurred in a certain part of the city. Fit a Poisson distribution to the data.
No. of accidents : 0 1 2 3 4 No. of days : 19 18 8 4 1
51. A sample of woollen balls has a mean weight of 3.2 oz. and standard deviation of 1 oz. Assuming that the weight of woollen balls is distributed normally, (i) How many balls are expected to weigh between 2.7 and 3.7 oz., (ii) what is the probability that weight of a ball is less than 1.5 oz. and (iii) what is the probability that the weight of the ball will exceed 4.7 oz.? 52. The weekly wages of 2,000 workers are normally distributed. Its mean and standard deviation are Rs 140 and Rs 10 respectively. Estimate the number of workers with weekly wages (i) (ii) Between Rs 120 and Rs 130. More than Rs 170.
(iii) Less than Rs 165. 53. Find the probability that the value of an item drawn at random from a normal distribution with mean 20 and standard deviation 10 will be between (i) 10 and 15, (ii) 5 and 10 and (iii) 15 and 25. 54. In a manufacturing organisation the distribution of wages was perfectly normal and the number of workers employed in the organisation were 5,000. The mean wages of the workers were calculated as Rs 800 p.m. and the standard deviation was worked out to be Rs 200. Estimate (i) (ii) the number of workers getting wages between Rs 700 and Rs 900. the percentage of workers getting wages above Rs 1,000.
(iii) the percentage of workers getting wages below Rs 600. 55. Suppose that the waist measurements W of 800 girls are normally distributed with mean 66 cms and standard deviation 5 cms. Find the number of girls with waists; (i) between 65 and 70 cms. (ii) greater than or equal to 72 cms. 56. (a) (b) A normal distribution has 77.0 as its mean. Find its standard deviation if 20% of the area under the curve lies to the right of 90.0. A random variable has a normal distribution with 10 as its standard deviation. Find its mean if the probability that the random variable takes on a value less than 80.5 is 0.3264.
404
57. In a particular examination an examinee can get marks ranging from 0 to 100. Last year, 1,00,000 students took this examination. The marks obtained by them followed a normal distribution. What is the probability that the marks obtained by a student selected at random would be exactly 63?
58. A collection of human skulls is divided into three classes according to the value of a 'length breadth index' x. Skulls with x < 75 are classified as 'long', those with 75 < x < 80 as 'medium' and those with x > 80 as 'short'. The percentage of skulls in the three classes in this collection are respectively 58, 38 and 4. Find, approximately, the mean and standard deviation of x on the assumption that it is normally distributed. 59. A wholesale distributor of a fertiliser product finds that the annual demand for one type of fertiliser is normally distributed with a mean of 120 tonnes and standard deviation of 16 tonnes. If he orders only once a year, what quantity should be ordered to ensure that there is only a 5% chance of running short? 60. A multiple choice quiz has 200 questions, each with 4 possible answers of which only one is correct. What is the probability (using normal approximation to binomial distribution) that sheer guess work yields from 25 to 30 correct answers for 80 questions (out of 200 questions) about which the student has no knowledge? 61. In a normal distribution 31% of the items are under 45 and 8% are over 64. Find the mean and standard deviation of the distribution. 62. The mean life of the bulbs manufactured by a company is estimated to be 2,025 hours. By using normal approximation to Poisson distribution, estimate the percentage of bulbs that are expected to last for (i) less than 2,100 hours, (ii) between 1,900 and 2,000 hours and (iii) more than 2,000 hours. 63. Find mean and standard deviation if a score of 51 is 2 standard deviation above mean and a score of 42 is 1 standard deviation below mean. Assume that the scores are normally distributed. 64. (a) A manufacturer requires washers with specification of its inner diameter as 3.30 0.04 mm. If the inner diameters of the washers supplied by some suppliers are distributed normally with mean m = 3.31 mm. and s = 0.02 mm., what percentage of the washers, supplied in the a lot, are expected to meet the required specification? A cylinder making machine has # = 0.5 mm. At what value of m should the machine be set to ensure that 2.5% of the cylinders have diameters of 25.48 mm. or more?
(b)
65. The mean life of a pair of shoes manufactured by a company is estimated to be 2.59 years with a standard deviation of 3 months. What should be fixed as guarantee period so that the company has not to replace more than 5% of the pairs? 66. In a large group of men, it is found that 5% are under 60 inches and 40% are between 60 and 65 inches in height. Assuming the distribution to be exactly normal, find the mean and standard deviation of the height. The values of z for area equal to 0.45 and 0.05 between 0 to z are 1.645 and 0.125 respectively. 67. Packets of a certain washing powder are filled with an automatic machine with an average weight of 5 kg. and a standard deviation of 50 gm. If the weights of packets are normally distributed, find the percentage of packets having weight above 5.10 kg. 68. For a normal distribution with mean 3 and variance 16, find the value of y such that the probability of the variate lying in the interval (3, y) is 0.4772. 69. The mean income of people working in an industrial city is approximated by a normal distribution with a mean of Rs 24,000 and a standard deviation of Rs 3,000. What percentage of the people in this city have income exceeding Rs 28,500? In a random sample of 50 employed persons of this city, about how many can be expected to have income less than Rs 19,500?
405
70. The burning time of an experimental rocket is a random variable which has normal distribution with = 4.36 seconds and s = 0.04 seconds. What are the probabilities that this kind of rocket will burn for (i) less than 4.5 seconds, (ii) more than 4.40 seconds, (iii) between 4.30 to 4.42 seconds. 71. A company manufactures batteries and guarantees them for a life of 24 months. (i) If the average life has been found in tests to be 33 months with a standard deviation of 4 months, how many batteries will have to be replaced under guarantee if the life of the batteries follows a normal distribution? If annual sales are 10,000 batteries at a profit of Rs 50 each and each replacement costs the company Rs 100, find the net profit.
(ii)
(iii) Would it be worth its while to extend the guarantee to 27 months if the sales were to be increased by this extra offer to 12,000 batteries? 72. The distribution of total time a light bulb will last from the moment it is first put into service is known to be exponential with mean time between failure of the bulbs equal to 1,000 hours. What is the probability that the bulb will last for more than 1,000 hours? 73. An editor of a publishing company calculates that it requires 11 months on an average to complete the publication process from the manuscript to finished book with a standard deviation of 2.4 months. He believes that the distribution of publication time is well described by a normal distribution. Determine, out of 190 books that he will handle this year, how many will complete the process in less than a year? 74. The I.Q.'s of army volunteers in a given year are normally distributed with mean = 110 and standard deviation = 10. The army wants to give advanced training to 20% of those recruits with the highest scores. What is the lowest I.Q. score acceptable for the advanced training? 75. If 60% of the voters in a constituency favour a particular candidate, find the probability that in a sample of 300 voters, more than 170 voters would favour the candidate. Use normal approximation to the binomial. 76. From the past experience, a committee for admission to certain course consisting of 200 seats, has estimated that 5% of those granted admission do not turn up. If 208 letters of intimation of admission are issued, what is the probability that seat is available for all those who turn up? Use normal approximation to the binomial. 77. The number of customer arrivals at a bank is a Poisson process with average of 6 customers per 10 minutes. (a) What is the probability that the next customer will arrive within 3 minutes? (b) What is the probability that the time until the next customer arrives will be from 2 to 3 minutes? (c) What is the probability that the next customer will arrive after more than 4 minutes? 78. Comment on the following statements : (i) (ii) The mean of a normal distribution is 10 and the third order central moment is 1.5. The mean of a Poisson variate is 4 and standard deviation is
3.
(iii) The mean of a binomial variate is 10 and standard deviation is 4. (iv) The probability that a discrete random variable X takes a value X = a is equal to P(X = a), where P(X) is probability mass function of the random variable. (v)
406
The probability that a continuous random variable X takes a value X = a is equal to f(X = a), where f(X) is probability density function of the random variable.
(vi) The second raw moment of a Poisson distribution is 2. The probability P(X = 0) = e-1. (vii) The variance of a binomial distribution cannot exceed
n . 4
(viii) If for a Poisson variate X, P(X = 1) = P(X = 2), then E(X) = 2. (ix) If for a Poisson variate X, P(X = 0) = P(X = 1), then P(X > 0) = e-1. (x) b1 = 0 and b2 = 3 for a normal distribution. Mean of a binomial variate is always greater than its variance. Mean of a Poisson variate may or may not be equal to its variance.
79. State whether the following statements are True/False : (i) (ii)
(iii) Time required for the arrival of two telephone calls at a desk is a Poisson variate. (iv) A normal distribution is always symmetrical. (v) A binomial distribution with p = q is always symmetrical.
(vi) The probability function of a continuous random variable is called a probability mass function. (vii) The parameters of a distribution completely determine the distribution. (viii) Any normal variate with given mean and standard deviation can be transformed into a standard normal variate. (ix) The number of suicide cases in a given year is a binomial variate. (x) Since the probability that a continuous random variable takes a particular value is zero, the event is said to be impossible.
80. Fill in the blanks : (i) (ii) If three balls are drawn, successively with replacement, from a bag containing 4 red and 3 black balls, the number of red balls is a ...... random variable. A standard normal variate has mean equal to ...... and standard deviation equal to ...... .
(iii) When 1 - p > p, the binomial distribution is ...... skewed. (iv) The ...... of a binomial variate with mean = 4 and standard deviation = 3 are 16 and (v)
1 . 4
A normal variate obtained by subtracting its mean and dividing by its standard deviation is known as ...... variate.
(vi) If the expected value of a Poisson variate is 9, its ...... is 3. (vii) The number of defects per unit of length of a wire is a ...... variate. (viii) The time of occurrence of an event is an ...... variate. (ix) The number of trials needed to get a given number of successes is a ...... variate. (x) Normal distribution is also known as the normal law of ...... .
407
ANSWERS
TO
QUESTIONS
(d) Bell (d) True
FOR
408
LESSON
12
PROBABILITY DISTRIBUTION OF A RANDOM VARIABLE
CONTENTS 12.0 12.1 12.2 Aims and Objectives Introduction Probability Distribution of a Random Variable 12.2.1 Discrete and Continuous Probability Distributions 12.2.2 Cumulative Probability Function or Distribution Function 12.3 12.4 12.5 Mean and Variance of a Random Variable Theorems on Expectation Joint Probability Distribution 12.5.1 Marginal Probability Distribution 12.5.2 Conditional Probability Distribution 12.5.3 Expectation of the Sum or Product of two Random Variables 12.5.4 Expectation of a Function of Random Variables 12.6 12.7 12.8 12.9 Decision Analysis under Certainty Decision-making under Uncertainty Decision-making under Risk Expected Value with Perfect Information (EVPI)
12.10 Use of Subjective Probabilities in Decision-making 12.11 Use of Posterior Probabilities in Decision-making 12.12 Let us Sum Up 12.13 Lesson-end Activity 12.14 Keywords 12.15 Questions for Discussion 12.16 Terminal Questions 12.17 Model Answers to Questions for Discussion 12.18 Suggested Readings
12.1 INTRODUCTION
A random variable X is a real valued function of the elements of sample space S, i.e., different values of the random variable are obtained by associating a real number with each element of the sample space. A random variable is also known as a stochastic or chance variable. Mathematically, we can write X = F(e), where e S and X is a real number. We can note here that the domain of this function is the set S and the range is a set or subset of real numbers. Example 1: Three coins are tossed simultaneously. Write down the sample space of the random experiment. What are the possible values of the random variable X, if it denotes the number of heads obtained? Solution: The sample space of the experiment can be written as S = {(H,H,H), (H,H,T), (H,T,H), (T,H,H), (H,T,T), (T,H,T), (T,T,H), (T,T,T)} We note that the first element of the sample space denotes 3 heads, therefore, the corresponding value of the random variable will be 3. Similarly, the value of the random variable corresponding to each of the second, third and fourth element will be 2 and it will be 1 for each of the fifth, sixth and seventh element and 0 for the last element. Thus, the random variable X, defined above can take four possible values, i.e., 0, 1, 2 and 3. It may be pointed out here that it is possible to define another random variable on the above sample space.
1 3 3 1 P ( X = 0 ) = , P ( X = 1) = , P ( X = 2 ) = and P ( X = 3) = . 8 8 8 8
The set of all possible values of the random variable X alongwith their respective probabilities is termed as Probability Distribution of X. The probability distribution of X, defined in example 1 above, can be written in a tabular form as given below:
X p X
= B
: :
0 1 8
1 3 8
2 3 8
3 1 8
Total 1
Note that the total probability is equal to unity. In general, the set of n possible values of a random variable X, i.e., {X1, X2, ...... Xn} along with their respective probabilities p(X1), p(X2), ...... p(Xn), where
probability function of X.
i =1
410
The random variable defined in example 1 is a discrete random variable. However, if X denotes the measurement of heights of persons or the time interval of arrival of a specified number of calls at a telephone desk, etc., it would be termed as a continuous random variable. The distribution of a discrete random variable is called the Discrete Probability Distribution and the corresponding probability function p(X) is called a Probability Mass Function. In order that any discrete function p(X) may serve as probability function of a discrete random variable X, the following conditions must be satisfied : (i) (ii) p(Xi) " 0 # i = 1, 2, ...... n and
! p(X ) = 1
i i =1
In a similar way, the distribution of a continuous random variable is called a Continuous Probability Distribution and the corresponding probability function p(X) is termed as the Probability Density Function. The conditions for any function of a continuous variable to serve as a probability density function are : (i) (ii) p(X) " 0 # real values of X, and
&
%$
p ( X ) dX = 1
Remarks: 1. When X is a continuous random variable, there are an infinite number of points in the sample space and thus, the probability that X takes a particular value is always defined to be zero even though the event is not regarded as impossible. Hence, we always talk of the probability of a continuous random variable lying in an interval. The concept of a probability distribution is not new. In fact it is another way of representing a frequency distribution. Using statistical definition, we can treat the relative frequencies of various values of the random variable as the probabilities.
2.
Example 2: Two unbiased die are thrown. Let the random variable X denote the sum of points obtained. Construct the probability distribution of X. Solution: The possible values of the random variable are : 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12 The probabilities of various values of X are shown in the following table :
Probability Distribution of X
X p X
= B
2 3 4 5 6 7 8 9 10 11 12 Total 1 2 3 4 5 6 5 4 3 2 1 1 36 36 36 36 36 36 36 36 36 36 36
Example 3: Three marbles are drawn at random from a bag containing 4 red and 2 white marbles. If the random variable X denotes the number of red marbles drawn, construct the probability distribution of X. Solution: The given random variable can take 3 possible values, i.e., 1, 2 and 3. Thus, we can compute the probabilities of various values of the random variable as given below :
4
C1 ' 2C2 4 = 6 C3 20
411
C2 ' 2C1 12 = 6 C3 20
C3 4 = C3 20
Note: In the event of white balls being greater than 2, the possible values of the random variable would have been 0, 1, 2 and 3.
F ( x ) = P ( X ( x ) = & p( X )dX
%$
The mean of a random variable or its probability distribution is often denoted by , i.e., E(X) = . Remarks: The mean of a frequency distribution can be written as
X1.
Variance: The concept of variance of a random variable or its probability distribution is also similar to the concept of the variance of a frequency distribution. The variance of a frequency distribution is given by
)2 =
2 2 f 1 ! fi ( Xi % X ) = ! ( Xi % X ) . Ni = Mean of Xi X N
values.
The expression for variance of a probability distribution with mean m can be written in a similar way, as given below :
412
Remarks: If X is a continuous random variable with probability density function p(X), then
E ( X ) = & X . p( X )dX
%$
) 2 = E (X % ) = &
%$
(X % )
. p( X ) dX
Moments
The rth moment of a discrete random variable about its mean is defined as:
r = E ( X % ) = ! ( X i % ) p( X i )
i =1
Similarly, the rth moment about any arbitrary value A, can be written as
r* = E ( X % A ) = ! ( Xi % A ) p( X i )
i =1
The expressions for the central and the raw moments, when X is a continuous random variable, can be written as
r = E ( X % ) = &
and
%$ $
(X % ) (X % A)
r* = E ( X % A ) = &
%$
= a ! X i . p ( X i ) = aE ( X )
i =1
Combining the results of theorems 1 and 2, we can write E(aX + b) = aE(X) + b Remarks : Using the above result, we can write an alternative expression for the variance of X, as given below : s2 = E(X - m)2 = E(X2 - 2mX + m2) = E(X2) - 2mE(X) + m2 = E(X2) - 2m2 + m2 = E(X2) - m2 = E(X2) - [E(X)]2 = Mean of Squares - Square of the Mean We note that the above expression is identical to the expression for the variance of a frequency distribution.
413
Theorems on Variance Theorem 1: The variance of a constant is zero. Proof: Let b be the given constant. We can write the expression for the variance of b as: Var(b) = E[b - E(b)]2 = E[b - b]2 = 0. Theorem 2: Var(X + b) = Var(X). Proof: We can write Var(X + b) = E[X + b - E(X + b)]2 = E[X + b - E(X) - b]2 = E[X - E(X)]2 = Var(X) Similarly, it can be shown that Var(X - b) = Var(X) Remarks: The above theorem shows that variance is independent of change of origin. Theorem 3: Var(aX) = a2Var(X) Proof: We can write Var(aX) = E[aX - E(aX)]2 = E[aX - aE(X)]2 = a2E[X - E(X)]2 = a2Var(X). Combining the results of theorems 2 and 3, we can write Var(aX + b) = a2Var(X). This result shows that the variance is independent of change origin but not of change of scale. Remarks: 1. On the basis of the theorems on expectation and variance, we can say that if X is a random variable, then its linear combination, aX + b, is also a random variable with mean aE(X) + b and Variance equal to a2Var(X). The above theorems can also be proved for a continuous random variable.
2.
Example 4: Compute mean and variance of the probability distributions obtained in examples 1, 2 and 3. Solution: (a) The probability distribution of X in example 1 was obtained as
X p X 0 1 8 1 3 8 2 3 8 3 1 8
= B
( )
( )
Thus, Var(X) = 3 - (1.5)2 = 0.75 (b) The probability distribution of X in example 2 was obtained as
X
414
p X
= B
2 3 4 5 6 7 8 9 10 11 12 Total 1 2 3 4 5 6 5 4 3 2 1 1 36 36 36 36 36 36 36 36 36 36 36
+ E( X ) = 2 '
1 2 3 4 5 6 + 3' + 4 ' + 5' + 6 ' + 7' 36 36 36 36 36 36 5 4 3 2 1 252 + 9 ' + 10 ' + 11 ' + 12 ' = =7 36 36 36 36 36 36 1 2 3 4 5 6 + 9 ' + 16 ' + 25 ' + 36 ' + 49 ' 36 36 36 36 36 36
+8'
2 Further, E ( X ) = 4 '
+ 64 '
Thus, Var(X) = 54.8 - 49 = 5.8 (c) The probability distribution of X in example 3 was obtained as
X p( X )
1 4 20
2 12 20
3 4 20
4 12 4 + 2' + 3' =2 20 20 20 4 12 4 2 = 4.4 and E ( X ) = 1 ' + 4 ' + 9 ' 20 20 20 \ Var(X) = 4.4 - 4 = 0.4 E( X ) = 1 '
Expected Monetary Value (EMV) When a random variable is expressed in monetary units, its expected value is often termed as expected monetary value and symbolised by EMV. Example 5: If it rains, an umbrella salesman earns Rs 100 per day. If it is fair, he loses Rs 15 per day. What is his expectation if the probability of rain is 0.3? Solution: Here the random variable X takes only two values, X1 = 100 with probability 0.3 and X2 = 15 with probability 0.7. Thus, the expectation of the umbrella salesman = 100 ' 0.3 15 ' 0.7 = 19.5 The above result implies that his average earning in the long run would be Rs 19.5 per day. Example 6: A person plays a game of throwing an unbiased die under the condition that he could get as many rupees as the number of points obtained on the die. Find the expectation and variance of his winning. How much should he pay to play in order that it is a fair game? Solution: The probability distribution of the number of rupees won by the person is given below :
X( Rs) p( X ) 1 1 6 2 1 6 3 1 6 4 1 6 5 1 6 6 1 6
Thus, and
E(X ) = 1 '
+ )2 =
415
Since E(X) is positive, the player would win Rs 3.5 per game in the long run. Such a game is said to be favourable to the player. In order that the game is fair, the expectation of the player should be zero. Thus, he should pay Rs 3.5 before the start of the game so that the possible values of the random variable become 1 - 3.5 = - 2.5, 2 - 3.5 = - 1.5, 3 - 3.5 = - 0.5, 4 - 3.5 = 0.5, etc. and their expected value is zero. Example 7: Two persons A and B throw, alternatively, a six faced die for a prize of Rs 55 which is to be won by the person who first throws 6. If A has the first throw, what are their respective expectations? Solution: Let A be the event that A gets a 6 and B be the event that B gets a 6. Thus, 1 1 P( A) = and P( B) = . 6 6 If A starts the game, the probability of his winning is given by :
6 , therefore, the random variable takes a value 55 11 6 5 with probability and value 0 with probability .Hence, E( A) = 55 ' 6 + 0 ' 5 = Rs 30 11 11 11 11
6 5 + 0 ' = Rs.30 11 11
Example 8: An unbiased die is thrown until a four is obtained. Find the expected value and variance of the number of throws. Solution: Let X denote the number of throws required to get a four. Thus, X will take values 1, 2, 3, 4, ...... with respective probabilities.
1 5 1 , 5- 1 , 5- 1 ' , . / ' ...... etc. , ' , 6 6 6 .6/ 6 061 6 0 1 1 5 1 ,5- 1 ,5- 1 + E ( X ) = 1. + 2. . + 3. . / . + 4. . / . ...... 6 6 6 61 6 0 061 6
2 3 1 5 5 5 = 1 + 2. + 3. + 4. + ...... 6 6 6 6 2 3 2 3
416
Let
5 5 5 S = 1 + 2. + 3. + 4. + ...... 6 6 6
5 5 5 5 5 + 2. + 3. + 4. + ...... S= 6 6 6 6 6
5 5 ,5,5+ S % S = 1 + (2 % 1) + (3 % 2) . / + (4 % 3) . / + ...... 6 6 061 061 1 5 5 5 1 S = 1 + + + + ...... = =6 5 6 6 6 6 16
2 3 2 3
.... (1)
1 ! 36 = 6. 6
2 3
Let
1 5 5 5 S = 1 + 22 - 1 + 32 - 22 + 42 - 32 + ...... 6 6 6 6
5 5 5 = 1 + 3 + 5 + 7 + ...... 6 6 6
5 and subtract 6
2 3
1 5 5 5 5 SS = 1 + (3 - 1) + (5 - 3) + (7 - 5) + ...... 6 6 6 6 36
2 1 5 5 5 5 S = 1 + 2 1 + + + ...... = 1 + 6 = 11 6 6 6 36 3
.... (2)
1 ' 36 ' 11 = 66 6
Hence, Variance = E(X2) - [E(X)]2 = 66 - 36 = 30 Generalisation: Let p be the probability of getting 4, then from equation (1) we can write
pS = 1 1 1 = or S = 2 Therefore, E ( X ) = p 1- q p 1 1 p 2 = p p
( )
417
P ( X = X i / Y = Y1 ) =
418
Joint probability of X i and Y1 pij (for i = 1, 2, ...... m). = Marginal probability of Y1 Pj*
This gives us a conditional probability distribution of X given that Y = Y1. This distribution can be written in a tabular form as shown below :
X Probability X1 p11 P1* X2 p21 P1* ... ... ... ... Xm pm1 P1* Total Probability 1
The conditional distribution of X given some other value of Y can be constructed in a similar way. Further, we can construct the conditional distributions of Y for various given values of X. Remarks: It can be shown that if the conditional distribution of a random variable is same as its marginal distribution, the two random variables are independent. Thus, if for the conditional distribution of X given Y1 we have
independent. It should be noted here that if one conditional distribution satisfies the condition of independence of the random variables, then all the conditional distributions would also satisfy this condition. Example 9: Let two unbiased dice be tossed. Let a random variable X take the value 1 if first die shows 1 or 2, value 2 if first die shows 3 or 4 and value 3 if first die shows 5 or 6. Further, Let Y be a random variable which denotes the number obtained on the second die. Construct a joint probability distribution of X and Y. Also determine their marginal probability distributions and find E(X) and E(Y) respectively. Determine the conditional distribution of X given Y = 5 and of Y given X = 2. Find the expected values of these conditional distributions. Determine whether X and Y are independent? Solution: For the given random experiment, the random variable X takes values 1, 2 and 3 and the random variable Y takes values 1, 2, 3, 4, 5 and 6. Their joint probability distribution is shown in the following table :
X 8 \Y 9 Y X 1 2 3 Marginal Dist. of Y
*\
1 1 18 1 18 1 18 1 6
2 1 18 1 18 1 18 1 6
3 1 18 1 18 1 18 1 6
4 1 18 1 18 1 18 1 6
5 1 18 1 18 1 18 1 6
6 1 18 1 18 1 18 1 6
Marginal Dist. of X 1 3 1 3 1 3 1
From the above table, we can write the marginal distribution of X as given below :
X Pi 1 1 3 2 1 3 3 1 3 Total 1
1 1 1 1 1 1 21 = 3.5 and E (Y ) = 1. + 2. + 3. + 4. + 5. + 6. = 6 6 6 6 6 6 6
419
Y Pj* / X = 2
1 1 6
2 1 6
3 1 6
4 1 6
5 1 6
6 Total 1 1 6
\ E (Y / X = 2) =
1 (1 + 2 + 3 + 4 + 5 + 6) = 3.5 6
Since the conditional distribution of X is same as its marginal distribution (or equivalently the conditional distribution of Y is same as its marginal distribution), X and Y are independent random variables. Example 10: Two unbiased coins are tossed. Let X be a random variable which denotes the total number of heads obtained on a toss and Y be a random variable which takes a value 1 if head occurs on first coin and takes a value 0 if tail occurs on it. Construct the joint probability distribution of X and Y. Find the conditional distribution of X when Y = 0. Are X and Y independent random variables? Solution: There are 4 elements in the sample space of the random experiment. The possible values that X can take are 0, 1 and 2 and the possible values of Y are 0 and 1. The joint probability distribution of X and Y can be written in a tabular form as follows :
X 8 \Y 9 0 1 0 4 1 1 4 2 Total 0 2 4
1 Total 1 0 4 1 2 4 4 1 1 4 4 2 1 4
X P ( X / Y = 0)
0 1 2
1 1 2
2 Total 0 1
X Pi
0 1 4
1 1 2
2 Total 1 1 4
420
Since the conditional and the marginal distributions are different, X and Y are not independent random variables.
=
m
! Xi ! pij + ! Yj ! pij
i =1 n j =1
= ! X i Pi + ! Y j Pj*
i =1
, . Here 0
m ij
!p
J =1
= Pi and
!p
i =1
ij
= Pj* / 1
= E ( X ) + E (Y )
The above result can be generalised. If there are k random variables X1, X2, ...... Xk, then E(X1 + X2 + ...... + Xk) = E(X1) + E(X2) + ...... E(Xk). Remarks: The above result holds irrespective of whether X1, X2, ...... Xk are independent or not. Theorem 2: If X and Y are two independent random variables, then E(X.Y) = E(X).E(Y) Proof: Let the random variable X takes values X1, X2, ...... Xm and the random variable Y takes values Y1, Y 2, ...... Y n such that P(X = Xi and Y = Y j) = pij (i = 1 to m, j = 1 to n).
m n
By definition E ( XY ) = !! X i Y j pij
i =1 j =1
Since X and Y are independent, we have pij = Pi .Pj* \ E ( XY ) = i!1 !1 X i Y j Pi . Pj* = i!1 X i Pi ' !1Y j Pj* = j= = j= = E(X).E(Y). The above result can be generalised. If there are k independent random variables X1, X2, ...... Xk, then E(X1. X2. ...... Xk) = E(X1).E(X2). ...... E(Xk)
m n m n
E 2: ( X , Y )3 = !! : ( Xi , Y j ) pij 6 7
i =1 j =1
421
The above expression, which is the mean of the product of deviations of values from their respective means, is known as the Covariance of X and Y denoted as Cov(X, Y) or ) XY . Thus, we can write
Cov ( X , Y ) = E 2( X % X )(Y % Y )3 6 7
An alternative expression of Cov(X, Y)
Note that E[{Y - E(Y)}] = 0, the sum of deviations of values from their arithmetic mean. Remarks: If X and Y are independent random variables, the right hand side of the above equation will be zero. Thus, covariance between independent variables is always equal to zero. II. Mean and Variance of a Linear Combination Let Z = : ( X , Y ) = aX + bY be a linear combination of the two random variables X and Y, then using the theorem of addition of expectation, we can write
Z = E ( Z ) = E (aX + bY ) = aE ( X ) + bE (Y ) = a X + b Y
Further, the variance of Z is given by
2 ) Z = E [ Z % E ( Z )] = E [aX + bY % a X % b Y ] = E 2 a ( X % X ) + b (Y % Y )3 6 7 2 2 2
= a2 E ( X % X ) + b2 E (Y % Y ) + 2abE ( X % X )(Y % Y )
2 2 = a2) X + b2) Y + 2ab) XY
Remarks: 1. 2. 3. The above results indicate that any function of random variables is also a random variable. If X and Y are independent, then s
XY
2 2 2 0 ,+) Z = a2) X + b2) Y
2 2 2 If Z = aX - bY, then we can write ) Z = a2) X + b2) Y % 2ab) XY . However, 2 2 2 ) Z = a2) X + b2) Y , if X and Y are independent.
4.
The above results can be generalised. If X1, X2, ...... Xk are k independent random
2 variables with means 1 , 2 , ...... k and variances ) 12 , ) 2 , ...... ) k2 respectively, then
E ( X1 X 2 .... X k ) = 1 2 .... k
and Notes: 1. 2. The general result on expectation of the sum or difference will hold even if the random variables are not independent. The above result can also be proved for continuous random variables.
2 Var ( X1 X 2 .... X k ) = ) 12 + ) 2 + .... + ) k2
422
: :
(i) (ii)
2 1 6
1
p
0 1 4
1
p
2 1 6
Solution: Since the total probability under a probability distribution is equal to unity, the value of p should be such that
1 1 1 + p + + p + =1. 6 4 6
5 24
1 5 1 5 1 Further, E ( X ) = - 2. - 1. + 0. + 1. + 2. = 0 , 6 24 4 24 6 1 5 1 5 1 7 E ( X 2 ) = 4. + 1. + 0. + 1. + 4. = , 6 24 4 24 6 4 E ( X + 2) = E ( X ) + 2 = 0 + 2 = 2
and
7 E (2 X 2 + 3 X + 5) = 2 E ( X 2 ) + 3E ( X ) + 5 = 2. + 0 + 5 = 8.5 4
Example 12: A dealer of ceiling fans has estimated the following probability distribution of the price of a ceiling fan in the next summer season :
Price ( P) Probability ( p) : : 800 0.15 825 0. 25 850 0. 30 875 0. 20 900 0.10
If the demand (x) of his ceiling fans follows a linear relation x = 6000 - 4P, find expected demand of fans and expected total revenue of the dealer. Solution: Since P is a random variable, therefore, x = 6000 - 4P, is also a random variable. Further, Total Revenue TR = P.x = 6000P - 4P2 is also a random variable. From the given probability distribution, we have E(P) = 800 ' 0.15 + 825 ' 0.25 + 850 ' 0.30 + 875 ' 0.20 + 900 ' 0.10 =Rs 846.25 and E(P2) = (800)2 ' 0.15 + (825)2 ' 0.25 + (850)2 ' 0.30 + (875)2 ' 0.20 + (900)2 ' 0.10 = 717031.25 Thus, E(X) = 6000 - 4E(P) = 6000 - 4 ' 846.25 = 2615 fans. And E(TR) = 6000E(P) - 4E(P2) = 6000 ' 846.25 - 4 ' 717031.25 = Rs 22,09,375.00 Example 13: A person applies for equity shares of Rs 10 each to be issued at a premium of Rs 6 per share; Rs 8 per share being payable along with the application and the balance at the time of allotment. The issuing company may issue 50 or 100 shares to those who apply for 200 shares, the probability of issuing 50 shares being 0.4 and that of issuing 100 shares is 0.6. In either case, the probability of an application being selected for allotment of any shares is 0.2 The allotment usually takes three months and the market price per share is expected to be Rs 25 at the time of allotment. Find the expected rate of return of the person per month. Solution: Let A be the event that the application of the person is considered for allotment, B1 be the event that he is allotted 50 shares and B2 be the event that he is allotted 100 shares. Further, let R1 denote the rate of return (per month) when 50 shares are allotted, R2 be the rate of return when 100 shares are allotted and R = R1 + R2 be the combined rate of return.
423
We are given that P(A) = 0.2, P(B1/A) = 0.4 and P(B2/A) = 0.6. (a) When 50 shares are allotted The return on investment in 3 months = (25 - 16)50 = 450 450 = 150 \ Monthly rate of return = 3 The probability that he is allotted 50 shares
= P A I B1 = P A . P B1 / A = 0.2 ' 0.4 = 0.08
>
C >C >
Thus, the random variable R1 takes a value 150 with probability 0.08 and it takes a value 0 with probability 1 - 0.08 = 0.92 \ E(R1) = 150 ! 0.08 + 0 = 12.00 (b) When 100 shares are allotted The return on investment in 3 months = (25 - 16).100 = 900 900 = 300 \ Monthly rate of return = 3 The probability that he is allotted 100 shares
. = P A I B2 = P A . P B2 / A = 0.2 ' 0.6 = 012
>
C >C >
Thus, the random variable R2 takes a value 300 with probability 0.12 and it takes a value 0 with probability 1 - 0.12 = 0.88 \ E(R2) = 300 ' 0.12 + 0 = 36 Hence, E(R) = E(R1 + R2) = E(R1) + E(R2) = 12 + 36 = 48 Example 14: What is the mathematical expectation of the sum of points on n unbiased dice? Solution: Let Xi denote the number obtained on the i th die. Therefore, the sum of points on n dice is S = X1 + X2 + ...... + Xn and E(S) = E(X1) + E(X2) + ...... + E(Xn). Further, the number on the i th die, i.e., Xi follows the following distribution :
Xi p( X i )
: :
1 1 6
2 1 6
3 1 6
4 1 6
5 1 6
6 1 6
\ E (Xi ) =
424
Since E(X) is less than Rs 600, the cost of testing the machine, hence, it is more profitable to install the machine without testing.
Hint: Random variable takes 3 values 14, 21 and 28. 2. ABC company estimates the net profit on a new product, that it is launching, to be Rs 30,00,000 if it is successful, Rs 10,00,000 if it is moderately successful and a loss of Rs 10,00,000 if it is unsuccessful. The firm assigns the following probabilities to the different possibilities : Successful 0.15, moderately successful 0.25 and unsuccessful 0.60. Find the expected value and variance of the net profits.
Hint: See example 5. 3. There are 4 different choices available to a customer who wants to buy a transistor set. The first type costs Rs 800, the second type Rs 680, the third type Rs 880 and the fourth type Rs 760. The probabilities that the customer will buy these types are
1 1 1 1 , , and respectively. The retailer of these sets gets a commission @ 20%, 3 6 4 4 12%, 25% and 15% on the respective sets. What is the expected commission of the retailer?
Hint: Take commission as random variable. 4. Three cards are drawn at random successively, with replacement, from a well shuffled pack of cards. Getting a card of diamond is termed as a success. Tabulate the probability distribution of the number successes (X). Find the mean and variance of X.
Hint: The random variable takes values 0, 1, 2 and 3. 5. A discrete random variable can take all possible integral values from 1 to k each with probability
1 . Find the mean and variance of the distribution. k
Hint: E X 2 = 6.
( )
1 2 1 k (k + 1)(2k + 1) 1 + 22 + .... + k 2 = . k k 6
An insurance company charges, from a man aged 50, an annual premium of Rs 15 on a policy of Rs 1,000. If the death rate is 6 per thousand per year for this age group, what is the expected gain for the insurance company?
Hint: Random variable takes values 15 and - 985. 7. On buying a ticket, a player is allowed to toss three fair coins. He is paid number of rupees equal to the number of heads appearing. What is the maximum amount the player should be willing to pay for the ticket.
Hint: The maximum amount is equal to expected value. 8. The following is the probability distribution of the monthly demand of calculators :
Demand (x) : 15 16 17 18 19 20 Probability p(x) : 0.10 0.15 0. 35 0. 25 0.08 0.07
Calculate the expected demand for calculators. If the cost c of producing x calculators is given by the relation c = 4x2 - 15x + 200, find expected cost. Hint: See example 12.
425
9.
Firm A wishes to bid for the supply of 800 chairs to an educational institution at the rate of Rs 500 per chair. The firm, which has two competitors B and C, has estimated that the probability that firm B will bid less than Rs 500 per chair is 0.4 and that the firm C will bid less than Rs 500 per chair is 0.6. If the lowest bidder gets business and the firms bid independently, what is the expected value of the contract to firm A?
Hint: The random variable takes value 0 with probability 0.4 ' 0.6 and it takes value 500 ' 800 with probability 1 - 0.4 ' 0.6. 10. A game is played by throwing a six faced die for which the incomplete probability distribution of the number obtained is given below :
2 3 4 5 6 X : 1 p(X) : 0.09 0. 30 m n 0.28 0.09
The conditions of the game are : If the die shows an even number, the player gets rupees equal to the number obtained; if the die shows 3 or 5, he loses rupees equal to the number obtained, while if 1 is obtained the player neither gains or loses. Complete the probability distribution if the game is given to be fair. Hint: E(X) = 0 for a fair game. 11. There are three bags which contain 4 red and 3 black, 6 red and 4 black and 8 red and 2 black balls respectively. One ball is drawn from each urn. What is the expected number of red balls obtained?
Hint: Find the expected number of red balls from each urn and add. 12. A survey conducted over last 25 years indicated that in 10 years the winter was mild, in 8 years it was cold and in the remaining 7 years it was very cold. A company sells 1,000 woollen coats in mild cold year, 1,300 in a cold year and 2,000 in a very cold year. You are required to find the yearly expected profit of the company if a woollen coat costs Rs 173 and is sold to stores for Rs 248. Hint: The random variable can take 3 possible values. 13. You have been offered the chance to play a dice game in which you will receive Rs 20 each time the point total of a toss of two dice is 6. If it costs you Rs 2.50 per toss to participate, should you play or not? Will it make any difference in your decision if it costs Rs 3.00 per toss instead of Rs 2.50? Hint: Compare the cost of participation with the expected value of the receipt. 14. The probability that a house of a certain type will be on fire in a year is 0.005. An insurance company offers to sell the owner of such a house Rs 1,00,000 one year term insurance policy for a premium of Rs 600. What is the expected gain of the company? Hint: See exercise 6. 15. Three persons A, B and C in that order draw a ball, without replacement, from a bag containing 2 red and 3 white balls till someone is able to draw a red ball. One who draws a red ball wins Rs 400. Determine their expectations. Hint: A wins if he gets a red ball on the first draw or all the three get white ball in their respective first draws, etc. 16. A coin is tossed until a head appears. What is the expected number and standard deviation of tosses? Hint: The random variable takes values 1, 2, 3, .... with respective probabilities p, (1 - p)p, (1 - p)2p, etc., where p is the probability of getting a head.
426
17. A box contains 8 tickets. 3 of the tickets carry a prize of Rs 5 each and the remaining 5 a prize of Rs 2 each. (i) (ii) If one ticket is drawn at random, what is the expected value of the prize? If two tickets are drawn at random, what is the expected value of the prize?
Hint: (i) The random variable can take values 5 or 2, (ii) It can take values 4, 7 or 10. 18. 4 unbiased coins are tossed 256 times. Find the frequency distribution of heads and tabulate the result. Calculate the mean and standard deviation of the number of heads. Hint: the random variable takes values 0, 1, 2, 3 and 4. 19. Throwing two unbiased coins simultaneously, Mr X bets with Mrs X that he will receive Rs 4 from her if he gets 2 heads and he will give Rs 4 to her otherwise. Find Mr X's expectation. Hint: The random variable takes values 4 and 4. 20. A man runs an ice cream parlor in a holiday resort. If the summer is mild, he can sell 2,500 cups of ice cream; if it is hot, he can sell 4,000 cups; if it is very hot, he can sell 5,000 cups. It is known that for any year the probability of the summer to be mild is
1 4 and to be hot is . A cup of ice cream costs Rs 2 and sold 7 7
for Rs 3.50. What is his expected profit? Hint: See example 5. 21. Comment on the validity of the following statement : For a random variable X, Hint: s2 = E(X2) - [E(X)]2.
Check Your Progress 12.1
E X2 " E (X ).
( )
1 2.
What is Stochastic variable? How bi-variate probability is different from multi variable probability distribution? Write your answer in the space given below. Please go through the lesson sub-head thoroughly you will get your answers in it. This Check Your Progress will help you to understand the lesson better. Try to write answers for them, but do not submit your answers to the university for assessment. These are for your practice only.
2.
3.
4.
Events 9 Actions A1 A2 M Ai M Am
S1 X 11 X 21 M X i1 M X m1
S2 X 12 X 22 M X i2 M X m2
Sj
Sn
X1 j X2 j M ... X ij M ... X mj
X 1n X 2n M ... X in M ... X mn
428
Given the payoff matrix for a decision problem, the process of decision-making depends upon the situation under which the decision is being made. These situations can be classified into three broad categories : (a) Decision-making under certainty, (b) Decision -making under uncertainty and (c) Decision-making under risk.
Example 17: Let there be a situation in which a decision-maker has three possible alternatives A1, A2 and A3, where the outcome of each of them can be affected by the occurrence of any one of the four possible events S1, S2, S3 and S4. The monetary payoffs of each combination of Ai and Sj are given in the following table :
Payoff Matrix
Events 9 Actions 8 A1 A2 A3
2.
S1
S2
S3
S4
Min. Payoff 12 17 15
Max. Payoff 27 45 52
27 12 14 26 45 17 35 20 52 36 29 15
Solution: Since 17 is maximum out of the minimum payoffs, the optimal action is A2. Maximax Criterion: This criterion, also known as the criterion of optimism, is used when the decision-maker is optimistic about future. Maximax implies the maximisation of maximum payoff. The optimistic decision-maker locates the maximum payoff for each possible course of action. The maximum of these payoffs is identified and the corresponding course of action is selected. The optimal course of action in the above example, based on this criterion, is A3. Regret Criterion: This criterion focuses upon the regret that the decision-maker might have from selecting a particular course of action. Regret is defined as the difference between the best payoff we could have realised, had we known which state of nature was going to occur and the realised payoff. This difference, which measures the magnitude of the loss incurred by not selecting the best alternative, is also known as opportunity loss or the opportunity cost.
3.
429
From the payoff matrix (given in 12.6), the payoffs corresponding to the actions A1, A2, ...... An under the state of nature Sj are X1i, X2j, ...... Xnj respectively. Of these assume that X2j is maximum. Then the regret in selecting Ai, to be denoted by Rij is given by X2j - Xij, i = 1 to m. We note that the regret in selecting A2 is zero. The regrets for various actions under different states of nature can also be computed in a similar way. The regret criterion is based upon the minimax principle, i.e., the decision-maker tries to minimise the maximum regret. Thus, the decision-maker selects the maximum regret for each of the actions and out of these the action which corresponds to the minimum regret is regarded as optimal. The regret matrix of example 17 can be written as given below:
Regret Matrix
Events 9 S1 S2 S 3 S 4 Actions A1 25 24 21 0 A2 7 19 0 6 A3 0 0 6 11
Max. Regret 25 19 11
From the maximum regret column, we find that the regret corresponding to the course of action is A3 is minimum. Hence, A3 is optimal. 4. Hurwicz Criterion: The maximax and the maximin criteria, discussed above, assumes that the decision-maker is either optimistic or pessimistic. A more realistic approach would, however, be to take into account the degree or index of optimism or pessimism of the decision-maker in the process of decision-making. If a, a constant lying between 0 and 1, denotes the degree of optimism, then the degree of pessimism will be 1 - a. Then a weighted average of the maximum and minimum payoffs of an action, with a and 1 - a as respective weights, is computed. The action with highest average is regarded as optimal. We note that a nearer to unity indicates that the decision-maker is optimistic while a value nearer to zero indicates that he is pessimistic. If a = 0.5, the decisionmaker is said to be neutralist. We apply this criterion to the payoff matrix of example 17. Assume that the index of optimism a = 0.7.
Min. Payoff 12 17 15
Weighted Average 27 ' 0.7 + 12 ' 0.3 = 22.5 45 ' 0.7 + 17 ' 0.3 = 36.6 52 ' 0.7 + 15 ' 0.3 = 40.9
27 45 52
Since the average for A3 is maximum, it is optimal. Laplace Criterion: In the absence of any knowledge about the probabilities of occurrence of various states of nature, one possible way out is to assume that all of them are equally likely to occur. Thus, if there are n states of nature, each can be assigned a probability of occurrence = 1/n. Using these probabilities, we compute the expected payoff for each course of action and the action with maximum expected value is regarded as optimal.
430
A1 % 20 200 400
A2 % 50 % 100 600
A3 200 % 50 300
The probabilities of the states of nature are 0.3, 0.4 and 0.3 respectively. Determine the optimal act using the Bayesian Criterion. Solution:
Computation of Expected Monetary Value
S1 P (S ) A1 A2 0.3 % 20
S2 0.4 200
S3 0.3 600 EMV % 50 ' 0.3 % 100 ' 0.4 + 600 ' 0.3 = 125 400 % 20 ' 0.3 + 200 ' 0.4 + 400 ' 0.3 = 194
% 50 % 100
A3 200 % 50 300 200 ' 0.3 % 50 ' 0.4 + 300 ' 0.3 = 130 From the above table, we find that the act A1 is optimal.
The problem can alternatively be attempted by finding minimum EOL, as shown below:
Computation of Expected Opportunity Loss
S1 P (S ) A1 A2 A3 0.3 220 0
S2 0.4 0
S3 0.3 0 EOL 250 ' 0.3 + 300 ' 0.4 + 0 ' 0.3 = 195 200 220 ' 0.3 + 0 ' 0.4 + 200 ' 0.3 = 126
250 300
250 300 0 ' 0.3 + 250 ' 0.4 + 300 ' 0.3 = 190
Similarly, if the decision-maker is certain that the state of nature S2 will be in effect, his course of action would be A1 and if he is certain that the state of nature S3 will be in effect, his course of action would be A 2. The maximum payoffs associated with the actions are Rs 200 and Rs 600 respectively. The weighted average of these payoffs with weights equal to the probabilities of respective states of nature is termed as Expected Payoff under Certainty (EPC). Thus, EPC = 200 ' 0.3 + 200 ' 0.4 + 600 ' 0.3 = 320 The difference between EPC and EMV of optimal action is the amount of profit foregone due to uncertainty and is equal to EVPI. Thus, EVPI = EPC - EMV of optimal action = 320 - 194 = 126 It is interesting to note that EVPI is also equal to EOL of the optimal action. Cost of Uncertainty This concept is similar to the concept of EVPI. Cost of uncertainty is the difference between the EOL of optimal action and the EOL under perfect information. Given the perfect information, the decision-maker would select an action with minimum opportunity loss under each state of nature. Since minimum opportunity loss under each state of nature is zero, therefore, EOL under certainty = 0 ' 0.3 + 0 ' 0.4 + 0 ' 0.3 = 0 . Thus, the cost of uncertainty = EOL of optimal action = EVPI Example 19: A group of students raise money each year by selling souvenirs outside the stadium of a cricket match between teams A and B. They can buy any of three different types of souvenirs from a supplier. Their sales are mostly dependent on which team wins the match. A conditional payoff (in Rs.) table is as under :
Type of Souvenir 9 Team A wins Team B wins I 250 II III 300
1200 800
700 1100
(i) (ii)
Construct the opportunity loss table. Which type of souvenir should the students buy if the probability of team A's winning is 0.6?
(iii) Compute the cost of uncertainty. Solution: (i) The Opportunity Loss Table
850 400
EOL of buying type I Souvenir = 0 ' 0.6 + 850 ' 0.4 = 340 EOL of buying type II Souvenir = 400 ' 0.6 + 400 ' 0.4 = 400. EOL of buying type III Souvenir = 900 ' 0.6 + 0 ' 0.4 = 540. Since the EOL of buying Type I Souvenir is minimum, the optimal decision is to buy Type I Souvenir.
432
Example 20: The following is the information concerning a product X : (i) (ii) Per unit profit is Rs 3. Salvage loss per unit is Rs 2.
From the above probability distribution, it is obvious that the optimum order would lie between and including 5 to 9. Let A denote the number of units ordered and D denote the number of units demanded per day. If D " A, profit per day = 3A, and if D < A, profit per day = 3D 2(A D) = 5D 2A. Thus, the profit matrix can be written as
0.10 0.20 0.30 0.25 0.15 EMV 15 13 11 9 7 15 18 16 14 12 15 18 21 19 17 15 18 21 24 22 15 18 21 24 27 15.00 17.50 19.00 19.00 17.75
From the above table, we note that the maximum EMV = 19.00, which corresponds to the order of 7 or 8 units. Since the order of the 8th unit adds nothing to the EMV, i.e., marginal EMV is zero, therefore, order of 8 units per day is optimal. (ii) Expected profit under certainty
= 5 ' 0.10 + 6 ' 0.20 + 7 ' 0.30 + 8 ' 0.25 + 9 ' 0.15 ' 3 = Rs 21.45
>
Alternative Method: The work of computations of EMV's, in the above example, can be reduced considerably by the use of the concept of expected marginal profit. Let p be the marginal profit and l be the marginal loss of ordering an additional unit of the product. Then, the expected marginal profit of ordering the Ath unit, is givenby
The computations of EMV, for alternative possible values of A, are shown in the following table : In our example, ; = 3 and < = 2 Thus, the expression for the expected marginal profit of the Ath unit
= (3 + 2 ) P ( D " A ) % 2 = 5P ( D " A ) % 2.
Table for Computations
Action( A) P ( D " A ) * EMP = 5P ( D " A ) % 2 5 6 7 8 9 1.00 0.90 0.70 0.40 0.15 5 ' 1.00 % 2 = 3.00 5 ' 0.90 % 2 = 2.50 5 ' 0.70 % 2 = 1.50 5 ' 0.40 % 2 = 0.00 5 ' 0.15 % 2 = %1.25
Total profit or EMV 5 ' 3.00 = 15.00 15.00 + 2.50 = 17.50 17.50 + 1.50 = 19.00 19.00 + 0.00 = 19.00 19.00 % 1.25 = 17.75
Since the expected marginal profit (EMP) of the 8th unit is zero, therefore, optimal order is 8 units.
Marginal Analysis
Marginal analysis is used when the number of states of nature is considerably large. Using this analysis, it is possible to locate the optimal course of action without the computation of EMV's of various actions. An order of A units is said to be optimal if the expected marginal profit of the Ath unit is non-negative and the expected marginal profit of the (A + 1)th unit is negative. Using equation (1), we can write
and
P (D " A) "
or
P (D < A ) ( 1 %
.... (4)
[P(D ( A - 1) = P(D < A), since A is an integer] Further, equation (3) gives
P ( D " A + 1) <
or
P (D < A + 1) > 1 %
P ( D ( A % 1) (
434
; < P (D ( A) . ; +<
Writing the probability distribution, given in example 20, in the form of less than type cumulative probabilities which is also known as the distribution function F(D), we get
Units demanded(D) : 5 6 7 8 9 F(D) : 0.1 0. 3 0.6 0.85 1.00
p 3 = = 0.6 p +l 5
Since the next cumulative probability, i.e., 0.85, corresponds to 8 units, hence, the optimal order is 8 units.
PROBABILITIES
IN
When the objective probabilities of the occurrence of various states of nature are not known, the same can be assigned on the basis of the expectations or the degree of belief of the decision-maker. Such probabilities are known as subjective or personal probabilities. It may be pointed out that different individuals may assign different probability values to given states of nature. This indicates that a decision problem under uncertainty can always be converted into a decision problem under risk by the use of subjective probabilities. Such an approach is also termed as Subjectivists' Approach. Example 21: The conditional payoff (in Rs) for each action-event combination are as under:
Action 9 Event 8 A B C D E 1 4 0 %5 3 6 2 %2 6 9 1 6 3 7 3 2 4 3 4 8 5 %3 5 2
(i) (ii)
Which is the best action in accordance with the Maximin Criterion? Which is the best action in accordance with the EMV Criterion, assuming that all the events are equally likely?
Solution: (i) The minimum payoffs for various actions are : Action 1 = 5 Action 2 = 2 Action 3 = 2 Action 4 = 3 Since the payoff for action 3 is maximum, therefore, A3 is optimal on the basis of maximin criterion. (ii) Since there are 5 equally likely events, the probability of each of them would be Thus, the EMV of action 1, i.e., EMV1 =
1 . 5
4+0-5+3+6 8 = = 1.6 5 5
19 17 Similarly, EMV2 = 20 = 4.0 , EMV3 = = 3.8 and EMV4 = = 3.4 5 5 5 Thus, action 2 is optimal.
435
PROBABILITIES
IN
The probability values of various states of nature, discussed so far, were prior probabilities. Such probabilities are either computed from the past data or assigned subjectively. It is possible to revise these probabilities in the light of current information available by using the Bayes' Theorem. The revised probabilities are known as posterior probabilities. Example 22: A manufacturer of detergent soap must determine whether or not to expand his productive capacity. His profit per month, however, depend upon the potential demand for his product which may turn out to be high or low. His payoff matrix is given below:
Do not Expand Expand High Demand Rs 5,000 Rs 7,500 Low Demand Rs 5,000 Rs 2,100
On the basis of past experience, he has estimated the probability that demand for his product being high in future is only 0.4 Before taking a decision, he also conducts a market survey. From the past experience he knows that when the demand has been high, such a survey had predicted it correctly only 60% of the times and when the demand has been low, the survey predicted it correctly only 80% of the times. If the current survey predicts that the demand of his product is going to be high in future, determine whether the manufacturer should increase his production capacity or not? What would have been his decision in the absence of survey? Solution: Let H be the event that the demand will be high. Therefore,
From the above table, we can write 0.24 2 0.12 1 P( H / D) = = and P( H / D) = = 0.36 3 0.36 3 The EMV of the act 'don't expand' = 5000 ' + 5000 ' and the EMV of the act 'expand' = 7500 ' + 2100 '
2 3 2 3 1 = Rs 5,000 3
1 = Rs 5,700 3
Since the EMV of the act 'expand' > the EMV of the act 'don't expand', the manufacturer should expand his production capacity. It can be shown that, in the absence of survey the EMV of the act 'don't expand' is Rs 5,000 and the EMV of the act expand is Rs 4,260. Hence, the optimal act is 'don't expand'.
436
The decision tree diagrams are often used to understand and solve a decision problem. Using such diagrams, it is possible to describe the sequence of actions and chance events. A decision node is represented by a square and various action branches stem from it. Similarly, a chance node is represented by a circle and various event branches stem from it. Various steps in the construction of a decision tree can be summarised as follows : (i) (ii) Show the appropriate action-event sequence beginning from left to right of the page. Write the probabilities of various events along their respective branches stemming from each chance node.
(iii) Write the payoffs at the end of each of the right-most branch. (iv) Moving backward, from right to left, compute EMV of each chance node, wherever encountered. Enter this EMV in the chance node. When a decision node is encountered, choose the action branch having the highest EMV. Enter this EMV in the decision node and cutoff the other action branches. Following this approach, we can describe the decision problem of the above example as given below: Case I: When the survey predicts that the demand is going to be high
Thus, the optimal act to expand capacity. Case II: In the absence of survey
437
The profit or loss (in Rs) under the three states is estimated as X 30,000 20,000 10,000 Y 60,000 30,000 20,000 Z 40,000 10,000 % 15,000 Prepare the expected value table and advise the management about the choice of product. Hint: Compute expected profit for each commodity. 3. A pig breeder can either produce 20 or 30 pigs. The total production of his competitors can be either 5,000 or 10,000 pigs. If they produce 5,000 pigs, his profit per pig is Rs 60; if they produce 10,000 pigs, his profit per pig is Rs 45 only. Construct a payoff table and also state what should the pig breeder decide?
Hint: This is a decision problem under uncertainty where the courses of actions are to produce 20 or 30 pigs while the states of nature are the production of 5,000 or 10,000 pigs by his competitors. 4. Mr X quite often flies from town A to town B. He can use the airport bus which costs Rs 13 but if he takes it, there is a 0.08 chance that he will miss the flight. A hotel limousine costs Rs. 27 with a 0.96 chance of being on time for the flight. For Rs 50 he can use a taxi which will make 99 of 100 flights. If Mr X catches the flight on time, he will conclude a business transaction which will produce a profit of Rs 1,000; otherwise he will lose it. Which mode of transportation should Mr X use? Answer on the basis of EMV criterion. A distributor of a certain product incurs holding cost of Rs 100 per unit per week and a shortage cost of Rs 300 per unit. The data on the sales of the product are given below : Weekly Sales : 0 1 2 3 4 5 6 7 8 No. of Weeks : 0 0 5 10 15 15 5 0 0 Find his optimal stock.
438
Hint: EMV of using airport bus = (1000 13) ' 0.92 13 ' 0.08, etc. 5.
1 2.
Distinguish between Hurwicz Criterion is different from Laplace Criterion. What is the use of subjective and posterior probabilities in decision-making? Write your answer in the space given below. Please go through the lesson sub-head thoroughly you will get your answers in it. This Check Your Progress will help you to understand the lesson better. Try to write answers for them, but do not submit your answers to the university for assessment. These are for your practice only. (b) (c)
Notes: (a)
1. 2. 3. 4. 5. 6. 7. 8.
Var(X) = E[X - E(X)]2 = E(X2) - [E(X)]2 E(b) = b, where b is a constant E(aX + b) = aE(X) + b Var(aX + b) = a2Var(X) E(X + Y) = E(X) + E(Y) E(X.Y) = E(X).E(Y), if X and Y are independent. Bayesian Decision Criterion : An action with maximum EMV or minimum EOL is said to be optimal.
12.14 KEYWORDS
Variable Decision Analysis Variance Theorems Marginal Analysis
439
Distinguish Between:
Explain the concept of random variable and its probability distribution by using a simple example. What is mathematical expectation of a random variable? If Y = aX + b, where X is a random variable, show that E(Y) = aE(X) + b. If X and Y are two independent random variables, show that (a) E(X + Y) = E(X) + E(Y) (b) E(X.Y) = E(X).E(Y) A bag contains 3 rupee coins, 6 fifty paise coins and 4 twenty-five paise coins. A man draws a coin at random. What is the expectation of his draw? A box contains five tickets; two of which carry a prize of Rs 8 each and the other three of Rs 3 each. If two tickets are drawn at random, find the expected value of the prize.
6. 7.
Obtain the probability distribution of the number of aces in simultaneous throws of two unbiased dice. You are told that the time to service a car at a service station is uncertain with following probability density function: f(x) = 3x - 2x2 + 1 for 0 x 2 = 0 otherwise. Examine whether this is a valid probability density function?
8.
9.
An urn contains 4 white and 3 black balls. 3 balls are drawn at random. Write down the probability distribution of the number of white balls. Find mean and variance of the distribution.
10. A consignment is offered to two firms A and B for Rs 50,000. The following table shows the probability at which the firm will be able to sell it at different prices :
SellingPrice(in Rs) 40,000 45,000 55,000 70,000 Prob. of A 0.3 0.4 0.2 0.1 Prob. of B 0.1 0.2 0.4 03
Which of the two firms will be more inclined towards the offer? 11. If the probability that the value of a certain stock will remain same is 0.46, the probabilities that its value will increase by Re. 0.50 or Re. 1.00 per share are respectively 0.17 and 0.23 and the probability that its value will decrease by Re. 0.25 per share is 0.14, what is the expected gain per share?
12. In a college fete a stall is run where on buying a ticket a person is allowed one throw of two dice. If this gives a double six, 10 times the ticket money is refunded and in other cases nothing is refunded. Will it be profitable to run such a stall? What is the expectation of the player? State clearly the assumptions if any, for your answer. 13. The proprietor of a food stall has introduced a new item of food. The cost of making it is Rs 4 per piece and because of its novelty, it would be sold for Rs 8 per piece. It is, however, perishable and pieces remaining unsold at the end of the day are a dead loss. He expects the daily demand to be variable and has drawn up the following probability distribution expressing his estimates:
No. of pieces demanded : 50 51 52 53 54 55 Probability : 0.05 0.07 0.20 0. 35 0. 25 0.08 Compute his expected profit or loss if he prepares 53 pieces on a particular day.
14. The probability that there is at least one error in an accounts statement prepared by A is 0.2 and for B and C are 0.25 and 0.4 respectively. A, B and C prepare 10, 16 and 20 statements respectively. Find the expected number of correct statements in all. 15. Three coins whose faces are marked as 1 and 2 are tossed. What is the expectation of the total value of numbers on their faces? 16. A person has the choice of running hot snack stall or an ice cream and cold drink shop at a certain holiday resort during the coming summer season. If the weather during the season is cool and rainy, he can expect to make a profit of Rs 15,000 and if it is warm, he can expect to make a profit of Rs 3,000 only, by running a hot snack stall. On the other hand, if his choice is to run an ice cream and cold drink
441
shop, he can expect to make a profit of Rs 18,000 if the weather is warm and only Rs 3,000 if the weather is cool and rainy. The meteorological authorities predict that there is 40% chance of the weather being warm during the coming season. You are to advise him as to the choice between the two types of stalls. Base your argument on the expectation of the result of the two courses of action and show the result in a tabular form. 17. Show that the expectation of the number of failures preceding the first success in an infinite series of independent trials is q/p, where p is the probability of success in a single trial and q = 1 - p. 18. If X is a random variable with expected value 50 and standard deviation 4, find the values of a and b such that the expected value of Y = aX + b is zero and standard deviation is 6. 19. A discrete random variable X has the following probability distribution:
X : 0 1 2 3 4 5 p( X ) : k 2 k 3k 5k 4 k 3k 5 Find (a) the value of k, (b) P(X " 3), (c) the value of m such that P( X ( m) = 6 and (d) write the distribution function of X.
20. A company introduces a new product in the market and expects to make a profit of Rs 2.5 lacs during first year if the demand is 'good', Rs 1.5 lacs if the demand is 'moderate' and a loss of Rs 1 lac if the demand is 'poor'. Market research studies indicate that the probabilities for the demand to be good and moderate are 0.2 and 0.5 respectively. Find the company's expected profit and standard deviation. 21. If it rains, a taxi driver can earn Rs 100 per day. If it is fair, he can lose Rs 10 per day. What is his expectation if the probability of rain is 0.4? 22. A player tosses 3 fair coins. He wins Rs 10 if three heads appear, Rs 6 if two heads appear, Rs 2 if one head appears and loses Rs 25 if no head appears. Find the expected gain of the player. 23. A player tosses 3 fair coins. He wins Rs 12 if three tails occur, Rs 7 if two tails occur and Rs 2 if only one tail occur. How much should he win or lose in case of occurrence of no tail if the game is given to be fair? 24. A firm plans to bid Rs 300 per tonne for a contract to supply 1,000 tonnes of a metal. It has two competitors A and B and it assumes that the probability that A will bid less than Rs 300 per tonne is 0.3 and that B will bid less than Rs 300 per tonne is 0.7. If the lowest bidder gets all the business and the firms bid independently, what is the expected value of the contract to the firm? 25. A certain production process produces items that are 10 percent defective. Each item is inspected before being supplied to customers but the inspector incorrectly classifies an item 10 percent of the times. Only items classified as good are supplied. If 820 items in all have been supplied, how many of these are expected to be defective? Hint: Let A be the event that an item is supplied. P(A) = 0.10 ' ! 0.10 + 0.90 ' 0.90 = 0.82. Let B be the event that a defective item is supplied. P(B) = 0.10 ' 0.10 = 0.01. Therefore P(B/A) = 0.01/0.82. 26. You are given the following payoffs of three acts A1, A2 and A3 and the states of nature S1, S2 and S3 :
States of Nature A1 25 S1 400 S2 650 S3 Acts A2 10 440 740 A3 125 400 750
442
The probabilities of the three states of nature are 0.1, 0.7 and 0.2 respectively. Compute and tabulate the EMV and determine the optimal act. 27. Given is the following payoff (in Rs) matrix :
State of Nature Probability Do not Expand 2500 2500 2500 Decision Expand Expand 200 units 400 units 3500 5000 3500 2500 1500 1000
What should be the decision if we use (i) EMV criterion, (ii) The minimax criterion and (iii) the maximin criterion? 28. The proprietor of a food stall has invented a new food delicacy which he calls WHIM. He has calculated that the cost of manufacture is Re 1 per piece and because of its novelty, it can be sold for Rs 3 per piece, It is, however, perishable and the goods unsold at the end of the day are a dead loss. He expects the demand to be variable and has drawn up the following probability distribution of his estimate:
11 12 13 14 15 No. of pieces demanded : 10 Probability : 0.07 0.10 0. 23 0. 38 0.12 0.10
(i) (ii)
Find an expression for his net profit or loss if he manufacture m pieces and only n are demanded. Consider separately the two cases n ( m and n > m. Assume that he manufactures 12 pieces. Using the results in (i) above, find his net profit or loss for each level of demand.
(iii) Using the probability distribution, calculate his expected net profit or loss if he manufactures 12 pieces. (iv) Calculate the expected profit or loss for each of the levels of manufacture (10 ( m ( 15). (v) How many pieces should be manufactured so that his expected profit is maximum?
29. A physician purchases a particular vaccine on Monday of each week. The vaccine must be used in the current week, otherwise it becomes worthless. The vaccine costs Rs 2 per dose and the physician charges Rs 4 per dose. In the past 50 weeks, the physician has administered the vaccine in the following quantities :
Doses per week : 20 25 40 60 No. of weeks : 5 15 25 5
Determine the number of doses the physician should buy every week. 30. The marketing staff of a certain industrial organisation has submitted the following payoff table, giving profits in million rupees, concerning a proposal depending upon the rate of technological advance in the next three years :
Reject Technological Accept advance Proposal Proposal Much 2 3 Little 5 2 None 1 4
The probabilities are 0.2, 0.5 and 0.3 for Much, Little and None technological advance respectively. What decision should be taken?
443
31. A newspaper distributor assigns probabilities to the demand for a magazine as follows:
2 3 4 Copies Demanded : 1 Probability : 0. 4 0. 3 0.2 0.1
A copy of magazine sells for Rs 7 and costs Rs 6. What can be the maximum possible expected monetary value (EMV) if the distributor can return the unsold copies for Rs 5 each? Also find EVPI. 32. A management is faced with the problem of choosing one of the three products for manufacturing. The potential demand for each product may turn out to be good, fair or poor. The probabilities for each type of demand were estimated as follows:
Demand 9 Product 8 A B C
The estimated profit or loss (in Rs) under the three states of demand in respect of each product may be taken as :
A 35, 000 15, 000 B 50, 000 20, 000 C 60, 000 30, 000 5, 000 3, 000 20, 000
Prepare the expected value table and advise the management about the choice of the product. 33. The payoffs of three acts A, B and C and the states of nature P, Q and R are given as :
States of Nature P Q R Payoffs (in Rs) A B C % 35 120 %100 250 % 350 200 550 650 700
The probabilities of the states of nature are 0.5, 0.1 and 0.4 respectively. Tabulate the Expected Monetary Values for the above data and state which can be chosen as the best act? Calculate expected value of perfect information also. 34. A manufacturing company is faced with the problem of choosing from four products to manufacture. The potential demand for each product may turn out to be good, satisfactory or poor. The probabilities estimated of each type of demand are given below :
Product A B C D Probabilities of type of demand Good Satisfactory Poor 0.60 0. 20 0.20 0.75 0.15 0.10 0.60 0. 25 0.15 0. 50 0. 20 0. 30
The estimated profit (in Rs) under different states of demand in respect of each product may be taken as :
A B C D
444
Prepare the expected value table and advise the company about the choice of product to manufacture.
35. A shopkeeper at a local stadium must determine whether to sell ice cream or coffee at today's game. The shopkeeper believes that the profit will depend upon the weather. The payoff table is as follows :
Event Cool Weather Warm Weather Action Sell Coffee Sell Ice cream Rs 40 Rs 20 Rs 55 Rs 80
Based upon his past experience at this time of the year, the shopkeeper estimates the probability of warm weather as 0.60. Prior to making his decision, the shopkeeper decides to hear forecast of the local weatherman. In the past, when it has been cool, the weatherman has forecast cool weather 80% times. When it has been warm, the weatherman has forecast warm weather 70% times. If today's forecast is for cool weather, using Bayesian decision theory and EMV criterion, determine whether the shopkeeper should sell ice cream or coffee? 36. A producer of boats has estimated the following distribution of demand for a particular kind of boat :
0 1 2 3 4 5 6 Demand : Probability : 0.14 0.27 0.27 0.18 0.09 0.04 0.01
Each boat costs him Rs 7,000 and he sells them for Rs 10,000 each. Any boats that are left unsold at the end of the season must be disposed off for Rs 6,000 each. How many boats should be kept in stock to maximise his expected profit? 37. A retailer purchases berries every morning at Rs 5 a case and sells for Rs 8 a case. Any case remaining unsold at the end of the day can be disposed of the next day at a salvage value of Rs 2 per case (thereafter they have no value). Past sales have ranged from 15 to 18 cases per day. The following is the record of sales for the past 120 days :
(iii) Distribution function is another name of cumulative probability function. (iv) Any function of a random variable is also a random variable. (v) The expected value of the sum of two or more random variables is equal to the sum of their expected values only if the are independent.
(vi) In the process of decision-making, the decision-maker can also assign probabilities to various states of nature based upon his degree of belief. 39. Fill in blanks : (i) (ii) The probability that a ........ random variable takes a particular value is always zero. The mean of a random variable is also termed as its ........ value.
445
(iv) If the conditional distribution of X given Y is same as the marginal distribution of X, then X and Y are ........ random variables. (v) The selection of a particular decision criterion depends upon the ........ of the decision-maker.
ANSWERS
TO
(b) Joint
QUESTIONS
(c) (X + Y) (e) True
FOR
(d) Possible (e) Expected Value with Perfect Information (EVPI) 2. (a) True (b) True (c) False (d) True
446
Unit-V
LESSON
13
INVENTORY MODEL
CONTENTS
13.0 Aims and Objectives 13.1 Introduction 13.2 Need of Inventory Control 13.3 Advantages of Material Controls 13.4 Essential Factors of Material Control 13.5 ABC Analysis Technique 13.6 Process of Inventory Control 13.7 Minimum Stock Level 13.8 Maximum Stock Level 13.9 Ordering Level or Re-order Level 13.10 Average Stock level 13.11 Danger Level 13.12 Let us Sum Up 13.13 Lesson-end Activities 13.14 Keywords 13.15 Questions for Discussion 13.16 Terminal Questions 13.17 Model Answers to Questions for Discussion 13.18 Suggested Readings
13.1 INTRODUCTION
The inventory means a physical stocks of good which is kept in hand for smooth and efficient running of future affairs of an organisation at the minimum costs of funds blocked in inventories. In a manufacturing organisation, inventory control plays a significant role because the total investment in inventories of various kinds is quite substantious. In this chapter we are going to discuss the meaning of inventory, need to control inventory, advantage of material control, essential factor, of material control, the ABC analysis techniques, process of inventory control.
Inventory can be defined as the stock of goods, commodities or other resources that are stored at any given period for future production. In real, inventory control is a process itself, with the help of which, the demand of items, scheduling, purchase receiving, inspection, storage and despatch are arranged in such a manner that at minimum cost and in minimum time, the goods can be despatched to production department. Inventory control makes use of available capital in a most effective way and ensures adequate supply of goods for production.
2.
3. 4.
5. 6.
Proper Co-ordination: There should be a proper co-ordination between all the departments who uses materials, such as purchase department, store department inspection department, accounts department, production department and sales department, so that there is neither a scarcity of material nor excess of material.
2.
Centralisation of Purchasing: The important requirement of a successful inventory control system is the appointment of intelligent and experienced personnel in purchase department, these personnel should be expert in their field and negotiating the deals. Proper Scheduling: All the requisitions made by production department should be scheduled, so material could be issued them by time and production should not be stopped. Proper Classification: Classification and identification of inventories by allotting proper code number to each item and group should be done, to facilitate prompt recordings, locating and dealing. Use of Standard Forms: Standards forms should be used so that any information can be send to all department within no time. Internal Check System: Audit should be done by an independent party to check effectiveness of inventory control system. Proper Storing System: Adequate and well organised warehouse facilities with well-equipped proper handling facilities must be there. Such facilities will reduce the wastage due to leakage, wear and tear, sustained dust and mishandling of materials. Store location should be in between the purchase department and production department, so that cost of internal transportation can be minimised. Proper Store Accounting: An efficient inventory control necessitates maintenance of proper inventory records. Any typical information regarding any particular item of inventory may be taken from such records. Proper Issuing System: There should be a well organised issuing system of material so that production process do not suffer. Fixing of Various Stock Levels: Minimum stock level, maximum stock level, reorder point, safety level etc, should be pre-determined to ensure the continuity of smooth production.
Inventory Model
3.
4.
5. 6. 7.
8.
9.
10. Perpetual Inventory System: Daily stock position should be taken in this system. 11.
12. Determination of Economic Order Quantity: Economic order quantity should be determined to minimised the cost of inventory. 13. Regular Reporting System: The information regarding the stock position, materials quantity etc, should be available to management regularly.
451
Steps in ABC Analysis Though no definite procedure can be laid down for classifying the inventories into A, B and C categories as this will depend upon a number of factors such as nature and varieties of items specific requirements of the business place of items in the production etc. These factors vary from business to business to business and items to item. However, following procedure can be followed: (i) First, the quality of each material expected to be used in a given period should be estimated.
(ii) Secondly, the money value of the items of materials, so chosen should be calculated by multiplying the quantity of each item with the price. (iii) Thirdly, the items should be rearranged in the descending order of their value irrespective of their quantities. (iv) Fourthly, a running total of all the values and items will then be taken and then the figure so obtained should be converted into percentage of the gross total. (v) Fifthly and lastly, it will be found that a small number of a first few items may amount to a large percentage of the total value of the items. the management, then, will have to take a decision as to percentage of the total value or the total number of items which have to be covered by A, B and C categories.
Advantages of ABC Analysis These are as follows: 1. Increase in Profitability: ABC analysis ensures a close control over the items of A, B and C categories and due to control over A category items, the capital investment over inventory reduces. Other Uses: The technique of ABC analysis is based on the principle of management by exception and can be used in areas like, distribution, sales, etc.
2.
Process of Purchasing of Materials Its steps are as follows: 1. Establishment of Purchase Department: A different department should be established for purchase of materials. This department not only ensure the availability of raw material but also, machines, stationary etc. are purchased by this department. Purchase of materials should be centralised. All purchase should be under a single department. Control centralised purchase is generally possible only in these industries, which are located at a single place only and nature of production is of same type. But if an industry has different production centre at different places, then it becomes compulsory to follow decentralised purchase system. Thus it is compulsory to have a complete knowledge about he nature of production, capacity of locality etc. 2. Preparation of Purchasing Budge: First of all the production target of the company should be determined, on the basis of which the budget for purchasing of material is prepared. Following points should be kept in mind while preparing purchase budget: (i)
452
System to receive the materials. The quantity and quality of the material according to the production requirements.
(ii)
(iii) Source of supply. (iv) Present balance of materials and predictions to receive the materials ordered. (v) Available cash for debtors. (vi) On which date the indent is made by concerned department. (vii) The conditions regarding the value of the material and rebate or discount on it. 3. Preparation of Purchase Requisition Slip: The initiations of purchase begins with the formal request from the various sections or departments to the purchase department to order goods. The request is made in a prescribed form to the purchase department by the departments needing the goods, authorising the purchase department for procuring the goods as per the specifications given in the slip by the date mentioned on it. Specimen of a PRS No. Pr ............................. Cost Centre ............................. Katech Corporation Ltd Purchase Requisition Slip Pealse purchase for ............................. department Item No. Code No. Description Quantity Required Remark Date: .............................
Inventory Model
Checked by ........................
Approved by
For use of department issuing this requisition Item No. Quantity in stock
For use of Purchased department Purchase order no. Supplier Delivery Date
Store keeper ........................ The requisitions are generally prepared in triplicate the original copy is sent to the purchase department, the second copy is retained by the store or the department initiating the purchase requisition and third are is sent to the costing department. 4. Obtaining the Tender: After the decision for purchase tenders are invited from the prospective suppliers on studying the terms of supply and the quantity and quality of the goods. Vendor is selected out of the tenderers for the comparative study of tenderers. Following type of table may be used: Type of Specimen of Tenderer Table Katech Corporation Ltd. Schedule of Quotations Material ........................ Name of Quantity Rate/Unit the party offered Date ........................ Terms Time of delivery S.No ........................ Mode of delivery Remarks
Date ........................
453
5.
Sending Purchase Order: After comparing the difference tenderers, the best vendor is decided and the order of required material quotation is placed to him. Purchase order is prepared in prescribed form by the purchase department and sent to the vendor authorising him to supply a specified quantity and quality of the materials at the stipulated terms at the time and place mentioned therein. Generally purchase order has the following information: (i) (ii) Name of the purchaser, serial no. and date of order. Name of vendor and address.
(iii) Full details of materials quantity etc. (iv) Value, rebate and terms of payment etc. (v) Time and place of delivery.
(vi) Directions regarding packing and despatching. (vii) Signature of purchaser. (viii) Method of follow-up. Specimen of a Purchase Order Katech Corporation Ltd. Cable ........................ To, M/s ........................ ........................ ........................ S. No. ........................ Telephone ........................ Date ........................ Reg. No. ........................ Our Ref. ........................
Please supply the following items in accordance with the terms and conditions mentioned herein ........................ Item No. Description Quantity Price Unit Amount Remarks
Terms and conditions: Delivery at ........................ Discount ........................ Excise Duty ........................ Sales Tax ........................ Freight ........................ Terms of Payment ................. For Katech Corporation Ltd. (Signature) Acknowledgement Kindly acknowledge the receipt of this order: Received on ........................ Date of Delivery ........................
454
Specimen of Goods Received Note Katech Corporation Ltd. Goods Received Note From M/s (Supplier) ........................ ........................ ........................
Goods Descrip- Code tion Quantity No. of Packets etc. Order No. Delivery Note No. Demanded by department Remarks Inspection Qty. Reason rejected
Inventory Model
Carrier
Received by
Store A ledger
6.
Receiving and Inspection of Materials: When goods arrive they are taken delivery of and parcels or packet unpacked and the contents of the packages are checked by the receiving clerk with the order placed by the purchasing department to the vendor. After proper checking goods should be delivered to the laboratory or inspection department. Goods received note is prepared here. Returning the Materials: On checking if any discrepancy is found as regards to quality and quantity. It should immediately be referred to the purchasing department so that the discrepancy may be adjusted or steps may be taken to return the defective or damaged goods in exchange of proper quality material on credit note. Payment of Purchased Material: After required inspection etc. final report is sent to purchase officer, who sent it to payment officer after placing required entries in the report. After checking the ledger, payment officer authorise accounts clerk for payment.
7.
8.
Quantity
Unit
Description
Code No.
Remarks
Authorised by .
Issued by
Received by .
Sigma Corporation Ltd. Materials Requisition Slip Job No. ....................... Department ....................... Please send the following materials.
Quantity Code or Symbol Description of Materials Rate* Amount*
* Both these entries are to be done by cost clerk. Inter Departmental Transfer of Materials: (For details see Inventory Storing Procedure) ABC Co. Ltd. Materials Transfer Slips Issuing Department .................... Receiving Department .................... Please receive the following materials.
Quantity Code or Symbol Description of Materials Rate* Amount Reason for Transfer
* To be filled by Cost Clerk. To Prepare Material Abstract: (For details see Inventory Storing System). Sigma Corporation Ltd. Material Abstract Week ending on ....................
Materials Requisition Slips Slip No. Amount Total
456
Job Numbers N1 N2 N3 N4 N5 N6
6.
Periodical Checking of Materials: To control the issue of materials this is very much necessary that bin cards, store control records and store ledgers are checked regularly and if any discrepancy is found, proper corrective actions should be taken. Physical Stock Checking of Materials: Physical stock checking in stores should be done to prevent materials loss, material damage and theft. This checking can be done weekly, monthly etc. Physical stock checking means the verification of actual quantity in stores. This checking should be done surprisingly or at random basis. If any discrepancy is found and corrective actions should be taken to reduce or eliminate them the possible reasons may be wear and tear of materials, absorption of moisture, evaporation, waste, breakage, theft or wrong recordings. This is assumed to be the best method of inventory control.
Inventory Model
7.
Maximum Level ......................... Minimum Level ......................... Danger Level ......................... Ordering Level ......................... Re-order Quantity .........................
Balance Qty. Audit Date Initial
Date
Qty.
Date
Qty.
2.
Issue of Material from Store: The store undertakes the responsibility of issuing the material to the using departments. In order to prevent malpractices, the materials must be issued only against the properly authorised requisition slips. These requisition must be properly checked and scrutinised to avoid overissue of materials. All requisition received must be posted immediately or daily on the bin cards and on the stock control cards. Generally three copies of requisition slips are prepared first two copies are given to the stores and third copy kept with the demanding department. Store incharge keeps one copy of requisition slip for himself and other copy he sent to accounts department.
457
3.
Return of Material to Store: If a department uses less material to its demand then it return the material to stores. Goods return slips are sent along with the materials. The same specifications and details of materials are given in goods return slips as they were mentioned in requisition slips. Three copies of goods return slips are prepared. First two copies are sent to stores department and third copy is kept by the goods returning department itself. Store keeper sent one copy to accounts department. The colour of both requisition slip and return slips are kept different to identify them easily. Transfer of Material: The transfer of materials from one department to another department is generally not appreciated, because it creates problems in material control process. But sometime when there is emergency, the transfer of material from one department to other department is allowed. The department transferring the materials makes four copies of material transfer slips. First copy is sent to the needy department along with material. Second and third copies are sent to stores department and accounts department for their information. Material Abstract: In big industries where the large quality of materials are received, issued and transferred daily, material abstract is prepared weekly or fortnightly to control the inventory. A physical verification of quantity in stores and other departments is done by material abstract. It any discrepancy is found in physical verification of quantity in store or other department. It is brought into the notice of top management this type of check plays a very important role in inventory control. Thus material abstract is a summary of materials received, issued and transferred, for a given time period.
Check Your Progress 13.1
4.
5.
1 2.
What are the main objectives of having Inventory Control? Discuss ABC analysis techniques. Write your answer in the space given below. Please go through the lesson suxb-head thoroughly you will get your answers in it. This Check Your Progress will help you to understand the lesson better. Try to write answers for them, but do not submit your answers to the university for assessment. These are for your practice only. (b) (c)
Notes: (a)
Minimum Stock Level = Re-ordered level (Average rate of consumption ! Lead time)
Inventory Model
5.
6.
7.
7. 8.
459
9.
Inventory Turnover: In case of slow moving materials the maximum level is low and in case of quick moving material it is high.
10. Nature of Supply: If the supply is uncertain the maximum level should be as high as possible. 11. Economic Order Quantity (EOQ): Maximum level largely depends in economic order quantity, because unless otherwise contra indicated the economic order quantity decides the quantity ordered and hence decides the maximum level.
Computation of Ordering Level or Re-order Level. The formula is as follows: Ordering level or Re-order level = Maximum usage per day ! Maximum Re-order period or Maximum Delivery Time Or = Maximum Level + (Normal usage of Average rate of consumption ! Average Re-order period or Average Delivery Time)
Some concerns fix danger level below the re-ordering level but above the minimum level. If action for purchase is taken as soon as the stock reaches the re-ordering level, the danger level bears no importance except that, when the stock reaches the danger level (but not yet the minimum level) a reference may be made to the purchase department to ensure that delivery is received before the actual stock reaches the minimum level. When the danger level is fixed below the minimum, it being reaches by the actual stock, the defect in the system is identified and corrective measure becomes necessary. When the danger level is fixed above the minimum, it being reached by the actual stock, preventive measure is to be taken so that the stock may not go below the minimum level. It is the point or level of stock which the material stock should never be allowed to reduce. It is generally a level below the minimum level. As soon as the stock of material reaches this point, urgent action is needed for replenishment of stock. Determination of Danger Level. This done as follows: Danger Level = Two days of normal consumption Re-order Quantity: The quantity which is ordered at re-order point is called re-order quantity. This is determined on the basis of minimum stock level and maximum stock level. This is normally used in notation of economic order quantity.
Check Your Progress 13.2
Inventory Model
Differentiate:
(a) Minimum stock level and Maximum stock level (b) Average Stock level and Danger Stock level
2.
What is Re-order level? Write assumptions for ascertaining Re-order point. Write your answer in the space given below. Please go through the lesson sub-head thoroughly you will get your answers in it. This Check Your Progress will help you to understand the lesson better. Try to write answers for them, but do not submit your answers to the university for assessment. These are for your practice only. (b) (c)
Notes: (a)
Solution: (i) Re-order Level = Maximum Usage ! Maximum Re-order Period For Component A = 75 ! 6 = 450 Units For Component B = 75 ! 4 = 300 Units (ii) Minimum Level = Re-order Level (Normal Usage ! Average Re-order Period) For Component A = 450 (50 ! 5) = 200 Units For Component B = 300 (50 ! 3) = 150 Units Note: Average Re-order Period for Component A = Average Re-order Period for Component B =
4+6 =5 2
2+4 =3 2
(iii) Maximum Level = (Re-order Level + Re-order Quantity (Minimum Usage ! Minimum Re-order Period) For Component A = (450 + 300) (25 ! 4) = 650 Units For Component B = (300 + 500) (25 ! 2) = 750 Units Example 2: From the following particulars, calculate: (a) Re-order Level (b) Minimum Level, (c) Maximum Level, (d) Average Level: Normal Usage Minimum Usage Maximum Usage Economic Order Qunatity Re-order Period Solution: (a) (b) Re-order Level = Maximum Usage ! Maximum Re-order Period = 130 ! 30 = 3,900 units Minimum Level = Re-order Level (Normal Usage ! Average Re-order Period) = 3,900 (100 ! 27.5) = 1.150 units Note: Average Re-order Period = (c) Maximum Level
25 + 30 = 27.5 days 2
100 units per day 60 units per day 130 units per day 5,000 units 25 to 30 days
= (Re-order Level + Re-order Quantity Or EOQ) (Minimum Usage ! Minimum Re-order Period) = (3,900 + 5,000) 60 ! 25) = 7,400 Units
(d)
Average Level = =
462
Example 3: A manufacturer buys costing equipment from out side suppliers Rs. 30 per unit. Total annual needs are 800 units. The following data is available: Annual Return on Investment 10% Rent, Insurance etc. per unit per year Re. 1 Cost of Placing an order Rs. 100 Determine Economic Order Quantity. Solution: EOQ =
2 ! R ! Cp CH
Inventory Model
Where, EOQ = Economic Order Qunatity R = Annual Requirement of Inventory Cp = Cost of placing an order CH = Annual holding Or Carrying cost per unit per year. Given : R = 800 units, Cp = Rs. 100, CH = Rs. 4 EOQ =
2 ! 800 ! 100 = 40,000 = 200 Equipments 4
= 10 % of Rs. 30 + Re 1 = Rs. 3 + Re. 1 = Rs. 4. Example 4: Fair Deal Limited uses Rs. 1,00,000 materials per year. The administration cost per purchase in Rs. 100 and the carrying cost is 20% of the average inventory. The company has a purchase policy on the basis of economic order quantity but has been offered a discount of 0.5% in the case of purchase five times per year. Advise the company whether it should accept new offer or not? Solution: Given: R(in Rs.) = 1,00,000, Cp = Rs. 100, P = Re. 1.00, CH = 1.00 ! 20% = Re. 0.20 E.O.Q. (in Rs.) =
2 ! R ! Cp CH 2 ! 1,00,000 ! 100 0.20
= 10,00,00,000 = Rs. 10,000 Total Inventory Cost in case of each order is placed of Rs. 10,000: (i) (ii) Cost of Materials Ordering Cost = q ! C P = 10,000 !100 0
q0 10,000 ! CH = ! 0 .2 2 2 R 1,00,000
Total Cost in case of each order is placed or Rs. 19,900 i.e., Rs. 20,000 0.5% discount: (i) (ii) Cost of Materials (19,900 ! 5) Ordering Cost
R = % q ! Cp " " % # & 0 ' $
Rs. 99,500.00
500.00
0 = % 2 ! CH " # &
19,900 ! 0.199 2
1,980.05 1,01,980.05
[Note: Here P = Re. 1, 0.5% or Re. 1 = Re. 1 = Re. 0.95, CH = 0.95 ! 20% = Re. 0.199] On the basis of above analysis the offer should be accepted as it will save Rs. 1,02,000 1,01,980.05 = Rs. 19.95. Example 5: A pharmaceutical factory consumes annually 6,000 kgms. of a chemical costing Rs. 5 per kgm. Placing each order costs Rs. 25 and the carrying cost is 6% per year per kgm. of average inventory. Find the Economic Order Quantity and the total inventory cost. The factory works for days in a year. If the procurement time is 15 days and safety stock 200 kgms., find the re-order point and maximum and average inventories levels. If the supplier offers a discount of 5% on the cost price for a single order of annual requirement, should the factory accept it? Solution: Given: R = 6,000 kgms.; P = Rs. 5 per kgm. Cp = Rs. 25; CH = 6% per kgm. per year of average inventory; No. of working days in a year = 300; Procurement time = 15 days; Safety Stock = 200 kgms. E.O.Q. =
2 ! R ! Cp CH 2 ! 60,000 ! 25 = .30 3,00,000 .30
= (6,000 ! 5) + % 1,000 ! 25 " + % 2 ! .30 " " % # # & & = 30,000 + 150 + 140 = Rs. 30,000
464
' 6,000
$ ' 1,000
' $ R Re-order Point = % No. of Working days ! Procurement time " + Safety Stock " % & #
Inventory Model
q0 + Safety Stock 2
TIC if a single order of 6,000 kgms is placed: Given: P = Rs. 5 5% of Rs. 5 i.e., 5 .25 = Rs. 4.75 CH = 6% of Average Inventory i.e., 4.75 !
6 = Re. 285; 100
'R
$ 'q
= (6,000 ! 4.75) + % 6,000 ! 25 " + % 2 ! .25 " " % # # & & = 28,500 + 25 + 855 = Rs. 29,380. The company should accept the offer of 5% discount in purchase price by placing a single order of 6,000 kgms. because the total inventory cost in this case is less by Rs. 30,300 Rs. 29,380 = Rs. 920 as compared to total inventory cost without discount offer.
' 6,000
$ ' 6,000
465
Example 6: A trading company expects to sell 15,000 mixers during the coming year. The cost per mixer is Rs. 200. The cost of storing a mixer for 1 year is Rs. 5 and the ordering cost is Rs. 540 per order. Find the Economic Order Quantity. Would it be profitable to the company to accept a discount offer of 30% on a single order per year. The storing cost continuing to be Rs. 5 per mixer per year. Solution: E.O.Q =
2 ! R ! CP CH
'R
$ 'q
' 15,000
$ ' 1,800
= 30,00,000 + 4,500 + 4,500 = Rs. 30,09,000 T.I.C. if a single order is placed at 30% discount in price:
0 T.I.C = (R ! P) + % q ! C P " + % 2 ! C H " " % # & # & 0
'R
$ 'q
' 15,000
= 21,00,000 + 540 + 37,500 = Rs. 21,38,040 The company should accept the offer of 30% discount as it will save Rs. 30,09,000 21,38,040 = Rs. 8,70,960. Example 7: A manufacturer requires 1,000 units of a raw material, per month. The ordering cost is Rs. 15 per order. The carrying cost in addition to Rs. 2 per unit, is estimated to be 15% of average inventory per unit per year. The purchase price of the raw material is Rs. 10 per unit. Find the Economic Lot Size and the total cost. The manufacturer is offered as 5% discount in purchase price for order for 2,000 units or more but less than 5,000 units. A further 2% discount is available for order of 5,000 or more units. Which of the three ways of purchase he should adopt? Solution: Given: R = 1,000 units per month or 12,000 units per annum; C p = Rs. 15 per order;
466
Rs. 10 per unit in case of order for less than 2,000 units. Rs. 10 5% of Rs. 10 i.e., Rs. 9.50 in case of order for 2,000 or more units but less than 5,000 units. Rs. 10 7% of Rs. 10 i.e., Rs. 9.30 in case of order for 5,000 or more units. Rs. 2 + 15% of Rs. 2 of Average inventory i.e., Rs. 2 + 1.50 = Rs. 3.50 per unit per annum in case of order for less than 2,000 units. Rs. 2 + 15% of Rs. 9.50 = Rs. 2 + 1.425 = Rs. 3.425 per unit per annum in case of order for 2,000 units or more but less than 5,000 units. Rs. 2 + 15% of Rs. 9.70 = Rs. 2 + 1.395 = Rs. 3.395 per unit per annum in case of order for 5,000 or more units.
Inventory Model
Alternative I: In case of order for less than 2,000 units: E.O.Q. (q0) =
2 ! R ! CP CH 3,60,000 3.50
2 ! 12,000 ! 15 = 3.50
'R
$ 'q
= (12,000 !10) + % 321 ! 15 " + % 2 ! 3.50 " & # & # = 1,20,000 + 561 + 562 = Rs. 1,21,123 (nearest to Rupee) Alternative II: In case of order for 2,000 or more units but less than 5,000 units: E.O.Q. (q0) =
2 ! R ! CP CH
' 12,000
$ ' 321
= 1,05,109.45 = 324 units As the Economic Lot size (324 units) is less than minimum ordering quantity (2,000 units), the company should order at least 2,000 units to get 5% discount in purchase price Thus, T.I.C. if q0 = 2000 units:
0 T.I.C. = (R ! P) + % q ! C P " + % 2 ! C H " " % # # & & 0
'R
$ 'q
= (12,000 ! 9.50) + % 2,000 ! 15 " + % 2 ! 3.425" % " # & # & = 1,14,000 + 90 + 3,425 = Rs. 1,17,515.
467
' 12,000
$ ' 2,000
Alternative III: In case of orders of 5,000 or more units: Economic Lot size (q0) =
2 ! R ! CP CH 2 ! 12,000 ! 15 = 3.395 3,60,000 3.395
As the Economic Lot Size (326 units) is less than the minimum ordering quantity 5,000 units, the company should order at least 5,000 units to get 7% discount in purchase price. Thus T.I.C. if q0 = 5,000 units:
0 T.I.C. = (R ! P) + % q ! C P " + % 2 ! C H " " % # # & & 0
'R
$ 'q
= (12,000 ! 9.30) + % 5,000 ! 15 " + % 2 ! 3.395 " % " # & # & = 1,11,600 + 36 + 8,487.50 = 1,20,123.50 On the basis of above analysis we find that the T.I.C. is minimum (Rs. 1,17,515) in second alternative. Hence the company should adopt this alternative.
' 12,000
$ ' 5,000
2.
13.14 KEYWORDS
Lead Time Maximum Level Minimum Level
468
: : :
Time between ordering & receiving the good. Is a technique to maintain inventory at a desired level. Level of inventory beyond which inventory is not allowed. Level of inventory beyond which inventory is allowed.
: :
The next best alternative cost. The stock level which is sufficient for the lead time consumption.
Inventory Model
Normal consumption 625 units per day Re-order Quantity 8,800 units Minimum period for receiving goods 7 days Maximum period for receiving goods 15 days Normal period for receiving goods 10 days
A manufacturers requirement for raw materials is 12,800 kgms. per annum. The purchase price of it is Rs. 50 per kgm. Ordering cost is Rs. 100 per order and carrying cost is 8% of average inventory. The manufacturer can procure its annual requirement of raw material higher in one single lot or by ordering of 400, 800, 1600 or 3,200 kgms. quantity. Find which of these order quantities is the Economic Order Quantity using tabular method. The annual requirement of a product in a firm is 1,000 units. The purchase price per unit is Rs. 50; ordering cost is Rs. 150 per order and the carrying cost per unit of average of inventory is 15%. The firm can procure its annual requirement either in one single lot or in various alternative losts of 100, 200, 250 or 500 units. Determine the Economic Order Quantity by Graphical method and with the help of the three curves, show at EOQ level ordering and carrying costs are equal and total cost is minimum.
9.
10. Calculate Economic Order Quantity from the following information by using Tabular method, Graphical method and mathematical method: Annual usage Buying cost per order Cost per unit Cost of carrying inventory 11. 10,000 units Rs. 10 Rs. 50 10% of Average Inventory
A company requires annually 12,000 lbs. of a chemical which costs Rs. 250 per lb. Placing each order costs the company Rs. 22.50, and the carrying cost is 15% of the cost of average inventory per annum. (i) (ii) Find Economic Order Quantity and total expenses on the chemical. If in addition, the company decides to maintain a stock of 300 lbs. find the maximum as well as average inventory.
12. Calculate the Economic Order Quantity from the following information. Also state what will be the number of orders during the whole year: Requirement of material per annum Cost of material per unit Cost of placing per order 1,250 units Rs. 200 Rs. 100
Holding cost per unit per annum 8% of average inventory. 13. A manufacturers requirement for a raw material is 2,000 units per year. The ordering costs are Rs. 10 per order while carrying costs are 16 paise per year per unit of a average inventory. The purchase price of raw material is Re. 1 per unit. (a) (b) Find the Economic Order quantity and the total inventory cost. If a discount of 5% is available for orders of 1,000 units, should the manufacturer accepts this offer?
(The carrying cost per unit per annum remains unchanged.) 14. A business unit expect to sell 60,500 units of a commodity during the coming year. The ordering cost per order is Rs. 840 and the cost per unit of the commodity is Rs.
470
200. The carrying cost per unit per annum is 0.5% of the average inventory. Find out Economic Order Quantity. Would it be profitable to the business unit to accepts a discount offer of 1% on a single order per year. In this case the storing cost per unit per year will increase to 0.75% of the average inventory. 15. A manufacturer requires 2,500 units of a raw material per month. The ordering cost is Rs. 20 per order. The carrying cost in addition to Rs. 3 per unit, is estimated to be 10% of average inventory per unit per year. The purchase price of the raw material is Rs. 4 per unit. Find the Economic Lot Size and the Total Inventory Cost. The manufacturer is offered a discount in purchase price for order of 1,000 units or more but less than 2,000 units. A further discount is available for orders of 2,000 or more units. Which of the three ways of purchase he should adopt?
Inventory Model
ANSWERS
(c) False
TO
(d) True
QUESTIONS
FOR
(b) True
471
LESSON
14
GAME THEORY
CONTENTS
14.0 14.1 14.2 14.3 14.4 14.5 14.6 14.7 14.8 14.9 Aims and Objectives Introduction Two-person Zero-sum Game Pure Strategies: Game with Saddle Point Mixed Strategies: Games without Saddle Point Dominance Property Solving Problem on the Computer with TORA Solving LP Model Games Graphically using Computer Let us Sum Up Lesson-end Activity
14.10 Keywords 14.11 Questions for Discussion 14.12 Terminal Questions 14.13 Model Answers to Questions for Discussion 14.14 Suggested Readings
14.1 INTRODUCTION
Game theory applies to those competitive situations which are technically known as competitive games or in general known an games. As the game is a competition involving two or more decisions makers each of whom is keen to win. The basic aim of this chapter is to study about how the optimal strategies are formulated in the conflict. Thus we can say that game theory is not related with finding an optimum or winning strategy for a particular conflict situation. Afterwards we can say that the theory of game is simply the logic of rational decisions. After reading this unit, you should be able to know how to take decision under the cut-throat competition and know that outcome of our business enterprise depends on what the competitor will do.
472
In todays business world, decisions about many practical problems are made in a competitive situation, where two or more opponents are involved under the conditions of
competition and conflict situations. The outcome does not depend on the decision alone but also the interaction between the decision-maker and the competitor. The objective, in theory, of games is to determine the rules of rational behaviour in game situations, in which the outcomes are dependent on the actions of the interdependent players. A game refers to a situation in which two or more players are competing. A player may be an individual, a group or an organization. Game Theory has formulated mathematical models that can be useful in decision-making in competitive situations. To get a better insight of the concept, we consider an example of a simple game. Let us assume that there are only two car manufacturers, company A and company B. The two companies have market shares for their product. Company A is planning to increase their market share for the next financial year. The vice-president of company A has come up with two strategies. One strategy is to modify the outer shape of the car and to advertise on TV. Company B, knowing that if these strategies are adopted by company A, it may lead to decrease in its market share, develops similar strategies to modify the shape of their car and to advertise on TV. Table 14.1 below, gives the pay off if both the companies adopt these strategies.
Table 14.1: The Pay Off if Both Companies Modify Shape & Advertise on TV
Company B Modify shape Modify shape Company A Advertise 4 8 Advertise 6 5
Game Theory
The pay off given is with respect to company A and represents company A. Company Bs pay off is the opposite of each element. For example, it means that for modification strategy, Company A wins 4 and company B loses 4. In a game, each player has a set of strategies available. A strategy of a player is the list of all possible actions (course of action) that are taken for every pay-off (outcome). The players also know the outcome in advance. The players in the game strive for optimal strategies. An optimal strategy is the one, which provides the best situation (maximum pay-off) to the players. Payoff Matrix: Company A has strategies A1, A2,, Am, and Company B has strategies B1,B2,.,Bn. The number of pay-offs or outcomes is m ! n. The pay-off amn represents company As gains from Company B, if company A selects strategy m and company B selects strategy n. At the same time, it is a loss for company B (amn). The pay-off matrix is given (Table 14.2) with respect to company A. The game is zero-sum because the gain of one player is equal to the loss of other and vice-versa.
Table 14.2: Pay-off Matrix
Company B Strategies B1 A1 A2 Company A Strategies A3 . . Am a11 a21 a31 . . am1 B2 a12 a22 a32 . . am2 B3 a13 a23 a33 . . am3 .. .. .. .. .. .. .. Bn a1n a2n a3n . . amn
473
iv.
Player B 1 1 Player A 2 1 1 2 3 6
The game is worked out using minimax procedure. Find the smallest value in each row and select the largest value of these values. Next, find the largest value in each column and select the smallest of these numbers. The procedure is shown in Table 14.4.
Table 14.4: Minimax Procedure
Player B 1 1 Player A 2
474
2 3 6 6
Row Min 1 1
1 1 1
Col Max
If Maximum value in row is equal to the minimum value in column, then saddle point exists. Max Min = Min Max 1=1 Therefore, there is a saddle point. The strategies are, Player A plays Strategy A1, (A Player B plays Strategy B1, (B Value of game = 1. Example 2: Solve the game with the pay-off matrix for player A as given in Table 14.5.
Table 14.5: Game Problem
Game Theory
A1). B1).
Player B B1 A1 Player A A2 A3 4 1 1 B2 0 4 5 B3 4 2 3
Solution: Find the smallest element in rows and largest elements in columns as shown in Table 14.6.
Table 14.6: Minimax Procedure
Select the largest element in row and smallest element in column. Check for the minimax criterion, Max Min = Min Max 1=1 Therefore, there is a saddle point and it is a pure strategy. Optimum Strategy: Player A Player B A2 Strategy B1 Strategy
The value of the game is 1. Example 3: Check whether the following game is given in Table 14.7, determinable and fair.
475
Player B 1 2 1 Player A 2 0 8 7 0
Solution: The game is solved using maximin criteria as shown in Table 14.8.
Table 14.8: Maximin Procedure
The game is strictly neither determinable nor fair. Example 4: Identify the optimal strategies for player A and player B for the game, given below in Table 14.9. Also find if the game is strictly determinable and fair.
Table 14.9: Game Problem
Player B
1 4 1 4 0=0
2 0 3 0
Row Min 0 3
The game is strictly determinable and fair. The saddle point exists and the game has a pure strategy. The optimal strategies are given in Table 14.10 (a, b).
Table 14.10: Optimal Strategies
1 p1 (a) S A 1
2 p2 and 0 (b) SB
1 q1 0
2 q2 1
476
Example 5 : Solve the game with the pay off matrix given in Table 14.11 and determine the best strategies for the companies A and B and find the value of the game for them.
Game Theory
Company B
2 Company A 1 2
4 5 6
2 4 2
Solution: The matrix is solved using maximin criteria, as shown in Table 14.12 below.
Table 14.12: Maximin Procedure
1 2 3
1 2 1 2 2
1 2.
Discuss two-person zero-sum game. What is minimax-minimin principle? Write your answer in the space given below. Please go through the lesson sub-head thoroughly you will get your answers in it. This Check Your Progress will help you to understand the lesson better. Try to write answers for them, but do not submit your answers to the university for assessment. These are for your practice only.
A1 Player A A2
Let p1 and p2 be the probability for Player A. Let q1 and q2 be the probability for Player B. Let the optimal strategy be SA for player A and SB for player B. Then the optimal strategies are given in Tables 14.14 a & b.
Table 14.14 (a), (b): Optimum Strategies
A1 (a) SA = p1
A2 and p2 (b) SB =
B1 q1
B2 q2
and p2 = 1 p1
q1 =
and q2 = 1- q1
and the value of the game w.r.t. player A is given by, a11 a22 a12a21 Value of the game, v = (a11+a22) (a12+a21) Example 6: Solve the pay-off given Table 14.15 matrix and determine the optimal strategies and the value of game.
Table 14.15: Game Problem
Player B 1 2 2 4 5 3
1 Player A 2
Solution: Let the optimal strategies of SA and SB be as shown in Tables 14.16 (a, b).
Table 14.16(a) and (b): Optimal Strategies
A1 (a) SA = p1
478
A2 and p2 (b) SB =
B1 q1
B2 q2
Game Theory
2 2 4 4
Row Min 2 3
= p2 q1 =
q2
( 5 " 4 ) ! (2 " 3 ) (5 + 4 ) ! (2 + 3 )
The optimum mixed strategies are shown in Table 14.18 (a, b) below.
Table 14.18(a) and (b): Optimum Mixed Strategies
A2 and (b) SB = #
B1 $
B2 $
479
Player B 1 1 Player A 2 3 1 6 5 2 7 2 1 3 2 7 6
Solution: Reduce the matrix by using the dominance property. In the given matrix for player A, all the elements in Row 3 are less than the adjacent elements of Row 2. Strategy 3 will not be selected by player A, because it gives less profit for player A. Row 3 is dominated by Row 2. Hence delete Row 3, as shown in Table 14.20.
Table 14.20: Reduced the Matrix by Using Dominance Property
Player B 1 Player A 1 2 1 6 2 7 2 3 2 7
For Player B, Column 3 is dominated by column 1 (Here the dominance is opposite because Player B selects the minimum loss). Hence delete Column 3. We get the reduced 2 ! 2 matrix as shown below in Table 14.21.
Table 14.21: Reduced 2 ! 2 Matrix
Player B 1 1 Player A 2 6 2
Now, solve the 2 ! 2 matrix, using the maximin criteria as shown below in Table 14.22.
Table 14.22: Maximin Procedure
2 7
Therefore, there is no saddle point and the game has a mixed strategy. Applying the probability formula, p1
= ! (+ ) ! ( + )
! = = ! =
Game Theory
q1
= !
q1
! ! = = = (+ ) ! ( + ) !
= = =4
q2
= 1 q1 = !
=
(" ) ! ( " ) ! = (+ ) ! ( + ) !
A1 (a) SA =
2
A2
3
A3 and (b) SB = 0
B1 $
B2 $
B3 0
/5
/5
Player B 1 2 3 4 1 5 6 8 3 2 10 7 7 4 3 9 8 15 1 4 0 1 1 4
Player A
Solution: Solve the given matrix using the maximin criteria as shown in Table 14.25.
Table 14.25: Maximin Procedure
1 5 6 8 3 8
Player B 2 3 10 9 7 8 7 15 4 1 7 15
4 0 1 1 4 4
Row Min 10 1 1 1
481
Player B 1 2 Player A 3 4 6 8 3 2 7 7 4 3 8 15 -1 4 1 1 4
When comparing column wise, column 2 is dominated by column 4. For Player B, the minimum profit column is column 2, hence delete column 2. The matrix is further reduced as shown in Table 14.27.
Table 14.27: Matrix Further Reduced to 3!3 (2 Deleted Column)
Player B
1 2 Player A 3 4 6 8 3
3 8 15 1
4 1 1 4
Now, Row 2 is dominated by Row 3, hence delete Row 2, as shown in Table 14.28.
Table 14.28: Reduced Matrix (Row 2 Deleted)
Player B 1 Player A 3 4 8 3 3 15 1 4 1 4
Now, as when comparing rows and columns, no column or row dominates the other. Since there is a tie while comparing the rows or columns, take the average of any two rows and compare. We have the following three combinations of matrices as shown in Table 14.29(a) (b) and (c).
482
Game Theory
(a) B
+ R3
(b) B
+ R1
(c) B R2
+
11.5 A 1
1 A 4
8 3
8 A 1.5
15 1
4.5
3.5
When comparing column 1 and the average of column 3 and column 4, column 1 is dominated by the average of column 3 and 4. Hence delete column 1. Finally, we get the 2 ! 2 matrix as shown in Table 14.30.
Player B 3 Player A 3 4 15 1 4 1 4
The strategy for the arrived matrix is a mixed strategy; using probability formula, we find p1, p2 and q1, q2. p1
4 ! ( ! 1)
(15 + 4 ) ! (1 + ( !1))
=
p2
=1
q1
! = =
q2
=1
( " ) ! (" ( )) ( + ) ! (+ ( ))
+
483
The optimum mixed strategies are given below in Table 14.31 (a, b)
A1 A2 A3 A4 (a) SA = 0 0
5
B1 and (b) SB = 0
B2 0
B3
3
B4
16
/19
14
/19
/19
/19
Figure 14.1: Solving Pure Strategy Problem Using TORA (Input Screen)
Now, go to Solve menu and click. Another screen appears with Solved Problem Select solve problem and click LP-based. Then select the output format screen and click Go to Output Screen. The following output screen is displayed, as shown in Figure 14.2.
484
Game Theory
Figure 14.2: Solving Pure Strategy Problem Using TORA (Output Screen)
The results of the problem can be read directly from the output screen. Value of the Game to Player A = 1.00 Player A optimal strategies: Strategies: Probability: Strategies: Probability: A1 0 B1 1 A2 1 B2 0 A3 0 B3 0
The output also includes the linear programming formulation for Player A.
Player B 1 Player A 1 2 5 3 2 2 4
Figure 14.3: Solving Mixed Strategy Problems Using TORA (Output Screen)
Here the players play both the strategies in what turns out to be a mixed strategy game. A1 Player A : 0.25 Value of the game, v = 3.50. Example 10: Solve the following 2 ! 3 game given below in Table 14.33 graphically, using computer.
Table 14.33: Game Problem
A2 0.75 Player B :
B1 0.5
B2 0.5
Player B B1 B2 A1 A2 1 9 3 5
B3 10 2
Solution: The game does not possess any saddle point and hence the solution has mixed strategies. As expected payoffs against Bs pure moves are given by
Table 14.34: Mixed Strategies Compared
Bs pure strategy B1 B2 B3
486
As expected payoffs p1 + 9 (1 p1) = 8p1 + 9 3p1 + 5 (1 p1) = 2p1 + 5 10p1 + 2 (1 p1) = 8p1 + 2
The expected payoff equations are plotted as functions of p1 which show the payoffs of each column represented as points on two vertical axis. Strategy B1 is plotted by joining value 1 on axis 2 with the value 9 on axis 1. Similarly, other equations are drawn. The output using TORA is given in the Figure 14.4 below:
Game Theory
Player A always wants to maximize his minimum expected payoff. Consider the highest point of intersection I on lower envelope of As expected payoff equation. The lines B 2 and B3 passing through I, are the strategies that B needs to play. Therefore the given matrix is reduced to 2!2 matrix as shown in Table 14.35.
Table 14.35: Reduced 2!2 Matrix
B2 A1 A2 3 5
B3 10 2
Solving the 2x2 matrix, the optimal strategies are obtained using the usual method
Table 14.36: Optimal Strategies
A1 (a) SA = 0.30
B1 0
B2 0.80
B3 0.20
Player B B1 A1 Player A A2 4 3 B2 1 4 B3 4 2
The linear programming formulation is given by, For player A, Maximize, z = v Subject to the constraints, v 4x1 + 3x2 < 0 v + x1 - 4x2 < 0 v + 4x1 + x2 < 0 x1 + x 2 = 1 where, For Player B, Minimize, z = v Subject to constraints, v 4y1 + y2 + 4y3 > 0 v + 3y1 4y2 + y3 > 0 y1 + y2 + y3 = 1 where, y1, y2, y3 > 0 v is unrestricted. The problem can be solved by using linear programming. This can also be solved by using two-person zero-sum game. The output result is given in Figure 14.5 below: ......................(v) ......................(vi) ......................(vii) x1, x2 > 0 v is unrestricted. ......................(i) ......................(ii) ......................(iii) ......................(iv)
488
The optimal strategies are, A1 Player A : Player B : 0.11 B1 0.22 Value of the game, v = 2.22
Check Your Progress 14.2
Game Theory
A2 0.89 B2 0 B3 0.78
Take a type of business problem of your choice in which game theory will be helpful. Notes: (a) (b) (c) Write your answer in the space given below. Please go through the lesson sub-head thoroughly you will get your answers in it. This Check Your Progress will help you to understand the lesson better. Try to write answers for them, but do not submit your answers to the university for assessment. These are for your practice only.
14.10 KEYWORDS
Two Person Game Zero Sum Game Dominance : A game that only has two players. : A game in which one player wins and other player loses. : A process by which the size of the game will be reduced.
489
Strategy
: The strategy of a player is the list of all possible actions that he takes for every pay-off. The strategy is classified into pure strategy and mixed strategy. : Pure strategy is always selecting a particular course of action with the probability of 1. For example, in case of two strategies, probability of selecting the strategies for players A is p1 = 0 and p2 = 1. : Mixed strategy is to choose at least two courses of action. The probability of selecting an individual strategy will be less than 1, but the sum of the strategies will be 1. For example, if player A plays a mixed strategy, then the probability of selection of mixed strategy is p1 = 0.45 and p2 = 0.55. But the sum of the strategies is 0.45 + 0.55 = 1. : Saddle point is a situation where both the players are facing pure strategies. When there is no saddle point, it indicates the players will play both the strategies. : Minimax criterion is selecting the strategies that minimize the loss for each player. In other words, the player always anticipates worst possible outcome and chooses the strategy to get maximum for profit and minimum for loss. : The Value of the game is the expected gain of player A if both players use their best strategies. The best strategy is arrived at using minimax criterion.
Pure Strategy
Mixed Strategy
Saddle Point
Minimax Criterion
Graphical method can only be used in games with no saddle point. Concept of dominance is very useful for expanding the size of the matrix. Saddle point in a pay off matrix is one which is smallest value in its row and the largest value in its column. In two-person zero-sum game there will be more than two choices. Dominance occurs in the pay-off matrix. Best strategic are mixed strategies if there is no involvement of saddle point. Graphical method is feasible for Small values. When the game have no saddle point & also cannot be reduced by dominance. In game theory we determine the best strategies for each player. A saddle point is an element of the matrix. Game theory applies to those _________ situation which are technically known as competitive game. Strategy could be _________ or one. A game involving n-players is called a _________ game. Every course of action is a _________ strategy. In game theory all players act _________.
(e)
4.
Write short Notes on following: (a) (b) (c) (d) (e) (f) (g) (h) (i) The value of a game The sum & non-zero-sum games. Maximum & Minimum strategy Concept of dominance. Pure strategy Mixed strategy Pay-off matrix Saddle point Optimum strategies.
Game Theory
Exercise Problems
1. Using maximin criteria, identify whether the players play pure strategy or mixed strategies
(a) 1 1 2 (b) 1 1 2
2.
Player B 2 3 2 Player B 2 2 5 7 1 7 5
(a) B1 A1 Player A A2 A3 3 2 2
Player B B2 1 5 3 B3 2 7 5
491
(b) B1 A1 Player A
3.
Player B B2 10 10 6 B3 8 14 10 B 3 10 20 4 1 A 2 3 4 15 10 15 1 1 3 4 0 2 2 2 2 1 3 2 0 3 3 4 1 1 2 2 6 4 4 B 1 1 2 5 10 30 5 20 20 40
A2 A3
2 3 4
20 30
5 10
4.
Company B B1 B2 B3 A1 15 25 35 A2 5 10 45 (b)
Company B B1 B2 B3 A1 7 Company A A2 1 A3 4 5 3 2 7 3 2
(a) Company A
A3 65 55 35
5. Consider the payoff matrix of player A and solve.
Player B 1 1 Player A 2
6.
2 3 11
3 7 8
4 4 4
5 6 7
6 8 9
6 7
Company A II 4 3 7 5 III 8 2 6 12 IV 18 12 16 10
I A Company B B C D 14 8 8 6
492
7.
Game Theory
B1 A1 4 A2 8 A3 10
8.
B2 4 6 2
B3 2 8 4
B4 4 4 0
B5 6 0 12
Solve the following two-person zero-sum game to find the value of the game.
Company B 2 2 1 2 3 3 4 12 0 7 (b) 4 1 3 6 7 2 2 5 2
1 1 Company A 2 3 4
9.
2 6 -3 2
(a)
5 1
2 7 B
B1 A1 A A2
11.
B2 -2 2
B3 3 0
B4 1 1 Player B B 4 2 5 6 C 6 13 17 12 D 4 7 3 2
4 1
A 1 Player A 2 3 4 18 6 11 7
Player B 1 1 Player A 2 3 6 3 12 2 15 3 12 3 30 6 24 4 21 6 36 5 6 4 3
493
ANSWERS
TO
QUESTIONS
FOR
494
LESSON
15
SIMULATION
CONTENTS
15.0 15.1 15.2 15.3 15.4 15.5 15.6 15.7 15.8 15.9 Aims and Objectives Introduction Advantages and Disadvantages of Simulation Monte Carlo Simulation Simulation of Demand Forecasting Problem Simulation of Queuing Problems Simulation of Inventory Problems Let us Sum Up Lesson-end Activities Keywords
15.10 Questions for Discussion 15.11 Terminal Questions 15.12 Model Answers to Questions for Discussion 15.13 Suggested Readings
15.1 INTRODUCTION
In the previous chapters, we formulated and analyzed various models on real-life problems. All the models were used with mathematical techniques to have analytical solutions. In certain cases, it might not be possible to formulate the entire problem or solve it through mathematical models. In such cases, simulation proves to be the most suitable method, which offers a near-optimal solution. Simulation is a reflection of a real system, representing the characteristics and behaviour within a given set of conditions. In simulation, the problem must be defined first. Secondly, the variables of the model are introduced with logical relationship among them. Then a suitable model is constructed. After developing a desired model, each alternative is evaluated by generating a series of values of the random variable, and the behaviour of the system is observed. Lastly, the results are examined and the best alternative is selected the whole process has been summarized and shown with the help of a flow chart in the Figure 90.
Simulation technique is considered as a valuable tool because of its wide area of application. It can be used to solve and analyze large and complex real world problems. Simulation provides solutions to various problems in functional areas like production, marketing, finance, human resource, etc., and is useful in policy decisions through corporate planning models. Simulation experiments generate large amounts of data and information using a small sample data, which considerably reduces the amount of cost and time involved in the exercise. For example, if a study has to be carried out to determine the arrival rate of customers at a ticket booking counter, the data can be generated within a short span of time can be used with the help of a computer.
Problem Definition
Introduction of Variables
Simulate
Not Acceptable
Examination of results
Not Acceptable
Acceptable
Selection of best alternative
AND
DISADVANTAGES
OF
Simulation is best suited to analyze complex and large practical problems when it is not possible to solve them through a mathematical method. Simulation is flexible, hence changes in the system variables can be made to select the best solution among the various alternatives. In simulation, the experiments are carried out with the model without disturbing the system. Policy decisions can be made much faster by knowing the options well in advance and by reducing the risk of experimenting in the real system.
Disadvantages
496
Simulation does not generate optimal solutions. It may take a long time to develop a good simulation model. In certain cases simulation models can be very expensive. The decision-maker must provide all information (depending on the model) about the constraints and conditions for examination, as simulation does not give the answers by itself.
Simulation
OF
DEMAND
FORECASTING
Example 1: An ice-cream parlor's record of previous months sale of a particular variety of ice cream as follows (see Table 15.1).
Table 15.1: Simulation of Demand Problem
Demand (No. of Ice-creams) 4 5 6 7 8 No. of days 5 10 6 8 1
Simulate the demand for first 10 days of the month Solution: Find the probability distribution of demand by expressing the frequencies in terms of proportion. Divide each value by 30. The demand per day has the following distribution as shown in Table 15.2.
Table 15.2: Probability Distribution of Demand
Demand 4 5 6 7 8 Probability 0.17 0.33 0.20 0.27 0.03
Find the cumulative probability and assign a set of random number intervals to various demand levels. The probability figures are in two digits, hence we use two digit random numbers taken from a random number table. The random numbers are selected from the table from any row or column, but in a consecutive manner and random intervals are set using the cumulative probability distribution as shown in Table 15.3.
497
To simulate the demand for ten days, select ten random numbers from random number tables. The random numbers selected are, 17, 46, 85, 09, 50, 58, 04, 77, 69 and 74 The first random number selected, 7 lies between the random number interval 17-49 corresponding to a demand of 5 ice-creams per day. Hence, the demand for day one is 5. Similarly, the demand for the remaining days is simulated as shown in Table 15.4.
Table 15.4: Demand Simulation
Day Random Number Demand 1 17 5 2 46 5 3 85 7 4 09 4 5 50 6 6 58 6 7 04 4 8 77 7 9 69 6 10 74 7
Example 2: A dealer sells a particular model of washing machine for which the probability distribution of daily demand is as given in Table 15.5.
Table 15.5: Probability Distribution of Daily Demand
Demand/day Demand 0 0.05 1 0.25 2 0.20 3 0.25 4 0.10 5 0.15
Find the average demand of washing machines per day. Solution: Assign sets of two digit random numbers to demand levels as shown in Table 15.6.
Table 15.6: Random Numbers Assigned to Demand
Demand 0 1 2 3 4 5 Probability 0.05 0.25 0.20 0.25 0.10 0.15 Cumulative Probability 0.05 0.30 0.50 0.75 0.85 1.00 Random Number Intervals 00-04 05-29 30-49 50-74 75-84 85-99
Ten random numbers that have been selected from random number tables are 68, 47, 92, 76, 86, 46, 16, 28, 35, 54. To find the demand for ten days see the Table 15.7 below.
Table 15.7: Ten Random Numbers Selected
Trial No 1 2 3 4 5 6 7 8 9 10
498
Demand / day 3 2 5 4 5 2 1 1 2 3 28
Average demand =28/10 =2.8 washing machines per day. The expected demand /day can be computed as, Expected demand per day =
Simulation
!
=
.......................(1)
where, pi = probability and xi = demand = (0.05 ! 0) + (0.25 ! 1) + (0.20 ! 2) + (0.25 ! 3) + (0.1 ! 4) + (0.15 ! 5) = 2.55 washing machines. The average demand of 2.8 washing machines using ten-day simulation differs significantly when compared to the expected daily demand. If the simulation is repeated number of times, the answer would get closer to the expected daily demand. Example 3: A farmer has 10 acres of agricultural land and is cultivating tomatoes on the entire land. Due to fluctuation in water availability, the yield per acre differs. The probability distribution yields are given below: a. The farmer is interested to know the yield for the next 12 months if the same water availability exists. Simulate the average yield using the following random numbers 50, 28, 68, 36, 90, 62, 27, 50, 18, 36, 61 and 21, given in Table 15.8.
Table 15.8: Simulation Problem
Yield of tomatoes per acre (kg) 200 220 240 260 280 Probability 0.15 0.25 0.35 0.13 0.12
b.
Due to fluctuating market price, the price per kg of tomatoes varies from Rs. 5.00 to Rs. 10.00 per kg. The probability of price variations is given in the Table 216 below. Simulate the price for next 12 months to determine the revenue per acre. Also find the average revenue per acre. Use the following random numbers 53, 74, 05, 71, 06, 49, 11, 13, 62, 69, 85 and 69.
Table 15.9: Simulation Problem
Price per kg (Rs) 5.50 6.50 7.50 8.00 10.00 Probability 0.05 0.15 0.30 0.25 0.15
Solution:
Table 15.10: Table for Random Number Interval for Yield
Yield of tomatoes per acre 200 220 240 260 280 Probability 0.15 0.25 0.35 0.13 0.12 Cumulative Probability 0.15 0.40 0.75 0.88 1.00 Random Number Interval 00 14 15 39 40 74 75 87 88 99
499
1960 1760 1560 1760 1820 1800 1430 1560 1760 1760 2400 1760
Average revenue per acre = 21330 / 12 = Rs. 1777.50 Example 4: J.M Bakers has to supply only 200 pizzas every day to their outlet situated in city bazaar. The production of pizzas varies due to the availability of raw materials and labor for which the probability distribution of production by observation made is as follows:
Table 15.13: Simulation Problem
Production per day Probability 196 0.06 197 0.09 198 0.10 199 0.16 200 0.20 201 0.21 202 0.08 203 0.07 204 0.03
Simulate and find the average number of pizzas produced more than the requirement and the average number of shortage of pizzas supplied to the outlet. Solution: Assign two digit random numbers to the demand levels as shown in Table 15.14
Table 15.14: Random Numbers Assigned to the Demand Levels
Demand 196 197 198 199 200 201 202 203
500
Probability 0.06 0.09 0.10 0.16 0.20 0.21 0.08 0.07 0.03
Cumulative Probability 0.06 0.15 0.25 0.41 0.61 0.82 0.90 0.97 1.00
No of Pizzas shortage 00-05 06-14 15-24 25-40 41-60 61-81 82-89 90-96 97-99
204
Selecting 15 random numbers from random numbers table and simulate the production per day as shown in Table 15.15 below.
Table 15.15: Simulation of Production Per Day
Trial Number 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 Random Number 26 45 74 77 74 51 92 43 37 29 65 39 45 95 93 Production Per day 199 200 201 201 201 200 203 200 199 199 201 199 200 203 203 Total No of Pizzas over produced 1 1 1 3 1 3 3 12 No of pizzas shortage 1 1 1 1 4
Simulation
The average number of pizzas produced more than requirement = 12/15 = 0.8 per day The average number of shortage of pizzas supplied = 4/15 = 0.26 per day
Check Your Progress 15.1
1. 2.
Discuss the role of simulation in demand forecasting. What is Monte Carlo simulation? Write your answer in the space given below. Please go through the lesson sub-head thoroughly you will get your answers in it. This Check Your Progress will help you to understand the lesson better. Try to write answers for them, but do not submit your answers to the university for assessment. These are for your practice only. (b) (c)
Notes: (a)
501
Mr. Srinivasan will implement the plan if the average waiting time of a customers in the system is less than 5 minutes. Before implementing the plan, Mr. Srinivasan would like to know the following: i. ii. iii. iv. Mean waiting time of customers, before service. Average service time. Average idle time of service. The time spent by the customer in the system.
Simulate the operation of the facility for customer arriving sample of 20 cars when the restaurant starts at 7.00 pm every day and find whether Mr. Srinivasan will go for the plan. Solution: Allot the random numbers to various inter-arrival service times as shown in Table 15.17.
Table 15.17: Random Numbers Allocated to Various Inter-Arrival Service Times
Sl. No. Random Number (Arrival) 87 37 92 52 41 05 56 70 70 07 86 74 31 71 57 85 39 41 18 38 Total Inter Arrival Time (Min) 6 3 6 4 4 2 4 5 5 2 6 5 3 5 4 6 3 4 3 3 83 Arrival Time at Service Starts at Random Number (service) 36 16 81 08 51 34 88 88 15 53 01 54 03 54 56 05 01 45 11 76 Service Time (Min) 4 3 5 2 4 3 6 6 3 4 2 4 2 4 4 2 2 4 3 5 72 Service Ends at Waiting Time Customer 7.10 7.13 7.20 7.22 7.27 7.30 7.36 7.42 7.45 7.49 7.51 7.56 7.58 8.04 8.08 8.12 8.15 8.21 8.24 8.29 1 1 2 1 2 3 4 2 1 1 1 1 20 Service (Min) 6 2 1 1 2 2 1 2 17
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20
7.06 7.09 7.15 7.19 7.23 7.25 7.29 7.34 7.39 7.41 7.47 7.52 7.55 8.00 8.04 8.10 8.13 8.17 8.20 8.23
7.06 7.10 7.15 7.20 7.23 7.27 7.30 7.36 7.42 7.45 7.49 7.52 7.56 8.00 8.04 8.10 8.13 8.17 8.21 8.24
i. ii. iii.
Mean waiting time of customer before service = 20/20 = 1 minute Average service idle time = 17/20 = 0.85 minutes Time spent by the customer in the system = 3.6 + 1 = 4.6 minutes.
502
Example 6: Dr. Strong, a dentist schedules all his patients for 30 minute appointments. Some of the patients take more or less than 30 minutes depending on the type of dental work to be done. The following Table 15.18 shows the summary of the various categories of work, their probabilities and the time actually needed to complete the work.
Simulation
Simulate the dentists clinic for four hours and determine the average waiting time for the patients as well as the idleness of the doctor. Assume that all the patients show up at the clinic exactly at their scheduled arrival time, starting at 8.00 am. Use the following random numbers for handling the above problem: 40,82,11,34,25,66,17,79. Solution: Assign the random number intervals to the various categories of work as shown in Table 15.19.
Table 15.19: Random Number Intervals Assigned to the Various Categories
Category of work Filling Crown Cleaning Extraction Check-up Probability 0.40 0.15 0.15 0.10 0.20 Cumulative probability 0.40 0.55 0.70 0.80 1.00 Random Number Interval 00-39 40-54 55-69 70-79 80-99
Assuming the dentist clinic starts at 8.00 am, the arrival pattern and the service category are shown in Table 15.20.
Table 15.20: Arrival Pattern of the Patients
Patient Number 1 2 3 4 5 6 7 8 Scheduled Arrival Random Number Service category 8.00 8.30 9.00 9.30 10.00 10.30 11.00 11.30 40 82 11 34 25 66 17 79 Crown Check-up Filling Filling Filling Cleaning Filling Extraction Service Time 60 15 45 45 45 15 45 45
Table 15.21: The arrival, departure patterns and patients waiting time are tabulated.
Time 8.00 8.30 9.00 9.15 9.30 10.00 10.30 10.45 11.00 11.30 11.45 12.00 Event (Patient Number) 1 arrives 2 arrives 1 departure, 3 arrives 2 depart 4 arrive 3 depart, 5 arrive 6 arrive 4 depart 7 arrive 5 depart, 8 arrive 6 depart End Patient Number (Time to go) 1 (60) 1 (30) 2 (15) 3 (45) 3 (30) 4 (45) 4 (15) 5 (45) 5 (30) 6 (15) 7 (45) 7 (30) Waiting (Patient Number) 2 3 4 5 5,6 6 6,7 7,8 8 8
503
The dentist was not idle during the simulation period. The waiting times for the patients are as given in Table 15.22 below.
Table 15.22: Patient's Waiting Time
Patient 1 2 3 4 5 6 7 8 Arrival Time 8.00 8.30 9.00 9.30 10.00 10.30 11.00 11.30 Service Starts 8.00 9.00 9.15 10.00 10.45 11.30 11.45 12.30 Total Waiting time (minutes) 0 30 15 30 45 60 45 60 285
The various costs involved are, Ordering Cost = Rs. 50 per order Holding Cost = Rs.1 per unit per day Shortage Cost = Rs. 20 per unit per day The dealer is interested in having an inventory policy with two parameters, the reorder point and the order quantity, i.e., at what level of existing inventory should an order be placed and the number of units to be ordered. Evaluate a simulation plan for 35 days, which calls for a reorder quantity of 35 units and a re-order level of 20 units, with a beginning inventory balance of 45 units. Solution: Assigning of random number intervals for the demand distribution and leadtime distribution is shown in Tables 15.25 and 15.26 respectively.
Table 15.25: Random Numbers Assigned for Demand Per Day
Demand per day 2 3 4 5 6 7 8 9
504
Probability 0.05 0.07 0.09 0.15 0.20 0.21 0.10 0.07 0.06
Cumulative probability 0.05 0.12 0.21 0.36 0.56 0.77 0.87 0.94 1.00
Random Number Interval 00-04 05-11 12-20 21-35 36-55 56-76 77-86 87-93 94-99
10
Simulation
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35
7 6 6 6 6 6 7 5 4 6 6 7 6 4 7 6 10 5 8 8 3 4 6 6 5 6 9 4 7 5 10 3 7 2 5
505
506
The simulation of 35 days with an inventory policy of reordering quantity of 35 units at the time of inventory level at the end of day is 20 units, as worked out in Table 10.27. The table explains the demand inventory level, quantity received, ordering cost, holding cost and shortage cost for each day.
Completing a 35 day period, the costs are Total ordering cost = (6 ! 50) = Rs 300.00 Total holding cost = Rs. 768.00 Since the demand for each day is satisfied, there is no shortage cost. Therefore, Total cost = 300 + 768 = Rs. 1068.00 For a different set of parameters, with a re-order quantity of 30 units and the same reorder level of 20 units, if the 35-day simulation is performed, we get the total of various costs as shown in Table 10.28. Total ordering cost = 6 ! 50 = Rs. 300.00 Total holding cost = Rs. 683.0 Total shortage cost = Rs. 20.00 Therefore, Total cost = 300 + 683 + 20 = Rs. 1003.00 If we analyze the combination of both the parameters, Case II has lesser total cost than Case I. But at the same time, it does not satisfy the demand on 33rd day, that might cause customer dissatisfaction which may lead to some cost. In this type of problems, the approach with various combinations of two parameter values is simulated a large number of times to find the total cost of each experiment, compare the total cost and select the optimum alternative, i.e., that one which incurs the lowest cost.
Check Your Progress 15.2
Simulation
1. 2. 3. 4. 5.
Explain how computer make ideal aides in simulating complex tasks. What are the two types of computer programming languages that are available to facilitate the simulation process? Why in the computer necessary in conducting a real world simulation. Do you think the application of simulation will enhance strongly in the coming 10 years. Draw a flow diagram for the simulation of electric-maintenance by the power corporation of India Ltd. Write your answer in the space given below. Please go through the lesson sub-head thoroughly you will get your answers in it. This Check Your Progress will help you to understand the lesson better. Try to write answers for them, but do not submit your answers to the university for assessment. These are for your practice only. (b) (c)
Notes: (a)
15.9 KEYWORDS
Simulation : A management science analysis that brings into play a construction and mathematical model that represents a realworld situation. : A number whose digits are selected completely at random. : A graphical means of representing the logic of a simulation model.
Simulations models are built for management problems and require management input. All simulation models are very expensive. Simulation is best suited to analyse complex & large practical problem Simulation-generate optimal solution. Simulation model can not be very expensive. Simulation is one of the most widely used ________ analysis book. Simulation allow, for the ________ of real world complications. System ________ in similar to business gaming. Monte Carlo method used ________ number. Simulation experiments generate large amount of ________ and information. The problem tackled by simulation may range from very simple to extremely complex. Simulations allows us to study the interactive effect of individual components or variables in order to determine which one is important.
Simulation is the valuable technique for analysing various maintenance policies before actually implementing them, Simulations technique in considered as a valuable tool because of its wide area of application. Simulation is nothing more or less them the technique of performing sampling experiment on the model of the system.
Simulation
Exercise Problem
1. A sweet stall observed that the demand for item Mysorpa per week in one kilogram pack is as follows:
Demand / week (per kilo pack) Frequency 5 4 10 22 15 16 20 42 25 10 30 6
Generate the demand for the next 10 weeks, and also find the average demand. 2. At a service station, cars arrive for water-wash daily. The probability of number of cars that arrive are given in the table below. Simulate the number of cars that will arrive for the next 10 days. Use the following random numbers: 87, 01, 74, 11, 46, 82, 59, 94, 25 and 34.
Cars arrival per day Probability 5 0.2 6 0.15 7 0.3 8 0.25 9 0.05 10 0.05
3.
A private bank has installed an ATM in the city bazaar area. It was found that the time between an arrival and completion of transaction varies from one minute to seven minutes. The arrival and service distribution times are given below. Simulate the ATM operations for the next 30 arrivals.
Probability Time (minutes) Arrival 1-2 2-3 3-4 4-5 5-6 6-7 0.10 0.15 0.30 0.25 0.10 0.10 Service 0.05 0.15 0.30 0.20 0.15 0.15
Use Monte-Carlo simulation technique and determine: a. b. Waiting time of the customers. Idle time of the ATM.
509
4.
The materials manager of a firm wishes to determine the expected mean demand for a particular item in stock during the re-order lead time. This information is needed to determine how far in advance to re-order, before the stock level is reduced to zero. However, both the lead time, and the demand per day for the item are random variables, described by the probability distribution.
Lead time (days) 1 2 3 4 Probability 0.45 0.30 0.25 Demand / day (units) 1 2 3 4 Probability 0.15 0.25 0.40 0.20
Manually simulate the problem for 30 re-orders, to estimate the demand during lead time. 5. A company has the capacity to produce around 300 bikes per day. Daily production varies from 295 to 304 depending upon getting the clearance from the final inspection department. The probability distribution of bikes passed through final inspection per day is given below:
Production per day 295 296 297 298 299 300 301 302 303 304 Probability 0.03 0.04 0.10 0.20 0.25 0.15 0.09 0.07 0.05 0.02
The finished bikes are transported in a long trailer lorry sufficient to accommodate 300 mopeds. Simulate the process for 10 days and find: a. b. 6. The average number of bikes waiting in the factory yard. The average empty space in the lorry.
In a single pump petrol station, it was observed that the inter-arrival times and service times are as given in the table. Using the random numbers given, simulate the queue behaviour for a period of 30 minutes and estimate the probability of the pump being idle and the mean time spent by a customer waiting to fill petrol.
Inter-arrival time Minutes 1 3 5 7 9 Probability 0.10 0.17 0.35 0.23 0.15 Service time Minutes 2 4 6 8 10 Probability 0.10 0.23 0.35 0.22 0.10
510
Use the following random numbers: 93, 14, 72, 10, 21, 81, 87, 90, 38, 10, 29, 17, 11, 68, 10, 51, 40, 30, 52 & 71.
7.
A one-man TV service station receives TV sets for repair. TV sets are repaired on a first come, first served basis. The observations of the study made over a 100 day period are given below.
No. of TV sets requiring service 1 2 3 4 5 No. of TV sets serviced 1 2 3 4 5 Service Frequency of request 15 15 20 25 25 Servicing done Frequency of service 10 30 20 15 25
Simulation
Simulate a 10 day period of arrival and service pattern. 8. ABC company stocks certain products. The following data is available: a. b. No. of Units: 0 Probability: 0.1 1 0.2 2 0.4 2 0.40 3 0.3 3 0.30
The variation of lead time has the following distribution Lead time (weeks): 1 Probabilities: 0.30
The company wants to know (a) how much to order? and (b) when to order ? Assume that the inventory in hand at the start of the experiment is 20 units and 15 units are ordered closed as soon as inventory level falls to 10 units. No back orders are allowed. Simulate the situation for 25 weeks. 9. A box contains 100 balls of which 20 percent are white, 30 percent are black and the remaining are red. Simulate the process for drawing balls at random from the box, identify and note the colour and then replace. Use the following 10 random numbers to simulate: 52, 60, 02, 3379, 79, 30, 36, 58 and 43.
10. Rahul, the captain of the cricket team, has the following observations on the number of runs scored against type of ball. The bowling probability of a bowler for the type of balls bowled are given below.
Type of bowling Over pitched Short-Pitched Outside off stump Outside leg stump Bouncer Attempted Yorker Probability of hitting a boundary 0.1 0.3 0.2 0.15 0.20 0.05
511
The number of runs scored off each type of ball is shown in the table given below:
Type of bowling Over pitched Short-Pitched Outside off stump Out side leg stump Bouncer Attempted Yorker Probability of hitting a boundary 1 4 3 2 2 0
Simulate the game for 3 overs (6 balls per over) and calculate the batting average of Rahul.
ANSWERS
(c) True
TO
QUESTIONS
(d) False (d) Random
FOR
(b) False
(c) Simulation
512