Data Envelopment Analysis (DEA) : Jean-Marc Huguenin
Data Envelopment Analysis (DEA) : Jean-Marc Huguenin
.......................................................7
..................................................................................32
.....................................................................40
...................................................66
LIST OF FIGURES
.................. 10
.................................... 12
............................................................. 14
............................................................................................................... 16
.....................................................................................18
.....................................................................................19
...............................................................................................21
................................22
...........................................33
................34
...........................................................................................36
.........................38
................................................................................................39
.....................................................................................42
..................44
..................45
..................46
........................47
...............53
............................................58
...................................................................................65
...................................................................................68
........................69
1 INTRODUCTION
This guide introduces Data Envelopment Analysis (DEA), a
performance measurement technique, in such a way as to be appropriate
to decision makers with little or no background in economics and
operational research. The use of mathematics is kept to a minimum. This
guide therefore adopts a strong practical approach in order to allow
decision makers to conduct their own efficiency analysis and to easily
interpret results.
DEA helps decision makers for the following reasons:
- By calculating an efficiency score, it indicates if a firm is
efficient or has capacity for improvement.
- By setting target values for input and output, it calculates how
much input must be decreased or output increased in order to
become efficient.
- By identifying the nature of returns to scale, it indicates if a
firm has to decrease or increase its scale (or size) in order to
minimize the average cost.
- By identifying a set of benchmarks, it specifies which other
firms’ processes need to be analysed in order to improve its
own practices.
After this introduction, Chapter 2 presents the essentials about DEA,
alongside a case study to intuitively understand its application.
Chapter 3 introduces Win4DEAP, a software package that conducts
efficiency analysis based on DEA methodology. Chapter 4 is dedicated
to more demanding readers interested in the methodical background of
DEA. Four advanced topics of DEA (adjustment to the environment;
preferences; sensitivity analysis; time series data) are presented in
Chapter 5. Finally, Chapter 6 shows how to program the Solver in
Microsoft Excel © in order to run a basic DEA efficiency analysis.
2 BASICS OF DEA
1
To keep it simple and to make this pedagogical guide more understandable, the term
‘firm’ is used in a generic way.
To better understand the mechanics behind DEA, this section develops a
simple practical case study. It includes only one input and one output,
although DEA can handle multiple inputs and multiple outputs.
Five register offices (A to E) produce one output (total number of
documents, such as marriage or birth certificates) with one input
(number of full-time equivalent public servants)2. The data are listed in
Table 1. For example, two public servants work in Register Office A.
They produce one document (during a certain period of time).
Input Output
Register Office
Public servants (x) Documents (y)
A 2 1
B 3 4
C 5 5
D 4 3
E 6 7
2 Note that DEA can handle more outputs and inputs. In order to represent this example
in a two-dimensional graph, we consider a total of two outputs and inputs of two (one
output, one input; no variable representing the quality of the variables).
not operating at an optimal scale. This is usually the case when
firms face imperfect competition, government regulations, etc.
The VRS model calculates an efficiency score called variable
returns to scale technical efficiency (VRSTE).
Comparison between the two models reveals the source of inefficiency.
Constant returns to scale technical efficiency corresponds to the global
measure of firm performance. It is composed by a ‘pure’ technical
efficiency measure (captured by the variable returns to scale technical
efficiency score) and a scale efficiency measure (SE). Section 4.2
demonstrates how these three notions (CRSTE, VRSTE and SE) relate
to each other.
7 E
6
Documents
5 C
4 B
3 D
ACRS-O
2
Production possibility set
1 A
S ACRS-I
T
0
0 1 2 3 4 5 6 7
Public servants
3 Note that the efficiency scores in a CRS model are always the same for an input or an
output orientation.
8
7
VRS efficient frontier E
6
Documents
5 C
DVRS-O
4 B
3 D
U DVRS-I
2
Production possibility set
1 A
0 V
0 1 2 3 4 5 6 7
Public servants
4 Note that the efficiency scores in a VRS model are different for an input or an output
orientation.
e) Case study 1 – CRS, VRS and scale efficiency
Figure 3 represents the CRS and the VRS efficient frontiers on the same
graph. Register Office B is CRS and VRS efficient, as it is located on
both frontiers. Register Offices A and E are efficient under the variable
returns to scale assumption but inefficient under the constant returns to
scale assumption. Finally, Register Offices D and C are both CRS and
VRS inefficient; they are located neither on the CRS nor on the VRS
frontiers.
7
E
5 C
4 B
DCRS-I
3 D
U DVRS-I
2
ACRS-I
1 A
0
0 1 2 3 4 5 6 7
Public servants
The gap observed between the CRS and the VRS frontiers is due to a
problem of scale. For example, Register Office A is VRS efficient. To
become CRS efficient, Register Office A should modify its scale (or
size). Only by operating at point ACRS-I would Register Office A be as
productive as Register Office B, which is the only CRS efficient
Register Office.
Some Register Offices (D and C) are not even located on the VRS
frontier. These Register Offices not only have a scale problem but are
also poorly managed. For example, Register Office D should move to
point DVRS-I located on the VRS frontier in order to become VRS
efficient (i.e. to eliminate the inefficiency attributable to poor
management). Furthermore, Register Office D should move from point
DVRS-I to point DCRS-I located on the CRS frontier in order to become
CRS efficient (i.e. to eliminate the inefficiency attributable to a problem
of scale).
As a result, the CRS efficiency (also called ‘total’ efficiency) can be
decomposed into two components: the VRS efficiency (also called
‘pure’ efficiency) and the scale efficiency. The following ratios
represent these three types of efficiency for Register Office D (input
orientation).
Technical efficiency of Technical efficiency of Scale efficiency of D
D under CRS D under VRS
UDCRS − I UDVRS − I UDCRS − I
TECRS = = 56.3% TEVRS = = 66.7% SE = = 84.4%
UD UD UDVRS − I
ECRS-I
7
E
VRS efficient frontier
6
DRS
Documents
CCRS-I
5 C
CVRS-I
4 B
DCRS-I
3 D
DVRS-I
2 IRS
ACRS-I
1 A
0
0 1 2 3 4 5 6 7
Public servants
To identify the nature of returns to scale, one has to focus on the slope
of the VRS efficient points A, DVRS-I, B, CVRS-I and E (or productivity).
Three situations can occur:
- A register office is located both on the CRS and the VRS
efficient frontiers (such as point B). Register Office B has the
highest productivity of all VRS efficient points (4 / 3 = 1.33). It
is facing constant returns to scale. Such a firm reaches its
optimal size (or efficient scale)5. It is operating at a point where
the scale (or size) has no impact on productivity. This situation
occurs when the average inputs consumption is minimized and
does not vary with output. In a situation of constant returns to
5 In the economic context, a firm operates at the optimal size (or efficient scale) when it
minimizes its average cost. In the context of DEA, we can measure efficiency in
physical or in monetary terms. Because cost and price information is not always
available or appropriate, the use of technical efficiency is often preferred. As this latter
measure is based on physical terms, we prefer to use the expression of average inputs
consumption instead of average cost.
scale, an increase in output of 1 percent requires a
proportionate increase in input (i.e. 1 percent).
- A register office (or the projected point of a register office) is
located at a point where the scale (or the size) has a positive
impact on productivity. Points A and DVRS-I are in such a
position (see Figure 5). The productivity of A (1 / 2 = 0.5) is
inferior to the productivity of DVRS - I (3 / 2.67 = 1.12). The ratio
of productivity is increasing with the scale. This situation
occurs until point B, which has a productivity of 1.33. Register
Offices A and D are therefore facing increasing returns to scale
(IRS) –or economies of scale–. In this situation, the average
inputs consumption declines whilst output rises. Register
Offices A and D have not yet reached their optimal size (or
efficient scale). To improve their scale efficiency, they have to
expand their output. In a situation of economies of scale, a
variation in output of 1 percent results in a variation in input of
less than 1 percent. Hence, an increase in output results in a
reduction of the average inputs consumption.
8
7
E
6
Documents
5 C
CVRS-I
4 B
3 D
DVRS-I
2
1 A
0
0 1 2 2.67 3 4 5 6 7
Public servants
7
E
6
Documents 5 C
CVRS-I
4 B
3 D
DVRS-I
2
1 A
0
0 1 2 3 4 5 6 7
Public servants
The specific cases of the five Register offices are described below (see
Figure 4):
- Register Office A is located on the VRS frontier but not on the
CRS frontier. Its inefficiency is due to an inappropriate scale. A
is facing increasing returns to scale. A variation in output of
1 percent results in a variation in input of less than 1 percent.
- Register Office D is neither located on the CRS nor on the VRS
frontier. Its inefficiency is due to an inappropriate scale and to
poor management. D is facing increasing returns to scale. A
variation in output of 1 percent results in a variation in input of
less than 1 percent.
- Register Office B is located both on the CRS and on the VRS
frontier. It has no inefficiency at all. B is facing constant
returns to scale. A variation in output of 1 percent results in a
variation in input of 1 percent.
- Register Office C is neither located on the CRS nor on the VRS
frontier. Its inefficiency is due to an inappropriate scale and to
poor management. C is facing decreasing returns to scale. A
variation in output of 1 percent results in a variation in input of
more than 1 percent.
- Register office E is located on the VRS frontier (but not on the
CRS frontier). Its inefficiency is due to an inappropriate scale.
E is evolving in a situation of decreasing returns to scale. A
variation in output of 1 percent results in a variation in input of
more than 1 percent.
7
E
VRS efficient frontier
6
Documents 5 C
CVRS-I
4 B
3 D
DVRS-I
2
1 A
0
0 1 2 3 4 5 6 7
Public servants
7
E
6 VRS efficient frontier
Documents
5
C
4
B
3
D
1
Slacks A
F
FVRS-I without slacks
0
0 1 2 3 4 5 6 7
Public servants
6 The higher the number of inputs and outputs that are taken into consideration for a
given number of firms, the more probable it is that each firm will be the best producer
of at least one of the outputs. Therefore, all firms could obtain a 100% efficiency score.
DEA models can also accommodate non-discretionary. In a second step,
efficiency scores can be adjusted to account for environmental variables
(i.e. such variables influence the efficiency of a firm but are not a
traditional input and are not under the control of the manager).
Moreover, variables should reflect both quantitative and qualitative
characteristics of firms’ resources and services. Although it may not be
easy to identify and to convert qualitative characteristics into numbers, it
is desirable to include such variables in the model in order to
appropriately benchmark firms.
7 This pedagogical guide will focus on the measurement of technical efficiency for two
main reasons: first, firms in the public sector are often not responsible for the age
pyramid of their employees; therefore taking into account the wages of the employees
(which often grow higher alongside seniority) would unfairly alter efficiency of a firm
with a greater proportion of senior employees; second, firms in the public sector do not
often produce commercial goods or services with a set price.
Technical efficiency is a global measure of firm performance. However,
it does not indicate the source of inefficiency. This source could be
twofold:
- First, the firm could be poorly managed and operated.
- Second, it could be penalised for not operating at the right
scale.
Technical efficiency can be decomposed into a ‘pure’ technical
efficiency measure and a scale efficiency measure to reflect these two
sources of inefficiency8.
8 The firm’s management team will definitely be held responsible for the ‘pure’ technical
efficiency score. In a situation where it does not have the discretionary power to modify
the firm’s size, it will likely not be accountable for the scale efficiency score. However,
especially in the private sector, one has the choice of the scale at which it operates: the
management team can easily downsize the firm and, with some efforts, upsize it also.
- How should one respond to managers who claim that their
firms are different from others, and therefore cannot be
compared to them?
Sometimes, inefficiencies can be explained by indisputable
environmental variables. But sometimes they cannot. Managers
often justify the low efficiency scores of their firms by arguing
that their situations are different compared to the situations of
the other firms. They claim to be a ‘special case’ (and therefore
it is acceptable to be inefficient). Actually, the majority of
firms could possibly claim to be different as most possess a
specificity that others do not have. However, it is likely that the
difference of one firm will be compensated by the difference of
another. More generally, it is up to the managers to prove that
they really face a hostile environment. If they cannot prove it,
management measures have to be taken to improve efficiency.
- Assume that a firm obtains an efficiency score of 86.3%.
Does this number have to be strictly applied?
Not really, it should be interpreted more as an order of
magnitude. This order of magnitude informs managers that they
have to increase their outputs or to decrease their inputs in
order to become more efficient. But one should not focus too
strictly on the capacity for 13.7% improvement. Such a number
could be interpreted by practitioners as too ‘accurate’ and may
offend their sensibilities. Therefore it is better to consider
efficiency scores more as more of an objective basis to hold an
open discussion about the way to improve firm efficiency
rather than a number to be strictly applied.
- A firm faces increasing returns to scale. It has economies of
scale. What does that concretely mean from a managerial
point of view?
Such a firm has not yet reached its optimal size. In order to
reduce its average cost (or its average inputs consumption), it
has to increase its size. Practically, this could be done either by
internal growth (i.e. producing more output) or by merging
with another firm which is also facing increasing returns to
scale. If, for some reason, managers cannot influence the scale
of a firm, they should not be held accountable for this source of
inefficiency.
- A firm faces decreasing returns to scale. It has
diseconomies of scale. What does that concretely mean from
a managerial point of view?
Such a firm is already oversized, having exceeded its optimal
size. In order to reduce its average cost (or its average inputs
consumption), it has to decrease its size. Practically, this could
be done either by internal decay (i.e. producing less output) or
by splitting the firm into two separate businesses. Note that
some of the production could be transferred to a firm facing
increasing returns to scale. If, for some reason, managers
cannot influence the scale of a firm, they should not be held
accountable for this source of inefficiency.
- Is efficiency the only criteria to assess a firm’s
performance?
Not necessarily. Basically, the assessment of a firm’s
performance will depend on the management objectives. Other
criteria such as effectiveness or equity are often considered
alongside efficiency. If this is the case, the overall performance
should be balanced with the various criteria.
- One firm obtains a score of 100% but all the others in the
dataset obtain much lower scores (for example, starting at
40% or lower). Is this realistic?
It could be realistic, but the gap appears to be important. In
such a case, data have to be carefully checked, and especially
data of the efficient firm. If a data problem is not identified,
such results mean that the efficient firm is likely to have
completely different processes than the other firms. It should
therefore be absolutely presented as a best practice model.
However, even if they are realistic, such results are likely to be
rejected by managers whose firms have low efficiency scores.
These managers are likely to be discouraged because it is
obviously unrealistic for them to improve their firm’s
efficiency by 60% (or more) in the short run. Therefore it is
better to exclude the efficient firm from the sample and to run a
new model.
- Almost all the firms obtain an efficiency score of 100%.
Does that mean that all of them are really efficient?
Yes, it could mean that all the firms are efficient. Such results
would be great! But they are unlikely. Here, the total number of
inputs and outputs is probably too high compared to the
number of firms in the dataset. In this case, one variable has to
be excluded and a new model has to be run. If the number of
firms obtaining a 100% score decreases, it indicates that the
number of variables was too high compared to the number of
firms. If not, all the firms are just efficient and must be
congratulated.
- The model does not show any results. What does that
mean?
Data has to be checked. This could happen when data with a
value of zero are in the set. Zeros have to be substituted by a
very small number (0.01).
Exercise 1
The following multiple choice questions test one’s knowledge on the
basics of DEA. Only one answer is correct. Solutions are listed at the
end of this exercise.
1. What is the main purpose of DEA?
a) DEA measures firms’ effectiveness
b) DEA measures firms’ efficiency
c) DEA measures firms’ profit
d) DEA measures firms’ productivity
2. A dataset includes information about input quantity, input cost and
output quantity. Which type of efficiency cannot be measured?
a) Technical efficiency
b) Cost efficiency
c) Revenue efficiency
d) Scale efficiency
3. ‘Pure’ technical efficiency reflects:
a) A global measure of firm performance
b) The efficiency of a firm operating at an incorrect scale
c) A measure of profit efficiency
d) The efficiency of a poorly managed firm
4. Firm A is inefficient. Who is its peer(s)?
a) One or several firms whose efficiency scores are worse than
firm A’s efficiency
b) One or several firms whose efficiency scores are better than
firm A’s efficiency, but which are not located on the
efficiency frontier
c) Any firm located on the efficiency frontier
d) One or several specific firms (i.e. a subgroup of efficient
DMUs) located on the efficiency frontier
5. A firm is producing laptops. Which input reflects quality?
a) The number of FTE employees with a Master’s degree
b) Total number of FTE employees, disregarding their
educational background
c) Total number of square meters of the factory
d) Energy consumption
6. A firm has diseconomies of scale. How can the management team
improve its efficiency?
a) By merging with another firm
b) By producing more output
c) By producing less output
d) By producing the same amount of output
7. A manager plans to measure efficiency using three inputs and two
outputs. What is the minimum number of firms that should be
included in the dataset?
a) 10
b) 6
c) 15
d) It does not matter
9 Zhu (2003) includes an earlier version of DEAFrontier, DEA Excel Solver, on a CD-
ROM. This software works only under Excel © 97, 2000 and 2003. It deals with an
unlimited number of DMUs and is available at little cost.
10 Cooper et al. (2006) include a CD-ROM with a DEA-Solver version limited at 50
DMUs. It is available at little cost.
11 As DEAP is a DOS program, a user friendly Windows interface has been developed for
it (Win4DEAP). These ‘twin’ software packages have to be both downloaded and
extracted to the same folder. Win4DEAP cannot work without DEAP.
12 DEAP Version 2.1: http://www.uq.edu.au/economics/cepa/deap.htm
Win4DEAP Version 1.1.3: http://www8.umoncton.ca/umcm-
deslierres_michel/dea/install.html
and coming from DEAP or Win4DEAP are reproduced by permission of
Timothy Coelli and Michel Deslierres.
Input Output
School
FTE teachers FTE adm. staff Computers Pupils
1 40.2 2.0 37.0 602.0
2 18.1 1.1 17.0 269.0
3 42.5 2.1 41.0 648.0
4 11.0 0.8 10.0 188.0
5 24.8 1.3 22.0 420.0
6 21.1 1.3 19.0 374.0
7 13.5 1.0 13.0 247.0
8 28.6 1.3 26.0 512.0
9 23.5 1.3 22.0 411.0
10 15.9 1.0 15.0 285.0
11 23.2 1.3 22.0 397.0
12 26.0 1.4 25.0 466.0
13 11.1 0.8 11.0 198.0
14 28.8 1.6 26.0 530.0
15 19.7 1.3 18.0 357.0
a) Case study 2 – Building a spreadsheet in Win4DEAP
Win4DEAP is launched by clicking the MD icon ( ). Firms (called
decision-making units or DMUs) are listed in the rows and variables
(outputs and inputs) in the columns. The opening spreadsheet contains
one decision-making unit (DMU1), one output (OUT1) and one input
(IN1) by default (see Figure 9).
To edit and name firms, outputs and inputs, the user has to click the
DMU1 ( ), OUT1 ( ) and IN1 ( ) icons, respectively. The
window reproduced in Figure 10 allows the user to (1) assign a long
name and a label (maximum of eight characters) to any variable and (2)
select the nature of the variables (either ‘input’ or ‘output’). Finally, the
user has to select the ‘with price’ option if he intends to measure cost,
revenue or profit efficiency (i.e. a ‘price’ column will be added to the
selected variable in the spreadsheet).
The icons enable the user to add firms (DMUs). The icons
enable the user to add variables (inputs or outputs). The icons
and are used to delete any existing DMUs or variables. Finally,
the following icons allow the user to reverse the order of
appearance of DMUs (rows) or variables (columns).
How to import Microsoft Excel © data into Win4DEAP
Note that data can be imported from an Excel © file into Win4DEAP by
following these steps:
- Save the Microsoft Excel © data (only numbers, no names of DMUs
or variables should be included) into the CSV format (Comma
delimited).
- In Win4DEAP, first select the ‘File’ menu, then the ‘Import’ option
and finally the ‘New data set’ application.
- Select the CSV file and open it.
- The data is now presented in the Win4DEAP spreadsheet, which
still has to be configured (DMUs and variables have to be named
and variables must be defined as inputs or outputs).
Exercise 2
The objective of this exercise is to correctly calibrate a spreadsheet in
Win4DEAP.
Tasks
a) Prepare a spreadsheet in Win4DEAP including 15 DMUs, 3 inputs
and one output. Name the DMUs ‘School 1’ to ‘School 15’. The
first input is ‘FTE teachers’, the second ‘FTE administrative staff’
and the third ‘Number of computers’. The output corresponds to the
number of pupils.
b) Feed the data appearing in Table 2 into the spreadsheet.
c) Save the file, preferably into the same folder containing
DEAP/Win4DEAP (menu ‘File’, option ‘Save as’).
To run a DEA model, the user has to click the ‘lightning’ icon ( ).
The window represented in Figure 12 will then appear. This window
allows a calibration of the model following steps 1 to 4 described below:
1. Select an input or an output orientation (Orientation box).
2. Select the assumption about returns to scale (Returns to scale box).
By ticking ‘constant’, one assumes constant returns to scale (CRS);
by ticking ‘variable’, one assumes variable returns to scale (VRS).
If one cannot be certain about the fact that firms are operating at an
optimal scale, running a VRS model is recommended.
3. Select a model (Calculate box). Three main options are available:
- To calculate technical efficiency (TE) or technical (CRS), ‘pure’
(VRS) and scale efficiency (SE), tick ‘DEA (multi-stage)’.
Options ‘DEA (1-stage)’, ‘DEA (2-stage)’ and ‘DEA (multi-
stage)’ correspond to different treatments of slacks. Following
Coelli (1998), the multi-stage treatment is recommended.
- To calculate cost, revenue or profit efficiency, tick ‘DEA-COST’.
For this option, cost and/or price information about variables
must be available and added to the spreadsheet.
- To calculate technical and scale efficiency when panel data are
available, tick ‘MALMQUIST’. See Section 5.4 to learn more
about this.
4. Choose how to display the results: only summarized or reported
firm by firm (Report box).
5. Click ‘Execute’ to run the model.
Exercise 3
The objective of this exercise is to run a DEA model in Win4DEAP
based on the schools case study. The following information is available:
- Schools are confronted with budget restrictions;
- The school system is heavily regulated;
- An obligatory school by school report is expected.
Prerequisites
Exercise 2
Tasks
a) Open the schools data spreadsheet (i.e. the calibrated spreadsheet in
exercise 2)
b) Calibrate the model
c) Execute the model
13 Note that if you had run a constant returns to scale model instead of a variable returns
to scale one, you would have obtained only one type of efficiency score in your results
file (technical efficiency –TE). Technical efficiency scores are strictly equal to constant
returns to scale technical efficiency scores obtained in the CRSTE column of your
variable returns to scale model.
- Firms associated with IRS are facing increasing returns to scale
(economies of scale).
- Firms associated with DRS are facing decreasing returns to
scale (diseconomies of scale).
- Firms associated with a dash are facing constant returns to
scale; they are operating at an optimal scale.
On average, schools efficiency scores are:
- 94% for CRSTE; overall, schools could reduce their inputs by
6% whilst educating the same number of pupils.
- 97.5% for VRSTE; a better school organization would be able
to reduce input consumption by 2.5%.
- 96.4% for SE; in adjusting their scale, schools could reduce
their inputs by 3.6%.
All subsequent tables displayed in the results file refer to the VRSTE
scores. These tables contain the following information:
- The number of the DMU under review (‘Results for firm’).
- The technical efficiency score (‘Technical efficiency’),
corresponding to the VRSTE when a VRS model has been run
or to the CRSTE when a CRS model has been run.
- The scale efficiency score (‘Scale efficiency’); note that the SE
is mentioned only when a VRS model has been run.
- The lines of the matrix represent the outputs and the inputs of
the model (‘output 1’, ‘output 2’, etc., ‘input 1’, ‘input 2’, etc.).
- The first column of the matrix recalls the original values of the
variables’ outputs and inputs (‘original values’).
- The second column of the matrix represents the movement an
inefficient DMU has to take in order to be located on the
frontier (‘radial movement’).
- The third column of the matrix is the additional movement a
DMU located on a segment of the frontier running parallel to
the axis has to take in order to become efficient (‘slack
movement’).
- The fourth column of the matrix lists the values of the variables
which enable the DMU to be efficient (‘projected value’); these
projected values take into account both the radial and the slack
movements.
- Finally, the listing of peers is mentioned. Each peer is
identified by a number and has an associated weight (‘lambda
weight’) representing the relative importance of the peer.
As illustrations, three individual school tables are specifically
commented on below: School # 1 (Figure 15), # 2 (Figure 16) and # 3
(Figure 17).
School # 1 (Figure 15) has a ‘pure’ efficiency score of 95.1% and a scale
efficiency score of 86.9%. It is facing decreasing returns to scale (DRS).
By improving the operation of the school, 4.9% (100 - 95.1) of inputs
could be saved. By adjusting the school to its optimal size, 13.1%
(100 - 86.9) of inputs could be saved.
The ‘original value’ column contains the original values of the school’s
variables: School # 1 educates 602 pupils with 40.2 teachers, 2
administrative staff and 37 computers. However, School # 1 could
‘produce’ the same quantity of output with fewer inputs: 37.186 teachers
instead of 40.2; 1.902 administrative staff instead of 2;
35.185 computers instead of 37 (see the ‘projected value’ column). The
decreases in inputs 2 and 3 are equal to 4.9 % of the original values:
(- 0.098 / 2) x 100 for input 2 and (- 1.815 / 37) x 100 for input 314. The
case of input 1 is slightly different: to become efficient, it has to
School # 2 (Figure 16) has a ‘pure’ efficiency score of 83.8% and a scale
efficiency score of 96.4%. It is facing increasing returns to scale (IRS).
By improving the operation of the school, 16.2% (100 - 83.8) of inputs
could be saved. By adjusting the school to its optimal size, 3.6%
(100 - 96.4) of inputs could be saved.
The ‘original value’ column contains the original values of the school’s
variables: School # 2 educates 269 pupils with 18.1 teachers,
1.1 administrative staff and 17 computers. However, School # 2 could
‘produce’ the same quantity of output with fewer inputs: 15.163 teachers
instead of 18.1, 0.922 administrative staff instead of 1.1;
14.242 computers instead of 17 (see the ‘projected value’ column). The
decreases in inputs 1, 2 and 3 are equal to 16.2% of the original values
(‘radial movement’ column). No slack movement is identified.
To improve its efficiency, School # 2 has to refer to Schools # 13, # 4,
# 14 and # 8, which are identified as its peers.
School # 3 (Figure 17) has a ‘pure’ efficiency score of 100% and a scale
efficiency score of 84.2%. It is facing decreasing returns to scale (DRS).
This school is well managed. It cannot improve its ‘pure’ efficiency. The
only capacity for improvement lies in a scale adjustment: 15.8%
(100 - 84.2) of inputs could be saved.
The ‘original value’ column contains the original values of the school’s
variables: School # 3 educates 648 pupils with 42.5 teachers,
2.1 administrative staff and 41 computers. These values are equal to the
projected ones (‘pure’ efficiency = 100%).
As School # 3 is purely efficient, it acts as its own peer.
Exercise 4
The objective of this exercise is to interpret DEA results. Figure 18
displays results for one of the 15 schools. It has been truncated in order
to hide the VRS technical efficiency score.
Tasks
Answer the following questions:
a) The variable returns to scale technical efficiency score has been
removed from the table. Find a way to calculate it.
Answer: For example, input 1 can be reduced by 0.864 (radial
movement). This represents 3.7% [(- 0.864 / 23.5) x 100]. Therefore
VRSTE corresponds to 96.3% (100 - 3.7). You would have obtained
the same result if you based your calculations on input 2. If you
based your calculations on input 3, be careful not to take into
account the slack movement.
b) Assume that the ‘pure’ efficiency score is equal to 96.3%. What is
the main feature in need for improvement: the school’s management
or the school’s scale?
Answer: The school’s management has the capacity to improve
efficiency by 3.7% (100 - 96.3). Modifying the school’s scale could
improve efficiency by 1.3% (100 - 98.7). Therefore the main feature
in need of improvement is school management.
c) Assume that the school has only time to analyze best practice from
one of its peers. Which one should it select?
Answer: School # 14. Among the three peers listed (13, 14 and 8),
School # 14 is associated with the highest weight (41.7%).
d) How much must the school reduce input 3 in order to be located on
the efficiency frontier?
Answer: - 1.184 [(- 0.809) + (- 0.375)].
4 DEA IN THE BLACK BOX
This section describes the two principal DEA models: the constant
returns to scale model (Charnes et al., 1978) and the variable returns to
scale model (Banker et al., 1984). DEA is based on the earlier work of
Dantzig (1951) and Farrell (1957), whose approach adopted an input
orientation. Zhu and Cook (2008), Cooper et al. (2007) or Coelli et al.
(2005) provide a comprehensive treatment of the methodology. By
2007, Emrouznejad et al. (2008) identified more than 4000 research
articles about DEA published in scientific journals or books.
DEA is a non-parametric method. Unlike parametric methods (such as
ordinary least square, maximum likelihood estimation or stochastic
frontier analysis), inputs and outputs are used to compute, using linear
programming methods, a hull to represent the efficiency frontier. As a
result, a non-parametric method does not require specification of a
functional form.
∑u y
r =1
r rk
TEk = m (1)
∑v x
i =1
i ik
15 This model is also known as the Charnes, Cooper & Rhodes model (CCR model).
Where:
TEk is the technical efficiency of firm k using m inputs to produce s
outputs;
yrk is the quantity of output r produced by firm k;
xik is the quantity of input i consumed by firm k;
ur is the weight of output r;
vi is the weight of input i.
n is the number of firms to be evaluated
s is the number of outputs
m is the number of inputs
The technical efficiency of firm k is maximized under two constraints.
First, the weights applied to outputs and inputs of firm k cannot generate
an efficiency score greater than 1 when applied to each firm in the
dataset (equation # 3). Second, the weights on the outputs and on the
inputs are strictly positive (equation # 4). The following linear
programming problem has to be solved for each firm:
s
∑u y
r =1
r rk
Maximize m (2)
∑v x
i =1
i ik
∑u
r =1
r yrj
Subject to m
≤1 j = 1, K , n (3)
∑v x
i =1
i ij
u r , vi > 0 ∀r = 1, K , s; i = 1, K , m (4)
Subject to Subject to
m s m s
∑v x − ∑u
i =1
i ij
r =1
r y rj ≥ 0 j = 1, K , n (6) ∑v x − ∑u
i =1
i ij
r =1
r yrj ≥ 0 j = 1, K , n (10)
s m
∑u
r =1
r yrk = 1 (7) ∑v x
i =1
i ik =1 (11)
Subject to Subject to
n n
φk y rk − ∑ λ j yrj ≤ 0 r = 1, K , s (14) y rk − ∑ λ j y rj ≤ 0 r = 1, K , s (18)
j =1 j =1
n n
xik − ∑ λ j xij ≥ 0 i = 1, K , m (15) θ k xik − ∑ λ j xij ≥ 0 i = 1, K , m (19)
j =1 j =1
λ j ≥ 0 ∀j = 1, K , n (16) λ j ≥ 0 ∀j = 1, K , n (20)
Where:
1 and θk represent the technical efficiency of firm k
φk
λj represents the associated weighting of outputs and inputs of firm j
16 Note that the input and output orientations refer to the dual equations of each model
(and not to the primal ones).
Every firm located on the sections’ envelope running parallel to the axes
has to be adjusted for output and input slacks. However, the preceding
formulation does not integrate the role of slacks in measuring efficiency.
Considering output slacks, sr, and input slacks, si, the above equations
become:
CRS output-oriented model CRS input-oriented model
Dual equation with slacks Dual equation with slacks
s m s m
Maximize φk + ε ∑ sr + ε ∑ si (21) Minimize θ k − ε ∑ sr − ε ∑ si (25)
r =1 i =1 r =1 i =1
Subject to Subject to
n n
φk yrk − ∑ λ j yrj + sr = 0 r = 1, K , s (22) y rk − ∑ λ j y rj + sr = 0 r = 1, K , s (26)
j =1 j =1
n n
xik − ∑ λ j xij − si = 0 i = 1, K , m (23) θ k xik − ∑ λ j xij − si = 0 i = 1, K , m (27)
j =1 j =1
λ j , sr , si ≥ 0 ∀j = 1, K , n; r = 1, K , s; i = 1,..., m λ j , sr , si ≥ 0 ∀j = 1, K , n; r = 1, K , s; i = 1, K , m
(24) (28)
∑λ
j =1
j = 1 in the dual equations).
Figure 19 represents the CRS efficiency frontier (the dashed line) and
the VRS efficiency frontier (the solid line) on the same graph to
illustrate a simple example with one output and one input. Only one
firm, B, is located on both frontiers. A and C are 100% efficient under
the VRS assumption, but inefficient under the CRS assumption. D and E
are inefficient under both specifications.
17 This model is also known as the Banker, Charnes & Cooper model (BCC model).
CRS efficient frontier
Output C’ C
W
VRS efficient frontier
E’’ E’ E
V
B
U
D’’ D’ D
T
S A
A’
0
Input
Subject to Subject to
m s m s
∑v x − ∑u
i =1
i ij
r =1
r yrj − ck ≥ 0 j = 1, K , n (30) ∑v x − ∑u
i =1
i ij
r =1
r yrj − ck ≥ 0 j = 1, K , n (34)
s m
∑u
r =1
r yrk = 1 (31) ∑v x
i =1
i ik =1 (35)
Subject to Subject to
n n
φk y rk − ∑ λ j yrj ≤ 0 r = 1, K , s (38) y rk − ∑ λ j y rj ≤ 0 r = 1, K , s (43)
j =1 j =1
n n
xik − ∑ λ j xij ≥ 0 i = 1, K , m (39) θ k xik − ∑ λ j xij ≥ 0 i = 1, K , m (44)
j =1 j =1
n n
∑λ
j =1
j =1 (40) ∑λ
j =1
j =1 (45)
λ j ≥ 0 ∀j = 1, K , n (41) λ j ≥ 0 ∀j = 1, K , n (46)
When slacks are added into the model, the dual linear programming
equations become:
Subject to Subject to
n n
φk yrk − ∑ λ j yrj + sr = 0 r = 1, K , s (48) y rk − ∑ λ j y rj + sr = 0 r = 1, K , s (53)
j =1 j =1
n n
xik − ∑ λ j xij − si = 0 i = 1, K , m (49) θ k xik − ∑ λ j xij − si = 0 i = 1, K , m (54)
j =1 j =1
n n
∑λ
j =1
j =1 (50) ∑λ
j =1
j =1 (55)
λ j , sr , si ≥ 0 ∀j = 1, K , n; r = 1, K , s; i = 1,..., m λ j , sr , si ≥ 0 ∀j = 1, K , n; r = 1, K , s; i = 1, K , m
(51) (56)
∑λ
j =1
j = 1 restriction is substituted by the ∑λ
j =1
j ≤ 1 constraint
(Coelli et al., 2005). In Figure 20, the NIRS efficiency frontier has been
added (the dotted line). It corresponds to the CRS frontier from the
origin to point B followed by the VRS frontier from point B. The nature
of the scale inefficiencies for each firm can be determined by comparing
technical efficiency scores under NIRS and VRS. If
NIRS TE ≠ VRS TE (as for firms A and D), increasing returns to scale
apply. If NIRS TE = VRS TE (but ≠ CRS TE) (as for firms E and C),
decreasing returns to scale apply. Finally, if
NIRS TE = VRS TE = CRS TE, as for firm B, constant returns to scale
apply.
CRS efficient frontier
Output C’ C
W
VRS efficient frontier
E’’ E’ E
V
B
U
D’’ D’ D
T
S A
A’
0
Input
5 EXTENSIONS OF DEA
In this section, a selection of four extensions of DEA is shortly
introduced: adjusting for the environment, preferences (weight
restrictions), sensitivity analysis and time series data. For a broader
overview of the major developments in DEA, see Cook and Seiford
(2008). For an up-to-date review of DEA, readers will refer to Cooper et
al. (2011).
For different reasons (e.g. the weights assigned to variables by DEA are
considered unrealistic for some firms; the management team may wish
to give priority to certain variables; etc.), preferences about the relative
importance of individual inputs and outputs can be set by the decision
maker. This is done by placing weight restrictions onto outputs and
inputs (also called multiplier restrictions). Cooper et al. (2011) and
Thanassoulis et al. (2004) provide a review of models regarding the use
of weights restrictions. An earlier review can be found in Allen et al.
(1997). Generally, the imposition of weight restrictions worsens
efficiency scores. Three main approaches are identified to accommodate
preferences:
- Dyson and Thanassoulis (1988) propose an approach which
imposes absolute upper and lower bounds on input and output
weights. This technique is applied in Roll et al. (1991) to
highway maintenance units or in Liu (2009) to garbage
clearance units.
- Charnes et al. (1990) develop the cone-ratio method. This
approach imposes a set of linear restrictions that define a
convex cone, corresponding to an ‘admissible’ region of
realistic weight restrictions. See Brockett et al. (1997) for an
application to banks.
- Thompson et al. (1986, 1990) propose the assurance region
method. This approach is actually a special case of the cone
ratio. It imposes constraints on the relative magnitude of the
weights. For example, a constraint on the ratio of weights for
input 1 and input 2 can be included, such as the following:
ν2
L1, 2 ≤ ≤ U1, 2 , where L1,2 and U1,2 are lower and upper
ν1
bounds for the ratio of the weight of input 2 (ν 2 ) to the weight
of input 1 (ν 1 ). As a result, the assurance region method limits
the ‘region’ of weights to a restricted area by prohibiting large
differences in the value of those weights. An application of this
model is provided by Sarica and Or (2007) in the assessment of
power plants.
In DEA, panel data are considered using two methods: window analysis
and the Malmquist index.
Window analysis, introduced by Charnes et al. (1985), examines the
changes in the efficiency scores of a set of firms over time. A ‘window’
of time periods is chosen for each firm. The same firm is treated as if it
represented a different firm in every time period. In this sense, window
analysis can also be considered as a sensitivity analysis method. For
instance, a model including n firms with annual data and a chosen
‘window’ of t years will result in n x t units to be evaluated. For each
firm, t different efficiency scores will be measured. The ‘window’ is
then shifted by one period (one year in our example) and the efficiency
analysis is repeated. Yue (1992) provides a didactical application of
window analysis. Other applications include Yang and Chang (2009),
Avkiran (2004) or Webb (2003).
The Malmquist total factor productivity index was first introduced by
Malmquist (1953) before being further developed in the frame of DEA.
It is used to measure the change in productivity over time. The
Malmquist index decomposes this productivity change into two
components:
- The first one is called ‘catch-up’. This captures the change in
technical efficiency over time.
- The second one is called ‘frontier-shift’. This captures the
change in technology which occurs over time (i.e. the
movement of efficiency frontiers over time).
Readers will refer to Färe et al. (2011) and Tone (2004) for actual
reviews. Applications of the Malmquist index can be found in Coelli and
Prasada Rao (2005) and Behera et al. (2011).
6 DEA WITH MICROSOFT EXCEL © SOLVER
18 In Microsoft Excel © 2010, the Solver has to be loaded by clicking the File button, then
the Excel Options and finally the Add-Ins button. In the Manage box, Excel Add-ins has
to be selected before clicking the Go button. In the Add-Ins box, the Solver Add-in has
to be selected. Finally, the OK button has to be clicked. Once the Solver is loaded, it is
located in the Analysis group on the Data tab.
Consider five register offices (A to E) producing two outputs (birth and
marriage certificates) with one input (full-time equivalent public
servant). The data are listed in Table 4. For example, one full-time
equivalent (FTE) public servant works in Register Office A. He
produces one birth and six marriage certificates during a certain period
of time.
Register Input Output
Office Public servant (x) Birth (y1) Marriage (y2)
A 1 1 6
B 1 3 8
C 1 4 3
D 1 5 6
E 1 6 2
The use of Excel © Solver is illustrated with the following CRS model.
m s
Subject to
∑v x − ∑u y
i =1
i ij
r =1
r rj ≥0 j = 1, K , n
m
∑v x
i =1
i ik =1
u r , vi > 0 ∀r = 1, K , s; i = 1, K , m
$F$2=1 (where the cell reference is $F$2, the sign is = and the
constraint is 1). Note that this constraint means that the given
level of input is kept constant.
Figure 23 represents the Solver Parameters defined above.
Exercise 6
The objective of this exercise consists in programming the following
CRS model using Microsoft Excel © Solver. The same data as above
(see Table 4) have to be used. Note that this CRS model is equivalent to
the one developed above. Instead of maximizing the weighted sum of
outputs, however, it minimizes the weighted sum of the input.
m
Minimize ∑v x
i =1
i ik
m s
Subject to
∑ vi xij − ∑ ur yrj ≥ 0
i =1 r =1
j = 1, K , n
s
∑u y
r =1
r rk =1
u r , vi > 0 ∀r = 1, K , s; i = 1, K , m
Tasks
a) Prepare an Excel © spreadsheet adapted to the use of
Excel © Solver.
b) Which cell has to be optimized?
Answer: The target cell to be optimized is $F$2
c) In this case, do you maximize output or minimize input?
Answer: Input minimization (the Min option has to be ticked in the
Solver)
d) Which equation of the CRS model is optimized by the Solver?
m
Answer: ∑v xi =1
i ik . This equation corresponds to the minimization
Answer: $G$6:$G$10>=0
∑u y
r =1
r rk = 1 in the CRS model?