Iso 20815-2018
Iso 20815-2018
STANDARD 20815
Second edition
2018-10
Reference number
ISO 20815:2018(E)
© ISO 2018
ISO 20815:2018(E)
Contents Page
Foreword......................................................................................................................................................................................................................................... iv
Introduction...................................................................................................................................................................................................................................v
1 Scope.................................................................................................................................................................................................................................. 1
2 Normative references....................................................................................................................................................................................... 1
3 Terms, definitions and abbreviated terms................................................................................................................................. 2
3.1 Terms and definitions........................................................................................................................................................................ 2
3.2 Abbreviations......................................................................................................................................................................................... 15
4 Production assurance and decision support.........................................................................................................................17
4.1 Users of this document.................................................................................................................................................................. 17
4.2 Framework conditions................................................................................................................................................................... 17
4.3 Optimization process....................................................................................................................................................................... 19
4.4 Production assurance programme...................................................................................................................................... 21
4.4.1 Objectives............................................................................................................................................................................. 21
4.4.2 Project risk categorization.................................................................................................................................... 22
4.4.3 Programme activities................................................................................................................................................. 23
4.5 Alternative standards...................................................................................................................................................................... 25
5 Production assurance processes and activities.................................................................................................................26
Annex A (informative) Contents of production assurance programme (PAP)........................................................28
Annex B (informative) Core production assurance processes and activities...........................................................30
Annex C (informative) Interacting production assurance processes and activities..........................................39
Annex D (informative) Production performance analyses..........................................................................................................43
Annex E (informative) Reliability and production performance data.............................................................................50
Annex F (informative) Performance objectives and requirements....................................................................................52
Annex G (informative) Performance measures for production availability..............................................................56
Annex H (informative) Relationship to major accidents...............................................................................................................69
Annex I (informative) Outline of techniques..............................................................................................................................................71
Bibliography.............................................................................................................................................................................................................................. 96
Foreword
ISO (the International Organization for Standardization) is a worldwide federation of national standards
bodies (ISO member bodies). The work of preparing International Standards is normally carried out
through ISO technical committees. Each member body interested in a subject for which a technical
committee has been established has the right to be represented on that committee. International
organizations, governmental and non-governmental, in liaison with ISO, also take part in the work.
ISO collaborates closely with the International Electrotechnical Commission (IEC) on all matters of
electrotechnical standardization.
The procedures used to develop this document and those intended for its further maintenance are
described in the ISO/IEC Directives, Part 1. In particular, the different approval criteria needed for the
different types of ISO documents should be noted. This document was drafted in accordance with the
editorial rules of the ISO/IEC Directives, Part 2 (see www.iso.org/directives).
Attention is drawn to the possibility that some of the elements of this document may be the subject of
patent rights. ISO shall not be held responsible for identifying any or all such patent rights. Details of
any patent rights identified during the development of the document will be in the Introduction and/or
on the ISO list of patent declarations received (see www.iso.org/patents).
Any trade name used in this document is information given for the convenience of users and does not
constitute an endorsement.
For an explanation of the voluntary nature of standards, the meaning of ISO specific terms and
expressions related to conformity assessment, as well as information about ISO's adherence to the
World Trade Organization (WTO) principles in the Technical Barriers to Trade (TBT) see www.iso
.org/iso/foreword.html.
This document was prepared by Technical Committee ISO/TC 67, Materials, equipment and offshore
structures for petroleum, petrochemical and natural gas industries.
This second edition cancels and replaces the first edition (ISO 20815:2008), which has been technically
revised. The main changes compared to the previous edition are as follows:
— Clause 3: several new terms, definitions and abbreviations;
— Clause 4: new 4.1 and new Figure 2;
— Annexes A, B, C and E: minor changes;
— Annex D: various new text and new figures;
— Annex F: new text in Clause F.3, new Clause F.4, and new figure;
— Annex G and H: some changes in Clauses G.2, G.3, H.1 and H.2;
— Annex I: various changes in Clauses I.7 to I.10, I.18 to I.22, and new Clauses I.23 to I.26.
Any feedback or questions on this document should be directed to the user’s national standards body. A
complete listing of these bodies can be found at www.iso.org/members.html.
Introduction
The petroleum, petrochemical and natural gas industries involve large capital investment costs as well
as operational expenditures. The profitability of these industries is dependent upon the reliability,
availability and maintainability of the systems and components that are used. Therefore, for optimal
production availability in the oil and gas business, a standardized, integrated reliability approach is
required.
The concept of production assurance, introduced in this document, enables a common understanding
with respect to use of reliability technology in the various life cycle phases and covers the activities
implemented to achieve and maintain a performance level that is at its optimum in terms of the overall
economy and, at the same time, consistent with applicable regulatory and framework conditions.
Annexes A to I are for information only.
1 Scope
This document describes the concept of production assurance within the systems and operations
associated with exploration drilling, exploitation, processing and transport of petroleum, petrochemical
and natural gas resources. This document covers upstream (including subsea), midstream and
downstream facilities, petrochemical and associated activities. It focuses on production assurance of
oil and gas production, processing and associated activities and covers the analysis of reliability and
maintenance of the components. This includes a variety of business categories and associated systems/
equipment in the oil and gas value chain. Production assurance addresses not only hydrocarbon
production, but also associated activities such as drilling, pipeline installation and subsea intervention.
This document provides processes and activities, requirements and guidelines for systematic
management, effective planning, execution and use of production assurance and reliability technology.
This is to achieve cost-effective solutions over the life cycle of an asset development project structured
around the following main elements:
— production assurance management for optimum economy of the facility through all of its life cycle
phases, while also considering constraints arising from health, safety, environment, and quality;
— planning, execution and implementation of reliability technology;
— application of reliability and maintenance data;
— reliability-based technology development, design and operational improvement.
The IEC 60300-3 series addresses equipment reliability and maintenance performance in general.
This document designates 12 processes, of which seven are defined as core production assurance
processes and addressed in this document. The remaining five processes are denoted as interacting
processes and are outside the scope of this document. The interaction of the core production assurance
processes with these interacting processes, however, is within the scope of this document as the
information flow to and from these latter processes is required to ensure that production assurance
requirements can be fulfilled.
The only requirement mandated by this document is the establishment and execution of the production
assurance programme (PAP). It is important to reflect the PAP in the overall project management in the
project for which it applies.
This document recommends that the listed processes and activities be initiated only if they can be
considered to add value.
2 Normative references
The following documents are referred to in the text in such a way that some or all of their content
constitutes requirements of this document. For dated references, only the edition cited applies. For
undated references, the latest edition of the referenced document (including any amendments) applies.
ISO 14224:2016, Petroleum, petrochemical and natural gas industries — Collection and exchange of
reliability and maintenance data for equipment
Note 2 to entry: ISO 14224:2016 distinguishes between the terms mean active repair time (MART), mean time to
repair (MTTR), mean time to restoration (MTTRes), and mean overall repairing time (MRT). See ISO 14224:2016,
3.59, 3.63, 3.64 and 3.61 for further details.
Note 3 to entry: The mean active repair time (MART) is defined as “expected active repair time” in ISO/TR
12489:2013, 3.1.34. See also ISO/TR 12489:2013, Figures 5 and 6.
[SOURCE: ISO 14224:2016, 3.2, modified — Notes 1 to 2 to entry have been added.]
3.1.2
availability
ability to be in a state to perform as required
Note 1 to entry: For a binary item, the measure of the availability is the probability to be in up state (i.e. in a state
belonging to the up state class), see 3.1.59.
Note 2 to entry: In 3.1.4, the figure shows the system is available at time t 1 and unavailable at time t 2.
Note 3 to entry: See ISO 14224:2016, Annex C for a more detailed description and interpretation of availability.
Note 4 to entry: Technical or operational availability (see ISO 14224:2016, C.2.3.2 and Table E.3) or system
availability can be used as derived performance measures. Case specific definition of system availability is
needed to reflect the system being addressed.
EXAMPLE 1 A usual item with an up state (3.1.59) and a down state (3.1.10) is a binary item. Components A
and B in the figure below are binary items.
EXAMPLE 2 A system made up of two redundant binary items, A and B, has four states: S1 (both A and B in
up state), S2 (A in up state and B in down state), S3 (A in down state and B in up state), S4 (both A and B in down
state). If the system is able to operate as required in states S1, S2 and S3 and not able in state S4, it is a binary item
with the up state class {S1, S2, S3} and the down class {S4}. This is illustrated in the Figure showing availability
behaviour of an 1oo2 system.
3.1.5
common cause failure
failures of multiple items, which would otherwise be considered independent of one another, resulting
from a single cause
Note 1 to entry: See also Notes to entry for common cause failures in ISO 14224:2016, 3.5.
[SOURCE: IEC 60050-192:2015, 192-03-18, modified — Note 1 to entry has been added.]
3.1.6
condition monitoring
obtaining information about physical state or operational parameters
Note 1 to entry: Condition monitoring is used to determine when preventive maintenance may be required.
Note 2 to entry: Condition monitoring may be conducted automatically during operation or at planned intervals.
Note 3 to entry: Condition monitoring is part of condition-based maintenance. See also ISO 14224:2016, Figure 6.
3.1.9
design life
planned usage time for the total system
Note 1 to entry: to entry It is important not to confuse design life with the ‘mean time to failure’ (MTTF), which
is comprised of several items that might be allowed to fail within the design life of the system as long as repair or
replacement is feasible.
3.1.10
down state
unavailable state
internally disabled state
internal disabled state
<of an item> state of being unable to perform as required, due to internal fault, or preventive
maintenance
Note 1 to entry: This concept is related to a binary item (3.1.4), which can have several down states forming the
down state class of the item. All the states in the down state class are considered to be equivalent with regard to
the unavailability of the considered item.
Note 2 to entry: See also Notes to entry for down state in ISO 14224:2016, 3.15.
EXAMPLE In the figure in 3.1.4, the down state class of the system S comprises only one state {S4} and the
system S is in down state at time t 2.
Note 2 to entry: Down time can be equipment down time (see Figure 4 and Table 4 in ISO 14224:2016), production
down time (see Figures I.1 and I.2) or down time for other operations (e.g. drilling). It is important to distinguish
between the equipment down time itself and the down time of the plant to which the equipment belongs.
3.1.13
failure
<of an item> loss of ability to perform as required
Note 1 to entry: A failure of an item is an event that results in a fault (i.e. a state) of that item (see 3.1.18). This is
illustrated in the figure in 3.1.50 for a binary system S comprising two redundant components A and B.
[SOURCE: IEC 60050‐192:2015, 192‐03‐01, modified — Note 1 to entry has been added.]
3.1.14
failure cause
root cause
set of circumstances that leads to failure
Note 1 to entry: A failure cause can originate during specification, design, manufacture, installation, operation or
maintenance of an item.
Note 2 to entry: See also ISO 14224:2016, B.2.3 and Table B.3, which define failure causes for all equipment
classes.
[SOURCE: IEC 60050-192:2015, 192-03-11, modified — Note 2 to entry has been added.]
3.1.15
failure data
data characterizing the occurrence of a failure event
Note 1 to entry: See also ISO 14224:2016, Table 6.
[SOURCE: IEC 60050-192:2015, 192-03-17, modified — Note 1 to entry has been added.]
3.1.17
failure rate
conditional probability per unit of time that the item fails between t and t + dt, provided that it has been
working over [0, t]
[SOURCE: ISO/TR 12489:2013, modified — Notes 1 to 4 to entry have been added.]
Note 1 to entry: See ISO 14224:2016, C.3 for further explanation of the failure rate.
Note 2 to entry: This definition applies for the first failure of binary items (3.1.4).
Note 3 to entry: Under the assumptions that the failure rate is constant and that the item is as good as new after
repairs the failure rate can be estimated as the number of failures relative to the corresponding accumulated up
time divided by this accumulated up time. In this case this is the reciprocal of MTTF (3.1.34). In some cases, time
can be replaced by units of use.
Note 4 to entry: The estimation of the failure rate can be based on operating time or calendar time.
3.1.18
fault
<of an item> inability to perform as required, due to an internal state
Note 1 to entry: A fault of an item results from a failure, either of the item itself, or from a deficiency in an earlier
stage of the life cycle, such as specification, design, manufacture or maintenance. See latent fault (ISO 14224:2016,
3.44). The down states of items A, B and S in the figure in 3.1.46 are examples of faults.
Note 2 to entry: An item made of several sub-items (e.g. a system) which continues to perform as required in
presence of faults of one or several sub-items is called fault tolerant.
Note 1 to entry: Discrepancy with intention is considered essential in determining human error; see Reference[81].
Note 2 to entry: The term “human error” is often attributed in hindsight to a human decision, action or inaction
considered to be an initiator or contributory cause of a negative outcome such as loss or harm.
Note 3 to entry: In human reliability assessment, human error is defined as any member of a set of human actions
or activities that exceeds some limit of acceptability, this being an out of tolerance action or failure to act where
the limits of performance are defined by the system (see Reference[78]).
[SOURCE: IEC 60050-192:2015, 192-03-14, modified — Notes 1 through 5 to entry have been added.]
3.1.21
instantaneous availability
A(t)
probability that an item is in a state to perform as required at a given instant
[SOURCE: IEC 60050-192:2015, 192-08-01]
3.1.22
integrity
ability of a barrier to function as required when needed
Note 1 to entry: See 3.1.2 in ISO/TR 12489:2013 for definition of safety integrity.
Note 2 to entry: There are different definitions of integrity: plant, asset, system, pipeline (see DNVGL-ST-F101:
2017), well (see ISO 16530-1:2017, 3.73), mechanical, safety (see ISO/TR 12489:2013, 3.1.2), structural (see
ISO 19900:—, 3.47) and technical.
3.1.23
item
subject being considered
Note 1 to entry: The item can be an individual part, component, device, functional unit, equipment, subsystem,
or system.
Note 2 to entry: The item may consist of hardware, software, people or any combination thereof.
Note 3 to entry: In this document, item can also be plant/unit and installation. See ISO 14224:2016, Figure 3.
3.1.24
logistic delay
delay, excluding administrative delay, incurred for the provision of resources needed for a maintenance
action to proceed or continue
Note 1 to entry: Logistic delays can be due to, for example, travelling to unattended installations, pending
arrival of spare parts, specialists, test equipment and information, and delays due to unsuitable environmental
conditions (e.g. waiting on weather).
[SOURCE: IEC 60050-192:2015, 192-07-13, modified — Notes 1 and 2 to entry have been added.]
3.1.25
lost revenue
total cost of lost or deferred production due to down time
3.1.26
maintainability
<of an item> ability to be retained in, or restored to a state to perform as required, under given
conditions of use and maintenance
Note 1 to entry: Given conditions would include aspects that affect maintainability, such as: location for
maintenance, accessibility, maintenance procedures and maintenance resources.
Note 2 to entry: Maintainability can be quantified using appropriate measures. See IEC 60050-192:2015,
192-07-Maintainability and maintenance support: measures.
3.1.31
maintenance supportability
supportability
<of an item> ability to be supported to sustain the required availability with a defined operational
profile and given logistic and maintenance resources
Note 1 to entry: Supportability of an item results from the inherent maintainability (3.1.26), combined with
factors external to the item that affect the relative ease of providing the required maintenance and logistic
support.
Note 2 to entry: See ISO 14224:2016, Annex C for further details regarding the interpretation of maintainability.
[SOURCE: IEC 60050-192:2015, 192-01-31, modified — Note 2 to entry has been added.]
3.1.32
major accident
hazardous event that results in multiple fatalities or severe injuries; or extensive damage to structure,
installation or plant; or large-scale impact on the environment
Note 1 to entry: Examples of large-scale impact on the environment are persistent and severe environmental
damage that can lead to loss of commercial or recreational use, loss of natural resources over a wide area or
severe environmental damage that will require extensive measures to restore beneficial uses of the environment.
Note 2 to entry: In ISO 17776:2016, a major accident is the realization of a major accident hazard.
3.1.34
mean time to failure
MTTF
expected time before the item fails
Note 1 to entry: See further details in ISO/TR 12489:2013, 3.1.29.
Note 2 to entry: IEC 60050-192:2015 defines MTTF as ”expectation of the operating time to failure”.
[SOURCE: ISO/TR 12489:2013, 3.1.29, modified — Notes 1 through 3 to entry have been added.]
3.1.35
midstream
business category involving the processing, storage and transportation sectors of the petroleum
industry
Note 1 to entry: See ISO 14224:2016, A.1.4 for further details.
3.1.36
modification
combination of all technical and administrative actions intended to change an item
[SOURCE: ISO 14224:2016, 3.67]
3.1.37
multi-state item
item with more than two classes of states
Note 1 to entry: This is an extension of the binary items beyond the concepts of up and down states. This can
characterize single items with degraded states or systems made up of several components.
EXAMPLE An oil production system comprising two wells, A and B, that can be considered as binary items
(see 3.1.3) has four states: S1 (both A and B in up state), S2 (A in up state and B in down state), S3 (A in down state
and B in up state), S4 (both A and B in down state). If, when they are in up state, A produces 200 bpd (barrels
per day) and B produces 100 bpd, then the system has four classes of production 300 bpd, {S1},200 bpd, {S2},
100 bpd, {S3} and 0 bpd, {S4}. With regards to oil production, it is a multi-state item. This is illustrated in the
figure showing production availability behaviour of a multi-state system.
3.1.38
observation period
time period during which production performance and reliability data are recorded
3.1.39
operating state
<of an item> state of performing as required
Note 1 to entry: See also ISO 14224:2016, Table 4.
Note 3 to entry: The state capacities of a multi-state item characterize various levels of operation and
consequently, the definition of the operating state of a multi-state item depends on the situation, for example, if:
— no other requirement is given, any state with a capacity greater than zero is an operating state;
— a minimum capacity is required, it provides the limit to split the states between up and down classes;
— a given capacity is specified, then only the states with this capacity are operating states;
— no other requirement is given, any state with a capacity greater than zero is an operating state (300 bpd,
200 bpd and 100 bpd in the figure in 3.1.37);
— a minimum capacity is required, it provides the limit to split the states between up and down classes (300 bpd,
200 bpd in the figure in 3.1.37, if the minimum allowed production is 200 bpd);
— a given capacity is specified, then only the states with this capacity are operating states (200 bpd in the
figure in 3.1.37 if and only if 200 bpd are required).
[SOURCE: IEC 60050‐192:2015, 192‐02‐04, modified — Notes 1 and 3 to entry have been added.]
3.1.40
operating time
time interval during which an item is in an operating state
Note 1 to entry: The accumulated times of various disjunct operating times interrupted by e.g. unplanned or
planned down time is also called operating time.
Note 2 to entry: Sometimes the term “running time” is used instead of “operating time”. Often the running time
describes the active part of the operating time, see Table 4 in ISO 14224:2016. Whether rundown or start-up
period is included depends on equipment, but hot-standby time is not included even though some equipment
functions can be active to minimize start-up time in e.g. redundant configuration (“hot standby”).
Note 3 to entry: Running hours during testing is also called running hours, even though this is at test conditions.
[SOURCE: IEC 60050-192:2015, 192-02-05, modified — Notes 1 to 3 to entry have been added.]
3.1.41
performance objective
indicative level for the desired performance
Note 1 to entry: Objectives are expressed in qualitative or quantitative terms. Objectives are not absolute
requirements and may be modified based on cost or technical constraints. See further details in Annex F.
3.1.42
performance requirement
required minimum level for the performance of a system
Note 1 to entry: Requirements are normally quantitative, but may also be qualitative.
3.1.43
petrochemical
business category producing the chemicals derived from petroleum and used as feedstock for the
manufacture of a variety of plastics and other related products
Note 1 to entry: See ISO 14224:2016, A.1.4 for further details.
3.1.45
production assurance
activities implemented to achieve and maintain a performance that is at its optimum in terms of the
overall economy and at the same time consistent with applicable framework conditions
Note 1 to entry: Production assurance in this document is not only limited to cover production of oil and gas, but
can also be other activities such as drilling operations, downhole well intervention, subsea intervention, offshore
loading operations, for which production assurance activities and reliability management are needed.
Note 2 to entry: Production assurance activities relate closely to the integrity management of the installations.
See definition of integrity in 3.1.22.
3.1.46
production availability
ratio of production to planned production, or any other reference level, over a specified period of time
Note 1 to entry: Production availability is an extension of the mean availability (3.1.33) to deal with multi-state
items. It is the ratio between the accumulated production delivered over a given interval of time and a reference
production level defined for this interval. For example, in 3.1.4, the figure shows the production availability of the
system over the interval [0, t 3] is equal to [300 · (δ1 + δ3 + δ5 + δ9) + 200 · (δ2 + δ6) + 100 · (δ4 + δ8)] / (300 · t 3)
provided the reference production level is 300 bpd at any time. By dividing by 300, this formula can also be
written [(δ1 + δ3 + δ5 + δ9) + 66,6 % · (δ2 + δ6) + 33,3 % · (δ4 + δ8)] / t 3.
Note 2 to entry: This measure is used in conjunction with analysis of delimited systems without compensating
elements such as substitution from other producers and downstream buffer storage. Battery limits need to be
defined in each case.
Note 3 to entry: See G.1 and Figure G.1 for further information. Examples of production loss categories (or time
loss categories) are shown in Tables G.1 to G.6.
Note 4 to entry: Production efficiency (PE) is a term often used by operators for historic production availability
in the operating phase and is a reported measure, but is in principle the same measure as predicted production
availability that is a modelled measure. This document uses the term production availability.
Note 5 to entry: For offshore and onshore loading systems, some special performance measures exist when
undertaking loading performance analyses, see further details of such metrics in I.26.
3.1.47
production performance
capacity of a system to meet demand for deliveries or performance
Note 1 to entry: Production availability, deliverability or other appropriate measures can be used to express
production performance.
Note 2 to entry: The use of production performance terms should specify whether it represents a predicted or
historic production performance.
3.1.48
production performance analysis
systematic evaluations and calculations carried out to assess the production performance of a system
Note 1 to entry: The term should be used primarily for analysis of whole systems, but may also be used for
analysis of production unavailability of sub-systems. Annex D provides guidance for planning and reporting such
analysis, and parts of Annex D can also be useful for loading performance analysis.
Note 2 to entry: Loading performance analysis is a particular type of production performance analysis focussing
on offshore and onshore loading operations, which e.g. use metocean data to analyse weather impact on such
operations (see I.25 and I.26).
3.1.49
redundancy
existence of more than one means for performing a required function of an item
Note 1 to entry: See ISO 14224:2016, C.1.2 for further details, where passive (cold), active (hot) standby and
mixed redundancy are described.
Note 3 to entry: IEC 60050-192:2015, 192-10-02 defines redundancy as “provision of more than one means for
performing a function”.
Note 2 to entry: In the figure below reliability is illustrated for a system S comprising two redundant components
A and B. The system is reliable all over the interval [0, t1] but has had a failure during [0, t 2]. See also Figure G.1
for further information.
[SOURCE: ISO 14224:2016, 3.81, modified — Notes 1 and 2 to entry have been added.]
3.1.51
reliability data
data for reliability, maintainability and maintenance support performance
[SOURCE: ISO 14224:2016, 3.84]
3.1.52
reliability management
activities undertaken to achieve reliability related performance objectives and requirements
Note 1 to entry: Reliability management reflects production assurance activities on equipment and system level.
In project/product–development and design phases this is often called “reliability engineering”.
Note 2 to entry: A reliability management programme (RMP) can be used to describe such activities, see A.1.
3.1.53
required function
function, or combination of functions, of an item that is considered necessary to provide a given service
[SOURCE: ISO 14224:2016, 3.83]
3.1.54
risk
combination of the probability of an event and the consequences of the event
Note 1 to entry: This definition is based on ISO/IEC Guide 51:2014, 3.9 that defines risk as combination of the
probability of occurrence of harm and the severity of that harm, where the probability of occurrence includes
the exposure to a hazardous situation, the occurrence of a hazardous event and the possibility to avoid or limit
the harm. “Harm" has been replaced by "event" in the definition to cope with production assurance purpose.
It is also similar to the definition of the “level of risk” given in ISO Guide 73:2009, 3.6.1.8 (i.e. “combination of
consequences and their likelihood”).
Note 2 to entry: Events leading to production losses are considered within the production assurance field.
3.1.55
risk register
record of information about identified risks
[SOURCE: ISO Guide 73:2009, 3.8.2.4]
3.1.56
state capacity
state efficiency
processing ability of an item state
Note 1 to entry: The capacity of an item state is related to the amount of production the item is able to produce or
process in this state. For example, in the figure in 3.1.33, component A has a capacity of 200 bpd.
EXAMPLE 1 A single oil production well with two states (binary item, 3.1.4) has, for example, a capacity of
100 barrels per day (bpd) when it is in up state and of 0 bpd when it is in down state (see component B in the
figure in 3.1.33).
EXAMPLE 2 An oil production system made up of two wells, A and B, producing respectively 200 bpd and
100 bpd, has four states. This is illustrated in the figure in 3.1.37:
When the reference value is non-ambiguous, the capacity can be given in percentage. For example, with regard
to the maximum capacity of the producing system, the capacity of S1 is 100 %, the capacity of S2 is 66,6 %, the
capacity of S3 is 33,3 % and the capacity of S4 is 0 %.
3.1.57
survival probability
reliability <measure>
R(t)
likelihood of the continued functioning of an item
Note 1 to entry: This likelihood is calculated by using Formula (1):
where Pr(T > t) is the probability that the time to failure of an item, T, is greater than t, a time equal to or
greater than 0.
Note 2 to entry: Reliability is illustrated in the figure in 3.1.50 for a system S comprising two redundant
components A and B. As T > t1 the system is reliable all over the interval [0, t 1] and is surviving at time t1. As
T < t 2 the system is not surviving at time t 2.
3.1.58
technology qualification
process of providing evidence that the technology will perform as required for the specified
application area
Note 1 to entry: The requirements include both functional (technical and operational) and associated reliability
requirements for its design life. Application area refers to the operating conditions, environment or purpose for
which the technology will be used.
3.1.59
up state
available state
<of an item> state of being able to perform as required
Note 1 to entry: This concept is related to a binary item (3.1.4), which can have several up states forming the
up state class of the item. All the states in the up state class are considered to be equivalent with regard to the
availability of the considered item.
Note 2 to entry: Up state relates to the availability performance (3.1.2) of the item.
EXAMPLE In the figure in 3.1.4, the up state class of the system S comprises three states {S1, S2, S3} and the
system is in up state at time t1.
[SOURCE: IEC 60050‐192:2015, 192‐02‐01, modified — Notes 1 to 3 to entry have been added.]
3.1.60
up time
time interval during which an item is in an up state
Note 1 to entry: See also ISO/TR 12489:2013, Figure 3.
3.1.62
variability
variations in performance measures for different time periods under defined framework conditions
Note 1 to entry: The variations can be a result of the down time pattern for equipment and systems or operating
factors, such as wind, waves and access to certain repair resources.
3.2 Abbreviations
FN flow network
PE production efficiency
PM preventive maintenance
PN Petri nets
QA quality assurance
used in the operational phase unless provisions have been made for it in the design phase. In addition,
there are dependencies between many of the listed measures.
Figure 1 — Typical technical and operational measures that affect production performance
e) Apply the optimization process as illustrated in Figure 3. Be aware that the execution of the
optimization process requires that the production assurance and reliability function should be
addressed by qualified team members.
f) If required, the process can be iterative, where the selected alternative is further refined and
alternative solutions are identified. The iterative process is typical for “gated” or threshold project-
execution phases.
g) Sensitivity analyses may be performed to take account of uncertainty in important input
parameters.
a Typical project constraints include HSE requirements; technical feasibility; compliance with acts, rules and
regulations; economical constraints; schedule constraints.
4.4.1 Objectives
A production assurance programme (PAP) shall serve as a management tool in the process of complying
with this document. The PAP may be either established for the various life cycle phases of a new asset
It is necessary to define the level of effort to invest in a PAP to meet the business objectives for each
life cycle phase. In practice, the production assurance effort required is closely related to the level of
technical risk in a project. Therefore, it is recommended that one of the first tasks to be performed is an
initial categorization of the technical risks in a project. This enables project managers to make a general
assessment of the level of investment in reliability resources that may have to be made in a project.
The project risk categorization typically varies depending on a number of factors such as financial
situation, risk attitude, etc. Hence, specific risk categorization schemes may be established. However, to
provide some guidance on the process, a simple risk categorization scheme is outlined in this subclause.
Projects can be divided into three risk classes:
— high risk;
— medium risk;
— low risk.
The features that describe the three risk classes are further outlined in Table 1. Typically, there is a
gradual transition from one risk class to another. Hence, a certain degree of subjective assessment is
required. However, the justification for the selected risk class for a project should be included in the
PAP issued during the feasibility or concept phase.
The risk categorization should not be detached from the overall project’s risk management process (see
Clause C.2).
The project risk categorization (i.e. high, medium and low) is further applied in Table 2 to indicate
what processes should be performed for the different project categories. The risk categorization
from different users perspective can require different approaches, but the importance is that risk
categorization should be in place in the user company to prioritize the production assurance activities.
Production assurance activities should be carried out in all phases of the life cycle of facilities to provide
input to decisions regarding feasibility, concept, design, manufacturing, construction, installation,
operation, maintenance and modification. Processes and activities shall be initiated only if they are
considered to contribute to added value of the project.
The production assurance activities specified in the PAP shall be defined in view of the actual needs,
available personnel resources, budget framework, interfaces, milestones and access to data and general
information. This is necessary to reach a sound balance between the cost and benefit of the activity.
Production assurance should consider organizational and human factors as well as technical aspects
(see Clause I.10).
Important tasks of production assurance are to monitor the overall performance level, manage reliability
and the continuous identification of the need for production assurance activities. A further objective of
production assurance is to contribute technical, operational or organizational recommendations.
The processes and activities specified in the PAP shall focus on the main technical risk items initially
identified through a top-down screening process (see 4.4.2). A risk-classification activity can assist in
identifying performance-critical systems that should be subject to more detailed analysis and follow-up.
The emphasis of the production assurance activities changes for the various life cycle phases. Early
activities should focus on optimization of the overall configuration, while attention to critical detail
increases in the later phases. The production assurance activities may also interact with integrity
management activities.
Production assurance activities relate closely to the integrity management of the installations and
system engineering activities, and the PAP should show such relationships.
In the feasibility and concept phases, the field layout configuration should be identified. This also
includes defining the degree of redundancy (fault tolerance), overcapacity and flexibility, on a system
level. This requires establishing the CAPEX, OPEX, LOSTREV, expected cost or benefit of risks and
revenue for each alternative.
These financial values are, in turn, fed back into the operators’ profitability tools, for evaluation
of profitability and selection of the alternative that best fits with the attitude towards risk. Optimal
production availability for field layouts requires that overemphasis on CAPEX is avoided, and it is
recommended that this is achieved through long-term partnering between suppliers and operators,
as well as between suppliers and their sub-suppliers. Such long-term relationships ensure mutual
confidence and maturing of the technology. Early direct involvement of the above parties with focus
on the overall revenue in a life cycle perspective is advised. This means, for example, implementing the
resulting recommendations as specifications in the invitations to tender.
The production assurance principles outlined in this document require use of the reliability data
methodology based on ISO 14224:2016, see the process “performance data tracking and analysis”
(Process 9). See also guidance in Annex E. In addition, Annex G provides a framework for performance
measures for production availability.
An overview of the production assurance processes is given in Table 2 and Clause 5, while descriptions
of the recommended activities for the processes are given in Annex B and Annex C.
Table 2 provides recommendations (indicated by an “X”) on which processes should be performed
as a function of the project risk categorization (see 4.4.2). The table also provides recommendations
(indicated by an “X”) as to when the processes should be applied (in what life cycle phase). Production
assurance requirements (process 1) can be used to illustrate the interpretation of the table. This process,
which is further described in Annex B, should be implemented for medium- and high-risk projects, and
performed in the feasibility, concept design, engineering and procurement life cycle phases.
Table 2 — Overview of production assurance processes versus risk levels and life cycle phases
Production assurance processes for asset development Life cycle phasee, f
Instal-
Fabri-
lation
Medium- High- Con- Engi- Pro- cation/
Low-risk Feasi- and Opera-
risk pro risk pro Process name and numberc ceptual neer- cure- Assem-
projects bility com- tion
jects jects designa ingb ment g bly/
mis-
Testing
sioning
1. Production assurance
— X X X X X X — — —
requirements
X X X 2. Production assurance planning X X X X X X X
3. Design and manufacture for
— X X — X X X X X X
production assurance
X X X 4. Production assurance X X X X X X X
— X X 5. Risk and reliability analysisd X X X — — — X
X X X 6. Verification and validation X X X — — — —
X X X 7. Project risk management X X X X X X X
— — X 8. Qualification and testing — X X X X — —
9. Performance data tracking
X X X — — — — X X X
and analysis
— — X 10. Supply chain management — — — X — — X
X X X 11. Management of change — X X X X X X
X X X 12. Organizational learning X X X X X X X
a Including front-end engineering and design (FEED).
b Including pre-engineering and detailed engineering.
c The following production assurance processes are within the main scope of work for this document: 1, 2, 3, 4, 5, 6 and 9.
d This process is primarily meant to cover production availability, system/equipment availability and component reliability analyses, but in this context
may also include availability/reliability of safety systems, see B.5 and G.1. The relations to risk analysis are described in Annex H and Clause I.20. Annex D
provides guidance for planning, execution and reporting of production performance analyses.
e Technology development projects can use similar production assurance processes.
f Lifetime extension projects can use similar production assurance processes. See Clause I.24.
g The procurement activities relate to various life cycle phases.
NOTE A process can be applicable for a certain risk class or life cycle phase although no “X” is indicated in this table. Likewise, if it can be argued that a
certain process does not add value to a project, it can be omitted.
Annex A
(informative)
A.1 General
This document describes the concept of production assurance (see Clause 4) and provides processes and
activities that culminate in a production assurance programme (PAP) (see 4.4.1). This annex suggests a
model for that PAP. A PAP (see 4.4) should cover the topics covered in Clauses A.2 to A.8.
The PAP is generally used for the entire asset or project by the operator, but can also apply for the
engineering contractor or a supplier/manufacturer for their scope of work in a project. The latter may
then be named “reliability management programme” (RMP), but entails the same guidance as described
in this annex. It may also apply for a technology development project for an operator or supplier, or in
a product portfolio (e.g. specific equipment categories or equipment classes; see ISO 14224:2016) to
ensure reliability management.
A.2 Title
Production assurance programme (PAP) for [insert the description of the project].
A.8 References
References are made to, and revised as appropriate when updating the PAP:
— key project documentation;
— relevant corporate or company requirements;
— relevant international, industry or national standards;
— list of production assurance deliverables (documentation).
Annex B
(informative)
Further requirements for the PAP are described in 4.4 and in Annex A. Production assurance process 2
is described in Table B.2.
— reference to the risk register (see Clause C.2); all mitigating actions arising from the
production assurance programme should be transferred to the risk register for follow-up and
close-out.
Input to process 4 — Production assurance
NOTE A close-out report for production assurance activities upon completion of a project can be
useful, also in organizational learning (see Process 12).
Production assurance process 3 is about implementing improvements (“feed forward loop”) during
design, manufacture, test and installation and operational phases to drive production assurance, and is
described in Table B.3. See also Figure B.1.
— flexibility, e.g.
the possibility
for alternative
routings,
reconfigurations
and future
expansions;
— maintainability,
e.g. minimizing
the amount
of down time
required for
maintenance.
The purpose of the The purpose of the The purpose of the
equipment reliability equipment reliability equipment reliability
analysis is to screen analysis is to screen analysis is to screen
the delivery project the delivery project the delivery project
to identify the critical to identify the critical to identify the critical
parts, which are then parts, which are then parts, which are then
studied in more detail studied in more detail studied in more detail
to identify possible to identify possible to identify possible
improvements. improvements. improvements.
A reliability analysis A reliability analysis A reliability analysis
technique may be se- technique may be se- technique may be se-
lected (see Annex I) lected (see Annex I) lected (see Annex I)
Failures occurring on equipment during fabrication and detected during acceptance testing (e.g. FAT,
SAT and SIT) is also important to assess, see also ISO 14224:2016, 5.2.
Production assurance activities (e.g. production performance analysis) can reveal the need for
reliability data that can require data collection in accordance with ISO 14224:2016.
Collection and analysis of performance data is further described in Annex E. Furthermore, Annex G
provides examples of performance measures that can be tracked and analysed.
NOTE Data qualification is part of process 5: Risk and reliability analysis.
Annex C
(informative)
C.1 General
The interacting processes described in this annex are not included in the responsibility of the
production assurance discipline. However, these interacting processes are required in order to achieve
the required production performance.
Reliability testing techniques are used in qualification testing and various techniques (e.g. accelerated
life testing) are further described in Clause I.9.
Annex D
(informative)
D.1 General
Production performance analyses should be planned, executed, used and updated in a controlled and
organized manner.
Production performance analyses should provide a basis for decisions concerning the choice of
solutions and measures to achieve an optimum economy within the given constraints. This implies that
the analysis should be performed at a point in time when sufficient details are available to provide
sustainable results. However, results should be presented in time for input to the decision process.
Production performance analyses should be consistent and assumptions and reliability data traceable.
Suitable analysis tools, calculation models, data and computer codes that are acceptable to the involved
parties should be chosen. Be aware that analysis tools and calculation models are under constant
development.
Recommendations given in this annex apply to the production performance analyses of complete
installations, but can also apply to reliability and availability analyses of components/systems with
obvious modifications.
Reporting of production performance analysis results should be at the relevant taxonomy level as
outlined in the Figure D.1. Production availability for entire production facility is typically reported
at taxonomic levels 3 - 4, whilst production unavailability can be reported to reflect production loss
impact from items on underlying taxonomic levels 5-9 when the analysis has a smaller scale focus. See
further description in this annex.
Figure D.2 illustrates the framework for this document. Collection of equipment reliability and
maintenance data is an important basis for production performance analysis. This topic is addressed in
ISO 14224:2016. These data are further treated and analysed to establish failure rates, repair time, etc.,
typically on equipment, subunit or component level (taxonomy levels 6 - 8, see Figure D.1). Smaller scale
availability assessments may be done at equipment level, e.g. as a function of component reliability
and maintainability. In this case, only (time-based) availability is assessed without considering the
(volumetric) production. It is also common to perform such availability analyses at system, plant or
installation level (taxonomy levels 3 - 5, see Figure D.1). If production volume is not considered, such
analyses may be performed with traditional reliability block diagrams (RBD) or fault tree analyses
(FTA). More information about these techniques is included in Clauses I.3 and I.4. ISO/TR 12489:2013
addresses reliability modelling and calculation of safety systems also applicable for non-safety systems,
but does generally not focus on production assurance.
D.2 Planning
D.2.1 Objectives
The objectives of the analyses should be clearly stated prior to any analysis. Preferably, objectives can
be stated in a production assurance activity plan as a part of the PAP structure. Objectives can be to:
— verify production assurance objectives or requirements;
— identify operational conditions or equipment units critical to production assurance;
— predict production availability, deliverability, availability, reliability, etc.;
— identify technical and operational measures for performance improvement;
— compare alternatives with respect to different production assurance aspects;
— enable selection of facilities, systems, equipment, configuration and capacities based on economic
optimization assessments;
— provide input to other activities, such as risk analyses or maintenance and spare-parts planning.
Operating modes for inclusion in the analysis should be defined. Examples of relevant operating modes
are start-up, normal operation, operation with partial load and run-down. Depending on the objective
of the analysis, it can also be relevant to consider testing, maintenance and emergency situations. The
operating phase or the period of time for analysis should also be defined.
The performance measures predicted should be defined. In production availability and deliverability
predictions, a reference level that provides the desired basis for decision-making should be selected.
It should also be decided whether to include the production performance effect from turnarounds (see
ISO 14224:2016, 3.94), as well as those major accident type of events normally identified and assessed
with respect to safety in risk analyses.
The analysis methodology for use should be decided on the basis of study objectives and the predicted
performance measures.
D.3 Procedure
D.3.1 Preparation
A review of available technical documentation should be performed as the initial activity, as well as
establishing liaison with relevant disciplines. Site visits can be performed and are recommended in
some cases.
All input documentation should be reviewed, liaison with relevant disciplines should be established,
and sites should be visited, if necessary.
Various performance measures may be used to evaluate the performance of the object subject to
analysis; see Clause G.1. Production availability and deliverability (whenever relevant) are the most
frequently used measures. Depending on the objectives of the production performance analysis, the
project phase and the framework conditions for the project, the following additional performance
measures may be used:
— proportion of time or number of times production (delivery) is equal to or above demand (demand
availability);
— proportion of time or number of times production (delivery) is above zero (on-stream availability);
— proportion of time or number of times the production (delivery) is below demand;
— proportion of time or number of times the production (delivery) is below a specified level for a
certain period of time;
— proportion of time production (delivery) is below planned production (production unavailability)
for specified increments of time;
— number of days with a certain production loss;
— resource consumption for repairs;
— availability of systems/subsystems;
— technical availability or operational availability.
As a predictor for the performance measure, the expected (mean) value should be used. The uncertainty
related to this prediction should be discussed and, if possible, quantified (see D.3.7).
Annex G provides a guide on the elements for inclusion in the performance measure for predictions and
for historical performance reporting.
In addition to the performance measure, a list of critical elements (e.g. equipment, systems, operational
conditions and compensatory means) should be established. This list assists in identifying systems/
equipment that should be considered for production assurance and reliability improvement.
There is a large number of component importance measures in literature on reliability theory, which
can be used to establish such lists. Many of these are developed to measure importance in safety
systems but can work in any traditional system availability analysis, typically performed by RBD or
FTA. When production is considered, most of the common importance measures in reliability theory are
not suited, but for several of these, it is possible to make only small adjustments in order to adapt them
to production performance analysis. For instance, the Birnbaum measure, as described in Reference[76]
can be interpreted as the difference in system state when the component is functioning and when it is
not functioning. By considering the difference in production when the component is functioning or not,
a slight variant of the Birnbaum measure can be used in production performance analyses.
Software tools for production performance analysis usually include some kind of importance measure
algorithm. Such algorithms can be linked to importance measures in reliability theory, but have a more
practical approach. A common concept is the evaluation of the component’s contribution to production
loss or unavailability. Some pitfalls related to this measure and its interpretation is addressed in
Reference[78] which also gives an interpretation of the covariance between system and component
suited for production systems. There are also other importance measures developed more specifically
for multi-state production systems, e.g. in Reference[75].
The main factors causing variability (and hence uncertainty in the predictions) in the performance
measure should be identified and discussed. Also, factors contributing to the uncertainty as a result of
the way the system performance is modelled should be covered.
Importance and sensitivity analyses can be carried out to describe the sensitivity of the input data used
and the assumptions made (see e.g. Reference[83]).
Where stochastic random sampling techniques are applied in production availability analyses,
corresponding mean values along with mean (p50), upper and lower bound (p10 and p90) values should
be reported. See also I.5.2.
Annex E
(informative)
reliability data management and coordination are needed to ensure reliability data collection for
selected equipment and consistent use of reliability data in the various analyses.
Selection of data should be based on the following principles.
— Data should originate from the same type of equipment and, if possible, originate from identical
equipment models.
— Data should originate from equipment using similar technology.
— Data should originate from periods of stable operation, although early-life or start-up problems
should be given due consideration. This also includes data from testing and inspection during the
operation (see Figure 6 in ISO 14224:2016).
— Data should, if possible, originate from equipment that has been exposed to comparable operating
and maintenance conditions.
— Data from laboratory testing, e.g. accelerated lifetime testing and reliability testing during
technology development and technology qualification. In addition, data from performance testing
prior to operation (e.g. FAT and SIT). Such pre-operational data should normally be entitled ‘pre-
operational/ test reliability data’, as opposed to actual field operating experience.
— The basis for the data used should be sufficiently extensive.
— The number of inventories and failure events used to estimate or predict reliability parameters
should be sufficiently large to avoid bias resulting from “outliers”.
— The repair time and down time data should reflect site specific conditions.
— The equipment boundary for the originating data source and analysis element should match as far
as possible (study assumptions should otherwise be given).
— Population data (e.g. accumulated operating time, observation period) should be indicated to reflect
the statistical significance (uncertainty related to estimates and predictions) and the “technology
window”.
— Data sources should be quoted.
Data from event databases (compliant with ISO 14224:2016) provide a relevant basis for meeting these
recommendations. In case of scarce data, it is necessary to use engineering judgement and to do a
sensitivity analysis of input data.
Annex F
(informative)
F.1 General
The specification of production assurance objectives and requirements can be considered for system
design, engineering and purchase of equipment, as well as for operations in defined life cycle periods. In
this respect, IEC 60300-3-4:2007 should also be considered.
In addition, provisions of this annex can be applied when specifying production assurance objectives in
documentation for scope of work, invitation to tender (ITT), etc.
b) Quantitative requirements may be expressed on the basis of performance measures, such as the
following:
— production availability (or production unavailability);
— system availability (or system unavailability);
— technical availability (or technical unavailability);
— operational availability (or operational unavailability);
— reliability (survival probability at time t of an item);
— time to failure;
— active repair time;
— preparation and/or delay (e.g. mobilization time for spare parts);
— repair workshop cycle time.
c) Qualitative requirements may be expressed in terms of any of the following:
— design criteria for the product;
— system configuration;
— inherent safety (acceptable consequence of a failure);
— production assurance activities to be performed.
Suppliers meet a variety of reliability requirements ranging from component level, all the way to the
overall field level as presented in Figure F.1. The reliability requirements can be a mix of operator/
project specific requirements combined with references to applicable standards and practices. This
situation can result in ambiguity and lack of consistency and standardisation, and can potentially
reduce the value of the production assurance activities, and should therefore be properly managed by
applying the principles in this document.
Annex G
(informative)
G.1 General
Performance measures for production availability are used in analyses for prediction or planning, as
well as for the reporting of historical performance in the operational phase. The performance measures
include the effect of down time caused by a number of different events. It is imperative to specify in
detail the different type of events and whether they should be included or excluded when calculating
the performance measure. This annex provides a guide to this subject in order to achieve a common
format for performance predictions and reporting among field operators.
Various detailed production-reporting systems exist, but the one selected should enable comparable/
exchangeable field reporting as indicated below.
For a typical hydrocarbon production facility, the following measures can be of interest for predictions
as well as for historical reporting:
a) Production (un)availability of oil for storage or for export, measured at the exit of the process
facility.
b) (Un)availability (time-based) or production (un)availability (volume-based) of water injection.
One can, in addition, estimate the production (un)availability of the production system, taking into
account the production unavailability of water injection.
c) (Un)availability (time-based) or production (un)availability (volume-based) of gas injection. One
can, in addition, estimate the production (un)availability of the production system, taking into
account the production unavailability of gas injection.
d) (Un)availability (time-based) or production (un)availability (volume-based) of utility systems.
One can, in addition, estimate the production (un)availability of the production system, taking into
account the production unavailability of the utility systems.
e) Production (un)availability of gas for export, measured at the exit of the process facility.
f) Production (un)availability of gas for export according to contractual requirements (e.g.
variable contractual nomination) and evaluation of penalties due to failure to fulfil contractual
requirements.
g) Deliverability of gas export, measured at the delivery point and including the effect of compensating
measures.
h) Production (un)availability of the subsea installation in isolation without considering downstream
elements.
i) Loading availability; measured offshore or onshore.
j) On-stream (production) availability; fraction of time the flow out of the system exceeds zero.
k) Demand availability; fraction of time the flow out of the system satisfies demand.
l) (Un)availability of the process facilities in isolation.
m) (Un)availability of gathering or exporting hydrocarbon/petrochemical network (volume-based).
VP (T1 ,T2 )
PA (T1 ,T2 ) = (G.1)
VR (T1 ,T2 )
where
phase, the latter is typically production efficiency (see 3.1.46, Note 4 to entry). Some alternative
reference measures are given in G.2.2 to G.2.6.
If the reference measures vary throughout the time (see cases G.2.2, G.2.5 and G.2.6), then:
— The estimated produced volume cannot be greater than the reference profile on average over
the time interval basis (daily, monthly, yearly, etc.). In practice, this means that the calculation
assumes that it is not possible to recover the production lost over the plateau period by using the
overcapacity of the system under study when the reference production volume declines. However,
regaining some production loss may be allowed given design capacity is not exceeded by producing
above planned production levels for a limited time period, and the use of production availability
estimation techniques can then reflect this.
— For a profile given on a yearly basis over 20 years, for example, the average of the estimated yearly
production availability values (approach A-yearly average) is not equal to the overall production
availability value estimated over the entire calculation period (approach B-lifetime average).
Indeed, the “arithmetic” average of the estimated yearly values considers that each yearly value
has the same weight within the final average whatever the reference production volume; whereas
the “volumetric” production availability over the entire calculation period gives more weight to the
years with a high reference production volume (i.e. the plateau period). Because both approaches
are correct, it is very important to clearly define from the start which final production availability
value has to be assessed, and to document the selected approach in the results.
— A third approach C can be to predict production availability for only one typical year in the lifetime,
e.g. in plateau period, to be representative for the analysis.
When presenting results of production availability analyses, it is recommended that the mean value
be presented together with the probabilistic distribution values to indicate the potential up- and
downside range.
Tu is the mean up time (MUT), estimated by using the actual up time observed in the field;
Td is the mean down time (MDT), estimated by using the actual down times observed in the field.
This down time includes planned and unplanned down time.
This operational availability corresponds to KPI number 7 in ISO 14224:2016, Table E.3. The technical
availability is given as KPI number 8 in ISO 14224:2016, Table E.3.
The taxonomic level (see ISO 14224:2016, Figure 3) where this measure is used is preferably on system
level or equipment item level, but can be used at plant or installation level.
The advantage of using operational availability as a time-based performance measure is that up time
and down time is easy to establish compared to the reference level of the volume-based measures. On
the other hand, the disadvantage is that this measure is not well suited to handle partial shutdowns. In
some cases, the measure can be modified by defining up time and time in operation as well-years.
These other parameters can include or exclude specific sources of loss of production or provide
information about how the losses are expected to occur. In some cases, this can be of equal or greater
importance than the overall production availability figure, for example the interruption frequency can
be a key element of a gas-supply system.
Whatever measures are used for an analysis, it is necessary to state explicitly the basis on which they
are calculated.
Tables G.1, G.4, G.5 and G.6 provide guidance on the events that should be included in production
availability predictions and the reporting of historical production availability for a production system
(i.e. volume-based performance measures). The production loss is a volume associated with an activity
or an event that results in reduced utilization of production potential within a period of time. Production
loss is registered according to the loss categories. The production loss volume being reported depends
on the facility and can be oil, gas, condensate, etc. Time-based availability predictions or statistics can
apply to the same event categorization (see Tables G.2 and G.3). Event categorization for other specific
operations (e.g. pipe laying) and its associated system/equipment typically have another format, which
is necessary to specify as required. Battery limits for the facilities, as well as any third-party processing,
tie-ins, subsea installations, etc., should be clearly defined.
Tables G.1 to G.6 refer to one of the following business categories: upstream (Tables G.1 to G.3),
midstream (Table G.4), downstream (Table G.5) and petrochemical (Table G.6). Examples of installations
or plants/units for each of these business categories are given in ISO 14224:2016, Tables A.1 and A.2.
It is important to distinguish between the production (or time) loss categorization shown in Tables G.1
to G.6 and the equipment failure and maintenance data requirements shown in ISO 14224:2016. This
relationship is also addressed in Annex E.
Annex H
(informative)
H.1 General
Risk analysis (e.g. QRA) and emergency preparedness analysis (see Clause I.20) will include serious
and infrequent hazardous events that can cause long-term shutdown of production or operation, and
that can imply major loss or even zero production or plant/system operation over a long time. These
events fall within the category of major accidents (see 3.1.32) and should be distinguished from other
infrequent events that are considered in the analyses of production availability and deliverability. The
major accidents should be treated separately in production performance analyses (see D.3.6).
The main purpose of this annex is to advise that there is a need in production performance analysis
to highlight this in the analysis work, and rather refer to the risk analysis (e.g. QRA) and emergency
preparedness analysis where such events are analysed. This means that the total production
unavailability or total system unavailability can be lower than what is predicted in the production
performance analysis. The project risk management for asset or facility would need to manage this total
risk picture, as some of the events causing major accidents can also benefit from using this document
(e.g. risk-reduction by equipment reliability management).
Typical major offshore accidents can be caused by hazards and accident conditions, such as the following
(ref. also NORSOK Z-013:2010):
1) process accidents (unignited and ignited leaks, fires and explosions);
2) risers/landfall and pipeline accidents;
3) storage accidents (liquid and gas);
4) loading/offloading accidents;
5) blowouts and well releases;
6) accidents in utility systems (leaks of chemicals, fires, explosion of transformers etc.);
7) accidents caused by external impact and environmental loads, e.g. collision, falling/swinging loads,
helicopter crash, earthquake, waves;
8) structural failure (including gross errors);
9) loss of stability and/or buoyancy (including failure of marine systems).
In addition, security issues can be considered in the context of risk analysis. The term threat should only
be used for such security considerations, and not be mixed with other equipment failure characteristics,
as threat is related to an intended action. See ISO 22300:2012.
Important factors in the analysis of major accidents are considered in more detail in the remainder of
this annex.
The purpose of the availability analyses is to predict the actual production availability, A, for the
installation for the time period considered. This quantity is uncertain (unknown) when the analysis is
carried out and it is necessary to predict it. The uncertainty related to the value of A can be expressed by
a probability distribution H(a), with mean or expected value, Ᾱ being the predictor of A. A Monte-Carlo
study of the production availability is generally performed by generating a sequence of independent,
identically distributed quantities, for example A1, A2, ..., An, from the probability distribution, H(a). The
distribution can be estimated from the sample A1, A2, ..., An.
In theory and as far as the uncertainty distribution H(a) is concerned, there is no problem in including
major accidents in this analysis. If a major accident results in a production loss, z, and its associated
probability equals p, this can be reflected in the distribution, H. However, using the “full distribution”
makes it difficult to predict A using the expected value. In this case, the spread around the mean would
be very large and the probability density can have a bimodal form very different from the typical
Gaussian distribution. The case is that the expected value of the contribution from the major accident
is normally a rather small quantity, namely p·z, which is an unrepresentative contribution to the
production loss. If the major accident type of event occurs, the actual loss would be z and this can mean
a dramatic reduction in the production availability, A.
If the time period considered is long, then the probability that a major accident will occur can be quite
large, and consequently the contribution p·z significant. Hence, in such cases, the inclusion of major
accidents is more meaningful.
Annex I
(informative)
Outline of techniques
I.1 General
Production performance analyses, such as reliability and availability analyses, are systematic
evaluations and calculations that are carried out to assess the performance of a system. The system
can in this context be at different taxonomic levels (see ISO 14224:2016, Annex A), for example, it can
be an overall production or transportation system, a compression train, a process shutdown system,
a drilling and well system or, it can be a pump or a valve. These analyses are part of a production
assurance programme (PAP).
It is useful to apply the following as a guide:
— Production performance analysis considers the production from facilities with several production
levels, e.g. offshore or onshore production systems, installation(s) or operation(s).
— Availability analysis considers the up times of two states (running/not running) of items
(components, equipment, units and systems).
— Reliability analysis considers the first failure of two states of items (components, equipment, units
and systems).
Reliability is important for safety and production performance. In the context of a PAP, it can be used to
evaluate the probability that the first failure occurs after a given period of time.
Availability is mainly focused on the time during which an item is running correctly. In the context of a
PAP, it can be appropriate for single components or for production trains made of component in series.
It may also be used to perform “availability allocations” in order to establish the requirements for the
providers of such components.
This annex briefly describes the following analysis methods and techniques:
— failure modes and effects analysis (I.2);
— fault tree analysis (I.3);
— reliability block diagram (I.4);
— Monte-Carlo simulation (I.5.2);
— behavioural modelling (I.5.3);
— flow network analysis (I.5.4);
— Petri net analysis (I.5.5);
— design reviews (I.6);
— hazard and operability study (I.7);
— performance and operability review (I.8);
— reliability testing (I.9);
— human factors (I.10);
Table I.1 — Failure modes and effects analysis (FMEA) and failure mode, effect and criticality
analysis (FMECA)
Analysis elements Summary
Analysis description Two bottom-up techniques for analysing and establishing systematically the effects of
potential failure modes.
Objective of analysis FMEA is a systematic technique for establishing the effects of potential failure modes
within a system. The analysis can be performed at any level of assembly. This can be done
with a criticality analysis, in which case it is called an FMECA.
FMECA is a semi-quantitative analysis, where the failure probability and the consequence
data are used to assess the criticality of each failure mode. It is a systematic methodol-
ogy to increase the inherent reliability of a system or product. It is an iterative process of
identifying failure modes, assessing their probabilities of occurrence and their effects on
the system, isolating the causes, and determining corrective actions or preventive meas-
ures. When the analysis is done from a functional standpoint, it is usually performed at
a plant or unit level, whereas if the focus is on the hardware, it usually descends down to
the maintainable-item level. The amount of data required is different depending on the
focus (see Tables I.2 to I.4 for details).
While it is most often used in the early stages of the design process to improve the inherent
reliability, the FMECA technique is equally useful in addressing system safety, availability,
maintainability, or logistics support.
Reference to existing MIL-STD-1629A:1998
standards
IEC 60812:2006
Overall need for The analysis is an inductive and systematic process in which individual failures at com-
information ponent level are generalized into potential failure modes at system level. The structured
method consists of the following steps:
a) system definition (both from functional and hardware standpoints);
b) identification of failure modes (it is necessary that it includes the operational and
environmental conditions present when failure occurs);
e) identification of detection means (to verify that suitable detection means exist for
all critical failure modes);
— evaluate various importance factors in order to assess the impact of the failures of
the individual components;
— evaluate the impact of the individual input uncertainties over the result(s).
Reference to existing IEC 61025:2006
standards
Overall need for A fault tree represents a Boolean process, which is used to calculate the probability of
information the corresponding overall event from the individual probabilities of the basic events
appearing in the formula. Therefore, the inputs used are the pure probabilities of fail-
ures, for which it is necessary to evaluate from the reliability parameters of the related
components:
— probability of failure;
— failure frequency;
A POR group is established consisting of production assurance analysts and disciplines like process,
operation and maintenance. During POR sessions, failure scenarios of each sub-part or stage of the
model are evaluated through a systematic review. Total down time estimates are established by
achieving time estimates for all down time phases.
An illustration of production down time associated with a critical failure event occurring on an
operating item belonging to an operating production facility is shown in Figure I.1, where a partial
production loss occurs. An illustration of production down time on an operating production facility
associated with a critical failure of a standby item is shown in Figure I.2. The items B and C given in the
figures are for example compressor trains (in 2 × 50 % or 2 × 100 % configuration).
Key
O operating T0 operating
RD run down T1 run down
RU ramp up T2 preparation and/or delay
F faulty T3 active repair
SB standby T4 preparation for production; start-up
PF production facility T5 ramp up
A, B, C, D are operating items belonging to the production T6 operating
facility
Figure I.1 — Illustration of production down time due to a critical failure of an operating item
belonging to an operating production facility
Key
O operating T0 standby
RD run down T1 restoration of C before demand
RU ramp up T2 restoration of C after demand
F faulty T3 ramp up
SB standby T4 operating
PF production facility A, B, C, D are operating items belonging to the production
facility
C is standby item belonging to the production facility
Figure I.2 — Illustration of production down time due to a critical failure of a standby item
belonging to an operating production facility
The figures in Clause 3 and Figures I.1 and I.2 in this document should be understood in conjunction
with Figure 4 in ISO 14224:2016 and Figures 5 to 7 in ISO/TR 12489:2013, to distinguish different
performance measures and their application.
Reliability testing is a generic term covering a variety of different test categories, but usually falls into
one of the following categories:
— Reliability life testing
Reliability life testing covers specific types of physical life tests performed to predict the reliability
characteristics during the expected life of the system. Typically, reliability tests seek to determine
reliability over calendar or operating time, but can also examine reliability to start-up on demand,
number of cycles to failures, or other appropriate operating life accelerating factor. The results from
the failure data from such tests are then recorded and analysed using relevant life data analysis
techniques (see I.13). See information in IEC 61123:1991.
— Accelerated life testing
Reliability tests can require extended time periods to be satisfactorily carried out and are often
unachievable given project schedules and deadlines. Accelerated life testing (ALT) programs can
shorten the length of the test program for a critical failure mode. ALT is similar to reliability life
testing, except that ALT decreases test times by typically overstressing components by increasing
or accelerating one or more test parameters (e.g. temperature, pressure or vibration levels). Related
types of testing are highly accelerated life testing (HALT) and highly accelerated stress screening
(HASS), which can provoke different or alternative failure modes or degradation mechanisms.
ALT, HALT and HASS are extremely challenging activities, and projects are encouraged to seek
specialized assistance. See further information in References [71] and [81] and in IEC 61163-1:2006
and IEC 61163-2:1998.
Careful consideration is needed when developing reliability testing programs as the requirements are
often misunderstood and can be difficult to plan and implement. Additional sub-categories of reliability
testing include:
— reliability growth testing (see also IEC 61164:2004);
— development-reliability tests to determine functional capabilities;
— demonstration-reliability tests to verify formally that operational requirements are being met at
the system level;
— qualification tests to verify conformance to the requirements of a specification (see IEC 60050:2015,
192-09-04);
— production-reliability assurance testing;
— in-service reliability demonstration testing.
As described above, several types of reliability testing can be performed in order to predict reliability
of components. BS 5760-2:1994 gives further information about such reliability tests.
With reliability testing, it is not meant normal manufacturing quality control (such as factory
acceptance test (FAT) and system integration test (SIT)) applicable for most components, sub-systems
and systems in the petroleum, petrochemical and natural gas industries. Accelerated lifetime testing
involves overstressing in terms of environmental and operational conditions, which provokes different
or alternative failure modes and degradation mechanisms compared to normal operating conditions. It
has proven extremely challenging to reproduce normal lifetime degradation from accelerated lifetime
testing.
The production availability model may be used to perform sensitivity studies in order to detect for
which components a better knowledge of their reliability parameters is necessary, or what reliability is
necessary to demonstrate for given components to reach the scheduled targets.
Reference can be made to API Publication 770:2001, ISO/TR 12489:2013, ISO 14224:2016,
IEC 62508:2010 and Reference[69].
NOTE 2 Further description of life data analysis can be found in ISO 14224:2016, Annex C.
— reorder point;
— replenishment quantity;
emergency preparedness analysis in accordance with ISO 15544:2000 are examples of such analyses.
Specifically, the interfaces to a risk and emergency preparedness analysis are as follows:
— Input to a risk analysis and an emergency preparedness analysis in terms of reliability of safety
systems (fire water system, fire and gas detection system, ESD system); such individual system
analyses can be a part of the overall production performance analysis.
— Risk and emergency preparedness analyses can impose reliability requirements on certain
equipment, typically safety systems. See also Clause F.4.
— Risk and emergency preparedness analyses can impose requirements to equipment configuration
that affect production assurance.
— Production can be made unavailable due to major accidents (see D.3.6 and Annex H).
EXAMPLE Manning levels, logistics and equipment test strategies.
— Coordination of study assumptions and data in risk and emergency preparedness analyses and
production performance analyses is recommended.
— Barrier issues (see ISO 17776:2016, 3.1.1 and Reference[70]).
In addition, as part of maintenance, plans for periodic test (ISO 14224:2016, 3.74 and Figure 6) should
be established to ensure that there are no hidden failures which would prevent a system needed for
emergency response achieving the essential functions and reliability targets given in the functional
requirements. On some installations, a risk-based approach is used to determine the inspection
and testing requirements. In this case, the inspection and testing frequencies will be developed for
individual installations.
ISO 35103:2017 addresses environmental monitoring for Arctic operations and can be relevant when
undertaking quality and reliability considerations of performance of environmental monitoring
systems.
Table I.10 — Technology maturity and typical qualification need for different technology
novelty categories
Category Technology statusa Description Way forward TRL
1 Proven for use No new technical elements, and No qualification required. The 7
known area of application. The tech- technology is ready for broad ap-
nology has reached TRL 7 through plication in company.
application in the company.
2a New - Qualified by No new technical elements, and No qualification required. The 4
company known area of application for com- technology is ready for first ap-
pany. Technology has been qualified plication in company.
for actual application by company.
Technology can have been applied
by others, but not by the company.
2b New - Qualified by Some new uncertainties due to If TRL 4 is verified, then no quali- 4 or
others new application area for the com- fication required. The technology <4
pany. Application area is known to, is then ready for first application
and technology has been qualified in company.
by others.
2c New - Unqualified Some new uncertainties due to Technology qualification will be <4
either technical elements with required. Low-to-medium risk to
limited industry history, or new reach TRL 4.
application area for the industry.
3 New - Unqualified Significant new uncertainties due Technology qualification will be re- <4
to either new technical elements, quired. Medium risk to reach TRL 4.
or new application area.
4 New - Unqualified Large new uncertainties due to Technology qualification will be <4
new technical elements, and new required. High risk. Demanding
application area. qualification expected, should have
focus and attention.
a New technology: Technology that is not proven in use, i.e. technology that has not reached TRL 7 through application in
the company.
— identify the shortest and/or the most probable sequences of event (scenarios)
starting from the perfect state and leading to the fully failed state;
— compute the expected requirement for spare parts and repair resources during the
system’s lifetime.
Reference to existing IEC 61165:2006
standards
Overall need for A Markov diagram represents a set of linear differential equations allowing the calcula-
information tion of the probability that the system is in a given state at a given time. The inputs are
data defining the transition rates and the relationships among the various states, such as:
— failure rates, repair rates of individual components;
— fault trees (FT driven Markov processes) where small Markov processes are used to
model the leaves of the FT, and where the FT provides the logic linking of the leaves
(see ISO/TR 12489:2013).
NOTE 5 This allows to build Markov models for large systems and prevent the combinato-
rial explosion of the number of states.
— NORSOK U-009:2011
f) Berthing operations: The time required for jetty configuration to become available for the loading of
specified vessels and cargo. This time also includes berthing delays due to operational limitations,
e.g. adverse wind or sea states.
Bibliography
[24] ISO/TR 12489:2013, Petroleum, petrochemical and natural gas industries — Reliability modelling
and calculation of safety systems
[25] IEC 31010:2009, Risk management — Risk assessment techniques
[26] IEC 60050-192:2015, International electrotechnical vocabulary — Part 192: Dependability
[27] IEC 60300-1:2014, Dependability management — Part 1: Guidance for management and application
[28] IEC 60300-3:2017, Dependability management — Part 3 (all sections): Application guide
[29] IEC 60300-3-2:2004, Dependability management — Part 3-2: Application guide — Collection of
dependability data from the field
[30] IEC 60300-3-3:2017, Dependability management — Part 3-3: Application guide — Life cycle costing
[31] IEC 60300-3-4:2007, Dependability management — Part 3-4: Application guide — Guide to the
specification of dependability requirements
[32] IEC 60300-3-11:2009, Application guide — Reliability centred maintenance
[33] IEC 60300-3-14:2004, Dependability management — Part 3-14: Application guide — Maintenance
and maintenance support
[34] IEC 60300-3-15:2009, Dependability management — Part 3-15: Application guide — Engineering
of system dependability
[35] IEC 60812:2006, Analysis techniques for system reliability — Procedure for failure mode and effects
analysis (FMEA)
[36] IEC 61025:2006, Fault tree analysis (FTA)
[37] IEC 61078:2016, Reliability block diagrams
[38] IEC 61123:1991, Reliability testing — Compliance test plans for success ratio
[39] IEC 61160:2015, Design review
[40] IEC 61163-1:2006, Reliability stress screening — Part 1: Repairable assemblies manufactured in lots
[41] IEC 61163-2:1998, Reliability stress screening — Part 2: Electronic Components
[42] IEC 61164:2004, Reliability growth — Statistical test and estimation methods
[43] IEC 61165:2006, Application of Markov techniques
[44] IEC 61508:2010, (all parts), Functional safety of electrical/electronic/programmable electronic
safety-related systems
[45] IEC 61511-1:2016, Functional safety — Safety instrumented systems for the process industry sector
— Part 1: Framework, definitions, system, hardware and application programming requirements
[46] IEC 61882:2016, Hazard and operability studies (HAZOP studies) — Application guide
[47] IEC 62402:2007, Obsolescence management — Application guide
[48] IEC 62508:2010, Guidance on human dependability
[49] IEC 62550:2017, Spare parts provisioning
[50] IEC 62551:2012, Analysis techniques for dependability — Petri net techniques
[51] IEC 62628:2012, Guidance on software aspects of dependability
[52] Publication API 770, A Manager’s Guide to Reducing Human Errors Improving Human
Performance in the Process Industries, March 2001
[53] API RP 17N, Subsea Production System Reliability, Technical Risk, and Integrity Management,
Second Edition, June 2017
[54] API RP 580, Risk-Based Inspection, Third Edition, February 2016
[55] BS 5760-2:1994, Reliability of systems, equipment and components — Guide to the assessment of
reliability
[56] EN 13306:2017, Maintenance — Maintenance terminology
[57] NORSOK N-003, Actions and action effects, January 2017
[58] NORSOK U-009, Life Extension for Subsea Systems, March 2011
[59] NORSOK Y-002, Life Extension for Transportation Systems, December 2010
[60] NORSOK Z-008, Risk based maintenance and consequence classification, Rev. 4, December 2017
[61] NORSOK Z-013, Risk and emergency preparedness assessment, Rev. 3, October 2010
[62] DNVGL-RP-0002. Integrity management of subsea production systems, November 2016
[63] DNVGL-RP-A203. Technology qualification, June 2017
[64] DNVGL-RP-F116. Integrity management of submarine pipeline systems, May 2017
[65] DNVGL-RP-F206. Riser integrity management, April 2008
[66] DNVGL-RP-G101. Risk based inspection of offshore topsides static mechanical equipment,
August 2017
[67] DNVGL-ST-F101. Submarine pipeline systems, October 2017
[68] MIL-STD-1629A, Military standard: Procedures for Performing a Failure Mode and Effect Analysis,
August 1998
[69] International Association of Oil and Gas Producers. Report No. 434 – 5: OGP Risk
Assessment Data Directory, Human Factors in QRA, March 2010
[70] International Association of Oil and Gas Producers. Report No. 544: Standardisation of
barrier definitions – Supplement to Report 415, April 2016
[71] Institute of Environmental Sciences and Technology. IEST-RP-PR003:2012: HALT AND HASS,
September 2012
[72] Norwegian Oil and Gas Association. Guideline 070: Application of IEC 61508 and IEC 61511 in
the Norwegian petroleum industry (recommended SIL requirements), June 2018
[73] Norwegian Oil and Gas Association. 122 - Norwegian Oil and Gas recommended guidelines for
the management of life extension, August 2017
[74] Oil & Gas UK. HS073: Guidance on the Management of Ageing and Life Extension for UKCS Oil
and Gas Installations, Issue 1, April 2012
[75] AVEN. T., ØSTEBØ, R., 1986. Two new component importance measures for a flow network
system. Reliability Engineering 14, pp.75-80
[76] BIRNBAUM. Z.W. 1969. On the importance of different components in a multicomponent system.
Krishnaiah, P.R. (Ed.), Multivariate analysis II, Academic Press, New York ( 1969), pp. 581-592
[77] COMER. P.J., ØSTEBØ, R. 1986. A Drillers’ HAZOP Method, SPE 15867, SPE European Petroleum
Conference, London, October 1986
[78] ISAKSEN. S. 2009. New insight into measures of component importance, In: Martorell, S.,
Guedes Soares, C. & Barnett, J. (eds.), Safety, Reliability and Risk Analysis: Theory, Methods and
Applications. CRC Press. Volume 2, pp. 1891 – 1899
[79] KIRWAN. B. A guide to practical human reliability assessment. Taylor & Francis, UK, 1994
[80 KLETZ. T.A. Hazop and Hazan, Fourth Edition, IchemE, UK, 1999
[81] NELSON. W. Accelerated Testing: Statistical Models, Test Plans, and Data Analysis, John Wiley &
Sons, Inc., Hoboken, NJ, USA, 1990
[82] REASON, J. Human Error. Cambridge University Press, UK, 1990
[83] SELVIK. J.T., AVEN, T. 2011. A framework for reliability and risk centered maintenance. Reliab.
Eng. Syst. Saf. 2011, 96 (2) pp. 324–331
ICS 75.180.01; 75.200
Price based on 99 pages