SPE-165374-Global Model For Failure Prediction For Rod Pump Artificial Lift Systems
SPE-165374-Global Model For Failure Prediction For Rod Pump Artificial Lift Systems
SPE-165374-Global Model For Failure Prediction For Rod Pump Artificial Lift Systems
Global Model for Failure Prediction for Rod Pump Artificial Lift Systems
Yintao Liu, Ke-Thia Yao, USC Information Sciences Institute; Cauligi S.Raghavenda, Anqi Wu, Dong Guo,
Jingwen Zheng, University of Southern California; Lanre Olabinjo, Oluwafemi Balogun, Chevron; Iraj
Ershaghi,University of Southern California
Copyright 2013, Society of Petroleum Engineers
This paper was prepared for presentation at the SPE Western Regional & AAPG Pacific Section Meeting, 2013 Joint Technical Conference held in Monterey, California, USA, 1925 April 2013.
This paper was selected for presentation by an SPE program committee following review of information contained in an abstract submitted by the author(s). Contents of the paper have not been
reviewed by the Society of Petroleum Engineers and are subject to correction by the author(s). The material does not necessarily reflect any position of the Society of Petroleum Engineers, its
officers, or members. Electronic reproduction, distribution, or storage of any part of this paper without the written consent of the Society of Petroleum Engineers is prohibited. Permission to
reproduce in print is restricted to an abstract of not more than 300 words; illustrations may not be copied. The abstract must contain conspicuous acknowledgment of SPE copyright.
Abstract
This paper presents a new generalized global model approach for failure prediction for rod pumps. By embedding
domain knowledge into an Expectation Maximization clustering algorithm, the proposed global model is able to
statistically recognize pre-failure and failure patterns from normal patterns during the training stage. Compared
with previous field-specific models, the enriched training set for the global model learns from much larger scale of
normal and failure examples from all fields with which a generalized Support Vector Machine (SVM) is trained and
is able to predict failures for all fields. The data set for this paper is taken from five real-world assets using rod
pump artificial lift systems which contain nearly 2,000 rod pumps. The results show that the global model of failure
prediction is capable of capturing future rod pump and tubing failures that produce acceptable precision and
recall. The resulting global model is scalable and can be used for predicting failures for proactive maintenance to
reduce lost oil production. Results from our case studies with multiple fields data show that precision and recall
are better than 65% with this global model.
Our prior work [1] used machine learning techniques to generate high quality failure prediction models with good
accuracy. However, these efforts suffer from two major drawbacks. First, this used machine learning techniques
that require labeled datasets for training the model. Generating these labeled datasets is human-intensive and
time-consuming. Second, this model is field-specific which is only applicable to the specific field from which the
labeled dataset is derived. These field-specific models generally perform poorly on other fields because of the
differences in the data characteristic caused by field geology, operational procedure, etc. Moreover, these models
have to be maintained independently, which accordingly raises nontrivial maintenance costs.
Keywords: failure prediction, global model, rod pump, artificial lift, semi-supervised learning
Introduction
Artificial lift systems are widely used in oil industry to enhance production from reservoirs with levels too low to
directly lift fluids to the surface. Among various artificial lift techniques in the industry (such as Gas Lift, Hydraulic
Pumping Units, Electric Submersible Pump, Progressive Cavity Pump and Rod Pump techniques), the Sucker
Rod Pump technique is the most commonly used. In a typical oil field, there are hundreds of wells, and there are
many fields operated by the same organization that may have various geological formations. Well failures in oil
field assets lead to production loss and can greatly increase the operational expense. Experts with rich
experience are capable of identifying anomalies via combining different types of information such as the wells
recent performance, its events log and its neighboring wells performance. Such anomalies, once identified, have
high probability to be followed by a failure in the future. Such anomalies might have already been causing
economical losses. Thus either proactive maintenance or repair workover needs to be scheduled to reduce
economic losses. However, with limited number of trained personnel and resources for managing large fields,
such proactive operation is impossible to be at full speed. Therefore, automating the monitoring and operating
such fields will be important to achieve higher economic benefits. These fields are instrumented with volumes of
2 SPE 165374
data being collected including historical event log and time series parametric data. Such field data, along with
experts knowledge, are very useful to data mining methodologies to predict well failures. Successful failure
predictions can dramatically improve production performance, such as by adjusting operating parameters to
forestall failures or by scheduling maintenance to reduce unplanned repairs and to minimize downtime. In a Smart
Oil Field, the decision support center uses measurements from fields for efficient management and operation.
The reasons for rod pump failures can be broadly classified into two main categories: mechanical and chemical.
Mechanical failures are caused by improper design improper manufacturing or from wear and tear during
operations. Well conditions may contribute to excessive wear and tear, such as sand intrusions, gas pounding,
rod cutting and asphalting. Chemical failures are caused by the corrosive nature of the fluid being pumped
through the systems. For example, the fluid may contain H
2
S or bacteria. For rod pumps one of the major
mechanical failures is called Tubing Failure, where its tubing is leaking because of accumulated mechanical
frictions and cutting events. A tubing leak does not cause a rod pump to shut down. It is happening down-hole
which makes it difficult for a field specialist to identify its anomalous status via visual or sound inspection. If not
discovered in time, the leaking causes continuous loss of production and reduces the rod pumps efficiency
significantly.
The main focus of this paper is in predicting down-hole tubing leaks and pump failures of sucker rod pump
production wells on a daily basis across heterogeneous fields using data mining and machine learning
approaches. By prediction we mean to detect the early signals such as mechanical frictions, prod events that
potentially lead to a tubing leak, and once the leak happens, we are also able to detect it in time. This problem
can be categorized as anomaly detection problem. Various techniques have been applied on similar problems. [2,
3] uses hard drives S.M.A.R.T. (Self Monitoring and Reporting Technology) log is the major source of predicting
disk drive failures. The S.M.A.R.T. log is comprised of drive aging, drive power cycles, errors corrected by inner
ECC code, mechanical shock, and so on. A Nave Bayes classifier is constructed to learn the past failures so that
it can estimate the failure probability for other disks. [4, 5] uses on OTC (over the counter) drug sales, as well as
customers information such as gender, age, seasonal information such as weather, temperature to construct a
complex Bayesian network to probabilistically infer disease outbreaks.
Our prior work [1] used machine learning techniques to generate high quality failure prediction models with good
accuracy. However, it suffers from two major drawbacks. First, it uses traditional machine learning techniques
that require labeled datasets for training the models. Generating these labeled datasets is human-intensive and
time-consuming. Second, this model is field-specific which is only applicable to the specific field from which the
labeled dataset is derived. Field-specific models generally perform poorly on other fields because of the
differences in the data characteristic caused by field geology, operational procedure, etc. Moreover, these models
have to be maintained independently, which accordingly raises nontrivial maintenance costs.
In this paper, we present a generalized global model for failure prediction that works across multiple rod pumps
located across multiple fields. Machine learning based labeling approach that involves clustering and rule-based
filtering is used to automate the labeling process. Integration of training sets across multiple fields showed that
this labeling approach is effective. Our experimental results show that the precision and recall for failure
predictions are very good. Furthermore, a single global model can be employed for predicting failures for rod
pumps in multiple fields while reducing maintenance cost.
Problem Description
All the wells data in this study are collected by Pump Off Controllers (POCs). These POCs gather and record
periodic well sensor measurements indicating production and well status through load cells, motor sensors,
pressure transducers and relays. Some attributes recorded by these sensors are card area, peak surface load,
minimum surface load, strokes per minute, surface stroke length, flow line pressure, pump fillage, priorday cycles,
and daily run time. From that one could calculate GB torque, polished rod HP, and net DH pump efficiency.These
attributes are measured daily, sent over wireless network and recorded in a LOWISdatabase. LOWIS stands for
Life of Well Information Software and saves the historical data about well information. In the LOWIS database
these attribute values are indexed by a well identifier and a date.
Based on our previous study we discovered that the trends of the 4 most reliable attributes are the key indicators
for a potential anomaly, plus two calculated features:
Existing attributes from LOWIS for trends extraction: card area, peak surface load, minimum surface load,
daily runtime;
Calculated attributes:
SPE 165374 3
o card unchanged days: If all card measures (area, peak and min load) reject to change, this
integer field keeps increasing with missing dates considered as unchanged.
o daily runtime ratio: percentage of runtime with respect to a full 24 hour running.
Combining the trends and calculated attributes, we can formulate failure prediction as a machine learning problem
that given a new features of a new date from a well, the output is whether the well is staying normal, having
potential failure, or is failing.
Failure Prediction Algorithms for Rod Pumps
In order to formulate it as a machine learning problem, more specifically, classification problem, we know that we
have to separate the task into three steps after preprocessing: feature extraction, semi-supervised learning and
evaluation. In this chapter, we will describe these steps in our approache one by one.
Feature Extraction
As was previously studied in [1], we extract both long-term and short-term trends for existing attributes for each
time sample by the dual moving median process as is illustrated in Figure 1.
Figure 1 Moving median feature extraction process
For each attribute, three medians are calculated for trends extraction:
Global median: long-term performance, e.g. past 3 months;
Median 1: short-term performance, e.g. most recent week;
Median 2: current performance, shorter than the short-term window, e.g. recent 3 days.
The actual features are then calculated by dividing Median 1 and 2 with global median. A trivial value o is added
to avoid computational outliers.
We discover that there is correlation between unchanging features with potential failures. The observation is
based on seeing many sudden failures after episodes of unchanging features. It is natural for daily data to
fluctuate because of real-world uncertainties that govern complex systems. According to reliability studies of many
fields [6], because models of complex systems are usually not accurate enough to predict reliably where critical
thresholds may occur, it would be useful to build a set of reliable statistical procedure to test whether the
environment is in a stable state, or at a critical tipping point in the future. This statistical procedure for our
problem, that exhibits the statistics for measuring system reliability, is the number of days that reliable critical
DualMoving
Median Process
4 SPE 165374
attributes do not change (CardArea, PeakSurfLoad, MinSurfLoad). Here, by unchanging we mean values not
changing by a single digit. Figure 2 is an example of this situation. The system has been shown experiencing
unreliable states when the CardUnchangeDays accumulates between March, 2012 and April, 2012. This is
followed by a critical downfall of CardArea which ultimately leads to a Tubing Failure. In many cases, this
unchanged days marks either POC problems that fail to read the parameters correctly or the actual situation
when the rod pump is problematic.
Figure 2 CardUnchangedDays correlates to sudden failures
RunTimeRatio is another parameter that we added for prediction. In our previous version, trends of DailyRunTime
are used. However the ratio of the runtime with regards to 24 hours/day is also relevant to a failure. From Figure
3, because of the sliding window process, we can observe that the trend DailyRunTime2 has not reached the
tipping point around early May, 2012 when the system has already almost stopped functioning. RunTimeRatio
captures this information in its earliest stage. So by including this feature we are expecting our prediction can be
slightly earlier compared with previous predictions.
Figure 3 Impact of RunTimeRatio
SPE 165374 5
Clustering Labeling with Rule Enhancement
Labeling for training data is one of the key components of failure predictions. In statistical learning, the
assumption is that there is one or more generative distributions that generates each type of failure and normal
examples. The data reveals only the portion of the observations.
Table 1 Rule-enhanced clustering labeling algorithm for failure prediction
Input: failure vs. normal rate r, number of failure training example n.
Output: labeled training set S with
+1
prIors(I)
, Clustei
IaIIurc
= iux(n)
3. If f
typc
= Tubing Failuie then
1.
CardArca
=
1
count(Idx=CIustcr
IaIue
)
X
I
(CaiuAiea2)
I,Idx(I)=CIustcr
IaIue
2. If
CardArca
> z then // Card is not shrinking enough
1. Goto Step 7
3. Endif
4. Endif
5. If Clustei
normaI
= Clustei
IaIIurc
and Piiois(iux
normaI
) > u.S then
1. Clustei
prc-IaIIurc
iux that iemain assigneu
2. S = S {(X
, f
typc
)| iux(i) = Clustei
IaIIurc
|
4. S = S {(X
, piefail
typc
)| iux(i) = Clustei
prc-IaIIurc
]
6. Endif
6. Endif
7. J = J - {P]
8. Repeat Step 3 until failure training reaches n or J = .
9. Collect all normal wells signatures from all available FMTs in a sampling pool Q
10. Randomly sample Q, = {X
t
]
t=1
n
, where X
n
is the most recent non-missing signature before
of the wells valid training range
11. If n < then
1. Goto Step 16 // Not enough signatures for labeling
12. Else
1. |piiois, iux] = EN(P, numbei of clusteis = 2) # Expectation maximization of GMM
2. Clustei
normaI
= iux
max
prIors(I)
3. If Piiois(iux
normaI
) > u.S then // Major cluster
1. S = S {(X
Failure 54 47
53.5 88.5
Normal 7 392
2
Failure 73 18
80.2 69.5
Normal 32 190
3
Failure 72 25
74.2 69.9
Normal 31 271
4
Failure 39 15
72.2 60.0
Normal 26 193
5
Failure 40 40
50.0 47.1
Normal 45 337