Lithogeochemical Prospecting: Signals Processing Applied To Segmentation of Geochemical Borehole Profiles
Lithogeochemical Prospecting: Signals Processing Applied To Segmentation of Geochemical Borehole Profiles
Introduction
The interpretation of geochemical blem of computing these descriptions as
borehole profiles is an exercise in signal follows:
understanding, and hardly any signal
'A great deal of effort has been ex-
understanding task can be performed using pended to obtain this kind of primi-
the raw numerical signal values directly: tive qualitative description, and the
problem has proved extremely difficult.
some description of the signal must first The problem of scale has emerged consis-
2 tently as a fundamental source of
be obtained (Nii ). Such descriptions
difficulty, because the events we per-
should be as compact as possible, and ceive and find meaningful vary enor-
their elements should correspond as mously in size and extent. The problem
is not so much to eliminate fine-scale
closely as possible to meaningful objects noise, as to separate events at diffe-
or events in the signal-forming entity. rent scales arising from distinct phy-
sical processes. It is possible to
A discussion of the abstract aspects to introduce a parameter of scale by
computing these descriptions and their im- smoothing the signal with a mask of
variable size, but with the introduc-
portance to interpretation of geochemical tion of scale-dependenoe comes ambi-
profiles is presented below. Andrews P. guity: every setting of the scale
parameter yields a different descrip-
Witkin 3 has described the practical pro- tion; new external points may appear,
signals resulting from the chemical ana- how objects are identified (eg: fourth
lysis of borehole samples. sample from top, or, sample ABC-24), and
what knowledge is available to the execu-
Formal and abstract aspects of borehole tive object regarding generation of attri-
profiles bute values.
In the domain of geological inference A borehole segment is an information
and reasoning (by geologist or computer) object representing any sampled section of
one can distinguish between physical ob- a borehole.
jects (minerals, boreholes), information For the geologist, provided the name of
objects (borehole logs, data files) and the borehole from which the segment origi-
executive objects (the computer, the geo- nates is known, and provided the borehole
logist). Executive objects are active, data base is available, the most obvious
while physical and information objects are primary attributes of a segment are its
passive. They are active because they can starting depth, ending depth and name, if
manipulate either, or both, of physical it has one. Any other attributes are
objects (sample them, name them, destroy secondary, as they can be canputed or
[fiX) dx]
0..
llb-a)
pattern definition (ie: during interval
parsing). It proves expedient, however,
to express them only implicitly (as re-
where: lationships between signal values or seg-
x = depth ments) in segmented representations of raw
a = depth at which sample starts
b = depth at which sample ends signal data, as is made clear below.
f(x) = chemical attribute value at The above characterisation of borehole
depth x
signal values helps develop methods of
We shall call such a signal value an transforming and describing the signals in
'integrated signal value'. Signal values a manner most sensitive to meaningful
of this type are in a different class from entities they may reflect, by highlighting
signals measured discontinuously, at a signal attributes which may assist segmen-
succession of regularly (or irregularly) tation.
spaced points, which we shall call 'sJ;Xlt
signal values'. Most digitally acquired Use of signal attributes for segmentation
borehole magnetometer logs are examples of One interesting attribute, in terms of
For the geologist, only the first three a: include infonnation related
items above are truly primary segment to signal values on either
attributes. side of the marginal signal
value (ie: the discontin-
Segments are named uniquely within each uity occurs within the
sample). This cannot arise
signal description. A signal description for spot signal values.
(in fact an interval) has only one name
b: include information related
pertaining to the multiple-scale nested only to one of its adjacent
list which describes it. Different levels signal values (ie: a single
discontinuity occurs at one
of that description are accessed proce- margin of the sample).
dural 1y, not by name.
c: include infonnation related
to neither of its adjacent
Segmentation: Phase one signal values (ie: there
are discontinuities on both
During Phase One of its operation, the margins of the sample) •
Segmentation Process applies the knowledge
In practice, application of this
that, in the majority of cases, along a knowledqe is distributed between
Phase One and Phase Two segmen-
sequence of borehole samples, when there tation, because it can require
is a change in the direction of change of consideration of relative signal
value magnitudes (considered in
an attribute value, that change reflects a Phase Two), rather than simply
change in the sampled world, and may be their directions of change.
HiH DISCONTINUITY
LOCATIONS
SIGNAL
i i
MAGNITUDE SIGNAL
MAGNITUDE
DEPTH DEPTH
III IIDD DD I I I I D I I I I
C DD DD D I I
INCREASING!DECREASING TRACE (L TO R)
INCREASING! DECREASING TRACE (R TO L)
I ID
G D D D DID DDD)
INCREASING! DECREASING TRACE (L TO R)
INCREASING! DECREASING TRACE (R TO L)
'-r-'
SAMPLE ASSIGNED TO DISCONTINUITY t
(INCLUDED IN BOTH SEGMENTS) JUNCTION ASSIGNED TO DISCONTINUITY
FIGURE lea). Type (1) direction-change pattern. Finite FIGURE l(b). Type (2) direction-change pattern. Finite
automaton searches for XXYY pattern in L to automaton searches for XXYXX pattern in L
R direction, where X = I and Y = D, or to R direction, where X = I and Y = D, or
X = D and Y = I X = D and Y = I.
SIGNAL
MAGNITUDE
I DEPTH
SIGNAL
MAGNITUDE
I DEPTH
SIGNAL
MAGNITUDE
r
DEPTH
annotated in Figures l(a), l(b) and l(c), The above theoretically developed
and described below. approach to segmentation is implemented in
three steps:
Pattern (1): In patterns of Type (1) in
Figure 1, on the basis of direction of
signal value change a16ne, there is no (i) A dichotomized trace (in one di-
evidence for the exact location of a rection) of the relevant bore-
discontinuity, if, indeed, one exists, at hole profile is produced, re-
all. Further, depending on the direction flecting each signal value's de-
in which the value sequence is evaluated, creasing or increasing relation-
different values become the first values ship to the sample before it.
of a new sequence . Consequently, because
conditions (ii) and (iv) a above apply, the (ii) This trace serves as input to a
marginal signal value is included in finite automaton subroutine,
segments to either side of it - as the which uses it to recognise and
first in one, and the last in the other. output Phase One segments (as
defined by their first and last
Pattern (2): In patterns of Type (2) in samples) •
Figure 1, on the basis of direction of
signal value change alone, there is (iii) Segment attributes are calcula-
evidence for a discontinuity, and for its ted, and stored in segment 14-
unequivocal location - although this tuples, and accumulated into a
becomes evident only after the direction list of segments which consti-
of change of the second sample after the tutes the interval that repre-
discontinuity has been checked. The sents the entire length of the
pattern recognition is not direction- borehole.
sensitive, and condition (iv)a or (iv)b
above applies, allowing segmentmarginal In practice, all these functions are
samples to be included in only one seg- carried out by a Pascal program
ment each. (The discontinuity itself may
occur in either of the two samples, or (PROLSEG4) , which is called from a Core
exactly between them.) program. The only operations for which
Pattern (3): Patterns of Type (3) may be the Core program itself is responsible,
regarded as special cases of Type (1) or are the following:
Type (2), and may be managed as either,
depending on whether condition (iv)a, (i) Receipt of the user's command to
(iv)b or (iv)c is thought to apply. If initiate the activity, and his
condition (iv)c applies, management as for selection of attribute (element)
Type (2) suffices. A fourth alternative and data set (borehole).
is to regard the entire extent of such
sequences as one segment. This policy was (ii) Initiation of the PROLSEG4
adopted initially, because such patterns program.
arise cornn:DI11y in geochemistry, when the
only difference between successive values (iii) Receipt from the PROLSEG4
is signal noise, or because of ~npling program of all of the output
effects resulting from under-sized segments, their structuring into
samples. It has the effect of minimising a nested list, and naming of the
the generation of many single sample, or resulting interval.
two sample segments. However, resolution
of signal structure can be severely Segments resulting from execution of
compromised, and this policy was dropped Phase One Segmentation of the Barium pro-
in favour of the Type (2) approach. This
causes Type (3) patterns to produce a file from Borehole 46gr are shown in
succession of single sample segments. Figure 2, alongside a plot of the original
Clearly, during data interpretation,
condition (iv) a, (iv)b and (iv)c may need data.
to be considered, and management of this Figure 3 explains the structure of the
task at higher levels in the system is
facilitated by single sample segments, nested lists used to represent intervals
rather than by the storing of two-sample during this and the following stage of
PHASE I
PHASE 2
PHASE 3
PHASE 4
ie:
FIGURE 3, Nested list structure designed to represent both split and merge components of an interval.
(It has major problems in representing overlap,)
r
SIGNAL
MAGNITUDE
DEPTH
..
ONE TWO
DISCONTINUITY DISCONTINUITIES CURVILINEAR
1 ! j
0
1
r X X
SIGNAL
MAGNITUDE I 1
B 1
DEPTH
)(
)( x METRIC =------___________ _
METRIC = METRIC =---------- -_______ _
((ot-b)/2) mo)( «otb)/2); (d)) mo)(((otb)/2);
((ctdte)/3))
PHASE
ONE
SEGMENTATION +
PHASE
TWO
SEGMENTATION
+
REDUNDANT
SEGMENT
FIGURE 6. The development of redundant degenerate segments, by a combination of phase one and phase
two segmentation
FIGURE 7. Nested list structure designed to represent successively merged components of an interval during
segmentation
representations of the original signal are them all their finer scale detail, which
needed for optimal feature recognition. may readily be accessed when necessary.
Phase Four Segmentation transforms re- Transformation of finer scale signal
latively finer scale symbolic representa- representations to coarser scale repre-
tions of geochemical profile signals to sentations is essentially the controlled
relatively coarser scale representations. merging of segments.
As an activity, it may be executed repeat- Phase Four Segmentation uses an 'Adja-
edly (with different parameters) on the cent Segment Area Difference' metric
same interval, to give ever more coarse- (ASAD) to control segment merging. If
scale representations of the same signal. this metric, as evaluated for both di-
Coarser scale representations carry with rections across a boundary between two
DEPTH DEPTH
DEPTH DEPTH
FIGURE 8. Graphical illustration of the evalution of the 'adjacent segment area difference' metric,
emphasising its direction-dependency
FIGURE 9(a). Phase four segmentation without the discontinuity preserving (DP) metric (ASAD thresholds
for 9(b»
FIGURE 9(b). Phase four segmentation with the discontinuity preserving (DP) metric set to 0,35.
Profile xbal is derived from prifile b46gr ba 2 of Figure 2 with an ASAD threshold of 1000,
xba2 from xbal with a threshold of 2000; likewise with thresholds of 4000, 6000, and 900
up to profile xbz9
.0 .0
t!) t!)
"0 "0
Cl> Cl>
Ba
o 4000 0 4000 0 4000 -3004000 -229 39988
o
(5
o
ot!)
o
.0
FIGURE 11. Segmentation phase one, two and four as applied to samples of variable length over the full
extent of a borehole. Profile (2) is the output from phase one segmentation, profile (3) the
output from phase two segmentation (the effect of which is well illustrated in the top three
samples of the profile). Profile (4) and (5) result from phase four segmentation at ASAD
thresholds of 2000 and 5000 respectively (DP metric was held constant at 0,35). Although
the linear regression approximation to the long segment near the centre of profile (5) is
obviously inappropriate (and produces a negative result at one extremity), the technique
provides a good approximation to most of the profile. The profile plotted in position (6)
is the same as that plotted in position (5), but is plotted at a different scale. It should be
compared with the second profile of Figure 10