John Ferguson (Auth.) - Mathematics in Geology-Springer Netherlands (1988)
John Ferguson (Auth.) - Mathematics in Geology-Springer Netherlands (1988)
John Ferguson (Auth.) - Mathematics in Geology-Springer Netherlands (1988)
Mathematics
in Geology
Mathematics
in Geology
TITLES OF RELATED INTEREST
Ferguson, John
Mathematics in geology.
1. Geology--Mathematics
I. Title
510'.24553 QE33.2.M3
ISBN 978-94-015-4011-7 ISBN 978-94-015-4009-4 (eBook)
DOI 10.1007/978-94-015-4009-4
Ferguson, John.
Mathematics in geology.
Bibliography: p.
Includes index.
!. Geology - Mathematics. I. Title.
QE33.2.M3F47 1987 510'.24553 87-14336
ISBN 978-94-015-4011-7
Preface and acknowledgements
List of tables XI
Introduction
1.1 Solution of geological problems - are mathematical
methods necessary? I
1.2 Mathematical formulation of a geological problem 3
1.3 Computational aids 7
1.4 Speed and accuracy II
1.5 Computer literature 12
2.1 Algebra 13
2.2 Geometry and trigonometry 43
2.3 Probability theory 51
3.1 Definitions 56
3.2 Some common series 59
3.3 Geological applications 63
References 285
Index 289
List of tables
Y= f(A, B, .. .)
Y= f(A, B, C, D, D', E, F)
(a) A schematic map showing the subdivision of the delta into 100
sedimentation zones. For purposes of the simulation, the co-
ordinates of the corners of each zone are used as data to enable
the correct sedimentation sequence to be assigned, for each
depositional episode.
6 INTRODUCTION
K= dz/z
dcr
where the thickness passes from z to (z- dz), i.e. from ~Z to ~Z' in
Figure 1.1.
These two equations can be related back to that given by Elliott
(1985), through the pore index p, which for original peaty material
COMPUTATIONAL AIDS 7
_________
-rr-
hs hl
~~~~ _jl_
I..I. .Ill
I........... ..,"z.----____,.J.h;-
... .. t ~
'../.z.i.i.i.
........ . ,_z.i. ..::I._Jf__
Speed and accuracy are two problems which, although they may
appear peripheral to the majority of matters discussed in this book,
turn out to be important in practice. Needless to say, they affect
some algorithms more than others. Later, and particularly in the
context of numerical analysis, some of the detail will be discussed
more fully. At this point a few general comments can be made. Since
it is fairly safe to say that these are usually not factors of concern to
the average mainframe-computer user, the comments are particu-
larly directed at users of micros. Some of the comments, especially
those concerned with accuracy, also apply when using hand-held
calculators.
Problems related to speed and accuracy arise in many ways, but
most importantly they relate to the word-length and the speed of the
machine being used. In some applications the efficiency of the
program itself will play an important part, particularly with regard
to the speed of execution. Since accuracy is related to word-length,
i.e. the number of significant figures which can be carried as a
number, it is fairly easy to check. It is possible to write a program to
evaluate a number such as 1/3, which cannot be expressed as a
rational number, and then successively to multiply the result by the
fractions 1/10, 1/100, etc., for say 50 increments. Performing such a
calculation on the BBC micro using the BASIC compiler, it is found
that the dynamic range of the exponent is down to 10- 39 , while the
12 INTRODUCTION
2.1 Algebra
y=ax
1.0
30
E
E
I
f- 20 0
l.?
z
L.U
a= 49 o
_J 10
10 20 30 1.0
WIDTH mm
Figure 2.1 Plot of length against width for the measurements given in Table 2.la, of
the Jurassic brachiopod Epithyris oxonica (Buckman). Length = length of pedicle
valve; width = maximum width of shell at right angles to length.
particularly if the two axes are not equally scaled), the vertical rise
per unit length increase along the horizontal axis can be calculated.
Whichever method is used, the sign for the slope is determined by its
direction, being positive if the slope is upwards from left to right,
and negative if the slope is upwards from right to left. In this
example the measured angle e = 49 and tan e = 1.15.
Alternatively, we can measure; thus for a rise of 36 units on the
(vertical) y-axis, we have 31 units on the (horizontal) x-axis, so
ALGEBRA 15
36/31 = 1.16
Plotting the data relating to Turitella sp. from Table 2.1, which is
illustrated in Figure 2.2, we find that the best-fit line no longer passes
through the origin of the graph, but has an intercept on they-axis, at
-8.5 mm. Any equation describing the line must take this factor
into account, so the general equation is
y=ax+b
0=79, tan0=5.145
or
39/8 = 4.875
40
E 30
E
I
f- 20
lJ
zw
_J
10
9= 79
0 1+----,-1'c-o-~2o
Figure 2.2 Plot of length against dia-
DIAMETER mm meter for the measurements given in
-1 0 Table 2.1 b, of the Eocene gastropod
Turite//a sp. Diameter = diameter of last
preserved whorl; length = length of pre-
- 20 served spire.
16 USEFUL MATHEMATICAL IDEAS
0.0 5.5
1.5 5.1
3.0 5.375
7.0 4.75
21.5 3.75
32.0 2.5
36.0 1.75
:r
E
a.
a.
,---, g_ 6"
+
N
0'
6
10 20 30
TIME h
Figure 2.3 Plot of the concentration of Mg2+ in aqueous solution against time,
recorded during an experimental study of the phase change aragonite-<:alcite in the
presence of magnesium (after Bischoff 1968).
ALGEBRA 17
The first thing which is apparent is that the slope of the line is in
the opposite sense to the previous examples. If we follow the rules we
have already used, we find
remember, since the line slopes from left to right the sign of the angle
e is reversed; alternatively
8 = (1.75- 5.5)/32
= -0.117
y= ax+ b
Particle diameter
Size class Cumulative weight
percentage
mm <p units
2.00 -1.00 9
very coarse 1.40 -0.49 18
1.00 0.00 27
coarse 0.71 0.49 36
0.50 1.00 45
Sand medium 0.355 1.49 54
0.25 2.00 63
fine 0.18 2.47 72
0.125 3.00 81
very fine 0.09 3.47 90
0.063 3.97 100
q> = -logz d
logzx = lnxjln z
(a) If the data are plotted on ordinary graph paper a straight line is
produced, i.e. the plot is linear.
(b) The calculation of various statistical parameters used in sedi-
ment grain-size analysis is much simplified.
'""
-.../
v'<
'&'""'
2
V1
r-
z
:J
-B-
1:L
:L
sa 100
CUMULATIVE%
where gu> =the growth rate, kg= Boltzmann's constant for the
reaction and t = time. This equation simply means that the rate of
calcite precipitation will be dependent on the surface area available
for nucleation of the crystals. Clearly, since the plot of t 2 is linear,
and not t 3 as expected, then the model described by the equation is
not satisfactory. Thus, the authors were able to conclude (Bischoff &
Fyfe 1968, p. 74) that the growth rate of calcite, under the conditions
of the experiment, is independent of the surface area available for
nucleation.
These last two examples have shown that the relationship between
two measured variables x and y is not necessarily linear, although
there are many areas of geological interest, where simple measure-
ments can be made, which follow the linear equation. It is suggested
that the student spends some time trying out a few ideas of his or her
own, measuring and plotting pairs of variables of geological interest,
to see what sorts of plots are obtained. In order to assist in this
exercise, Program 2.1 is a listing of a graph-plotting program, which
will plot a graph on standard (22 x 28 em) listing paper. It should be
noted that it is intended solely for printer output, and the plot which
appears on the VDU during the running of the program will not be
easily interpreted.
So far in our discussion of linear equations we have considered
them as a means of expressing the relationship between two mea-
sured variables, one of which is, or is arbitrarily designated as, the
dependent variable, the other being the independent variable. Going
back to the earlier example, illustrating the growth of Turitella sp.,
the fitted line had the equation
y = 4.875x- 8.5
ALGEBRA 21
where x =the diameter of the last chamber (the independent vari-
able) andy= the length of the spire (the dependent variable). In this
example the coefficient of the x-value is defined as being a measure
of the slope of the line, the constant - 8.5, is the intercept on they-
axis, and the coefficient of they-value is unity. We can rewrite this
more conveniently, with the unknowns x and y on one side of the
equation (rounding-off in the process) as
5x- y = 8.5
ax+ by= c
5x- y = 8.5
y = 16.5 (when x = 5)
5x- y = 0
x= 2 (when y = 10)
y
20 (a)
2
10
-10
"r ,.1y
10
X
5 10
-10
Figure 2.5 Plot of families of functions related to the linear equation 5x- y = 8.5.
(a) Curves produced by altering the coefficient of they-value
I. 5x-y = 8.5 2. 5x- 3y = 8.5
3. 5x-5y=8.5 4. 5x-10y=8.5
(b) curves produced by altering the intercept or constant value.
I. 5x-y=O 2. 5x-y=8.5
3. 5x-y= 17
ALGEBRA 23
or, if
5x- y = 17
These results are the same as those obtained by solving the corres-
ponding pairs of simultaneous equations. For example,
5x- y = 8.5
X =5
5x- y = 8.5
- 5x =25
-y=-16.5
5x- y = 8.5
9y=O
Quadratic
y = a:il + bx + c
Cubic
y = ax? + b:il + ex + d
y
(a) 40
X
-5
y
(b) 80
X
- 5 5
Quartic
y = ax4 + bx3 + ex? + dx + e
which can have three points of inflexion; either one maximum and
two minima, or two maxima and one minimum. The example
illustrated (Fig. 2.8) shows only one minimum over the range
plotted.
y
(a) 80
X
-5
-20
y
(b) 40
X
-5
-10000
Figure 2.8 Example of the graph of a quartic equation: y = O.OOlx" - 0.08x' +
0.5x2 - x + 15.
y= ax?+ bx+ c
ax? + bx + c = 0
y = 2:x? - 3x + l
(2x ) (x )
(2x I) (x 1)
(2x- 1) (x- 1) = 0
therefore x = 1 or x = l.
The rules for determining the sign of the factors of the constant
are:
(a) if both the x-term and the constant term are positive, then the
factors of the constant term will also be positive;
(b) if the x-term is negative and the constant term is positive, then
the factors of the constant will both be negative;
(c) if the x-term is positive and the constant term is negative, then
the factors of the constant term will be positive and negative.
2x-l=-x
X
x-l=-2x +
-3x
the arrows indicate multiplying and the two products should add up
to the middle term of the original expression, the sign will also match
if the factorization is correct.
Not all examples are so simple that they can be factored in this
way, and a formula which allows solution has to be used. If a and b
are the coefficients of the r-
and x-terms, respectively, and cis the
constant, then
- b -J(b 2 - 4ac)
X= _ _---..:.,o------'-
2a
x = 3 or -2/3
x = (1 + 2z) or (1 - 2i)
x = -bj2a
140 208 40 57
100 180 55 78
76 127 44 61
100 180 26 36
86 150 50 69
ll2 195 60 80
IIO 180 52 65
150 174 64 82
90 162 55 70
77 140
30 USEFUL MATHEMATICAL IDEAS
200
Adults
1 50
E
E
I
f-
lJ
z
L.U
....J
100
..
L.U
>
a:
:::>
u
50 Juveniles
50 100 150
WIDTH mm
Figure 2.9 Plot of measurements given in Table 2.4, relating to the Carboniferous
brachiopod Gigantoproductus injlatus (Sarycheva). The curved length is measured
around the perimeter of the pedicle valve starting at the tip of the umbo, at right
angles to the hinge; the width is measured along the hinge line.
ALGEBRA 31
different form of curve. That for the juveniles is a straight line, while
that for the adults is a curve. The form of the adult data plot is
typical of this genus (Prentice 1956; Ferguson 1978) and illustrates
the principle of allometric growth, i.e. the relationship between the
size of one part of an organism and the size of another is not linear
but follows the general equation
CL = 5.25 W0 75
2.15 2.32
2.00 2.26
1.88 2.10
2.00 2.26
1.94 2.18
2.05 2.29
2.04 2.26
2.00 2.24
1.95 2.21
1.85 2.15
32 USEFUL MATHEMATICAL IDEAS
2.5
....J
u
"" 2.25
0
2.0 2.2
log W
Figure 2.10 Plot of the logarithms of measurements of adult members of the
population of Gigantoproductus inflatus (Sarycheva) (data from Table 2.5).
or as
C =Co e -kt
or
InC= In C0 - kt
ALGEBRA 33
where C = concentration at a time t during the reaction, C0 = con-
centration at the start of the reaction, k = the rate constant and
t =time.
One method of finding k for reactions is to determine experimen-
tally the changes of concentration of a reactant with time and to
solve the problem graphically (Lasaga 1981, pp. 13ff.). Table 2.6
gives data from experimental cementation of aragonite ooids, show-
ing the rate at which strontium is released from the aragonite during
its conversion into calcite. A plot of this data, Figure 2.lla, shows
the characteristic exponential curve. Replotting the data using
logarithms, as in Figure 2.11 b, reduces the curve to two straight
lines, which show a break after about 20 days. This result is
interpreted as indicating a major change in reaction rate- a slowing
down - after the time in question. Based on other evidence, this is
thought to be the time of onset of cementation when rim cements
start forming around the ooids (Ferguson et a/. 1984). The rate
constants deduced from the graph are 1.017 and 0.5, indicating a
drop in rate of just over one-half.
Table 2.6 Data relating to the increase of strontium in sea water during the
experimental cementation of aragonite ooids at 200"C and I 500 lb in.-'
0 7.7 2.04
7 1.95 30 3.4
12 2.49 93 4.53
15 2.71 125 4.83
19 2.95 153 5.03
27 3.30 156 5.05
51 3.93 230 5.44
63 4.14 310 5.74
78 4.36 300 5.70
2 00
E
a.
a.
.---.
+
N
'-
(f)
'--'
100
2 4 b
In TIME days
Figure 2.11 Plot of experimental data showing the release of Sr2 from aragonite
during its conversion into calcite, plotted against time. (a) Plot of raw data; (b) plot of
logarithms of data. (Data as in Table 2.6.)
ALGEBRA 35
shorthand notation which is extensively used in mathematics, and it
is now convenient to explain its meaning and use.
The original meaning of the word 'function' in mathematics was
that some quantity, denoted by the symbol y, is related to another
quantity, denoted by x, by a simple formula or rule. A function is
written symbolically as
Y = f(x)
Y = f(x)
S = f(Q,R,D,M)
subsidence, the rate of dispersal and the nature of the material. Also,
although the nature of the relationship, i.e. the function, is not
defined, then the concept has allowed the development of a simple
computer simulation model (Harbaugh & Bonham-Carter I970, pp.
373ff.).
x2-2x+5=0
Using the equation it was found that the roots are given by:
2 4.J- I
2
x = (l + 2z) or (I - 2z)
16 <X< 30
10<y<l5
25x + 20y= z
relating profit from each grade of ore to the overall profit from the
operation should be maximized. This is known as the objective
function.
38 USEFUL MATHEMATICAL IDEAS
X= 16; X= 30
y= 10; y= 15
and
25x+ l0y=750
Notice that the inequality signs have now been replaced by equals
signs. Working through each, it should be noted that:
(a) x must be greater or equal to 16, therefore the area to the left of
x = 16 cannot be considered;
(b) x must be less or equal to 30, therefore the area to the right of
x = 30 cannot be considered;
(c) y must be greater or equal to 10, therefore the area below y = 10
cannot be considered;
y
X= 16 X =30
30
25x 10y" 750
20
1 6,1 5 24,15
y =15
0 ::::::::::.c
I :::::::::::.
10 i:------------,1'"""'6-,,1c-:~:-r-::::...:::..::..:.::..;::::...:::..::..:.:.:..:::=r=~:;;6-:;;,1o;-t Y =10
X
10 20 30
Figure 2.12 Graphical solution of first mine scheduling problem. The objective
function is maximized at x = 24 andy= 15.
ALGEBRA 39
(d) y must be less or equal to 15, therefore the area above y = 15
cannot be considered; and
(e) since the distance travelled cannot be greater than 750 tonne-km
day- I, then the area to the right of the line 25x + lOy= 750
cannot be considered.
16, 15 700
16, 10 600
26, 10 850
24, 15 900
16 <X IO<y
x+ y< 45
20
X
10 20 30 40
Figure 2.13 Graphical solution of second mine scheduling problem. The objective
function is maximized at x = 20 and y = 25.
fireplace A 10 2 30
fireplace B 3 8 50
fireplace C 6 4 25
maximum rock
available
per week 200 150
=200
= 150
x 1 = 15.54, x 2 = 14.86, x 3 = 0
x 1 = - 3.56, x 2 = 0, x 3 = 39.28
x 1 = 0, x2 = 2.78, x 3 = 31.94
Mineral I Mineral2
(%) (%)
rock A 10 10
rock B 30 15
rock C 25 45
From Figure 2.14 we can see that the distances (in geometrical
terms) between the minerals are given by:
44 USEFUL MATHEMATICAL IDEAS
so
#.
B
_j
4:: 2 s
a::
UJ
-z
:r:
A
2S so
MINERAL 2 %
Figure 2.14 Plot of the percentage of two minerals for three rock samples A, B and
C, showing their relationship using the Pythagorean distance coefficient (data as in
Table 2.7).
Thus, for example, the distance between the two rocks A and Busing
this is
= ~425
= 20.62
Similarly,
DA.C = 38.08
Da.c = 30.4
sin 9 = AB/AC
cose = BC/AC
tane = AB/BC
When the triangle does not have a right angle the above ratios do
not hold, and neither does Pythagoras' Theorem. Thus, in the
triangle ABC shown in Figure 2.15b which does not have a right
angle, then to find the unknowns we use the sine rule:
A c
(a)
B [ 8 c.
A
Figure 2.15 The relationship between the length of the sides and the angles of a
triangle. (a) A right-angled triangle; (b) an obtuse-angled triangle.
46 USEFUL MATHEMATICAL IDEAS
This rule is normally used when one side and two angles are known.
If we are given two sides and an angle or three sides, then the cosine
rule should be used:
cr = b 2 +2 - 2bc cos A
The most useful applicativns of these ratios and rules in geology are
in solving problems related to geological mapping and section
construction. To illustrate their use in this important field, we will
consider three simple, but frequently occurring, problems.
(a) Find the true thickness of a stratigraphic unit given its apparent
thickness at outcrop, its dip and the angle of slope on which the
apparent thickness was measured. In Figure 2.16 CA is the
apparent (or outcrop) thickness, b is the true dip and a is the
slope of the surface. The true thickness AB is given by
AB =AS sin b
D = Mtanb
tanb = VjH
Table 2.8 The ranges of values of the three common trigonometric functions for the
four quadrants of the circle. Note that whereas the sine and cosine distributions are
continuous, that for the tangent is not.
Circular
Angle sin cos tan measure
(degrees) (radians)
0 to 90 0 to + 1 + 1 to 0 0 to+ oo 0 to 1t/2
90 to 180 + 1 to 0 0 to -1 - oo to 0 ~t/2 to 1t
180 to 270 0 to -1 -1 to 0 0 to+ oo 1t to 3~t/2
270 to 360 -1 to 0 0 to + 1 - oo to 0 37t/2 to 21t
48 USEFUL MATIIEMA TICAL IDEAS
(1.) . 2A 2 . A
cos
A 2 tan A
Sin = Sin = 1 + tan 2A
( .) t 3A _ 3 tan A - tan 3A
VI an - 1 - 3 tan 2A
(1.) . A _ 2 tan tA
sm - 1 + tan 2tA
( ... ) A_ 2 tan tA
m tan - 1 - tan2tA
we can express the biaxial stress equations in terms of the angle 29,
which is the angle between the shear planes (Fig. 2.17a)
(a) (b)
la1 a, '-. //
j,)-A
'/l;..O"
/
/
/
-----<> <>--- 0"3-----<> / 0"3
0"3 0"3
l
/
/ '
/
o-,
jcr,
Figure 2.17 (a) The biaxial stress field acting on a small block of rock, producing
shear failure. (b) The biaxial stress field showing the relationship between the normal
stress (cr) and the shear stress ('t).
PROBABILITY THEORY 51
2.3 Probability theory
0 ~p(E) ~ 1
In general,
In general,
Note the use here, as elsewhere in this book, of the symbol fl, which
is the conventional shorthand for multiplication. The indices indi-
cate the range of values to be multiplied together.
p(6) = l/6
20
15
>-
u
z
w
::> 10
aw
a:.
u..
2 4 6 7 8 9 10 11 12
TOTAL VALUE
Figure 2.18 Histograms showing the frequency distributions of values obtained
from a pair of dice. , Distribution obtained after throwing a pair of
dice 60 times; , the theoretical distribution for throwing a pair of dice 60
times.
54 USEFUL MATHEMATICAL IDEAS
The result of getting two sixes gives a total of 12. On the other hand,
if we wish to calculate the probability of getting the total value five,
then there are four ways from which this may arise:
= 5)
I + I + I + I (th e a dd.1t10n
= 36 . axiOm
. )
P(total 36 36 36
3.1 Definitions
Thus, as the value of n increases, the value of the function gets closer
to 1.0, although it never quite reaches it. We can describe this by
saying that f(n) tends to l, as n tends to infinity, i.e. f(n)-+ I as
n-+ oo. Alternatively, the limit of f(n) is I as n tends to infinity,
symbolically
limf(n) =I
n-+oo
where n tends to infinity (i.e. increases without limit) and, if the sum
58 SEQUENCES AND SERIES
1-1+3-3+5-5+ ...
then the sum s2n = 0 and the sum s2n+ I = 2n + 1. Thus, when n--+ 00'
then S2n --+ 0 and S2n +I --+ 00. This SerieS OSCillates infinitely.
If u1 + u2 + u3 + ... is a convergent series, then un--+ 0 as n--+ oo,
and the value of any term is given by
Also,
lim un = lim sn - lim sn-1
n--+ oo n--+ oo n--+ oo
=0
lim L\x-. 0 as n -+ oo
b = t(a+ c)
a + aR + aR 2 + aR 3 + ...
where a = the first term and R = the common ratio term - to find R
divide any term by the previous one. The series can be either
convergent or divergent. The nth term un is given by
y = .J(xz)
andy is called the geometric mean of x and z.
In general, the geometric mean is given by
a a
a+ a+ ((n)- 1) +a+ ((n)- 1) +
where a= the first term and (n) =the position of a term in the series.
The series can be either convergent or divergent.
The harmonic mean is given by
n
nf L (lfx;)
i=l
(a+ x) 1 = la+ lx
n Pascal's triangle
1
2 1 1 2
3 3 1
3
4 4 6 4 1
5 5 10 10 5
6 etc.
62 SEQUENCES AND SERIES
e.g. n =I I I
v I
n=2 I 2 I
yy
n=3 3 3
l }2 p }4 }9
e = e1 = I + TI + 2! + 3! + 4! + + 9!
y = eX or x = In y
Table 3.1 The relationship between the arithmetic and geometric means for the
grain-size data of Table 2.3.
mm equivalent q~ mm equivalent q~
Arithmetic mean
q~ equivalent mm
q~-scale 1.49 0.355
64 SEQUENCES AND SERIES
From the above it is clear that we need some rule which will
enable us to decide which mean is appropriate in any particular
situation. Distinguishing between the arithmetic and geometric
means is relatively straightforward, since things which grow expo-
nentially (such as the mm-scale in our grain-size problem) require
the use of the geometric mean. In this context we should also notice
that the geometric mean of the lognormal distribution is equal to the
median of the distribution. Also, if the natural logarithmic mean
Jl 1 {y) (throughout this text the Greek symbol Jl is used to signify the
mean of a data set; unqualified this implies the arithmetic mean, a
subscript will signify the mean other than the arithmetic mean, but
in this case the subscript l indicates the logarithmic mean) of a set of
lognormally distributed data is given by
n
Jl 1 {y) = 1/n L In Y;
i=l
y=x
and
exp 3.67 = 39.25 ppm
GEOLOGICAL APPLICATIONS 65
10
>-
u
z
w
:::>
as
w
a::
u.
Figure 3.1 Frequency distribution
of the data used in the example
calculation relating the geometric
and arithmetic means. The data are
25 65 105 145 taken from experiments carried out
45 85 125 165 simulating the early diagenesis of
CLASS MID-POINT carbonate ooids, and represent the
[Sr 2 +] ppm concentrations of Sr' + in the brines.
For these data there is a close agreement between the value of the
geometric mean of the raw data and the exponential of the arith-
metic mean of the transformed set. The difference between the
median and the geometric mean probably reflects the nature of the
data distribution. For further discussion of these relationships see
Rendu (1981, pp. 7ff.).
The use of the harmonic mean is not so clear-cut in geological
studies. It may be that in some applications of stratigraphic data
analysis the harmonic mean would be more suitable than the
generally used arithmetic mean. One important application of the
harmonic mean is in the estimation of the tectonic strain ratio. Lisle
(1977) was able to show that the strain ratio estimated from the
mean shape of deformed elliptical markers varied according to the
mean used (Lisle 1977, p. 144, table 1), and the best estimate (when
the results were compared with another method of analysis) is given
by the harmonic mean. To facilitate calculation of the various
means, a computer program, Program 3.1, is listed.
1/~in =I Pi Ki
i=l
Mineral Percentage K
meal/em sec oc
pyroxene 34 9.13
plagioclase 46 3.65
hornblende 4.5 6.07
ilmenite 8 5.25
quartz 5.5 18.37
calcite 2 8.58
GEOLOGICAL APPLICATIONS 67
~ax= (0.34 X 9.13) + (0.46 X 3.65) + (0.045 X 6.07) +
(0.08 X 5.25) + (0.055 X 18.37) + (0.02 X 8.58)
= 6.6583
events can occur and no others, from the basic concepts of prob-
ability theory,
p+q=1
p(p in 2 trials) = P2
p(q in 2 trials) = q2
The remaining alternative is that with our two trials we will pick a
piece of mineral and a piece of gangue. Now, from the addition
axiom, the probability that either p or q will happen twice is
(p + q)2 = [l + 2pq + q2
If, instead of picking two pieces of rock (two trials), we pick a
number of subsamples of n pieces (n trials), then by extending the
above argument it is possible to show that the probability of getting
0, 1, 2, 3, ... , n pieces of mineral in our subsamples will be given by
the successive terms of the expansion of
reading from left to right (note the order of p and q in this last
expression).
Returning to our example, we may wish to test the hypothesis that
the mineral and gangue are mixed in equal proportions. To test this
we take 50 subsamples of four pieces each, counting the number of
pieces of mineral in each subsample. The data can be tabulated as in
Table 3.3.
0 2
I 11
2 22
3 12
4 3
which are
70 SEQUENCES AND SERIES
p=q=0.5
Substituting the values for p and q into each term of the expansion
and multiplying the resulting probability values by 50 (the number
of subsamples selected), we get theoretical probabilities, and the
equivalent expected number of subsamples as given in Table 3.4.
P[O] = 0.0625 3
P[l] = 0.25 12
1'(2] = 0.375 19
1'(3] = 0.25 12
1'(4] = 0.0625 3
.
Figure 3.2 Contiguous grid sam-
.
. pling scheme for the binomial dis-
tribution. 4 x 3 layout correspond-
ing to the hypothetical example
discussed in the text.
q(non-occurrence) = 1 - P(oa;urrence)
Observed Calculated
0 3
I 4 5
2 6 7
3 4 6
4 2 2
5 2 0
p = 2.I9/5 = 0.44
q = I - 0.44 = 0.56
and if a = e, x 1 = z and x 2 = - z
or
'7" -z
~[n]
n!
0 67 0.646 62
1 22 0.282 27
2 4 0.062 6
3 1 8.99x I0- 3 1
4 9.82 X J0- 4 0
5 8.58 X JO-l 0
z = 42/96 = 0.437
and
i.e.
Substituting the values of z and e-z, into each term of the expansion,
the expected probabilities can be calculated, and from these the
expected number of quadrats.
As can be seen from the results in Table 3.6, there is a close match
between the observed and expected frequencies of occurrence (as
with the binomial distribution, a statistical test of significance
should be made, to test the degree of similarity). Used in this way,
the Poisson distribution is a useful method which is often used to
describe events in a continuum, on the basis of a number of discrete
samples. It is normally recommended that this distribution be used
when the event is relatively rare, while the binomial distribution
should be used when the event is common. Again, the reader is
referred to the geological literature for further examples (see Davis
1973, pp. 305-7).
GEOLOGICAL APPLICATIONS 75
Before leaving the discussion of the use of probability distribution
models, based on simple mathematical series, it should be mentioned
that several others have been used by ecologists to describe relation-
ships within a community, particularly relating the number of
individuals of a species to the number of species present in the
community. One particularly important model is the so-called
'Fisher's a distribution', which is a special case of the Poisson
distribution where the zero term is not included. This distribution
may have important applications in palaeoecology, and is discussed
in Section 7.4. I, where an example of its use is also given.
4 Rates of change
differentiation
4.1 Functions: some further definitions
4.1.1 Rationa/functions
A rational function is a function of the form
P(x)
Q(x)
P(x). f .
Q(x) IS a proper raction
P(x) . . fi .
Q(x) IS an Improper raction
f(x} = x+ 2
40
4 20
-1.0 -0.5
X
0.5 1.0
-20
-40
~---L----~2----~--~4--~S x -60
Figure 4.1 Graphs of f(x). (a) The function is continuous, f(x) = x + 2; (b) the
function is discontinuous,f(x) = 1/x.
f(x) = 1/x
More formally:
4.2 Differentiation
4.2.1 Definitions
(a) Differentiation is the process of finding the derivative of a
function. The derivative is called the derived function or the
differential coefficient. The derivative of y with respect to x is
usually written as
dy
dx
f'(x) or d[f(x)]
dx
(b) The gradient of a curve at any point is the gradient of the tangent
to the curve at that point, and is given by the derivative dyjdx at
that point.
grad PQ = oy
bx
.. (8y)
Itmtt - dy
8x 8x=O- dx
y + 8y = (x + 8x)2
8y = .~ + 2x8x + (Jx? - x2
80 RATES OF CHANGE - DIFFERENTIATION
(4) Simplifying,
oy = 2x+ ox
Ox
(5) We have already noted that as Ox-+0, the chord becomes the
tangent at the point on the curve for the value of x, i.e.
which is
Thus,
dy = 2x when y = x2
dx
We can now consider the general case where the exponent can be any
value n. Thus, we find the derivative whenf(x) =X'.
y + oy = (x + oxY
n(n- 1)
(X+ OX)n =X'+ nx!'- 10X + l X l X'- 20.x2 + ...
so that
But
.
dy= l'Imtt
-
dx
Therefore
30
.
,., '
/
/
.
20
,
/
E / '
E
Vl
Vl 6.y
UJ
z ,
:><: , /
,,
10
u ,
I
f-
10 20 30
shows that throughout the life-cycle of the animal there are changes
in the rate of growth relating the two measurements concerned. If we
join the two ends of the curve as indicated by the broken line we can
estimate the overall or average rate of growth by finding the tangent
of the angle e, which is
e = 4o
tan e = 0.839
i.e. the average rate of growth of the thickness of the shell relative to
its length is 0.839.
If we wish to know the rate of change for the stage of growth
represented by any particular thickness, then we can estimate the
value by constructing a chord to the curve near the point by taking a
small increment on either side of it and finding the corresponding
increment on the other axis. We can then calculate the slope of the
chord using these values or, more directly, measure its slope and find
the tangent of the angle, using tables. This is also illustrated in
Figure 4.3, for the length measurement 20 mm. The results for
lengths 10, 20 and 30 mm are given in Table 4.1, which show that
during the life-cycle of the animal there is an increase in the rate of
change of length relative to thickness.
Even a casual inspection of Figure 4.3 shows the limitations of
this approach, particularly since the chord is only an approximation
to the tangent to the curve, leading to inaccuracies in the estimated
growth values. On the other hand, if an equation could be fitted to
the raw data in the form of
thickness= /(length)
Table 4.1 Data relating to the rate of change of growth in the brachiopod
Homoeorhynchia acuta. Estimated rate of change was found from Figure 4.3, while
the calculated rate of change was found as described in the text.
Estimated Calculated
Length (mm) rate of change rate of change
Angle Tangent
(degrees)
5 29 0.554 0.503
10 34 0.675 0.726
15 44 0.966 0.900
DIFFERENTIATION 83
and if the derivative is then found, then the exact value of the rate of
change for any given length could be calculated.
Using a statistical method, the constants a and Bof the allometric
growth equation (Section 2.1.4) were estimated from the raw data.
The equation is
= 0.2142 L 053
By putting the values 10, 20 and 30 mm for length into the equation,
we can calculate the exact rate of change. The results are given in
Table 4.1, and we can make a comparison between those found
analytically and the estimates arrived at using the chord method.
The differences in the values obtained by the two methods result
from difficulties involved in constructing a chord so that it is close to
the tangent, and then using the slope of that chord to estimate the
rate of change.
Simple functions can be differentiated from first principles, but
unfortunately the majority of functions normally met are too
complex for such treatment. To facilitate the differentiation of
complex functions, two methods are available: (a) the use of rules
and (b) the use of standard forms. The application of these two
methods is described, with simple examples, in Appendix 1. Readers
are reminded that reference should be made to mathematical texts
for a more detailed treatment. For example, the books by Hogben
(1967) or Graham eta/. (1984) provide good simple introductions to
the topic. More-advanced texts which are useful include those by
Gow (1960), Sherlock et a/. (1982) or Stephenson (1973). Also, since
familiarity with the techniques can only be achieved by practice,
examples which are to be found in these books should be worked
through.
d(dyjdx) d2
denoted by d.:;
dx
dy = l2x 2 + 4x + 1
dx
dJy = 24
dxJ
oy = 2x- 3
ox
DIFFERENTIATION 85
Differentiate with respect to z, with x constant:
ow
-= 2ax+ 2by
ox
Differentiate with respect toy, with x constant:
ow 2bx+ 2cy
oy
-=
by definition, ).1 = tan a, where a is the angle of sliding friction and crn
is the normal stress. This equation can be rewritten in terms of the
angle of sliding friction and the normal stress:
The shear stress -r has also been defined in terms of the angle
between the shear planes, therefore we can write
(-cr -2-cr
1- 3)
sin 29 = C0 +tan a [ (cr- -+
1
2-
cr 3) J - [ ( -cr -crJ)
1
2-
Jcos29
Now, differentiating with respect to 9,
sin 29
I =ta n a --
cos29
or
Therefore,
29 = 90- a
9 = 45 - 15 = 3o
and 29 = 60
MAXIMA AND MINIMA 87
When the angle 29 is measured in rock (see Fig. 2.16a), it is usually
found to be close to 60. Thus, we have demonstrated analytically a
result which is in agreement with that observed in rock, showing the
power of mathematical analysis applied to this type of problem.
Figure 4.4 is a sketch of the curve of a cubic function. The curve has
two turning points at p and q, p being a maximum and q a minimum.
We wish to find, analytically, the values of x at these turning points,
and which is a maximum and which a minimum.
To find the position of the turning points p and q, we differentiate
and put
dy =0
dx
the solution of which gives the turning points required. In order to
find which is a maximum and which is a minimum, we need to
differentiate with respect to x a second time, i.e. find the second
differential coefficient of the original equation. In order to find
which is which, we substitute the values of x for the maxima and the
minima in this equation and use the rule:
q
Figure 4.4 Sketch of the curve generated by a
cubic function, showing the turning points.
88 RATES OF CHANGE- DIFFERENTIATION
dy = 9r- I8x- 27
dx
0 = (9x + 9) (x- 3)
x= -1 or 3
Substituting,
y = 25 and -71
-1 0 0
Figure 4.5 Sketch of the cubic function y = 3x3 - 9x2 - 27x + 10. The first and
second differentials are also shown. Note that the turning points of y = f(x) have
values of x equal to the values of the points where the curve y = f'(x) cuts the x-axis.
Also, when y = j(x) is negative the turning point of y = f(x) is a maximum, and
when y = j(x) is positive the turning point is a minimum.
(2~) (-b)
X= 47.9 17.26i
60
E
E
I
.....
0
~
30
20 40 60
CURVED LENGTH mm
Figure 4.6 Graphical plot of the growth data measured from a single individual of
the productid brachiopod Semip/anus latissimus (J. Sowerby). The best-fit curve has
been eyeballed in.
where y = the height and x = the distance from the origin, both
measured in yards x 10- 1
It should be noted that the measurements were in feet (height) and
yards (distance), and both were converted to the scale indicated for
convenience of calculation.
w valley floor E
~ :::r~
=::-L-------::-'--:--'~---:-'~
600 1200 1800 2"100
DISTANCE yards
Figure 4.7 Cross section constructed from contour data across Kingsdale Beck, Nr
Ingleton, Yorkshire. Note the flat valley floor, which is probably formed from flat-
lying sediments, deposited at the bottom of a glacial lake.
MAXIMA AND MINIMA 93
The solution of the quadratic is
x = 95.2 or 207.6
~= -8.93 x to- 3
Thus, we have:
and
distance= 952.3 yd
height = 828.1 ft
Since the measured level at the base of the valley from the map is
850ft, then the thickness of infill is 21.9 ft.
Using the data given by Wilson (1980), the base of the old river
channel at the northern end of the plug of glacial till is at approxima-
tely 775ft O.D. The level of the valley infill at Kingsdale Beck is at
850ft O.D., therefore assuming no change of level in the old valley
floor upstream, the maximum thickness of sediment is 75ft. If we
allow for changes in the valley upstream based on changes down-
stream, we estimate a change in level of 50 ft to give a base level for
the old valley at 825ft O.D. Thus, a minimum figure for the infill is
25 ft. These maximum and minimum figures should be compared
with the calculated value of 21.9 ft.
94 RATES OF CHANGE - DIFFERENTIATION
C= 5A 2 + 8B2 - 3AB
subject to (production)
A= 100-B
B = 40.6 x l 03 tonnes
A+ B- 100 = 0
/...(A+ B- 100) = 0
Now, if the constraint is satisfied, then the last term in the Lagran-
gian cost function will be zero. Therefore, whatever values of A and
B minimize the cost C, the same values will minimize c~... However,
c~.. contains three unknowns: A, Band A.. We can solve the system by
partial differentiation and setting the results equal to zero:
A.= -472.2
The values for production from the two mines are as we calculated
using the substitution method, but we also have an additional value
A.. The value of A. is significant, and represents the reduction in the
total operating cost if only 99 x 103 tonnes were produced instead of
100 x 103 tonnes per week. Thus, the 472.20 is known in economics
as the marginal cost of the last unit of production. It should be noted
that, having set ac,joh = 0, we automatically ensure that the con-
straint equation is embodied in the system. Additional constraints
are handled simply by multiplying each by a different Lagrange
multiplier, and including each in the system. For further information
on this method students are referred to economics texts such as
Glass (1980).
y=ax+b
16 16a+ b 18 16a + b- 18
17 17a+ b 19 17a+b-19
19 19a+ b 20 19a + b- 20
19 19a+b 21.5 19a+b-21.5
23.5 23.5a + b 26.5 23.5a + b- 26.5
25 25a+ b 29 25a+ b- 29
29 29a+ b 33.5 29a + b- 33.5
31 31a + b 36 3la+b-36
First, we set out the data showing how the pair of measurements
are related, using the equation for the straight line, as in Table 4.2.
The deviations can be squared as follows:
or, in terms of b,
S= px2 + qx + c (4.3)
dS =O
da
dS
-=2px+ q=O
da
x= -qj2p (4.4)
In Eqn 4.1, and by analogy with Eqn 4.3, we find that the values for
p and q are
(359b- 9669.5)
a= - 8492.5
a= 1.229 b = -2.129
a= 0.809 b = 1.859
y=ax+b
which are
ti -a ab b2 - b constant
r -2xy 2x
40
E
E
I
f-
CJ 20
z
w
Figure 4.8 Graphical represen- ...J
tation of the data from Table 2.la
relating to the Jurassic brachio-
pod Epithyris oxonica (Buckman), /
and
that simplify to
(I:x)a + (N)b = Ey
Note that the equations are written slightly differently from those
for the linear case; this is simply a matter of convenience, and the
pattern can be extended to enable higher-order equations to be fitted
as required.
102 RATES OF CHANGE - DIFFERENTIATION
(a) To find the best-fit surface assuming that the differences between
the observed and calculated values are due to a random compo-
nent such as measurement error.
(b) The isolation of components within the data leading to the
separation of the regional and local trends.
In general, the surfaces fitted will obey the least-squares criteria for a
best fit, where
z=A +Bx+ Cy
given the constants A, B and C, we can for any given point (x,y)
calculate
DIFFERENTIATION AND LEAST-SQUARES METHODS 103
zcaic = A + Bx + Cy
and the deviation between zobs and zcalc is given by
deviation = zobs - A - Bx - Cy
oF
oA = 1:2(z- A- Bx- Cy)( -1) = 0
-
oF
- = 1:2(z- A - Bx- Cy)(- x) = 0
oB
oF
oc = 1:2(z- A - Bx- Cy)(- y) = 0
-
where n = the number of data points and the summation signs imply
summing from 1 to n. The solution of these equations is achieved by
rewriting them in matrix form as
J
1:x
1:y
1:xy
1:xy 1:yl
104 RATES OF CHANGE - DIFFERENTIATION
5.1 Integration
b
A= ff(x)dx = [F(x)]~ = F(b)- F(a)
a
n
A = lim
Llx-0
L1f(xk)Axk
k~
n-co
B'
each of width Llx, and summing the area of theN strips (the smaller
Llx is, then the more accurate the answer will be).
b
~ Jf(x)dx
a
i.e. the sum of the areas of the rectangles approximates the area
ABCD, as illustrated in Figure 5.2.
For practical purposes we can replace the rectangles with trape-
zoids, as in Figure 5.3. By this method we calculate
~ ""'y=flx)
Obviously, the smaller .dx is, then the greater the number of strips
and the more accurate the final answer. In general,
n-1
area= .dx{[HYo + Yn)] + L Y1}
i= l
Yo = axl + bx + c
v ~--
/
~ y= f (x)
Yo Y1 Y2 Y3 Yt. Ys
Figure 5.3 The area under the curve
y =fix), approximated by trapezoids of
A B width L1x and ordinates y 0 , y,, etc.
108 AREAS AND VOLUMES: INTEGRATION
If x 1 - x = h, then
Yo = a>! + bx + c
Y 1 = a(x + W+ b(x +h)+ c
A = hf3(y0 + 4y 1 + y:J
Thus,
b
L ~ ff(x)dx
a
provided that
I= 10.994
2.5
J
A= dxjx = (1/12)(I0.994)
I
I- 0 I
h=--=-
4 4
)=23.19
I
A= Jln (4x + 5)dx = (1/12)(23.19)
0
Note that the weights or multipliers in the tables come from the
general equation for the sum, ie, the first and last strips have a
weight of 1 while odd numbered strips have a weight of 2 and even
numbered a weight of 4.
Program 5.1 is a computer program for evaluating Simpson's
Rule.
y =ax"
Now,
dy = nax"-1
dx
_ na .x;<n-1)+ 1
y- (n- I)+ I
y =ax"
y = 3x3 + 4
then
ANALYTICAL METHODS 111
Integrating,
Jdy = J(9r)dx
y 9 _ _x(2 +I}
= __
(2 + 1)
Using this approach the constant is lost, and we must take this into
account. It should be remembered that the constant must always be
provided for, so it is normal to write
y = 3x3 + constant
y= 3x+ C
40 = 3(2)3 + c
C= 16
y = 3~ + 16
Thus, it is possible to show that two functions which have the same
derivative differ only by a constant. In general, there are three
methods of integrating:
ff(x)dx = F(x) + C
Further, if the interval of integration is between a andb, then we can
find:
frdxJri
It will have been noticed that the constant of integration was not
used in this example. This follows since it automatically drops out in
the calculation. Thus, given
+ CJ:
b
ffix)dx = F(x)
a
ANALYTICAL METHODS 113
However,
= f(x)J~
f(x) 5 12 21 32 45 60 77 96
SJ(x)dx 1.33 10.66 27.0 53.33 91.66 144.0 212.33 298.66
114 AREAS AND VOLUMES: INTEGRATION
y
300
200
y= Jt(x) dx A
.....
10'0
. .
.:::::
........
..........
................... ,
........................
B I:::::::::::::::::::::::::::
:::::::::::::::::::::::::::I
1 a a a a a a a a a a a a a a a a a a al
~ X
2 4 6 8
Figure 5.4 Geometric representation of the definite integral. The area outlined in
dots, under the curve y = f(x), is given by the difference in height of the ordinates to
the curve y = Jf(x)dx, labelled A and B.
be noted that if the curve cuts the x-axis between the limits, then the
areas above and below the axis should be calculated separately.
If this expression tends to a limit as n-+ oo (or as Jx-+ 0), then the
limit is
For example, given y = r, find the mean value of yin the interval
O~x~4.
4
J
area = x 2 dx = tx3]~
0
= 21.3
~ = ( 4~ 0) (21.3) = 5.325
= ssrc(3r) dx
4
2
s
= frc9x 4 dx
4
= rc9x5
5
J 5
= -18 5909
-rc cu b"tc umts
.
JJ z dx dy = SJJ(x,y) dx dy
R R
J =- 0.0024 A.=0.0013
118 AREAS AND VOLUMES: INTEGRATION
which gives
z = a + Px + yy + r5x 2 + A.y2
II 20
V= J J (a+ Px + yy + r>r + A.y)dxdy
y= I x=5
II
= J [ax+ Pr/2 + yxy + r5x /3 + A.lxU dy
y=l
3 0
II
= J [15a + {P/2)(375) + yy(15) + (r5/3)(7875) + A.y(15)]dy
y-1
= 188.465
and, after converting scales, we find that the volume of material in
place is equal to 1 884 650m 3
The following results were obtained for cells of different sizes,
using a numerical approximation method. For this the thickness z
was found at the centre of each cell, and the cell volume calculated.
Summing the volumes for all of the cells gave the total material in
place:
Thus, for a continuous function, the area under the curve can be
found by either integration or summation. Functions which are not
continuous, on the other hand, are not integrable and the summation
method is used.
In statistics there are a number of fundamental distributions
which are used to describe the probability of the occurrence of
events of different types. These distributions can be continuous or
discrete. The two which were introduced in Section 3.3.3, the
binomial and Poisson Distributions, belong to the latter group,
whereas the extremely important Normal Distribution, for example,
is a continuous function. Implicit in statistical theory is the relation-
ship between area and probability; thus, since the probabilities sum
to unity, the area under the curve or histogram described by the
distribution will also be unity. From the above discussion it is clear
that there will be differences in the way in which properties, such as
the mean or standard deviation for the two types of distribution, are
derived. To illustrate the differences let us consider the derivation of
the arithmetic mean. We define the mean as follows.
120 AREAS AND VOLUMES: INTEGRATION
(a)
(b)
J xfix)dx
<Xl
f.l.x =
-oo
Jfix)dx = 1
xl
Let us now consider the derivation of the mean for the uniform
distribution. This distribution, sometimes termed the random
distribution, can be either discrete or continuous. For the discrete
uniform distribution, the mean is given by
n .1 N+l
f.l.x= L J - = - -
j=l N 2
where x takes all integer values in the range 1 toN, inclusive. For
the continuous uniform distribution, the mean is given by
b 1
f.1.
X
= Jx--dx
a b- a
b 2 -li a+b
2(b- a) --2-
a "' [
z = 2o ."'5;. 1 a. cos (mr.x)
L . (mr.x)
+b. sm L J
The angular relationships are expressed in radians, and:
z = f(x)
In this situation the coefficients of the sine (b,) and cosine (a,) terms
can be approximated by
z z L
+_k + k-I zcos
2 o_
a = __ (nnLx,.) n = 0, 1, 2, .. . ,k/2
n k 2 i=l I
PS = ~(ti, + b~)
If we wish to retrace our steps we can do so, using the calculated
coefficients a. and b., and calculating the values of our original
series, at each sampling point, using
where z; = the observed value at the ith sample point, F(x;) = the
approximating function and N = the maximum degree of the series.
Now this is extremely useful, particularly should we wish to filter our
data, since we can use the equation to combine harmonics of our
choice. Thus, we can use only those which, from inspection of the
power spectrum values, are thought to be important, and can
construct a smoothed version of the original sequence. One of the
chief advantages of applying this method is that, unlike moving-
averages and related methods, points are not lost at the beginning
and end of the series. Before looking at a practical application, it is
convenient to start with some artificially constructed series, made up
of data of known amplitude and wavelength.
The series used were constructed using sine tables, the actual
angular value in the range 0-360 being used to represent distance in
metres and the sine value for each 5-m increment being used to
calculate the amplitude of the wave at the corresponding depth. In
its simplest form this gives a wavelength of 360m and, by adding
unity to each sine value, gives an amplitude of I. Changes in
wavelength were achieved by sampling the amplitude values already
generated at constant intervals according to the wavelength chosen.
For example, to generate a wavelength of 180m the series is sampled
at 10-m intervals. The amplitude, on the other hand, can be altered
by multiplying each value by a constant. In this way three series were
generated of 360-, 180- and 45-m wavelengths, the first of these
having an amplitude of 1m, the remainder an amplitude of 10m.
The series generated in this way are shown in Figures 5.5a and band
Figure 5.6b. By combining these fundamental series, two other
sequences were formed as illustrated in Figures 5.5c and 5.6c.
The numerical data relating to these series (using 19 points) were
analysed using the computer program given as Program 5.2, which
metres
7 0.0
52.5
35.0
17.5
10 20 20 30
AMPLITUDE
Figure 5.5 Artificially generated oscillating data series. (a) Wavelength 180m; (b)
wavelength 45 m; (c) combined 180 + 45 m. Amplitude constant 10m.
metres
(a)
52. 5
(b)
3s 0
1 7. 5
10 20 10 20 40
AMPLITUDE
Figure 5.6 Artificially generated oscillating data series. (a) Wavelength 73 m,
amplitude I m; (b) wavelength 36.5 m, amplitude 20m; (c) combined.
FOURIER AND TIME SERIES 125
Table 5.2 Power spectrum values for two artificial series of wave-
length 360 and 180m, and the combined values. The series are
illustrated in Figure 5.5.
Power spectrum
0 0 0 0
I 1.003 0 0.996
2 0 10.014 10.024
3 0.004 0 0.014
4 0 0.020 0.029
5 0.001 0 0.010
6 0 0.021 O.o28
7 0.001 0 0.011
8 0 0.01 0.018
9 0 0 0.010
has been written using the equations already discussed. The results
of the analysis of this data are summarized in Tables 5.2 and 5.3,
which give the power spectrum values related to each harmonic.
Remember that since we started with 19 data points, the series will
give rise to ( 19 - 1)/2 harmonics, plus the zeroth harmonic. Since the
zeroth harmonic has no sine term, it does not have a corresponding
power spectrum value.
Examination of the data in Table 5.2 shows that for the 360-m
wavelength the dominant harmonic is the first, that for the 180-m
Table 5.3 Power spectrum values for two artificial series of wave-
length 180 and 45 m, and combined values. The series are illustrated in
Figure 5.6.
Power spectrum
0 0 0 0
I 0 0.044 0
2 10.014 O.o38 10.040
3 0 0.044 0
4 0.020 0.045 0.029
5 0 0.044 0
6 0.021 0.023 0.056
7 0 0.044 0
8 0.010 10.030 10.004
9 0 0.044 0
126 AREAS AND VOLUMES: INTEGRATION
wavelength the dominant harmonic is the second, and that the value
of the power spectrum is equal to the amplitude of the series being
analysed. The result of combining the two series shows that, as
expected, the dominant harmonics are I and 2. Similarly, in Table
5.3 the dominant harmonic for the 45-m wavelength is 8, and in the
combined data the second and eighth harmonics dominate. Finally,
combining all three wavelengths gives the results shown in Table 5.4,
where the dominant harmonics are I, 2 and 8. In all of these
examples the values for the other harmonics have values less than
1.0 and result from noise introduced by the inaccuracy of calculating
the amplitude values, where only two significant figures were used.
Table 5.4 Power spectrum values for a com-
bined series of 360-, 180- and 45-m wavelengths.
0 0
I 0.908
2 10.137
3 0.111
4 0.129
5 0.111
6 0.131
7 0.110
8 10.100
9 0.111
Power spectrum
Wavelength
Harmonic Temperature Pressure L1h (min)
0 0 0 0
0.22 0.09 3.84 60
2 0.22 0.10 1.00 30
3 0.17 0.04 1.75 20
4 0.05 0.14 3.03 15
5 0.15 0.30 0.82 12
6 0.3 0.27 3.18 10
7 2.17 1.70 2.18 8.5
8 0.23 0.15 !.85 7.5
9 0.12 0.25 3.13 6.7
10 0.09 0.10 6.95 6
ll 0.13 0.11 2.06 5.45
12 0.11 0.09 2.19 5
13 0.17 0.26 5.40 4.6
14 0.40 0.10 4.96 4.25
15 0.05 0.22 4.38 4
60
E
UJ 30
I:
I-
Sandstone
Coal
Shale nan marine
Sandstone
Shale nan-marine
llllk;Shale marine
10m
Shale marine
Limestone
Shale nan-marine
shale (m) 0 2 0 0 0
shale (nm) 0 0 4 2 0
sandstone 5 0 0
limestone 2 0 0 0 0
coal 0 0 0 0
shale (m) 3 2 0 0 0
shale (nm) 0 27 2 2 0
sandstone 3 36 0 0
limestone 2 0 0 2 0
coal 0 0 0 0
Symbol Meaning
A any matrix
a any vector (row or column)
A. any scalar
an element of matrix A: the subscripts i and j give row and column
position, respectively
the identity matrix
the transpose of matrix X
the determinant of matrix M
the inverse of matrix A
A + B c
[
5 11 +
3 2j
134 NUMBER ARRAYS, TilE ALGEBRA OF MATRICES
6.2.2 Transposition
Writing the rows of a matrix as columns and columns as rows:
M M'
[-: :] [~
- l
-3 3
6.2.3 Multiplication
(a) BY A SCALAR
To multiply a matrix by a scalar, each element is multiplied by the
scalar. For example, we might need to convert a matrix of measure-
ments in one scale to another, in which case we simply multiply each
element of the matrix by the required constant:
A . A. B
. 6
121
l8J
(b) BY A VECfOR
If we consider the algebraic equations
y=Ax
this simply means that the vector of y-values, is equal to the matrix
of the coefficients of the x-values (denoted by a 1, etc.), multiplied by
the vector of the x-values. Thus, ifwe are given values for a 1, , a4
BASIC MATRIX OPERATIONS 135
and x 1 and x 2, we can calculate y 1 and y 2 simply by following the
rules of number algebra. Translated into matrix form, this is the
multiplication of the first row of the matrix by the values of the
vector x and summation to give the first element of the unknown
vector y, followed by multiplication of the second row of the matrix
by the vector of x and summation to give the second element of the
vector y. This is the row-by-eolumn rule for matrix multiplication:
which is
(a)
(b)
(c)
(d)
(b) (c)
/ 7
[: :] /'- ---y
2.4/~
0 7 0 10
7
[0-11 56
-6\,5 -----1--~----------------
2 .47/~:~
_____ -, (d)
''
' ', .' ,'
-u -- --- - - - - - - - - - - - - - - - - - -~.
-6 0 6
12 r(1-:_:-:: ~ :] (e)
o- '
32
Figure 6.2 Graphical representation of some simple transformations.
(e)
[~ -~]
d1
Da,
'1
2
1>,
X
-2 2 4 6
Figure 6.3 Transformation of the unit square abed. Showing a reflection in the y-
axis, a,b,c 1d, and shear, a,b2c2tf,.
Then
X=
[-: :l andy=
[:]
[- 2 X 2) + (2 X 4)
z= (Q
(1
X
X
2) + (4
2) + (3
X
X
4)
4)
=
[::]
(c) BY ANOTHER MATRIX
This operation also follows the row-by-column rule as above, where
138 NUMBER ARRAYS, THE ALGEBRA OF MATRICES
C= [(3
(6
X
X
1) + (4
I)+ (8
X
X
4)
4)
(3
(6
X
X
2) + (4
2) + (8
X
X
6)]
6)
[3819 6030]
In general, we can say that a matrix of order p x q can be multiplied
by a matrix of order r x s only if q = r, and the resulting matrix will
be of order p x s.
In matrix algebra the product AB does not equal the product BA,
if either of these products exist- i.e. multiplication is not commuta-
tive as in number algebra. For this reason post-multiplication and
pre-multiplication are distinguished. Thus, for the product AB, A is
post-multiplied by B, and for the product BA, A is pre-multiplied by
B. A computer program is listed as Program 6.1, which will multiply
a matrix by a scalar, a vector or another matrix (within the rules of
the algebra). The general case is given by
AI=A
Also, the following result holds:
AI=IA=A
A~ [: ~J
then
B= [(3 X 3) + (4 X 5) (3 X 4) + (4 X 1) l
(4x3)+(lx5) (4x4)+(lxl)j
From the rules, it follows that only square matrices can be treated in
this fashion.
6.2.5 Determinants
The determinant of a 2 x 2 square matrix is calculated as the
product of the NW and SW elements minus the product of the other
two:
A IAI
[: :]
I (4 X 5) - (2 X 6) I = 8
5 4 2
6 4 3
0 2 5
140 NUMBER ARRAYS, THE ALGEBRA OF MATRICES
4 3
5 = 5(20 - 6) = 70
2 5
6 3
- 4 = - 4(30- 0) = - 120
0 5
6 4
2 = 2(12- 0) = 24
0 2
I D 3 1 = 70 - 120 + 24 = - 26
or
5 4 2
ID 3 1 = 6 4 3
0 2 5
4 3 6 3 6 4
=5 -4 +2
2 5 0 5 0 2
= 70 - 120 + 24 = - 26
BASIC MATRIX OPERATIONS I4I
The determinant of a 4 x 4 matrix:
+ +
a b c d
e f g h
j k I m
n p q r
f g h e g h e f h e f g
ID4 1=a k I m - b j I m +c j k m - d j k I
p q r n q r n p r n p q
a= b= c= d=
142 NUMBER ARRAYS, THE ALGEBRA OF MATRICES
r- 21 14 l
d
Rb= =bl Sb= = bz
L LsJ
and
Rc=
r:J =cl Sc= 1lJ
6
L6
= Cz
Rd=
[- ~] =dl Sd= [ :] ~d,
and a, a 1 and a2, etc., are as plotted in Figure 6.3. As noted earlier,
the multiplication of the co-ordinates for each point by the matrix R
represents a rotation and reflection in the y-axis, but without a
change in area; while multiplication by the matrix S involves a shear
to the right and a stretch, resulting in an increase in area. Measuring
the area shows that there has been an increase, by a factor of 3, from
the original. Thus, the ratio of the new area to the old is equal to the
value of the determinant of the matrix used in the transformation. If
a singular matrix had been used, then the transformation would be
to a straight line; in other words, we would lose one dimension. This
is not reversible, whereas transformations brought about by non-
singular matrices are. These ideas can be extended to matrices of
higher order. For example, in the case of a third-order matrix the
vectors will represent the comers of a solid, and after multiplication
by a singular matrix, the transformation will be from a volume to an
area, i.e. again one dimension is lost.
Determinants are the basis of Cramer's method for the solution of
simultaneous linear equations. We can write the equations in matrix
form as demonstrated in Section 6.2.3:
BASIC MATRIX OPERATIONS 143
[A][x] = [b]
xn = Z/IAI
For a 2 x 2 matrix:
Note that if the original matrix (of any order) is singular, i.e. has a
determinant of zero, then there is no inverse. Let us now consider an
example:
A A-'
la bl
I
I
I d -b
I i iAT TAT
dj
I'
Lc
I
I!
I
-c a
LiAT IAI
144 NUMBER ARRAYS, THE ALGEBRA OF MATRICES
If
A= and IAI = 8
5 - l
8 4 1o.625 - o.25l
-3
4
l
2 l- 0.75 0.5
[: :] [l OJ
0.625
[ - 0.25]
-0.75 0.5 0 l
[01 OJ
1
[
-;
2- t]
t_
Thus, we have arrived at the same answer as before. We can, by
using the transformation rules relating to determinants, calculate the
determinant of the original matrix. The rules invoked are:
IA1=4X2=8
5 4 2
6 4 3
0 2 5
5 4 2 0 0
6 4 3 0 0
0 2 5 0 0
0.8 0.4 0 0
6 4 3
10: 0
I
0 2 5 L0 0
row 2 6 4 3 0 0
which gives
:6J
0 -0.8 - 1.2 0
0 2 0 0 0
BASIC MATRIX OPERATIONS 147
Divide row 2 through by - 0.8, to give 1 at a 22 :
0 2 5 0 0 0
subtract 0 - 1 0.64 0
which gives
0 - 1 0.64 0
0 2 5 0 0
0 - 1 0.64 0
0 0.4 0 0 0.2
which gives
0 - l 0.64 0
0 0.4 0 0 0.2
- l 0.64 0
0 0 0.2
which gives
ll 0 ll - l 0.64 0
l:
l.l5 -0.6 0.115
row l 0 - l 0.64 0
l: 0
0 1.15
-0.46
-0.6
0.24
0.154
IA I = 5 X (- 0.8) X 5 X 1.3 = - 26
A/S = s- 1A
It is also worth noting that, since we can only find the inverse of
square matrices, it is only possible to perform this particular
operation if the divisor is a square matrix with the same number of
rows as the other has columns.
[A][x] = l[x]
[A][x] = [b]
except that
[b] =l[x]
[x]=~
IAI
This is Cramer's Rule, where the values of x are given as the ratio of
two determinants; however, the determinant of the numerator is
zero, since it has one row made up of zeros. We can rewrite the
equation as
EIGENVALUES AND EIGENVECTORS 151
IAI[x]=O
Now, if the vector [x] is not equal to zero, then it follows that the
determinant I A I must be zero:
I[A]- .l.[I]I = =0
The equation
IA-.l.ll =0
Multiplying out,
A,-;.
B2-).
A 1B2 - B2.l.
- A,.l. + ).2
- A I A + A I B2 - B2). + A2
which is
A.2 + aA. + fJ = 0
Finding the roots of this equation will give the values of .l., the
eigenvalues of the matrix.
The use of a simple example will help to clarify the calculation
procedure. Consider the equation set
152 NUMBER ARRAYS, THE ALGEBRA OF MATRICES
4x+ 2y= a
6x+ 5y=b
[A]=
[ 46 25]
[
4-A. 2 l =0
6 5-A.j
Multiplying out,
20- 9A. + A. 2 - 12 = 0
Collecting terms,
A. 2 - 9A. +8= 0
Factorizing,
(A.- 8)(A. - 1) = 0
(A- A.l)x = 0
Now, if the eigenvalues A. have been found, we can solve for x 1 and
x 2, which are the eigenvectors associated with the particular eigen-
value substituted in the equation. There will be one pair of eigenvec-
tors for each eigenvalue. Thus, for the example which we have been
using:
for A.= 8
2 l
5-8 J
which is analogous to the simultaneous linear equation set
whence x 1 = I and x 2 = 2
for A.= I
2 l
5- I _I
J54 NUMBER ARRAYS, THE ALGEBRA OF MATRICES
which is
whence x 1 = 2 and x 2 = - 3.
Mg K
Mg 1.0 0.45]
[
K 0.45 1.0
r' I - )c
0.45 l
(A]- )c[I] =
~45 I- )c.JI
EIGENVALUES AND EIGENVECTORS 155
Equating the determinant to zero,
A2 - 2A + 0.7975 = 0
Using the formula for the solution of a quadratic,
A= 1.45 or 0.55
l'- ~::
I.
0.45l
1 - 1.45 J
which gives
- 0.45x 1 + 0.45x2 = 0
0.45x 1 - 0.45x2 = 0
0.45x 1 + 0.45x2 = 0
0.45x 1 + 0.45x 2 = 0
M
1.0 g
0.55
the same scale as the axes, which are of unit length), corresponds to
half the length of the major axis of the ellipse and has a slope of 45,
while the vector represented by the eigenvalue 0.55 is half the length
of the minor axis and has a slope of 135 (both slopes are measured
from the vertical axis of the plot). The values of the correlation
coefficients for the two metals represent the tangents of the angles
which two vectors make with the appropriate axis, and can be
represented by plotting on the perimeter of the ellipse as shown. The
origin of the axes for this type of plot is always at the centre of the
ellipse. To summarize, we have taken a matrix representing the
correlation between two variables, and have reduced it to a pair of
vectors whose magnitude and direction are given by the eigenvalues
and eigenvectors of the correlation matrix. With more-complex data
sets, where many variables are involved, the method can be
extended. It is an extremely useful technique, and is the basis of a
number of methods in multivariate statistics. Another application,
as will be seen in Section 6.4.4, is in structural geology.
During the consideration of the problem of determining the
eigenvalues and eigenvectors, the characteristic equation for a
square matrix of order n has been defined, and a mathematical
GEOLOGICAL APPLICATIONS AND EXAMPLES 157
technique has been described for its solution. The method outlined
can be used to solve square matrices of any order, but it proves to be
too cumbersome for the solution of examples where the order is > 3.
Several alternative methods of solution have been developed to
facilitate the solution of higher-order matrices. One method, specifi-
cally designed to find the eigenvalues and eigenvectors for real
symmetrical matrices, known as the Jacobi method, is described by
Cheeney (1983, pp. ll9ff.). This method is used as the basis of
Program 6.2, which is a development of a Fortran program given by
Davis (1973, pp. 168-9).
Table 6.4 Transition matrix for a sequence measured in the Jurassic, Blisworth
Limestone. Sample interval 20 em. Lithologies simplified.
limestone 38 4 8 so
marl 4 9 2 15
clay 7 2 6 15
158 NUMBER ARRAYS, THE ALGEBRA OF MATRICES
Table 6.5 Transition probability matrix for the data of Table 6.4.
=
P
{i.j) s
TU.J)
(1)
m~"' m"l~:::
0.60 X 0.60:0.360
060
0.13 X 0.60:0.078
0
~limestone 0.47 X 0.13:0.0611
:0.4283
:0.3985
:0.1732
Figure 6.5 Tree diagram for the three-state transition probability matrix of Table
6.5, to show the successive probabilities of reaching a given lithology after two steps,
starting with marl.
or
p (marl-+ clay) = 0.13 (step l)
MMM = MJ = p<step 3)
limestone-+ marl
Table 6.7 Transition probability and cumulative probability matrix for the se-
quence illustrated in Figure 6.6a. Sample interval 20 ern.
Clay
Blisworth Cloy
Limestone
Limestone - Cross Bedded
Marl
Limestone
Morl- Epdllyris Bed
Clay
Limestone Marl
Cloy
Morl Cloy
Limestone
Clay
Limestone- fossiliferous
Limestone
Marl
Clay
Limestone Marl
Cloy Clay
Marl Limestone
Clay Clay
Marl
Clay Limestone
Limestone- fossiliferous-oolitic
Clay
E . = RSUMi CSUMj
'' TSUM
where EiJ is the expected frequency for each element iJ, RSUMj is
the row sum of the jth row of the observed matrix, CSUMi is the
column sum of the ith column of the observed matrix, TSUM is the
sum of all the elements in the observed matrix, and i and j range
from 1 to n, n being the number of different lithologies in the
sequence. In other words, if we wish to find the expected frequency
for any element iJ we multiply the row total for row j by the column
total for column i and divide by the total number of transitions.
Having obtained the expected random matrix, this is then subtracted
from the original to give the difference matrix. These calculations are
shown in Table 6.8, which is based on a simple example, recording
Table 6.8 Calculation of an expected random matrix and difference matrix, based
on a transition matrix relating to a sequence in the Lower Carboniferous (see text for
details).
Element 2,2 is the sum of the squared deviations from the mean for
variable 2
which is
D=MM'
Table 6.9 Raw data matrix R, relating to trace element analysis of ten samples of
flint, from Grimes Graves, Norfolk.
Sample no.
2 3 4 5 21 22 23 24 25 Mean
aluminium (ppm) 636 616 605 586 495 668 648 512 635 503 590.4
magnesium (ppm) 29 24 26 26 90 21 21 19 28 18 30.2
potassium (ppm) 240 247 220 247 202 217 250 203 234 172 223.2
where xi.j is the ith sample of the jth variable, aj is the standard
Table 6.10 Calculation of the dispersion matrix, using the flint mine data of Table
6.9.
(a) The data expressed as the deviation from the mean for each element, matrix M
aluminium 45.6 25.6 14.6 - 4.4 - 95.4 77.6 57.6 - 78.4 44.6 - 87.4
(ppm)
magnesium - 1.2 - 6.2 -4.2 - 4.2 59.8 - 9.2 - 9.2 - 11.2 - 2.2 - 12.2
(ppm)
potassium 16.8 23.8 - 3.2 23.8 - 21.2 - 6.2 26.8 - 20.2 10.8 - 51.2
(ppm)
AI Mg K
deviation of the jth variable and xj is the mean of the jth variable.
The correlation matrix R is given by
R = (l/n)(NN')
where N is the normalized data matrix, N' is its transpose and n is the
number of samples. The matrix R will be square and of order m,
where m is the number of variables. Details of this calculation, using
the data set given in Table 6.11, are given in Table 6.12 as an
example.
In both of the example calculations given, only three variables
have been used. However, the method can be extended to take into
account any number of variables. For details of the use of these
matrices in multivariate statistics, the reader is referred to a statisti-
cal text such as that of Davis (1973). Program 6.4 is a computer
program which allows the multiplication of a matrix by its transpose
and, if required, will also calculate either the dispersion matrix D or
the correlation matrix R.
u = x'- x v= y'- y
(a) The normalized data matrix N (each value has a mean of zero and standard
deviation of unity)
Sample no.
2 3 4 21
AI Mg K
:l
i
AI 1.00 0.48 - 0.55
x.y x~y'
L __ _ _ _ _ _L __ _ _ _ ~ X
X
Figure 6.7 Simple shear showing the transformation of the point (x,y) to the new
position (x' ,y') and the angular shear strain 'I'.
u = - yy v= 0
where y =tan 'fl. 1f1 is the angular shear strain, and is negative.
The co-ordinate transform equations are
x' = x- yy y'=y
or
-y
[:]
Strain equations can be written either relating the final position to
the original or vice versa. Thus, for body translation we have the
following.
in matrix form
[: y
GEOLOGICAL APPLICATIONS AND EXAMPLES 171
(b) Eulerian equation
in matrix form
If the values for the coefficients of the matrix for the Lagrangian
specification are known, then the values of the coefficients of the
Eulerian form can be calculated as follows:
A = df(ad- be)
B= - bf(ad- be)
C=- ef(ad- be)
D = af(ad- be)
Then, applying the rules of matrix algebra and remembering that the
above equations represent the inverse of a 2 x 2 matrix (Section
r; :J [: :r
6.2.6),
v=y'- y=ex+(d-l)y
which is
[u] v
[a-
e
1 b
d- I
J IlyXJ
L
172 NUMBER ARRAYS, THE ALGEBRA OF MATRICES
[: :] [:J [:]
and the lengths of the major and minor semi-axes of the ellipse are
given by the eigenvalues of the shear matrix, and the principal finite
extensions e 1 and e2 , by the lengths of the eigenvalues minus one.
That is, since the lengths of the major and minor semi-axes are given
by (1 + e1) and (1 + eJ, respectively (Ramsay & Huber 1983, p. 31),
then e 1 = (largest eigenvalue- 1.0) and e2 = (smallest eigen-
value- 1.0). Also, since the eigenvectors associated with each eigen-
value give the vector direction, we can use them to define the
orientation of the strain ellipse. Any area change or dilatation is
given by the determinant of the matrix and the ellipticity by the
square root of the ratio of the largest to smallest eigenvalues. It
should be noted that the strain ellipse concept is a Lagrangian
concept. Students wishing more information on this application of
matrix algebra are referred to the textbook Strain analysis applied to
structural geology by Ramsay & Huber (1983).
Before leaving this topic, it is worthwhile considering two simple
examples. Table 6.13 lists co-ordinate pairs defining the simplified
outline of the Palaeozoic brachiopod genus Spirifer and the resulting
pairs after shear represented by the two matrices
and
[: :J
new co-ordinate pairs being calculated as outlined in Section 6.2.3.
For the first of these matrices, which represents simple shear, the
angular shear strain is given by
tan- 1 1.5 ~ 56
GEOLOGICAL APPLICATIONS AND EXAMPLES 173
Table 6.13 Co-ordinates (x,y) representing points on the perimeter of a simplified
outline of a spiriferid brachiopod, and the corresponding pairs (x;,y; and x;,y;) after
shear represented by the two matrices given in the text. These data are plotted in
Figure 6.8.
X 3 5 6 7 9 II 3 9 6 5.5 6.5
y 4 4 4 4.5 4 4 4 2 2 1.5 I
x; 7 9 II 12.75 13 15 17 6 12 8.25 7 8
y; 4 4 4 4.5 4 4 4 2 2 1.5 I I
x; 6 10 14 16.5 18 22 26 8 20 13.5 12 14
y; 9 II 13 15 15 17 19 7 13 9 7.5 8.5
We can verify this by reference to Figure 6.8, where a line joining the
point representing the umbo of the shell (a) and the sulcus (b) is
parallel to they-axis in the original undeformed shape. However, in
the deformed shape (outline at lower right on figure) a line joining
the same two points in their transformed positions (a 1,b 1) makes an
angle of about 55 with the y-axis. Also, since the determinant is
unity there is no area change. This can also be verified by reference
to the figure.
The second matrix represents homogeneous shear, and it follows
y
20
10
X
10 20 30
that the changes will be more complex for this transformation. First
we calculate the determinant of the matrix, which is
2
= I(2 X 2)- (1 X 1) I = 3
2
Expanding,
. F-4..1.+ 3 = o
(..1.- 3)(..1.- 1) = 0
. 1. = 3 or 1
2-..l.x+ y=O
x+ 2-..l.y=O
-x+y=O
x-y=O
whence x = I and y = 1.
GEOLOGICAL APPLICATIONS AND EXAMPLES 17 5
For the eigenvalue A.= 1 we get
x+y=O
x+y=O
whence x = 1 and y = - 1.
The eigenvectors show that the principal axis of the strain ellipse
for this matrix is at 45 to the x-axis, whereas the minor axis is at
right angles to this. Returning to Figure 6.8 we can verify this by
observing that lines joining corresponding points, for example a,a 2
or b,b 2 will also be at 45" to the x-axis, i.e. parallel to the principal
axis of the strain ellipse.
So far we have only considered simple examples of the shear
matrix order 2, dealing with changes in two dimensions; we can,
however, take into account the third dimension, and more-complex
shear matrices of order 3 and vectors of the same order locating
points in three dimensions can be used. The mathematical pro-
cedures are the same as those outlined above, only the interpretation
will be slightly different. The determinant will give the change in
volume (if any), while the eigenvalues (3) of the shear matrix will
give the lengths of the principal axes of the ellipsoid. Students should
also be aware of the existence and uses of other matrices in structural
geology, often referred to as tensors. These matrices provide a
convenient method of specifying completely the three-dimensional
state of stress at a point. In the context of structural geology, simple
tensors are square matrices of order 3, with the elements represent-
ing the nine stress vectors acting across the positive faces of a small
cube whose edges are parallel to the co-ordinate axis as:
The rules of tensor algebra are the same as those of matrix algebra,
and for their application to problems in structural geology students
are referred to texts such as that of Means (1976).
7 Introduction to numerical
methods
aX 2 +bX+ c= 0
Fraction Decimal
1/1 1.0
3/2 1.5
7/5 1.4
17/12 1.416666
41/29 1.413793
99/70 1.414285
239/169 1.414201
5/7/408 1.414215
1393/985 1.414213
3363/2378 1.414213
32 10 7 8 7l 3.5 0
-1
23 7 5 6 5 -3.25 0
R= I
33 8 6 10 9 2.25 -0.251
31 7 5 9 10 0.25 0
I
J
These residual values do not appear too bad, but it is easy to verify
that
X = 0.2:i'- + 0.4
x= 0 and x= 10
XI= 20.4
x 2 = 83.6
x 3 = 1398.2
etc.
Step 5. Make another guess at the value of the second root: say
x = 5, then put this value on the right-hand side of the
equation:
x 2 = 6.232
x 3 = 8.1676
etc.
In this example, any choice of value for x which is just lower than the
value of the largest root, will give answers which will converge on the
lowest root. A choice of x which is higher than the largest root will
always diverge. This illustrates one of the fundamental problems of
iteration, that of convergence.
(b) Find the positive root of the equation (correct to 4 decimal
places),
lnx=l+1/x
x 0 = 3.0
x= f(x)
which is
x = exp[l + (1/x)]
and set up the recurrence relationship
(c) Testing for convergence. In the first example above, using the
method of simple iteration, we could only find one of the two roots
for the equation. For one root successive estimates converged to give
the correct answer, while for the other, the estimates diverged,
ITERATIVE METHODS 185
successive estimates becoming larger (i.e. x, tends to infinity as r
increases). From a practical standpoint it is important that conver-
gence should occur, and that it should happen as rapidly as possible.
To test for convergence, it can be shown that an equation will
converge if the following result holds:
In other words, if, after differentiating the equation and putting into
the differential the initial guess for the unknown root, the result is
less than unity, then the iteration will converge, giving a real root of
the equation. Returning to our two examples:
f'(x) = 0.4x
f(x) = 0
e-x- sin x =0
f'(x)= -(e-x+cosx)
e-xn- sin X
X
n+ I
=x
n
+e Xn + COS Xnn
If x 0 = 0.5, then
f(x) = r- a
f'(x) = 2x
Choosing x 0 = 3.0,
when
188 INTRODUCTION TO NUMERICAL METIIODS
2x 1 - x 2 + 3x3 - x 4 = 7 (1)
x 1 - x 2 + 4x 3 - 2x4 = 5 (2)
m x, x2 X3 x4 b
2 - I 3 -I 7 (I)
-t - I 4 -2 5 (2)
-.l
2 3 2 4 31 (3)
2
4
4 -3 3 -3 -3 (4)
XI x2 X3 x4 b
2 -1 3 -I 7 (1)
-t
- 1 t -.l
2
~
2
7
2
add -I 4 -2 5 (2)
-t 5
2
-.l
2
3
2 (6)
XI Xz X3 x4 b
2 - 1 3 - 1 7 (5)
-t 5
2
-.l
2 2
3
(6)
7 -22 II 41
2 2 2 (7)
-1 -3 - I - 19 (8)
SIMULTANEOUS LINEAR EQUATIONS 191
Note that we have recorded the pivotal equation (now labelled Eqn
5), unchanged after these operations. The next step is to eliminate
the x 2 coefficients from Eqns 5, 7 and 8. In this process the x 2 value of
- t in Eqn 6 is used; this is the 2nd pivot and Eqn 6 is known as the
2nd pivotal equation. The multipliers are calculated using - t as the
divisor:
m x, xl X3 x4 b
-2 2 - 1 3 -I 7 (5)
-t 5
2
-1.
2
3
2 (6)
7 7 II 41
7 2 2 T T (7)
-2 - 1 -3 -I - 19 (8)
and, performing the operations as before, using the new set of
multipliers, we get
x, xl X3 x4 b
2 -2 3 6 (9)
5
-l
2 2
3
2
3
2 (10)
14 -5 31 ( 11)
-8 2 -22 (12)
m x, x2 XJ x4 b
I
7 2 -2 3 4 (9)
5 5 3 3
2
-l
2 2 2 2 (10)
14 -5 31 ( 11)
4
7 -8 2 -22 (12)
192 INTRODUCTION TO NUMERICAL METHODS
So we have
x, x2 xJ x4 b
2 7
9 59
7 (13)
-t -!l
28
-ill
28 (14)
14 -5 31 (15)
-.2.
7
-N7 (16)
For this method, ifn is the number of unknowns, then the number of
operations N is given by
2x 1 - x 2 + 3x3 - X4 = 7 (I)
x 1 - x 2 + 4x 3 - 2x4 = 5 (2)
XI x2 x3 x4 b
2 - 1 3 - 1 7 (5)
-t T
5 -1.
2 T
3 (6)
_]_ II
T
7
2 T
41
T (7)
-1 -3 -1 - 19 (8)
Next we eliminate x 2 from Eqns 7 and 8 only, leaving the 1st pivotal
equation unchanged:
m XI x2 x3 x4 b
2 -1 3 -1 7 (5)
-t T
5 -1.
2 T
3 (6)
7 T
7 _]_
2 T
II 41
T (7)
-2 -1 -3 - 1 - 19 (8)
to give
XI x2 x3 x4 b
2 -1 3 - 1 7 (9)
-t 5
T
_]_
2
3
2 (10)
14 -5 31 ( 11)
-8 2 -22 (12)
m x, Xz xJ x4 b
2 - 1 3 - 1 7 (9)
-t 2
5 _]_
2
3
2 (10)
14 -5 31 ( 11)
8
T4 -8 2 -22 (12)
to leave
x, x2 x3 x4 b
2 - 1 3 - 1 7 (13)
_]_ 3
-.l
2 2
5
2 2 (14)
14 -5 31 (15)
--2- -30
(16)
7 T
x, = 1 x2= 2 x 3 = 4 x 4 = 5
stage I X X X X X X X X
X X X X X X X X
X X X X X X X X
X X X X X X X X
stage 2 X X X X X X X X
X X X X X X
X X X X X X
X X X X X X
stage 3 X X X X X X X
X X X X X X
X X X X
X X X X
stage 4 X X X X X X
X X X X X
X X X X
X X
stage 5 X
X
X
X
m XI Xz x3 b L
2.4759 1.6235 4.6231 0.0647 (l)
Notice that we now have one extra column in our layout, the check-
sum column, labelled L
this will contain the sum of the numerical
values for each equation:
.L = x, + x 2+ ... xn + b
We form our multipliers as before, dividing the coefficient that we
are eliminating by the pivot, and reversing the sign. We perform this
operation to at least one more decimal place than the original
coefficients:
m x, x2 X3 b .L
2.4759 1.6235 4.6231 0.0647 8.7872 (1)
which gives
m x, x2 X3 b .L
2.4759 1.6235 4.6231 0.0647 8.7872 (4)
Notice that we have two new check-sum values for Eqns 5 and 6. We
can now check our calculation as follows.
Take the check-sum value for the 1st pivotal equation, multiply by
one of the row multipliers as in (a) below. Add this to the check sum
for the row chosen as in (b) below. This value should agree with the
new check-sum value for that row, i.e. subtraction should give a zero
value, as in (c) below.
For example, to check Eqns 2 and 5:
m b
to give
XI x2 XJ b I
2.4759 1.6235 4.6231 0.0647 8.7872 (7)
Back-substitution leads to
r=b-d
where b is the vector containing the original values from the right-
hand sides of the equations and dis the vector containing calculated
values for the right-hand sides of the equations, calculated as:
198 INTRODUCTION TO NUMERICAL METHODS
b d r
The size of the third residual is probably due to the loss of significant
figures during the formation of the 2nd pivot and its use in
subsequent calculations. This suggests that the equations may be ill-
conditioned, and it may be that we would only be justified in quoting
our answer correct to 2 decimal places.
In the methods described we have been systematic in our choice of
pivots, and have also used the values in the order given and without
any attempt at scaling. However, there are variants of the methods
discussed which involve scaling where it is practicable, and involve
choosing pivots according to their numerical value. This last method
has the advantage that, if we use the numerically largest element in
each row as the pivot, then inaccuracies in calculation due to using
excessively small pivots, as in the last example, can be avoided.
A suitable computer program for the solution of simultaneous
linear equations is listed as Program 7.1. It is based on the Gaussian
elimination method as described in the text, except that the pivots
are chosen on the basis of the largest element in each row. Results
from the program include the value of the determinant of the
coefficient matrix and the values of the pivots used in the calculation.
SIMULTANEOUS LINEAR EQUATIONS 199
7.3.2 Methods of solution based on an iterative technique
Algebraic methods of the solution of simultaneous linear equations
and those based on matrix algebra are direct methods, whereas the
iterative method, known as the Gauss-Seidel method, is an indirect
method.
For example, we are asked to solve the equations
XI = (2 - (2)(3)]/3 = - 1.33
XI=- 0.81
x 2 = 1.87
Iteration number
2 3 4 5 6 7
X 1 =- 0.40
If there are more than two unknowns, then we can proceed by one of
several different methods, only two of which we need to consider
here.
In 1 t i a I
guess
- c2y;l()
3
+ z v\
We can rewrite as
and
provided that
IAI =ad- cb
SIMULTANEOUS LINEAR EQUATIONS 203
where
A~ [: :]
z, b, a, z,
z2 b2 a2 7'
~2
X=
I
X2=
a, b, a, b,
a2 b2 a2 b2
provided that
3x+y+4z=l2
2x + 4y + 2z = 8
y + 3z = 16
204 INTRODUCTION TO NUMERICAL METHODS
3 1 4
4 2 2 2 2 4
IDI= 2 4 2 = 1 - 1 +4
1 3 0 3 0
0 3
= 32
12 4
x=.!..
32 8 4 2 = 3~6 =- 3
116 3
3 12 4
y=fi 2 8 2 =#=1
0 16 3
3 12
z=.!..
32 2 4 8 ='3~=5
0 16
A=
X1l -Z 1 l
x=
~21 b= z2
I
~"J
I
Zm J
An exact analytical solution is given provided that IA I #- 0, using
X= A- 1b
and
12 -3-1
[A -I] 8 l I
I
16 5J
206 INTRODUCTION TO NUMERICAL METHODS
(a) If one equation is nearly a multiple of the other, then the system
of equations will be ill-conditioned and problems of round-off
error will become very important. Small changes in the coeffi-
cients lead to large changes in the roots of the equations (cf.
example in general discussion at start of Section 7.3). Test by
calculating the determinant which should be small compared
with the expansion of the terms that make up the determinant.
Also note that if the determinant is small in the sense just
described: it is likely that an iterative procedure will take a long
time to converge. Thus, the rate of convergence can be taken as
a rough guide to condition.
(b) If the determinant is zero, then the equations are exact multiples
and plot graphically as parallel lines.
(c) Estimates of roots which differ appreciably from the true roots
can produce small residuals (cf. earlier comment on small
residuals not guaranteeing good solutions, Section 7.3.lc).
(d) If individual terms of the inverse of the coefficient matrix are
large compared with the roots of the equation.
(e) If the ratio of the largest to the smallest eigenvalue is large.
(a)
89 1~4.011= -0.45
155 89
-6
(b)
- 3
- 5
Figure 7.3 Examples of ill-conditioned equations. (a) Graph of the equations
89x 1 + (144 + a)x, = 233 and 55x, + 89x2 = 144, where a is a small increment. The
two equations plot almost exactly on top of one another. (b) Graph of the pair of
equations 3x, + 2x, = 2 and 6x, + 4x2 = 10, which plot as two parallel lines.
208 INTRODUCTION TO NUMERICAL METHODS
S= a In ( 1 + ~) (1)
x= N/(N+ a) (2)
11
o Observed
Calculated
9
7
Vl
LU
u
LU
~ 5
Vl
u..
0
a:: 3 0
LU
co
L:
::J
z
2 4 6 7 8 10
NUMBER OF INDIVIDUALS
Figure 7.4 Plot of palaeoecological data showing the number of individuals repre-
senting a species against the number of species represented by that number of
individuals; (0) observed; (e) calculated. For a community collected from the
Lower Carboniferous of Weardale, Co. Durham. Fisher's a= 8.802; no. of spe-
cies = 33; no. of individuals = 304. See text for details.
GEOLOGICAL APPLICATIONS 211
(b) CALCULATION OF THE DISSOLUTION RATE OF ANHYDRITE
The problem of the dissolution of gypsum and anhydrite in the
foundations of dams and other large hydraulic structures is ex-
tremely important. A study by James & Lupton (1978) has shown
that, although both materials are relatively insoluble, significantly
large amounts of them can be removed by dissolution in periods of
between 50 and I00 years. Such a process, if unchecked, can lead to
structural failure, perhaps of catastrophic proportions. The mathe-
matical solution of one particular equation involved in the calcula-
tion provides a good example of the use of simple iteration. If we
consider the dissolution of anhydrite, then the amount dissolved is
governed by a second-order kinetic reaction, where
dM
-=K(S-c)
dt
KAt_ c
V- S(S-c)
x = c/S(S- c)
Iz = 10.54
c = 0.0105928
B = 0.0835258
A = 0.4633817
R= 0.03072
0.03888
Z(calc) = A + Bx + Cy
we can calculate values for z(caic) for any pair of co-ordinate values x
andy. The results are presented as a linear trend map in Figure 7.5.
GEOLOGICAL APPLICATIONS 215
5
61 46
(55.3)
(64.5)
78
(71 .9)
4
76
64 S4 87 (88.0)
(55.2) I
(69.4)
(76.9)
3 4 7:
95
I
(55. 3)
_J I (75.2)
<
u
f-
a:: 50/o 80/o 7 1
I.U (84.4)
>
80
(74.6) 78
(82.7)
2
HORIZONTAL
Figure 7.5 Linear trend surface map, calculated using the multiple regression
method discussed in Section 4.5.2. The map represents a small section of a quarry
face, showing a contoured plot of percentage methane (data from Table 7.4). Each
marked sample location gives the original data value and the calculated trend value.
The trend value is in parentheses. , Linear-trend contour line, values
are percentage methane; - - - - - , mineralized joint plane.
From this map it can be seen that although there are discrepancies
between the original data and the surface, the linear trend is almost
parallel to the mineralized joint plane, which fits in with the
hypothesis that there may be a relationship between the mineraliza-
tion and the occurrence of light hydrocarbons in these samples.
8 Geological applications of
differential equations
d y + dy = f(x)
2
is a second-order differential equation
dr dx
dy = 3x
dx
3
fl.x) = J3xdx = 2.r + C
DEFINITIONS AND EXAMPLES 217
Note that here we include the constant of integration, which leads to
the general solution. Thus, the equation gives rise to a family of
curves, which will have the same slope for a particular value of x, as
shown in Figure 8.1. If a particular solution is required, then an x-
and corresponding y-value must be given. These are referred to as
boundary conditions, and enable the constant of integration to be
calculated. Thus, in the example we have just considered, if x = 0
when y = 2, then
3
y=-r+2
2
dy =f(x)
dx
y = .fj(x)dx
y
30
-10
Figure 8.1 The family of curves of the differential equation dyjdx = 3x, i.e.
f(x) = tx' + c for c = - I, 3 and I 0.
218 GEOLOGICAL APPLICATIONS
dx = f(y)
dy
_l_dy =I
f(y)dx
I
J-dy= Jdx
'f(y)
dy _f(x)
dx- g(x)
g(y) dy = f(x)
dx
fg(y)dy = .[t(x)dx
dy =ky
dx
_!_dy =k
ydx
I
f-dy = fkdx
y
NUMERICAL SOLUTION: FINITE DIFFERENCES 219
which leads to
lny= kx+ C
or
y = exp(kx +C)
= expkxexpC
(d
dr
y)
2 ~ (dy/dx) 0
~
-
lx
(dy/dx)p
0
Thus,
NUMERICAL SOLUTION: FINITE DIFFERENCES 221
We can derive the finite-difference quotients for higher-order deriva-
tives similarly.
To use the finite-difference approximation we can drop the
y = f{x) notation and use values of y themselves. The finite-differ-
ence approximations with the y = j{x) notation replaced by the
corresponding y-values, for differential equations up to order four,
are
Yo- 2y 1 + Y2 = 0.000625
Y1 - 2y 2 + Y3 = 0.00125
Y8 - 2y9 + Yw = 0.005625
- 2y 1 + y 2 = 0.000625
Y1 - 2y 2 + y 3 = 0.00125
Y2 - 2y 3 + Y4 = 0.001875
=- 9.994375
z = J(x,y)
oz oz
ox and oy
224 GEOLOGICAL APPLICATIONS
OZ = Z;+ l . j - Z;-l,j
ox 2L\.x
and
OZ= z.l,[' + I - z.l,J-
- 1
oy 2Ay
o
_ z=
2 z.+ 1 .-2z.+z
I l,}
,}l - 1,}
.
of Ax2
and
-
o2z = z .+ 1 - 2z . . + z. 1
oy AI
l,) '' 1,{-
It should be noted that there are now two indices in the equations;
when we hold y constant the index j remains constant and, con-
versely, when xis constant i remains constant.
Now consider the partial differential
z.+
1
I .-
,)
2z.1.) + z. 1-
I .
,) + z . .+I
1,)
- 2z1,). + z.1,)-
. I = c
A.'~? Ay2
If we use a regular grid where Ax= Ay, then this equation becomes
z.+
l 1,). +z , - 1.j. + .. + 1 + z1,)-
zl,) .. 1- 4z l,). = c(Ax) 2
Inspection of Figure 8.2 shows that there are 25 cells involved, nine
of which are internal (i.e. have no external boundary), four have
external corners and 12 contain an external edge.
NUMERICAL SOLUTION: FINITE DIFFERENCES 225
i. j 1
I 0
I e e 1
L
z !j) i, j -1 : I :
t- - - - -- - - - -.
I o
e II e e '
I
- -- - _,_ - - - -'
I I I
I 0 I
x (o)
Figure 8.2 Method of indexing points for the finite-difference quotients, with two
independent variables. Points are at the centre of a regular grid of .1.x = Ay.
dN
-=rN
dt
dN
- = -rN
dt
GEOLOGICAL APPLICATIONS 227
where N =the population size, t =time and r =constant. Both
equations can be solved analytically, and lead to
and
InN,= lnN0 + rt
and
lnN, = lnN0 - rt
dC = -kC
dt
InC= In C0 - kt
or
dC = k(C- C)
dt s
228 GEOLOGICAL APPLICATIONS
In c -c)
(C,- C 0
= - kt
dA=kS
dt
InS= - kt + In I 00
dM
--=aM
dt
=Mo-Me-a'
(N- l)!:J..t
The mass deposited during any time increment is Md. The mass
remaining today M,, from sediments deposited at time i, is
Thus, given values for M, a, N and lit, we can calculate the mass
remaining for values of i, where i goes from 1 to N. A Fortran
program based on both of the Garrels and Mackenzie models is
given by Harbaugh & Bonham-Carter (1970, pp. 367tf.). They also
include a comprehensive discussion of the mathematics of the two
models.
n- n0 = P(uyz cos a- uzx sin a)+ Q(2u_.Y cos 2a + ul'l sin 2a)
where P = 2NDmhl/-r and Q = 2NDml 2 j-r. P and Q are constants,
and depend on the physical parameters of the instrument,
oc
at =DVC
which simply states that the change of concentration C with time tat
any given point is equal to the Laplacian of the concentration, VC,
times the coefficient of diffusion, D. To solve the differential
equation, so that values for the concentration at a given depth z at
time t could be found, boundary conditions are set using the
following assumptions.
timet= 0
depth z = h
can be used.
C(O,z) = 0
This does not, however, satisfy the first boundary condition. There-
fore Stegna (1961, p. 448), assumed that the concentration at timet
and depth z consisted of two parts:
C"(O,z) = - cO,z/h
C"(t.hl = 0
C"(I,O) = 0
which will satisfy the third of the above boundary conditions, but
not the second, unless
a=nrc/h
234 GEOLOGICAL APPLICATIONS
n~l
co
An Stn h
(mt) Z = - C0
(z)fz
This leads to an equation which allows the calculation of the gas
concentration Cat a depth z and at time t, which is
C (t.z) = Co (:.
h
+ ~ L..
~ (- 1)"
e
-(n21tz/irl)Dt
sm h
n7tZ)
1tn=l n
where a;= the thickness of the ith lithology and D; = the diffusion
coefficient of the ith lithology.
It is interesting to note that Stegna concluded that in some
situations sufficient time could not have elapsed from the formation
of the reservoir to the present for steady-state diffusion to occur.
Also, he argued that since the time required to reach steady-state
diffusion varies with the square of the depth 'the geochemical
method is a structure-prospecting rather than a direct oil-prospect-
ing method' (Stegna 1961, p. 450).
This example is only one of many ways of solving the differential
equation for Fick's Law. Harbaugh & Bonham-Carter (1970,
pp. 238-58) look at several examples with different objectives, and
hence different boundary conditions. Thus, if the material through
which diffusion takes place is anisotropic then the diffusion coeffi-
cients kx, kY and k=, for each direction x, y and z is
OC = k o2 c + k o2 c + k iJ2c
ot x8X2 yay =a?
which can be solved by finite differences.
GEOLOGICAL APPLICATIONS 235
8.3.4 Formation of crenulation cleavage
This example follows closely the arguments and discussion of
Cosgrove (1976), with some reference to the earlier work of Cobbold
eta/. (1971). There have been two proposed origins for crenulation
cleavage (Cosgrove 1976, p. 157): either it is the result ofmicrofault-
ing or the result of microfolding. Both types are well represented
geologically, along with a third type - crenulation cleavage in
conjugate sets. This last type has been considered as a consequence
of microfaulting. Cosgrove (1976) demonstrated that these three
types of crenulation cleavage are different expressions of buckling
instabilities developed in anisotropic materials.
Certain assumptions were made:
Figure 8.3 The relationship between foliation and the co-ordinate axes x and y
chosen for the mathematical analysis of the formation of crenulation cleavage (after
Cosgrove 1976, fig. 4).
236 GEOLOGICAL APPLICATIONS
u= o<pxy v= o<pxy
ay ox
where P =the differential stress (cr 1 - cr 3), Q =a measure of the
resistance to shear and N = a measure of the resistance to compres-
sion in the direction of the maximum compressive stress cr 1
The shear and compressive moduli M and L for any stress
condition can be given in terms of Q and N, as
M= N+ P/4 L= Q+P/2
type I 2M>L
P>L
type 2 2M<L
type 3 2M<L
P>L
From these it is easy to see that the type of buckling which develops
depends on the material properties, M and L, and the size of the
differential stress, P. It should be noted that:
which will have either one or three real roots, it follows that there
will be either one or three equilibrium positions for any given pair of
values a, b. If for any position in the control space there is only one
equilibrium position which is stable, there will be an energy mini-
mum. Where there are three positions, the upper and lower are
minima and the intermediate will be a maxima. The maxima will
give rise to an unstable equilibrium.
The state of equilibrium is maintained on a topological surface
either by smooth changes or by catastrophic jumps. The theory can
be used to model and predict features of stable and unstable
equilibrium. Zeeman (1971) has suggested that the catastrophe
model could be appropriate if the system exhibits one or more of the
following:
(a) bimodality;
(b) divergence;
(c) catastrophic jumps; or
(d) hysteresis delays.
BASE z
0
f-
<(f-
uw
a:::Vl
:J
Figure 9.2 The cusp catastrophe model for Zeeman's catastrophe machine.
whereas inside the bifurcation set two modes are possible. These
correspond to the two stable positions of the disc with two local
minima (one on the top sheet and one on the bottom), even though
the behaviour surface at this point is overfolded and has three
sheets. This follows since the middle sheet, corresponding to a local
maximum, is made up of points representing the least probable
behaviour of the system. These factors can be more easily under-
stood by plotting the energy function V(x) for values of u < 0 for
different values of v (the position of the control point). A set of plots
of the energy state are shown in Figure 9.3. The position of the ball
in the diagram shows, in a pictorial manner, how the catastrophe
occurs.
Returning to our discussion of Figure 9.2, the inclusion of the
middle sheet ensures that the behaviour surface is smooth and
continuous; however, it should be noted that the control point is
never represented on the middle sheet. Whenever the control point
crosses the fold, either at A or at B, the point jumps between the top
and bottom sheets catastrophically, since the middle sheet is inac-
cessible. The fold curve, marking the boundary between sheets,
represents the inflection points. This sort of behaviour is characteris-
tic of physical systems in which friction is important, as it allows
them to assume a state of minimum energy. In the case of Zeeman's
catastrophe machine, the energy is the potential energy stored in the
rubber bands, so the disc will rotate until the tension of the rubber
bands is minimized. Unless energy is added to the system, the
machine will remain in a position of stable equilibrium. The process
that keeps it there is called the dynamic.
CATASTROPHE THEORY 243
u
lv \:
v
~
X
V
J I c) I d I \ l \. I
V W ~1um~
e f \J.
Figure 9.3 Energy states across the bifurcation set for Zeeman's catastrophe
machine (u < 0 is fixed, values of v vary: a, b, c, etc.).
The behaviour associated with the cusp farthest away from the
disc can be explained in a similar fashion, except that the energy is
inverted, i.e. there are two points with maximum energy and only
one where there is a minimum. As before, the dynamic compels the
machine to remain in the single stable position of minimum energy.
Also, on the behaviour surface the positions of maxima and minima
are reversed and, as the maxima are unstable, the behaviour point
can only lie on the middle sheet. This phenomena is known as a
dual-cusp catastrophe.
Features of this system, which are common to all systems that
follow the cusp model, are:
ressure
are a of
catastrophic
evolution
Figure 9.4 (a) The cusp catastrophe model applied to fault dynamics (after Henley
1976). (b) The cusp catastrophe model applied to evolution. The model relates
variability of the organism to environmental pressure.
fossilization collection
living organism fossil interpretation
i i
noise noise
mathematical model
log 2 2N = n bits
n
H= -K L P; log2P;
i=l
H=O
H=0.8
R=l-RE
or, as a percentage,
R% = (1 - RE) X 100
Then
In other words, the original His equal to the weighted sum of the Hs
for the successive choices (Shannon 1948, p. 393, Guiasu 1977,
pp. 57ff.).
Sr = [S- D- N] + R + Sc (1)
Output
Sr + Dd at te
Figure 9.5 Summary of the fossil communication system (after Tasch 1965).
H = I; = - I: P; In P;
coeffi ctent o f e11mmat10n
= [1 + (r.;=,,p;log 2 p;)J x 100
1. 585
The proposal was based on the idea that the generation of kerogen
started with an hypothetical organic compound which is a mixture
of equal proportions of the three elements, and which would have a
coefficient of elimination of 0 per cent. This hypothetical compound
would then undergo evolution due to geological processes which
could eliminate oxygen and hydrogen, leaving graphite, i.e. 100 per
cent carbon, with a coefficient of elimination of 100 per cent. In real
life most organic compounds which result from biogenic reactions
have values for the coefficient in the range 10-28 per cent, whereas
products resulting from the action of geological processes gave
considerably higher values. Indeed, it was suggested that kerogens
resulting from diagenesis had values in the range 28-48 per cent,
those resulting from catagenesis in the range 48-64 per cent and
metagenesis> 64 per cent.
where H<x> = the information content of the original data, H<Y> = the
information content of the regional trend component and H<=> = the
information content of the local trend component.
The calculation of the information content of the three compo-
nents is relatively simple. The range of the data values is subdivided
to give a number of suitable class intervals (the same for each
component), the width of the class interval in most applications
being the contour interval at which the final maps will be displayed.
The number of points which fall into each class interval is counted,
and the probability of points falling into each class interval is
obtained. Then the information content of the three sets can be
calculated:
From the resulting Hs we can calculate the entropy ratio for the data
sets
dy = nx"-1
dx
It follows from this that the derivative of a variable with respect to itself is
unity, i.e. if y = x, then
dy =I
dx
dy = (3 X 4)_x(l-l) + (3 X J)_x(l-1)
<ix-
= 12x2 + 3
y= ex
Write as
Then
=ex
256 APPENDIX 1
(b) If c is any numerical constant, its derivative is zero; i.e. the rate of
change of a constant is zero.
General form y = u + c, where u = f(x).
Example: y = 5x 2 + 6
dy =lOx
dx
(c) If cis any numerical constant and the equation is of the form y = cu,
where u = f(x), then
d(cy) _ dy
---c-
dx dx
Example: y = 4(5x' + 6)
dy = 4(l0x)
dx
d(uv) dv du
--=u-=v-
dx dx dx
= 5x4 + 12x2 - 2x
INTRODUCTION TO DIFFERENTIATION 257
(e) Quotient rule: y = u/v, where u and v = f(x)
du dv
v--u-
d(ujv) _ dx dx
~- y2
Example: y = (x 2 + 4)/(x 3 - 1)
dy _ (x 3 _ 1 )d(x~;4)_(x2 + 4)d(x::-l)
dx- (x3 - 1)2
-X'- 12x2 - 2x
.x6- 2x 3 +I
(f) Function of a function: y = f(v), e.g. y = v/(1 - v2) and v = f(x), e.g.
v = 1 - x 2 Then
dy _ dy dv
dx- dvdx
Example: y = v3 + 2, v = 2x
dy = 3v2 dv = 2
dv dx
Now,
dy _ dy dv
dx- dv dx
= 3v2 x 2
But v= 2x, so
dy = 3(2x)2 x 2 = 24x 2
dx
dx dy I
dy = I/ dx or dyjdx
258 APPENDIX 1
(h) The derivative of a variable function with a constant exponent:
y = (u)", where u = f(x)
dy = n(u)"- 'd(u)
dx dx
d(u) _ d(2x) =2
dx- dx
dy _ 3(2r) x 2 = 24x2
dx-
A list of some of the more commonly used standard forms is given in Table
A 1.1. Many others can be found in mathematics textbooks dealing with
calculus, to which the reader is advised to refer for a more comprehensive
list.
Function Derivative
x" nx"-'
a' a' Ina
lnx 1/x
,jx tx-1;2
e-' e-'
sinx cosx
cosx - sinx
tanx (1 +tan' x) or
sec' x
The following simple examples show how the standard forms are used in
conjunction with the rules given in the previous section.
INTRODUCTION TO DIFFERENTIATION 259
(a) Differentiate y =(cos x sin x). Using the product rule
Using the standard forms for the derivatives of sin x and cos x, we get
(b) Differentiate y =tan x 3 Using the rule for a variable function with a
constant exponent and one of the standard forms for the derivative of tan x,
we get
In y = In x + 3 In( I + x 2)
dy _I+ 6x
dx- x
(I+ x 2)
Either form of the equation can be used for differentiation; however, the
explicit form would present more difficulties than the implicit. For the
solution of the implicit equation we use the function of a function rule: i.e.
find dyjdx when 3x2 + y = 12:
260 APPENDIX 1
d(3x 2) d(Y) _ d(l2)
--+-----
dx dx dx
6x+ 2ydy = 0
dx
dy _- 6x _- 3x
dx-2_Y--y-
Thus, as a general rule it is better to use the implicit form of the equation
and differentiate using the function of a function rule.
Appendix 2 An introduction to the
methods of integration
(a) IKydx=Kiydx
For example, we are asked to integrate 4x3 Thus
4x3+ 1
I 4x3 dx = 4Ix3 dx = - - + C = x' + C
3+ 1
5_x615
J1.)xdx= Ix 115 dx=-6-+ C
1 + 5x
J--dx = J(l!x + 5)dx =In x + 5x + C
X
Another example:
I 2xdx_ 3 = (l/2)ln(2x- 3) + C
4x' 6x 3
=---+4x+ C
4 3
= x' - 2x 3 + 4x + C
262 APPENDIX 2
(e) Juvdx=ufvdx- Jz(dujdx)dx, where z= Jvdx. Thus, the integral
sin X COS X is
Derivative Integral
X"+ I
X"
n+ I+ C
e' e'+C
eh (1/k)e'-' + c
a' ~+C
Ina
1/x lnx+ C
sinx -cosx+C
COSX sinx+ C
tanx ln(secx) + C
sinkC - (l(k)cos kx + C
coskC (1/k)sin kx + C
I+ tan'x tanx+ C
sin-'x+ C
'-"CI - x')
I
I+ x 2 tan-'x+C
A2.3 Substitution
Then
dz
dx =2x
= t + Cz3!2
x =sin B
Then
dx
-=cosO
dB
= HO + cos 2B)dB
By rule (b)
By rule (d)
y = HB dB + Hcos 28 dB
y = !B + *sin 28 + C
264 APPENDIX 2
Since
Then
The programs in this appendix have been developed to enable some of the
basic mathematical techniques to be explored further. It is suggested that if
a program is to be used, then the data from the appropriate example in the
text should be used as test data, to ensure that the program is working
correctly. All of the programs are specifically written for the BBC Micro
(Model B), and have been extensively tested. Students wishing to use the
programs on other systems should check some of the functions used. For
example, VDU statements are specific to the version of BASIC in which the
programs were written. As far as possible the programs have been kept
straightforward, and hopefully, with the REM statements, should be
relatively easy to follow. Also, prompts are given by the program for any
input required. They are presented here in the same order as they appear in
the text.
Program 2.1 Program to draw a graph of x against y, using a BBC Micro and an
Epson printer. The axes are scaled automatically to fit the printer paper size
(22 x 28 em). Not to be used as VDU output in its present form.
Program 2.2 Program to solve for the roots of a quadratic equation. Requires
coefficients of xs and the constant, as input.
Program 4.1 Program to solve problems of maxima and minima for a third-degree
polynomial. Input values required are the coefficients of the equation and the
constant.
Program 5.1 Program to find the area under a curve between given limits, using
Simpson's Rule. The function to be evaluated is entered in the form of a BASIC
statement.
Program 6.2 Program to find the eigenvalues and eigenvectors of a real symmetrical
matrix up to size 10 x 10. Based on the Jacobi method.
Program 6.3 Program to raise a matrix to any given power. Handles matrices up to
25 x 25 in its present form.
COMPUTER PROGRAMS 279
10 REM PROGRAM TO MULTIPLY A MATRIX BY ITS TRANSPOSE
20 REM THIS PROGRAM CALLED MULTR
30 REM ALSO CALCULATES THE DISPERSION OR CORRELATION MATRH'ES
35 REM MULTIPLE DATA SETS PROCESSED
40 DIM X(50, 10),XMT(10, 10),XMEAN(10),XDEV(10)
50 REM SET PARAMETERS FOR PRINTING OUT RESULTS
60 @\=&0102020A
70 REM INPUT MATRIX SIZE, TITLE AND DATA
80 INPUT"TYPE NUMBER OF ROWS= NUMBER OF VARIABLES"' "VALUE GREATER
THAN 50 TERMINATES EXECUTION",N'.
90 IF N\>50 GOTO 610
100 INPUT"TYPE NUMBER OF COLUMNS NUMBER OF SAMPLES",C\
110 INPUT"TYPE TITLE"T$
120 PRINT"INPUT MATRIX ROW WISE"
130 NS=C\
140 FOR I\=1 TO N\
150 FOR J\=1 TO C\
160 INPUT"ELEMENT VALUE",X(I\,J\)
170 NEXT J\
180 NEXT I\
190 REM CHOOSE TYPE OF CALCULATION
200 REM 1 = USE RAW DATA ONLY
210 REM 2 = CALC. EITHER DISPERSION OR CORRELATION MATRICES
220 INPUT"DO YOU WISH TO USE RAW DATA ONLY?"'"IF YES TYPE 1, IF NO
TYPE 2",M
230 IF M=1 GOTO 270
240 PROCTRSF
250 GOTO 360
260 REM SWITCH ON PRINTER AND PRINT OUT INPUT DATA
270 VDU2
280 PRINT' '"TITLE:",T$
290 PRINT' '"INPUT MATRIX"' '
300 FOR I\=1 TO N\
310 PRINT'
320 FOR J\=1 TO C\
330 PRINTX(I\,J\);
340 NEXT J\
350 NEXT I\
360 REM MULTIPLY MATRIX BY ITS TRANSPOSE
370 FOR I\=1 TO N\
380 FOR J\=1 TO N\
390 XMT(I\,J\)=0
400 NEXT J\
410 NEXT I \
420 FOR I\=1 TO N\
430 FOR J\=1 TO N\
440 FOR K\=1 TO C\
450 XMT(I\,J\)=XMT(I\,J\)
+X(I\,K\)*X(J\,K\)
460 NEXT K\
470 NEXT J'.
480 NEXT I\
490 PRINT' '"RESULTANT MATRIX"' '
500 FOR I\=1 TO N\
510 PRINT'
520 FOR J\=1 TO N\
530 PRINT XMT(I\,J\);
540 NEXT J\
550 NEXT I\
560 PRINT'' "This data set finnished"
570 REM SWITCH OFF PRINTER AND CLEAR SCREEN
580 VDU3:CLS
590 PRINT"RETURNING TO BEGINING OF PROGRAM"'"FOLLOW INSTRUCTIONS TO
TERMINATE THE EXECUTION"
600 GOTO 80
610 END
620 REM
630 REM
640 REM
650 DEF PROCTRSF
660 REM PROCEDURE TO NORMALIZE RAW DATA MATRIX USING MEAN OR MEAN
AND STANDARD DEVIATION
670 REM SWITCH ON PRINTER AND PRINT OUT ORIGINAL DATA
280 APPENDIX 3
680 VDU2
690 PRINT' '"TITLE,",T$
700 PRINT' '"INPUT MATRIX"''
710 FOR I\=1 TO N\
720 PRINT'
730 FOR J\=1 TO C\
740 PRINTX(I\,J\);
750 NEXT J\,NEXT I\
760 VDU3
770 REM CALC MEAN FOR EACH VARIABLE
780 FOR I\=1 TO N\
790 XSUM~O.O
800 FOR J\~1 TO C\
810 XSUM~XSUM+X(I\,J\)
820 NEXT J\
830 XMEAN(I\)~XSUM/NS
840 NEXT I\
850 REM SUBTRACT MEAN FROM EACH DATA VALUE
860 REM DESTROYS ORIGINAL DATA MATRIX
870 FOR I\~1 TO N\
880 FOR J\~1 TO C\
890 X(I\,J\)~X(I\,J\)-XMEAN(I\)
900 NEXT J\:NEXT I\
910 REM CLEAR SCREEN AND SWITCH OFF PRINTER
920 CLS 'VDU3
925 PRINT"PROGRAM WILL NOW CALCULATE EITHER"'"THE DISPERSION MATRIX
OR"'"THE CORRELATION MATRIX, YOUR CHOICE"
930 INPUT"DO YOU WISH TO CALCULATE THE DISPERSION MATRIX?"' "IF YES
TYPE 2, IF NO {IMPLIES CORR. MATRIX), TYPE 1",R
940 IF R=1 GOTO 1050
950 REM SWITCH ON PRINTER
960 VDU2
970 PRINT' '"DISPERSION MATRIX HAS BEEN CHOSEN"'" MATRIX WHICH
FOLLOWS HAS BEEN NORMALIZED, WITH A MEAN OF ZERO FOR EACH VARIABLE"
980 FOR I\~1 TO N\
990 PRINT '
1000 FOR J%~1 TO C\
1010 PRINT X(I\,J\);
1020 NEXT J\
1030 NEXT I\
1040 GOTO 1260
1050 REM CALCULATE STANDARD DEVIATION
1060 FOR I\~1 TO N\
1070 XSUM~O.O
1080 FOR J\=1 TO C\
1090 XSUM=XSUM+X(I\,J\)*X(I\,J\)
1100 NEXT J\
1110 XDEV(I\)~SQR(XSUM/NS)
1120 NEXT I\
1130 REM DIVIDE EACH VALUE BY THE CORRESPONDING STD. DEV.
1140 FOR I\~1 TON\
1150 FOR J\~1 TO C\
1160 X(I\,J\)~X(I\,J\)/XDEV(I\)
1170 NEXT J\, NEXTI\
1180 REM SWITCH ON PRINTER
1190 VDU2
1200 PRINT' '"CORRELATION MATRIX HAS BEEN CHOSEN"'" MATRIX WHICH
FOLLOWS HAS BEEN NORMALIZED, WITH A MEAN OF ZERO"'"AND STANDARD
DEVIATION OF UNITY FOR EACH VARIABLE"''
1210 FOR I\=1 TO N\
1220 PRINT'
1230 FOR J\=1 TO C\
1240 PRINTX{I\,J\);
1250 NEXT J\, NEXT I\
1260 ENDPROC
Program 8.1 Program to solve the second-order differential equation d'y/dx' = 5x.
See text for details.
References
Barnes, J. W. 1981. Basic geological mapping. Milton Keynes: The Open University
Press.
Beerbower, J. R. & D. Jordan 1960. Application of information theory to palaeonto-
logical problems: taxonomic diversity. Journal of Palaeontology 43, 1184-98.
Bischoff, J. L. 1968. Kinetics of calcite nucleation: magnesium ion inhibition and
ionic strength catalysis. Journal of Geophysical Research 73, 3315-22.
Bischoff, J. L. & W. S. Fyfe 1968. Catalysis, inhibition and the calcite-aragonite
problem. (I) The aragonite-<:alcite transformation. American Journal of Science 266,
65-79.
Blackman, R. B. & J. W. Tukey 1959. The measurement of power spectra. New York:
Dover.
Buntebarth, G. 1984. Geothermics- an introduction. Berlin: Springer-Verlag.
Camina, A. R. & G. J. Janacek 1984. Mathematics for seismic data processing and
interpretation. London: Graham and Trotman.
Cheeney, R. F. 1983. Statistical methods in geology. London: Allen and Unwin.
Cobbold, P. R., J. W. Cosgrove & J. M. Summers 1971. Development of internal
structures in deformed anisotropic rocks. Tectonophysics 12, 23-53.
Coli, J. 1982. BBC microcomputer user guide. London: British Broadcasting Corpora-
tion.
Cosgrove, J. W. 1976. The formation of crenulation cleavage. Quarterly Journal of the
Geological Society of London 132, 155-78.
Cubitt, J. M. & B. Shaw 1976. The geological implications of steady-state mechan-
isms in catastrophe theory. Mathematical Geology 8, 657-{)2.
Davis, J. C. 1973. Statistics and data analysis in geology. New York: Wiley.
Doornkamp, J. C. & C. A.M. King 1971. Numerical analysis in geomorphology- an
introduction. London: Edward Arnold.
Falini, F. 1965. On the formation of coal deposits of lacustrine origin. Bulletin of the
Geological Society of America 76, 1317-46.
Ferguson, J. 1978. Some aspects of the ecology and growth of the Carboniferous
gigantoprodutids. Proceedings of the Yorkshire Geological Society 42,41-54.
Ferguson, J. 1980. Application of information theory in geological data processing.
Quarterly Journal of the Geological Society of London 137, 107-8.
Ferguson, J. 1982. The application of information theory to trend surface analysis.
Computer applications in geology iii. Geological Society of London Miscellaneous
Paper No. 15, 3--44.
286 REFERENCES
Ferguson, J. 1983. The analysis ofC-H-0 data in kerogen studies using a coefficient
of elimination. Journal of Petroleum Geology 5, 400--8.
Ferguson, J., P. R. Bush & B. A. Clarke 1984. The role of organic matter in the early
diagenesis of carbonate ooids- an experimental study. Journal of Petroleum Geology
7, 245-66.
Fox, W. T. & R. A. Davis, Jr 1971. Computer simulation model of coastal processes in
eastern Lake Michigan. Technical Report No. 5, O.N.R. Task No. 388-092/10-18-
68(414), Contract NOO 14-69--c-0151. Williamstown, Mass.: Williams College.
Friedman, G. M. & J. E. Sanders 1978. Principles ofsedimentology. New York: Wiley.
Garrels, R.N. & F. T. Mackenzie, Jr 1971. Evolution ofsedimentary rocks. New York:
Norton.
Glass, J. C. 1980. An introduction to mathematical methods in economics. New York:
McGraw-Hill.
Gould, P. 1967. On the geographic interpretation of eigenvalues: an initial explora-
tion. Transactions of the Institute of British Geographers 42, 53-86.
Gow, M. M. 1960. A course in pure mathematics. London: Unibooks, English
University Press.
Graham, D., C. Graham & A. Whitcombe 1984. A-level mathematics course com-
panion. London: Charles Letts.
Gray, J. R. 1967. Probability. Edinburgh: Oliver and Boyd.
Guiasu, S. 1977. Information theory with applications. New York: McGraw-Hill.
Nettleton, L. L. 1976. Gravity and magnetics in oil prospecting. New York: McGraw-
Hill.
Noble, B. 1964. Numerical methods: I. Iteration programming. Edinburgh: Oliver and
Boyd.