Surface Wave Methods: Acquisition, Processing and Inversion
Surface Wave Methods: Acquisition, Processing and Inversion
POLITECNICO DI TORINO
Acknowledgements
I would like to thank all the people that supported, inspired, corrected this work,
the people with whom I have worked,
and the people with whom, without working, I have spent my time and shared my space.
Prof. Luigi Sambuelli, my supervisor, continuously guided and supported my work, mitigating and
tolerating my anarchist way of working. Gigi stimulated this work with his always high, unifying
and open point of view, with his extraordinary ability in doubting and searching.
Dr. Valentina Socco suggested the subject of this dissertation, and introduced me to geophysics, in
front of a sliding door. Then Valentina has been a necessary source of corrections, suggestions,
ideas, and she reviewed all this work with attention, care and patience, putting order in my chaos.
Dr. Sebastiano Foti has been an indispensable guide, with his rigorous approach to the problems,
his suggestions and ideas. Moreover I have to thank Sebastiano for all what he gave me to read, and
for his enthusiasm.
Dr. Alberto Godio gave me the chance to work with him and shared his experience in different
fields. Alberto’s irony reached his maximum in our discussion about inverse problems.
Dr. Isabel Lopes read this work when it was a draft, and her suggestions and questions strongly
influenced this final text.
The other colleagues with whom I have worked, Gianluca, Andrea, Stefano, Cesare,
and the all the students that helped in the acquisitions.
My family, all my friends.
1. INTRODUCTION
2.1 INTRODUCTION 16
2.1.1 Body waves and surface waves 16
2.1.2 Body waves in seismic exploration 18
2.1.3 The use of surface wave for geotechnical characterization 20
i
Table of Contents
3. ACQUISITION
3.1 INTRODUCTION 82
ii
Table of Contents
4. PROCESSING
iii
Table of Contents
5. INVERSION
iv
Table of Contents
v
CHAPTER 1
Introduction
This chapter is an introduction to the dissertation and to the surface wave methods, it is a
schematic description of all the main theory and operating steps involved, and is completely
qualitative. The following chapters(2-6) will deal in detail with the theory of the propagation, the
acquisition, the processing, the inversion, and will finally present a selection of experimental
1
results. The main aim of the chapter 1 is hence to explain how the subjects that are developed later
are related each other, and why they are necessary.
In this chapter
The qualitative theory of the surface wave propagation: the geometric dispersion, the modal
superposition, the apparent dispersion curve. The possibility of simulating the surface wave
propagation, and obtaining a synthetic dispersion curve for a given stratigraphy.
The acquisition of vibrations and the processing of these data to get an experimental dispersion
curve at a site.
The inversion of the experimental dispersion curve to find the layered earth model.
The three steps of the method: acquisition, processing and inversion.
How each step can increase the accuracy and reliability of the result: information, variance,
parameterisation.
The propagation of seismic body waves has been widely used for the characterization of the subsoil
for different purposes, with a great number of different techniques. Compressional and shear waves
are employed in seismic reflection and refraction, in vertical seismic profiling, in up-hole, down-
hole, cross-hole tests, in seismic tomography, sonic log and so on.
Surface waves, often called the ground roll, have usually been considered as a coherent noise
masking useful signals; something to be filtered out.
The use of surface waves for the characterization of the shallow subsurface has recently, in the last
decades, become of growing interest among geotechnical engineers and geophysicist. Nevertheless,
the test are often performed adapting equipment, acquisition techniques, modelling and inversion
algorithms conceived for other applications. The different approaches that have been developed will
be described in the following chapters.
This dissertation has a geophysical point of view, and focuses on the test itself, with the aim of
studying the best procedure to get a more accurate and reliable estimate of the soil properties with a
reasonable effort. The final aim is hence the optimisation of the test.
In the different steps of the surface wave test the specific requirements and limitations are
emphasized, different approaches are presented and compared, in order to select the most suitable.
A series of algorithms have been implemented, for the data processing, for the simulation of the test
and for the inversion. They are included in a software package, PoliSurf, result of the work of the
surface wave group at Politecnico di Torino, which allow an interactive automated interpretation of
surface wave data.
Many field tests have been performed at different sites: the effectiveness of both the method and
the procedures have been checked, the limits have been confirmed. The comparison with other
geophysical seismic techniques is also supported by a wide set of experimental data.
An introductory overview of the surface wave method is necessary to understand the motivations of
the different chapters, each one dealing with a specific step of the test (acquisition, processing,
modelling, inversion). The final aim of the study is of course the most accurate and reliable estimate
of the subsoil properties, and this introduction would explain how the different step of the test affect
the result, and how they influence each other.
The general description provided in this first chapter is on purpose schematic and qualitative,
without the mathematics that will be necessary later to explain and proof the details of the aspects
that are here just outlined: also the references will be given in the following chapters.
Figure 1.1. The result of a SWM test is the vertical profile of the shear wave velocity (or of the stiffness G0).
The use of Rayleigh waves allow a simpler acquisition and is widely more diffused than that of
Love Waves.
Figure 1.2. Energy propagating in the horizontal direction, as a function of the depth normalized to the wavelength
(homogeneous half space). The energy does not propagate in the vertical direction.
Figure 1.3. In a homogeneous medium (left) all the wavelengths sample the same material, and the phase velocity is
constant. When the properties changes with the depth (right) the phase velocity depends on the wavelength.
So the surface wave does not have a single velocity, but a phase velocity that is function of the
frequency: the relation between frequency and phase velocity is called dispersion curve (Figure1.4),
and depends on the stratigraphy. At high frequency the phase velocity is the Rayleigh velocity of
the uppermost layer: at low frequency the effect of deeper layers become important, and the phase
velocity tends asymptotically to the Rayleigh velocity of the deepest material, as if it extends
infinitely in depth (the half space).
Figure 1.4. The dispersion curve describes the dispersion of surface waves:
usually is depicted as phase velocity versus frequency.
The dispersion curve can be experimentally measured and hence it is possible to interpret this curve
in order to estimate the subsoil properties. It is a worthwhile effort to understand first which
parameters actually influence the dispersion curve that can be measured.
The Rayleigh wave propagation in vertically heterogeneous media, is actually a multi-modal
phenomenon: for a given stratigraphy, at each frequency different wavelength can exist (Figure
1.5).
Figure 1.5. Different modes of vibration means different wavelength, and so different velocities, for the same
frequency.
Hence different phase velocities are possible at each frequency, each of them corresponding to a
mode of propagation, and the different modes can exhibit simultaneously (Figure 1.6).
4th mode
3rd mode
phase velocity
2nd mode
1st mode
Figure 1.6. In a usual frequency range for engineering testing, many modes can exist.
Higher modes exist only above their cut-off frequencies.
The different modes, except the first one, exist only above their cut-off frequency, which is for each
mode the lower limit frequency at which the mode can exist: with a finite number of layers, in a
finite frequency range, the number of modes is limited. At very low frequency, below the cut-off
frequency of the second mode, only the first mode, known as the fundamental Rayleigh mode,
exists.
Modes are not just theory or just mathematical possible solutions; they are often observed in
experimental data, also in the frequency ranges of interest for a test for engineering purposes.
The energy associated to the different modes depends on many factors, the stratigraphy at first, but
also the depth and the kind of source.
The first mode is sometimes dominant in wide frequency ranges, but in many common situations
higher modes play important roles and are actually dominant, and so they cannot be neglected.
The different modes have different phase velocity, and so they separate at great distance from the
source: otherwise they superimpose on to one another, and the mode identification can be
impossible.
At the engineering scale the modal superposition is critical: the effective Rayleigh phase velocity
deriving from the modal superposition is only an apparent velocity, also depending on the
observation layout, on the source orientation and position (Figure 1.7).
Figure 1.7. The apparent dispersion curve due to modal superposition can be influenced by different modes: it also
depends on the observation layout. They grey lines are modes, the black line is the apparent velocity.
Figure 1.8-a. The parameterisation of the subsoil. For each layer the thickness H, the shear
velocity Vs, the Poisson Ratio, and the mass density
Figure 1.8-b. The modelling allows the prediction of the surface wave
propagation properties for a given layered model.
Figure 1.9. The processing of acquired seismic data can estimate the experimental dispersion curve at a site.
statement of an associated inverse problem: the unknown model parameters are determined from
measured quantities, assuming a model that relates the two.
The inversion consist of estimating the parameters of a postulated model of the subsoil from a set of
observation, trying to fit a schematic response to a finite set of actual observations. The quantitative
description of complex systems needs a scheme or model that simplifies the reality, and allows a
quantitative finite description: the approach cannot be complete, the geometry and the physics have
to be simplified and often coupled physical aspects have to be uncoupled.
a)
b)
Figure 1.10. The forward problem (a) and the inverse problem(b) for the surface waves method.
Figure 1.11. Scheme of the three steps of the method, with the corresponding result.
In order to infer the value of the phase velocity at different frequencies, different approaches can be
followed: either monochromatic signals are produced and recorded, or transient signals are recorded
and then decomposed, in the processing step, into their monochromatic components.
Figure 1.12. Forward and inverse problems. (a) The system is firstly parameterised. The forward problem (b) consist of
predicting data using a model when model parameters are known. The inverse problem (c) consist in identifying model
parameters, when data are known.
This chapter illustrates the properties of the surface wave propagation and the algorithms that
allow the simulation of the test and that will be used to solve the forward problem in the
2
inversion. The dispersion of surface waves in vertical heterogeneous media is the physical principle
on which the Surface Waves Methods are based: in this chapter the characteristics of the
propagation of such waves in layered media are shown. The first aim is discussing the details of
such propagation phenomenon, with a particular attention to the modal superposition at the small
scale of engineering problems. The final objective and result of the chapter is a series of algorithms
that allow the simulation of the test in a layered medium.
In this chapter
Some notes on the use of seismic waves for the characterisation.
The theory of surface wave propagation in homogeneous half space: this introduces the notation,
the approach with potentials, and allow the description of the properties of surface waves in a very
simple case.
The theory of Rayleigh wave propagation in layered media: the model of layered soil is described,
the problem is stated, and the solution in matrix form is presented. The dissipative phenomena are
considered with a linear visco-elastic model.
The solution of the Rayleigh problem is developed with the aim of simulating the test: the
computational problem of the matrix formulation are presented, then the computation of the modal
curves is obtained, the properties of the modal curves are described, then the stress and
displacements are computed, and the displacements at the free surface. The effects of the modal
superposition are presented, and the simulation of a real test can be performed.
2.1 INTRODUCTION
This short introduction contains some general ideas about seismic techniques based on body waves and surface
waves. The differences between the two phenomena are just outlined, and the principles of body wave
exploration are briefly mentioned. A general description of the surface wave methods closes this section,
explaining why there is the need of a reliable and efficient modelling algorithm that simulate the Rayleigh wave
propagation in layered media.
α = VP =
( K + 4 / 3µ ) =
(λ + 2µ ) (2.1)
ρ ρ
where ρ is the mass density, K is the bulk modulus, and µ the shear modulus, λ is the Lame’s
parameter. The relations between the elastic parameter and the corresponding expression for the
seismic velocities are given in appendix A.
S-waves are shear waves (also called transverse waves, or distorsional waves, or equivoluminal
waves) inducing a particle motion perpendicular to the direction of propagation; in an elastic
medium the propagation velocity of S-waves is
µ
β = VS = (2.2)
ρ
S-waves can be polarized, in such a way that the particle motion is along a definite direction
perpendicular to the direction of propagation: SV-waves are polarized in the vertical direction, SH-
waves are polarized in the horizontal direction. It is obvious from (2.1 and (2.2 that the shear wave
velocity is lower than the compressional wave velocity. As it can be seen from appendix A, the two
body wave velocities are a pair of parameters that completely describes the behaviour of linear
elastic medium.
When the medium is bounded by a free surface, also another kind of phenomena occurs: surface
waves are generated and propagate near the surface (Figure 2.2). They induce a different particle
motion, and their velocity is a more complicated function of the moduli, as it will be shown below.
With a point source they have cylindrical wavefronts, and they do not irradiate energy toward the
earth’s interior,
Rayleigh waves induce a particle motion in the vertical plane that contains the direction of
propagation, and in a homogeneous body is, at the free surface, a retrograde elliptic orbit (Figure
2.2, left). R-wave velocity in a homogeneous medium is less than the shear wave velocity. If there
are variations of the elastic properties with depth, R-waves become dispersive: the different
wavelength propagate with different velocities.
Love waves are surface waves existing only in special stratigraphic conditions, when the shear
velocity in the upper layer is lower than that in the substratum; the particle motion is transverse to
the direction of propagation and parallel to the free surface (Figure 2.2, right). They cannot exist in
homogeneous media, and in layered media Love waves are dispersive: the velocity tends to the
shear velocity of the upper layer at high frequency, and tends to the shear velocity of the substratum
at low frequency.
The Stoneley waves are interfacial waves existing under special conditions at the interface between
two media.
Figure 2.2. Surface waves, Rayleigh waves and Love waves, propagate parallel to the body surface
without spreading energy towards the interior.
A compressional source at the free surface imparts most of the energy in surface waves and
moreover the geometric spreading of surface waves is lower than that of body waves. Therefore in
active seismic techniques, Rayleigh waves actually dominate at great distance. Considering only the
geometric attenuation, the energy is distributed on increasing surfaces, and so it decreases following
the shape of the wavefront: with spherical wavefronts the energy decrease with the square of the
distance, with cylindrical wavefront it decreases linearly with the distance. The energy is
proportional to the square of the displacements, thus body waves have a decrease of amplitudes
proportional to the distance from the source and Rayleigh waves proportional to the square root of
the distance. Nevertheless, as shown by Richart et al. (1970), at the free surface, because of leaking
of energy into the free space for body waves the displacement attenuation goes with the square of
distance.
Figure 2.3. The refraction and reflection of body waves are the physical principle used for seismic prospecting of the
subsoil with measures taken at the surface.
P waves are widely used for the seismic prospecting: the use of S waves, in particular SH waves,
for the geotechnical characterization is gaining interest because of a series of advantages. A greater
resolution can be achieved due to the shorter wavelengths, and also a higher reflectivity and hence
information especially in soft soil, and in particular in saturated media. Moreover the use of SH
waves allows the same processing techniques and theoretical background with a different
acquisition. The following figure (2.4) shows a seismogram of SH waves acquired with a single
shot of a polarized source, with sensors Swyphones (Sambuelli et al., 2001) : a refraction is clearly
visible from the 11th trace
Figure 2.4. A raw seismogram for SH waves shows a quite evident refraction.
In seismic reflection surveys, one of the main source of noise is the ground roll, a low velocity
event that can mask late reflections. Ground roll is mainly composed by surface waves, and depends
on the shallow structure at sites: it is a low frequency and high energy event created by the
energization, and cannot be reduced by a stacking procedure being a coherent noise. To avoid its
superposition to signals adequate acquisition geometry are used, and to enhance reflections many
filtering techniques have been developed.
On the other hand surface waves can be successfully used to infer the properties of the first tens of
meters: in seismic reflection processing this can be useful to compute the static corrections (Mari,
1984). Moreover, for the geotechnical characterisation, the properties of this seismic event are very
interesting and surface waves can be considered an useful signal, and no more a noise: the same
Extract from PhDThesis - Claudio Strobbia. 19
Chapter 2.1 INTRODUCTION
processing techniques that have been developed to filter out the ground roll can be used to extract,
enhance and analyse it.
The main aim of this chapter is to obtain a forward modelling algorithm and finally a computer
program able to simulate the Rayleigh wave propagation in layered media. A preliminary step is the
description of the properties of Rayleigh waves in a homogeneous medium: propagation velocity,
particle motion, energy, attenuation. Then the theory and the equations for a layered medium are
presented. Finally the solution in layered media is discussed, the properties of Rayleigh waves in
layered media are illustrated, and in the end the simulation of the real test is obtained.
2.2.1 Introduction
Surface waves are waves propagating in directions parallel to earth's surface: their amplitude
decreases exponentially with depth and the most of the energy is confined near the earth's surface,
so the propagation is influenced mostly by shallow materials.
The amplitude distribution is stationary with horizontal coordinates, apart from a geometrical
spreading factor accounting for the increase of the surface of the wavefront. The energy in this kind
of propagation spreads only in the horizontal direction, so that the geometrical spreading factor is
smaller than for body waves, which spread energy both horizontally and vertically down into the
earth interior.
Here we will describe the basic properties of surface waves in a homogeneous medium bounded by
a free surface: this is useful to introduce the more interesting case of layered media.
The analytical approach followed here leads to discuss the properties of pure surface Rayleigh
waves: obviously in a real propagation phenomenon they are superposed to body waves with direct,
refracted and reflected paths.
∂u x ∂u y ∂u z
ε xx = ε yy = ε zz = (2.3)
∂x ∂y ∂z
∂u x ∂u y ∂u z ∂u x ∂u y ∂u z
ε xy = + ε zx = + ε yz = +
∂y ∂x ∂x ∂z ∂z ∂y
Forces distributed through the body and varying in time and space produce strains: in a linear
elastic material we can express stresses acting on an elementary volume as a linear combination of
strains (Hooke's law).
In general we can state σ ij = a ijkl ε kl where a is a 4th order tensor:
considering the symmetry of stress and strain tensor, and with the hypothesis of isotropy we need
only two elastic moduli as constants of proportionality. Different couples of parameters can be
chosen (appendix A). Young's modulus and Poisson's ratio are often used, here we will use Lamè’s
coefficients λ and µ to write the constitutive laws
p xx = (λ + 2 µ )ε xx + λε yy + λε zz (2.4)
p yy = λε xx + (λ + 2 µ )ε yy + λε zz
pzz = λε xx + λε yy + (λ + 2µ )ε zz
p xy = µε xy pxz = µε xz p yz = µε yz
where p are the stress components
The acceleration of the element can be obtained from unbalanced forces, including linearised stress
variations between nearby points: from the equilibrium of an elementary volume, in the X direction
we get
∂p ∂p xy ∂p
p xx + xx ∆x − p xx ∆y∆z + p xy + ∆y − p xy ∆x∆z + p xz + xz ∆z − p xz ∆x∆y
∂x ∂y ∂x
(2.5)
∂p ∂p xy ∂p xz
= xx + + ∆x∆y∆z
∂x ∂y ∂z
Assuming the body force B acting throughout the elementary volume (gravity or other fields) and
introducing acceleration and mass ρ∆x∆y∆z , we get the three components
∂ 2u x ∂ 2u x ∂ 2u x ∂ 2u y ∂ 2u z
= ρ ∂ ux
2
(λ + 2µ ) + µ 2 + + (λ + µ ) +
∂x∂y ∂x∂z
(2.7)
∂x 2 ∂y ∂z 2 ∂t 2
∂ 2u y ∂ 2u y ∂ 2u y ∂ 2u x ∂ 2u z ∂ 2u y
(λ + 2µ ) 2 + µ 2 + 2 + (λ + µ )
+ = ρ
∂y ∂x ∂z ∂x∂y ∂y∂z ∂t 2
2 ∂ 2u ∂ 2u
(λ + 2µ ) ∂ u2z + µ ∂ u2z + ∂ u2z + (λ + µ ) x + y = ρ ∂ u2z
2 2 2
∂z ∂x ∂y ∂x∂z ∂y∂z ∂t
A = ∫ L ( x, x ' , t ) (2.9)
t1
where L is the Lagrangian which, in a conservative system, is equal to the kinetic energy minus the
potential energy
L = T −U (2.10)
The dependence of x on t is not fixed, so x(t) is unknown and the exact path of integration is not
known.
The Hamilton's principle states that the motion equation of a particle is equivalent to the
minimization of the action
This implies the Euler-Lagrange equation
∂L ∂d ∂L
− =0 (2.11)
∂x ∂t ∂x'
For a linear elastic material the potential energy is the elastic strain energy, and so the Lagrangian
density is
L(u&i , ui , j ) =
1 1
ρu&i u&i − σ ij ε ij (2.12)
2 2
where ui i=1,2,3 are the displacements, σ ij and ε ij are the component of stress and strain, ρ is
the mass density assumed constant with time.
Considering the Hooke's law compactly written as
σ ij = λε kkδ ij + 2µε ij (2.13)
where λ and µ are Lame's moduli (in general function of the coordinates) and δ is the Kronecker's
symbol the Lagrangian density is then
L(u&i , ui , j ) =
1 1
ρu&i u&i − λε kk2 + µε ij ε ij (2.14)
2 2
and the Euler - Lagrange equation yield, in vector notation,
∂ 2u
µ∇ u + (λ + µ )grad (divu ) = ρ 2
2
(2.15)
∂t
that is the Navier’s equation of motion for the indefinite medium.
(λ + 2µ ) ∂ u2x ∂ 2u x
2
=ρ (2.16)
∂x ∂t 2
and a simple solution is
u x = f (t − x / α ) + g (t + x / α ) (2.17)
Let’s consider now the second term of 2.22, and the second equation of 2.23.
A solution of equation of motion (2.15) is found also introducing a vector potential Ψ with the
three components Ψx , Ψy and Ψz such that
∂Ψz ∂Ψy
ux = − (2.29)
∂y ∂z
∂Ψx ∂Ψz
uy = −
∂z ∂x
∂Ψy ∂Ψx
uz = −
∂x ∂y
This displacements satisfy the equation of motion if the vector potential from which they are
obtained satisfies the second vector equation 2.23 (remembering the definition of β 2.21)
∂ 2 Ψx ∂ 2 Ψx ∂ 2 Ψx 1 ∂ 2 Ψx
+ + = (2.30)
∂x 2 ∂y 2 ∂z 2 β 2 ∂t 2
∂ 2 Ψy ∂ 2 Ψy ∂ 2 Ψy 1 ∂ Ψy
2
+ + =
∂x 2 ∂y 2 ∂z 2 β 2 ∂t 2
∂ 2 Ψz ∂ 2 Ψz ∂ 2 Ψz 1 ∂ 2 Ψz
+ + = 2
∂x 2 ∂y 2 ∂z 2 β ∂t 2
But this is not sufficient, since the components of the vector Ψ are non independent. To ensure that
they are compatible with the request ∇ ⋅ Ψ = 0 we can derive them from two scalar functions χ
and γ so that
∂γ
Ψx = (2.31)
∂y
∂γ
Ψy = −
∂x
Ψz = χ
The simple shear wave is obtained if Ψx , Ψy are taken equal to zero and Ψz is independent from y
and z: the solution for equation (2.15) is a plane shear wave travelling in the positive x direction and
inducing particle displacement along y
x
Ψz = f t − (2.32)
β
with the scalar functions satisfying
∂ 2γ ∂ 2γ ∂ 2γ 1 ∂ 2γ
+ + = (2.33)
∂x 2 ∂y 2 ∂z 2 β 2 ∂t 2
The result is that the three displacements u x , u y and u z (scalar functions) can be described by a
combination of scalar and vector potentials which need only three scalar potentials Φ , χ and γ
These potentials allows us to specify plane waves, shear or compressional, with any direction and
any waveform: a combination of such solution is still a solution. This will be useful when we shall
have to select a combination of plane waves to satisfy a boundary condition.
Figure 2.5. The geometry of the problem and the coordinate system.
In order to simplify the discussion the solution are considered in the form of plane waves, rather
than considering the waves diverging from a point source: this involves no loss of generality, since
the point source solution may be developed by integration of plane wave solutions. For the case of
Rayleigh waves it can be shown (Aki and Richards, 1980) that the z-dependence of plane wave and
cylindrical waves is the same.
For simplicity let us consider a 2D geometry of a stress free plane XZ normal to the Y axis (Figure
2.5), lying in an elastic medium. Any compressional wave travels in the XZ plane, and the particle
motion coincides with the direction of propagation, , so u y = 0 . Reflections of P waves can generate
shear waves: anyway shear waves will lie in the same plane, and also their associated particle
motion. So we can consider solutions independent on y and for which u y = 0 . This will include
shear waves with motion in the XZ plane (called SV if the plane is bounded by the horizontal free
surface of earth)
The equations written above for the potentials can be combined to give the wave equation to be
satisfied by the potentials and to express stresses that must vanish at the free surface.
We have two wave equations for the two scalar potential,
∂ 2Φ ∂ 2Φ 1 ∂ 2Φ
+ = (2.35)
∂x 2 ∂z 2 α 2 ∂t 2
∂ 2γ ∂ 2γ 1 ∂ 2γ
+ = (2.36)
∂x 2 ∂z 2 β 2 ∂t 2
and the relation between the vector potential and the scalar potential γ ,
∂γ
Ψx = 0 Ψy = − Ψz = 0 (2.37)
∂x
When the potentials have been found, the displacements can be then obtained summing the two
displacements (from Φ and from Ψ ),
∂Φ ∂Ψy
ux = − (2.38)
∂x ∂z
uy = 0
∂Φ ∂Ψy
uz = +
∂z ∂x
and the same holds for stresses:
∂ 2Φ 2 ∂ Φ
2
2 ∂ Ψy
2
p xx = ρ 2 − 2 β − 2β (2.39)
∂t ∂z 2
∂x∂z
pxy = 0
2 ∂ 2Φ 2 ∂ Ψy
2
∂ 2 Ψy
p zx = ρ 2β − 2β +
2
∂x∂z ∂z 2
∂ t
This set of equations can be used to generate solutions imposing the different boundary conditions.
2.2.6 General solution of the wave equation for the half space
The dependence on x, z, t must be exponential for both the scalar potentials. From 2.37 it can be
noticed that the dependence of Ψy has to be an exponential function of x, z, t, so it can be written
Φ = A0e Kx e Mz e Ωt (2.41)
Ψy = B0e Kx e Nz e Ωt (2.42)
The coefficient M, N, K and Ω are in general complex, and from equations 2.35, 2.36 and 2.37
they have to satisfy the two relationships
Ω2
K2 +M2 = (2.43)
α2
Ω2
K2 + N2 = (2.44)
β2
The time is allowed to range between − ∞ and + ∞ , and the same is for the elastic half-space in the
x direction: to obtain finite potentials in the whole domain, Ω and K has to be pure imaginary, and
hence we can write
Ω = iω (2.45)
K = ik (2.46)
with ω and k real quantities ranging over positive and negative values.
With K and Ω pure imaginary M and N has to be either real or pure imaginary since their square
M 2 and N 2 are the sum of real values (2.43 and 2.44), so they are real positive or negative.
M 2 = −(ik ) + (iω ) / α 2 = k 2 − ω 2 / α 2
2 2
(2.47)
N 2 = −(ik ) + (iω ) / β 2 = k 2 − ω 2 / β 2
2 2
(2.48)
Their square roots have to be either real or pure imaginary depending on the sign of the above
differences
If k 2 < ω2 /α 2 < ω2 / β 2 M and N are both imaginary
b = k 2 − ω2 / β 2
The potentials satisfying the wave equations, being also solution of the equation of motion, may be
written as:
(
Φ = A1e Mz + A2e − Mz eikx eiωt ) (2.52)
Ψ = (B e 1
Nz
+ B2e − Nz )e ikx iωt
e (2.53)
∫ [A (k ,ω )e ]
1
Φ ( x, z , t ) = + A2 (k , ω )e − Mz eikx eiωt dkdω
(2π )2 −∫∞
Mz
1 (2.54)
−∞
∞ ∞
Ψ y ( x, z , t ) =
1
[B (k , ω )e ]
+ B2 (k , ω )e − Nz e ikx e iωt dkdω
(2π ) ∫ ∫
Nz
2 1 (2.55)
−∞ −∞
In the expressions above, the bracketed quantities are the Fourier transform of the functions of x, z
and t which constitute the desired solution, and can be written as
ϕ = A1 (k , ω )e Mz + A2 (k , ω )e − Mz (2.56)
ψ = B1 (k , ω )e Nz + B2 (k , ω )e − Nz (2.57)
In equation 2.54 and 2.55 the variables x and t appear in the exponentials only, multiplied by k and
ω respectively : this combination of variables describe a travelling wave with apparent velocity of
c =ω/k (2.58)
Substituting c = ω / k we get
eikx eiωt = eiω (t − x / c ) (2.59)
N 2 = k 2 − ω 2 / β 2 = k 2 (1 − c 2 / β 2 ) (2.61)
The sign of M 2 and N 2 , depending on k and ω , depends on the ratio between c and α and c and
β . So, as we said before, M and N can be real or imaginary depending on the relation between c,
α and β . Different possible solutions can be found depending on the ratio between the different
velocities.
2.2.7 Rayleigh waves as solution of the wave equation for the half space
Considering the definitions of M2 and N2 rearranged in 2.60 and 2.61, there are three cases,
β < α < c , β < c < α and c < β < α . The third case, the only one giving real values of M and
N, occurs when the propagation velocity is lower than the P and the S velocities and leads to
Rayleigh surface waves
If the phase velocity have to be less than the shear velocity, both N and M are real and can be
expressed (2.49, 2.50) as
M = m and N = n
and the solution for potentials can be written
∫ [A (k ,ω )e ]
∞ ∞
1
Φ ( x, z , t ) = + A2 (k , ω )e − mz eiωx / c eiωt dkdω
(2π )2 −∫∞
mz
1 (2.62)
−∞
∫ [B (k ,ω )e ]
∞ ∞
1
Ψy ( x, z , t ) = + B2 (k , ω )e − nz eiωx / c eiωt dkdω
(2π )2 −∫∞
nz
1 (2.63)
−∞
where
1/ 2 1/ 2
1 1 1 1
m = ω 2 − 2 n = ω 2 − 2 (2.64)
c α c β
In order to determine the coefficients A and B we can impose the null potentials at infinity and the
zero stresses at the free surface. The coefficient A1 and B1 must be zero to avoid exponential growth
with depth and infinite value of Φ and Ψ . Then from each equation imposing vanishing stresses at
the surface, a ratio between the two others coefficients A2 and B2 (equation 7) is obtained:
B2
=−
(
2 − c 2 / β 2 i sgn ω
= −i sgn ω ⋅ L
) (2.65)
A2 2 1− c2 / β 2 ( 1/ 2
)
B2
=−
(
2 1 − c 2 / α 2 i sgn ω
= −i sgn ω ⋅ L
) 1/ 2
( )
(2.66)
A2 2 − c2 / β 2
where
L=
(2 − c / β ) 2 2
=
(
2 1− c2 / β 2 )1/ 2
2(1 − c / β ) 2 2 1/ 2 (
2 − c2 / β 2 ) (2.67)
From the above equation (2.67) the equation (2.68) for the phase velocity can be derived
2 1/ 2
c2 c2 c2
2 − 2 − 41 − 2 1 − 2 = 0 (2.68)
β α β
A real root of this equation gives the speed of Rayleigh wave cR , first described by Lord Rayleigh
in 1885 .
2.2.8.1 Velocity
For the case of elastic, isotropic, homogeneous body, the velocity doesn't depend on ω , i.e. there is
no dispersion. If we define p and q as
8 16β 2 272 80 β 2
p= − q= − 2 (2.69)
3 α2 27 α
we get the velocity as
[ ] [ ]
1/ 2
cR
(
= − q / 2 + q 2 / 4 + p 3 / 27
β
1/ 2 1/ 3
)
+ − q / 2 − q 2 / 4 + p 3 / 27
1/ 2 1/ 3
(
+ 8 / 3 ) (2.70)
q2 p2
when − > 0
4 27
and as
( )
1/ 2
cR π − cos −1 − 27 q 2 / 4 p 3
1/ 2
= − 2(− p / 3) cos
1/ 2
+ 8 / 3 (2.71)
β 3
q2 p2
when − < 0
4 27
Both p and q are functions of the ratio α / β ( α = VP and β = VS ), that can be expressed as a
function of the Poisson ratio υ
1/ 2
α 1−υ
= (2.72)
β 1/ 2 − υ
Rayleigh wave velocity depends on a β and υ : it mainly depends on the shear velocity β , being
about 0.9 times β , and it is much less sensitive to the variations of υ .
The ratio between the P-wave velocity and the S-wave velocity, and between the Rayleigh wave
velocity and S-wave velocity as a function of the Poisson ratio are illustrated in figure 2.6.
Figure 2.6. The ratio VP/VS and VR/VS as a function of the Poisson ratio.
The minimum ratio between VP and VS is 2 , and the ratio tends to infinity when the Poisson ratio
tends to 0.5: this because in such condition the shear velocity should be zero.
The exact range of variation of the Rayleigh velocity is
VR
0.87 < < 0.96
VS
(υ = 0) (υ = 0.5)
2.2.8.2 Displacements
To obtain the displacements produced by Rayleigh waves let's discuss the steady-state oscillation at
the angular frequency ω0 in a homogeneous half space.
which means that we only have amplitude A for the angular frequency ω0 . The potentials resulting
from the integrals are
Φ = Ae − mz cos ω0 (t − x / cR ) (2.74)
Ψ y = Le − nz sin ω 0 (t − x / c R ) (2.75)
where L is defined in 2.67, the corresponding displacements are, following equations 2.38
[ ]
u z = − me − mz + (Lω 0 / c R )e − nz A cos ω 0 (t − x / c R ) (2.76)
ux = [(ω 0 / c R )e − mz − (L ⋅ n )e ]A sin ω (t − x / c
− nz
0 R ) (2.77)
With horizontal free surface, u z is vertical and the direction of propagation x is horizontal: the
particle motion for a fixed x and y is a retrograde ellipse, since the two components have a phase
difference of π / 2 exactly, and the vertical component is greater than the horizontal one.
The amplitude decreases with depth exponentially, but with different rates for the two components:
u z remains positive, u x becomes negative (fig 2.7), and so the ellipse becomes prograde below a
certain depth (about 1 / 2π times the wavelength) where the displacement trajectory collapse to a
vertical segment.
The particle motion decreases exponentially with depth and becomes quickly negligible (Figure
2.7), so the wave propagation affects only the shallowest materials, and it is influenced only by
mechanical properties of such layers. We can plot the dimensionless displacements against the
dimensionless depth (Figure 2.7)
U z = (u z cR / ω0 A) = u z 2πλ / A U x = (u x cR / ω0 A) = u x 2πλ / A z = z/λ (2.78)
2.2.8.3 Energy
The energy of Rayleigh waves is also concentrated near the surface: in the following the
expressions give intensity, kinetic-energy density and potential-energy density
I z = − p zz v z − p xz v x (2.79)
I x = − p xz v z − p xx v x (2.80)
(
KE = ρ vx2 + vz2 / 2 ) (2.81)
[ (
PE = ρα 2 (ε xx + ε zz ) + ρβ 2 ε zx2 − 4ε xxε zz / 2
2
)] (2.82)
E = PE + KE (2.83)
With the symbols defined in 2.3, 2.4, and being v the velocity
Using the above obtained displacements (2.76 and 2.77) we find I z proportional to
sin ω0t cos ω0t ,
so the vertical energy flow has a frequency which is twice the frequency, and its average is zero:
there is no energy flow towards the earth's interior.
The flow I x is proportional to sin 2 ω0t cos 2 ω0t and so its time average is not zero.
the average flow in the vertical z direction and in the horizontal x direction can be derived.
Iz = 0 (2.84)
{( ) [ ] ( )
I x = I 0 1 + 4r 2 β 2 / cR2 e −2 r z − (r / s ) (r + s ) 1 + 2(1 + rs )β 2 / cR2 e − (r + s ) z + (r / s ) − 3 + 4 β 2 / cR2 e −2 s z
1/ 2
}
(2.85)
with
(
r = 1 − cR2 / α 2 )1/ 2
(
s = 1 − cR2 / β 2 )
1/ 2
I 0 = ρω 4 A2 / 2cR z = 2πz / λ
The horizontally travelling energy as a function of the depth and of the Poisson’s ratio is plotted in
figure 2.8
Figure 2.8. Horizontal average intensity Ix/Io against normalized depth, for different
poisson ratio nu. Shear velocity is constant.
2.2.9 Conclusions
Rayleigh wave equation in a homogeneous half-space have been derived to compute their velocity,
displacement and energy.
The Rayleigh wave velocity in a homogeneous isotropic half space is not dependent on the
frequency, i.e. there is no dispersion, and it is mostly influenced by the shear-wave velocity.
The velocity of R waves is lower than the shear-wave velocity, so, for a given frequency, surface
waves have the shorter wavelength.
The displacements, the stresses and the energy have an exponential decay with depth, and so the
surface waves affect only a shallow zone.
The influenced layer is limited to a depth roughly equal to one wavelength: this means that the
wave propagation is conditioned by the mechanical properties of this zone, then for a given
wavelength, deeper materials are not influent. As a consequence, the half-space has not to be strictly
infinite to allow the application of the above theory. If a shallow homogeneous layer is thick
enough for a given wavelength, the latter will propagate such as in an infinite half-space.
So, the half space has not to be strictly infinite, in order to have such phenomenon.
The Rayleigh waves do not spread energy into the body, and so, not considering the intrinsic
attenuation, it is clear that being the geometrical spreading smaller, the surface waves are
preponderant on body waves at a great distance from the source.
2.3.1 Introduction
If in a homogeneous half space the Rayleigh wave problem has a quite simple solution, for
heterogeneous media it becomes very complex: the solution exists for transverse isotropic media
with the free surface parallel to the isotropy plane.
In a homogeneous body there is a unique velocity for all frequencies, and so for all wavelength. In a
layered medium the different wavelength “sample” different portions of soil, and so they are
influenced by different materials, and the geometric dispersion arises. This dispersion is called
geometric because it is due to geometric variations of elastic properties, and it is not intrinsic
(material dispersion).
As it is easy to see, the greater wavelength, corresponding to the lower frequencies, are influenced
also by deeper layers (Figure 2.9), while shorter wavelengths are influenced only by shallow layers.
The solution must satisfy the free surface condition at z = 0, the vanishing of amplitude at infinity,
the continuity at each boundary. A nontrivial solution exists, for each frequency, for special values
of the wavenumber, and so for fixed phase velocities: given a frequency more than one solution can
exist. Many modes of propagation can be contemporarily present.
The dispersion also allows the existence of a group velocity different from the phase velocity. Each
mode has its relationship between frequency and phase velocity: the velocity of a wavetrain,
composed by several single frequency signal, is the group velocity, while the phase velocities are
related to each frequency.
This is a plane wave in the x direction u2 = 0 , but the z dependence is the same for a cylindrical
wave (Aki and Richards, 1980)
We can state
τ zr = r3 ( z , k , ω )e i (ωt − kx ) (2.90)
and stating
f ( z ) = [r1 , r2 , r3 , r4 ]
T
(2.92)
we get
df
= A( z ) f ( z ) (2.93)
dz
This is a linear differential eigenvalue problem with displacements eigenfunctions
r1 ( z , k , ω ) and r2 ( z , k , ω )
and stress eigenfunctions
r3 ( z , k , ω ) and r4 ( z , k , ω )
For a given frequency ω , non trivial solutions exist only for special values of the wavenumber k,
k1 (ω ), k2 (ω ),..., k n (ω ) , the eigenvalues. The corresponding functions r1, r2 , r3 , r4 solutions of the
equation are the eigenfunctions.
[ ]
FR λ ( z ), G ( z ), ρ ( z ), k j , ω = 0 (2.94)
known as the Rayleigh dispersion equation, or the Rayleigh secular function: in vertically
heterogeneous media the velocity of propagation is a multi-valued function of frequency. Each
function
k j , ri ( z , k j , ω )
Figure 2.10. The modal curves are the possible non trivial solution of the problem.
Once the eigenvectors are found (the roots of the dispersion equation) the eigenfunctions for each
mode of propagation can be computed. The Rayleigh eigenfunctions are the stresses and
displacements as a function of the depth, of frequency and of the mode.
efficient iterative algorithm for computing the Rayleigh dispersion equation, and explicitly models
the constructive interference that leads to the formation of surface wave modes.
Figure 2.11. Geometry of the layered medium with α, β, ρ for each layer.
Using again the Helmoltz’s decomposition, we can say that the displacements in the n th layer can be
obtained by the dilatational and shear potential Φ n and Ψn of the nth layer
n
n ∂Φ n ∂Ψy
ux = − (2.95)
∂x ∂z
n
n ∂Φ n ∂Ψy
uz = + (2.96)
∂z ∂x
and the equation of motion in each layer is satisfied if Φ n and Ψyn are solutions of the appropriate
wave equation (eq. 2. 23)
∂ 2Φ ∂ 2Φ ∂ 2Φ 1 ∂ 2Φ
+ + =
∂x 2 ∂y 2 ∂z 2 α 2 ∂t 2
∂ 2 Ψy ∂ 2 Ψy ∂ 2 Ψy 1 ∂ Ψy
2
+ + = 2
∂x 2 ∂y 2 ∂z 2 β ∂t 2
As it has been illustrated above (section 2.2.6), the equations can be solved by separation of
variables, and in each layer we have (eq. 2.52 and 2.53)
(
Φ n = A1e Mz + A2e − Mz eikx eiωt ) (2.97)
Ψyn = (B e
1
Nz
+ B2e − Nz )e ikx iωt
e (2.98)
the Fourier transformed potentials (section 2.2.7), when c < β < α (Rayleigh surface waves
condition) can be written as :
∫ [A (k ,ω )e ]
∞ ∞
1
Φ n ( x, z , t ) = + A2 (k , ω )e − mz eiωx / c eiωt dkdω
(2π )2 −∫∞
mz
1 (2.99)
−∞
∫ ∫ [B (k , ω )e ]
∞ ∞
1
Ψ y ( x, z , t ) =
n nz
+ B2 (k , ω )e −nz eiωx / c eiωt dkdω (2.100)
(2π )2 −∞ −∞
1
If the phase velocity is lower than shear wave velocity, both N and M are real and can be expressed
(15,16) as M = m and N = n and the solution for potentials give
1/ 2
1 1
m = ω 2 + 2 (2.101)
c αn
1/ 2
1 1
n = ω 2 + 2 (2.102)
c βn
So, the Fourier Transform can be written as the sum of an up-going and down-going term as
ϕ n = An e mn ( z − z n−1 ) + Bn e − mn ( z − z n−1 ) ≡ ϕ n + ϕ n
+ −
(2.103)
ψ n = Cn e nn ( z − z n−1 ) + Dn e − nn ( z − z n−1 ) ≡ ψ n + ψ n
+ −
(2.104)
where
mn2 = k 2 − ω 2 / α n2 (2.105)
nn2 = k 2 − ω 2 / β n2 (2.106)
[
Φ n ( z ) ≡ ϕ n ( z ),ψ n ( z ),ψ n ( z ),ϕ n ( z )
+ + − −
] (2.107)
containing the displacements and the stresses in the same layer nth ( p zzn and p zxn are the two stress
components in the nth layer).
All components are functions of the transformed variables k , ω
The displacements and the stresses can be obtained from the potentials (2.38 and 2.39) as
∂Φ ∂ψ y
ux = − = ikϕ n+ + ikϕ n− − nnψ n+ + nnψ n− (2.109)
∂x ∂z
∂ϕ ∂ψ y
uz = − = mϕ n+ − mϕ n− + ikψ n+ + ikψ n− (2.110)
∂z ∂x
∂ 2Φ 2 ∂ Φ
2
2 ∂ Ψy
2
p xx = ρ 2 − 2 β − 2β (2.111)
∂t ∂z 2
∂x∂z
∂ 2Φ ∂ 2 Ψy ∂ 2 Ψy
pzx = ρ 2β 2 − 2β 2 +
2
(2.112)
∂x ∂z ∂ z 2
∂ t
At any point in the nth layer we can compactly write the above relations between potentials and
stresses and displacements as:
S n ( z ) = Tn Φ n ( z ) (2.113)
with a = 2k 2 + ω 2 / β n2
Within a layer the z-dependence of the potentials can be summarized and compactly written
between the two interfaces as
Φ n ( zn ) = En Φ n ( zn −1 ) (2.116)
Considering that for each individual potential there is an exponential relation such as, for instance
ϕ + ( z + d n ) = e mn d n ϕ + ( z )
S n ( zn ) = S n +1 ( zn ) n=1,2,…,L-1
+ +
ϕ L =ψ L = 0
Thomson and Haskell define a layer matrix G that continues the solution from a layer to the
following
S n +1 ( zn ) = Gn S n ( zn −1 ) (2.119)
This layer matrix can be obtained from the following equations starting from the stress and strain in
the n+1th layer, at the interface zn with the upper nth layer :
a) S n ( z n ) = S n+1 ( z n )
Starting from the layer n+1, the continuity of stress and strain at the interface n allow to get the
stress and strain at the same interface, but in the upper nth layer
b) Tn Φ n ( z n ) = S n+1 ( zn )
The relation between potential and stress-strain in the nth layer gives the potentials at depth zn in
the layer nth
c) Tn En Φ n ( zn −1 ) = S n +1 ( zn )
The z dependence of the potentials in the nth layer allow the continuation of the potentials from
zn up to zn-1
d) Φ n ( zn −1 ) = Tn−1S n (zn −1 )
The relation between potential and stress-strain in the nth layer, already used for the step b),
gives stress and strain at the depth zn-1 in the layer nth, which is the final result of equation 2.119
Summarizing the four steps the result is
Tn EnTn−1S n ( zn −1 ) = S n +1 ( zn ) , that gives the G matrix of equation 2.119
Gn = Tn EnTn−1 (2.120)
The matrix Gn continues the stress displacement solution downward through a layer and across an
interface. S is converted to Φ , then continued by E from the top to the bottom, then converted to S
and equated to the S of the layer below.
Considering the disturbance generated at the top surface, the solution is continued from the top
surface to the half space by
Φ1 ( z L −1 ) = TL−1GL −1 LG2G1 S0 = R ⋅ S 0 (2.121)
and if we let
R11 represent the transpose of the cofactor matrix of R11 then
[u , w ] = − detR R
0 0
11
[
R12 σ 0 ,τ 0 ] (2.125)
11
The expression det R11 is the function whose roots give rise both to leaking and propagating modes:
det R11 = 0 must be true for the characteristic oscillations for the problem in which the upper
[ ]
surface is free by setting σ 0 ,τ 0 = 0 . A solution can exist for pairs of (k , ω ) satisfying
The solution can exist only for those couples of (k , ω ) : they are the eigenvalues, and on the (k , ω )
are the modal curves.
It is interesting to notice that some of the more commonly encountered modal equations are
expressible in terms of subdeterminants of the matrix just defined: for the homogeneous half-
space we have
(det T )
−1
1 11 =0 (2.127)
where:
Gn ( z − z n −1 ) means that d n is replaced by z − zn −1 in Gn .
Figure 2.12. Hysteretic loop of stress strain at very low strain, after Foti (2000).
The behaviour of soil below this limit of very small strain is called linear viscoelastic, and the value
is called the linear cyclic threshold shear strain. The linear viscoelastic model is necessary to
describe the dissipation at very low strain, which is the case of seismic methods . This model can be
schematised with an elastic spring and a viscous dashpot (Maxwell model, Kelvin-Voigt model),
and the stress strain relation involves complex moduli. A complex modulus has an imaginary part
that makes the stress and strain out of phase (from which the elliptic hysteresis).
For the wave propagation this leads to complex velocities.
*
α * = VP =
(K *
+ 4 / 3µ * ) (2.129)
ρ
* µ*
β * = VS = (2.130)
ρ
The meaning of the complex velocities in terms of spatial attenuation can be easily illustrated: if a
complex velocity is used, a complex wavenumber is introduced in the wave equation (Schwab and
Knopoff, 1971). The solution has then to consider so an attenuation factor accounting for spatial
decay of a wave propagating through a dissipative medium.
The simple case of one dimensional harmonic S-wave propagation can be used as example
∂ 2u y 1 ∂ uy
2
if the wavenumber k * is complex, it can be written as k * = Re(k ) + i Im(k ) the solution can be
written as
u ( x, t ) = Ae i [(Re( k ) +i Im( k ) )x +ωt ] = Ae i [Re( k ) x +ωt ]e −i Im( k ) x = Ae i [kx +ωt ]e −αx (2.131)
Finally, to completely describe the wave propagation in dissipative media, assuming a linear
viscoelastic model, a complex velocity can be used, or alternatively a real phase velocity and an
attenuation coefficient.
The material damping ratio D and the quality factor Q, Q = 1/2D, are the parameters usually used
as a measure of energy dissipation, is the material damping ratio D, and the quality factor. The
damping ratio can be defined, from the hysteretic loop, as a ratio between the energy dissipated
during the cycle and the maximum energy reached during the cycle (figure 2.12).
∆W (ω )
D(ω ) = (2.132)
4πW (ω )
The quality factor used by seismologists is simply (Aki and Richards, 1980)
1
Q= (2.133)
2D
The solution for weakly dissipative media can be obtained directly from the solution of the linear
elastic eigenvalue problem substituting in the relevant expressions the complex velocities. In other
cases, when no assumption is made on the damping factor, a coupled computation is needed (Lai
and Rix, 2002).
The attenuation is quite easy to be described in linear viscoelastic soils, using complex moduli and
complex velocities, when body waves are considered. Surface waves, as combination of P and SV
waves, have an attenuation due both to shear wave attenuation and longitudinal wave attenuation.
Viktorov (1967) shows that the Rayleigh wave attenuation is mainly due to the shear wave
attenuation: the attenuation of Rayleigh waves can be obtained as a weighted average of the
attenuation of shear and compressional waves. The weights for the shear-wave attenuation is A, the
weight for the compressional wave velocity is (1-A). The value of A only depends on the Poisson
ratio. (Figure 2.13)
Figure 2.13. Weights for the shear wave attenuation (A) and the compressional wave attenuation (1-A) as a function of
the Poisson ratio, after Viktorov (1967).
The use of complex moduli and velocities makes the phase difference between the two
displacements at the free surface other than π/2. This means that the elliptic trajectory of the particle
at the free surface has no vertical and horizontal axes, but is inclined.
In this section the computational steps for the simulation of the test are described, and the main properties of the
Rayleigh wave propagation in layered media are discussed. The first step is computing the eigenvalues, i.e the
possible solutions for each frequency: the eigenvalues are the modal curves. Then the displacements and stresses
for each frequency, at each depth, and each mode are computed, and finally with the Green’s function the
displacements at the free surface are computed for a point source. The synthetic seismograms are obtained, in
frequency-offset domain and in the frequency wavenumber domain. Hence, given a testing layout, the
displacements induced at the free surface at known distance from the source are predicted.
For P equal to the product P = A1 A2 L An , being A the individual layer matrices in the Dunkin’s
notation
we have
ij
{ } {a }
p kl = a1
ij
mn
2 mn
op { } {a }
L a n−1
st
uv
n uv
kl (2.135)
m
∫
2π Γm
(2.136)
being Γm the region of the real ξ axis for which the mth pole is included and
M R σ
S n (z ) = 11 11 R12 0
m
(2.137)
M 21 f s τ 0
r − r12 −1 − r
R11 = 22 = −r11 12 [r21 − r11 ] (2.138)
r12 r11 r11
So we can write
−1 M − r 1 σ
S nm ( z ) = − r11 11 12 [r21 − r11 ]R12 0 (2.139)
M 21 r11 f s τ 0
or, considering the receiver (R) and the source (W) function
m
S n = RnmW0m (2.140)
transforms of the displacements and stresses in the nth layer associated with the mth mode,
normalized by the vertical displacement at the free surface.
Considering the secular function 2.126 in terms of subdeterminant
F (k , ω ) = det R11 = r 12 = 0
12
we can write
F (k , ω ) ≡ det R11 = r 12 = t l
12 12 ab cd pq
⋅ g l −1 ⋅ g l −2 ⋅L ⋅ g l −2 (2.141)
ab cd ef 12
expression which is linear in the second order determinants belonging to any Gn : this expression
contain no exponential terms.
The source term is
[
W0m = (r21r13 − r11r23 )σ 0 − (r14r21 − r11r24 )τ 0 f s ] −1
[ 12
W0m = r 31σ 0 + r 41τ 0 f s
12
] −1
{ } ⋅ {g }
W0m = t L
12
ab
L −1 ab
cd
⋅L ⋅ g 2 { } ⋅ {g }
rs
tu
1 tu
31
σ 0 + g1 { } tu
41 0
τ f s
−1
(2.142)
F (k , ω ) ≡ det R11 = r 12 = t l
12 12 ab cd pq
⋅ g l −1 ⋅ g l −2 ⋅L ⋅ g l −2
ab cd ef 12
{ } ⋅ {g }
W0m = t L
12
ab
L −1 ab
cd
⋅L ⋅ g 2 { } ⋅ {g }
rs
tu
1 tu
31
σ 0 + g1 { } tu
41 0
τ f s
−1
Receiver term
21
f (k ,ω ) ≡ det R11 = r 12 = t l
12 12 ab cd pq
⋅ g l −1 ⋅ g l −2 ⋅L ⋅ g l −2
ab cd ef 12
where
ab
gn
cd
are the subdeterminants of the matrix TL−1 , of the Lth layer, which is the half space. The matrix TL−1
is simply the inverse of the T matrix (equation 2.115) of the half space.
2iµ n kmn nn µ n an nn mn nn iknn
−µ a m 2iµ n kmn nn ikmn − mn nn
−1 β n2 n n n
Tn =
2µ n mn nnω 2 − 2iµ n an − µ n an nn mn nn − iknn
µ n an mn 2iµ n kµ n nn − ikmn − mn nn
where we have defined
mn2 = k 2 + ω 2 / α n2 nn2 = k 2 + ω 2 / β n2 an = 2k 2 + ω 2 / β n2
It can be stated, as proposed by Dunkin,
λ =ω /k γ n = 2k 2 / β n2 / ω 2
~ = k −1 k 2 + ω 2 / α 2
m n~n = k −1 k 2 + ω 2 / β n2
n n
~ −1 sinh km
SM = m ~ d SN = n~n−1 sinh kn~n d n
n n n
~ d
CM = cosh km CN = cosh kn~n d n
n n
= g = i (ρ λ k ) (CM − CN )
n n 2 −1
g13 24 n
= (ρ λ k ) (− SM + n~ SN )
−1
g14n n
2 2
n
n
g 21 = g = i[γ m
n
43
~ SM − (γ + 1)SN ]
n
2
n n
n
g 22 = g 33
n
= (γ n + 1)CM − γ nCN
g n = ρ λ2 k m
23 ( −1 ~ 2
n )(
SM − SN n )
g = g = iρ n λ kγ n (γ n + 1)(CM − CN )
n
31
n
42
2
32 [
g n = ρ λ2 k (γ + 1) SM − γ 2 n~ 2 SN
n
2
n n n ]
= ρ λ k [− γ m
~ SM − (γ + 1) SN ]
n 2 2 2 2
g 41 n n n n
The terms that are needed for the computation of the secular function are the second order
subdeterminants listed in the following:
12 34
(
g 12 = g 34 = −2γ n (γ n + 1) + 2γ 2 n + 2γ n + 1 CMCN − (γ n + 1) + γ n m
2 2 ~ 2~2
n nn SNSM ) [ ]
12 24
g 13 = g 34 = ρ nλ2 k() (− CMSN + m~ SMCN ) −1 2
n
= g = (ρ λ k ) (SMCN − n~ CMSN )
12 13 2 −1 2
g 24 34 n n
Figure2.14. Secular function a the same frequency for the same stratigraphy, two expressions (real and complex).
When a reliable and efficient technique allows the computation of the secular function, its roots
have to be found to get the modal curves. However the secular function is a rapidly oscillating
function, and so the root searching techniques may fail. The following figure (2.15) shows the
values of the secular function, at fixed frequency, as a function of the wavenumber in the range 0-9
rad/m. The number of zeros is not known, and roots can be very close each other.
Figure2.15. The secular function for the different wavenumber at fixed frequency. The behaviour can be rapidly
oscillating, and the root finding is not simple. Two zeros can be close each other.
In this range, between the two lines of slope 2π / cR min and 2π / c S max the zeros of the secular
function constitute the modal curves. The roots are found with a bracketing-bisection technique
(Press, 1992)
The example below shows the position of the zeros the in frequency-wavenumber domain, and the
value of the secular function for two frequencies.
Figure 2.16-a. Modes in frequency wavenumber domain. They are found searching
at each frequency the zeros in the admissible range. Model: 400m/s, 200m/s, 500m/s; 1m, 2m.
Figure2.16b. The value of the secular function at the frequency of 40 Hz (left) and at the frequency of 280 Hz.
Figure2.17. Secular function, lines of zeros, region of acceptable values between the
two straight diverging lines (2 layers : 100 m/s, 250 m/s; 2m).
The computation is hence heavy: at each frequency the secular function has to be computed to find
its zeros. A bracketing-bisection procedure (Press et al., 1992) has been implemented: the more
critic step is the bracketing, that may fail due to the rapidly oscillating behaviour of the function. An
adaptive bracketing based on the intervals between zeros at higher frequency has been
implemented. The computation starts from the highest frequency in the range of interest, where the
number of modes and zeros is maximum. A very fine discretization is adopted for the bracketing at
that frequency, and the position of the roots is used to predict the roots at the following step at
lower frequency. The discretization is coarser where roots are far from each other, and it is fine
where they are close: the following figure (2.18) shows the secular function with two roots very
close each other: the coarse discretisation (left) with step of 0.001 rad/m in this case fail the
identification.
Figure 2.18. Bracketing of the zeros of the secular function. The horizontal line of zero is not crossed but the samples
of the coarse discretization (left), while two roots are identified with the finer discretization (right).
Figure 2.19. Modal curves of the example above, in frequency wavenumber and in frequency-phase velocity
(2 layers : 100m/s, 250 m/s; 2m). They are discrete curves computed numerically.
increase of phase velocity with increasing frequency is associated to a velocity inversion, i.e. a layer
with low velocity underlying a layer with higher velocity (Figure 2.20).
It is intuitive that at high frequencies, when f tends to infinity, the surface wave should have a
infinitesimal wavelength, and then it should propagate only in the first layer, having indeed a phase
velocity equal to the Rayleigh wave velocity of the first layer. Nevertheless this refers to the
macroscopic behaviour, that could be due to a sequence of different modes, and not to a single
modal curve.
The relative excitation of different modes depends on the nature and on the depths of the source:
these aspects will be discussed later in 2.4.6. In general we can say that in order to separate the
different modes we have to record at a great distance from the source, where they arrive separate in
time due to their different propagation velocities.
The first mode, often called the fundamental mode, is the one having the lowest phase velocity, and
sometimes it is referred to as mode 0. The modal curve can have a quite strange behaviour (Figure
2.21), and can show apparent intersection, that make the computation particularly difficult.
Figure 2.21. The behaviour of modal curves can be quite strange, with apparent intersection.
Here a strong velocity inversion (200m/s 300m/s 1000m/s 300m/S 1100m/S).
A series of example is given hereafter to discuss some properties of the modal curves. When it is
not differently specified, in the examples below the velocities of the layers are shear wave
velocities, and the Poisson ratio is always 0.33.
2.4.3.1 Example 1
A simple 2 layer normally dispersive model is considered at first: the first layer is 20 m thick and
has a shear velocity of 200 m/s, the half space has a shear velocity of 300m/s. (Figure 2.22)
Figure 2.22. Modal curves of a two layer model (20m at 200 m/s, halfspace 300 m/s). 14 modes exist at 90 Hz. The first
mode tends to the Rayleigh Wave velocity of the first layer, the others to the shear velocity of the first layer.
The first mode tends to the Rayleigh wave velocity of the first layer, which is 186.5 m/s, while all
the others tends to the shear wave velocity of the first layer (200 m/s). The cut-off frequencies are
evenly spaced with a spacing of 6.70 Hz, from equation 2.145, and the limit phase velocity equals
the shear velocity of the half space (300 m/s). At high frequency, where only the first layer rule the
propagation, the first mode is dominant.
2.4.3.2 Example 2
The spacing of cut-off frequencies is regular only for simple cases. In the following example
(Figure 2.23), where 3 layers are present (200 m/s 300 m/s 400 m/s, 20 m, 20m, half space ), the
cut-off frequencies are not even-spaced. Furthermore three asymptotic velocities are possible: for
the first mode the Rayleigh wave velocity of the first layer (186.5 m/s), for the other modes the
shear wave velocity of the first layer (200 m/s). Also the shear wave velocity of the intermediate
layer (300 m/s) appears an alignment as made up of different modes that flatten at that phase
velocity.
At high frequency the Rayleigh wave velocity of the first layer is given by the first mode.
Figure 2.23. Modal curves of a three layer model (20m at 200 m/s, 20 m at 300 m/s then 400 m/s).
2.4.3.3 Example 3
An inversely dispersive model is here presented: two layers ( the first with shear velocity of 250 m/s
and 10 m thick, the second with shear velocity of 150 m/s and 10 m thick) lay on a half space at 300
m/s of shear wave velocity (Figure 2.24)
Figure2.24. An other example of modal curves. All modes tend at high frequency to the minimum shear velocity.
At high frequency higher modes have an horizontal branch at the Rayleigh velocity of the first layer. The cut-off
frequencies are associated to the shear velocity of the half-space. Model: 250 m/s, 10m; 150 m/s, 10m; half-space 300
m/s.
In the inversely dispersive model of the figure above, the modal curves tend at high frequency to
the shear velocity of the intermediate layer (lower velocity in the model): it can be noticed that the
modal curves flatten, at high frequency, at the Rayleigh velocity of the uppermost layer, 233 m/s,
producing a continuous line that allows the existence of a physically acceptable solution at high
frequency. The surface wave should be confined, at high frequency, in the uppermost layer: the
phase velocity has to be equal to the Rayleigh wave velocity of the uppermost layer for the shortest
wavelengths. For instance above 40Hz, also the maximum possible wavelength (300 m/s /40 Hz =
7.5 m) is almost confined in the first layer. The phase velocity hence should tend to 233 m/s for all
the wavelengths greater than this value, and this behaviour is possible if the energy at different
frequencies follows different modes.
2.4.3.4 Example 4
In the following example a small variation in the shear velocity of one layer change the asymptotic
behaviour of the first modal curve (Figure 2.25). In this earth model, the velocities are expressed in
km/s and the thickness in km.
Layer Thickness Shear velocity Compressional density
[km] [km/s] velocity [km/s]
1 1 3.8 6.5 2.7
2 1 3.5 (left) 6.0 2.5
3.3 (right)
3 1 4.4 7.5 3.1
4 1 4.1 7.0 2.9
Half space ∞ 4.7 8.0 3.3
It can be noticed that for both the models the first mode at low frequency has the Rayleigh velocity
of the half space (4.31 km/s), then the phase velocity decreases for the influence of the low velocity
Extract from PhDThesis - Claudio Strobbia. 60
Chapter 2.4 SIMULATION OF THE RAYLEIGH WAVE PROPAGATION IN LAYERD MEDIA
layer, and increases again because of the uppermost layer. At high frequency the surface wave
should be confined in the first layer and have a phase velocity of 3.49 km/s, which is the Rayleigh
velocity of the first layer Vr1: this can be obtained following one or more of the modal curves.
In the figure it can be easily noticed that with the lower velocity inversion (left) the first mode at
high frequency tends to Vr1: with the higher velocity inversion, on the other hand , it tends to the
Vs2, and the other modes tends to the Vr1.
Figure 2.25. A little variation in the model can change the asymptotic behaviour of modal curves.
2.4.4.1 Introduction
The modal curves depend on the velocities and the thickness of the layers, the relation being quite
complicated and computationally heavy with the complex models that are often needed to describe
the very shallow subsurface. If the curves of one model could be obtained simply from the curves
computed for other models, a great save in computing time would be obtained: when the procedure
is used in the forward modelling for an inversion, a fast simplified computation could be very
useful. The scaling properties of the modal curves are discussed here: when the velocities or the
thickness of all layers change of the same factor, the modal curves simply scale. The applications of
these properties will be presented in chapter 5.
The other terms in which the frequency or the velocities are present are factors that multiply the
expression: for instance µ (related to β by β = µ / ρ ) can appear as a factor with exponent –1,
0,+1. In the term λ the frequency is isolated λ = ω / k , but this term again can only be a factor.
Fixing the thickness and the densities, fixing the ratio between the velocities β and α , α = C ⋅ β
we can express the secular function as a function of the ratio between velocity and frequency
f (ω , k , β ) = const(ω , β ) ⋅ f (ω / β , k )
This express the scaling property of the secular function with layer velocities. The secular function
has exactly the same value, when the velocities are multiplied by a factor F, provided that also the
frequencies are multiplied by the same factor F.
In figure 2.26 the secular functions calculated, using the Dunkin’s normalized expression are
shown, for two normal dispersive six layers model, the second with all the velocities being a half of
the first (200m/s 500m/s 1000m/s 2000m/s 3000m/s; 2m 2m 2m 2m ):
This scaling of the frequency has a simple effect on the phase velocity, obtained as a function of
frequency as
2πf ω
v= =
k k
The scaling of the frequency becomes directly a scaling of the phase velocity of Rayleigh waves.
Finally, if two model have the same thickness and densities, multiplying the layer velocities by a
factor F, the Rayleigh phase velocities and the frequencies are multiplied by the same factor.
With simple algebraic passages one can get the modal curves. In the following figure (2.27) the
modal curves for 5 layers model. (400m/s 600m/s 800m/s 1200m/s; 1m-1m-1m: Æ 200m/s
300m/s 400m/s 600m/s; 1m-1m-1m)
Figure 2.27. Scaling of Modal curves: the figure below refers to a model with doubled velocities
compared to the one of the figure above. Both the velocity and the frequencies are stretched.
Figure2.28. Secular function for the two scaled models. In the second model (bottom) the secular function has been
stretched both in the wavenumber and in the frequency axis.
The point f =100 k=3of the top image is stretched to the point f=50 k=1.5 of the second image.
Figure 2.29. Modal curves for the scaled model. The bottom figure is obtained
from a model with doubled layer thickness.
The two scaling properties can of course be considered simultaneously, and then from a starting
model, we can easily obtain all the model with the same ratio between the velocities and the
thickness: a normalization is possible.
In logarithmic axis, the scaling become a translation, and all modal curves, for a ratio of d and v can
be obtained with a translation from a normalized model
Figure2.30. Modal curve depicted in logarithmic axis the scaling becomes a translation. Grey line (100 200 100 400
m/s: 2m,3m,2m) . Black line: velocities multiplied by 1.4, thickness divided by 1.1.
eigenfunction. The first part of the computation consists of calculating the Rayleigh dispersion
equation and finding its roots. For each frequency we search for the zeros, and we find the Rayleigh
wavenumbers k j = k j ( f ) , which are the eigenvalues of the problem. Each curve corresponds to a
Rayleigh mode (Figure 2.32).
Figure 2.32. Modal curve are the eigenvalues in f- k plane, only possible solutions.
We can see different modal curve, lower modes with higher values of the wavenumber, and so
lower modes with lower velocity of propagation. The number of modes increases with increasing
frequency, and each mode, except the first, exist only for f > f Cj being f Cj the cut-off frequency of
the jth mode: at the cut-off frequency the propagation velocity equals the maximum shear-wave
velocity, and the ratio between k and f is 2πf / k = max (VS ) , in this case 400 m/s.
It is easy to transform the modal curves in term of frequency and phase velocity simply calculating
the phase velocity from each wavenumber (Figure 2.33).
Then, with the found eigenvalues, we can calculate the eigenfunctions, which express, for each
frequency, for each mode, displacements and stresses with depth.
Figure 2.34 shows the vertical and horizontal displacements eigenfunction for a homogeneous half-
space. For a layered medium the decay of amplitude with the depth is still exponential, with
different rate in the different layers, and with the imposed continuity at the interfaces. In figure 2.33
vertical and horizontal displacements associated to the first mode are plotted for the frequency of 20
Hz. It is easy to recognise the presence of the interfaces a depth of 6m.
Figure2.34. Vertical and horizontal displacement induced by Rayleigh waves in a layered medium.
If we plot the same eigenfunctions of the first mode for different frequencies we see that the
amplitude decay with depth depends on frequency, so it depends on wavelength λ .
But we notice also that the amplitudes depends on frequency and for certain frequencies amplitudes
are much greater (Figure 2.35).
Figure 2.35. Vertical and horizontal displacement for the first mode at different frequencies.
If we plot the values of displacement at the free surface for different frequencies we can recognise
the presence of maxima.
We can also see the depth at which the horizontal displacement is zero, and so the elliptic trajectory
of the particle collapse on a vertical trajectory.
It is evident that the energy distribution with depth is strongly influenced by the elastic properties of
the layers, and the theoretical assumption of the half-space are no more valid.
If we analyse eigenfunctions of other eigenvalues we can compare amplitudes of motion, rate of
decay and so on for the different modes. For a frequency of 100 Hz, greater than the cut-off
frequency of the 6th mode, we can plot for all modes horizontal and vertical displacements with
depth (Figure 2.36).
Figure 2.36. Displacement with depth for the first six modes.
j =1
(2.147)
[ ]
M
( )
u z (r , z, ω ) = ∑ Ay (r , z, ω ) . j e
i ωt − k j r +π / 4
j =1
(2.148)
where Ar , Az and k j are the amplitudes and the wavenumber associated to the jth mode.
The horizontal coordinate here is r, the radial distance from the source. It can be shown (Aki and
Richards, 1980) that these expressions hold for the plane waves described above.
The actual particle displacement is obtained by taking either the real or imaginary part of the
expression above: with trigonometric identities the equation of the wave front can be obtained, and
an analytic expression of the effective phase velocity can be derived (Lai, 1998).
To compute the particle displacements due to a harmonic unit point source the displacement
Green’s function has to be computed. The vertical displacement, considering the mode
superposition, is given in the expression below. More details can be found in Lai (1998).
uˆ z (r , z , ω ) = U z (r , z , ω ) ⋅ ei [ωt −Ψz (r , z ,ω )]
(2.149)
[ ]
1 M M r2 (ki , z)r2 (k j , z)r2 (ki ,z s )r2 (k j , zs )cosr(ki − k j )
0.5
where V, U and k are the phase velocity, group velocity, and wavenumber, I is the first energy
integral. It has to be noticed that because of the modal superposition, the spatial variation of the
displacement is no longer harmonic, even with a harmonic source. When the source function is
known ( Fz ⋅ eiωt ), the real displacement can be easily computed as
u (r , z , ω ) = Fz ⋅ uˆ (r , z , ω )
(2.152)
Figure2.37. Modal curves for a simple model (200m/s 300 m/s; 2m).
The only possible velocity at 14 Hz is 265 m/s of the first mode: at 74 Hz two phase velocities,
associated to the two existing modes, are present with the phase velocities of 191 m/s and 284 m/s.
The amplitude at 14 Hz has a decay proportional to the square root of the distance from the source,
simply due to the geometric spreading (Figure 2.38, left). At the frequency of 74 Hz two modes are
superimposed, and the amplitude has a different behaviour (Figure 2.38, right)
Figure 2.38. Overall amplitudes with the distance, for a single mode and for the superposition of two modes. Left, at 14
Hz, where only one mode exists. Right, amplitudes at 74 Hz, where two modes exist and are superimposed.
Same model 200m/s 300 m/s; 2m.
At the frequency of 74 Hz two modes exists with different velocities: at the source the two modes
are in phase, increasing the distance their phase difference (time delay) increases as a consequence
of their different velocities.
The wavenumbers are the phase rotation with the distance, and they are 2.435 rad/m and 1.637
rad/m, (obtained as k = 2π ⋅ f / v ) which corresponds to a difference of 0.798 rad/m. The sum is
maximum with a zero phase difference and minimum with a π phase difference: the sum is periodic
with period 2π/0.798 = 7.87 m (Figure 2.39)
Figure 2.39. Amplitude as a function of the distance, after the geometrical spreading correction. Same model.
When two modes have the same amplitude at a certain frequency, the resulting sum is zero at
certain distances from the source. An example with real data is given in chapter 3 .
When more than two modes are present, the shape is more complicated (Figure 2.40), but the
principle is the same.
Figure2.40. Amplitude as a function of the distance when many modes are superimposed.
The displacement due to a unit source (Fz = 1) can be obtained from the expressions above in space-
frequency domain: then, at each distance, the complex spectrum can be obtained.
Figure2.41. Amplitude and phase of a trace at 10 m from the source, with unit source.
Figure2.42. Trace amplitude as a function of the offset and the frequency. The cut-off frequency of the second mode can
be clearly identified: above 48 Hz two modes superimpose producing oscillation of the amplitude, at a fixed frequency,
as a function of the offset. Same model: 200m/s, 300 m/s; 2m.
The complex spectrum has been computed, even if only the amplitude spectrum has been depicted:
an inverse transform of the complex displacement, transforming the frequency into time, could give
the seismograms. On the other hand, from these displacements in f-x, with a 1D Fourier transform
in the space dimension, the f-x domain is transformed into f-k domain (Figure 2.43). Of course, if
the f-k spectrum is to be compared with experimental data, all the padding and windowing
parameters have to be the same.
Figure 2.43. The f-k spectrum is simply computed Fourier transforming the f-x. Same model: 200m/s, 300 m/s; 2m.
This synthetic spectrum is computed considering also all the acquisition parameters and layout of a
real test: the relative maxima at each frequency and the absolute maxima are searched, and the
whole modelling procedure is consistent with the test. The aspect that is not considered in this kind
of modelling is the material attenuation.
It is important to notice that the effect of higher modes are not only important at inversely
dispersive site: it can be easily seen that higher modes can become important, even at low
frequency, and even at normally dispersive sites without strong contrasts of soil properties. The
example below (Figure 2.47) shows a normally dispersive site where the second mode is dominant
at low frequency. This feature is very important, because below the cut-off frequency of the second
mode less energy is propagated, and often experimentally the curve is obtained only above that limit
value. In this contest, the high velocity of the apparent curve, due to the second mode, can produce
a significant error in the model estimate if it is interpreted as a first mode curve. These aspects will
be discussed in chapter 5.
Figure2.47. Spectrum, modal curve and apparent curve for the model 100m/s, 120m/s, 250m/s; 1m, 1m. The second
mode has a relevant energy approaching to his cut-off frequency and strongly condition the apparent phase velocity,
which is however due to the modal superposition. (array 24 m long).
The zoom of the low frequency portion of the amplitude spectrum is depicted in the following
figure 2.50
Figure 2.50. Amplitude spectrum of the hammer: zoom on the low frequencies.
The coupling of the source with the soil is not considered here, so the signal that is actually
transmitted into the earth from the source is supposed to be exactly identical to the force signal
recorded on the source.
The following figure 2.51 shows the modal curve, the f-k spectrum, and the synthetic seismograms
for a simple 3 layers model with the source described above.
Figure 2.51. Modal curves, f-k spectrum, and synthetic seismogram for a simple
3 layer model 80m/s, 120m/s, 500 m/s; 2m, 2m.
The algorithm can be tested in different situations: for instance the following figure 2.52 shows the
differences between a homogeneous medium and a layered dispersive medium.
The homogeneous medium is not dispersive, and every frequency component of the signal
propagate with the same phase velocity. The wavelet propagate without being deformed: a simple
translation in time offset is obtained (Figure 2.52, left).
In a layered medium (a first layer with a shear wave velocity of 200 m/s on a half-space of 400 m/s)
surface waves are dispersive, and the different frequencies propagate with different phase velocities,
so at a certain distance from the source they arrive at different times. This produce the broadening
of the impulse with the distance from the source that can be easily noticed in figure 2.52 (right).
Figure 2.52. Synthetic seismograms for a homogeneous medium (Vs=200 m/s) (left) and a 2 layer model (200 m/s, 400
m/s; 2m)(right). The dispersion in the layered model produces the broadening of the impulse.
∂VR ρVS
zj
dr1
2
dr2
∂V S
Lj
=
2 k 2
UI ∫
1 z j −1
kr2 −
dz
− 4 kr1
dz
dz
z 2
∂VR ρVP
j
dr
= ∫ kr1 − 2 dz
∂VP L j 2k UI1 z j −1
2
dz
Acquisition
This chapter describes the acquisition of seismic data for the surface wave methods; particular
attention is paid to the design of the acquisition in such a way to obtain the maximum of
information and the minimum of uncertainty, stressing the intrinsic limitations of the sampling
in time and space. The different sources of noise are discussed, considering the procedures to
reduce their effect, then the intrinsic limits of the acquisition are presented. This allows to give
practical indications on the acquisition parameters and on the characteristics of the equipment, and
gives warning about the artefacts that the processing will have to face.
3
In this chapter
The physical noise in records: the random noise and the coherent noise. Different kind of coherent
noise: body waves, scattered surface waves, Lamb waves, air blast, near field and far field effects,
lateral variations.
The intrinsic limits of the acquisition: the effects of the sampling on the acquired data. The effects
of the spatial sampling, wavenumber resolution, maximum wavenumber and spatial aliasing, spatial
windowing. Effects upon the frequency ranges, on the mode separation, on the modes identification
accuracy.
The design of the acquisition parameters: the layout geometry - receiver spacing, array length, first
receiver offset - and the strategies to face the limitations – multi array, unequal spaced array, multi
array simulation.
The characteristics of the equipment: the source in general, impulsive sources and vibrating sources,
the sensors, the digital recorder. Finally the description of the equipment that has been used for the
experimental part.
3.1 INTRODUCTION
Many acquisition techniques have been used in surface waves surveying, depending on the different
applications, depth of investigation, and scale of acquisition. The elements to be discussed are the
kind of source that generate the surface waves, the receiver that detect the vibration or pressure
field, and the scale of the observation. The sources and the instruments can be very different, and
the scale of observation can vary from thousands of kilometres to some centimetres.
In earthquake seismology surface waves have long been used to infer shear-velocity models of the
crust and upper mantle (Ewing et al, 1957; Dorman et al, 1960; Dorman and Ewing, 1962; Bullen,
1963; Knopoff 1972; Kovach 1978): these surface waves propagating in the crust and in the upper
mantle, known as long period oscillations, involves periods between 5-10 s and 300-500 s,
sometimes up to 800 s (Keilis-Borok et al., 1989). They are of course observed and acquired with
wide arrays of seismometers at almost continental scales.
Surface waves have also been used for the determination of the “shallow structure”, in the
seismological meaning referring to the first hundreds of meters of the crust. Examples of
acquisitions with arrays of tens of kilometres length are given by Mokhart et al. (1988), Al-Eqabi
and Herrmann (1993), Herrmann and Al-Eqabi (1991). The surface wave data consist in general of
some tens of traces ( with time window of tens of seconds) within array length of some tens of
kilometres, gathered with standard low-frequency vertical geophones (less than 2 Hz ) and
explosive as source: wavelengths of some kilometres are acquired.
The engineering scale is smaller: some hundreds of meters or even some tens of meters. The
equipment and technology that is used has to be light, portable, and cheap: the receivers are
geophones, the sources are sledgehammers, weight drop, small vibrators, and also noise can be
used. The one and two station acquisition pioneered by Jones (1958, 1962) and Ballard (1964)
developed to the SASW test (Nazarian and Stokoe 1984; Stokoe et al., 1994) has widely been used
for the characterisation of pavement systems and soil deposits (Abbiss, 1981, Hiltunen and Woods,
1988, Gukunski and Woods, 1991, Rix and Lai, 1998; Tokimatsu et al., 1998a). The use of two
components receivers for the acquisition of both the component of the Rayleigh wave displacement
have also been proposed (Tokimatsu et al., 1998). Active multi-station techniques use arrays of
geophones (12-48 and more) and different sources, without substantial difference from the shallow
crust acquisition (Gabriels et al., 1987; Tselentis and Delis 1991; Schtievelman 1999; Park et al.,
1999; Foti 2000). Passive techniques usually observe vertical motion of microtremors using a two-
dimensional array of sensors distributed on the ground surface, in the attempt to go deeper using the
great wavelengths of microtremors (for instance Kanai and Tanaka 1961, Horike, 1985, Tokimatsu
et al., 1998b): some techniques use the power spectra and other the phase velocity of microtremors.
Multi-station applications have also been preformed in marine environment, with the use of
hydrophones to detect the pressure field related to the surface wave propagation on the sea bottom,
and explosive or airguns as sources (McMechan and Yedlin, 1981; Shtievelmann 1999)
Going to the smallest scale, the use of ultrasonic surface waves and the techniques to generate and
acquire surface waves are described for instance in the book of Viktorov (1967).
The seismic acquisition is the observation of the effects of the propagation of seismic waves in the
time and in the space. The complex vibration field produced by a seismic source can be detected by
different kind of transducers, placed on the surface or in boreholes and each acquisition should try
to enhance the phenomenon that has to be observed, reducing the other seismic events.
Surface wave methods are based upon the inversion of dispersive characteristics of such waves,
obtained by the processing of full-waveform data. The acquisition step hence has to record the
effects of the Rayleigh wave propagation in the most complete way, reducing the presence of other
phenomena in raw data.
In general, the acquisition is the observation of the surface wave within a full wave field
propagation: as it has been mentioned, the source, the receiver type, number and positions, the
recording instrument can be very different. All these aspects influence the result: the data quality
depends of course on the acquisition, but the adopted layout and equipment also influence the
information that can be extracted by raw data.
The acquisition step is here discussed in the view of the data quality and of the effects of the
acquisition parameters on the information contained in the field data.
The acquisition of good quality data is one of the main aspects of all geophysical techniques, and
directly influences the final result. It is a rather diffused opinion that the acquisition of SWM is
quite simple, since surface waves are high energy, dominant events in records, and this is actually
quite true. However the analysis is performed in frequency domain, and the signal to noise ratio has
to be high in the whole frequency band where the information is needed, and this requirement is not
simple to be satisfied. Surface waves actually usually dominate in records, but only in certain
frequency ranges, mostly depending on the site, which sometimes are not sufficient to extract the
desired information.
Furthermore, when an inversion has to be performed, it is fundamental that the forward problem
considers all the factors that influence the inverted data. The acquisition parameters have then to be
considered carefully, in order to take into account all their effects and to properly design the test so
that the inverted data contain the maximum of coherent information and the minimum of coherent
noise.
The aim of the acquisition is observing and recording surface waves in time and space. The limits of
the acquisition are due to physical noise and to intrinsic observation limitations.
The former depends on the fact that the measured particle vertical velocity is supposed to be due
only to the source-generated propagating surface wave: actually other source-generated phenomena
are present, and moreover other types of noise (seismic noise non-source generated, electrical and
electronic noise) affect records.
For the latter, the observation of the propagation within a limited window in time and space and in a
limited number of discrete points induces other limits, and produces loss of information.
The acquisition should try to reduce the presence of external events in records, by an adequate
design and control of the experiment. The noise in records can be classified in random noise and
coherent noise, according to its origin and repeatability.
Figure 3.1. The amplitude spectrum of a noise with dominating frequency at 30 Hz. Other isolated peaks are present.
Frequently peak are at 50Hz and its multiples (60Hz in the USA) because of the electrical net.
Figure 3.3. The amplitude of the sum when it’s synchronized and with a certain phase difference.
Often the different shot gathers are summed by the instrument and recorded in a single file: the
preview of traces in time domain allows to roughly evaluate the data quality and to select a
sufficient number of stacks.
It has to be mentioned that to improve the signal quality by stacking it is strictly necessary to have a
perfect synchronisation of acquisition, and that the sum of time shifted data can even deteriorate the
signal quality: the trigger performance is hence very important in stacking.
A constant time shift does not affect neither the amplitude of spectra nor the phases; if all the
records are shifted by the same quantity respect to the shot instant, the result is identical: a pre-trig
can be used without negative effects, except an increase in noise energy (see 3.6.3).
On the contrary, if the trigger is not reliable and does not assure a perfect synchronization of
records, i.e. the delay is not constant, it is better to save the data in different files. In such a way,
data can be pre-processed, for instance cross-correlated to compute and correct the trigger shift
between the different records, and then they can be summed after being synchronized.
The S/N does not increase summing shifted traces (Figure 3.4): the amplitude of the f-k spectrum is
not affected by the phases of the traces, and the effect of such a stacking is simply an overall scaling
of the spectrum, like multiplying a single seismogram, the noise and the signal, by a constant value.
In an analogous way, if the stacking is performed in f-k domain, the complex spectra have to be
used: for the linearity of the Fourier Transform, the stacking in time-offset or in frequency-
wavenumber domain give exactly the same result.
Figure 3.4. Synthetic seismograms: the right seismogram is obtained summing time shifted traces: the time shift affects
only the phase, the amplitude of the f-k spectrum of the two is identical, except a scaling factor.
3.2.1.3 Use of the coherence function to evaluate the signal to noise ratio
Another reason to record separately the different acquisitions instead of stacking them directly in
the field, is the possibility to evaluate the signal to noise ratio comparing statistically the different
acquisitions.
One possibility, usually adopted in the 2-station processing, is the use of the coherence function.
The coherence function is a spectral quantity that allows a quantitative assessment of the signal
quality, and evaluates the presence of random noise in the signals. It is discussed in chapter 4.
Other approaches, such as the statistical analysis of the phase distribution (chapter 4), allow the
direct assessment of the signal quality as a function of the frequency and the offset, and requires
separate acquisitions of the different shots.
Figure 3.5. Examples of coherent noise/model errors. left) The best linear model has a great misfit with quadratic data.
right) Experimental data (dots) actually correspond to the second mode(solid black line). If they are fitted by a first
mode (grey dashed) there is great error even with a little misfit.
In SWM the subsoil scheme is a stack of homogeneous elastic layers: the model parameters are the
geometrical and mechanical parameters of each layer, and the modelling, with the algorithm
illustrated in chapter 2, allows to simulate the multi-modal surface wave propagation as horizontally
travelling plane wave. So, no other soil characteristics is accounted for , and no other phenomenon
is considered.
Actually scattered surface waves can have a great energy with great reflection coefficient, and
higher modes can be dominant, even in normal dispersive sites and even without abrupt variations
of soil properties. Moreover it has to be considered that surface waves should dominate record at
each frequency: this hypothesis is less true, and P-waves can, for instance, dominate a certain band
at high frequency (see figure 3.7). The separation of coherent noise can be therefore important.
The identification of other phenomena can be done, when multi offset data are gathered, by
coherency pattern, arrival time, velocity, attenuation.
In the following an overview of different types of coherent noise is given, showing experimental
data and discussing their properties.
Figure 3.6. Data acquired at a very soft site (a deposit of soft organic clay). The 24 traces seismogram (left) and the
last trace (right). The body waves can be clearly seen at early times. (P wave velocity due to the saturating water).
Sledgehammer, vertical geophones.
The other aspect refers to the attenuation. The geometric attenuation is different for the two
phenomena (square root of the distance for surface waves and square of the distance for the body
waves at the surface), and should make R-waves dominant in the far offset. This is the usual
situation, however the material attenuation can play a strong role in the P/R amplitude ratio.
In the example above the amplitudes of body waves are not important in comparison to Surface
waves amplitudes (seismograms with true amplitude, normalization of traces). In a similar situation,
the ratio between the amplitude of the different phenomena can be very different, and so can not
respect the usual assumptions.
Body wave damping is influenced by the saturating water, while the Rayleigh wave attenuation, in
such a material, is completely due to the shear wave damping (Viktorov, 1967): moreover P-wave
wavelength are much higher, and this reduces the attenuation.
The intrinsic attenuation of Rayleigh waves is hence much higher, and the dominance of body
waves with the distance is obtained. Fortunately, body waves can be here easily recognized and
filtered out, in all domains where data are analysed. The following example (Figure 3.7) shows data
gathered at a site with soft saturated organic clay, where P waves become dominant in far offset.
Figure 3.7. Field data. Seismograms with a evident body waves and then S waves and R waves. Saturated Organic
clays, hammer.
The P-wave velocity is 1600 m/s, the shear wave velocity, estimated by the inversion of the
Rayleigh wave dispersion, is less than 35 m/s. The Poisson ratio, consequently, can be estimated as
2 2
1 VP − 2VS
ν= = 0.4998
2 VP 2 − VS 2
and hence the Rayleigh wave attenuation is almost equal to the shear wave attenuation (Viktorov
1967; see section 2.1.1).
By the analysis of the experimental decay of the spectral amplitudes as a function of the offset, the
coefficient α of the expression e −α ( f )⋅x can be experimentally estimated, for the frequencies between
5 and 12 Hz, between 0.06 and 0.12, and the damping ratio varies in this range between 0.05 and
0.055: this values refers to a material that cannot be considered weakly dissipative (Lai and Rix,
2002).
It is possible to see that in this situation, the intrinsic attenuation compensate the different geometric
attenuation and make the P-wave dominant in the far offset.
It is possible sometimes to filter this source of noise before the processing procedure. In the this
example above, body waves can be recognised both in time-offset and frequency-wavenumber
domains. The following figure (Figure 3.8) shows the time-frequency analysis of the 10th trace of
the seismogram depicted above (Figure 3.7). The P-waves can be recognised for both the early
arrival time and for the higher frequency content respect to the R-waves. In figure 3.8b the trace is
depicted as energy density as a function of the time and of the frequency (see chapter 4.2.9.)
P-waves
a)
R-waves
P-waves
b)
R-waves
Figure 3.8. The time-frequency analysis (b) of the 10th trace(a) shows that the differences in time and in frequency
between R and P waves are present (P-wave: early times higher frequencies). R waves are dominant, but not at high
frequency (more than 30 Hz). (10th trace, 20 m from the source).
The presence of body waves can be important even at sites where the velocities are not so different.
The following example (Figure 3.9) refers to a site of sands and gravels.
P-waves
R-waves
Figure 3.9. An other example of a seismogram with strong body waves before the surface wave. The frequencies here
are quite similar, and the amplitude not so different.
Back scattered
R-waves
propagation gives two groups of waves, each one satisfying the equation of motion and the
boundary conditions, that can propagate independently one from the another (Viktorov, 1967). They
are the symmetric and antisymmetric modes, for which respectively the motion is symmetric or
antisymmetric respect to the mid plane of the plate (Figure 3.11).
Figure 3.11. Antisymmetric Lamb waves (up) and symmetric Lamb waves.
Both symmetric and antisymmetric Lamb waves have many modes of propagation, and they both
are dispersive; the first symmetric and antisymmetric modes exist at low frequency, and they tends
to the Rayleigh wave velocity at high frequency.
The excitation of the different modes depends on the spectrum of the source, on its position, and on
the excitation function of the plate itself: often the first antisymmetric mode dominates in a wide
range.
The Lamb waves are formed by a combination of multiple reflected, reinforcing SV waves when
VP>VL>VS , SV and P when VL>VP>VS (Ewing et al, 1957)
The example below shows the experimental seismogram (Figure 3.12a) and the dispersion curve
(Figure 3.12b) of Lamb waves in a concrete floor with void below: probably only the first
antisymmetric mode is involved
a)
b)
Figure 3.12. Seismogram (a) and dispersion curve (b) experimentally obtained on a concrete floor with void below
When a stiff top layer lays on a soft material, the boundary conditions can be close to that of Lamb
waves: nevertheless, the lower boundary is not free, and the model used for the Rayleigh wave
propagation in layered media rest adequate to describe the system, but also Lamb waves can exist.
In the example below, a stiff top layer (0.30 m thick) lays on a very soft soil (Figure 3.13): the
Rayleigh wave modal curves are depicted with the energy distribution (Figure 3.14).
Figure3.13.: A stiff top layer lays on a very soft layer (VS = 600m/s and VP= 1200 m/s on VS = 60 m/s and VP = 80 m/s)
Figure 3.14. The f-k spectrum with modal curves in f-k domain (left) and modal curve with apparent dispersion curve in
f-v(right).The Rayleigh wave solution exist for such a stratigraphy
The modal curves shown in figure 3.14.left with the energy distribution show that the energy is
concentrated at few frequencies. The array for which the spectrum has been computed is 48 m long,
and allows a good mode separation, as it can be seen in figure 3.14.right
The displacements associated to modes are illustrated in figure 3.15. The particle motion in the soft
layer is not negligible, while the half space is almost undisturbed.
Figure 3.15. The displacements as a function of the depth for the first three Rayleigh modes of the model of figure 3.13
For the same model, the Lamb solution for the first layer view as a free plate gives the Lamb modal
curves shown in figure 3.16 up to 2000 Hz. The Lamb wave solution has been obtained
implementing in Matlab the equations given by Ewing et al. (1957)
Figure 3.16. Lamb waves modal curves for a free plate 600m/s 1200m/s 0.3 m. The first two symmetric modes and
antisymmetric mode exist below 2000 Hz.
In most of cases, especially with pavements, the cut off frequencies of higher modes are high
respect to SWM acquisition, and so only the first symmetric mode and the first antisymmetric mode
can exist.
The first symmetric mode, at low frequency, tends to the velocity of longitudinal waves in two
dimensional medium, that is function of the Poisson’s ratio and is smaller than VP; at high
frequency tends to the Rayleigh wave velocity of the plate.
The first antisymmetric mode tends, at low frequency, to zero, and always increases monotonically
asymptotically to the Rayleigh velocity of the plate.
Lamb waves can be generated in SWM testing, when a velocity inversion is present, and are
identical to pure Lamb waves for wavelength greater than 7 times the rigid plate thickness. The
inverse dispersion associated to the first antisymmetric mode is easily recognized, and eventually
filtered.
The possibility of existence of Lamb waves, however, does not affect the presence of Rayleigh
waves: the two phenomena can superimpose, each one with its own energy.
Air blast
Figure3.17. A seismogram with evident air blast with a sledgehammer shot.(diagonal, from 0m-0s to 35m-0.1s).
The amplitude is comparable with P waves, the frequency is higher
The near field and far offset effects can be reduced respecting some empirical rules in acquisition,
or by a processing that considers these limitations (see chapter 4.3, 4.4).
Figure 3.19. The two opposite dispersion curves with significant differences (Field data)
The lateral variations can be identified also by comparison of the information extracted from
different portions of the array. When the overall f-k spectrum is computed as the sum of elementary
2 traces spectra (chapter 4.5.3), the degree of similarity among them can indicate the presence of
2D effects.
Another tool that allows the detection of lateral variations is the processing of local phase
differences between traces (chapter 4.4). The following examples (Figures 3.20, 3.21) show the
phase velocity for a fixed frequency in the different portions of the array.
Figure 3.20. Field data. Phase difference as a function of the offset for a single frequency. The good linearity of the
phase as a function of the offset indicate the absence of lateral variations.
Figure 3.21. Other field data. The phase difference as a function of the offset don’t have a good linearity. The variance
of the phases does not justify the variations, so lateral changes of the properties are suspected.
Figure 3.22. Field data. Seismograms (sledgehammer as source, 24 geophones), and its f-k spectrum.
Two modes are clearly recognized in the spectrum
The case here presented is quite interesting, since the two modes have almost the same amplitude in
a wide frequency range. In particular, the section at 27.5Hz, depicted in figure 3.23, shows the
presence of two modes with the same amplitude: the wavenumbers of the two modes can be
estimated from the position of the two main maxima of the spectrum.
Figure 3.23: Field data. A section of the f-k spectrum at 27.5 Hz shows the presence of two lobes, corresponding to two
modes of similar energy.
At this site, with the used array length, and with the adopted processing technique, the higher mode
is not a coherent noise, but on the contrary can increase or confirm the information carried by the
first mode.
At the same site, with the same receiver array, a different processing or acquisition can lead to a
different conclusion: the higher mode could be not identified and could even corrupt the
information of the first mode.
Considering the same frequency of figure 3.23 (27.5Hz), if a monochromatic signal is recorded, the
presence of the two modes is no more easily identified. The seismogram acquired with a vibrating
source (Figure 3.24, left) shows abrupt variations around 8 m and 16 m: the amplitude has an
oscillating behaviour, and the phase has jumps of π radians at the same offsets (Figure 3.24, right).
Figure 3.24: Field data. Left) Monochromatic seismogram, Right ) The spectral amplitude and the phase vs. the offset
It is quite easy to show that this is the correct behaviour due to the modal superposition. With a
spectrum like the one showed in figure 3.22, the energy travels with two phase velocities, and two
wavelengths. The two superimpose and the distance from the source influences the amplitude and
the phase: a scheme is presented in figure 3.25.
a)
b)
Figure 3.25. Scheme of the modal superposition with two modes at a single frequency. The phase difference between
the two modes increases with the distance from the source (a). The resultant (b) has a phase and an amplitude that are
strongly influenced by the superposition.
If we consider a simplified spectrum with two spikes at the two wavenumber (0.7606 and 1.4477
rad/m), it is easy to see that the phase difference between the two modes varies with the distance.
The wavenumber is the phase difference occurring in 1m, and so the amplitude has a periodic phase
difference that increases of k1-k2 radians per meter, i.e. 0.6871 rad/m. Hence the amplitude is
periodic with period 2π / ∆k = 9.15m , and the phase rotates every 9.15m.
As a test synthetic data are generated, without intrinsic attenuation and corrected for the geometric
spreading, and they are depicted in figure 3.26.
Figure 3.26. Synthetic monochromatic seismogram (left) and trace amplitude and phase as a function of the offset
(right) . The data are obtained with two identical modes with different wavenumbers.
It is evident that if the presence of higher modes is not recognized and only few traces are collected
and processed, the errors induced by higher modes can be dramatic in the phase velocity and
attenuation estimate. This is one of reasons to prefer a multi station approach.
The importance of higher modes for a correct interpretation of the SWM is discussed in detail in
chapter 5.
3.2.3 Filtering
A pre-processing phase of filtering can help in noise removal.
The filtering is the separation of data in its components, using the differences in frequency, velocity,
wavenumbers: dealing with a two variable process, the differences between the two variables can be
used to enhance the signal relative to the noise. The two dimensional impulse response function of
some theoretical filters in 2D (see for instance Buttkus, 2000) are shown in figure 3.27.
The pass band can be, for 1D filters, in frequency, in wavenumber or in velocity, and for 2D filters
in limited region of f-v, f-k, v-k spaces
Figure 3.27. Frequency- wavenumber response functions for different types of filter: (a) frequency filter (b)wavenumber
filter (c) velocity filter (d) f-v filter (e) f-k filter (f) v-k filter. (adapted after Buttkus,2000)
These filters can be effective when the event to remove or to enhance is separated in one of the
mentioned domains.
Frequency filtering can be used to filter out noise, when it affects only a limited range of
frequencies. Nevertheless, since the analysis and the processing are conduced in frequency domain,
the filtering operation has a reduced utility; since it is possible to discard directly the information of
noisy frequencies.
Velocity filtering can be useful to remove coherent events of velocity different from the surface
wave velocity, but the random noise often has apparent velocities in a wide range, and hence it is
not possible to remove it all. If there is not a single source of noise the velocities can be positive and
negative and very different.
The example below shows an application of a 2D filter in f-k domain. The same data shown above
(Figure 3.22) are processed to remove the higher mode. The original spectrum is shown in figure
3.28 left: the portion of the spectrum corresponding to the higher mode is muted with a smoothed
edges filter, then the obtained spectrum is inverse-transformed to give the filtered seismogram of
figure 3.29 right.
Figure 3.28. Left) original f-k spectrum right) Filtered spectrum, the higher mode has been blanked
Figure 3.29. Field data. Left) Seismogram Right) filtered seismogram obtained by the spectrum of figure 3.28right
The energy associated to each mode depends on the site but also on the type and depth of the
seismic source. The modal velocities are a characteristic of the mechanical layered system and
modal curves are continuous and regular functions of model parameters.
The wavefield associated to surface wave propagation, in a layered medium, can be written in terms
of mode superposition as (Aki and Richards, 1980)
∞
1
s ( x, t ) =
2π ∫ ∑S
−∞ m
m (ω , x)ei (ωt − k m (ω ) x ) dω
This introduce a series of limitations that have important consequences on the inferred dispersion
curves. The following section about the spatial sampling discuss the effect of the parameters on the
dispersion curve, and is useful to give indications for the design of the acquisition.
These sampling issues can lead to a quite irregular and misleading behaviour of the experimental
dispersion curve if they are not correctly assessed in the design of the testing array.
Moreover, this limitations have to be taken into account also in the inversion. Only the modal
curves should be considered a characteristic of the site: the experimental dispersion curve is
strongly affected by mode superposition effects and depends also on the acquisition parameters, it
can then be considered as an apparent curve. Since the dispersion characteristics are not an intrinsic
property, to correctly compare synthetic and experimental data within an inversion algorithm, it is
necessary to use a modelling that reproduces closely the testing procedure, taking in account the
acquisition parameters (chapter 2).
To evaluate the ideal total information carried by surface waves one should analyse modal curves in
a certain frequency range: Rayleigh waves can exist only for the velocities of modal curves.
Time sampling (in t) is a minor concern since the sampling frequency with all instruments can be
set high enough for surface waves usual frequencies and window length can be easily increased
(particularly when using vibrating sources the actual signal length can be increased too).
On the other hand spatial sampling (in x) can pose some problems mainly due to: the wavenumber
resolution, the maximum achievable wavenumber and the effects of spatial windowing.
A A
k1 k2 k3 k k1 k2 k3 k
Figure 3.32. Global effect of sampling on f-k spectrum. An ideal section for a fixed frequency (left) and the real
situation (right).
-6
x 10
0
0.2 0.4 0.6 0.8 1 1.2 1.4 1.6
wavenumber [rad/m]
Figure 3.33: Spectra calculated with 3 array length (24, 48 , 96 m). Two picks are present but they both are retrieved
only by large arrays. (blue: 24m, magenta: 48m, red: 96m).
Two aspects have to be discussed separately: the actual wavenumber resolution depends on the
array length, which conditions the shape of the spectrum of the window, and the discretization of
the spectrum which depends on the numerical impossibility of a infinite zero padding. It can be
demonstrated that the f-k transform gives a correct position of maxima only when it is computed
with an infinite zero padding: all finite number of samples give a discretization of the spectrum.
The interpolation does not change the actual modal separation capability of the record, while great
importance has the macroscopic effect of the actual array length, as shown in figure 3.34. Two
spectra have been computed for different array lengths for a stratigraphy in which higher modes
play an important role. The obtained synthetic dispersion curves show that modal superposition
leads to a dispersion curve not corresponding to any real modal curve, because of insufficient
resolution. The absolute maximum of the spectrum is due to the superposition of the contributions
of two modes, and hence the resulting dispersion curve shows a smooth passage to higher modes
with increasing frequency (Figure 3.34, right). When the array length is sufficient to separate
modes, the maximum follows with jumps the mode with the highest energy at each frequency
(Figure 3.34, left)
Figure 3.34 : Two dispersion curves are computed for the same stratigraphy using different array length: 48m (left),
12m (right), with the same zero padding (2048 traces).
It is hence easily shown that, consequently, the dispersion curve, computed by picking maxima of
the f-k spectrum, also depends on sampling parameters and is not only a characteristic of the system
itself: it is an “apparent dispersion curve”. This misleading identification of the true dispersion
characteristics is due to the finiteness of the spatial window. This problem is well established in the
literature concerning 2-station SASW test (Sanchez-Salinero, 1987; Gucunski and Woods, 1992,
Tokimatsu, 1995).
Indeed in that configuration, a single estimate of the phase velocity is obtained and it is affected by
mode superposition if higher modes are relevant in the propagation phenomenon. As a matter of
fact the limitations imposed by the 2-station technique can be interpreted from the above
considerations regarding multi-station methods. Indeed it can be shown that the processing adopted
for the 2-station method, based on the phase difference between the two signals, is formally
equivalent to an f-k analysis with infinite zero padding .
The real spectral resolution and the mode separation strongly change in the different frequency band
of the 2 receiver dispersion curve: the distance is reduced increasing the frequency, and so the
modal superposition occurs specially at high frequency.
Long arrays seem to be suitable, since increasing the spatial window, i.e. the array length, simplifies
the mode identification. On the other hand with a short array there are less problems with lateral
variations, S/N ratio, high frequency attenuation and, being fixed the number of channels, with
spatial aliasing.
In end off gathers, all the coherent energy travels in the positive direction and is associated to
positive wavenumbers. In the negative quadrant only noise and aliased events are present, so it is
possible to recover the aliased information laying between –kNyq and 0 with a 2kNyq horizontal
unwrapping without introducing additional noise (Figure 3.37).
The spatial aliasing produce indeed apparent negative velocities, that can be identified also in raw
data (Figure 3.36). In the example below (Figure 3.36), a monochromatic signal is produced by a
source in x = 0, but the low velocity produce aliasing and an apparent negative velocity
In the computation of the f-k spectrum, it is then possible to shift the negative quadrant and increase
the maximum wavenumber (Figure 3.37). This computational aspect shows that is possible to go
beyond the pure spectral limitations
Figure 3.37. The f-k spectrum computed from real traces (3m spacing, 24 geophones, weight drop). Spatial aliasing
recovering with a 2kNyq translation of negative quadrant.
3.5
2.5
1.5
0.5
0
0.4 0.6 0.8 1 1.2 1.4
wavenumber [rad/m]
Figure 3.38. Section of the f-k spectrum at a fixed frequency (red). The effects of zero padding consists of a better
maximum positioning (blue).
Figure 3.39 shows the dispersion curve obtained using the same synthetic traces and different
options for zero padding to compare the accuracy of maxima positioning. Note that when no zero
padding is applied it is not possible to obtain a reasonable estimate of the dispersion curve to be
used in an inversion process. This is due to the lack of accuracy in the detection of maxima
location: for a certain frequency range the same location of the maximum is found, hence the
estimated velocity (ω/k|max) is a linear function of frequency in that range.
Figure 3.39. Synthetic dispersion curves obtained with different zero padding applied to the same traces: 2048 traces
(continuous blue line), 512 traces (green dots), 24 traces, that is no padding, (red asterisks).
The same considerations can be made about field data (Figure 3.40): The following figure shows
the f-k spectra of the same real seismogram, computed without zero-padding(left) and with a 2048
zero-padding
Figure 3.40: On real data the effects of zero padding in space to increase the interpolated resolution in wavenumber
and secondary lobes. Spatial windowing introduces a fictitious spreading of energy in the fk
spectrum due to the main lobe of the window spectrum and also the secondary lobes often prevent
higher mode to be identified, hiding the energy maxima associated to them (Linville and Laster,
1966).
This effect can be reduced using an appropriate windowing in the space domain, for example a
Hamming window. In Figure 3.41 the spectrum on the left is computed using a box window, the
one on the right using a Hamming window. A generalized Gaussian window, without ripples, help
in discriminating secondary maxima due to higher modes and those due to ripples.
Data are acquired obviously without any special windowing, this aspect refers more to processing:
the details of windowing will be discussed in chapter 4.
Figure 3.41. The f-k spectra computed from synthetic seismograms. Using a box window (left) evident ripples are
produced. Using a Hamming window (right) the ripples have lower amplitude, but the main lobe is larger.
As it has been mentioned above, many techniques have been proposed for the acquisition of surface
wave data. The parameters that characterise the acquisition are: the number and position of the
receivers; the array length and the spacing; the offset of the source; the kind of receivers and their
response; the kind of source and the signal that can be produced; the parameters of the time
sampling. In the following the different parameters are discussed, and the equipment that can be
used.
Nevertheless the real limit is often due to the attenuation of the high frequencies.
Figure 3:45. With two traces, given a distance, the velocity is computed from the time shift, and hence no limits on the
maximum wavelength are imposed.
The time shift can be computed for instance by means of the cross correlation between the two
traces, and the smaller identifiable time shift is equal to one time sample. For a fixed frequency
f * , if the spacing between the two traces is ∆X and the sampling rate is ∆t , the maximum
velocity is
∆X
VMAX =
∆t
and the maximum wavelength is
∆X
λ MAX = VMAX / f * =
f * ⋅ ∆t
There is no upper limit on the wavelength imposed by the array length, and it is worthwhile to
mention that the maximum wavelength depends more on the site conditions, and the frequencies
that can propagate, given also a source spectrum.
These considerations don’t want to suggest in general the use of short arrays: on the contrary, as it
will be shown in chapter 4, short arrays produce higher uncertainties in the estimate. Intuitively, the
greater is the array length, the greater is the phase difference given a frequency and its phase
velocity: the estimate will be less sensitive to the errors on the phases. However, for a given
receiver layout, it is possible to perform the analysis considering also wavelength greater than the
array length without violating any sampling theorem: this would not be possible trying to guess the
wavelength from the analysis of the velocities at different offsets, simultaneously in a single time
sample.
As an example a real data set (Figure 3.46) and its processing are shown. By processing both the
whole 24 traces seismogram (46 m long) and a shorter subset of 6 traces (10 m long), one gets
similar spectra: the search of the maxima of the spectra gives the two dispersion curve depicted in
figure 3.46 (right), which reaches similar wavelengths. In particular, with the short array, a
wavelength more than 4 times greater that the array length can be analysed. The uncertainties of the
wavenumber, however, are 20 times greater (see chapter 4)
Figure 3.46. On the left the whole seismogram and (grey coloured) a subset of six traces; on the right the
corresponding dispersion curves in wavelength-phase velocity (solid line: whole seismogram. Dots: subset of 6 traces)
Figure 3.47. Frequency attenuation, in dB, with distance from the source. To record the high frequencies short
distances from the source have to be used
The example below shows two data sets acquired at the same site, with the same receiver layout,
changing only the shot position. The source offset equals the receiver spacing, that is 2m, in the
first case (figure 3.48, left), and is 12 m in the second case (Figure 3.48, right). In this way half of
the seismogram can be overlaid, to correct eventual shifts.
A series of differences are present between the two acquisitions. The shorter source offset introduce
in the seismograms, and also in the f-k spectra, more high frequencies.
Part of this high frequency energy is carried by body waves, and actually introduces noise in the
spectrum: but it can also be noticed that first mode can be followed up to higher frequencies (Figure
3.49, left). The two dispersion curves are almost identical in the whole range, apart a greater noise
in the high frequencies for the acquisition with the shorter source offset.
Figure 3.48. At the same site, with the same receiver layout, two shot positions are acquired.
The left seismogram has a 1m source offset, the right a 12 m source offset.
Figure 3.49. The two f-k spectra are depicted (normalized amplitudes).
The left correspond to 1 m source offset, and has higher frequencies.
Figure 3.50 Different strategies for the acquisition: a) use of different spacings b) moving the source c) moving the
receivers d) non even spaced array
Figure 3.51. The two seismograms of figure 3.48 are re-attached. The second one is shifted 12m, and a 36 trace
seismogram is obtained. The traces from 13 to 24 are present in the two records
Figure 3.52. Example of nonuniform array. From this 24 channel record 3 set of uniformly spaced traces are extracted.
Figure 3.53 . Three sets of traces can be extracted (left), and the corresponding f-k spectra are computed(right)
The processing of non uniformly spaced array can be performed even without extracting subset of
uniformly spaced traces, with the algorithm illustrated in chapter 4.
Figure 3.54. A single trace with 1.9s of pre-trig window. Before 1.9 s only noise is present, and this pre-trig seems
quite silent: the overall signal quality is quite good.
Figure 3.55. Time frequency analysis of the trace of figure 3.54. The three figures depict the
energy density as a function of the time for three frequencies. It can be seen that at
low frequency the signal level above the noise is not very high.
After the acquisition parameters have been designed, the test is performed. The site should be flat,
free of obstacles, and the geology one-dimensional. In active measurements the surface wave is
generated with an active source, the vibrations are detected by sensors and acquired with a digital
recorder (Figure 3.56). The technical characteristics of the three main elements of the recording
equipment affect the quality of data and the range in which the dispersion curve can be evaluated.
a bedrock (450 m/s of shear-wave velocity). The low frequencies have very little energy, and the
propagation is almost confined in the upper layers that acts as a waveguide. The figure 3.57 (left)
shows the site response in dB: it can be noticed that at 8 Hz the response is 100 times lower than at
15 Hz. The f-k spectrum shown on the right evidences the lack of energy in the low frequencies
Figure 3.57. Site response( left) for a site with a high contrast of velocity (100m/s 120 m/s, 450 m/s; 2m, 2m). The
synthetic f-k spectrum(right) shows the lack of energy in the low frequencies.
The field data example below (Figure 3.58) shows the overall spectrum of two acquisitions at
different sites with the same source (sledgehammer on plate, 20 stacks to average the differences)
Figure 3.58. Spectrum of the signal produced by the same source at two different sites (normalized amplitudes)
Nevertheless the use of an adequate source is necessary, in combination with the design of the array
length.
It is important to mention that the low frequency content of the signal itself is not sufficient to have
a great penetration depth: for instance, we can consider the two cases depicted in the figure 3.58.
In the case where low frequency are present, the velocities are low and the wavelength is relatively
small (20m); in the other case, to the greater velocities correspond grater wavelengths (80m).
In general the signal to noise ratio in SWM acquisition is less critical than in other seismic methods,
since the energy associated to Rayleigh waves is greater. It is possible to gather good quality data
even at very noisy sites as urban areas or industrial plants (Xia et al., 2002).
Figure 3.59. Spectra of the signals recorded at the same site with 8 kg sledgehammer, 10 stacks (solid blue line) and
drop weigth 130 kg 3m (red dashed line)
Other impulsive sources use bullets propelled by small charges, shot in a metal pipe inserted in a
small hole in the ground (minibang). The energy is greater than the one of a sledgehammer and the
frequency content is similar.
Explosives located in holes can produce enough energy in one shot: there are often problems for
authorizations and the need of licensed explosive-handlers. An other important aspect about the use
of explosive in holes is the depth of the source, that has to be considered in shallow surveying.
The following example shows the seismograms and the spectra acquired at the same site, with the
same receiver array, using two different sources. Figure 3.60 (left) shows the seismogram acquired
with a sledgehammer, figure 3.60 (right) the seismogram acquired with 100g of explosive in hole
(1m deep). The spectra are depicted in figure 3.61
Figure 3. 61: Spectra of the two signals (black explosive, grey sledgehammer)
Figure 3.62. Synthetic seismograms Source at the free surface(left) and at 1m depth(right).
Same model (200m/s,500m/s; 2m), same array layout, identical frequency content
of the source(flat band in 5-100 Hz): only Rayleigh waves are simulated
The modal curve are of course the same, being a characteristics of the site stratigraphy: if the
positions of the maxima of energy are plotted in frequency-phase velocity domain together with the
modal curves, the different energy distribution can be noticed (Figure 3.63). In the highest
frequency band with the buried source, the apparent phase velocity corresponding to the maximum
of energy follow the second mode (Figure 3.63 right), while with the source at the surface it follows
the first mode (Figure 3.63 left).
This little difference can be very important, because the apparent curve often is the only
characteristics that can be extracted from experimental data, and so the influence of the source
position should be taken into account.
Figure 3.63.The modal curves and the apparent curves, with source at the free surface and source 1m deep
Also the overall energy distribution as a function of the frequency is affected by the position of the
source: the amplitude spectra at 10 m from the source for the two source depths are depicted in
figure 3.64right, the source spectrum which is flat below 80 Hz is also shown (Figure 3.64, left).
3.5.3.1 Sweep
A sweep signal is a non stationary function that can have the general form
A = A(t )sin (2πf (t ) + ϕ )
A linear sweep has a linear increase of the frequency versus time, that is
f (t ) = c ⋅ t
With such a frequency function, the different frequencies are differently sampled, i.e. a different
number of periods is recorded for each of them. With a discrete sweep, if a constant number of
period is recorded for each frequency, the interval related to each frequency is the inverse of the
frequency, and so a logarithmic series of intervals is used (Figure 3.65).
Figure 3.65. A discrete sweep in time frequency domain: two period for each frequency are recorded
To better sample the low frequencies, a logarithmic or quadratic sweep are preferable.
A logarithmic sweep has the following function of frequency with time
f (t ) = f 0 + 10 b⋅t
a quadratic sweep has the following function of frequency with time
f (t ) = f 0 + b ⋅ t 2
A sweep can be used to record a wide frequency range in a fast way: the amplitude at the different
frequencies can be quite different, depending on the site response and on many factors.
It is interesting to notice that when geophones are used, the vibration velocity is recorded: the
response curve of receivers, and so the flat band correspond to the velocity: the flat band of the
source often refers to force, and hence acceleration.
When recorded signals are depicted in time domain, even at small distance from the source, and
even in the theoretically flat band of the source, very different amplitudes are observed: a linear
sweep of force signal at constant amplitude produces a velocity signal which has not a constant
amplitude, but an amplitude which decreases with the frequency. To have a constant velocity a
linear increase of the acceleration amplitude has to be used (Figure 3.66)
Figure 3.66. Synthetic signals: linear sweep in acceleration (left) and corresponding velocity(right).
A constant amplitude velocity needs a linear increase of amplitude in acceleration
Figure 3.67. A recorded signal with a sweep. The source spectrum, the site and
the receivers are responsible for the amplitudes.
Sweep Signal
Noise
Figure 3.68. Time frequency analysis of a single trace, with a linear sweep of 15s between 5 and 40 Hz
Figure 3.69. Raw data with monochromatic source (30Hz)(left), and narrow band-pass filtered (right).
Note the jump on the tenth trace, due to the presence of two modes.
Different monochromatic signals can be stacked on the same file: in time domain the signal will
appear blurred, as in the stacking procedure the phase is random, but in frequency domain the
energy will emerge clearly (Figure 3.70). This procedure, nevertheless, deteriorates the signal to
noise ratio of individual records. The signal at a specific frequency is present in one file only, while
the noise at that frequency affects the whole set of stacked files.
Figure 3.70. Spectrum of a trace made up stacking 5 monochromatic signals (20, 25, 30, 35, 40 Hz).
The amplitude refers to the velocity of vibration from a geophone.
The main advantage on the sweep is that the time duration of the signal and the noise is the same,
and so the signal energy at each frequency is not diluted in a larger window. In the frequency band
where the data quality is critical this advantage may be very important.
With a given stationary instantaneous SNR, the SNR of the recorded signal increases with the
record length: if the noise sources are random, the length of the signal act as a sort of stacking.
Hence long record have a better signal quality.
A signal enhancing technique that can be used to increase the signal to noise ratio is an internal
stacking applied to each file. A single trace is divided into portions of the same length
corresponding to an integer number of periods of the frequency that want to be enhanced: this
guarantees the synchronization in the sum of the different portions. In the example below (figure
3.71), with a 4 Hz signal, even if the initial phase is not known, the partitioning into 0.5 s windows
produces 8 synchronised signals.
Figure 3.71. A coherent stationary signal is divided in sub-window containing a fixed number of periods.
The other frequencies, unless the multiples, are not enhanced: neither the noise is enhanced, if it is
not stationary (Figure 3.72). Moreover this allow the selection of the portions of the time windows
where there is less noise: this selection of the sub-windows to be stacked allow to discard impulsive
noise events.
The figure shows a single interval of the original seismogram (Figure 3.72, left), and the result of
the stacking (Figure 3.72, right).
Figure 3.72. A monochromatic signal is split into sub-windows which are stacked, after removal of noisy sub-
windows.. The signal quality depends on the time duration of the signal
3.5.4 Sensors
Figure 3.73. Theoretical response amplitude and phase response for different damping factor
The damping strongly influences the curve, and it is recommended to set it such as the response is
as much as possible flat in the frequency band of interest. The damping of a geophone can easily be
modified by inserting a shunt resistor across the coil terminals: the current circulating in the coil,
depending on the shunt resistance, produces a magnetic field opposing to the coil movement and
damps it. Hence the shunt resistor is chosen such that the damping factor is about equal to 0.6, to
flatten the amplitude above the natural frequency within a large band. The loss of signal output that
results is easily made up by additional gain of the amplifier.
The real response curve that can be measured on a shaking table is different, and measures also all
the aspects that make a real geophone different from an ideal forced damped oscillator. If we look at
an experimentally measured response curve (Figure 3.74), at higher frequencies, the geophone
records can be contaminated with spurious resonance noise (due to undamped motions of the
geophone’s coil normal to the principle axis or rotation or noise in the spring).
Figure 3.74. Spurious frequencies for a 100Hz geophones tested on a shaker (thanks to L.Sambuelli et al)
The receiver response is important because it influences directly the recorded signal srec. In
frequency domain the recorded signal is the product of the complex spectrum of the true signal Strue
by the receiver complex spectrum R.
S rec ( f ) = S true ( f ) ⋅ R( f )
that is, considering the amplitude A and the phase ϕ ,
Arec ( f ) = Atrue ⋅ AR
ϕ rec ( f ) = ϕ true + ϕ R
The spectra of the different receivers affect hence for both the amplitude and the phase of the
recorded signals. The amplitude are attenuated following the receiver response curve, but this does
not influences the inferred phase velocity, which is estimated from the phases. Geophones can be
used also out of their flat band, and below their natural frequency, getting attenuated signals.
The phase of the receiver does not have a strong effect if the receivers are all identical. Otherwise it
introduces an error in the recorded phases differences, and an error in the estimated phase velocity.
If we examine two monochromatic traces, if two receivers are used, the real phase lag is modified
of a quantity equal to the difference of the phases of the receivers at that frequency
∆ϕ1, 2:rec = ∆ϕ1, 2:true − (ϕ R1 − ϕ R 2 )
The difference of the phases of the receivers, if they are the same receiver type, is zero if the
theoretical curves (Figure 3.73) are considered: when the receiver are tested, it can be seen that the
phase difference can be also important. The following figure 3.75 shows the phase difference as a
function of the frequency for two geophones tested on a shaking table (Experimental data: courtesy
of Luigi Sambuelli).
Figure 3.75. Phase difference between two 100 Hz tested geophones. Note the increase of
phase lag close to spurious frequencies
The additional phase lag due to the receivers affect the phase velocity, which can be computed as
2π ⋅ ∆X ⋅ f
V = , where f is the frequency, ∆ϕ is the phase difference, ∆X the distance
∆ϕ
and the uncertainty on V can be estimated as
2π ⋅ ∆X ⋅ f
σV = ⋅ σ ∆ϕ
∆ϕ 2
It can be seen that the uncertainty can be high for little phase differences, and so increasing the
spacing D is necessary with great wavelength: increasing ∆X increases also the true ∆ϕ , but it
appears with power 2. (the true ∆ϕ indeed can be obtained as ∆ϕ = k ⋅ ∆X )
For example, velocity of 600 m/s at 150 Hz and 1m spacing would give ϕ = π 2 . If the error is 10°,
i.e. π 18 , the obtained velocity is 675 m/s. This effect is strongly reduced when a multi offset
record is acquired, and the single phases are considered simultaneously (chapter 4).
3.5.4.2 Coupling
The coupling between geophones and soil is important for signal quality: removing the uppermost
layer to improve coupling is necessary for some applications. In SWM acquisitions the high energy
of the signal allows a fast positioning of receivers.
On pavements and hard ground the use of steel plate allows to simply lay the geophones on the
ground without deteriorating the signal quality too much. A series of test, comparing the signals
from geophones planted whit spikes and laid on the ground with base plate, showed very similar
signals
When other connecting systems are used, the risk of introducing spurious frequencies is high. A
strong resonance is evident in data of figure 3.76, acquired with geophones coupled to the wall of a
marble pillar with an L-shaped metallic support.
Figure 3.76 Experimental seismogram acquired with geophones fastened on L shaped steel support.
The resonance of the support introduces a ringing in every signals.
Figure 3.77. Velocity composition with a vibrating source(left) and an impulsive source(sledgehammer)(right)
Processing
This chapter describes the basic principles of the processing of surface wave data, presents and
compares the different approaches and processing techniques, and finally gives the details of the
‘f-k’ processing and of the ‘Multi Offset Phase Inversion’. The basic common principles of the
processing techniques are presented. Both the two receivers and multi-offset techniques are
described to give an overview on different processing techniques. A comparison shows that many
4
of them are equivalent, and is based on the sensitivity to different kind of noise and uncertainty of
data. The main differences depend on the acquisition parameters. Then two different processing are
presented in detail: first the multi offset phase inversion, which allows a robust estimation of the
phase velocity and attenuation considering also the different uncertainties. Then the details of the f-
k processing are discussed, and the algorithms are presented: this is the processing technique that
have been finally selected.
In this chapter
Basic Principles of the processing of Surface Wave Data
An overview of the different existing approaches and techniques: the concepts and the main
relations of 2 receiver and multi offset approaches are presented
The comparison between the different processing techniques: sensitivity to different kind of noise,
mode separation possibility, coherency with a simple model for the inversion, logistic
The Multi Offset Phase Inversion: statistical evaluation of the uncertainty of the experimental
phases, weighted estimate of the wavenumber, test of the model.
The details of the f-k processing: computation of the spectrum, muting, padding and different
windowing, search for maxima, spectrum deconvolution for the separation of superimposed modes.
The f-x filtering before the computation of the f-k spectrum.
Figure 4.1. The processing estimates the dispersion curve from raw seismic data
The data that are processed are full-wave records: the phase velocity of Rayleigh waves at each
frequency has hence to be estimated from data that contain also the effects of other seismic events.
Two operations are usually needed in the processing of full-wave form data: the selection of the
events to analyse, and the identification of their properties (velocity and attenuation as a function of
the frequency).
The usual assumption is that the main events, at each frequency, are related to the propagation of
the Rayleigh waves, which are then supposed to be dominant. Under this hypothesis, the phase
velocity at each frequency can be obtained measuring the distances and the travel times, the latter
obtained for instance by searching the position of the energy maxima in data.
Some processing techniques transform the whole data set from the time-offset domain where it has
been acquired into domains where the energy is mapped as a function of other parameters, such as
frequency, slowness, wavenumber. Local maxima of energy density can hence be identified in the
new domain, and they correspond to different seismic events that are present in the record. The
coordinates of the energy density maxima, in the new domain, give the properties of the
propagation of the corresponding event.
Some methods of processing allow a direct evaluation of the signal quality, in order to select the
more reliable information. Some processing techniques allow a weighting of traces and frequency
ranges according either to their energy or to a statistical evaluation of the signal quality. Sometimes
this is implicit, i.e. the traces with higher energy have a greater weight in the overall result,
sometimes, as in the phase inversion, each trace is evaluated, at each frequency, to compute the
variance to be used as weight.
The choice of the processing algorithm affects in some way the result even if it depends mostly on
the acquisition parameters. On synthetic data without any noise the result of all the processing
techniques should be the same. Important differences can be in the sensitivity to random and
coherent noise, in modal separation possibility (and consequently amount of extracted information),
in the possibility of a simple procedure for a modelling coherent with the test.
4.2.1 Overview
Surface waves have been used for the characterisation at very different scales: from the ultrasonic
tests (Viktorov 1967) to the continental studies of the earth crust and upper mantle (Ewing et al,
1957; Dorman et al, 1960; Dorman and Ewing, 1962; Bullen, 1963; Knopoff 1972; Kovach 1978).
Different processing techniques have been used to measure the phase and group velocity from data,
and to infer the attenuation. Several parameters affecting the processing depend on the application
that is considered: the required resolution, the amount and quality of data, the frequencies that have
been acquired, the complexity of the model that has to be reconstructed and so on.
Multiple filter analysis, introduced by Dziewonski et al. (1969) has been used for the determination
of the group velocity as a function of the frequency from a dispersed wavefront. Herrmann (1973)
studied the use of multiple filter analysis to estimate the spectral amplitudes of various models. The
use of the cross-power spectrum of two-station data (Dziewonski and Hales, 1972) has been
adopted by many authors working with the SASW approach. Nolet and Panza (1976) presented the
f-k wave field transform for an unambiguous investigation of higher modes, McMechan and Yedlin
discussed the use of another wavefield transform, the ω-p obtained from a slant stack. The
frequency time analysis (for instance Keilis-Borok et al., 1989), the correlation (Park et al., 1999),
and other approaches, also for multicomponents data (Glangeaud et al., 1999) have been used for
the processing of surface wave data.
In the following some of the techniques that can be used for the processing of surface wave data for
engineering purposes are outlined: the main aim of the following overview on processing methods
is discussing the different tools in the view of the comparison that will be carried on in section 4.3.
After the comparison, two processing technique will be presented in detail: the f-k processing and
the multioffset phase difference inversion.
The different algorithms that are discussed below have been implemented in Matlab codes: this
made possible a series of tests on synthetic data and real data. In the computer program that have
been implemented for the processing of surface wave data, many of the algorithms have been
included, even if the approach that has been finally selected is the f-k processing.
is moved away from the source along a radial direction: points with zero phase difference from the
source are searched.
The distance between two points in which the recorded signals are in phase is assumed to be equal
to the main wavelength at the energising frequency, and hence to the wavelength associated to the
relative harmonic of surface waves (Figure 4.2). Different positions are measured, and either the
mean distance is considered or a regression on the sequence is performed (Figure 4.3).
From the estimated wavelength value, the velocity is computed as
V =λ⋅ f (4.1)
Repeating the process for different frequencies the dispersion curve over a certain frequency range
could be obtained. This technique joins acquisition and processing in the field work, because
directly estimates the phase velocity.
Figure 4.2. In the Steady state Rayleigh method the wavelength is directly measured in the field
the two signals observed at known measured distance. The phase difference between two points is
used to compute the wavenumber, and the phase velocity. The kind of processing that is performed
on the pairs of traces that are analysed strongly condition the acquisition.
The acquisition is performed with two receivers only, with spacing equal to the source offset:
several acquisition are performed for each geometrical configuration, to improve the S/N by
stacking. With a given configuration, only a limited range of frequencies can be obtained, mainly
due to spectral limitations, but also to attenuation and near field effects. Short spacing and high
frequency energising are used to infer the dispersion at higher frequencies, i.e. for shorter
wavelengths, great spacing and low frequency energising are used to infer dispersion at lower
frequencies, i.e. greater wavelengths. The receivers are then moved, to sample the whole dispersion
curve, according to different schemes: common receivers midpoint or common source (Figure 4.4).
It has to be mentioned that the investigated portions of soil depend on the position of the receivers
and on the distance between them.
Figure 4.4. The common source array(left) and the common receiver mid point array(right) for the 2station sasw test
The source position can of course be reversed, and this should reduce the effects of the phase
distortion of the measuring system (see chapter 3).
The phase difference can be computed directly from the two complex spectra
y1 (t ) →
F
Y1 ( f ) = A1 ( f )e iϕ1 ( f ) (4.2)
y 2 (t ) →
F
Y2 ( f ) = A2 ( f )e iϕ 2 ( f ) (4.3)
∆ϕ = ϕ2 - ϕ1 (4.4)
and it is, actually, the phase of the cross-power spectrum
Y12 ( f ) = Y1 ( f ) ⋅ Y2 ( f ) = A1 A2 e i (ϕ 2 −ϕ1 ) (4.5)
This latter was frequently used in two-receiver SASW since the adopted instruments gave it directly
as result of the acquisition-processing
Estimate of the velocity
Given two receivers at a known distance ∆X , the measured phase difference is 2π when the
propagating wave has a wavelength that equals the distance ∆X : this is the principle of the SSRM.
More in general, the ratio between the phase difference and 2π equals the ratio between the
distance and the wavelength.
∆ϕ / 2π = ∆X / λ (4.6)
If the distance between the receivers and the phase difference between the two corresponding
signals have been measured, the wavelength can be computed as
λ = ∆X ⋅ 2π / ∆ϕ (4.7)
Another way to show the same result, is to consider that the wavenumber is simply
∆ϕ
k= and the velocity is (4.9)
∆X
2πf
V = (4.10)
k
The velocity is then computed from the phase difference between the signals at the two receivers. If
the wavelength is shorter than the receiver spacing, the phase difference is greater than 2π , and
cannot be obtained directly from the analysis of data. All the points at a distance of Nλ + ∆X have
the same phase difference (Figure 4.6)
Figure 4.6. To a single phase difference ∆ϕ may correspond the distance ∆x or any Nλ + ∆x
When analysing a certain frequency range, either one has to be sure that the wavelength is greater
than the spacing, or the number of entire additional wavelength has to be known. Sometimes this
can be done unwrapping the phase on a wide frequency range (Figure 4.7). With a single
geometrical configuration, if the signal quality is good and the phase difference has a continuous
behaviour, the increase of the phase difference with the frequency can be followed up to values
greater than 2π . The phase of the cross power spectrum, from a modulo 2π can be unwrapped to a
full phase representation (Poggiagliolmi et al, 1982): each jump of the phase corresponds to a
wavelength. In the unwrapped phase difference, the low frequencies, with greater wavelengths,
have a small phase difference, while the high frequencies, with shorter wavelengths, have a great
phase difference.
Figure 4.7. By unwrapping the phase difference (phase of the cross power spectrum) also the wavelengths shorter than
the receiver spacing can be identified, after Foti (2000)
Figure 4.8. Example of spectral quantities of a couple of traces: (a) Phase of the cross-power spectrum (CPS); (b)
Coherence function; (c) Auto-power spectrum(APS) (first receiver); (c) Auto-power spectrum (second receiver).
(after Foti, 2000)
The coherence function evaluates for a couple of traces the ratio between the random component
and the coherent component, but it does not allow to identify the nature of the coherent part of the
signal.
Given two traces y1 and y2, Y1 and Y2 are the Fourier transformed signal, and
G11 = Y1 ( f )Y 1 ( f ) the auto power spectrum of the first trace (4.11)
Y122 =
∑Y 1n Y2 n ⋅ ∑ Y1 n Y2 n
(4.16)
∑Y 1n Y1 n ⋅ ∑ Y2 n Y2 n
If only one record is considered, the coherence function loses significance because it simplifies to 1.
If y1 and y2 are uncorrelated noise the coherence function, calculated with n pairs of signals,
converge to 0 as 1 / n . So the value of the coherence function decreases increasing the number of
summed shots. When the signal repeats itself identically the coherence function remain equal to 1
for any number of repetitions.
In real cases a value close to unity is an index of good correlation and hence of good data quality. In
a certain frequency range, the coherence function gives the idea of the contribution of coherent
signal and uncorrelated noise in the overall signal. The difference in the coherence function value
between good and poor quality frequency bands increases with the number of repetitions: the value
decreases slightly in the formers and strongly in the latter. The limit value, index of absence of
correlation, for n shots, is 1 / n .
It is interesting to notice that the coherence function evaluate the system, i.e. gives an idea of the
coherence of the system that relates the couple of traces: the coherence can be high even when the
signal are different, if the system response is linear.
If for example y2 is obtained from y1 via a LTI filtering operation
y 2 (t ) = h(t ) * y1 (t ) ⇒ Y2 ( f ) = H ( f ) ⋅ Y1 ( f ) (4.17)
Y122 =
∑Y 1n Y2 n ⋅ ∑ Y1 n Y2 n
=
∑Y1n H ⋅ Y1 n ⋅ ∑ Y1 n H ⋅ Y1 n
=
∑Y
1n H Y1 n ⋅ ∑ Y1 n H ⋅ Y1 n
=
H2
(4.18)
∑Y 1n Y1 n ⋅ ∑ Y2 n Y2 n ∑Y1n Y1 n ⋅ ∑ H ⋅ Y1 n H ⋅ Y1 n ∑Y
1n Y1 n ⋅ ∑ H ⋅ Y1 n H Y1 n H2
and does not depend on the series of acquired Y1 and Y2, but only on their relation: the signals can
be very different. The coherence function, so, does not judge the relations between the signals, that
can be completely independent from each other: if they repeat themselves with low variations the
coherence will be high.
where τ is the delay time and p is the ray parameter. It is a stack of the wavefield for each couple of
τ and p. In general the ray parameter p is defined, in a medium with vertical velocity v(z), as
sin(ϕ )
p= (4.20)
v( z )
where ϕ is the angle of incidence of the wavefront that change along the raypath. According to the
Snell law, the ray parameter p is constant along the ray. At the source it is
sin(ϕ 0 )
p= (4.21)
v(0)
On recorded signals, it can be determined from traveltime data of adjacent geophones if the velocity
at the surface is known.
s = v(0)∆t = ∆x sin ϕ 0 (4.22)
sin(ϕ 0 ) ∆t
p= = (4.23)
v(0) ∆x
So p is the reciprocal of the apparent velocity at the Earth’s surface. The delay time τ can be seen
as the intercept time rearranging t = px + τ and obtaining a straight line with slope p and intercept
time τ (Figure 4.9).
and so p can be obtained differentiating the traveltime curve, p = dt/dx. Hence the t-p transform of a
record in time offset domain, stacks the wavefield along a straight line of slope p, for each value of t
(Figure 4.10).
Figure 4.10. The τp transform sum the amplitudes of the signal over lines of slope p and intercept τ
A straight line in time offset has constant τ and p. Each linear event in the x-t domain with a slope p
and intercept τ maps to a single point in the τp plane. An inverse transform exists, it is hence
possible to go back into the initial t-x domain. In seismic prospecting the slant stack is widely used:
for the analysis of seismic velocities and filtering, for improvement of SNR and suppression of
multiples, for migration, inversion, and interpolation of traces increasing the spatial sampling rate in
the inverse transform. (see Yilmaz 1987, Buttkus 2000)
The separation and the analysis of the different types of waves is the most important application of
the τ−p transform in SWM.
For the computation, the minimum and maximum value of p (pmin and pmax), and the ∆p, have to
determined. pmin and pmax are easily estimated, ∆p depend on the frequency content of the signal: to
avoid aliasing the following sampling condition has to be satisfied
1
∆p ≤ being fmax and xmax respectively the maximum frequency and the length of the
2πf max x max
transformed data in the x direction. It is important to observe that the τ-p transform can be easily
computed using the two-dimensional Fourier transform, i.e. the f-k spectrum. (Buttkus, 2000)
∞ ∞
∫ ∫ u (t , x)e
−i 2π ( ft − kx )
U ( f , k) = dxdt (4.25)
−∞ −∞
∫ ∫ u( f , k )e
i 2π ( ft − kx )
U (t , x) = dkdf (4.26)
−∞ −∞
if U ( f , k ) is computed along a straight line k = fp for the fixed slowness value p, then
∞ ∞
∫ ∫ u(t , x)e
−i 2πf ( t − px )
U ( f , fp ) = dxdt (4.27)
−∞ −∞
we have
∞
U ( f , fp ) = ∫ u (τ , p )e − i 2πfτ dτ (4.30)
−∞
showing finally that the t-p transform can be determined by first transforming the wavefield u(x,t)
into the f-k domain and then calculating for each p-value the 1D inverse Fourier transform along a
straight line k = fp. The details of the algorithm for direct and inverse tp transform can be found in
Buttkus (2000)
In figure 4.11 it is shown a synthetic seismogram obtained with a simple wavelet propagating at a
constant velocity of 100 m/s. The slowness is 0.01 s/m, and also the time delay is depicted in the τ-
p panel.
The synthetic data of figure 4.12 show that the ω−p is an image of the propagation that, even with a
spectral smearing, show the position of seismic events in the new domain. The maximum of the ω-p
spectrum, in all the frequency range where energy is present, has a slowness of 0.01 s/m, that
correspond to the velocity of 100 m/s of the original data.
It can be demonstrated (McMechan and Yedlin, 1981) that the maxima of the new domain
correspond to the modal curves when they are the only events. If the wavefield is described, in
frequency-wavenumber domain by a relation
N (k , ω )
V ( x, t ) = ∫ dk ∫ dωe i ( kx −ωt ) (4.32)
D(k , ω )
where x is the offset, t the traveltime, k is the wavenumber, N(k,ω) is a function of the source and
D(k,ω) is the dispersion relation. Applying a slant stack to both sides we get
N (k , ω )
U ( p, τ ) = ∫ V ( x, τ + px)dx = ∫ ∫ dk ∫ dωe i ( kx −ω (τ + px )) dx (4.33)
D(k , ω )
Performing the x integration and the k integration we obtain
N (ωp, ω )
U ( p, τ ) = ∫ dωe −iωτ (4.34)
D(ωp, ω )
and finally, Fourier transforming τ,
N (ωp, ω )
U ( p, ω ) = (4.35)
D(ωp, ω )
In the transformed wavefield U ( p, ω ) , the points that satisfy the dispersion relation [D(ωp, ω ) = 0]
send U to infinity; we have maxima of U in those points, and so the dispersion curve can be found
searching these maxima.
The demonstration given by McMechan and Yedlin (1981) assumes a wavefield described by
equation 4.32, that is a wavefield only containing the desired dispersive event.
It can also be noticed that the transform has to be computed with an infinite zero padding when the
wavefield is recorded within a limited window.
This transform decomposes the original wavefield in its components at constant frequency and
wavenumber: the seismic data are transformed into an image of the energy density as a function of
the frequency and of the wavenumber.
This transform is used in seismic processing because it allows to separate and filter events having
different frequencies, wavenumbers and apparent velocities (Figure 4.13).
Figure 4.13. The f-k transform allows to identify and to filter different seismic events in records.
One of the main applications of the f-k transform is the f-k dip filtering. Different wave phenomena
are separated and can be muted and suppressed: in particular, in seismic processing, the ground roll
is viewed as high energy coherent noise, and its removal is an important step.
If data are analysed in f-k domain and maxima are picked, the filtering is not necessary, since the
surface wave is usually the main event.
Figure 4.14. The same synthetic seismogram shown above is here f-k transformed
The main limitation imposed on acquisition is that traces have to be uniformly spaced to use a
conventional spectral estimate: this may be overcome using other techniques (Duijndam, 1999).
where
e −α m (ω )x
S m (ω , x) = I (ω )Pm (ω )Rm (ω ) (4.40)
x
m is the mode number, I (ω ) the instrument response, Pm (ω ) the source spectrum, Rm (ω ) the path
response, α m (ω ) the attenuation coefficient.
If we have N aligned geophones with spacing ∆x , we can write the phase difference for the mode
m as k m (ω )∆x
The relation between the wavenumber, the phase velocity c and the slowness p, for the mode m is
ω
k m (ω ) = = ωp m (ω ) (4.41)
c m (ω )
If we perform the slant stack on the N traces
N N ∞
1 M
S (τ , p ) = ∑ s ( x n ,τ + px n ) = ∑ ∫ ∑ S m (ω , x n )e i (ω (τ + pxn )− km (ω ) xn ) dω =
n =1 n =1 2π − ∞ m =1
(4.42)
∞ N M
1
∫ ∑ ∑ S (ω , x )e
iωτ i (ωpxn − k m (ω ) xn )
= m n e dω
2π − ∞ n =1 m =1
Expanding S
e −α m (ω )xn
N ∞ N M
1
S (τ , p ) = ∑ s( x n ,τ + px n ) = ∫∑ ∑ I m (ω )Pm (ω )Rm (ω ) e iωτ e i (ωpxn − k m (ω ) xn ) dω (4.43)
n =1 2π − ∞ n =1 m =1 x
and compensating for attenuation one gets S independent from x
The influence of the geometric attenuation can be easily removed by multiplying each contribution
by the square root of the source-receiver distance and it will be therefore neglected in the following,
assuming that such correction is applied on the original data. As the material attenuation is
concerned, its contribute can be taken out from the expression of S, so that this last quantity
becomes a function of frequency only.
M ∞ N
1
S (τ , p ) = ∑ ∫ S m (ω )∑ e
i (ωτ +ωpxn − k m (ω ) xn ) −α m (ω ) x
e dω =
m =1 2π −∞ n =1
(4.44)
M ∞ N
1
=∑ ∫ S (ω )∑ e e i (kxn − km (ω )xn )e −α m (ω )x dω
iωτ
m
m =1 2π −∞ n =1
It is then easy to recognize in the above equation the integral of the fourier transform
∞
1
∫ S (ω )e
iωτ
S (τ ) = dω (4.45)
2π −∞
If we neglect the material attenuation contribution, it is evident that differentiating the quantity in
the square bracket in 4.46 with respect to k and setting the results equal to zero, the maximum of
the energy spectra is obtained for
k=km; (4.48)
Furthermore it can be shown that also if the above differentiation is performed without neglecting
the material attenuation the conclusion is the same, i.e. the accuracy is not conditioned by material
attenuation (Tselentis and Delis, 1998).
The details of the computation are shown in section 4.4, where the f-k procedure for the processing
of surface wave data is illustrated.
4.2.6 Crosscorrelation
The identification of the shift between signals is the main step of the processing: it can easily be
done by means of the crosscorrelation. The cross correlation is an operation that allows to determine
the relations between signals and to compare events occurring at different times. It is based on the
cross-covariance function, defined as the time average of the product of a function at time t (after
subtracting its mean value) and another function at t + τ .
The autocovariance is a statistical measure of the relationship between the different parts of a
random process, the crosscovariance function describes statistically the interdependence of two
processes (Buttkus, 2000)
The correlation is used for the determination of differences in signal arrival times, to evaluate for
instance the travel times of different waves. The vibroseis method also is based on this procedure: a
sweep signal is generated with a vibrator, thus many reflections and events can be superimposed in
recorded signals. The recording is correlated with the emitted signal to compute the travel times of
different events, being the autocovariance function of a sweep very short (Figure 4.15). More
details about the operator and its applications the can be found in Buttkus (2000).
Figure 4.15. The crosscorrelation easily identify the time shift even in noisy data. A sweep is depicted with its shifted
noisy copy: the crosscorrelation produce an evident peak.
The simpler application of correlation can be the determination of the time shift between two
monochromatic or narrow bandpass filtered signals recorded at different distances.
In correlation study, if the velocity is known to be positive, the time shift can be followed up to
more than a half a period, even if the maximum becomes negative.
Figure 4.16 Two time shifted sinusoids and their discrete crosscorrelation
where FZ is the dynamic force transmitted by the vibrator, and G (r , ω ) the geometric attenuation
law. The term Ψ (r , ω ) is the complex phase angle depending on the complex wavenumber, on the
velocity and on the intrinsic attenuation. The displacement transfer function can be computed
dividing the measured displacement by the measured force of the vibrator
U Z (r , ω )
T (r , ω ) = = G (r , ω )e −iΨ (r ,ω )
FZ (ω )e iωt (4.51)
When a single mode is assumed to be present, the dependence of the complex phase angle from the
offset become linear, and may be written as
T (r , ω ) = G (r , ω )e −iK (ω )⋅r
(4.52)
and the complex wavenumber K (ω ) may be estimated from the data (Foti et al., 2001; Foti 2000)
This approach is a powerful tool for a coupled estimate of the phase velocity and the attenuation,
but it needs monitoring the source, whose force depends also on its coupling to ground (Ven der
Veen et al., 1999)
where k m (ω ) is the wavenumber of the mth mode as a function of the frequency, Am is the
amplitude, considering the source and the path.
The expression can be rearranged separating the space dependent and frequency dependent part of
the amplitude as
e −α m (ω )x
s (ω , x ) = ∑ I (ω )Rm (ω ) e i (ωt − k m (ω )x +ϕ I (ω ))
m x (4.54)
The amplitude has a dependence on the input source I, on the response of the site R, on the
geometric attenuation and on the intrinsic attenuation.
This complex expression, for each mode, has amplitude
e −α (ω ) x
I (ω )R(ω ) (4.55)
x
and phase
− k (ω ) ⋅ x + ϕ I (ω ) (4.56)
and then, considering a single mode, for a single frequency, the amplitude and the phase have the
simple behaviours of figure 4.17
figure 4.17. Theoretical behaviour of the phase and the amplitude for one frequency as a function of the offset
When experimental data of amplitude and phase as function of the offset and the frequency have
been gathered, the two expressions can be used to estimate the unknown parameters of the
propagation. The source component of the amplitude can be unknown: anyway, even with
controlled sources, the ground coupling of the source can play a dominant role in the spectral
energy actually transmitted to the ground, and it is difficult to be evaluated.
The experimental phases and amplitudes that are needed to estimate the wavenumber (from
equation 4.56) and the attenuation coefficient (from equation 4.55) can be easily obtained by a
single Fourier transform of multi-offset data.
For a single frequency, with n receivers at the offsets
X = ( x1 , x 2 ,..., x n )
we will have two vectors of data: the amplitudes
A = (a1 , a 2 ,..., a n ) (4.57)
and the phases
Φ = (ϕ1 , ϕ 2 ,..., ϕ n ) (4.58)
An example of experimental amplitude and phases is given in the following figure 4.18
Figure 4.18. Experimental amplitude and phase, for the frequency of 20 Hz, with a 24 receiver array, sledgehammer
To get this result, different shots are collected separately, in order to allow a statistical comparison
of them in frequency domain, and the data can be filtered in frequency offset domain. Then the
phases at each frequency are considered: the phase difference is computed for each frequency and
each shot, and the statistical distribution of the phases is assessed. The experimental phases are
finally unwrapped to a full-phase representation, such as that of figure 4.18 (right).
For each frequency, the phase as a function of the offset (Figure 4.19) is then analysed: a weighted
least squares inversion is performed, to compute the velocity considering directly the data quality of
each trace. The weights have to be computed considering that high values of the phase variance are
related to lack of information.
Figure 4.19. Experimental phases vs. the offset, for six different frequencies. The higher frequencies, with greater
slopes, show higher uncertainties in far offset
Figure 4.20. The less reliable points should have lower weights in the estimate of the slope
This approach has the main advantage that data quality is considered and evaluated in a statistical
way: local variations can moreover be identified, to give warning about the presence of lateral
variations.
The main limit is that only the properties of the most energetic event are estimated, and a
preliminary filtering is necessary when several modes are present.
Figure 4.21 The time frequency analysis is based on the Short Time Fourier Transform, i.e. the computation of the FT
in short moving windows. A 1D signal(in time) gives a 2D spectrum (spectral density in time and frequency)
The moving window can overlap: long windows produce higher frequency resolution but lower
time resolution. The Short Time Fourier Transform can be used to see in a signal the time
depending behaviour of the different frequencies.
A simple example in figure 4.22 shows the meaning of the STFT. On the left the time domain
representation of a signal s1, a signal s2, and their sum. On the right the same representation in
time-frequency domain. The Time Frequency analysis, hence, shows the presence of different
frequencies and also shows when they are present in the signal. A signal is represented as energy
density as a function of the time and of the frequency.
A transient signals can be processed without loosing the time information, as it would happen with
a simple Fourier transform of the whole signal. The problem is the well known reciprocity
principle: a high resolution in time correspond to a low resolution in frequency (Oppenheim and
Shaefer, 1975; Buttkus, 2000)
When different signals are recorded at different distances, they can be analysed in time frequency
domain in order to compute the time delay at each frequency. The time delay can be obtained
analysing the energy distribution in time at each frequency, for the recorded signals: the barycentre,
in time, of the energy distribution for each frequency can be used (Audisio et al., 1999).
In the figure below (figure 4.23) are depicted two traces in time domain and in time-frequency
domain. The position of barycentre of the energy at the frequency of 20 Hz is shown in the TF
spectra.
Figure 4.23. The time-frequency analysis of two traces obtained with an impulsive source. If we cut the two TF spectra
at constant frequencies, the travel times can be extracted.
For the surface waves analysis, this tool can be used to extract the travel times of the different
frequency components of the signals at different offsets.
Similar approaches can be obtained with the time-scale analysis with the wavelets, that are able to
infer the local properties of signals.
4.3.1 Differences
The different processing techniques are different mathematical tools to compute the phase velocity
at each frequency from field data in time-offset domain. Actually they are not so different from
each other: they all are able to identify the properties of the propagation, and when applied to ideal
synthetic data they all should give the same result. Moreover most of the different techniques are
mathematically identical. The conventional SASW processing is completely identical to a f-k
processing of two traces, the different wavefield transforms are related by linear invertible
transforms (appendix C). The vertical flexibility coefficient at the top boundary of the surface layer
used by Al-Hunaidi (1994, 1998) is the f-k spectrum at a single frequency of the two traces
seismogram.
The different processing techniques, hence, have to be compared with regards to the way in which
they face the problems of real data. Different aspects have to be considered:
a) real data do not contain pure Rayleigh waves, but also other coherent phenomena
b) real data are noisy
c) real data are limited in time and space
Two operations are usually needed in the processing of full-wave form data: the selection of the
events to analyse, and the identification of their properties (velocity and attenuation as a function of
the frequency). One important difference among the processing techniques is the possibility of
selecting at each frequency different events, and not only the most energetic: this allows the
analysis also of secondary events. It has to be remarked that sometimes the secondary events are the
most interesting, and anyway this possibility increases the information that is extracted.
The noise is always present in field data and cannot be disregarded: the processing should reduce
the effect of noise on the inferred propagation properties. A better processing technique produce a
dispersion curve that is less contaminated by noise. However the main differences are in the number
of data that are processed. The accuracy of the result, for a given signal quality, mainly depends on
the number of traces that is processed and on the array length, but also on the possibility of
identifying and discarding the low-quality information. Some techniques actually allow the control
of the data quality, the selection of the information to process, and also a weighting of the
information based on its reliability.
The limited extent of the observation can be critic at the small scale of the engineering applications:
the main aspect is that the several modes of the Rayleigh wave propagation do not separate, but
often completely superimpose within the observation window. As a consequence only apparent
properties can often be obtained: an apparent phase velocity, and also an apparent attenuation. Some
processing techniques better face this problem, and allow a better mode separation than others.
In this section the different processing techniques are compared with regard to their sensitivity to
random noise and to coherent noise, and to their mode separation capability.
Figure 4.24. The phase rotation ∆φ induced by the noise on the signal can be shown in the complex plane. For the same
phase difference between S and N, the phase rotation depend on the Signal to Noise ratio
From a statistical point of view, the distribution of the resulting phase from the sum of a zero phase
signal and a random phase signal, can be obtained considering that the noise is
N ⋅ e iϕ
with random ϕ uniformly distributed in (0,2π ) and that the signal is simply
S
The resulting signal (S+N) has hence a real part
N cos ϕ + S (4.59)
S/N=2 S/N=1
S/N=0.9 S/N=0.5
Figure 4.25. The theoretical distribution of the phase for different signal to noise ratio (the phase of the noise is a
random variable uniformly distributed in –π +π)
This kind of distribution is obtained when the S/N is constant: if this is not true, a more complicated
distribution is obtained. It can be obtained intuitively summing different distribution corresponding
to different S/N.
In the expression
N sin ϕ
tan −1
N cos ϕ + S
the phase is distributed uniformly in (0,2π ) , the amplitude of the noise N can vary. If N has a
normal distribution with mean µ N and variance σ N , the signal to noise ratio is distributed as
− (1 / x − µ N )
1 2σ N2
e (4.62)
σ N 2π
If for instance the mean S/N is 3 and the variance of the noise is the 20% of the mean value of the
noise itself the resulting distribution for the S/N is not symmetric and is depicted in figure 4.26
Figure 4.26. The distribution of the Signal to Noise Ratio, with constant signal amplitude and normal noise amplitude
For the distribution of the phases, with the same mean S/N and with different variances of the noise
(ranging from 0 to 0.5 the mean value of the noise) we get the distributions in figure 4.27
Figure 4.27. The distributions of the phase, with a fixed mean Signal to Noise Ratio and different variances of the SNR
In real acquisition also the source is not perfectly constant, even with controlled sources. A
statistical prediction of the phases is then quite difficult, but of course little variations around the
average value mean good quality, while high dispersion means poor quality. It has been
experimentally observed, in a series of experiments performed during this work, that the phase
distribution, in real acquisitions with a great number of shots, tends to be a normal distribution for
high values of the S/N (Figure 4.30).
When real data are gathered averaging different acquisitions, if the phase of each trace at each
frequency is analysed, the data quality can be inferred. The trace depicted in frequency domain and
time domain (Figure 4.28) is obtained averaging 9 different acquisitions. The amplitude spectrum
shows that the highest signal level is in the band 40-70 Hz: in this range the S/N is also the highest,
also because in this case the main frequencies of the noise are lower. The single phases of the 9
acquisitions are shown (Figure 4.28) for the two frequencies of 5 Hz (low S/N) and 50 Hz (high
S/N).
Figure 4.28. A single trace obtained averaging 9 acquisitions, in time domain(top) and frequency domain (bottom). A
high energy content is within 35 Hz and 80 Hz
Figure 4.29. The 9 experimental phases, at the frequency of 5Hz (left), where the signal is weak,
and at 50 Hz, where the signal has the maximum of energy.
The obtained distributions of the experimental phases are in good agreement with the S/N: at 50Hz
the phase distribution as a much lower variance than at 5Hz.
A greater number of repetitions allow a better evaluation of the phase distribution. The following
figure 4.30 is obtained from the analysis of 100 acquisitions: the histograms have 15 classes
between the minimum and maximum values. In a frequency band where the signal is low, the
phases are distributed in a quite uniform way in all the range. Where the signal level is high
compared with the noise, the phases are concentrated around a single value, with a little variance.
Figure 4.30. Phase distribution for two frequency value (5Hz top, 60 Hz bottom). 100 shots.
4.3.2.2 Effects of the Array length and the number of receivers on the uncertainty of
the wavenumber
The acquisition is responsible for the uncertainty on each single trace, and the reduction of these
uncertainties is one of the main task of a careful, properly designed and performed acquisition. In
the design of the array the effects depending on the horizontal dimension (near field, far offset,
etc.) described in chapter 3 have to be taken into account.
Apart this, the choice of the array is mainly due to the need of reducing the propagation of the
single uncertainties of the phases. The uncertainty on the estimated wavenumber depends on the
uncertainty on data, of course, but also on their distribution, that is the number and the position of
recorded signals.
In particular let’s consider the phase of the traces at different distances. Considering a single
frequency component, it is evident that the length of the array and the number of receivers have a
great influence on the estimated wavenumber (and hence the phase velocity), and it is intuitive that
a multi offset estimate can average the errors of the single traces and give a more reliable estimate,
even with the same array length. The example below (Figure 4.31) shows that if only two signals
are recorded, the wavenumber is directly obtained as the slope of the line through the two points. If
more signals are recorded, the line can be obtained with a more robust estimate, reducing the effect
of single errors on data: this is a general concept not depending on the adopted processing.
Figure 4.31. The number of receivers influence the uncertainty on the estimated phase velocity, even with the same
array length. With two traces (left), the wavenumber is directly the slope of the line through the two points (in phase-
offset). With several traces (right) single uncertainties are averaged and reduced.
The first step, so, is to evaluate the error amplification due to the array dimension and to the number
of receivers.
A possible approach to evaluate the propagation of the uncertainty of the phases of the single traces
on the final estimate of the velocity can be done by the unit covariance matrix (Menke, 1989).
If the physical model is one dimensional, the phase is a linear function of the offset: considering
also the possibility of near field effects, or ignoring the exact position of the source, the linear slope
holds only in a certain range of offsets, in which the relation is
ϕ i = ϕ 0 + k ⋅ xi (4.63)
where ϕ i is the phase measured at the offset xi , k is the wavenumber, and ϕ 0 the intercept phase
(the details are given in section 4.4).
The wavenumber k is estimated considering n measured signals. The system of n equations 4.63
can be compactly written in matrix form as
Φ = G ⋅ [ϕ 0 k]
T
(4.64)
where Φ is the vector of the phases and G the kernel matrix, depending on the number of points
and on their distribution, which can be written as
1 x1
1 x 2
G= (4.65)
M M
1 xn
The matrix G contains the information about the data distribution, and not on their measured values:
it can be evaluated a priori, before the acquisition of data, and allows to compute the amplification
of uncertainty of data. As it can be noticed in the schematic example below, with the same
uncertainty of data, the final uncertainty depends on their position and number (Figure 4.32). The
uncertainty of the slope increases reducing the number of points and the length of the array.
.
Figure 4.32. The array length and the number of receivers influence the uncertainty of the estimated phase velocity
Figure 4.33. Uncertainty on the wavenumber for unit uncertainty on the measured phases. It is evident that for the
same array length a great number of receivers reduces the error
For each array length, a greater number of receivers reduces the amplification of the uncertainty: the
use of 24 receivers reduces the uncertainty of 4 times and the use of 48 receivers of more than 8
times in comparison to the use of 2 receivers. Given a certain ratio between the number of receivers,
for instance 2 and 24, the ratio between their amplification does not change with the array length.
For each number of receivers, the use of longer array reduces the amplification of the uncertainty: a
20 m array reduce the uncertainty of more than 4 times respect to a 10 m array, and a 40 m array of
more than 18 times, for any number of receivers. However, the increase in the array length is not
always possible: moreover, with a longer array the data quality is probably lower, due to the greater
attenuation. Increasing the array length also the uncertainty of the phase can not be supposed
constant; furthermore the risk of lateral variations is increased, increasing the array length.
The most interesting aspect hence is that the same amplification can be obtained with a shorter
array, if more receivers are used. This fact has a series of advantages, as shown in chapter 3, and
moreover allow a better data quality: there will be the same theoretical amplification, but a shorter
array will have lower uncertainties to be amplified.
Figure 4.34. With the same array length, the use of a different number of receivers change the accuracy of the result
The following figure 4.35 shows the comparison between 2 receivers and 24 receivers arrays: half
the length can be used with 24 receivers, and the source offset can exclude the risk of near field
effects.
Figure 4.35. The amplification for different array length for a 2receiver array and for a 24 receivers array.
A two-receiver array needs a double array length compared to a 24 receivers to have the same variance amplification.
Of course the choice of the array is also conditioned by the wavelengths that we want to sample: the
maximum spacing is conditioned by the spatial aliasing. This is, however, an other advantage of a
multi-offset, that can be used to sample a wide range of wavelength without changing the
configuration. These considerations on the averaging effect of a multi-offset array hold also if a
coherent noise is considered, particularly when it affect a single portion of the array (as for instance
the near field).
Figure 4.36. If weights are introduced, only an additional windowing effect is obtained. Left, weights for the traces.
Right: spectrum of the weighting function. The resulting spectrum is simply the original spectrum convolved with the
window spectrum
When these weights are applied to real data, the modification of the modal superposition appears in
the spectra. In figure 4.37 a seismogram is depicted (normalization of each trace) with its spectrum.
In figure 4.38 the spectrum of the original seismogram is compared with the spectrum of the
weighted seismogram: the bandwidth (in wavenumber) of the main lobe of the window is increased,
and between 20 and 30 Hz the superposition of the two modes is stronger.
The two dispersion curves obtained by searching the maximum at each frequency are shown in
figure 4.39: the main differences are due to the change of modal superposition caused weighting
window spectrum, and are in the frequency band 20-30 Hz.
Figure 4.37. Real seismogram and its f-k spectrum. Two modes are easily identified
Figure 4.38. The traces are weighted, and the two spectra are compared
Fgure 4.39. The two dispersion curves are compared here: the distortion is produced by the shape of the window
spectrum that affect the modal superposition. Black: no weighting. Grey: weighting.
Figure 4.40. On synthetic monochromatic data, the phase of a single trace is modified, to evaluate the effect of the
position of this error on the estimated wavenumber
The error on the estimated wave number depends on the distance from the centre of the distribution:
the first traces and the last traces have a greater effect on the estimated wavenumber, either using
the f-k processing, or all of with all the equivalent techniques, or the phase difference inversion.
The figure 4.41 shows an example with 24 traces: a phase rotation of a single trace is produced is
synthetic data: the error of the estimated wavenumber is plot vs. the position of the modified trace.
Figure 4.41. The same phase rotation has different effect depending on the position of the trace .The greater the
distance from the centre of the array, the greater the effect on the estimated wavenumber
It is interesting to notice that the amplitude of the effects of a phase shift on a trace depends on the
array length and on the position: the far traces produce a greater change on the estimated velocity,
even if their energy is little. Furthermore, for their little energy, they can be easily influenced by
noise: in seismic data with end-off shot, the far traces have a lower SNR. Their phases can be easier
distorted by noise, and the effect of this uncertainty is amplified by the distance. It is important,
hence, to reduce the weights of low SNR traces in the final estimate. In particular, with many
processing techniques, the weights are difficulty controlled, hence the corrupted frequencies of each
trace should be discarded and excluded from the analysis.
It is evident that the same uncertainty on the wavenumber produces very different uncertainties on
the phase velocity.
In the f-k plane, the same variation of velocity correspond to very different intervals for the
wavenumber (Figure 4.43): this means that the energy density of the same event is different in the
two domains, since the spreading around the maximum is different: the energy density, as it is
discussed later, can be considered as a probability density.
Figure 4.43. The relations between the f-k and fv planes are not linear. Equal areas in f-v correspond to very different
areas in f-k domain. The energy spread is uniform in f-k
The non-linear relation between the wavenumber and the phase velocity and the amplification of the
uncertainty of the phase velocity have to be carefully considered. This effect can be also analysed
considering a theoretical wave-field written (as in eq. 4.39)
∞
1
s ( x, t ) =
2π ∫ ∑S
−∞ m
m (ω , x)ei (ωt − k m (ω ) x ) dω
and the acquired wave field, observed within a limited time and space window, that has a spreading
of the energy of the different modes due to these acquisition windows.
In the f-k domain, the ideal data have the energy that is concentrated in the mode positions, while
the energy of acquired data follows the window spectrum. Considering the energy density as a
probability density function, real data have a probability density depending on the window
spectrum. The probability density function of the wavenumber, hence, is the same at each
frequency.
In f-v domain, on the contrary, the probability density of the velocity depends on the frequency.
The following figure 4.44 shows the frequency wavenumber (f-k) and the frequency-slowness (f-p)
representation of the same synthetic seismogram (a non dispersive event at 100 m/s in a frequency
band of 0-90 Hz): the energy spreading, hence the uncertainty, only depends on the windowing.
Figure 4.44. A non dispersive event 100 m/s in 0-90 Hz has a uniform spreading in f-k and not in f-p domain
This is not a wrong result: the probability density of the slowness has a greater variance than that of
the wavenumber. However this can not be neglected, and the uncertainty of the phase slowness has
always to be associated to its mean value, and has to be used in the inversion process that estimates
the subsoil properties.
A further aspect is that the phase velocity distribution, obtained from the slowness distribution, is
no more symmetric (Figure 4.45): the event associated to the highest probability is not the median
of the distribution. The use of the Gaussian distribution is no more possible.
Figure 4.45. Distribution of the phase slowness and of the phase velocity. The latter is not symmetric
Figure 4.46. The experimental phases are depicted for three frequencies, with the associated wavelength. There is no
evidence of some slope change between near field and far field
Analysing the lower of the three frequencies of figure 4.46, hence the greater wavelength, it can be
seen that only the first receiver seems to be out of the trend (Figure 4.47). This means that near field
effects are not so strong to modify the phase velocity of the main event
Figure 4.47. A zoom on the greater wavelength (40m) shows that the experimental phases have a good linearity in all
the offsets, apart the first trace(2m from the source)
Figure 4.48. Monochromatic seismogram: the phase rotations are due to the destructive interference between two mode
with similar energy (see chapter 3, figures 3.22-3.24).
On the contrary, the analysis of the whole set of traces could indicate the presence of two distinct
events, whose properties can be identified: the example of the f-k processing of this single
frequency, already given in chapter 3, is reported also in the following figure 4.49.
Figure 4.49. Energy density as a function of the wavenumber k. Two separate event are identified
The mode separation possibility can be an important parameter for assessing the performance of a
processing technique, because the amount of information extracted and the filtering possibility
depend on it.
The processing techniques can be classified in two classes considering the mode separation: some
of them transform data and show the distribution of energy as a function of other parameters
(velocity and frequency) allowing also the identification of local maxima; the others extract directly
the main event, i.e. the absolute maximum of energy, and give its velocity as a function of
frequency.
Considering the processing techniques allowing the analysis of several modes, the maximum
number of maxima and the mode separation possibility depend on the acquisition parameters. With
the f-k processing, for instance, there is a maximum number of modes that are identifiable with a
fixed number of receivers, as it has been illustrated in chapter 3.
4.3.5.2 The forward model for SWM and the test parameters
In the forward model used for SWM inversion (chapter 2), the modal curves are produced first
(eigenvalues); then the displacement and stresses as a function of depth and frequency
(eigenfunctions). Finally the Green function allows to compute the displacements due to modal
superposition, at the free surface at every distance from the source. In such a way each testing
configuration can be simulated: even-spaced arrays, non uniformly spaced arrays, SASW data with
different configurations. In all these cases, the synthetic data are processed with the same procedure
of experimental displacement measured in the field.
For instance, in SASW test, the mode separation increases increasing the spacing: at high
frequency, with little spacing, the modal superposition is usually strong (Figure 4.50).
Figure 4.50. If different arrays are used to get the whole desired range, and different frequency ranges are estimated
with different array length, the modal superposition will change with the frequency
In the spectral decomposition of the seismograms the information is the amplitude and the phase of
each trace at each frequency, and this information is treated differently from the different
processing techniques. The signal quality evaluation is performed only to select the frequency
ranges where the information is supposed to be reliable, and some empirical rules are used during
acquisition to avoid undesired effects depending on the near offset or on the far offset.
A direct evaluation of the quality of this elementary information, i.e. the amplitude and the phase of
each spectral component at each offset, is performed: the reduction of the uncertainty of the final
estimate of the velocity and attenuation needs a filtering and weighting of the data in frequency –
offset domain. The estimate of the propagation properties is made considering the local properties,
and so the lateral variations can be identified.
This procedure (Strobbia and Foti, 2003) can be considered an extension of the 2 receivers phase
difference analysis (Dziewonski and Hales, 1972) and is based on the simple relationship of the
phase and the amplitude as a function of the offset. As it has been mentioned above (equation 4.54),
the wavefield of Rayleigh waves can be written in terms of modal amplitude and phase as a
function of the offset and the frequency
The amplitude is written as
e −α (ω )x
A(ω , x ) = I (ω )R(ω ) (4.63)
x
and this relation allows the estimate of the attenuation coefficient α from the experimental
amplitude and offsets, for each frequency.
The linear relation between phase and wavenumber
ϕ (ω , x ) = −k (ω ) ⋅ x + ϕ I (ω ) . (4.64)
can be used to estimate the unknown wavenumber at each frequency, from the experimental
phases and offsets.
The processing can be described as follows:
• different records of different acquisitions are transformed into frequency domain
• the statistical distribution of the phase and of the amplitude is evaluated, at each offset and
at each frequency
the phases and the amplitude are processed separately
• the phase is unwrapped to a full-phase representation
• the wavenumber is estimated with a weighted least squares inversion of the experimental
phases: the weights are computed from the statistical distribution of the phases
• the relation between phases and offset is used to test the hypothesis of one-dimensional
medium: the residuals of the inversion can be used for a statistical test
Extract from PhDThesis - Claudio Strobbia. 185
Chapter 4.4 THE MULTI OFFSET PHASE INVERSION
• the attenuation coefficient is estimated with a weighted least squares inversion of the
experimental amplitudes
Figure 4.51. The raw seismograms (time- offset) and their amplitude spectra (frequency- offset)
The complex spectra are then analysed: at each frequency and at each offset the amplitude and the
phase are considered. Both for the amplitude and the phase the mean and the standard deviation are
computed.
This task is almost straightforward for the amplitude data, even if a normalization considering the
source variability has to be performed: on the other hand the periodic behaviour of the phase leads
to some complications.
The experimental phase is obtained in a modulo-2π representation from which an unwrapped phase
vs. offset function has to be obtained. If the unwrapping is applied before the statistical analysis to
the different records, the resulting distribution of the unwrapped phase can be strongly influenced
by the unwrapping algorithm, which can amplify some little differences. On the other hand, to
process phase data its periodicity has to be considered, especially with low-variance data: otherwise
the real distribution can be misinterpreted. For instance, the two distributions of figure 4.52
apparently have the same mean value if the periodic behaviour of the phase is not accounted for.
Figure 4.52. The phase has to be analysed considering its periodic behaviour: the risk is, for cases such as in the right
one, to compute a wrong mean and overestimate the variance.
The positive and negative values of the phase can be analysed separately, and the parameters of the
two distribution are compared with those of the overall set of data. If the two variances are
significantly lower than the overall variance, the distribution is corrected by shifting the positive or
the negative values.
The standard deviation of the raw phases can be directly plotted and analysed: the resulting image
(Figure 4.53) shows the region of the f-x domain where the phase information is reliable. High
values of the standard deviation of the phases correspond to poor information, and low reliability.
In particular, a uniform distribution of the phase in the interval (-π, +π) , corresponding to a
complete lack of coherence and random behaviour of the phase, would give a variance of π 2 / 3 .
Values of the variance close to this limit threshold indicate a poor information of the traces in that
frequency range.
Figure 4.53. Variance of the phase. The attenuation of the high frequencies in far offset produce a
poor signal quality, and a high phase uncertainty
The mean value of the phases cannot be plotted as it is, since an unwrapping is necessary. The
following figure (4.54) shows the measured phases between 17 and 23 Hz for the 24 traces. It is
clear that the oscillating behaviour of the phase does not allow a clear identification of the velocities
Figure 4.54. Experimental phases as a function of the frequency and the offset.
Figure 4.55. Experimental raw phase (left) and unwrapped phase (right) at the
frequency of 20Hz of the spectrum of figure 4.54
The phase difference between two adjacent receivers depends on the wavenumber and on the
spacing:
∆ϕ = k ⋅ ∆x
and can be greater than π radians when
k ⋅ ∆x > π
π
Hence k > correspond to wavenumbers greater than the critical Nyquist wavenumber
∆x
corresponding to a spatial sampling of ∆x (see chapter 3).
If the wavenumber is smaller than this critical value, the unwrapping procedure is applied
straightforwardly. In cases such as the one depicted in figure 4.56, the unwrapping must be applied
with the constraint of positive velocity (and wavenumber).
This allows to reach k MAX = 2k NYQUIST , which is the same result obtained before (chapter 3),
recovering the quadrant of the negative wavenumbers of the f-k spectrum.
Figure 4.56. The wavenumber at this frequency is slightly greater than π/2. The phase difference between adjacent
receivers is slightly greater than π
Figure 4.57. The experimental phases are inverted using a linear model
x1 1
x 1
G= 2 (4.67)
M M
xN 1
is the data kernel matrix (Menke, 1989).
The model parameters can be estimated in the least squares sense by the pseudo-inverse G − g
M = G −g ⋅ Φ
(
G −g = GT ⋅ G )−1
⋅ GT .
This simple procedure is very similar to other processing techniques used for surface wave analysis.
When only two traces are processed, the pseudo-inverse is simply the inverse matrix, and the
estimated wavenumber is directly obtained as
∆ϕ
k=
∆X
that is the classical 2-station phase difference approach (Dziewonski and Hales, 1972) traditionally
used in the SASW method. The relation of this procedure with the f-k processing is shown in
appendix C.
The great advantage of the proposed procedures the possibility of reducing the effects of low
quality traces on the estimate of the phase velocity: far traces, in the high frequency ranges, do not
have any coherent signal, and so do not contribute to the information, but degrade the information.
As it can be seen in figure 4.57, far traces have higher uncertainties, because of the lower energy
and lower S/N: the amplitudes of the traces do not affect directly the estimate, and so weights have
to be introduced to control the variance.
Figure 4.58. The phases as a function of the offset for 6 different frequencies. The high frequencies with greater slopes
shows higher uncertainties in the far offset
The sum of the squared errors that is minimized does not consider that the distance from an
experimental point has different meanings depending on the uncertainty on that point. The
uncertainty on the estimate is reduced, as it can be intuitively understood, if the weight of unreliable
points is lower than the weight of reliable points. This avoids that unreliable points, lying out of the
trend, pull the interpolating line in their direction. This is not, of course, only the case of the
outliers: the use of weights, from a statistical point of view, is the way to get the best estimate of
model parameters. When the distribution of the phase values assumed by each experimental point is
normal, it can be demonstrated (Menke, 1989) that the best estimate is obtained using as weights the
inverse of the variances of the experimental phase values measured at each point.
In the case of a distribution such as that of the phase, a simple definition of weights is not possible:
when the variance is π 2 / 3 radiances, the point has zero reliability.
A preliminary filtering can be made, in order to assign a zero weight to the unreliable portions of
the f-x phase spectrum. Referring again to the example shown before, the following figure (4.59)
shows a example of muting: the processing algorithm actually automatically excludes the traces
that, at each frequency, has a standard deviation greater than a fixed threshold .
Figure 4.59. Experimental variances of the phase, and manual filtering of unreliable portions
Another possible approach is the use of the spectral amplitude above the noise level, that has a good
correlation with the variance of the phase.
After the muting procedure, the weight are introduced into the computation, and the pseudo-inverse
GW− g can be obtained as (Tarantola, 1987)
(
GW− g = G T ⋅ W T ⋅ W ⋅ G )−1
⋅ G T ⋅W T ⋅W (4.68)
implies, under the assumption of normally distributed and independent data, that
−g
σ M2 = G 2W ⋅ σ Φ2 . (4.70)
where σ M2 and σ Φ2 are respectively the covariance matrices of the model parameters and of the
data, and G 2W− g is the matrix containing the squares of the elements of GW− g (Santamarina and
Fratta, 1998).
Adopting the above procedure, the mean value of the wavenumber and the associated uncertainty
canbe estimated at each frequency of interest (figure 4.60)
Figure 4.60. The wavenumbers and their uncertainty with the frequency
It has to be remarked that the above variances do not consider the effects of coherent noise in the
raw seismic data, and hence the uncertainties can be underestimated.
The wavenumber is then used to compute the phase velocity V, and whose uncertainties can be
estimated with a first order approximation
2π ⋅ f
V = (4.71)
k
2π ⋅ f
σV = σK (4.72)
k2
or more accurately considering the probability density of the phase velocity corresponding to a
normal probability density of the wavenumber (Figure 4.45).
The following figure 4.61 shows a dispersion curve with the associated uncertainties, obtained by
direct propagation of the variances of the experimental phases. The standard deviation, however,
does not describe the shaper of the probability density function of the phase velocity.
Figure 4.61. The dispersion curve is obtained from the best estimate and the experimental uncertainties are statistically
estimated
Figure 4.62. Experimental data showing an abrupt change of the slope of the phases in the middle of the array
More rigorous statistical test can be implemented. If each of the N individual prediction error is a
Gaussian independent random variable x, each with zero mean and unit standard deviation, the sum
of their squares is a random variable with chi-squared probability density with N degrees of
freedom.
N
χ 2 = ∑ xi2
i =1
If the different error have different variances, the overall misfit can be predicted in a different way,
weighting with the covariance matrix C and including the vector of the means µ , to give a random
variable with the same probability density
χ 2 = (x − µ ) C (x − µ )
T
Too large values of the experimental misfit, with respect to the probability density defined above,
may suggest the violation of some hypothesis.
(1993) showed the effectiveness of their reconstruction generating synthetic seismograms for the
obtained laterally varying earth model, under the assumption that no mode conversion occurs at the
segment boundaries (Keilis-Borok et al., 1989).
The approach here proposed, from a theoretical point of view, allows the partitioning among
segment of uniform wavenumber: the segments can be of different extents depending on the
wavelength. A kind of phase velocity pseudo-section could be obtained: the phase velocity can be
plot as a function of the wavelength and of the coordinate along the array, as it is sketched in figure
4.63
Figure 4.63. Pseudosection of phase velocity. If lateral variations are present, it is possible to partition the array in
segments of homogeneous properties. A kind of pseudo-section can be obtained.
As it has been mentioned, the interpretation (or inversion) of these data cannot be performed with a
one-dimensional forward modelling. But if lateral variation are present, the global processing of
data can give wrong results, as it will be shown in chapter 5. When there are sharp lateral changes
the global information obtained by the processing of the whole array may yield two dispersion
curves (Mockart et al, 1988). If the changes are gradual, the processing will give a dispersion curve
not clearly defined, that is an average dispersion curve.
Figure 4.64. The mean (left) and percent standard deviation (right) of the amplitude as a function of frequency and
offset. It is evident the attenuation of the high frequencies in far offset
The amplitude are then inverted assuming as model the exponential law
e −α (ω )( x − x1 )
A(ω , x ) / A(ω , x1 ) =
x − x1
and estimating the attenuation coefficient α at each frequency. The procedure is quite similar to
that describe above for the phase, but an iterative inversion has be performed. The noise level has
moreover to be included, and the additive nature of the noise amplitude has to be considered.
The f-k processing has been introduced in section 4.2 and the limitations of the acquisition have
been presented in chapter 3 referring to an f-k processing; here some further details are discussed,
and the parameters used for the computation are presented. Then an alternative approach for the
computation as sum of elementary contributions is presented.
The flow chart of the f-k processing is presented in figure 4.65.
DATA (t-x)
N records
Comparison of
different records
Selection of
records to process
DATA (t-x) FFT DATA (f-x) FILTERING f-x DATA FFT DATA (f-k)
1 stack record (tÆ f) 1 stack record f-x FILTERED (xÆ k) 1 stack record
WINDOWING
1,2, …, m
Figure 4.65. The flow chart of the f-k processing of surface waves
4.5.1.1 time
The time is transformed into frequency: this first transform has the aim of separating the different
spectral component of the signal, to compute then the properties of each of them.
The zero padding increases the frequency resolution of the spectrum, when the signal length is not
sufficient, and this can be useful specially at low frequency.
A frequency resolution of 0.25 Hz, for instance, can be obtained with a 4 s record: padding can be
necessary with shorter records. When the time length of the record is not sufficient, the number of
samples in the padded signal can be computed as
1
N pad =
∆f ⋅ ∆t
where ∆f is the desiredfrequency resolution and ∆t the sampling interval.
The number of samples can be very high with little ∆t , the dimension of the spectrum increases,
and the computation becomes heavy. An under-sampling of traces, or decimation, can often be
performed because the very high frequencies are usually not necessary.
If the signal is not monochromatic, the spectrum has a frequency content which does not shows
spikes: the observation on a limited time window does not introduce strong distortion of the actual
behaviour. The windowing in time domain often is not strictly necessary, if the signal is not
artificially truncated. The figure 4.66 shows a signal recorded at 6 m from the source, a
sledgehammer, and its amplitude spectrum computed without any windowing.
..
Figure 4.66. A trace in time domain and in frequency domain, transformed without any windowing
4.5.1.2 offset
The transform of the offset has the aim of identifying the wavenumber with which the energy
propagate at each frequency. This transform can be quite problematic, since the theoretical
behaviour has spikes (because the propagation admits only few modal wavenumber at each
frequency), and needs long arrays to be observed. The effect of the limited window has been
discussed in chapter 3, and here the aspects that can be useful in the processing are discussed in
detail.
From a theoretical point of view, in a laterally homogeneous medium, the spectrum should have M
spikes corresponding to M modes: if a window of length L has been used, the convolution of the
real spectrum with the window spectrum can be observed. The mode separation possibility depends
on the actual array length, that influences the spread of the window spectrum.
However the computation requires a zero padding, even if this does not increase the resolution. The
f-k spectrum can be evaluated numerically only in a limited number of points, that is in a limited
number of wavenumber: the computation with a finite zero padding is a discretization of the
theoretical spectrum obtained with an infinite zero padding (chapter 3.4.3, figures 3.41 and 3.42 ).
The computation in 2048 values is usually enough to get a continuous behaviour of the dispersion
curve.
4.5.1.3 Windowing
If data are simply transformed without any other preliminary operation, as it has been mentioned
before, a boxcar window is introduced. To recognize these effects, different kind of window can
further be applied, to compare the position of secondary maxima. The results can be compared to
assess if secondary events are due to the window.
Two windows that reduce the amplitudes of the ripples are the Hanning window and the Hamming
window.
The Hanning window of n samples is defined by
k
w[k + 1] = 0.51 − cos 2π k = 0,1 ,..., n − 1
n −1
The Hamming window of n samples is defined by
k
w[k + 1] = 0.54 − 0.46 ⋅ cos 2π k = 0,1, ... , n − 1
n −1
Figure 4.67 show three windows in time domain and in frequency domain. Many other windows
have been developed and can be used (see for instance Oppenheim and Schaefer, 1975).
Figure 4.67. Different windows, in time domain (top) and in frequency domain (bottom)
Hamming and Hanning windows are quite similar in time domain, but the former has lower
amplitude ripples. In order to evaluate the width of the main lobe and the amplitude of the
secondary lobes, it is better to plot the spectra of the windows in dB (Figure 4.68)
Figure 4.69. Three modes (spikes) and the spectrum obtained with a 24 m array
However even the first maximum is moved from its correct position, and the shape of the window is
distorted by the contribution of the second maximum, as it can be noticed from the zoom of the
previous figure (Figure 4.70)
Figure 4.70. Zoom of the previous figure: even if modes are separated,
the position is modified by the modal superposition
The effect is of course stronger if the distance between the two modes is smaller, and stronger
interferences between the windows ripples and main lobes occur: the modal superposition, as it has
been illustrated occurs when the sum of the windows has a maximum which is different from the
mode positions (Figure 4.71).
Figure 4.71. With the same array the modal superposition is stronger if the distance between the mode is smaller
Figure 4.72. The same array length, with different windows, has a different mode resolving power. A Boxcar window
has a narrower main lobe, and separate the two modes. The position of the maxima is not correct
If, on the other hand, relative maxima are searched, a window with lower ripples has to be used: this
procedure can be effective when modes are well separated and the main problem is the
identification of ripples.
The following example of processing shows that a simple search of relative maxima can be
effective in a case when several modes are present. The data have been acquired with 24 vertical
geophones, 2 m spaced, with 2m of source offset: the source is a sledgehammer.
The seismogram is plotted in figure 4.73: different events are present and separate within the
limited array length, with a broadening of the signal that requires 0.6 s to be completely recorded at
the furthest receiver
Figure 4.73. Example of f-k processing. The raw seismogram, 24 receivers 2m spaced
The f-k spectrum has been computed with a 2048 traces zero padding, and recovering the quadrant
of the negative wavenumbers (chapter 3.4): the maximum wavenumber is hence π radiant. (figure
4.74)
Several coherent events are present in the spectrum, corresponding to several modes, and the
absolute maximum at each frequency, does not follows a single mode. A first jump can be noticed
at around 30 Hz, and a second jump at around 47 Hz.
Figure 4.74. Example of f-k processing. The f-k spectrum with the position of the absolute maximum for each frequency
The search of the relative maxima gives the curves that are depicted in frequency-velocity domain
in figure 4.75. It can be noticed that the absolute maximum lays on different modes in the different
frequency bands.
Figure 4.75. Example of f-k processing. The position of the absolute maximum (blue asterisks) and of relative
maxima(black dots)
Figure 4.76. Principle of the deconvolution: the window spectrum shape is known, and allow to identify superimposed
windows
A simple spectral deconvolution can be applied: the spectrum and the window are transformed,
then they are divided, and finally inverse-transformed. With synthetic data this procedure is
efficient
The principle is that the real unknown spectrum is convolved with the window spectrum to give the
observed experimental spectrum, and this operation becomes a multiplication when dealing with
transformed quantities: the unknown real spectrum can be obtained with a division between the
transformed quantities. Referring to figure 4.77, the experimental spectrum A and the window
spectrum B are transformed to give SA and SB .These are spectra of the spectra. The spectral ratio
SA / SB is computed, and this latter function is inverse transformed to give the real spectrum.
The window has to take into account all the effects influencing real data: geometric and intrinsic
attenuation at first. With real data some other steps are necessary (whitening for instance), but the
first results that have been obtained in the processing of field data encouraging.
a)
b)
Figure 4.78. (a) Two-traces seismograms: offset (m) and time (s).
(b) Elementary spectra. Wavenumber(rad/m) and frequency (Hz)
The elementary seismograms and elementary spectra can be analysed by means of the coherence
function, in order to asses the signal quality of pairs of receivers: for each couple of traces, the
coherence function is computed, and gives indications about the frequency ranges that have to be
excluded. Another possibility, as shown above, is the evaluation of the spectral amplitude as a
function of the frequency and of the offset: this procedure indicate the regions where the signal
quality is high.
This approach of selecting couples of traces is completely equivalent to a muting performed in f-x
domain before the f-k transform.
The example below shows the effectiveness of the procedure in increasing the quality of the
estimate at high frequency. The seismogram is depicted in figure 4.79, left: the acquisition has been
performed with 24 vertical geophones, 0.5 m spaced, 1m of source offset. The seismogram is
transformed into f-x domain (figure 4.79, right), where the energy is depicted in dB and clearly
show the attenuation of the high frequencies in the far offset.
Figure 4.79. Seismogram in time offset and frequency offset domain. The poor signal regions are easily identified
In the new domain, the data can be filtered, so that only coherent event are retained. In the high
frequency band, of course, the far offset traces are discarded. The second transform is then applied,
and a filtered f-k spectrum is obtained. The seismogram here depicted is processed with the overall
spectrum and with the filtered spectrum, and the two resulting dispersion curves are depicted in
figure 4.80. In this case the main advantage is in the high frequencies, which are mainly influenced
by shallow layers: here with the filtered spectrum a stiff top layer (pavement) can be well identified,
while the uncorrelated noise does not allow the identification of its properties when all the traces are
considered. Even if the pavement is not the target of the test, a reliable estimate of its properties is
necessary also for the inversion of the low frequency portion of the dispersion curve.
Figure 4.80. The dispersion curve is estimated from the whole seismogram (left) and from the f-x filtered
seismogram(right). The filter has muted the high frequencies in the far offset.
Figure 4.81 The dispersion curve of the averaged seismogram is depicted together with the individual dispersion curve
of each single shot.
It is interesting to notice that in the low frequency band, the experimental distribution of the phase
velocity seems to be asymmetric (Figure 4.80): this is in good agreement with the theoretical
distribution of the phase velocity, as discussed in paragraph 4.3.2.5, and as shown in figure 4.44.
The phase velocity of the average seismogram is different from the average of the phase velocities.
The program allows the processing of seismic data to extract the dispersion characteristics and the
attenuation. The different steps of the processing are visualised and saved as Matlab variables, and
can be exported in numeric and graphic format.
• Raw seismogram are imported and converted into matrices (each trace a column), in the
time-offset domain. Records can be analysed by crosscorrelation to compute and correct the
trigger time shift if any.
• Single traces and the whole seismogram can be transformed into frequency domain, time-
frequency domain, frequency-wavenumber domain. The slant-stack can be performed, and
the ω-p transform is available too. Different windows can be applied before the
transformation.
• One-dimensional and two-dimensional filtering can be performed: manual muting in the
different domains is possible, manual picking.
• Statistical analysis of the different records, to evaluate the distribution of different spectral
quantities (amplitude and phase as a function of the offset and the frequency).
• A chosen processing procedure can be applied to an averaged seismogram, and the same
procedure is automatically applied to the single acquisition to evaluate the uncertainty.
• Automated search of absolute and relative maxima within assigned boundaries can be
performed in the different domains.
• A complete f-k processing is easily performed: data are transformed into f-x, analysed to
identify good quality regions, filtered, different windowing are applied and the f-k transform
is completed, then different search of maxima are used to extract the maximum of
information about modal curves.
• The multi offset phase inversion is implemented, following the steps described in 4.3.
Inversion
This chapter, after introducing the basic concepts of the inverse problem theory, discusses the
inversion of surface waves data, the reliability of the results, and the effects of possible errors.
The inverse problem theory is presented focusing the attention manly on the uncertainty and
5
information content of data, the resolution and variance of the result, the parameterisation of the
model, the data error and model error. Then, for the surface wave data inversion, some
considerations about the data to be inverted and the model are presented: which set of data is the
better for a certain target, and given a set of experimental data which parameters can be reliably
estimated. Some possible dramatic errors coming from the misinterpretation of some data feature,
particularly due to the multi mode nature of surface waves, are shown.
In this chapter
The basic concepts about the inversion: solution techniques for inverse problems; uncertainty and
information in data; evaluation of the information content; resolution of data; resolution of model,
and variance of the model parameters; parameterisation of the model and simplicity, data error and
modelisation error; testing the model.
The inversion of surface wave data: inversion techniques for the surface wave data; the factors
influencing the solution; considerations on the data to invert to increase the information and reduce
the uncertainty, without replicating the same information, considerations on the model parameters,
to find an optimal set of parameters to estimate with the maximum reliability. The errors deriving
from the misinterpretation of data, especially due to the multi mode nature of surface waves
The use of constraints and the joint inversion of surface waves with other data
5.1 INTRODUCTION
Figure 5.1. Scheme of the inversion of SWM: the soil properties are inferred from
the dispersive properties of Rayleigh waves
The unknown system is parameterised and is assumed to be completely described by a finite set of
M model parameters: the model allows to predict the values of N measurements, that are called, in
the following, data (Menke, 1984). The model is the set of relations describing the observed
phenomenon: the model parameters are the value that allow a quantitative description of the object
that is studied.
The unknown model parameters are the geometric and mechanical properties of the layers
constituting the subsoil: for each layer i , the shear velocity, the Poisson ratio, the mass density, and
the thickness, VSi ,ν i , ρ i , hi .
The number of unknown model parameters, when L layers are considered, is M = 4 L − 1 , because
the last layer is the half space and its thickness is assumed a priori as infinite.
The measured data are the Rayleigh wave velocities for N discrete values of frequency,
d = VR 1 , VR 2 , VR 3 ,..., V R N (5.2)
The model parameters and the data are functionally related as illustrated in chapter 2, and for each
frequency a function of the model parameters gives the Rayleigh velocity. The forward problem is
solved with the series of algorithms allowing the simulation of the Rayleigh wave propagation that
has been described in chapter2.
Figure 5.2. Model parameters and data for the surface wave inverse problem
The set of model parameters described above completely represents the layered model: the
velocities can be in general complex, thus allowing to account for material attenuation and
viscoelastic behaviour. All these parameters have to be specified to solve the forward problem:
however, the solution is more sensitive to some of them, in particular the shear velocity and the
layer thickness. Therefore in the inversion, mass density and Poisson ratio are often assumed a
priori on the basis of other information The influence of earth parameters on the dispersion curve is
treated for instance by Aki and Richard (1980). The relations of the SWM are not linear: the
problem is hence a non-linear inversion.
The simpler interpretation, which is not an inversion, of the dispersion curve is the λ / 3 or λ / 2
interpretation: it simply consists of assuming that the shear velocity is the 110% of the Rayleigh
velocity, and the depth is equal to one third or one half of the wavelength (for instance Abbiss,
1981). An other approach that has been proposed (Stokoe et al. 1984), based on a proper forward
modelling, is the trial-and-error inversion. Starting from an initial guess, the theoretical dispersion
curve is compared to the experimental dispersion curve, and the profile is modified until a good
fitting is reached. Automated iterative inversions have been used with the least squares criterion by
many authors, with different updating criteria, damping and weighting (Herrmann, 1994; Lai 1998;
Xia et al, 1999; Schtievelman 1999). Genetic algorithms (Al-Hunaidi, 1998), joint inversion with
other techniques (Hering et al, 1995; Comina et al., 2002), and coupled inversion of phase velocity
and attenuation (Lai, 1998) have been applied.
Often only the first mode is considered, but interesting examples of inversion of several modes
(Gabriels et al., 1987) and apparent curve have been discussed.
Considering an inversion procedure able to produce the best model, considering the data, their
uncertainty, the eventual constraints and the a priori information, the questions to be answered are
the following:
• Which data allow the best estimate of model parameters?
• For a certain desired target at a site, which frequency range is needed? Within a frequency
range, which is the minimum number of frequency points that carries the whole information,
without replicating it? Which is the best sampling of the data space?
When data have been collected, and their uncertainty has been estimated, the main question refers
to the uncertainty of the model parameters. Different aspects should be considered.
• Which is the uncertainty of thickness and shear velocity?
• Which is the investigation depth that has actually been obtained?
• Some model parameters are often estimated a priori, for instance the density and the
Poisson ratio: is this approach always correct, and which can be the consequences of errors
on the estimate of these parameters?
• The number of layers that has been chosen influences the result, and different models can be
equivalent. Which is then the best parameterisation, given a set of data?
Before discussing the case of the surface wave data inversion, some of the basic concepts of the
inversion will be briefly presented, to introduce the tools that will be used later.
where G is the data kernel matrix, d is known and is the column vector of the N data. It is possible
to solve this matrix equation to find the vector of the M model parameters m and, for this simple
case, the solutions can be obtained with different approaches from different viewpoints: the length
method, the generalized inverse, the maximum likelihood method (Menke, 1984).
“This ‘solution’ may be chosen in order to optimise the resolution provided by the data, the error in
the solution, the fit to the data, the proximity of the solution to an arbitrary function, or any
combination of the above” (Jackson, 1972).
The matrix G is very important both to find the solution by “inverting” it and to assess the
information content, the resolution, the variance, as it will be shown later.
where ∆d is the vector of the increments of data, ∆m is the vector of the increments of the model
parameters, and J is the Jacobian matrix. This equation correspond to equation 5.5 of linear
problems.
This allows to write in matrix linear form the relation between an increment of the model
parameters and an increment of data, and the same technique of linear problems can be used to get
the pseudo inverse of this relation. Most of the considerations that are possible for linear inverse
problems can apply to this linearization.
When a set of data has to be fitted to infer a set of model parameters, a first guess of the model m0 is
generated, and the forward problem is solved to find a first theoretical data set d0. This data are
compared with the experimental ones, and a vector of prediction errors ∆d 0 = d exp − d 0 is obtained;
inverting equation 5.7, the correction ∆m0 to be applied to the initial estimate of m is obtained. This
procedure can be repeated iteratively until the vector of the prediction ∆d i error is small enough and
hence a good fitting is obtained.
d = G ⋅m
where d is the vector of N experimental data, m is the vector of the M unknown model parameters,
and G is the data kernel matrix containing the coefficients Gij of the linear relations (equation 5.4).
If N data have been observed, we can define the prediction error e as
e = d obs − d pred = d obs − Gm est
where d obs is the vector of the observed values of the data, d pred is the vector of the predicted data,
obtained using the forward model G with the estimated model parameters m est
The different solution techniques are based on the minimisation of the vector e: the sum of the
squared errors can be simply obtained with the expression eTe, and the least squares criterion is
simply
[
min(e T e) = min (d obs − Gm est ) T ⋅ (d obs − Gm est ) ]
which leads to the generalised inverse (Moore-Penrose inverse; Penrose, 1955),
(
G −g = GT G )−1
GT
that can invert the relation 5.5 giving the estimate of the model parameters from the observed data
m est = G − g d obs
A weighted measure of length considers the a priori information about the uncertainty on single
data. The application of probability theory, assuming a multivariate Gaussian distribution, allows to
get the Maximum Likelihood Solution of the linear inverse problem as a minimization of a
weighted measure of the prediction length, corresponding to the weighted least squares solution.
e e e
eW = 1 , 2 ,..., N
σ1 σ 2 σN
On the other hand this implies that is data are not normally distributed, then another measure of the
prediction error may be more appropriate. (Menke, 1989).
Another approach focuses mainly on the operator, and not on the estimate: the linear matrix
equation is inverted by means of a generalized inverse, whose properties are discussed to get the
best solution, in terms of resolution and variance. The data resolution matrix and the model
resolution matrix can be introduced to evaluate the properties of the solution and to design the best
pseudo-inverse. The solution technique depends mainly on the information that data provide about
the unknown model parameters and on the uncertainty of data: as it will be seen, also in linear
problems, some of the model parameters can lack in information, and a global error minimization
can produce a solution with high amplification of the variance of data. So, regularisation, weighting,
and damping will be used to better control the inversion.
In the simples least-squares or “Gauss Newton” approach, the cumulative squared error is
minimised. In order to reduce the difficulties and the amplification of the variance, a constraint of
maximum energy of the parameter change vector can be introduced: this solution is known as
damped least squares, or Marquardt-Levenberg approach, because it and was introduced by
Levenberg (1944 ) and described by Marquardt (1963). The quantity that is minimized is
min(e T e + η 2 m T m)
that gives the pseudo-inverse
(G T
G +η 2 I )
−1
GT
the damping factor η bound the length of the vector of the model parameters: in iterative linearised
techniques this limits the correction to the model parameters.
The details can be found for instance in Lines and Treitel (1984), Tarantola (1987), Menke (1989)
and Santamarina and Fratta (1998).
These solution techniques give different results when the problems are actually mix-determined:
when the model parameters are well constrained by the data, the solutions are very close.
I. GLOBAL SEARCH
The trial and error procedure consists of: generating a solution from a first estimate; comparing the
solution with experimental data; using different criteria and norms; updating the model on the basis
of the residuals distribution. Of course this kind of approach can be used when a limited number of
model parameters has to be estimated, and the updating criteria depend on the experience of the
analysts.
If the forward model has some special properties, such as scaling properties, this can be very useful
in the updating of the model, because a reduction of the unknown can be performed at some steps.
As it has been mentioned, the way by which the solution is found does not affect the possibility of
assessing the properties of the solution by means of the analysis of the linearized formulation
around the minimum.
The Monte Carlo approach, see for instance Tarantola (1987), search the solution with a random
exploration of the model space. A large number of solutions is randomly generated, respecting
bounds for each model parameter: these solutions are compared with experimental data to find the
best model. This is a completely general approach, for linear and non-linear problems, for any
distribution, that is not trapped in local minima, but can be expensive in terms of computation time.
From the field of artificial intelligence, Artificial Neural Networks and Genetic Algorithms are
other way to solve this kind of problems (Santamarina and Fratta, 1998)
The problem may be linearised around an initial estimate, and the solution with a Newton-type
algorithm iteratively inverts the residuals. This local search algorithms try to minimize the misfit
following the local decrease of the misfit function
d = F ⋅m
∆d = J ⋅ ∆m
∆m = J − g ⋅ ∆d
The iterative procedure, starting from a initial guess, m0 , computes the final model with a simple
expression
[
mi +1 − mi = J − g d exp − d isynt ]
mi +1 − mi = J − g [d exp
− F ⋅ mi ]
The pseudoinverse J − g can be computed with different approaches: the Gauss-Newton approach
minimises the cumulative squared prediction error, the Marquardt-Levenberg approach also
constrains the sum of squares of the elements of the model parameter change vector. The
Marquardt-Levenberg exploits the good properties of the Gauss-Newton, suitable with small errors,
and the steepest descent method, more suitable with big errors (Smith and Shanno, 1971).
It is not possible to evaluate a priori the uniqueness of the solution, and the convergence is
controlled by the surface of residuals.
The example below shows the “surface of residuals” for a simple case that can be easily
represented. A two layer model is considered. The velocity of the first layer is known (as it could be
obtained from the high frequency range of the dispersion curve) and hence only two parameters are
unknown: the velocity and the depth of the bedrock. Figure 5.3 shows the right model and its
dispersion curve, that is its response, with a solid black line: it also shows the boundaries of the
domain of variation of the model, and the corresponding curves. The curves corresponding to
models with the bedrock depth ranging from 1m to 6 m, and the velocity of the bedrock ranging
from 210 m/s to 400 m/s, are depicted in figure 5.3, right.
Figure 5.3. Local search for a two parameters problem (The velocity and depth of the bedrock). In the left the right
model and the domain of variation of the parameters, in the right the obtained set of data and the one corresponding to
the true model
The distance ( in the sense of L2 norm) between the right curve and the curves corresponding to the
other models is computed and depicted in figure 5.3. The figure represent the surface of the
residuals, and has a minimum for V=300m/s and H=3m, where it is zero.
This figure suggests the iterative approach to find the right model starting from an initial guess of
the solution: following the local descent of the residuals, that is the misfit function, one can go
toward a local minimum (Figure 5.4). If the initial guess is close enough to the right model, the
procedure converge to the right model.
Figure 5.4. The surface of the residuals for the case of figure 5.3. Dark colours correspond to the highest residuals.
The path of the evolving solution during the iterative procedure depends on the used algorithm: the
path of the Steepest-Descent, the Gauss-Newton, Marquardt-Levenberg are different, and in general
the methods can lead to different solutions (see for instance Lines and Treitel, 1984).
Iterative techniques of local search use different approaches to correct the model parameters, either
reducing the residuals or promoting a unit ratio: ART (Algebraic Reconstruction Techniques) SIRT
(Sequential Image Reconstruction Techniques), MART (Multiplicative ART), are described by
Herman (1980). The different model parameters can be updated simultaneously
Some of the updating techniques can be more suitable for SWM, where the model parameters
referred to shallow layers are well determined from a limited range of the dispersion curve.
In general the solution of non-linear problems with iterative algorithm can find difficulties due to
local minima: the surface of residuals may have several minima for non linear transformations, and
convergence may be trapped in a local minimum.
The initial model is very important for all local search algorithms: the data pre-processing may
permit the identification of the salient characteristics of the solution without formal inversion.
Moreover the convergence to the right model is simpler if a way of generating reliable first model is
available.
Figure 5.5. A even-determined inverse problem: fitting a straight line through two points
When N > M, the problem is over-determined: this is the case of a straight line through 3 or more
points. In general, if points are not aligned perfectly, this single straight line does not exist. Some
criterion should be used to choose among the infinite possible straight line that don’t pass through
the points (figure 5.6).
Figure 5.6. An over-determined inverse problem: fitting a straight line through 8 points
It is usual to assume that the right model parameters are those that produce the better estimate of
acquired data: so the correct model is the one that minimize the prediction error, i.e. the overall
distance between the predicted data and the observed experimental data.
[
e = d1obs − d1pred , d 2obs − d 2pred ,..., d Nobs − d Npred ]
If the L2 norm of this vector is minimized, this means that the quantity
∑ (d ) =e T ⋅ e = (d − Gm ) ⋅ (d − Gm )
2 T
i
obs
− d ipred
i
is minimized with respect to each mi , which means that (see for instance Menke, 1989)
G T ⋅ Gm − G T d = 0
[
that is, if G T G ]
−1
exists
(
m est = G T G )−1
GT d
This is the least squares solution of the overdetermined problem d = G ⋅ m , the matrix G T G ( )
−1
GT
is called the Moore-Penrose generalized inverse (Penrose, 1955)
When each datum d i has its estimated variance, it is better to weight the single prediction error
considering its uncertainty: the same prediction error has different meanings if the experimental
point is accurate or not.
d obs − d1pred d 2obs − d 2pred d obs − d Npred
e= 1 , ,..., N
σ1 σ2 σN
Figure 5.7 shows a set of experimental points with their uncertainties, and the two straight lines
estimated with a weighted least squares and a conventional least squares solution.
Figure 5.7. Weighted least squares to consider the different reliability of the experimental data
From a statistical point of view the best estimate with Gaussian data is the weighted least squares
(Menke, 1989): the use of the inverse of the variances as weights minimize the uncertainty of model
parameters. Discussing the information content, it can be noticed that the very inaccurate points,
actually, do not contribute to the information.
If N < M the problem is underdetermined: it is such as searching the straight line that passes
through a single point (Figure 5.8). Infinite straight lines pass through that point, and a criterion to
chose the best one has to be assumed. A priori information should be incorporated.
Figure 5.8. A under determined problem: fitting a straight line through only one point
If we have more than 3 points, a best plane in the least squares sense can be found, but only if
[ ]
−1
points are not aligned (Figure 5.10). The least squares solution exists if the inverse G T G exist,
and this happens if the rank of G is maximum (Santamarina e Fratta,1998). The rank is maximum in
this case if there are 3 linearly independent rows, and so the matrix has rank 3.
If the points on the contrary are perfectly aligned, only two rows are linearly independent and the
rank is not maximum, so the least squares solution does not exist (Figure 5.11). In this case data are
not independent; even with a great number of points there is not enough information to constrain the
plane, i.e. there is not enough information to determine the model parameters.
Moreover, the lack of information may not affect a single model parameter, but a linear
combination of the model parameters: this is a mix-determined problem. Part of the model is over-
determined, part is under-determined.
Figure 5.11. The problem is apparently over determined, but the data are not actually independent
The problem is in some way underdetermined, since the equation contains information only about
some model parameters: the combination of model parameters that can be found lies in a subspace
of the model parameters space Sp(m). The rest of the space is called the null space, and no
information is provided by the data about these combinations of model parameters. No model
parameter can span the whole space d: only a subspace Sp(d) can be satisfied by model parameters.
But, if points are almost aligned, the rank is still 3, and the solution, in the mathematical sense,
exists. But data are not actually independent, since this case is not so different from the one
illustrated above: the mathematical existence of the solution is not sufficient to make it significant.
In the case on the left side of figure 5.12, the points are perfectly aligned, and the solution does not
exist: it is only possible to determine the slope of the plane in the direction of alignment. In the case
depicted on the right side, the points are not perfectly aligned, but only almost aligned: the solution
for the best plane exist, but is evident that the information carried by the points is mainly about the
slope in the direction in which they are “almost” aligned.
x1 y1 1
x y2 1
G= 2
M
xN yN 1
and is decomposed with the SVD. The eigenvectors of the matrix V represent the combination of
the model parameters a, b and c
Four cases are presented in the following example (figure 5.13): for all of them 6 experimental
points are used to infer the 3 parameters of the plane. The four distributions of data differently
constrain the plane, and the assessment of the information content is performed with the SVD.
I. CASE1
In the first case the data are well distributed, and contain information about the whole set of model
parameters: the three eigenvalues are 6.068, 3.190 and 2.243.
II. CASE 2
In the second case the points are aligned, they all have the same y, and so it is not possible to
determine uniquely a plane. Nevertheless in the direction x there is enough information, and so the
parameter a could be identified. The lack of information affect the second parameter, i.e. the
coefficient b. There is indeed a null eigenvalue, which correspond to the eigenvector (0, 1, 0), that
means that the combination of the model parameters 0a+1b+0c is completely under-determined.
III. CASE 3
In the third case the points are again perfectly aligned, in the direction y = x. There will be a
combination of the model parameters that cannot be determined. The null eigenvalue correspond to
the eigenvector (0.707, –0.707, 0), which means that the combination 0.707 a- 0.707b is without
information: in that direction, indeed, points gives no information at all.
IV. CASE 4
The fourth case is the most interesting, since points are almost aligned along the same direction as
the third case. In such a more realistic situation eigenvalues are not zero, but are small. The
eigenvector corresponding to the small eigenvalues says the direction where there is little
information.
eigenvalues
6.068 0 0 4.434 0 0 6.209 0 0 6.425 0 0
0 3.19 0 0 2.311 0 0 2.334 0 0 2.312 0
0 0 2.243 0 0 0 0 0 0 0 0 0.11
0 0 0 0 0 0 0 0 0 0 0 0
0 0 0 0 0 0 0 0 0 0 0 0
0 0 0 0 0 0 0 0 0 0 0 0
eigenvectors
V1 V2 V3 V1 V2 V3 V1 V2 V3 V1 V2 V3
0.597 -0.721 0.353 0.977 -0.215 0 0.701 -0.091 0.707 0.675 -0.192 0.713
0.798 0.579 -0.167 0 0 1 0.701 -0.091 -0.707 0.724 -0.0161 -0.690
0.084 -0.382 -0.921 0.215 0.977 0 0.129 0.992 0.000 0.144 0.981 0.128
Figure 5.13. Example of the information assessment by means of singular value decomposition
These concepts about the amount of information in data are very important because they allow the
assessment of the reliability and uncertainty of the result. They will be applied to the SWM
inversion to discuss the properties of the solution and the requirements of the data to invert. The
model parameters, for the SWM inversion, will be the properties of the different layers, and this
kind of analysis will allow to understand which layers can be resolved.
Figure 5.14. Example of the singular value for a six layer model with unknown velocities.
They decrease but are not zero
It is evident that some model parameter, or some combination of them, has few information: this
result is very useful, since it says which model parameters will be difficult to determine, or which
combination of them.
The reason why few information means difficulty in estimation depends mainly on the variance
amplification. When the information regarding a model parameter is small, even a little uncertainty
on data, results in an unacceptably large variance on the model parameter. The critical aspect is that
a single parameter, or combination of parameters, without enough information can corrupt all the
others.
For the kth model parameter, the variance can be estimated, referring to the SVD formalism, as
(Jackson, 1972)
2
Vkj
m
var mk = ∑
j =1 λ j
where V are the eigenvectors and λ the singular values.
The relation above shows that little singular values can deteriorate the result producing an uncertain
estimate of all model parameters: a different kind of solution should be used to extract al least the
reliable information. In the case of the plane, if the points are almost aligned, we will find a little
singular value relative to the direction perpendicular to the alignment. But the plane will have a
degree of freedom, and the resulting estimate will show a great variance on both the model
parameters a and b (Figure 5.15).
.
Figure 5.15. The lack of information can affect a combination of model parameters, not only single model parameters
It can then sometimes be useful to damp the little singular value (as mentioned in table 5.1) to
reduce the variance: this damping produces, however, a decrease of resolution.
The condition number, for square symmetric matrices such as G T G , can be easily estimated as the
ratio between the maximum and the minimum singular values (Santamarina and Fratta, 1998)
max(λi )
c=
min (λi )
A matrix is singular and cannot be inverted if k = ∞ , and is said to be ill-conditioned when the
condition number is very large (>1012): numerical inaccuracies occurs in the solution and errors are
magnified.
(
G −g = GT G )
−1
GT
But in general, a good inverse should satisfy the following criteria (Jackson, 1972)
• The data resolution matrix should be close to the identity matrix, i.e. G ⋅ G − g ≈ I N
• The model resolution matrix should be close to the identity matrix, i.e. G − g ⋅ G ≈ I M
• the unit covariance should be small
The data resolution matrix indicates how well data can be predicted, or how they are independent
(Wiggins, 1972). When the model has been estimated with mest = G − g d obs we can predict data with
the forward equation d = G ⋅ m getting d pred = G ⋅ m est = G ⋅ G − g ⋅ d obs .
This means that the data resolution matrix S = G ⋅ G − g (N x N) is the identity matrix it data are
perfectly fitted; if S is not the identity matrix the prediction error is not zero. In real cases, the data
resolution matrix is not the identity matrix, and each datum can be obtained as a weighted average
of neighbouring real data.
The model resolution matrix indicates how well model parameters can be predicted (Backus and
Gilbert, 1968). If a true set of model parameters exists, observed data can be obtained as
d obs = G ⋅ m true , and the estimated model parameters obtained with the generalized inverse are
m est = G − g d obs , which means
m est = G − g G ⋅ m true
Hence the estimate of model parameters can be thought as obtained from the real model parameters
by means of a model resolution matrix R = G − g ⋅ G (M x M) . If R is the identity matrix each
model parameter is uniquely determined. In real cases R shows the resolution with which model
parameters can be determined, or resolved, as weighted average of neighbouring model parameters.
Each row i of the model resolution matrix contains the weights of the weighted average of the M
real model parameters giving the estimated model parameter i. An example is reported in the figure
5.33
The unit covariance matrix indicates how the covariance on data is mapped to model parameters:
this propagation is function of the data kernel G and of the pseudoinverse G − g , not of the data
themselves. If data are supposed to be uncorrelated and to have equal variance σ 2 it can be written
[cov u m] = σ −2 G − g [cov d ]G − gT = G − g G − gT
The unit covariance matrix is independent on the actual values and variances of data.
The goodness of a generalized inverse can be measured in terms of spread of the data resolution and
of the model resolution matrices, and in terms of size of the unit covariance matrix; the optimisation
of the generalized inverse gives (Menke, 1989)
[
G −g = GT G +η 2 I ⋅ GT ]
which is the damped least squares relation, being ε the damping factor, that minimize a weighted
combination of data resolution spread and covariance size.
The spread is defined as norm of the difference of the resolution matrices and the identity matrix:
the spread is a measure of the off-diagonal elements.
Other considerations, for underdetermined, overdetermined, purely determined cases are presented
by Backus and Gilbert (1968)
There is, however, a Trade-off between model resolution spread and variance size: one decreases
increasing the other. In the minimization of the two, the weights are α and 1 − α , (Menke, 1989)
Figure 5.16. The damping factor rules the trade off between resolution and variance (after Menke, 1989)
Figure 5.17. The same data set can be fitted by models of different complexity. Reducing the misfit does not always
means to increase the reliability of the result (modified, after Santamarina and Fratta, 1998)
Simple models filter data noise and extract the useful (and sometimes the only achievable)
information contained in the measurements. More complicated models fit even better data points:
reducing the misfit do not guarantee a higher significance of the result.
This criterion is general: in the case of SWM the parameterisation is essentially the number of
layers, the depth of the half-space, the number of parameters for each layer. These aspects will be
discussed in section 5.3.
Figure 5.18. When model errors are present, often data cannot be fitted
“Reasonable” fitting means that it is possible to test the model, when the uncertainties of data are
known: if, with a given set of experimental data and a given model, the misfit is too big compared
with the uncertainties, then the assumed model is not correct, as shown below.
A reliable estimate of the uncertainty is needed: in the example below (Figure 5.19), with little
uncertainty the linear model is not probable (left), with greater uncertainties it passes the test. This
kind of test (briefly described below) gives indications on the possibility of increasing the
complexity of the model, considering the uncertainty of data.
Figure 5.19. The same experimental points are fitted by the same model: the value of the misfit is the same, but the
probability of correctness of the model is different. The right situation with higher uncertainties on data corresponds to
a higher probability for the model (of course: in the night (the darkness of errors) all the cats (models) are black (true).
For instance, considering the SWM inversion, if a three layer situation is fitted with a two layers
model, significant errors will be present: anyway the possibility of increasing the complexity of the
model to reduce the misfit in data has to be carefully evaluated.
The risk of model errors due to the multi-mode nature of the surface wave propagation and the
modal superposition is discussed in par. 5.3.5: if a branch of dispersion curve due to a second mode
is inverted as a first mode, a high error is introduced, and the velocities are highly overestimated.
N
χ 2 = ∑ ei2
i =1
(χ )
N
−1
( ) ( )
2 2
pdf χ 2 ; N = N
exp − χ 2 2
2 Γ(N 2 )
2
If the different errors have different variances, the overall misfit can be computed in a different
way. In fact, weighting the ei’s with the covariance matrix C and including the vector of the means
µ , a random variable with zero mean and unit standar deviation is obtained; it then follows the
same probability density
χ 2 = ( x − µ ) C −1 ( x − µ ) .
T
The experimental misfit can be compared with a theoretically predicted misfit, computed
considering the estimated uncertainties of the experimental data: too large values of the
experimental misfit with respect to the probability density define above (for instance above the 95%
of probability), may suggest the risk of violation of some hypothesis.
while the data are the Rayleigh–wave velocity as a function of the frequency.
d = VR 1 , VR 2 , VR 3 ,..., V R N
but the data could also include other propagation properties, such as for instance the power spectra
or the distribution of the energy as a function of the frequency along a mode
The attenuation and the velocity of the layers are physically coupled, and may be uncoupled in
weakly dissipative media. In the following we consider an uncoupled inversion of experimental data
to estimate the shear velocities and the thickness of the layers.
Usually the propagation velocity is considered, and the energy distribution is not used. The
dispersion causes the propagation velocity to be a function of the frequency, and so a phase velocity
and group velocity exist (chapter 2).
dω dV dV
U= =V + f =V −λ
dk (ω ) df dλ
From the above equation it is easily seen that if phase velocity V increases with frequency, the
group velocity U is greater than V. If V decreases with increasing frequency, as it happens at
normally dispersive sites, V is greater than U. The simple relation between wavenumber and group
velocity shows the relation between the data space of the wavenumber and the data space of the
group velocity.
Figure 5.20. Scheme of the relations between the model space and the data space
The sensitivity of the phase velocities modal curves is an aspect that can help to understand which
data contain the information about the different model parameters.
The following figure 5.21 shows the partial derivatives of the first three modal phase velocities with
respect to each layer shear velocity. The model is a three layer model; two layers, 1m thick each, on
a half space, and with shear wave velocity being 100 m/s, 200 m/s, 400 m/s. Considering for
instance the first mode, it is evident that the phase velocity shows sensitivities to the different layers
that are a function of the frequency: at high frequency the first layer, at low frequency the half-
space, and in the middle the second layer. Higher modes have behaviours more difficult to be
interpreted.
Figure 5.21. Partial derivatives of the modal phase velocities respect to the layer shear velocities
The sensitivity can be seen as the way in which the variation of the properties of a layer influences a
mode in a certain frequency band.
It is possible to see directly how the different layers influence the different modal phase velocities:
the effects of the same variation of the shear-wave velocity of the first and the third layer are shown
in the following figure 5.22. The right side of figure 5.22 confirms that the high frequencies are not
sensitive to the deep layers, and they could be inverted separately. (same model of Figure 5.21)
Figure 5.22 Variation s of the modal curve corresponding to the same increase of the shear velocity respectively of the
first layer (left) and the third layer (right)
This approach could be used for the assessment of the properties of the solution and the design of
experiments: however it has to be considered that this simple approach has a limit due to the energy
distribution on modes, that can make some modes not detectable, and can influence the modal
superposition. The modal curves are only possible solutions, and from an experimental point of
view they can be not detectable: it can happen either that modes are well separated but some of
them are not identified, or that modes are superimposed in a average behaviour, depending on the
modal phase velocities but also on the modal energies.
Figure 5.23. Modal curve (lines) and apparent curve (asterisks) corresponding
to the maximum of energy for each frequency
The modal superposition can play a strong role, both in normal dispersive and inversely dispersive
sites. The array length influence the modal superposition in experimental data, and have to be
considered in the inversion.
The apparent dispersion curve can be the result of the superposition of different modes in an
experimental configuration. It depends on the site properties and on the array dimension. Of course
all the influencing factors have to be considered in the inversion, and so a multi-modal modelling
with modal superposition has to be used. These aspects about the simulation of the test, accounting
for the modal superposition and the array length, have been discussed in chapter 2. In chapter 3 and
4 the effects of the modal superposition have been considered for both the acquisition and the
processing.
The inversion of such a function, anyway, poses a series of problem due to its discontinuities: the
jumps between different modes produce high variations of the data even for small variations of the
model parameters. If branches of modes can be isolated, this allow a multi-modal inversion of
different modes. However it is not possible to identify the mode number: when different modal
curves are obtained in experimental data, their identification is not straightforward. Sometimes, for
instance, only branches of the first and the third modes are visible and there are no evidences
helping to assign the right mode number to the branches
In many cases, a trial and error inversion is the only way to get a stable result: it can be used, of
course, as a first estimate for a local-search refinement. An example is given in chapter 6.
Several modes can exist, and they all can be dominant in different frequency ranges. Moreover,
along a single modal curve, even with a flat frequency band source and not considering the
attenuation, the energy distribution can be very inhomogeneous. When strong acoustic impedance
changes occurs, resonant frequencies in the displacements at the free surface are easily identified.
The figure below (Figure 5.25, 5.26) shows the synthetic spectrum obtained simulating a test at a
site with a layer 2m thick, with 100 m/s of shear-wave velocity, over a bedrock with shear-wave
velocity of 400 m/s, with vertical source at the free surface. A resonant frequency is identified
around 22 Hz with an input source at the free surface that has a unit amplitude spectrum between 5
and 50 Hz.
Figure 5.26. Modal curves and f-k spectrum for a two layer stratigraphy 2m 100m/s, bedrock 400 m/s of shear velocity:
it can be noticed that the response is amplified around 22 Hz. (synthetic data)
As for the resonance frequency of layers for a simple SH – wave with vertical incidence on a stack
of layers, the greater is the impedance contrasts, the greater the is amplitude of the resonance
(Kramer, 1996): the amplification can be easily simulated in elastic and viscoelastic media, and is
one of the results of the modelling described in chapter 2.
The experimental observation of resonant frequencies, at sites where high impedance contrast
exists, is common, but a quantitative analysis requires a higher control of the amplitudes in
throughout acquisition and processing. Many factors in the equipment and at the site can modify the
amplitude distribution as a function of the frequency (source, receivers, attenuation…), and it is not
simple to consider them all for a quantitative analysis of the amplifications: anyway the position of
energy maxima increases the information.
If the experiment is designed and performed controlling the attenuation and the frequency response
of the whole, these latter factors can be included in the modelling and also the energy distribution
can be used in the inversion procedure.
Figure 5.27. Modal curve (dots experimental, solid line numerical) and the corresponding model.
The modal curve is uniformly sampled with 114 points.
The Jacobian matrix of the final model is used to compute the data resolution matrix is depicted in
figure 5.28: the data set was a vector of 114 values of phase velocity at 114 different frequencies
(from 4 to 33 Hz), and so the data resolution matrix is a square matrix 114 x 114. As it has been
discussed in 5.2.5, independent data correspond to a diagonal matrix, and the relation between
diagonal and off-diagonal element is a measure of the independence of data (the spread defined
above).
It is evident that in the low frequency band the diagonal elements are much higher than the others,
while in the high frequency ranges this is not true. This means that in the low frequency range the
points are more independent, so each of them increases actually the information content. In the high
frequency range, on the contrary, points are mutually dependent, and actually replicate the same
information.
Figure 5.28. The data resolution matrix is depicted in the left and three cuts are shown in the right
The consequence is that in the high frequency range less points are needed. The same effect can be
seen if the experimental data are analysed in wavelength vs. phase velocity: even if it is not
straightforward to describe the relations between data space and model space, to adequately sample
the different depths in the model space it is necessary to adequately sample the different
wavelengths in the data space (Figure 5.29).
Figure 5.29. The same experimental dispersion curve in frequency – phase velocity(left) and wavelength – phase
velocity(right). A uniform sampling in frequency is not uniform in wavelength.
If the dispersion curve in the frequency – phase velocity domain were sampled in such a way to
have an experimental point each meter of wavelength between the minimum and maximum values,
the resulting graph would be like the one in figure 5.30.
Figure 5.30. The sampling of the dispersion curve which correspond to a uniform sampling of the wavelength
5.3.4.2 Layers
As far as the layer is concerned, two aspects have to be considered: the number of layers and the
depth of the half-space when the velocities only are inverted. These aspects refers respectively to
the discretization and the boundaries of the investigated domain. If the layers are considered as a
discretization of the studied domain, the thickness of the layers as a function of the depth has to be
discussed: the loss of resolution and sensitivity with the depth has to be considered, and the
thickness should then be increased as the depth increases.
The example below shows the effects of the number of layers on the result of the inversion of an
experimental dispersion curve. The site presents a quite homogeneous sandy formation laying on
gravels.
I. NUMBER OF LAYERS
The first inversion is performed with a model of 6 layer with constant thickness: the thickness of the
layers has been obtained simply dividing the investigation depth, estimated from the maximum
wavelength, by the chosen number of layers (Figure 5.31).
This model cannot fit well the high frequency portion of the dispersion curve for any number of
iterations because of the limited number of layers in the shallow part. This could be considered a
model error: no value of the parameters of the first two layers could fit the high frequency range of
the experimental dispersion curve, and the number of layers has to be increased. The obtained
stratigraphy has also a velocity inversion, that hence has to be considered an inversion artefact.
Indeed it depends on the high frequency range that is not properly fitted. The limited number of
layers, moreover, produces quite high impedance contrasts between the “layers”, that are unlikely
for that site, characterized by a quite homogeneous formation in the shallow part. When the number
of layers is increased, the site characteristics are better identified. There is not a geological layering
with sharp boundaries, but a continuous variation of the mechanical properties of the soil.
Figure 5.31. Final Model (left) and fitting of the curve (right) for the best 6 layer model
The inversion with 10 layers (Figure 5.32) shows a better fitting in the high frequency portion, and
a model with a quite smooth increase of shear velocity with depth, with a half space characterised
by a higher velocity. The fitting is excellent and the misfit is very small if compared with the
possible uncertainties of the dispersion curve. The thickness of the layers could be optimised, but
with the chosen approach this result is satisfactory.
Figure5.32. Obtained model and modal curve fitting with a 10 layers model. The fitting is good and the model
reasonably compatible with the a priori information. The reliability of the different layers is not constant.
With a further increasing of the number of layers the model follows each oscillation of the
dispersion curve, but there is the risk of fitting the noise (as mentioned in 5.2.6).
When the number of layers becomes high (Figure 5.33), of course decreases the reliability of the
single shear velocities. A trend is identified, and it is better to invert for the coefficient of a velocity
law, for instance a polynomial law, which is assumed to velocity variation with the depth. In
particular the velocity inversion that is obtained in the uppermost part of the profile is simply due to
the small change of the experimental phase velocity, that decreases more slowly above 27 Hz.
Figure 5.33. Obtained stratigraphy and modal curve fitting with a 35 layers model. The single shear velocities are not
reliable, and inversion artefacts are introduced (shallow velocity inversion)
Considering the final model of the inversion with the 35 equally thick layer model, it can be shown
that in data simply there is not the information to resolve each layer. The model resolution matrix
shows that the estimated shear velocities are obtained as weighted averages of the real velocities,
and the number of layers that are weighted increases with depth. Using a 35 parameter model, the
model resolution matrix is the square 35 x 35 matrix depicted in figure 5.34, left. In the upper part
of the model there is a higher resolution, in the deeper part a lower resolution.
Figure 5.34. The model resolution matrix and the plot of the diagonal elements for the 35 layer model
The resolution strongly decreases with depth as the sensitivity, and it is interesting to notice that the
deepest layers are not resolved at all.
The singular value decomposition shows clearly this aspect.
The plot of the singular values (Figure 5.35, left) shows that there isn’t a clear threshold between
big and small singular values, but a strong decrease that makes the fifth singular value to be one
tenth of the first singular value.
Each singular value indicate a certain amount of information, and the associated eigenvector (V)
describe which is the combinations of model parameters carrying that amount of information. The
coefficient of the first 4 eigenvectors are plot in figure 5.35, right: it can easily be seen that they
refers to weighted averages of layers.
In particular the first singular value has the following coefficients
0.1892 0.5981 0.6059 0.3843 0.2341 0.1454 0.0913 0.0603 0.0413 0.0296 …
As it has been mentioned in section 5.2.4.3, these combination means that the first singular value
refers to the combination
0.1892VS1+0.5981VS2+0.6059VS3+0.3843VS4+0.2341VS4+0.1454VS5+0.0913 VS6+0.0603VS7+0.0413VS8+0.0296VS9…
This, roughly speaking, means that the maximum of information that is present allows the estimate
of the mean of the second and third layers, with also the contribution of the third the fourth, and the
first, and other coefficients that are actually negligible. The coefficient of the first four singular
values are depicted in figure 5.35, right: each line represent an eigenvector, with the coefficient
associated to the 35 layer.
Figure 5.35. The singular value (left) and the first 4 eigenvectors, as coefficient of the different model parameters. The
first eigenvector is the almost mean of the second and third layer
The figure 5.35 also shows that the information and the resolution decreases with depth.
The singular value decomposition, showing the decrease of the resolution and information in the
deepest part of the model, states a kind of “suppression principle”: if the singular value is
associated to an eigenvector that average two layers, these ones may be substituted by a single
average layer.
A complicated stratigraphy can be simplified to find an equivalence with an averaging stratigraphy:
figure 5.36 shows two models (left), one with a great number of layers and the other with a small
number of layers, and the corresponding responses (right). The two responses are in practice
identical, and hence the two models are equivalent.
Figure 5.36. Two different models and the corresponding dispersion curve.
They are equivalent, in the sense that there is not enough resolution for the fine discretisation.
5.3.5.1 Example 1
The first example refers to a simple normal dispersive stratigraphy of three layers (Figure 5.37).
The first two modal curves are quite close between 13 and 20 Hz: in wavenumber domain the
difference between the two curves is less than 0.05 rad/m (Figure 5.38), and the second mode has
more energy than the first. The two modes can be clearly separated only with an array hundreds of
meters long, otherwise their energies will be superimposed (Figure 5.39).
If an more realistic array is used, the two modes are superimposed. The apparent curve lays between
the two and tends to the second mode approaching its cut-off frequency. Below the cut-off
frequency of the second mode, of course the first mode only can exist, but the associated energy is
small and probably difficult to be detected experimentally (5.39). A test performed at this site
would probably give only the continuous branch of the apparent curve above 14 Hz: a similar case
from the real world is given in chapter 6.
Figure 5.39. The synthetic f-k spectrum and the apparent curve. Very low energy is present below the cut-off frequency
of the second mode
If the continuous branch of the dispersion curve is considered and is misinterpreted as a first mode,
a great error can be introduced. The mode weighted damped least square inversion considered a
pure first mode gives a model that fit well the data (Figure 5.40)
Figure 5.40. The model that fit, with its first mode, the synthetic main branch of the dispersion curve of figure 5.38
Comparing the model that has been used to generate the data and the model that is obtained by the
inversion (Figure 5.41), it can be noticed that in the upper part, above 7m, they are in good
agreement: this portion of the model corresponds to the frequency band of the dispersion curve
which was actually due to the first mode. Going deeper, the data are influenced by the second mode,
(which has a higher velocity than the first) but as they have been interpreted as a first mode: over-
estimated velocities are obtained.
Figure 5.41 The difference between the true model (grey) and the one that come from the fitting of figure 5.39(black)
It must be stressed that it is not possible, from the experimental data alone, to recognize if the
obtained dispersion pattern is due to a modal superposition or to a single mode and, moreover, if a
single mode is the first one or a higher mode. It is so very important to evaluate the obtained result,
because such an error can be quite dramatic.
A possible procedure to check the result is to use the final model as input of a multi-modal forward
modelling, to asses the relative importance of the different modes.
If this procedure is applied to the result of the inversion above (Figure 5.40), it could be noticed that
the first mode (which well fits experimental data) is not dominant in all the considered range: if the
test was performed at such a site, the modal superposition would have been observed. The synthetic
characteristic to be compared with the experimental, hence, cannot be the first mode.
Figure 5.42. The multi-modal modelling of the obtained stratigraphy is necessary when a first mode inversion is
performed. This allows at least to see if the first mode is dominant in the resulting model
5.3.5.2 Example 2
An other example of possible errors due to the modal superposition considers the presence of a
higher mode which is close to the first mode in a central frequency band.
The effects described in chapter 4 about the modal superposition have to be taken into account: the
presence of a mode close to another can locally move the position of the maximum that is identified
by the processing (Figure 5.43).
Figure 5.43. The modal superposition can move the maximum from one mode to a average position between two modes
If a higher mode is present close to the first one, with a high energy content, it can move the energy
maximum to higher velocities. Below the cut-off frequency of the second mode, or simply out of
the frequency band where this higher mode has a great energy, the maximum will go back to its
correct position, on the first mode, reducing its phase velocity. This local increase of phase velocity,
if the presence of the higher mode is not recognised, can be misinterpreted. The example below
(Figure 5.44) shows an example of such a dispersion curve: the shape of the curve can be
interpreted as a local increase around 20 Hz, or a decrease between 15 and 20 Hz.
Figure 5.44. The decrease of the phase velocity going towards the low frequencies has to be well verified, because it
correspond to a velocity inversion. It has to be verified that it is not a modal superposition effect.
That small variation in the dispersion curve has a great importance in the model estimation: if it is a
real feature it implies a velocity inversion. The example below (Figure 5.45) shows the models and
the modal curves (first modes) for a simple case where a layer 2 m thick produces an effect on the
curve that can be compared with the one seen above (Figure 5.44).
Figure 5.45. The local increase of the phase velocity in the dispersion curve has a great effect on the stratigraphy
5.3.5.3 Example 3
The same kind of effect can be produced by lateral variations. If the test is performed at a site where
a homogeneous bedrock is overlaid by a layer of sediments showing an abrupt lateral variation
(Figure 5.46), as in the example below, the behaviour of the dispersion curve is confused. When a
sharp lateral variation is present, two dispersion curves could be obtained: if the variations are
gradual or less strong, a confused dispersion curve is obtained (Mockart et al., 1988).
Figure 5.46. Scheme of a abrupt lateral variation in the sediments on a homogeneous bedrock
Figure 5.47 The two f-k spectra that may be obtained from two separate acquisitions
Figure 5.48 The spectrum that is obtained from a single measure and the corresponding curve
The different branches obtained by figure 5.48 could be incorrectly interpreted as different modes.
where G 2 − g is the matrix containing the squares of the elements of G − g (Santamarina and Fratta,
1998).
In general, if the solution is obtained with a linearised inversion, and only the shear velocity of the
layers is considered, it can be written (Lai, 1998):
[(
cov[VS ] = J ST ⋅ J S )
−1
J ST ]
last #
[(
cov[VR ] ⋅ J ST ⋅ J S )
−1
J ST ]
T
last #
The application of this last equation to the inversion of the dispersion curve depicted in the
following figure 5.48 (left)gives the uncertainties of the shear velocities of the model of figure 5.49
(right).
Figure 5.49 Uncertainties of the estimated model parameters. Left) experimental dispersion curve with standard
deviations, right) shear velocity model with standard deviations of the layer velocities
Figure 5.49. Dispersion curves of ten separate shots (points) and the curve obtained stacking the ten record. At each
frequency we have 10 experimental phase velocities, showing a kind of distribution of the phase velocity.
The program allow the inversion of the modal dispersion curve to find the shear velocities of layers
of assumed thickness, or to find thickness and shear velocities of layered model. The multi-modal
forward modelling of the final model allows to assess the importance of higher modes in the
obtained model.
The trial and error inversion of apparent curves due to the modal superposition is possible in a
interactive manner, using the scaling properties of the Rayleigh waves.
• A branch of a modal curve can be inverted, assuming the mode number, with a weighted
damped least squares algorithm.
• Several branches of different modes can be simultaneously inverted
• A trial and error procedure is possible, comparing the experimental curve and spectrum with
the simulated ones. The applied global stretch of the numerical curve is used to compute the
scaling of thickness and velocities.
REFERENCES
• Abbiss C.P., 1981, Shear wave measurements of the elasticity of the ground,
Geotechnique, 31, 91-104
• Aki, K., Richards, P.G., 1980, Quantitative seismology. Theory and methods, W.H.
Freeman and Company
• Al-Eqabi G.I., Herrmann R.B., 1993, Ground Roll: A potential tool for constraining
shallow shear wave structure, Geophysics, 58, 713-719
• Al-Hunaidi O., 1994, Analysis of dispersed multi-mode signals of the SASW method
using the multiple filter crosscorrelation technique, Soil dynamics and Earthquake
engineering, 13, 13-24
• Al-Hunaidi O., 1998, Evolution based genetic algorithm for the analysis of non-
destructive surface wave test on pavements, NDT&F, 31, 273-280
• Audisio A., Bonani S., Foti S., 1999, Advances in spectral analysis for SASW test, in:
Jamiolkowsky M., Lancellotta R., LoPresti D. (eds), Pre-failure deformation
Characteristics of Geomaterials, Balkema, Rotterdam, 395-402
• Backus, G., Gilbert F., 1970, Uniqueness in the inversion of inaccurate gross earth data,
Phil. Trans. R. Soc. Lond., 266A, 123-192
• Backus, G., Gilbert, F, 1968, The resolving power of gross earth data, Geophys. J. Roy.
Astr. Soc, 16, 169-205
• Ballard R.F., 1964, Determination of soil shear moduli at depth by in situ vibratory
techniques, Waterways Experiment Station, Miscellaneous Paper, No.4–691, December
• Blake R.J., Bond L.J., 1990, Rayleigh wave scattering from surface features: wedges and
down steps, Ultrasonics, 28, 214-228
• Blake R.J., Bond L.J., 1992, Rayleigh wave scattering from surface features: up steps and
toughts, Ultrasonics, 30, 255-265
• Bloch S., Hales A.L., 1968, New techniques for the determination of surface wave phase
velocities, Bulletin of the Seismological Society of America, 58 (3), 1021-1034
• Budaev B.V., Bogy D.B., 1995, Rayleigh waves scattering by a wedge I, Wave Motion,
22, 239-257
• Budaev B.V., Bogy D.B., 1996, Rayleigh waves scattering by a wedge II, Wave Motion,
24, 307-314
• Budaev B.V., Bogy D.B., 2001, Scattering of Rayleigh and Stoneley waves by two
adhering elastic wedges, Wave Motion, 33, 321-337
• Bullen, K.E., 1963, An introduction to the theory of seismology, Cambridge University
Press
references
• Buttkus B., 2000, Spectral analysis and filter theory in applied geophysics, Springer,
Berlin, 667 pp
• Chapman, N.R., 1991. Estimation of geoacoustic properties by inversion of acoustic field
data, in: Hovem, J.M., Richardson,M.D., and Stoll,R.D. (Eds.), Shear Waves in marine
sediments, Kluwer Academic Press, Dordrecht: 511-520
• Claerbout, J.F., 1985, Fundamentals of geophysical data processing, Blackwell Scientific
Publication, California, 274 pp
• Comina C., Foti S., Sambuelli L., Socco L.V., Strobbia C., 2002, Joint inversion of VES
and surface wave data, proc. SAGEEP, LasVegas
• Dorman J., Ewing M., 1962, Numerical Inversion of seismic surface waves dispersion
data and crust-mantle structure in the New York-Pennsylvania area, J Geophysical
Research, 67, 5227-5241
• Dorman J., Ewing M., Olivier J., 1960, Study of the shear-velocity distribution in the
upper mantle by mantle Rayleigh waves, Bulletin of the Seismological Society of
America, 50, 87-115
• Duijndam A.J.W., Schoneville M.A., 1999, Nonuniform Fast Fourier Transform,
Geophysics, 64(2), 539-551
• Duijndam A.W.J., 1988, Bayesian estimation in seismic inversion. Part I: Principles,
Geophysical Prospecting, 36, 878-898
• Duijndam A.W.J., 1988, Bayesian estimation in seismic inversion. Part II: Uncertainty
analysis, Geophysical Prospecting, 36, 899-918
• Dunkin, J.W., 1965, Computation of modal solutions in layered, elastic media at high
frequencies, Bulletin of Seismological Society of America, 55, 335-358
• Dziewonski A., Bloch S., Landisman M., 1969, A technique for the analysis of transient
seismic signals, Bulletin of the Seismological Society of America, 59, 427-444
• Dziewonski A.M., Hales A.L., 1972, Numerical analysis of dispersive seismic waves, in:
B.A. Bolt (ed), Methods in computational physics, , Vol.11, 271-295, Academic Press
• Ewing, W.M., Jardetzky, W.S., Press, F., 1957, Elastic waves in layered media, McGraw
Hill Book Co. Inc., NewYork
• Foti S., 2000, Multi-Station Methods for Geotechnical Characterisation Using Surface
Waves, PhD Diss., Politecnico di Torino, 229 pp
• Foti S., Lai C.G., Lancellotta R., 2002, Porosity of Fluid-Saturated Porous Media from
Measured Seismic Wave Velocities, Geotechnique, 52 , 359-373
• Foti S., Lancellotta R., Sambuelli L., Socco L.V., 2000, Application of fk Analysis of
Surface Waves for Geotechnical Characterization, Annali di Geofisica 43, N.6 pp.1199-
1209.
• Foti S., Sambuelli L., Socco L.V., Strobbia C., 2001 Fast SWM for Site Characterisation
Devoted to Trenchless Technology. Atti del 7th Meeting Environmental and Engineering
Geophysics Society European Section, 2 – 6 settembre, Birmingham, 270 – 271.
references
• Foti S., Sambuelli L., Socco L.V., Strobbia C., 2002. Spatial Sampling Issues in fk
Analysis of Surface Waves Atti del SAGEEP 2002, Las Vegas, Feb. 2002 CDROM.
• Foti S., Strobbia C., 2002, Some notes on model parameters for surface wave data
inversion, proc. SAGEEP, LasVegas
• Gabriels P., Snieder R., Nolet G., 1987, In situ measurement of shear wave velocity in
sediments with higher-mode rayleigh waves, Geophysical prospecting, 35, 187-196
• Gautesen A.K., 2002a, Scattering of a Rayleigh wave by an elastic quarter space –
revisited, Wave Motion, 35, 91-98
• Gautesen A.K., 2002b, Scattering of a Rayleigh wave by an elastic three quarter space,
Wave Motion, 35, 99-106
• Gilbert F., Backus G.E., 1966, Propagator matrices in elastic wave and vibration
problems, Geophysics, 31(2), 326-332
• Glangeaud F., Mari J.L., Lacoume J.L., Mars J., Nardin M., 1999, Dispersive seismic
waves in geophysics, Eur. J. Environ. Eng. Geophys., 3, 265-306
• Gucunski N., Woods R.D., 1991, Inversion of Rayleigh wave dispersion curve for SASW
test, Proc. 5th Conf. On Soil Dyn. And Earthquake Engineering, Karlsuhe, 127-138
• Gucunski, N., Ganji, V., Maher, M.H., 1996, Effects of obstacles on Rayleigh wave
dispersion obtained from SASW test, Soil Dynamics and Eathquake Engineering, 15,
223-231
• Gukunski N., Woods R.D., 1992, Numerical simulation of SASW test, Soil Dynamics
and Earthquake Engineering, 11(4), Elsevier, 213-227
• Hardage, B.A., 1983, Vertical Seismic Profiling: Part A: principles, Geophysical Press,
552pp
• Haskell, N.A., 1953, The dispersion of surface waves on Multilayered Media, Bulletin of
the Seismological Society of America, 43, 17-34
• Hering A., Misiek R., Gyulai A., Ormos T., Dobroka M., Dresen L., 1995, A joint
inversion algorithm to process geoelectric and surface wave seismic data. Part I: basic
ideas, Geophysical Prospecting, 43, 135-156
• Herman, G.T., 1980, Image reconstruction from projections, the fundamentals of
computerized tomography, Academic Press, New York
• Herrmann R.B., 1973, Some aspects of band-pass filtering of surface waves, Bulletin of
the Seismological Society of America, 63, 663-671
• Herrmann R.B., 1994, SURF: surface wave inversion program, R.B.Herrmann editor
• Herrmann R.B., Al-Eqabi G.I., 1991, Surface wave inversion for shear velocity, in:
J.M.Hoven et al. (eds), Shear waves in marine sediments, Kluwer Academic Publisher,
545-556
• Herrmann, R.B., 1996, Computer programs in seismology: an overview on synthetic
seismogram computation, User’s manual, StLouis University, Missouri
• Herrmann, R.B., Wang, C.Y., 1985, A comparison of synthetic seismograms, Bulletin of
Seismological Society of America, 75: 41-56
references
• Hiltunen D.R., Woods R.D., 1988, SASW and crosshole test results compared,
Earthquake Engineering,
• Hisada Y., 1994, An efficient method for computing Green’s function for a layered half
space with sources and receivers at close depth, Bulletin of the Seismological Society of
America, 84, 1456-1472
• Hisada Y., 1995, An efficient method for computing Green’s function for a layered half
space with sources and receivers at close depth (part 2), Bulletin of the Seismological
Society of America, 85, 1080-1093
• Horike M., 1985, Inversion of phase velocity of long-period microtremors to the s-wave
velocity structure down to the basement in urbanized areas, J.Phys.Earth, 33, 59-96
• Jackson, D.D., 1972, Interpretation of Inaccurate, Insufficient and Inconsistent Data,
Geophys. J. Roy. Astr. Soc, 28, 97-109
• Jefferys W.H., Berger J.O., 1992, Ockham’s razor and bayesian analysis, American
Scientist, 80, 64-72
• Jones R.B., 1958, In-situ measurements of the dynamic properties of soil by vibration
method, Geotechnique, 8, 1-21
• Jones R.B., 1962, Surface waves techniques for measuring the elastic properties and
thickness of roads: theoretical development, British Journal of Applied physics, 13, 21-29
• Jongmans D., Demanet D., 1993, The importance of surface vibration study and the use
of Rayleigh waves for estimating the dynbamic chacteristics of soils, Eng. Geol., 34, 105-
113
• Kanai, K., Tanaka, T., 1961, On microtremors, VIII, Bull. Earthq.Res.Inst., 39, 97-114
• Kausel E., Roesset J.M., 1981, Stiffness matrices for Layered Soils, Bulletin of the
Seismological Society of America, 71, 1743-1761
• Keilis-Borok V.I., Levshin A.L., Yanovskaya T.B., Lander A.V., Bukchin B.G., Barmin
M.P., Ratnikova L.I., Its E.N., 1989, Seismic surface waves in laterally inhomogeneous
earth, Kluwer Academic Publishers, 293 pp
• Kennet B.L.N., 1974, Reflections, Rays and Reverberation, Bulletin of the Seismological
Society of America, 64, 1685-1696
• Knopoff L., 1964, A matrix method foe elastic wave problems, Bulletin of the
seismological society of America, 54, 431-438
• Knopoff L., 1972, Observation and Inversion of Surface wave dispersion,
Tectonophysics, 13,497-519
• Kovach R.L., 1978, Seismic surface waves and crust and upper-mantle structure, Rev.
Geophys and Space Phys., 16, 1-13
• Kramer S.L., 1996, Geotechnical Earthquake Engineering, Prentice-Hall, New Yersey
• Lai and Rix, 2002, BSSA
• Lai C.G., 1998, Simultaneous Inversion of Rayleigh Phase Velocity and Attenuation for
Near-Surface Site Characterization, PhD Diss., Georgia Inst. Of Techn., Atlanta
(Georgia, USA) , 370 pp
references
• Lai C.G., Rix G.J., 1999, Inversion of multi-mode effective dispersion curves, in Pre-
failure deformation Characteristics of Geomaterials, Jamiolkowsky M., Lancellotta R.,
LoPresti D. editors, Balkema, Rotterdam, 411-418
• Lamb, H., 1904, On the propagation of tremors over the surface of an elastic solid,
Philos. Trans., vol.CCIII, pp. 1-42
• Lanczos C., 1961, Linear Differential operators, D.Van Nostrand Co., London
• Levenberg K., 1944, A method for the solution of certain nonlinear problems in least
squares, Quarterly of Applied Mathematics 2, 164-168
• Lines R.L., Treitel S., 1984, A review of least squares inversion and its application to
geophysical problems, Geophysical prospecting, 32, 159-186
• Love A.E.H., 1911, Some Problems of Geodynamics, Cambridge University Press
• Mari, J.L., 1984, Estimation of static corrections for shear wave profiling using dispersive
properties of Love waves, Geophysics, 49, 1169-1179
• Marquardt D.W., 1963, An algorithm for least squares estimation of nonlinear
parameters, Journal of the Society of Industrial Applied Mathematics, 2, 431-441
• Martinez M.D., Lana X., Olarte J., Badal J., Canas J.A., 2000, Inversion of Rayleigh
wave phase and group velocity by simulated annealing, Physics of the Earth and
planetary interiors, 122, 3-17
• McMechan G.A., Yedlin M.J., 1981, Analysis of dispersive wave by wave field
transformation, Geophysics, 46, 869-874
• Menke W., 1989, Geophysical Data Analysis: Discrete Inverse theory, International
Geophysics Series. San Diego: Academic Press. 289 pp
• Misiek R., Liebig A., Gyulai A., Ormos T., Dobroka M., Drese L., 1995, A joint
inversion algorithm to process geoelectric and surface wave seismic data. Part II:
applications, Geophysical Prospecting, 45, 65-85
• Mockart T.A., Herrmann, R.B., Russel R.D., 1988, Seismic velocity and Q model for the
shallow structure of the Arabian shield from short-period Rayleigh waves, Geophysics,
54, 1379-1387
• Nazarian S., Stokoe K.H. II, 1986, Use of surface waves in pavement evaluation, Transp.
Res. Rec, 1070, 132-144
• Nazarian S., Stokoes II K.H., 1984, In situ shear wave velocity from spectral analysis of
surface waves, Proc 8th Conference on Earthquake engineering- St Francisco, vol. 3,
Prentice Hall, pp.31-38
• Nolet G., Panza G.F., 1976, Array analysis of seismic surface waves: limits and
possibilities, Pure and Applied geophysics, 114, 776-790
• Nolet, G., 1987, Seismic tomography, D.Reidel Publishing Co., Dordrecht, Holland,
386pp
• Oppenheim A.V., Schaefer R.W., 1975, Digital Signal Processing, Prentice-Hall,
Inglewood Cliffs
references
• Park C.B., Miller R.D., Xia J., 1999, Multichannel analysis of surface waves,
Geophysics, 64(3), 800-808
• Park C.B., Miller R.D., Xia J., 2001, Offset and resolution of dispersion curve in
multichannel analysis of surface waves (MASW), proc. SAGEEP 2001
• Pecorari C., 2001, Scattering of a Rayleigh wave by a surface breaking crack with faces
in partial contact, Wave Motion, 33, 259-270
• Penrose R., 1955, A generalized inverse for matrices, Proc.Camb. Phil. Soc., 51, 406-413
• Poggiagliolmi E., Berkhout A.J., Boone M.M., 1982, Phase unwrapping, possibilities and
limitations, Geophysical Prospecting, 30, 281-291
• Press F., 1968, Earth models obtained by Monte Carlo inversion, Journal of Geophysical
Research, 73, 5223-5234
• Press F., Harkrider D., Seafeldt C.A., 1961, A fast convenient program for computation
of surface wave dispersion curves in multi-layered media, Bulletin of the seismological
society of America, 51, 495-502
• Press W.H., Teukolsky S., Vetterling W., Flannery B, 1992, Numerical Recipes – The art
of technical computing, Cambridge University Press, Cambridge, 2nd edition
• Randall M.J., 1967, Fast programs for layered half-space problems, Bulletin of the
Seismological Society of America, 57, 1299-1316
• Rayleigh J.W.S., 1985, On waves propagated along the plane surface of an elastic solid,
Proc. Lond. Math. Soc., 17, 4-11
• Richart F.E. Jr., Woods R.D., Hall J.R., 1970, Vibration of soil and foundations, Prentice-
Hall, New Jersey
• Rix G.J., Lai C.G., Foti S., 2001, Simultaneous Measurement of Surface wave dispersion
and attenuation curves, Geotechnical Testing Journal, ASTM, 24, 350-358
• Roesset J.M., Chang D.W., Stokoe K.H. II, 1991, Comparison of 2D and 3D models for
analysis of surface wave tests, Proc. 5th Int. Conf. On Soil Dyn. And Earthq. Eng.,
Karlsruhe, vol.1, pp 111-126
• Sambuelli L, Deidda G.P., Albis G., Giorcelli E., Tristano G., 2001, Comparison of
standard horizontal geophones and newly designed horizontal detectors, Geophysics, 66,
1827-1837
• Sanchez-Salinero I., 1987, Analytical investigation of seismic methods used for
engineering applications, PhD Dissertation, University of Texas at Austin
• Santamarina J.C., Fratta D., 1998, Introduction to discrete signals and inverse problems
in civil engineering, ASCE Press, New York, 327 pp
• Schwab F, Knopoff L., 1970, Surface wave dispersion computation, Bulletin of the
seimological society of America, 60, 321-344
• Schwab F., Knopoff L., 1971, Surface waves on multilayered anelastic media, Bulletin of
the seismological society of America, 61, 893-912
• Sheriff R.E., Geldart L.P., 1995, Exploration Seismology, University Press, Cambridge
references
• Shtivelman V., 1999, Using surface waves for estimating the shear-wave velocities in the
shallow subsurface onshore and offshore Israel, European journal of environmental and
engineering geophysics, 4, 17-36
• Smith F.B., Shanno D.F., 1971, An improved Marquardt procedure for nonlinear
regression, Technometrics, 13, 63-75
• Socco L.V., Strobbia C. (2003) Extensive Modelling To Study Surface Wave Resolution
Accettato per gli Atti del SAGEEP 2003, San Antonio, Apr. 2003 CDROM.
• Socco L.V., Strobbia C., Foti S., 2002, Multimodal Interpretation of Surface Wave Data
Atti del 8th Meeting Environmental and Engineering Geophysics Society European
Section, 8 – 14 settembre, Aveiro.
• Stokoe K.H. II, Nazarian S., 1985, Use of Rayleigh wave in liquefaction studies, Proc. Of
the Measurement and Use of shear wave velocity for evaluating dynamic soil properties,
ASCE, N.Y., 1-17
• Stokoe K.H., Wright S.G., Bay J., Roesset J.M., 1994, Characterization of geotechnical
sites by SASW method, in Geophysical characterization of sites, (ISSMFE TC#10) by
R.D. Woods (ed), Oxford & IBH Publ., pp. 15-25
• Stokoes K.H. II, Nazarian S., Rix G.J., Sanchez-Salinero I., Sheu J., Mok Y., 1988, In
situ seismic testing of hard-to-sample soils by surface wave method, Eartquake
Engineering and Soil Dynamics II- Recent advances in ground-motion evaluation-Park
city, ASCE, pp-264-277
• Strobbia C., 2002, Weighted f-k for the surface wave analysis, proc. EEGS-ES meeting,
Aveiro, Portugal
• Strobbia C., Foti S., 2003, Statistical Regression of Phase difference in surface wave data,
Atti SAGEEP, SanAntonio, Texas, USA
• Szelwis R., Behle A., 1984, Shallow shear wave velocity estimation from multi-modal
Rayleigh waves, in: Shear Wave exploration, S.H. DanBom and S.N. Domenico
(editors), Geophysical Development Series, vol 1, SEG, 214-226
• Tarantola, A., 1987, Inverse Problem Theory : Methods for data fitting and model
parameter estimation, Elsevier Science Publisher, Amsterdam, 614 pp
• Thomson, W.T., 1950, Transmission of Elastic Waves through a Stratified Solid
Medium, Journal of Applied Physics, 21, 89-93,
• Thrower E.N., 1965, The computation of the dispersion of elastic waves in layered
media, J.Sound.Vib., 2, 210-266
• Tokimatsu K., 1995, Geotechnical Site characterisation using surface waves, Proc. 1st
Int.Conf. on Eartq. Geotechn. Eng., IS-Tokio, Balkema, 1333-1368
• Tokimatsu K., Shinzawa K., Kuwayama S., 1992b, Use of short period microtremors for
Vs profiling, Journal of Geotechnical Engineering, 118 (10), 1544-1558
• Tokimatsu K., Tamura S, Kojima H., 1992a, Effects of multiple mode on Rayleigh wave
dispersion characteristics, Journal of Geotechnical Engineering, American Society of
Civil Engineering, 118 (10) 1529-1543
references
• Tselentis G.A., Delis G., 1998, Rapid assessment of S-wave profiles from the inversion
of multichannel surface wave dispersion data, Annali di Geofisica, 41, 1-15
• Van der Veen M., Brouwer J., Helbig K, 1999, Weighted sum method for calculating
ground force: an evaluation by using a portable vibrator system, Geophysical
Prospecting, 47, 251-267
• Viktorov, I.A., 1967, Rayleigh and Lamb waves: physical theory and applications,
Plenum Press, NewYork, 154 pp
• Vucetic, M., 1994, Cyclic threshold shear strains in soils, J.Geotechnical Eng., 120,
ASCE, 2208-2228
• Wang, C.Y., Herrmann, R.B., 1980, A numerical study of P-, SV-, SH- wave generation
in a plane layered medium, Bulletin of Seismological Society of America, 70 1015-1036
• Watson T.H., 1970, A note on fast computation of Rayleigh wave dispersion in the
multilayerd elastic half-space, Bulletin of the Seismological Society of America, 60, 161-
166
• White J.E., 1983, Underground Sound. Application of seismic waves, Elsevier Science
Publisher, 253 pp.
• Wiggins R., 1972, The general linear inverse problem: implications of surface waves and
free oscillation of the earth structure, Rev. Geophys. Space Phys., 10
• Xia J., Miller R.D., Park C.B., 1999, Estimation of near-surface shear-wave velocity by
inversion of Rayleigh waves, Geophysics, 64, 691-700
• Yilmaz O., 1987, Seismic data processing, Society of Exploration Geophysics, Tulsa
• Zwicky D., Rix G.J., 1999, Frequency wavenumber analysis of passive surface waves,
Proc. Symp. On the Appl. Of Geophysics to Environ. And Eng. Problems, Oakland, 75-
84
references
Books
Aki and Richards, 1980; Buttkus, 2000; Claerbout, 1985; Ewing et al., 1957; Hardage, 1983; Keilis-
Borok et al., 1989; Menke, 1989; Nolet, ; Oppenheim and Shaefer, 1975; Richart et al., 1970;
Santamarina and Fratta, 1998; Tarantola, 1987; Viktotov, 1967; White, 1983; Yilmaz, 1987;
317