Lecture Notes On Mathematical Modelling
Lecture Notes On Mathematical Modelling
Lecture Notes On Mathematical Modelling
net/publication/277986444
CITATIONS READS
2 21,685
3 authors:
Marcello Delitala
Politecnico di Torino
68 PUBLICATIONS 2,242 CITATIONS
SEE PROFILE
All content following this page was uploaded by Nicola Bellomo on 25 February 2016.
Vol. 8 - 2010
ISBN-A: 10.978.88905708/72
Licensed under
Attribution-Non-Commercial-No Derivative Works
Published by:
SIMAI - Società Italiana di Matematica Applicata e Industriale
Via dei Taurini, 19 c/o IAC/CNR
00185, ROMA (ITALY)
Contents
Preface . . . . . . . . . . . . . . . . . . . . . . . . . . . . v
Preface
The Lecture Notes collected in this book refer to a university course delivered at the Politecnico
of Torino to students of the Master Graduation in Mathematical Engineering. Ph.D. students
attending programs in engineering sciences have been attending the same lectures.
The monograph corresponds to the first part of the course devoted to modelling issues to
show how the application of models to describe real world phenomena generates mathematical
problems to be solved by appropriate mathematical methods. Mathematical models are quite
simple being proposed with tutorial aims.
The contents are developed through four chapters. The first one proposes an introduction to
the science of mathematical modelling focusing on the three representation scales of physical
reality: microscopic, macroscopic and statistical over the microscopic states. The three chapters
which follow deal with the derivation and applications of models related to each of the afore-
mentioned scales. Different mathematical structures correspond to each scale. Specifically
models at the microscopic scale are generally stated in terms of ordinary differential equations,
while models at the macroscopic scale are stated in terms of partial differential equations.
Models of the mathematical kinetic theory, presented in Chapter 4, are stated in terms of
integro-differential equations.
The above different structures generate a variety of analytic and computational problems.
The contents are devoted to understand how computational methods can be developed starting
from an appropriate discretization of the dependent variables.
The Lecture Notes look at applications focussing on modelling and computational issues,
while the pertinent literature on analytic methods is brought to the attention of the interested
reader for additional education.
After the above introduction to the contents and aims of the Lecture Notes, a few remarks
are stated to make a little more precise the guidelines followed by the authors.
• All real systems can be observed and represented at different scales by mathematical equations.
The selection of a scale with respect to others belong, on one side, to the strategy of the scientists
vi
in charge of deriving mathematical models, and on the other hand to the specific application
of the model.
• Systems of the real world are generally nonlinear. Linearity has to be regarded either as
a very special case, or as an approximation of physical reality. Then methods of nonlinear
analysis need to be developed to deal with the application of models. Computational methods
are necessary to solve mathematical problems generated by the application of models to the
analysis and interpretation of systems of real world.
• Computational methods can be developed only after a deep analysis of the qualitative prop-
erties of a model and of the related mathematical problems. Different methods may correspond
to different models and problems.
• Modelling is a science which needs creative ability linked to a deep knowledge of the whole
variety of methods offered by applied mathematics. Indeed, the design of a model has to be
precisely related to the methods to be used to deal with the mathematical problems generated
by the application of the model.
• Modeling systems of the inert matter can generally take advantage of first principles related
to well defined physical theories. On the other hand, models of the living matter need looking
at the complexity of living entities and at their ability to express specific strategies.
These Lectures Notes attempt to provide an introduction to the above issues and will exploit
the use of electronic diffusion to update periodically the contents also on the basis of interactions
with students. The authors aim at taking advantage of suggestions, generally useful, from
students involved in the master graduation in mathematics for engineering sciences. This edition
is a revisiting of the Lecture Notes published by SIMAI with the slightly different title. The main
novelty consists in enlarging the horizons of applied mathematics to modeling and simulation
of living, and hence complex, living systems.
Chapter 1
The analysis of systems of applied and natural sciences, for instance technology, economy,
biology etc., needs a constantly growing use of methods of mathematics and computer sciences.
In fact, once a physical system has been observed and phenomenologically analyzed, it is often
useful to use mathematical models to describe its evolution in time and space. Indeed, the
interpretation of systems and phenomena, which occasionally show complex features, is gener-
ally developed on the basis of methods which organize their interpretation toward simulation.
When simulations related to the behavior of the real system are available and reliable, it may
be possible, in most cases, to reduce time devoted to observation and experiments.
Bearing in mind the above reasoning, one can state that there exists a strong link between
applied sciences and mathematics represented by mathematical models designed and applied,
with the aid of computer sciences and devices, to the simulation of systems of real world. The
term mathematical sciences refers to various aspects of mathematics, specifically analytic and
computational methods, which both cooperate to the design of models and to the development
of simulations.
Before going on with specific technical aspects, let us pose some preliminary questions:
1 • What is the aim of mathematical modelling, and what is a mathematical model ?
2 • There exists a link between models and mathematical structures ?
3 • There exists a correlation between models and mathematical methods ?
4 • Which is the relation between models and computer sciences ?
Moreover:
5 • Can mathematical models contribute to a deeper understanding of physical reality ?
6 • Is it possible to reason about a science of mathematical modelling ?
2
7 • Can education in mathematics take some advantage of the above mentioned science of
mathematical modelling ?
8 • Is it possible designing models of living systems by an approach that retains the complexity
of the living matter ?
Additional questions may be posed. However, it is reasonable to stop here considering that
specific tools and methods are needed to answer precisely to the above questions. A deeper
understanding of the above topics will be achieved going through the chapters of these Lecture
Notes also taking advantage of the methods which will be developed in the next chapters.
Nevertheless an intuitive reasoning can be developed and some preliminary answers can be
given:
• Mathematical models are designed to describe physical systems by equations or, more in
general, by logical and computational structures.
• The above issue indicates that mathematical modelling operates as a science by means of
methods and mathematical structures with well defined objectives.
• Intuitively, it can be stated that education in mathematics may take advantage of the sci-
ence of mathematical modelling. Indeed, linking mathematical structures and methods to the
interpretation and simulation of real physical systems is already a strong motivation related to
an inner feature of mathematics, otherwise too much abstract. Still, one has to understand if
modelling provides a method for reasoning about mathematics.
• At this preliminary stage, it is difficult to reason about the possibility that mathematical
models may contribute to a deeper understanding of physical reality. At present, we simply
trust that this idea will be clarified all along the contents of these Lectures Notes. The goal
consists in depicting, by mathematical models, behaviors that have not yet been observed in
reality.
• Far more difficult is a reply, although preliminary, to the last question 8. We simply observe
that deterministic causality principles are lost in the case of living system, as observed by May
(2004) and Reed (2004). This delicate matter will be treated more deeply in the last chapter
of this monograph.
This chapter has to be regarded as an introduction to the science of mathematical modelling
which will be developed through these Lecture Notes with reference to well defined mathematical
structures and with the help of several applications intended to clarify the above concepts.
Specifically it deals with general introduction to mathematical modelling, and is organized into
six more sections which follow this introduction:
– Section 1.2 deals with the presentation of some simple examples of mathematical models which
act as a preliminary reference for the various concepts introduced in the following sections.
Then, the definition of mathematical model is given as an equation suitable to define the
evolution in time and space of the variable charged to describe (at each specific scale) the
physical state of the real system.
3
– Section 1.3 deals with a preliminary aspect of the modelling process, that is the identification
of the representation scales, microscopic, macroscopic and statistical, needed to observe and
represent a real system. The above concepts are related to a variety of examples of models
at each one of the above scales. Simple examples are chosen with tutorial aims, while more
sophisticated models are treated in the next chapters.
– Section 1.4 deals with the dimensional analysis of mathematical models. It is shown how writ-
ing the model in terms of dimensionless variables is useful towards computational analysis and
allows to extract suitable scaling parameters which can be properly used towards a qualitative
understanding of the properties of the model.
– Section 1.5 analyzes the various concepts proposed in the preceding sections by means of
models of vehicular traffic flow. Such a system can be described by different models and scales,
all of them are analyzed with reference to the above mentioned definitions and scaling methods.
– Section 1.6 deals with a classification of models and mathematical problems still referring to
the various aspects of the modelling process dealt with in the preceding sections.
– Section 1.7 provides a description and critical analysis of the contents of this chapter with
special attention to complexity problems. This section introduces the contents of the last
chapter, devoted to the modeling of living systems.
This section deals with the description of three simple examples of mathematical models which
will be a technical reference for the definitions given in the following sections. The models are
derived by an intuitive approach, while well defined modelling methods will be developed in
the chapters which follow and applied to the design of relatively more sophisticated models.
The first example describes linear oscillations of a mass constrained to move along a line,
while the second one refers to modelling heat diffusion phenomena. The third example is
a generalization of the second one to a nonlinear case. As already mentioned, very simple
examples are selected to reason, according to tutorials aims, with classification of models and
mathematical structures.
Example 1.2.1
Linear Elastic Wire-Mass System
Consider, with reference to Figure 1.2.1, a mechanical system constituted by a mass m con-
strained to translate along an horizontal line, say the x-axis. The location of the mass is iden-
tified by the coordinate of its center of mass P , which is attached to an elastic wire stretched
with ends in A and P . The assumptions defining the mechanical model are the following:
• The system behaves as a point mass with localization identified by the variable x.
• The action of the wire is a force directed toward the point A with module: T = k x.
4
• Friction forces are negligible with respect to the action of the wire.
Application of Newton’s principles of classical mechanics yields:
d2 x
m = −kx . (1.2.1)
dt2
The mathematical model is an evolution equation for the following vector variable:
µ ¶
dx
u = u1 = x , u2 = · (1.2.2)
dt
Using the above variables, the second order ordinary differential equation (1.2.1) reads:
du1
= u2 ,
dt
(1.2.3)
du2
k
= − u1 ,
dt m
which is a linear equation.
Example 1.2.2
Linear Heat Diffusion Model
Consider the one-dimensional linear heat diffusion model in a rod. The assumptions defining
the mechanical model are the following:
• The state of the system is described by the temperature u = u(t, x) along the axis of the rod
identified by the variable x ∈ [0, 1]. Variations orthogonal to the axis of the rod are neglected
as the walls of the rod are perfectly isolated.
• The heat flow q per unit area is proportional to the temperature gradient:
∂u ,
q = −h0 (1.2.4)
∂x
where h0 is the heat conduction coefficient.
5
• The material properties of the conductor are identified by the heat conduction coefficient
h0 and heat capacity c0 .
The mathematical model can be obtained equating the net heat flux in a volume element to
the rate of increase of the heat capacity in the volume. Let q + and q − be, respectively, the
ingoing and outgoing heat fluxes for unit area, see Figure 1.2.2 The above balance writes:
∂u ∂q ,
c0 A dx = −A(q + − q − ) = −A dx (1.2.5)
∂t ∂x
d2 u
k0 = 0, (1.2.7)
dx2
which can also be written as a system of two coupled equations in normal form:
du
= v,
dx
(1.2.8)
dv = 0 .
dx
6
Example 1.2.3
Nonlinear Heat Diffusion Model
Nonlinearity may be related to the modelling of the heat flux phenomenon. For instance, if
the heat flux coefficient depends on the temperature, say h = h(u), the same balance equation
generates the following model:
· ¸
∂u ∂ ∂u h(u)
= k(u) , k(u) = . (1.2.9)
∂t ∂x ∂x c0
The reader with a basic knowledge of elementary theory of differential equations will be soon
aware that the above two simple models generate interesting mathematical problems. In fact,
Model 1.2.1 needs initial conditions for t = t0 both for u1 = u1 (t) and u2 = u2 (t), while Models
1.2.2 and 1.2.3 need initial conditions at t = t0 and boundary conditions at x = 0 and x = 1
for u = u(t, x).
The solution of the above mathematical problems ends up with simulations which visualize
the behavior of the real system according to the description of the mathematical model.
After the above examples, a definition of mathematical model can be introduced. This
concept needs some preliminary definitions referring to:
• State variables which are the dependent variables, that take values corresponding to
the independent variables.
• Mathematical model, that is a set of equations which define the evolution of the state
variable over the dependent variables.
The general idea is to observe the phenomenology of a real system in order to extract its
main features and to provide a model suitable to describe the evolution in time and space of
its relevant aspects. Bearing this in mind, the following definitions are proposed:
Independent variables
The evolution of the real system is referred to the independent variables which,
unless differently specified, are time t, defined in an interval (t ∈ [t0 , T ]), which refers
the observation period; and space x, related to the volume V, (x ∈ V) which contains
the system.
7
State variable
The state variable is the finite dimensional vector variable
Mathematical model
A mathematical model of a real physical system is an evolution equation suitable
to define the evolution of the state variable u in charge to describe the physical state
of the system itself.
In order to handle properly a mathematical model, the number of equations and the dimension
of the state variable must be the same. In this case the model is defined consistent:
Consistency
The mathematical model is said to be consistent if the number of unknown depen-
dent variables is equal to the number of independent equations.
This means that one has to verify whether an equation belonging to the model can be obtained
combining the remaining ones. If this is the case, that equation must be eliminated.
The space variable may be referred to a suitable system of orthogonal axes, 0(x, y, z) with
unit vectors i, j, k, so that a point P is identified by its coordinates
P = P(x) = x i + y j + z k . (1.2.11)
The real physical system may be interacting with the outer environment or may be isolated.
In the first case the interactions has to be modelled.
The above definitions can be applied to real systems in all fields of applied sciences: engi-
neering, natural sciences, economy, and so on. Actually, almost all systems have a continuous
distribution in space. Therefore, their discretization, that amounts to the fact that u is a finite
dimensional vector, can be regarded as an approximation of physical reality.
In principle, one can always hope to develop a model which can reproduce exactly physical
reality. On the other hand, this idealistic program cannot be practically obtained considering
that real systems are characterized by an enormous number of physical variables. This reasoning
applies to Example 1.2.1, where it is plain that translational dynamics in absence of frictional
forces is only a crude approximation of reality. The observation of the real behavior of the
8
system will definitively bring to identify a gap between the observed values of u1 and u2 and
those predicted by the model.
Uncertainty may be related also to the mathematical problem. Referring again to the above
example, it was shown that the statement of mathematical problems need u10 and u20 , i.e. the
initial position and velocity of P , respectively. Their measurements are affected by errors so
that their knowledge may be uncertain.
In some cases this aspect can be dealt with by using in the model and/or in the mathematical
problems randomness modelled by suitable stochastic variables. The solution of the problem
will also be represented by random variables, and methods of probability theory will have to
be used.
As we have seen, mathematical models are stated in terms of evolution equations. Examples
have been given for ordinary and partial differential equations. The above equations cannot be
solved without complementing them with suitable information on the behavior of the system
corresponding to some values of the independent variables. In other words the solution refers to
the mathematical problem obtained linking the model to the above mentioned conditions. Once
a problem is stated suitable mathematical methods have to be developed to obtain solutions
and simulations, which are the prediction provided by the model.
The analysis of the above crucial problems, which is a fundamental step of applied mathe-
matics, will be dealt with in the next chapters with reference to specific classes of equations.
As we have seen by the examples and definitions proposed in Section 1.2, the design of
a mathematical model consists in deriving an evolution equation for the dependent variable,
which may be called state variable, which describes the physical state of the real system, that
is the object of the modelling process.
The selection of the state variable and the derivation of the evolution equation starts from
the phenomenological and experimental observation of the real system. This means that the
first stage of the whole modelling method is the selection of the observation scale. For instance
one may look at the system by distinguishing all its microscopic components, or averaging
locally the dynamics of all microscopic components, or even looking at the system as a whole
by averaging their dynamics in the whole space occupied by the system.
For instance, if the system is a gas of particles inside a container, one may either model the
dynamics of each single particle, or consider some macroscopic quantities, such as mass density,
momentum and energy, obtained averaging locally (in a small volume to be properly defined:
possibly an infinitesimal volume) the behavior of the particles. Moreover, one may average the
physical variables related to the microscopic state of the particles and/or the local macroscopic
variables over the whole domain of the container thus obtaining gross quantities which represent
the system as a whole.
9
Specifically, let us concentrate the attention to the energy and let us assume that energy may
be related to temperature. In the first case one has to study the dynamics of the particles
and then obtain the temperature by a suitable averaging locally or globally. On the other
hand, in the other two cases the averaging is developed before deriving a model, then the
model should provide the evolution of already averaged quantities. It is plain that the above
different way of observing the system generates different models corresponding to different
choices of the state variable. Discussing the validity of one approach with respect to the other
is definitively a difficult, however crucial, problem to deal with. The above approaches will be
called, respectively, microscopic modelling and macroscopic modelling.
As an alternative, one may consider the microscopic state of each microscopic component
and then model the evolution of the statistical distribution over each microscopic description.
Then one deals with the kinetic type (mesoscopic) modelling which will be introduced in
this chapter and then properly dealt with later in Chapter 4. Modelling by methods of the
mathematical kinetic theory requires a detailed analysis of microscopic models for the dynamics
of the interacting components of the system, while macroscopic quantities are obtained, as we
shall see, by suitable moments weighted by the above distribution function.
This section deals with a preliminary derivation of mathematical framework related to the
scaling process which has been described above. This process will ends up with a classification
both of state variables and mathematical equations. Simple examples will be given for each
class of observation scales and models. The whole topic will be specialized in the following
chapters with the aim of a deeper understanding on the aforementioned structures.
Both observation and simulation of system of real world need the definition of suitable ob-
servation and modelling scales. Different models and descriptions may correspond to different
scales. For instance, if the motion of a fluid in a duct is observed at a microscopic scale, each
particle is singularly observed. Consequently the motion can be described within the frame-
work of Newtonian mechanics, namely by ordinary differential equations which relate the force
applied to each particle to its mass times acceleration. Applied forces are generated by the
external field and by interactions with the other particles.
On the other hand, the same system can be observed and described at a larger scale consid-
ering suitable averages of the mechanical quantities linked to a large number of particles, and
the model refers to macroscopic quantities such as mass density and velocity of the fluid. A
similar definition can be given for the mass velocity, namely the ratio between the momentum
of the particles in the reference volume and their mass. Both quantities can be measured by
suitable experimental devices operating at a scale of a greater order than the one of the single
particle. This class of models is generally stated by partial differential equations.
Actually, the definition of small or large scale has a meaning which has to be related to the
size of the object and of the volume containing them. For instance, a planet observed as a rigid
homogeneous whole is a single object which is small with respect to the galaxy containing the
planet, but large with respect to the particles constituting its matter. So that the galaxy can
be regarded as a system of a large number of planets, or as a fluid where distances between
10
planets are neglected with respect to the size of the galaxy. Bearing all above in mind, the
following definitions are given:
Microscopic scale
A real system can be observed, measured, and modelled at the microscopic scale if
all single objects composing the system are individually considered, each as a whole.
Macroscopic scale
A real system can be observed, measured, and modelled at the macroscopic scale
if suitable averaged quantities related to the physical state of the objects composing
the system are considered.
Mesoscopic scale
A real system can be observed, measured, and modelled at the mesoscopic (kinetic)
scale if it is composed by a large number of interacting objects and the macroscopic
observable quantities related to the system can be recovered from moments weighted
by the distribution function of the state of the system.
As already mentioned, microscopic models are generally stated in terms of ordinary differ-
ential equations, while macroscopic models are generally stated in terms of partial differential
equations. This is the case of the first two examples proposed in the section which follows.
The contents will generally be developed, unless otherwise specified, within the framework of
deterministic causality principles. This means that once a cause is given, the effect is deter-
ministically identified, however, even in the case of deterministic behavior, the measurement of
quantities needed to assess the model or the mathematical problem may be affected by errors
and uncertainty.
The above reasoning and definitions can be referred to some simple examples of models, this
also anticipating a few additional concepts which will be dealt in a relatively deeper way in the
chapters which follow.
Example 1.3.1
Elastic Wire-Mass System with Friction
Following Example 1.2.1, let us consider a mechanical system constituted by a mass m con-
strained to translate along a horizontal line, say the x-axis. The location of the mass is identified
by the coordinate of its center of mass P , which is attached to an elastic wire stretched with
ends in A and P . The following assumption needs to be added to those of Model 1.2.1 defining
the mechanical model:
• Friction forces depend on the p-th power of the velocity and are direct in opposition with
it.
11
The mathematical model, according to the definitions proposed in Section 1.2, is an evolution
equation for the variable u defined as follows:
µ ¶
dx
u = u1 = x , u2 = · (1.3.2)
dt
Using the above variables, the second order ordinary differential equation (1.3.1) can be
written as a system of two first order equations:
du1
= u2 ,
dt
(1.3.3)
du2 = − k u1 − c up2 ·
dt m m
The above example has shown a simple model that can be represented by an ordinary differ-
ential equation, Eq. (1.3.3), which is nonlinear for values of p different from zero or one.
Observing Eq. (1.3.3), one may state that the model is consistent, namely there are two
independent equations corresponding to the two components of the state variable. The physical
system is observed singularly, i.e. at a microscopic scale, while it can be observed that the model
is stated in terms of ordinary differential equations.
Linearity of the model is obtained if c = 0. On the other hand, if k is not a constant, but
depends on the elongation of the wire, say k = k0 xq a nonlinear model is obtained:
du1
= u2 ,
dt
(1.3.4)
du2 = − k0 uq+1
·
dt m 1
Independently of linearity properties, which will be properly discussed in the next Chapter
2, the system is isolated, namely it is a closed system. One should add, in the case of open
systems, to the second equation the action of the outer environment over the inner system. A
simple example is the following:
du1
= u2 ,
dt
(1.3.5)
du2
k0 1
= − uq+1 + F (t) ,
dt m 1 m
where F = F (t) models the above mentioned action.
12
The above models, both linear and nonlinear, have been obtained linking a general back-
ground model valid for large variety of mechanical systems, that is the fundamental principles
of Newtonian mechanics, to a phenomenological model suitable to describe, by simple analytic
expressions, the elastic behavior of the wire. Such models can be refined for each particular
system by relatively more precise empirical data obtained by experiments.
The example which follows is developed at the macroscopic scale and it is related to the heat
diffusion model we have seen in Section 1.2. Here, we consider a mathematical model suitable
to describe the diffusion of a pollutant of a fluid in one space dimension.
As we shall see, an evolution equation analogous to the one of Example 1.2.2 will be obtained.
First the linear case is dealt with, then some generalizations, i.e. non linear models and diffusion
in more than one space dimensions, are described.
Example 1.3.2
Linear Pollutant Diffusion Model
Consider a duct filled with a fluid at rest and a pollutant diffusing in the duct in the direction x
of the axis of the duct. The assumptions which define the mechanical model are the following:
• The physical quantity which defines the state of the system is the concentration of pollutant:
variations of c along coordinates orthogonal to the x-axis are negligible. The mass per unit
volume of the pollutant is indicated by ρ0 and is assumed to be constant.
• There is no dispersion or immersion of pollutant at the walls.
• The fluid is steady, while the velocity of diffusion of the pollutant is described by a phe-
nomenological model which states that the diffusion velocity is directly proportional to the
gradient of c and inversely proportional to c.
The evolution model, i.e. an evolution equation for c, can be obtained exploiting mass con-
servation equation. In order to derive such equation let consider, with reference to Figure 1.2.2,
the flux q = q(t, x) along the duct and let q + and q − be the inlet and outlet fluxes, respectively.
Under suitable regularity conditions, which are certainly consistent with the physical system
we are dealing with, the relation between the above fluxes is given by:
∂q
q+ = q− + dx . (1.3.7)
∂x
A balance equation can be written equating the net flux rate to the increase of mass in the
volume element A dx, where A is the section of the duct. The following equation is obtained:
∂c ∂(cv)
ρ0 A dx + A dx = 0 , (1.3.8)
∂t ∂x
13
where v is the diffusion velocity which, according to the above assumptions, can be written as
follows:
h0 ∂c
v=− , (1.3.9)
c ∂x
and h0 is the diffusion coefficient.
Substituting the above equation into (1.3.8) yields
∂c ∂2c h0
= k0 2 , k0 = , (1.3.10)
∂t ∂x ρ0
Nonlinearity related to the above model may occur when the diffusion coefficient depends on
the concentration. This phenomenon generates the nonlinear model described in the following
example.
Example 1.3.3
Nonlinear Pollutant Diffusion Model
Consider the same phenomenological model where, however, the diffusion velocity depends on
the concentration according to the following phenomenological model:
h(c) ∂c
v = −h0 , (1.3.11)
c ∂x
where h(c) describes the behavior of the diffusion coefficient with c. The model writes as follows:
µ ¶
∂u ∂ ∂c h(c)
= k(c) , k(c) = . (1.3.12)
∂t ∂x ∂x ρ0
The above diffusion model can be written in several space dimensions. For instance, technical
calculations generate the following linear model:
Example 1.3.4
Linear Pollutant Diffusion in Space
Let us consider the linear diffusion model related to Example 1.3.2, and assume that diffusion
is isotropic in all space dimensions, and that the diffusion coefficient does not depend on c. In
this particular case, simple technical calculations yield:
µ ¶
∂c ∂2 ∂2 ∂2
= k0 + + c = k0 ∆c . (1.3.16)
∂t ∂x2 ∂y 2 ∂z 2
The steady model is obtained equating to zero the right-hand side of (1.3.16):
µ ¶
∂2 ∂2 ∂2
2
+ 2+ 2 c = k0 ∆c = 0 . (1.3.17)
∂x ∂y ∂z
The above (simple) examples have given an idea of the microscopic and macroscopic mod-
elling. A simple model based on the mesoscopic description will be now given and critically
analyzed. Specifically, we consider an example of modelling social behaviors such that the mi-
croscopic state is defined by the social state of a certain population, while the model describes
the evolution of the probability density distribution over such a state. The above distribution
is modified by binary interactions between individuals.
Example 1.3.5
Population Dynamics with Stochastic Interaction
Consider a population constituted by interacting individuals, such that:
• The microscopic state of each individual is described by a real variable u ∈ [0, 1], that is
a variable describing its main physical properties and/or social behaviors. As examples, in the
case of a population of tumor cells this state may have the meaning of maturation or progression
stage, for a population of immune cells we may consider the state u as their level of activation.
• The statistical description of the system is described by the number density functions
N = N (t, u) , (1.3.18)
which is such that N (t, u) du denotes the number of cells per unit volume whose state is, at
time t, in the interval [u, u + du].
15
1
f = f (t, u) = N (t, u) . (1.3.19)
n0
Z 1
n(t) = f (t, u) du . (1.3.20)
0
The evolution model refers to f (t, u) and is determined by the interactions between pairs of
individuals, which modify the probability distribution over the state variable and/or the size of
the population. The above ideas can be stated in the following framework:
• Interactions between pairs of individuals are homogeneous in space and instantaneous, i.e.
without space structure and delay time. They may change the state of the individuals as well
as the population size by shifting individuals into another state or by destroying or creating
individuals. Only binary encounters are significant for the evolution of the system.
• The rate of interactions between individuals of the population is modelled by the encounter
rate which may depend on the state of the interacting individuals
η = η(v, w) , (1.3.21)
which describes the rate of interaction between pairs of individuals. It is the number of encoun-
ters per unit time of individuals with state v with individuals with state w.
• The interaction-transition probability density
gives the probability density distribution of the transition, due to binary encounters, of the
individuals which have state v with the individuals having state w that, after the interaction,
manufacture individuals with state u.
The product between η and A is the transition rate
• The evolution equations for the density f can be derived by balance equation which equates
the time derivative of f to the difference between the gain and the loss terms. The gain term
models the rate of increase of the distribution function due to individuals which fall into the
state u due to uncorrelated pair interactions. The loss term models the rate of loss in the
distribution function of u-individuals due to transition to another state or due to death.
16
The above example, as simple as it may appear, gives a preliminary idea of the way a kinetic
type modelling can be derived. This topic will be properly revisited in Chapter 4. At present
we limit our analysis to observing that a crucial role is defined by the modelling of interactions
at the microscopic scale which allows the application of suitable balance equation to obtain the
evolution of the probability distribution. At present, we can observe that since A(v → u |v, w)
is a probability density, it satisfies
Z 1
A(v → u |v, w) du = 1 , ∀ v, w ∈ [0, 1] . (1.3.25)
0
for p = 1, 2, . . .
Examples 1.2.1 and 1.3.2 can be properly rewritten using dimensionless variables. This pro-
cedure should be generally, may be always, applied. In fact, it is always useful, and in some
cases necessary, to write models with all independent and dependent variables written in a
dimensionless form by referring them to suitable reference variables. These should be properly
chosen in a way that the new variables take value in the domains [0, 1] or [−1, 1].
The above reference variables can be selected by geometrical and/or physical arguments
related to the particular system which is modelled. Technically, let wv be a certain variable
17
(either independent or dependent), and suppose that the smallest and largest value of wv ,
respectively wm and wM , are identified by geometrical or physical measurements; then the
dimensionless variable is obtained as follows:
wv − wm ,
w= w ∈ [0, 1] . (1.4.1)
wM − wm
For instance, if wv represents the temperature in a solid material, then one can assume
wm = 0, and wM = wc , where wc is the melting temperature for the solid.
In principle, the description of the model should define the evolution within the domain [0, 1].
When this does not occur, then the model should be critically analyzed.
If wv corresponds to one of the independent space variables, say it corresponds to xv , yv , and
zv for a system with finite dimension, then the said variable can be referred to the smallest and
to the largest values of each variable, respectively, xm , ym , zm , and xM , yM , and zM .
In some cases, it may be useful referring all variables with respect to only one space variable,
generally the largest one. For instance, suppose that xm = ym = zm = 0, and that yM = axM ,
and zM = bxM , with a, b < 1, one has
xv , yv , zv ,
x= y= z= (1.4.2)
xM xM xM
tv − t0 ,
t= t ≥ 0, (1.4.3)
Tc − t0
where generally one may have t0 = 0. The choice of Tc has to be related to the actual analytic
structure of the model trying to bring to the same order the cause and the effect as both of
them are identified in the model. For instance, looking at models in Example 1.2.1, the cause
is identified by the right-hand side term, while the effect is the left-hand term. The model
should be referred to the observation time during which the system should be observed. This
time should be compared with Tc .
Bearing all above in mind let us apply the above concepts to the statement in terms of
dimensionless variables of the two models described in Examples 1.2.1 and 1.3.2.
Example 1.4.1
Dimensionless Linear Elastic Wire-Mass System
Let us consider the model described in Example 1.2.1, with the addition of the following as-
sumption:
• A constant force F is directed along the x-axis.
18
d2 xv
m = F − kxv . (1.4.4)
dt2v
It is natural assuming ` = F/k, t = tv /Tc , and x = xv /`. Then the model writes:
m d2 x
= 1 − x. (1.4.5)
kTc 2 dt2
Assuming:
m m
=1 ⇒ Tc 2 = , (1.4.6)
kTc 2 k
yields
d2 x
= 1 − x, (1.4.7)
dt2
which is a second order model.
The evolution can be analyzed in terms of unit of Tc .
Example 1.4.2
Linear Dimensionless Pollutant Diffusion Model
Consider the model described in Example 1.3.2 which, in terms of real variables, can be written
as follows:
∂cv d2 cv h0
= k0 , k0 = , (1.4.8)
∂t ∂x2v ρ0
It is natural assuming u = cv /cM , t = tv /Tc , and x = xv /`. Then, the model writes:
1 ∂u k0 ∂ 2 u
= 2 , (1.4.9)
Tc ∂t ` ∂x2
Moreover, taking:
k0 Tc `2
=1 ⇒ Tc =
`2 k0
yields
∂u ∂2u
= . (1.4.10)
∂t ∂x2
In particular Eq. (1.4.10) shows that the same model is obtained, after scaling, to describe
linear diffusion phenomena in different media. Indeed, only Tc changes according to the material
19
properties of the media. This means that the evolution is qualitatively the same, but it evolves
in time with different speeds scaled with respect to Tc .
Writing a model in terms of dimensionless variables is useful for various reasons, both from
analytical point of view and from the computational one, which will be examined in details in
the chapters which follow. One of the above motivations consists in the fact that the procedure
may introduce a small dimensionless parameter which characterizes specific features of the
evolution equation, e.g. nonlinear terms, time scaling, etc. As we shall see in Chapter 2, the
above parameters allow the development of perturbation techniques such that the solution can
be sought by suitable power expansion of the small parameter.
Various models have been proposed in the preceding sections corresponding to the micro-
scopic, macroscopic, and kinetic representation. This section will show how the same physical
system can be represented by different models according to the selection of different observation
scales.
Let us consider the one dimensional flow of vehicles along a road with length `. First the inde-
pendent and dependent variables which, in a suitable dimension form, can represent the relevant
phenomena related to traffic flow are defined, then some specific models will be described.
In order to define dimensionless quantities, one has to identify characteristic time T and
length `, as well as maximum density nM and maximum mean velocity vM . Specifically:
nM is the maximum density of vehicles corresponding to bumper-to-bumper traffic jam;
vM is the maximum admissible mean velocity which may be reached by vehicles in the empty
road.
It is spontaneous to assume vM T = `, that means that T is the time necessary to cover the
whole road length ` at the maximum mean velocity vM . After the above preliminaries, we can
now define dimensionless independent and dependent variables.
The dimensionless independent variables are:
• t = tr /T , the dimensionless time variable referred to the characteristic time T , where tr is
the real time;
• x = xr /`, the dimensionless space variable referred to the characteristic length of the road `,
where xr is the real dimensional space.
The dimensionless dependent variables are:
• ρ = n/nM , the dimensionless density referred to the maximum density nM of vehicles;
• V = VR /vM , the dimensionless velocity referred to the maximum mean velocity vM , where
VR is the real velocity of the single vehicle;
• v = vR /vM , the dimensionless mean velocity referred to the maximum mean velocity vM ,
where vR is the mean velocity of the vehicles;
20
• q, the dimensionless linear mean flux referred to the maximum admissible mean flux qM .
Of course a fast isolated vehicle can reach velocities larger that vM . In particular a limit
velocity can be defined
such that no vehicle can reach a velocity larger than V` . Both vM and µ may depend on the
characteristics of the lane, say a country lane or a highway, as well as to the type of vehicles,
say a slow car, a fast car, a lorry, etc.
The above variables can assume different characterization according to the modeling scales
which can be adopted for the observation and modeling. In particular, one may consider,
according to the indications given in the previous sections, the following types of descriptions:
• Microscopic description: the state of each vehicle is defined by position and velocity as
dependent variables of time.
• Kinetic (statistical) description: the state of the system is still identified by position and
velocity of the vehicles however their identification refers to a suitable probability distribution
and not to each variable.
• Macroscopic description: the state is described by locally averaged quantities, i.e. density,
mass velocity and energy, regarded as dependent variables of time and space
In detail, in the microscopic representation all vehicles are individually identified. The
state of the whole system is defined by dimensionless position and velocity of the vehicles. They
can be regarded, neglecting their dimensions, as single points
f = f (t, x, V ) , (1.5.3)
where f dx dV is the number of vehicles which at the time t are in the phase domain [x, x +
dx] × [V, V + dV ].
Finally, the macroscopic description refers to averaged quantities regarded as dependent
variables with respect to time and space. Mathematical models are stated in terms of evolution
equations for the above variables. If one deals with density and mean velocity, then models will
be obtained by conservation equations corresponding to mass and linear momentum:
Before describing a specific model for each of the above scales, it is necessary to show how the
information recovered at the microscopic scale and by the kinetic representation can provide,
by suitable averaging processes, gross quantities such as density and mass velocity.
In the microscopic presentation, one can average the physical quantities in (1.5.2) either at
fixed time over a certain space domain or at fixed space over a certain time range. For instance
the number density u(t, x) is given by the number of vehicles N (t) which at the time t are in
[x − h, x + h], say
N (t)
u(t, x) ∼
= · (1.5.5)
2hnM
A similar reasoning can be applied to the mean velocity
N (t)
1 X
v(t, x) ∼
= Vi (t) , (1.5.6)
N (t)vM i=1
where, of course, the choice of the space interval is a critical problem and fluctuations may be
generated by different choices.
In the kinetic representation, macroscopic observable quantities can be obtained, under suit-
able integrability assumptions, as momenta of the distribution f , normalized with respect to
the maximum density nM so that all variables are given in a dimensionless form. Specifically,
the dimensionless local density is given by
Z 1+µ
ρ(t, x) = f (t, x, V ) dV , (1.5.7)
0
After the above preliminaries, we can now describe some specific models. Actually very simple
ones will be reported in what follows, essentially with tutorials aims. The first model is based
on the assumption that the dynamics of each test vehicle is determined by the nearest field
vehicle.
Example 1.5.1
Follow the Leader Microscopic Model
d2 x i
The basic idea of this model, see Klar et al. (1996), is that the acceleration (t + T ) of the
dt2
i − th vehicle at time t + T depends on the following quantities:
• The speed Vi (t) of the vehicle at time t,
• The relative speed of the vehicle and of its leading vehicle at time t: Vi−1 (t) − Vi (t),
• The distance between the vehicle and its leading vehicle at time t: xi−1 (t) − xi (t).
22
Hence the ordinary differential equation which describes the model is as follows:
where T is the reaction time of the driver, and a, m, c, are parameters to be fitted to specifical
situations.
The model which will be described in what follows was proposed by Prigogine and Hermann
(1971), and is based on the assumption that each driver, whatever its speed, has a program
in terms of a desired velocity which can be computed by suitable experiments. Specifically
fd = fd (V ) denotes, in what follows, the desired-velocity distribution function, meaning
that fd (V )dx dV gives the number of vehicles that, at time t and position x ∈ [x, x + dx], desire
to reach a velocity between V and V + dV .
Example 1.5.2
Prigogine Kinetic Model
Prigogine’s model describes the traffic flow according to the scheme:
∂f ∂f
+V = JP [f ] , (1.5.10)
∂t ∂x
JP [f ] = Jr [f ] + Ji [f ] , (1.5.11)
1
Jr [f ](t, x, V ) = (fd (V ) − f (t, x, V )) , (1.5.12)
Tr [f ]
with
u(t, x) ,
Tr [f ](t, x) = τ (1.5.13)
1 − u(t, x)
23
where τ is a constant. The relaxation time is smaller the smaller is the density; instead for
density approaching to the bumper condition, u → 1, this term grows indefinitely.
The term Ji is due to the interaction between a test (trailing) vehicle and its (field) heading
vehicle. It takes into account the changes of f (t, x, V ) caused by a braking of the test vehicle due
to an interaction with the heading vehicle: it contains a gain term, when the test vehicle has
velocity W > V , and a loss term, when the heading vehicle has velocity W < V . Moreover, Ji is
proportional to the probability P that a fast car passes a slower one; of course this probability
depends on the traffic conditions and so on the normalized density. Taking the above probability
defined by the local density yields:
Z 1+µ
Ji [f ](t, x, V ) = u(t, x)f (t, x, V ) (W − V )f (t, x, W ) dW . (1.5.14)
0
∂f ∂f 1 − u(t, x)
+V = (fd (V ) − f (t, x, V )) + u(t, x)
∂t ∂x τ u(t, x)
Z 1+µ
× f (t, x, V ) (W − V )f (t, x, W ) dW . (1.5.15)
0
Example 1.5.3
Macroscopic Models
The macroscopic model which follows is based on conservation of mass and linear momentum,
see Bellomo et al. (2002), which can be written as follows:
∂u ∂
+ (uv) = 0 ,
∂t ∂x
(1.5.16)
∂v + v ∂v = g[u, v] ,
∂t ∂x
where g defines the average acceleration referred to each particle. Square brackets are used
to indicate that the model of g may be a functional of the arguments. In practice it may
be not simply a function of the variables, but also of their first order derivatives. The word
acceleration is used, when dealing with traffic flow models, to avoid the use of the term force
for a system where the mass cannot be properly defined.
It is worth stressing that the above framework simply refers to mass and momentum conser-
vation, while energy is not taken into account. This choice is practically necessary for a system
where the individual behavior plays an important role on the overall behavior of the system.
24
As we have seen, models can be obtained by Eq. (1.5.16) with the addition of a phenomeno-
logical relation describing the psycho-mechanic action g = g[u, v] on the vehicles. For instance,
if one assumes (in dimensionless variables):
1 ∂p
g[u, v] ≡ g[u] = − , (1.5.17)
u ∂x
then, the following:
∂u ∂
+ (uv) = 0
∂t ∂x
(1.5.18)
∂v + v ∂v = − 1 ∂p ,
∂t ∂x u ∂x
is obtained, where p is the pressure. A possible relation is the equation of ideal gases: p = c uγ ,
where c is a constant (in isothermal conditions).
If, instead of (1.5.17), we assume the following:
1 ∂p 2 ∂2v ,
g[u, v] = − + (1.5.19)
u ∂x 3uRe ∂x2
we have
∂u ∂
∂t + ∂x (uv) = 0 ,
(1.5.20)
2
∂v + v ∂v = − 1 ∂p + 2 ∂ v .
∂t ∂x u ∂x 3uRe ∂x2
Here Re = V0 Lu0 /µ with V0 , L and u0 reference speed, length and density respectively, and
µ > 0 a material parameter called viscosity. Re is a positive constant called Reynolds number,
that gives a dimensionless measure of the (inverse of) viscosity. System (1.5.20) defines the
equations of motion of a viscous, compressible fluid in one spatial dimension. As it is obvious,
the presence or the absence of the viscous term leads to a change in the mathematical structure
of the equations, with consequences in the properties of the model.
Let us stress again that the above models have to be regarded as simple examples proposed
to show how different representation scales generate different models corresponding to the same
physical system.
The above sections have shown that the observation and representation at the microscopic
scale generates a class of models stated in terms of ordinary differential equations, while the
macroscopic representation generates a class of models stated in terms of partial differential
equations. In details, the following definitions can be given:
25
Example 1.6.1
Static Configurations of an Elastic Wire-Mass System
Consider, with reference to Figure 1.2.1, the mechanical system described in Example 1.2.1.
Equating to zero the left hand side term, i.e. the time derivatives, yields, in the linear case, the
following model:
u2 = 0 ,
(1.6.1)
− k u = 0·
1
m
26
u2 = 0 ,
(1.6.2)
− k1 u − c up = 0 .
1
m m 2
Example 1.6.2
Static Configurations of a Population Dynamics Model
The static configuration of the stochastic population dynamics model described in Example
1.3.5 are obtained equating to zero the left side term of Eq. (1.3.24)
Z 1 Z 1 Z 1
η(v, w)A(v → u |v, w)f (t, v)f (t, w) dv dw = f (t, u) η(u, v)f (t, v) dv . (1.6.3)
0 0 0
This chapter has been proposed as an introduction to modelling, classification and organi-
zation of mathematical models and equations. It has been stated that a deeper insight into
mathematical aspects can be effectively developed only if a well defined class of models (and
equations) is effectively specialized. Therefore, the above relatively deeper analysis is postponed
to the chapters which follow.
This section simply anticipates some topics and concepts which will be analyzed properly in
the chapters which follow. Specifically, we anticipate some ideas concerning model validation
and complexity problems in modelling.
Referring to model validation one can state, in general, that a model can be regarded valid
if it is able to provide information on the evolution of a real system sufficiently near to those
obtained by experiments on the real system. So far a conceivable modelling procedure needs
the development of the following steps:
• The real system is modelled by suitable evolution equations able to describe the evolution of
the dependent variables with respect to the independent ones.
• Mathematical problems are generated by linking to the model all conditions necessary for
its solution. These conditions should be generated by experimental measurements on the real
system.
27
• The above problems can be possibly solved and the output of the simulations is compared
with the experimental observations.
• If the distance (according to a concept to be properly defined in mathematical terms) between
the above simulations and experiments is less than a critical value fixed a priori, then the model
can be regarded valid, otherwise revisions and improvements are necessary.
Unfortunately, the concept of validity is not universal, but it refers to the circumstances
related to the above comparisons. Indeed, a model which is valid to describe certain phenomena,
may loose validity with reference to different phenomena. Therefore development of models
and their application needs a constant critical analysis which can go on following a systematic
analysis and improvements of each model.
Referring now to complexity problems in modelling, it is worth stating that this concept
can be applied to the real system, as well as to the model and to the mathematical problems. In
principle all systems of the real world are complex, considering that the number of real variables
suitable to describe each system may be extremely large, if not infinite. Once applied mathe-
maticians try to constrain the real system into a mathematical model, i.e. into a mathematical
equation, then a selection of the variables suitable to describe the state of the real system is
done.
In other words, every model reduces the complexity of the real system through a simplified
description by a finite number of variables. Enlarging the number of variables makes the
model virtually closer to the real system. On the other hand this enlargement may cause
complexity in modelling. In fact a large number of variables may need experiments to identify
the phenomenological models related to the material behavior of the system, which may require
high costs to be realized, and, in some cases, may be impossible.
However, suppose that the applied mathematician is able to design a model by a large number
of variables, then the related mathematical problem may become too difficult to be dealt with.
Technically, it may happen that the computational time to obtain a careful solution increases
exponentially with the number of variables. In some cases, mathematics may not even be able to
solve the above problems. The above concepts refer to complexity related to mathematical
problems. Once more, this is a critical aspects of modelling which involves a continuous
intellectual effort of applied mathematicians.
It is plain that the attempt to reduce complexity may fall in contrast with the needs posed
by validation. Let us anticipate some concepts related to validation of models. Essentially,
a validation process consists in the comparison between the prediction delivered by the model
and some experimental data available upon observation and measurement of the real system.
If this distance is “small”, then one may say that the model is valid. Otherwise it is not.
The above distance can be computed by a suitable norm of the difference between the variable
which defines the state of the model and the measurement obtained on the real system related
to the same variable. Of course, different norms have to be used according to the different
classes of models in connection to the different representation scales.
28
Let us critically focus on some aspects of the validation problems and their interplay with
complexity problems:
• The validation of a model is related to a certain experiment. Hence a validity statement holds
only in the case of the phenomena related to the experiment. In different physical conditions,
the model may become not valid.
• The evaluation of the distance between theoretical prediction and measurements needs the
selection of a certain norm which needs to be consistent not only with the analytic structure of
the model, but also with the data available by the measurements.
• The concept of small and large related to the evaluation of the deviations of the theoretical
prediction from the experimental data has to be related both to the size (in a suitable norm)
of the data, and to the type of approximation needed by the application of the model to the
analysis of real phenomena.
• Improving the accuracy (validity) of a model may be contrasted by the complexity problems
concerning both modelling and simulations. In some cases accuracy may be completely lost due
to errors related to complex computational problems
Mathematical modelling constantly supports the development of applied sciences with the
essential contribution of mathematical methods. In the past centuries, a systematic use of mod-
elling methods have generated classical equations of mathematical physics, namely equations
describing hydrodynamics, elasticity, electromagnetic phenomena etc. Nowadays, modelling
refers to complex systems and phenomena to contribute to the development of technological
sciences.
Mathematical models already contribute, and in perspective will be used more and more,
to the development of sciences directly related to quality of life, say, among others, biology,
medicine, earth sciences.
Modelling processes are developed through well defined methods so that it is correct to talk
about the science of mathematical modelling. The first stage of this complex process is
the observation of the physical system which has to be modelled. Observation also means orga-
nization of experiments suitable to provide quantitative information on the real system. Then
a mathematical model is generated by proper methods to deal with mathematical methods.
Generally a mathematical model is an evolution equation which can potentially describe the
evolution of some selected aspects of the real system. The description is obtained solving
mathematical problems generated by the application of the model to the description of real
physical behaviors. After simulations it is necessary to go back to experiments to validate the
model. As we shall see, problems are obtained linking the evolution equation to the so-called
initial and/or boundary conditions. Indeed, the simplest differential model cannot predict
the future if its behavior in the past and on the boundaries of the system are not defined.
The above procedure will be revisited all along these Lecture Notes and it will be particu-
larized with reference to specific models or class of models. However, simply with the aim of
introducing the reader to some aspects of the statement of problems and development of math-
29
ematical methods, the previously described Examples 1.3.1 and 1.2.2 will be revisited with
special attention to statement of problems and simulations.
Example 1.7.1
Simulations for the Elastic Wire-Mass Model
Consider the model proposed in Example 1.3.1 as it is described, in the nonlinear case, by
Eq. (1.3.3). Simulations should provide the evolution in time of the variables u1 = u1 (t) and
u2 = u2 (t).
It is plain that the above evolution can be determined from the initial state of the system:
In other words, different behaviors correspond to different initial states. A very simple way
(actually, as we shall discuss in Chapter 2, too simple) to obtain the above simulation consists
in developing a finite difference scheme organized as follows:
i) Consider the discretization of the time variable:
ii) Given the initial state (1.7.1), compute, with reference to Eq. (1.3.3), the state u11 = u1 (t1 )
and u21 = u2 (t1 ) by the following scheme:
u11 = u10 + u20 h ,
µ ¶ (1.7.3)
k c p
u21 = u20 + h − u10 − u .
m m 20
iii) Continue the above scheme at the step (i + 1) of the discretization by the following scheme:
u1(i+1) = u1i + u2i h ,
µ ¶ (1.7.4)
k c p
u2(i+1) = u2i + h − u1i − u2i ,
m m
Example 1.7.2
Simulations for Linear Heat Transfer Model
Consider the mathematical model proposed in Example 1.2.2 as it is described by Eq. (1.2.6).
Similarly to Example 1.7.1, simulations should provide the evolution in time and space of the
variable u = u(t, x). Also in this case, the above simulations can be developed by appropriate
mathematical methods, if additional information is given on the behavior of u at the initial
time and at the boundaries of the space domain.
Specifically, let us assume it is known the initial state of the system:
Different behaviors correspond to different initial and boundary states. A very simple (actu-
ally too simple as we shall discuss in Chapter 3) way to obtain the above simulation consists in
developing a finite difference scheme organized as follows:
i) Consider the discretization (1.7.2) of the time variable, and the following discretization for
the space variable:
ii) Compute, with reference to (1.2.4), the fluxes at the boundary of each tract (finite volume)
[xj , xj+1 ] by their approximate values qj+1 = qj+1 (t):
h0
qj+1 = − (uj+1 − uj ) , (1.7.8)
d
where uj = uj (t) = u(t, xj ) , ∀ j = 1, · · · n − 1
iii) Apply the following scheme at each time and for each volume corresponding to the space
discretization:
duj+1 ∼ 1 k0
=− (qj+1 − qj ) = 2 (uj+1 − 2uj + uj−1 ) . (1.7.9)
dt c0 d d
d uj+1
iv) Apply the time discretization to as in Example 1.7.1.
dt
The mathematical problem is stated linking conditions (1.7.5)–(1.7.6) to the mathematical
model, while the related mathematical method is developed in items i) – iv).
The reader should be aware that the above examples have been dealt with at an intuitive
and, may be, naive level and that the above topics have to be revisited in a deeper framework.
31
Then, at this stage, some simple remarks are proposed with the aim of pointing out some crucial
features of the modelling process, which will be specifically discussed in the various chapters
which follow with direct reference to particular models.
• A mathematical model, although approximating the physical reality, should not hide
relevant features. In particular, it should not hide nonlinear behaviors or nonlinear features of
the phenomena which is modelled.
• Analysis of mathematical models essentially means solution of mathematical problems
obtained by providing suitable initial and/or boundary conditions to the state equation. This
type of analysis needs the development of mathematical methods that can be organized for
classes of models and which may differ for each class. Mathematical methods, according to
what we have said in the preceding item, should be those of nonlinear analysis. Linearity
should be regarded as a particular situation.
• Generally, mathematical problems are not as mathematicians wish. In other words, real
situations are not such that existence, uniqueness, and regularity of the solution can be proved.
Often, mathematical problems are imposed by physical reality. In fact, it may often happen that
although some information on the solution is given, some features of the model (the parameters)
or of the mathematical problem (initial or boundary conditions) cannot be measured. Inverse
problems are almost always ill posed. On the other hand, it is plain that the solution of
inverse-type problems is of relevant importance in the construction of mathematical models.
• Physical systems sometimes show stochastic behaviors. In some situations, even if the
mathematical model is of a deterministic type, the related mathematical problem may be of
a stochastic type. In fact, initial or boundary conditions cannot be measured precisely and
this type of information may be affected by some stochastic noise. Stochastic behaviors in
mathematical models may be an unavoidable feature, and, consequently, suitable mathematical
methods need to be developed in order to deal with stochastic problems.
• The modelling process may be regarded as a sort of loop that might be interrupted when
there is a satisfactory agreement between simulation and observation of the phenomena.
• Modelling not only leads to a simulation of physical reality, but can also contribute to
a deeper understanding of physical systems. Indeed, after the simulation, the experimental
observation can be revisited and hopefully improved.
Part of the contents of a letter, Bellomo (1998), appeared on the review journal of the
American Society of Mechanical Engineers is reported as it summarizes some of the concepts
which have been reported above.
It is worth pointing out that modelling is a creative science, which requires observation, initia-
tive, and invention. Modelling motivates applied mathematics which, on the other hand, needs
to support modelling and contributes to address the invention along mathematically reasonable
paths. Further mathematical models can often contribute to a deeper understanding of physical
reality. Indeed, the construction of a mathematical model contributes to discover the organized
32
structures of physical systems. Moreover, the simulation can point out behaviors which have
not been, or even cannot, be observed.
Then one may state that mathematical modelling constantly supports the development of
technological and natural sciences by providing the essential contribution of the mathematical
methods.
An additional complexity problem is related to scaling. It may happen, in the case of systems
constituted by a large number of interacting elements that, although the dynamics of each
element is well understood, the collective behavior of the whole system is not properly described
by the sum of the dynamics of each element. The complexity source is that collective behaviors
follow a dynamics totally different from that of the behavior of a few entities. It is not simply
a matter of selection of the proper scale, while models should take into account the fact that
in large systems, the various elements do not behave in the same way and individual behaviors
can play a relevant role in the overall evolution of the whole system.
This observation introduces the concept of complexity of living systems, and hence the
challenging problem of modeling living systems. This topic is dealt with in Chapter 4, however
some preliminary observations can be anticipated for large systems of interacting individuals.
• Living systems have the ability to express specific strategies which depends on that of the
other interacting individuals.
• This ability is heterogeneously distributed among individuals, while interactions are generally
not additive.
• Living systems operate out of equilibrium and have the ability to extract energy from the
surrounding environment for their own benefit.
Although the above remarks do not cover the whole variety of complexity features of living
systems, a preliminary analysis can be referred to the example of models we have presented in
this chapter.
• All models of this chapter involve, with exception of Models 1.3.5 and 1.5.2, a description of
the state of the system by deterministic variables.
• All models have been derived by a causality principle, namely by a relation between “cause”
and “effect”. This relation is deterministic for most models besides, again, 1.3.5 and 1.5.2.
• Focusing on deterministic models, it can be observed that the causality principle is based
on rigorous relations either conservation or equilibrium equations. On the other hand, the
derivation of the model needs phenomenological models concerning the material behavior of
the system under consideration.
• The afore–said characteristics are generally related to experiments which are valid in steady
conditions, while the model is supposed to operate far from these special conditions.
This latter remark focus on one of the weakness of the modeling approach, which occurs
whenever the closure of equilibrium and conservation equations is obtained by heuristic mate-
rial models that are no supported by a robust theory. Due to this motivation, the definition
mathematical models is used. On the other hand, the use of models of mathematical
33
physics can be used when conservation and/or equilibrium equations are closed by a relation
derived by a theory of mathematical physics.
Very different appears to be the panorama offered by models of living systems, where the
causality principle involves random variables. Moreover, the phenomenological model used to
close the equilibrium and/or conservation equations may, in some cases, be stated in probability.
In fact, the heterogeneous distribution of the ability to express a certain strategy requires the
use of random variables and stochastic models for the interactions.
This specific characteristics and the modeling difficulty suggest to design explorative models
rather than predictive ones. More precisely:
- Predictive models are supposed to depict the evolution of systems once the parameters of the
model have been properly identified.
- Explorative models are supposed to depict the scenario of all conceivable evolutions corre-
sponding to all admissible values of the parameters.
The latter class of models is very important in the case of living systems, where one may be
interested to understand the behavior of the system corresponding to different interaction rules
involving the entities composing the system itself.
However, the two examples that have been presented in this Section have to be considered very
simple examples that need to be criticized. For instance, by the fact that interactions are linearly
additive, while living systems generally undergo non-linear interactions; as an example, focusing
on the vehicle traffic model, the passing probability does not depend by binary interactions only,
but by the distribution of the vehicles in the whole interaction domains.
Therefore, the challenging topic we are talking about has to be treated within a well defined
mathematical approach, possibly a mathematical theory, different from that widely used in this
chapter. Accordingly, the modeling approach to living systems will be widely treated in the
last chapter of this monograph.
34
35
Chapter 2
2.1 Introduction
This chapter deals with modeling and related mathematical problems within the framework
of the microscopic observation scale. Modeling at the microscopic scale means, according to the
classification of models and equations proposed in Chapter 1, deriving an evolution equation
for each element constituting the system where each element is treated as a whole. Generally,
the model is stated in terms of a system of ordinary differential equations.
More precisely, if m is the dimension of the state variable which defines the state of each
element and n the number of elements, then the dimension of the system is n × m. Equations
are generally linked by the mutual interactions of the elements belonging to the system and by
external actions on the elements of the system.
Microscopic modeling can be developed when the number of elements is somehow small
(in some sense to be properly specified). Otherwise, when their number becomes large, the
computational complexity to handle a large system of equations may involve difficulties which
cannot be technically dealt with. Actually, technical difficulties do not refer to the derivation of
the model, but to the computational treatment of a system of equations too large to be handled:
specifically the computational time may grow exponentially with the number of equations. In
this case a different way of modeling can be developed in order to reduce complexity.
This chapter deals with models and problems which do not yet generate the above type
of computational complexity. This delicate matter cannot be hidden, being aware that
reducing complexity is one of the challenging aspects of the science of mathematical modeling.
However the real complexity problem is the modelling of nonlinear interactions rather than the
approach based on assumption of linear superposition already discussed in Chapter 1.
After the above preliminaries, the contents and organization of this chapter can be described.
Specifically, after this introduction:
– Section 2.2 deals with modeling methods at the microscopic scale. As we shall see, although
36
it is not possible to assess a unique procedure to derive a model, still some general rules can be
identified. Some specific examples will show how models can be derived within the mathematical
framework of ordinary differential equations.
– Section 2.3 presents a classification of ordinary differential equations, and the mathematical
statement of the problems, which is obtained by linking the evolution equation to the conditions
that are necessary to its solution. If these conditions are given corresponding to the initial value
of the independent variable, say t = t0 , one has an initial-value problem, also called Cauchy
problem. Otherwise, if these quantities are given at both ends of the range of the independent
variable, say t = t0 and t = t1 , one has a boundary-value problem.
– Section 2.4 deals with the representation of solutions to mathematical problems described in
Section 2.3. Then an introduction to discretization methods is proposed to obtain quantitative
results, that are the simulations related to the solution of the above problems. This topic is
related both to initial and boundary value problems. The contents have to be regarded as an
introduction to computational methods. The reader will be introduced to topics which need to
be properly revisited by means of the essential contribution of the pertinent literature.
– Section 2.5 is concerned with the qualitative analysis of dynamical systems; in particular with
stability definitions followed by methods to study linear and nonlinear stability.
– Section 2.6 deals with regular and singular perturbation methods developed to analyze the
qualitative and quantitative behavior of solutions to initial value problems for finite models.
– Section 2.7 provides a brief introduction to the problem of bifurcation, analyzing the de-
pendence of the equilibrium points and of their stability properties on the parameters of the
model.
– Section 2.8 develops a critical analysis on the contents of this chapter focused, as already men-
tioned, on modelling and computational problems, looking ahead to modelling and simulation
of living systems.
In general, it is not reasonable to assess only one method to derive models at the micro-
scopic scale. However some common lines can be identified. Therefore a conceivable modeling
procedure will be here described and applied in the derivation of some specific models.
Let us consider the following sequential steps which may be followed for the derivation of
finite mathematical models:
Step 1. Phenomenological observation of the physical system which needs to be modeled.
Step 2. Identification of the various elements composing the system and of the dependent
(state) variables suitable to describe the state of each element, and hence of the whole system.
Step 3. Modeling the causes which generate the dynamics and identification of the cause-
effect relationships, as well as of the conservation equations (when they exist) of quantities
related to the state of the system.
37
Step 4. Derivation of the mathematical model, i.e. an evolution equation for the dependent
variables, by exploiting the above relationships.
When the system is a mechanical one, within the framework of classical mechanics, then the
derivation of the model follows methods of Newtonian and Lagrangian mechanics, as docu-
mented in the book Bellomo et al. (2000). The examples which follow will show how the above
steps can be practically developed in the modeling systems of real world. Specifically, the first
system is related to technology, the second one to economical sciences and marketing, and the
third one to populations dynamics. The four aforesaid Steps 1–4 will be explicitly related to
the derivation of each model.
Once the model has been derived, the analysis and the application of the model follows by
stating and solving mathematical problems generated by the applications.
Example 2.2.1
Linear Electric Circuits
Consider the standard circuit with resistor R, inductance L, and capacitor C, in series, as shown
in Figure 2.2.2, where it is also shown that the circuit is subject to the voltage E. The model
is expected to describe the flow of the current and its derivation can be developed through the
steps which follow:
Step 1. The phenomenological observation of the system indicates that it is an electric
circuit: the electromotive force E = E(t) may vary in time, while R, L, and C are constant
parameters.
Step 2. The system is composed of one only element: the circuit, the state of the system
may be described either by the charge q (to be measured in Coulombs), or by the intensity of
the current i = dq/dt (to be measured in Ampère).
Step 3. The relationship between cause and effect can be described by Kirchoff’s law (ac-
tually a phenomenological model) which states that the sum of the tension drops in each
element equals the electromotive force E(t). Moreover, it is assumed that Ohm’s model is valid:
the tension drop in the resistor is V (t) = R i(t), with R constant.
Step 4. The evolution model is obtained exploiting the above relationship, which, considering
that all parameters are constant, writes:
di q
L + Ri + = E(t) . (2.2.1)
dt C
38
Considering that i is the derivative with respect to time of the charge q, the following second
order model is derived:
d2 q R dq q E(t) ,
2
+ + = (2.2.2)
dt L dt LC L
which can be written as a system of first order equations using the following variables:
dq du1 ,
u1 = q , u2 = =
dt dt
1 , R, 1
α=− β=− γ= ·
LC L L
Remark 2.2.1. The derivation of the model (2.2.3), is obtained by particularizing the general
procedure described at the beginning of this section and visualized in Fig. 2.2.1.
Example 2.2.2
Modeling Marketing Dynamics
Consider the interaction within an economical system constituted by potential buyers with
the users of brand names in a closed marketing environment. Modeling the above system
should provide the evolution in time of the numbers of the above interacting subjects. The
mathematical model can be derived through the following steps:
Step 1. The system is constituted by potential buyers with the users of brand names in a
closed marketing environment. Interactions modify their number.
Step 2. The state of system is supposed to be described by the number u1 of potential
buyers and the number u2 of users of brand names.
Step 3. The relationship between cause and effect can be described by a simple phenomeno-
logical model such that u1 grows with time at a constant rate, namely the growth is linearly
depending on u2 . On the other hand such a growth is contrasted by the interaction between u1
with up2 , with p > 1, due to saturation phenomena. Similarly u2 grows due to the interaction
between u1 with uq2 , with q > 1. On the other hand such a growth is contrasted by u2 due to
saturation of offer in the market.
Step 4. The model is obtained exploiting the above modeling of the cause and equating the
cause to the effect, that is the time derivative of u1 and u2 . The results is as follows:
du1
= α − βu1 up2 + γu2 ,
dt
(2.2.4)
du2 = βu1 uq2 − δu2 ,
dt
where α defines the natural rate of growth of buyers, β refers to the decrease of buyers due to
their contact with brand names, while brand names increase the rate of growth of buyers as
ruled by the parameter γ. The parameter β is again used for their increase due to the contact
with buyers, while δ refers to decay of brand names.
Example 2.2.3
Population Dynamics
Consider a system of n interacting populations fully isolated with respect to the outer envi-
ronment. Modeling the above system means obtaining the evolution in time of the number
of individuals of the above interacting populations. The mathematical model can be derived
through the following steps:
Step 1. The system is fully isolated, i.e., it is assumed that there is not interaction with the
outer environment. Moreover, one assumes that the state of the system is simply described by
the number ui of individuals of each ith population.
40
Step 2. The cause–effect relationship is assessed by two phenomenological aspects. The first
aspect states that the population growth of the ith species linearly depends upon the number
of the individuals belonging to the said species. A second aspect states that the interaction
between individuals of different species produces a decrease proportional to the product of
individuals of the interacting species.
Step 3. The mathematical model is simply obtained equating the cause:
n
X
ai ui − ui bij uj , (2.2.5)
j=1
to the effect:
Xn
dui
= ai ui − ui bij uj · (2.2.6)
dt j=1
The above three simple models in Examples 2.2.1–2.2.3 show some technical differences. The
first model refers to one element only, the second one to a system of two elements, while the
number of elements in the third one may be arbitrarily large. All models need to be rewritten
in terms of dimensionless variables after a proper selection of reference quantities as indicated
in Chapter 1. The derivation of the first model is based on principles of physics, more precisely
models of material behaviour of circuits subject to electrical potential. The second and third
model are simply related to a phenomenological interpretation of real systems. Surely the
derivation is based on naive ideas that can be critically analyzed in the last section of this
chapter.
More precisely, one may observe that the first Example 2.2.1 describes the interactions be-
tween two objects, while the model 2.2.3 is concerned with the interactions among n popula-
tions. Actually, the model is meaningful if n is small. Indeed, when the number of populations
becomes large, interactions between pair of individuals may be affected by the presence of a
third population. In other words, the coefficients are no longer constant, but may depend on
the state of the system. This is definitely an origin of complexity to be properly analyzed
Section 2.2 has shown that mathematical models can be derived at the microscopic scale in
order to describe the time evolution of a finite dimensional state variable: u = u(t), where
u is the set of dependent variables u = (u1 , . . . , ui , . . . , un ) defined over the domain [t0 , t1 ] of
the independent variable t.
This section deals with the classification of microscopic models which are structured in terms
of ordinary differential equations, which will be assumed to be always written in dimensionless
41
form. The classification refers to models where the independent variable is the time t. However,
it holds for any type of independent variable and it is based on the structure of the mathematical
equations.
Specifically, let us consider first systems of equations written in the commonly called normal
form:
du1
= f1 (t, u1 , . . . , un ) ,
dt
.. (2.3.1)
.
dun = fn (t, u1 , . . . , un ) ,
dt
which can be written, with obvious meaning of notation, in the compact vector form:
du
= f (t, u) . (2.3.2)
dt
Another interesting class of models is the one which refers to higher-order scalar equations:
µ ¶
dn u du , , dn−1 u
=f t, u, ··· · (2.3.3)
dtn dt dtn−1
The above equation can be rewritten in terms of a normal system of n equations in n unknowns
by the change of variables:
du dn−1 u
u1 = u , u2 = , . . . , un = n−1 , (2.3.4)
dt dt
du1
= u2 ,
dt
du2
= u3 ,
dt (2.3.5)
..
.
du
n = f (t, u1 , . . . , un ) .
dt
This means, referring specifically to the class of equations represented in Eq. (2.3.1), that the
following classification can be given.
42
Linear Models
A system of ordinary differential equations (the model) is considered linear if it can
be written as follows:
du1
= a11 (t)u1 + · · · + a1n (t)un + b1 (t) ,
dt
.. (2.3.6)
.
dun = an1 (t)u1 + · · · + ann (t)un + bn (t) ,
dt
or in a compact form:
du
= A(t)u + b(t) , (2.3.7)
dt
where A(t) is the n × n matrix of the coefficients aij (t), and b(t) is a column vector
with n components.
du
= A(t)u . (2.3.8)
dt
Nonlinear Models
A system of ordinary differential equations (the model) is considered nonlinear if
the terms fi are nonlinear functions of the dependent variables u.
43
du
= A(t)u + b(t) + ε f (t, u) , (2.3.10)
dt
A further aspect which plays an important role in modeling is the presence of time.
Autonomous Models
A system of ordinary differential equations (the model) is considered autonomous
if the time t does not explicitly appear as an argument of f . Otherwise, the model is
called nonautonomous.
For instance, in Eq. (2.3.9) A and b are, respectively, a constant matrix and a constant
vector, so the system is autonomous. On the other hand, mathematical models presenting time
dependent forcing terms are nonautonomous.
Remark 2.3.2. Linear homogeneous systems can be written in the following form:
du
= L(u) , (2.3.11)
dt
where L(u) is a linear operator, i.e.
where λ is a real constant. On the other hand, nonlinear systems can be written as follow:
du
= N (u) , (2.3.13)
dt
where N does not satisfy condition (2.3.12).
In some very general cases models may have even a structure of the type of the type:
µ ¶
du , , dn u
f t, u, ··· = 0· (2.3.14)
dt dtn
In this case, writing the model in normal form as indicated in Eq. (2.3.1) may be hard or
even impossible. Actually, the above very particular cases are not dealt with in these Lecture
Notes.
Let us now consider mathematical problems which are generated by the analysis of real
systems. These problems are stated by joining the evolution equation with the conditions
necessary for its solution. Conditions for the solutions of problems are called initial and limit
conditions which may be defined as follows:
44
Initial conditions for all components of u define the state of the system before the evolution
starts, while limit conditions define constraint on the final configuration that is assumed by the
system.
Linking the evolution equation to initial and/or limit conditions generates the following math-
ematical problems:
Initial-value problem
The initial-value problem for Eq. (2.3.1) is obtained by coupling the system of n
evolution equations with n initial conditions ui0 , for i = 1, . . . , n:
du = f (t, u) ,
dt (2.3.17)
u(t0 ) = u0 .
One may look at the initial-value problem as presented by the input–output system shown
in Figure 2.3.1. The output is the dynamical response, while the mechanical model is
contained in the box. If the model is subject to an external field, the block representation also
includes the term b as one of the inputs, as shown in Figure 2.3.2, while the mechanical model
is still contained in the box. The forcing term is related to the physical situation characterizing
the interactions between the inner and outer system.
45
It should be noted that a system of n ordinary differential equations needs n initial conditions
– one per state variable – in order to be transformed into an initial-value problem. If some of the
initial conditions are replaced with limit conditions, then one has a boundary-value problem. In
this case, it is not necessary to have a condition per state variable. Indeed, the problem can be
stated with two types of conditions, namely both initial and limit conditions on a component,
and no condition at all on the other one. This situation is described in the following definition:
Boundary-value problem
The boundary-value problem for Eqs. (2.3.1) is obtained by linking the system of
n evolution equations to p < n initial conditions:
The above conditions can be replaced by linear or nonlinear combinations of the boundary
conditions.
The formal application of the above concepts to the statement of problems for some of the
models described in Section 2.2 is dealt with in the following examples.
Example 2.3.1
Initial Value Problems for Marketing Models
Consider the mathematical model described through Example 2.2.2. The initial value problem
is obtained linking the values that the variables u1 and u2 attain at the initial time t = t0 . The
46
problems is as follows:
du
1
= α − βu1 up2 + γu2 ,
dt
du2
= βu1 uq2 − δu2 ,
dt (2.3.20)
u1 (t0 ) = u10 ,
u2 (t0 ) = u20 ,
where u10 and u20 are given constants.
Example 2.3.2
Boundary Value Problems for a Population Model
Consider the mathematical model described through Example 2.2.3 for a two population dy-
namics. A boundary value problem is obtained for example linking the values that the variable
u1 attain at the initial time t = t0 and the one that u2 attain at the final time t = t1 . The
problem is as follows:
du
1
= a1 u1 − u1 (b11 u1 + b12 u2 ) ,
dt
du2
= a2 u2 − u2 (b21 u1 + b22 u2 ) ,
dt (2.3.21)
u1 (t0 ) = u10 ,
u2 (t1 ) = u21 .
Section 2.3 has shown in that once a model has been properly derived, its practical application
to the analysis of the behavior of real system needs the statement of a mathematical problem and
the related technical solution. It follows that mathematical methods for ordinary differential
equations should be developed to obtain quantitative simulations.
The development of analytic methods does not generally give sufficiently operative tools to
deal with mathematical problems. In fact, the presence of nonlinearity in the model often
(or almost always) requires the development of computational schemes and possibly scientific
software for the application of solution algorithms.
Several books are available in the literature which deal with analytic and/or computational
methods. For instance, and among others, one may consider the book Marasco and Romano
47
(2001), into which, after the definition of the proper analytic background concerning existence
of solutions, stability properties, bifurcation analysis, various computational schemes are devel-
oped with the aid of the Software Mathematica° c . A similar line is followed in the previously
cited book Bellomo et al. (2000), which is mainly oriented on the analysis of models of classical
mechanics. The interested reader is addressed to the above literature to obtain a relatively
more complete background on analytic and computational methods.
This section provides a concise description of solution schemes by analytic and discretization
methods. Completeness is not claimed, the aim being simply to show how it is possible to
obtain quantitative results from simulation of a large variety of class of problems. In details,
this section is organized through four subsections which deal with the following topics:
i) existence of solutions to initial value problems;
ii) analytic methods for linear initial value problems;
iii) discretization schemes for initial value problems;
iv) boundary value problems.
Lipschitz condition
A vector function f (t, u) satisfies a Lipschitz condition in a region D of the (t, u)-
space if there exists a constant L (called Lipschitz constant), such that, for any
(t, u) and (t, v) in D one has:
° °
°f (t, u) − f (t, v)° ≤ L ku − vk . (2.4.2)
where both terms on the left-hand side of (2.4.2) are referred to the same instant of
time.
The Lipschitz condition is a property related both to continuity and differentiability. Indeed,
it can be proved that if f (t, u) is defined in a bounded, closed, and convex domain D, and if
the partial derivatives of f with respect to ui exist with:
¯ ¯
¯ ∂fi ¯
max sup ¯¯ ¯≤M, (2.4.3)
i,j=1,...,n (t,u)∈D∂uj ¯
√
3
f (u) = u2
is continuous, however it does not satisfies the Lipschitz condition in any region containing the
origin. On the other hand, if f (t, u) is differentiable to respect to u, for all times, then it satisfies
the Lipschitz condition, but the opposite it is not true, as it is shown by |u| that satisfies the
Lipschitz condition with constant L = 1 though it is not differentiable in u = 0.
Functions like:
do not satisfy the Lipschitz condition in IR. In fact, roughly speaking, their derivative grows
without bound as u goes to infinity. However, they satisfy the Lipschitz condition in any closed
bounded interval with Lipschitz constant equal to the maximum of the absolute value of the
derivative in the interval.
Let us now consider, after the above preliminary definitions, the qualitative analysis of the
initial-value problem with initial time denoted by t0 . Existence and uniqueness of solutions is
stated by the following two theorems:
49
© ª
R = (t, u) : ku − u0 k ≤ K , |t − t0 | ≤ T , (2.4.4)
then there exists at least one solution to the initial-value problem (2.3.17) and it is of
class C 1 for |t − t0 | ≤ Tb, where:
½ ¾
K °
b
T = min T, and M = max kf (t, u)° .
M (t,u)∈R
ku(t) − u0 k ≤ M Tb . (2.4.5)
Notice that the Lipschitz condition is not needed to assure the existence of a solution of the
initial-value problem. Instead, it is essential in the uniqueness proof. Actually, Theorem 2.1
can be slightly improved, mainly by specializing the proofs to particular cases. For instance,
existence results can be obtained for f (t, u) with a limited number of finite discontinuities.
Consider now the problem of the continuous dependence on the initial data, and let u b
and ue be the two solutions of the two initial-value problem (2.3.17) with initial data u(t0 ) = u
b0
e
and u(t0 ) = u0 , respectively.
° ° ° °
°u e (t)° ≤ °u
b (t) − u e 0 °eL|t−t0 | ,
b0 − u (2.4.6)
du = f (t, u) ,
du = f ∗ (t, u) ,
dt and dt (2.4.7)
u(t0 ) = u0 , u(t0 ) = u∗0 ,
with f and f ∗ defined and continuous in a common domain D.
50
° °
°f (t, u) − f ∗ (t, u)° ≤ ε , ∀ (t, u) ∈ D , (2.4.8)
then
° ° ° ° h i
°u(t) − u∗ (t)° ≤ °u0 − u∗0 °eL|t−t0 | + ε eL|t−t0 | − 1 , (2.4.9)
L
where u(t) and u∗ (t) are the solutions of the two initial-value problems defined in (2.4.7).
Although the above analysis answers the questions posed at the beginning of this section
and can be applied to most of the initial-value problems occurring in nature, an important
question still remains unanswered: how large is the domain of the solution? The existence
results shown above assure existence and uniqueness of the solution which may, however, lead
to useless estimates on the existence interval.
Of course, the domain might be larger than the estimate given by the theorems, possibly
extending to all t ≥ t0 . Then one needs criteria to determine the largest possible domain of
existence. This question gives rise to the class of what is known as the extension theorems.
Namely, extensions of the solution can be developed if one can prove that the norm of the
solution does not grow too fast. This type of analysis is often based on a deep understanding
of the qualitative properties of the system. For instance boundness properties can be recovered
by a qualitative analysis of energy conservation. Of course, one needs to prove that the a priori
estimates hold true in the same function space where the local existence theorem has been
stated. This type of analysis can be recovered in the classical literature on ordinary differential
equations, see for example Pontriaguine (1975).
In general, analytic solutions can be sought for in the case of linear systems of ordinary
differential equations. The qualitative analysis of linear systems is widely dealt with in the
literature and can be approached in several ways. Some general results can be summarized
here:
du
= A(t)u + b(t) , (2.4.10)
dt
with continuous coefficients on a time interval [0, T ] satisfies the Lipschitz condition with
n
X
L= max |Aij (t)| . (2.4.11)
t∈I
i,j=1
51
with Aij (t) and bi (t) defined and continuous for |t−t0 | ≤ T , has a unique solution for |t−t0 | ≤ T ,
by Theorems 2.1–2.2.
Remark 2.4.3. If u1 (t), . . . , um (t) are m solutions of the homogeneous linear system:
du
= A(t)u , u ∈ IRn , (2.4.13)
dt
m
X
Ci ui (t) , C1 , . . . , Cm ∈ IR (2.4.14)
i=1
n
X
u(t) = bi ui (t) ,
C (2.4.15)
i=1
b1 , . . . , C
for suitable C bn ∈ IR.
is solved by imposing the n initial conditions to u(t), given by (2.4.15), to find the values of the
constants Cb1 , . . . , C
bn .
A set of independent solutions for the homogeneous problem is called a fundamental set
and it can be ordered into a matrix:
u11 (t) ··· u1n (t)
.. .. ..
U(t) = . . . (2.4.17)
un1 (t) ··· unn (t)
52
n
X
u(t) = e (t) ,
Ci ui (t) + u (2.4.18)
i=1
for suitable coefficients C1 , . . . , Cn ∈ IR, where {u1 (t), . . . , un (t)} is a fundamental set
for the homogeneous system (2.4.13) associated to (2.4.10), and u e is a solution of
Eq. (2.4.10).
Summarizing the results stated in the above Theorems, the solution of a linear initial value
problem (2.4.12) is obtained according to the following steps:
Step 1. Find a fundamental set {u1 (t), . . . , un (t)} for the homogeneous system (2.4.13) asso-
ciated to (2.4.10).
where U is the fundamental matrix defined by the functions obtained in Step 1, which amounts
to choosing a constant of integration.
Step 3. Impose the n initial conditions to (2.4.18) to find the values of the coefficients
C1 , . . . , C n .
du
= Au , u ∈ IRn , (2.4.20)
dt
it is possible to give a general procedure to complete Step 1, and Step 2 can be developed in a
simplified way. In fact, one can look for solutions in the form:
(A − λI)v = 0 . (2.4.22)
53
Hence, they have to be an eigenvalue and the corresponding eigenvector of the matrix A. De-
tailed calculations differ technically according to the fact that the number of distinct eigenvalues
differs from n or is the same. In particular
• If A has n linearly independent eigenvectors v1 , . . . , vn corresponding to the eigenvalues
λ1 , . . . , λn (which need not all be distinct and may be complex), then the solution of (2.4.20)
is:
A complex conjugate pair of eigenvalues λr ± iλi and eigenvectors vr ± ivi gives rise to a
solution:
u = (Cr + iCi )(vr + ivi )e(λr +iλi )t + (Cr − iCi )(vr − ivi )e(λr −iλi )t
h¡ ¢ ¡ ¢ i
= 2eλr t Cr vr − Ci vi cos(λi t) − Cr vi + Ci vr sin(λi t) . (2.4.24)
This matter has to be dealt with differently if there are eigenvalues with multiplicity r > 1
and p < r corresponding eigenvectors. Without entering into details, we consider just the
following particular case:
• If A has n − 2 distinct eigenvalues λ1 , . . . , λn−2 with corresponding distinct eigenvectors
v1 , . . . , vn−2 and a double eigenvalue λn−1 = λn = λ corresponding to a single eigenvector v,
the solution of (2.4.20) is as follows:
h i
u = C1 v1 eλ1 t + · · · + Cn−2 vn−2 eλn−2 t + Cn−1 v + Cn (w + vt) eλt , (2.4.25)
dn u du
an + · · · + a1 + a0 u = 0 (2.4.26)
dtn dt
is somewhat simpler since the procedure can be reduced to the solution of what is commonly
called the characteristic equation:
an λn + · · · + a1 λ + a0 = 0 . (2.4.27)
A real root λ corresponds to the solution (c0 + · · · + cµ tµ )eλt , where µ is the multiplicity of
λ. A complex conjugate pair, λ+ = λr + iλi , and λ− = λr − iλi , corresponds to the solution:
h i
eλr t (c0 + . . . + cµ tµ ) cos λi t + (d0 + . . . + dµ tµ ) sin λi t , (2.4.28)
It = {t0 , . . . , ti , . . . , tn = T } , (2.4.29)
and by approximating the solution in the points of the discretization by suitable algorithms.
This section provides a brief survey to deal with the above discretization scheme. The analysis
is in two steps. We first introduce the concepts of accuracy and stability.
In order to understand these concepts, consider the simplest conceivable method for solving
initial-value problems, the forward Euler method. If one knows the state variable at time ti and
wants to compute its value at time ti+1 = ti + h, the simplest idea consists in approximating
the solution of:
du = f (t, u) ,
dt (2.4.30)
u(ti ) = ui .
with:
Since:
¡ ¢ du
f ti , u(ti ) = (ti ) , (2.4.32)
dt
the above discretization corresponds to a Taylor expansion of the solution u stopped at first
order, which corresponds somewhat to following the direction of the tangent to the solution in
ti , as shown in a scalar case in Figure 2.4.1.
u
ui+1
ui
h
ti ti+1 t
Truncation Error
The truncation error Terr is defined as the norm of the difference between the
solution to the differential equation and the numerical solution divided by the time
step used in the numerical scheme.
In the case of forward Euler method it can be proved, under suitable regularity assumptions,
that for each component:
h d2 u
Terr ∼
=− (ξ) , ξ ∈ (ti , ti+1 ) , (2.4.33)
2 dt2
Accuracy
If the truncation error goes like hp , the method is pth order, which means that
halving the step divides the error by 2p . This is what is generally meant by accuracy
of the method.
From (2.4.33), for the forward Euler method the error goes like h and one has a first order
method.
Another important requirement of a numerical scheme is that the time step has to be chosen
so that the scheme is absolutely stable:
Absolute Stability
A numerical scheme is absolutely stable in a point ah of the complex plane if a
sequence {ui } generated by the method applied to the model:
du
= au , (2.4.34)
dt
Stability Region
The stability region is the set of points ah in the complex plane for which the
method is absolutely stable
It can be proved that each scheme is characterized by a stability region, which can be used
to determine a condition on the time step to be used in the integration. The procedure to be
used is the following:
Step 1. Starting from a nonlinear model, consider its linearized form:
du
= Au , (2.4.35)
dt
µ ¶
rd h
3 order : ui+1 = ui + K1 + 4K2 + K3 , (2.4.38)
6
where
K1 = f (ti , ui ) ,
µ ¶
h h
K2 = f ti + , ui + K1 , (2.4.39)
2 2
¡ ¢
K3 = f ti + h, ui + h(2K2 − K1 ) .
µ ¶
th h
4 order : ui+1 = ui + K1 + 2K2 + 2K3 + K4 , (2.4.40)
6
where
K1 = f (ti , ui ) ,
µ ¶
h h
K2 = f t i + , u i + K1 ,
2 2
µ ¶ (2.4.41)
h h
K3 = f t i + , u i + K2 ,
2 2
K4 = f (ti + h, ui + hK3 ) .
57
Recalling that the initial-value problem with initial condition u(ti ) = ui can be written in
integral form as:
Z t
¡ ¢
u(t) = ui + f s, u(s) ds , (2.4.42)
ti
and, in particular,
Z ti+1 ¡ ¢
u(ti+1 ) = uti + f s, u(s) ds , (2.4.43)
ti
it can be observed that both the third-order and the fourth-order methods are closely related
to Simpson’s rule applied to the integrals in (2.4.43).
The stability regions of the second-order method has an ellipsoidal shape, while that of the
third-order and fourth-order methods are bean-like shaped.
Third-order and fourth-order Runge–Kutta methods are recommended in dealing with prob-
lems for which the spectrum is not available, since they include both a part of the imaginary
axis and a part of the negative real axis. The disadvantage of these schemes is that several
evaluations of f have to be performed per time step and therefore if the computation of f
is very heavy, this method may demand an excessive amount of labor per step and becomes
inconvenient.
Runge–Kutta methods have the property of being self-starting as well as adaptive. In other
words, the time step can be changed at any moment according to an estimate of the local
error. Unfortunately, it is not trivial to get the above estimate of time step. In this case, to
be practical, the time step has then to be decreased if the difference between the two values
(over h) is larger than a specified maximum tolerance and can be increased if this difference is
smaller than the minimum required tolerance. Of course, this checking is time consuming.
where
Remark 2.4.5. When the existence of the solution of a linear boundary value problem is
assured (for example from Theorem 2.7), we can apply to the problem the results of Theorems
2.5–2.6. Then, imposing the initial and/or boundary conditions we get the solution of the
problem.
In order to clarify the importance of the existence and uniqueness issues related to boundary-
value problems, consider the following example.
Example 2.4.1
Nonuniqueness and Buckling of a Rod
Consider a simple mass-spring system governed by the ordinary differential equation:
d2 u
+ Ω2 u = 0 , (2.4.46)
dt2
du
u(0) = u0 and (0) = u00 , (2.4.48)
dt
uT − u0 cos(ΩT )
u(t) = u0 cos(Ωt) + sin(Ωt) . (2.4.51)
sin(ΩT )
if uT = (−1)h u0 .
The existence theorem considered above assures existence and uniqueness only if:
√
2 2 π
T < < . (2.4.53)
Ω Ω
Using the following argument this result can be used to explain the buckling of a rod. Consider
a rod of length T subject at its ends to a compressive load F , as shown in Figure 2.4.2. The
linear theory of elasticity tells us that for small deformations the angle of deflection u of the rod
can be found by solving (2.4.46) with Ω2 = F/(EI), where E is Young’s modulus of elasticity
for the rod, and I is the moment of inertia of the cross section of the rod about its axis.
Figure 2.4.2 — Buckling of a rod: (a) small loads, (b) large loads
The previous discussion can then be reprocessed in the following terms: if T < π/Ω, that is
if F < EIπ 2 /T 2 , then there is no deflection, but as soon as F reaches the critical compressive
load (first buckling load) F = EIπ 2 /T 2 , the solution is not unique and no longer small, which
also indicates a failure of the linear theory of elasticity to model the situation.
60
Equilibrium points
Consider the autonomous system of ordinary differential equations:
du
= f (u) . (2.5.1)
dt
f (ue ) = 0 (2.5.2)
are called the equilibrium points. If u(t0 ) = ue , then u(t) = ue for all times.
It is trivial to verify that if an equilibrium point is occupied initially, it is maintained for all
times.
The next problem is to decide whether an initially perturbed system gradually returns back to
the equilibrium point, or at least remains next to it, or wanders away. The following definitions
are given:
° °
°u(0) − ue ° < δ(ε) , (2.5.3)
° °
lim °u(t) − ue ° = 0 . (2.5.5)
t→+∞
The largest possible De is called the basin of attraction of ue and is denoted in the
following by Be .
The definition of stability has a local meaning. Indeed, if we want that the system remains,
within a given tolerance, near ue , the initial condition has to be sufficiently close to ue . In
general an equilibrium position can be stable to small perturbations, but unstable to large ones.
The aim of this subsection is to provide suitable methods for linear stability analysis of equi-
librium points. The simplest approach to establish the stability properties of an equilibrium
configuration is to investigate what happens if the perturbation is very small. The stability
condition we will provide is often referred to as the linear stability criterion.
Consider then the autonomous system (2.5.1) and the expansion of f in a Taylor series about
the equilibrium state, which can be performed under suitable regularity assumptions on f . In
this way it is possible to approximate each component of fi (u), i = 1, . . . , n as follows:
³ ´
fi (u) ≈ fi (ue ) + (u − ue ) · ∇fi (ue ) + o |u − ue | , (2.5.6)
where by the definition of equilibrium point fi (ue ) = 0. Therefore for infinitesimal perturbations
v = u − ue about the equilibrium state, one can neglect the other terms in the expansion, and
the differential system (2.5.1) can be approximated by its linearized form:
dv
= J(ue )v , (2.5.7)
dt
62
Equation (2.5.7) is a linear differential system with constant coefficients which can be solved
using the analytic methods presented in Section 2.4.
It is possible to relate these reasoning on the linearized system (2.5.7) to the behavior of the
nonlinear system via the following classical linearized stability criterion.
It is crucial to remark that this theorem guarantees the existence of a sufficiently small
neighborhood Ie of ue such that if u(t0 ) ∈ Be , then u(t) tends to ue , but does not give
an algorithm for the actual computation of the basin of attraction De . For this reason, this
criterion is also named stability with respect to infinitesimal perturbations. This is
certainly a limit to the utility of the above theorem for the applications. However, it is useful for
instability analysis. Indeed, if ue is linearly unstable, i.e., unstable with respect to infinitesimal
perturbations, then it is also unstable to larger perturbations.
If ue is asymptotically stable, then (2.5.7) can be solved to describe the evolution of the
system for suitably small initial values.
The above concepts can be specialized in the case of two-dimensional dynamic systems such
that the Jacobian defined in (2.5.8) is a 2×2 square matrix. In this case some particular dynamic
responses can be classified and their qualitative behavior can be visualized. In particular, we
consider the following two cases:
• If λ1 and λ2 are both real eigenvalues, then, if λ1 6= λ2 the solution of (2.5.7) can be written
as Eq. (2.4.23), which reduces to:
where C1 and C2 are integration constants, and v1 and v2 are the eigenvectors of J.
63
Figure 2.5.1 — Qualitative behavior of the dynamic response: (a) stable, (b) unstable node.
and this equilibrium point is called a stable node. On the other hand, if one of the eigenvalue
is positive, it follows that:
The equilibrium point is called an unstable node if both eigenvalues are positive and a
saddle point otherwise. In this latter case, two of the trajectories meet at the equilibrium
point and two depart from it. They are therefore called stable and unstable manifolds.
Figure 2.5.2 — Qualitative behavior of the dynamic response: (a) stable and (b) unstable focus.
• If the eigenvalues are complex conjugate, i.e., λ1 = λ + iω, λ2 = λ − iω, the solution to the
64
initial-value problem (2.5.7) can be written as in Eq. (2.4.24) by a linear combination of sine
and cosine functions.
In particular, if:
λ = <e(λ1 ) = <e(λ2 ) = 0 ,
then v(t) stays bounded and the equilibrium state is a center point.
The qualitative behavior of the solution in the case of stable and unstable node is represented
in Figure 2.5.1. The qualitative behavior of the solution in the case of stable and unstable focus
is represented in Figure 2.5.2.
Figure 2.5.3 — Qualitative behavior of the dynamic response: (a) saddle and (b) center point.
Finally, the qualitative behavior of the solution in the case of saddle and center point is
represented in Figure 2.5.3.
If λ1 = λ2 , the discussion gives the same results though (2.5.9) cannot be the solution of
(2.5.7) as shown in (2.4.23).
Note that for n = 2 the eigenvalues of J are:
r
tr J (tr J)2
λ= ± − det J , (2.5.14)
2 4
where tr J = J11 + J22 , and have both negative real part if and only if:
Example 2.5.1
Stability for the van der Pol Model
As an example, consider the van der Pol model which can for instance model the behavior of
some nonlinear electric circuits. The mathematical model can be written as follows:
di
= v,
dt
(2.5.16)
dv = (α − βi2 )v − γi ,
dt
(
v = 0,
(2.5.17)
(α − βi2 )v + γi = 0 ,
that is i = v = 0, which, according to what has just been stated, is linearly stable if α is
negative, as the Jacobian:
µ ¶
0 1
J(i = v = 0) = (2.5.18)
−γ α
has always positive determinant and trace equal to α. Hence, the equilibrium point is stable if
α < 0. More in details, the eigenvalues of the Jacobian are:
1h p i
λ= α ± α2 − 4γ , (2.5.19)
2
√ √
which are complex conjugate if |α| < 2 γ, and real if |α| ≥ 2 γ.
The stability results can be summarized in the following table:
Figure 2.5.4 — Stability analysis for the van der Pol Model
66
Liapunov function
du
Let ue be an isolated equilibrium of the system of equation = f (u) in some open
dt
neighborhood De of ue . A function V = V (u) is called a Liapunov function if it
satisfies the following properties:
i) It is continuous in De and differentiable in De − {ue };
ii) It has a local minimum in ue , i.e., for instance:
n n
dV ¡ ¢ X ∂V dui X ∂V ¡ ¢
u(t) = = fi u(t) ≤ 0 . (2.5.20)
dt i=1
∂ui dt i=1
∂ui
instance, in the case of systems of two equations, let consider the quadratic function:
with a > b2 , that is always nonnegative. Then one has to check that the sign of:
1 dV ¡ ¢ ¡ ¢
= av1 + bv2 f1 (v1 , v2 ) + bv1 + v2 f2 (v1 , v2 ) (2.5.22)
2 dt
be negative in some region near v1 = v2 = 0 for some values of a and b with a > b2 .
Another method is to look for V as a quadratic form V = v · Av with A such that:
JT A + AJ = −I , (2.5.23)
Example 2.5.2
Stability through Liapunov Method
Consider the system:
du1
= −u1 + u2 ,
dt
(2.5.24)
du2 = u2 + 2u1 u2 ,
dt
which has u1 = u2 = 0 and u1 = u2 = −1/2 as equilibrium points. In order to check the
nonlinear stability of u1 = u2 = −1/2, it is useful to introduce the perturbation (v1 , v2 ) defined
by:
1 1
u1 = − + v1 , u 2 = − + v2 . (2.5.25)
2 2
Eq. (2.5.24) then rewrites in terms of the new variables as:
dv1
= −v1 + v2 ,
dt
(2.5.26)
dv2
= −v1 + 2v1 v2 .
dt
68
1
1 −
2
A= (2.5.28)
1 3
−
2 2
has both positive eigenvalues.
The computation of dV /dt yields:
dV dv1 dv2
= (2v1 − v2 ) + (3v2 − v1 ) = −v12 − v22 + 6v1 v22 − 2v12 v2 . (2.5.29)
dt dt dt
As we have already seen, mathematical models must always be written in terms of suitable di-
mensionless variables. This technical manipulation may generate dimensionless numbers which
are small with respect to unity, while the independent and dependent variables are of the order
of unity. Almost always, these parameters have a well defined physical meaning which should
be related to the solution of mathematical problems (or vice versa).
This section provides a brief introduction to regular and singular perturbation methods by
means of simple examples where the dependent variable is a scalar or a two dimension vector.
We first define a perturbed model. Let ε be a positive number which is small with respect
to unity.
69
is called regularly perturbed if f (t, u, ε) and u0 (ε) depend smoothly on their argu-
ments.
The above definitions are given for a scalar problem. Their generalization to systems of
equations requires technical additional calculations..
As it is shown in the examples which follow, the parameter ε (in a regularly perturbed
problem) is linked to the presence in the model of nearly negligible physical phenomena, while
in a singularly perturbed problem the parameter ε is generally related to the scaling and to the
dimensional analysis of the variables.
Different methods have to be developed for each of the above classes of models, as it will
be shown in the next subsections. The starting point is the idea of looking for an asymptotic
expansion, in the sense of Poincaré, for ε → 0, uniformly valid in an interval [t0 , T ], for the
solution u(t, ε) of the perturbed problem, and we write:
∞
X
u(t, ε) ∼ ui (t)εi as ε → 0. (2.6.3)
i=0
This means that there exists a positive constant ε0 and that ∀ N ≥ 0, there exist functions
{ui (t)}N
i=0 defined in [t0 , T ], and a positive constant KN such that:
¯ u(t, ε) − PN u (t)εi ¯
¯ i=0 i ¯
¯ ¯ < KN , (2.6.4)
εN +1
N
X
u(t, ε) = ui (t)εi + O(εN +1 ) as ε → 0, (2.6.5)
i=0
70
uniformly valid for t ∈ [t0 , T ]. The series on the right-hand side of (2.6.3) is called asymptot-
ically convergent, uniformly in [t0 , T ]. The concepts of convergent series and asymptotically
convergent series are quite different: we refer to De Jager and Furu (1996) for a discussion
about convergence versus asymptotic convergence.
Starting from the above preliminaries it is shown in the next subsection how to deal with the
relatively simpler case of regular perturbations. Subsequently the case of singular perturbed
problems is considered.
Due to the smoothness of f (t, u, ε) and u0 (ε) with respect to their arguments, we espect that
the solution u(t, ε) of (2.6.1) depends smoothly on t and ε and that it tends to the solution
u(0) (t) of the limiting problem (2.6.6), as ε → 0, uniformly with respect to t ∈ [t0 , T ].
Following De Jager and Furu (1996), let us assume that f (t, u, ε), defined on IR2 × [0, 1], has
the following expansion:
∞
X
f (t, u, ε) ∼ f (i) (t, u)εi as ε → 0, (2.6.7)
i=0
uniformly valid in any bounded domain of IR2 , with the functions f (i) (t, u) infinitely differen-
tiable in IR2 , and assume that:
∞
X
u0 (ε) ∼ u0i εi . (2.6.8)
i=0
Then, let us look for an asymptotic expansion for the solution of problem (2.6.1):
∞
X
u(t, ε) ∼ u(i) (t)εi , (2.6.9)
i=0
∞
X ³ X ∞ ´
f (t, u(t, ε), ε) ∼ εi f (i) t, u(j) (t)εj
i=0 j=0
∞
X n ∂f (i) X∞
∼ εi f (i) (t, u(0) (t)) + (t, u(0) (t)) u(j) (t)εj
i=0
∂u j=1
71
1 ∂ 2 f (i) ³X
∞ ´2 o
(0) (j) j
+ (t, u (t)) u (t)ε + · · ·
2 ∂u2 j=1
n ∂f (0)
=f (0) (t, u(0) (t)) + ε (t, u(0) (t))u(1) (t)
∂u
o n ∂f (0)
+ f (1) (t, u(0) (t)) + ε2 (t, u(0) (t))u(2) (t)
∂u
1 ∂ 2 f (0) (0) (1) 2 ∂f (1)
+ (t, u (t))(u (t)) + (t, u(0) (t))u(1) (t)
2 ∂u2 ∂u
o n ∂f (0)
+ f (2) (t, u(0) (t)) + · · · + εp (t, u(0) (t))u(p) (t)
∂u
o
+ f˜(p) (t, u(0) (t), u(1) (t), · · · , u(p−1) (t)) + · · · , (2.6.10)
with f˜(p) are known functions depending only on (t, u(0) , u(1) , · · · , u(p−1) ).
Step 2. Consider from Step 1, that:
X ∞ n ∂f (0)
(0) (0)
f (t, u(t, ε), ε) ∼ f (t, u (t)) + εi (t, u(0) (t))u(i) (t)
i=1
∂u
o
+ f˜(i) (t, u(0) (t), u(1) (t), · · · , u(i−1) (t)) , (2.6.11)
then substituting (2.6.11), (2.6.9) and (2.6.8) into the corresponding terms of problem (2.6.1),
and equating the same powers of ε, the following systems are obtained:
(0)
du = f (0) (t, u(0) ) ,
dt (2.6.12)
(0)
u (t0 ) = u00 ,
and
(i) (0)
du = ∂f (t, u(0) ) u(i) + f˜(i−1) (t, u(0) , u(1) · · · , u(i−1) ) ,
dt ∂u (2.6.13)
(i)
u (t0 ) = u0i ,
for i = 1, 2, . . . .
Step 3. Applying to problem (2.6.12) the existence and uniqueness Theorems 2.1-2.2, we
get the existence of a unique solution u(0) (t) of (2.6.12), for t ∈ [t0 , T ]. Recalling that f (0) (t, u)
is infinitely differentiable, the solution u(0) is C ∞ . Moreover, referring to subsection 2.4.2, the
solutions of the linear differential systems (2.6.13) in the same interval [t0 , T ] is obtained.
Step 4. The procedure has to be completed showing that the formal expansion (2.6.9) for
u(t, ε), where the terms u(i) (t) are the solutions of systems (2.6.12) and (2.6.13) obtained in
72
Step 3, is an asymptotic expansion for u(t, ε). This means that, writing ∀N ≥ 0,
N
X
u(t, ε) = u(i) (t)εi + RN (t, ε) , (2.6.14)
i=0
uniformly in t ∈ [t0 , T ]. The reader is addressed to O’Malley (1991) or De Jager and Furu
(1996) for the details of this proof.
Remark 2.6.1. The precedure described in Step 1 - Step 4 refers to the scalar problem (2.6.1).
In the general case of a system of equations, the method of looking for an asymptotic expansion
for the solution of the system can also be applied, as it is shown in the following example from
Humi and Miller (1988).
Example 2.6.1
∞
X ∞
X
u(t, ε) ∼ u(i) (t)εi , v(t, ε) ∼ v (i) (t)εi . (2.6.17)
i=0 i=0
We may observe that (2.6.16) is symmetric in u, v, as well as (2.6.18) and (2.6.19) are sym-
metric in u(0) , v (0) and u(1) , v (1) respectively. Solving the linear systems (2.6.18) and (2.6.19)
yields:
u(0) (t) = v (0) (t) = e−t , u(1) (t) = v (1) (t) = e−t − e−2t , (2.6.20)
Example 2.6.2
Duffing’s Equation - Part I
Let consider the following system related to the motion of a mass attached to a nonlinear spring
(hard spring):
2
d u 3
dt2 + u + εu = 0 ,
¯ (2.6.22)
du ¯¯
u(0) = a , = 0 ,
dt ¯t=0
where ε ∈ (0, 1), which can be considered as a regular perturbation of
2
d u
dt2 + u = 0 ,
¯ (2.6.23)
du ¯¯
u(0) = a , = 0.
dt ¯t=0
du
= v,
dt
dv (2.6.24)
= −u − εu3 ,
dt
u(0) = a , v(0) = 0 .
The power series expansion method gives at order i = 0 and i = 1 the following systems:
d2 u(0)
+ u(0) = 0 ,
dt2
¯ (2.6.25)
(0)
du(0) ¯¯
u (0) = a , = 0,
dt ¯t=0
74
and
d2 u(1) (1) (0) 3
dt2 + u = −(u ) ,
¯ (2.6.26)
(1) du(1) ¯¯
u (0) = 0 , = 0,
dt ¯t=0
where (2.6.25) is solved by u(0) (t) = a cos t , so that system (2.6.26) for u(1) can be written as
follows:
µ ¶
d2 u(1) (1) 3 1 3
dt2 + u = −a cos 3t + cos t ,
4 4
¯ (2.6.27)
(1)
du(1) ¯¯
u (0) = 0 , = 0.
dt ¯t=0
Its solution is:
· µ ¶ ¸
(1) 3 1 3
u (t) = a cos 3t − cos t − t sin t , (2.6.28)
32 8
so that the solution of (2.6.22) can be written as follows:
· µ ¶ ¸
1 3
u(t, ε) = a cos t + εa3 cos 3t − cos t − t sin t + O(ε2 ) , (2.6.29)
32 8
Remark 2.6.2. It is interesting to remark that we do not get any answer to the question of
how big can be the deviation of the solution of problem (2.6.1) from the solution of problem
(2.6.6) on a large time interval. The following simple examples show that this deviation may
even be not small.
Example 2.6.3
Let consider the scalar system, from Mishchenko and Rozov (1980),
du = (u + ε)2 ,
dt (2.6.30)
u(0) = 0 ,
Example 2.6.4
Lindstedt-Poincaré Algorithms
Coming back to Example 2.6.2, we see that expression (2.6.29) is a reasonable approximation
of the solution for bounded time intervals but not for t ∈ [0, ∞). In fact due to the term
−(3/8) t sin t in (2.6.29), when t = O(1/ε) the second term of the expansion is no longer O(ε)
and when t = O(1/ε2 ) the second term blows up.
Let us notice here that the term −(3/8) t sin t in (2.6.29) is due to the presence of the term
−(3/4)a3 cos t in (2.6.27) (the so–called resonance effect). On the other hand, (2.6.22) has
only bounded (and periodic) solutions, as it describes a conservative system with the energy
integral:
µ ¶2
du ε ε
+ u2 + u4 = a2 + a4 , (2.6.32)
dt 2 2
which implies u and du/dt are bounded for t ∈ [0, ∞).
This suggests to use a different expansion of the solution in which unbounded terms do not
appear: a possible approach is the Lindstedt-Poincaré algorithm.
Let s be a new variable defined as follows:
t = s ω(ε) , (2.6.33)
where
System (2.6.22) can now be written in the unknown u(s) = u(t(s)) as:
2
d u 2 2 3
ds2 + (1 + εω1 + ε ω2 + · · ·) (u + εu ) = 0 ,
¯ (2.6.35)
du ¯¯
u(0) = a , = 0.
ds ¯s=0
The above procedure can be applied to this problem looking for an asymptotic power series
expansions for the solution of the type:
∞
X
u(s, ε) ∼ u(i) (s)εi . (2.6.36)
i=0
The power series expansion method gives, respectively to the order i = 0, 1, 2, the following
systems:
d2 u(0)
+ u(0) = 0 ,
ds2
¯ (2.6.37)
(0)
du(0) ¯¯
u (0) = a , = 0,
ds ¯ s=0
76
d2 u(1) (1) (0) 3 (0)
ds2 + u = −(u ) − 2ω1 u ,
¯ (2.6.38)
(1)
du(1) ¯¯
u (0) = 0 , = 0,
ds ¯s=0
2 (2)
d u
+ u(2) = −3(u(0) )2 u(1) − 2ω1 (u(1) + (u(0) )3 )
dt2
− (ω1 2 + 2ω2 )u(0) , (2.6.39)
¯
du(2) ¯¯
u (0) = 0 ,
(2)
= 0.
ds ¯s=0
The solution of (2.6.37) is u(0) (s) = a cos s, so that system (2.6.38) for u(1) can be rewritten
as follows:
³3 ´
d2 u(1) (1) 1 3 2
+ u = − a cos(3s) − a + 2ω1 a cos s ,
ds2 4 4
¯ (2.6.40)
(1)
du(1) ¯¯
u (0) = 0 , = 0,
ds ¯ s=0
so that, we can choose ω1 such that:
3 2
a + 2ω1 = 0 , (2.6.41)
4
in order to avoid the resonance effect. Therefore, the following bounded solution of (2.6.37) is
obtained:
1 3
u(1) (s) = a (cos 3s − cos s) . (2.6.42)
32
Consequently,
1 3
u(s, ε) = a cos s + a (cos 3s − cos s)ε + O(ε2 ) , (2.6.43)
32
that is uniformly valid in any finite s interval, with the coordinate s given by:
³ 3 ´−1
s = t ω(ε)−1 = t 1 − a2 ε + · · · . (2.6.44)
8
Finally the solution of (2.6.22) can be written as follows:
ε 3
u(t, ε) = a cos(Ωt) + a (cos(3Ωt) − cos(Ωt)) + O(ε2 ) , (2.6.45)
32
where
³ 3 ´−1 3
−1
Ω = ω(ε) = 1 − a2 ε + · · · = 1 + a2 ε + O(ε2 ) . (2.6.46)
8 8
77
Example 2.6.5
Initial Layer Effect - First Order Problem
Let us consider the problem:
ε du + u = 1 ,
dt (2.6.47)
u(0) = u0 ,
which has the unique solution:
t
u(t, ε) = 1 + (u0 − 1)e− ε , ∀t > 0. (2.6.48)
that proves that the convergence of u(t, ε) to 1, for ε → 0+ , is not uniform in any right-
neighbourhood of t = 0.
Remark 2.6.3. The solution (2.6.48) of system (2.6.47) is the sum of two terms: the first one,
the outer or bulk solution, is an asymptotic solution of the differential equation for t > 0,
and the second one, the initial layer solution, is a function of what is called stretched time
variable:
t
τ= · (2.6.50)
ε
As τ → +∞, the initial layer solution is decreasing to zero. The stretched time can be considered
as a rescaling parameter which magnifies the region of non-uniform convergence.
Example 2.6.6
Initial Layer Effect - Second Order Problem
Let us now consider the following nonlinear second order boundary value problem:
2 ³ ´2
ε d u = du ,
dt2 dt (2.6.51)
u(0) = 1 , u(1) = 0 ,
78
which shows that the convergence u(t, ε) → 0 for ε → 0 is uniform to respect to t on each closed
subinterval of (0, 1], but not in [0, 1].
Different solution techniques can be applied to solve and to overcome the problem of non-
uniform convergence both for singularly perturbed initial value problems and for singularly
perturbed boundary value problems. Here we limit ourselves to show in the following example
of singularly perturbed boundary value problem how an asymptotic expansion technique can
be applied.
Example 2.6.8
Friedrichs’ Problem
Consider the following second order linear boundary value problem:
2
ε d u + du + u = 0 ,
dt2 dt (2.6.54)
u(0) = a , u(1) = b ,
∞
X ∞
X
U (t, ε) ∼ U (i) (t)εi , ũ(τ, ε) ∼ ũ(i) (τ )εi . (2.6.57)
i=0 i=0
79
and assume that ũ(i) and its derivatives tend to zero as τ → +∞, providing the desired result
that the correction term ũ has only a significant value in a neighbourhood of t = 0. The outer
solution U has to solve:
2
ε d U + dU + U = 0 ,
dt2 dt (2.6.58)
U (1, ε) = b .
The expansion gives the following first order linear problems defined inductively:
dU (0)
+ U (0) = 0 , U (0) (1) = b , (2.6.59)
dt
dU (i) d2 U (i−1)
+ U (i) = − , U (i) (1) = 0 i = 1, 2, . . . . (2.6.60)
dt dt2
Solving the related problems gives:
which shows that for taking into account the initial condition, the initial layer correction ũ must
be considered. The problem for ũ is the following linear homogeneous one:
d2 ũ dũ
+ + εũ = 0 , ũ(0, ε) = a − U (0, ε) . (2.6.63)
dτ 2 dτ
dũ(0)
+ ũ(0) = 0 , ũ(0) (0) = a − U (0) (0) = a − be . (2.6.64)
dτ
So we have shown that Problem (2.6.54) has the following unique asymptotic solution:
³t´ t
u(t, ε) = U (0) (t) + ũ(0) + O(ε) = b e1−t + (a − be) e− ε + O(ε) . (2.6.66)
ε
1
It is worth stressing that u(0, ε) = a, and for ε > 0 we have u(1, ε) = b + (a − be) e− ε and
u(1, ε) → b as ε → 0+ .
80
The algorithm shown in Example 2.6.8 can also be applied to the initial value problem for
an autonomous system of differential equations in the form:
du
ε = f (u, v) ,
dt
dv (2.6.67)
= g(u, v) ,
dt
u(0) = µ(ε) , v(0) = η(ε) ,
where u and v can be scalar or vector functions of t, and the dimensions of u and v can be
different.
Implementing such an algorithm from a numerical point of view, as reported in Mika and
Palczewski (1991), means to solve a number of equations equal to the order of approximation
plus one. In this paper, the authors propose a different algorithm which involves only one
differential equation for a single function. This algorithm is here briefly illustrated in the
following example, while the reader is addressed to the original paper for the details.
Example 2.6.9
Mika and Palczewski’s Algorithm
Consider the boundary value problem:
2
d u du 2
ε dt2 + dt + u = 0 ,
¯ (2.6.68)
du ¯¯
u(0) = η , = µ,
dt ¯t=0
The algorithm leads, as first order approximation, to solve the following system:
ε dw = −w2 − 2εw2 ,
dt (2.6.70)
w(0) = η + ε(µ + η 2 ) ,
and it is possible to prove that the accuracy obtained in this new algorithm is similar to that
for the first order standard algorithm and much better than the zero order standard algorithm.
81
This section focus on how the equilibrium configurations of a dynamical system usually depend
on the parameters which characterize the model. In particular, it may occur that as one
of the parameters crosses a critical value, the solution will tend toward another equilibrium
configuration. The aim of this Section is to give some hints for the study of the bifurcation of
a dynamical system without claiming any completeness on the treatment of the subject. For a
detailed treatment of the subject, refer for instance to the book Bellomo et al. (2000).
Consider a mathematical model described by a scalar autonomous ordinary differential equa-
tion depending on a parameter α:
du
= f (u; α) . (2.7.1)
dt
The equilibrium configurations are determined by solving the algebraic equation:
f (u; α) = 0 . (2.7.2)
This equation is usually nonlinear and for each value of α it can be solved by a certain number
of values of u, or possibly by no value at all. The number of solutions of (2.7.2) cannot be
determined a priori and generally changes with α. As will be shown later in this section, the
values of α for which the number of solutions of (2.7.2) changes are particularly important in
the stability analysis.
Then, one can represent by dots in the (u, α)-plane, the solutions of (2.7.2) obtained for each
fixed value of α. If α varies continuously, the solution of (2.7.2) defines a certain number of
curves, each of which can be locally described by an equation u = ue (α), implicitly defined by
(2.7.2), i.e., such that f (ue (α); α) = 0.
If one is able to determine the stability property of each equilibrium configuration, then
this piece of information can also be reported on the diagrams using the following classical
convention:
• Stable equilibrium configurations are identified by a heavy solid line;
• Unstable equilibrium configurations are identified by a dashed line.
The figures obtained by drawing these curves are called stability diagrams, (or bifurcation
diagrams, branching diagrams, response diagrams), and each of the curves appearing in the
diagram is called a branch.
The generalization of the above considerations, given for the scalar case, to systems of equa-
tions is technical.
The following definitions are recalled:
Branch points
¡ ¢
The values α, ue (α) where the number of equilibrium solutions changes are called
branch points.
82
Bifurcation points
An equilibrium solution bifurcates from another at α = αb if there are two distinct
branches ûe (α) and ǔe (α) continuous in α, such that ûe (αb ) = ǔe (αb ). The common
¡ ¢ ¡ ¢
value αb , ûe (αb ) = αb , ǔe (αb ) in the (α, u)-space is called a bifurcation point.
Not all branch points are, however, bifurcation points. That is, not all changes in the number
of equilibrium solutions are due to the intersection of curves in the stability diagram. Bi-
furcation points can be further classified, e.g. pitchfork bifurcation, supercritical, subcritical
and transcritical bifurcations and Hopf bifurcations, referring again to the book Bellomo et al.
(2000) for details. The following definitions can be reported:
Pitchfork bifurcation
If one of the two branches intersecting at the bifurcation point (αb , ub ) is one sided
(i.e., defined only for α ≥ αb or for α ≤ αb ), then the bifurcation is of the pitchfork
type.
Transcritical bifurcations
A bifurcation is transcritical if on both sides of the bifurcation point there are
locally two equilibrium solutions.
In many applications it may happen that, still departing from an unstable configuration,
the system will neither go to another stationary configuration nor wander away, but will start
oscillating approaching a periodic orbit. The oscillations described here are also referred to in
mechanics as self-sustained oscillations, to distinguish them from those which are due to
oscillatory forcing terms. One can then state the following definition:
Limit cycle
An orbit u(t) tends to a closed curve Γ called the limit cycle if there exists a period
T > 0 such that the sequence of points u(t + nT ) tends to a point of Γ as the integer
n goes to infinity.
A cycle Γ is asymptotically stable if there exists a neighborhood UΓ of Γ such that if
u(0) ∈ UΓ , then u(t) tends to Γ.
The relevance of limit cycles is basically related to the problem of how time-periodic behaviors
may arise from the bifurcation of a steady state. This is an interesting problem in the case of
systems of equations, i.e. in the case of dimension more than one. In fact, for a scalar problem
(2.7.1), it is impossible to have a nonconstant periodic solution.
83
The behavior of a dynamical system may change, according to the value assumed by a pa-
rameter α and the solution will tend to a time periodic orbit, as a critical value αb is crossed.
In this case, the bifurcation is called the Hopf bifurcation.
The identification of self-sustained oscillations is far more difficult than that of equilibrium
points. As a consequence, recognizing the existence of a Hopf bifurcation can be a hard task.
In this respect, the following theorem, known as the Hopf Theorem, is often very helpful.
λ(αb ) = ± iΩ , (2.7.3a)
and all the other eigenvalues have a negative real part and, furthermore, for α close to
αb :
dµ
λ(α) = µ(α) ± iω(α) , with (α = αb ) > 0 . (2.7.3b)
dα
Therefore for α sufficiently near the critical value αb , there exists a limit cycle with initial
period:
2π
T = · (2.7.3c)
Ω
If, in addition, the equilibrium configuration u(α = αb ) is locally asymptotically stable,
then the limit cycle is stable.
The theorem above uses again the eigenvalues of the Jacobian and the knowledge of the
stability properties which are not too hard to verify. In particular, it pays attention to what
happens to the first eigenvalue which will have positive real part as the bifurcation value is
crossed.
We already know that at criticality. i.e. for α equal to the critical value αb , the real part
of an eigenvalue or of a pair of eigenvalues vanishes and becomes positive as the bifurcation
value is passed by, while that of the others is negative. The theorem states that if also the
imaginary part vanishes, i.e., the eigenvalue is zero, then the bifurcation involves stationary
configurations. If, instead, there is a pair of purely imaginary eigenvalues, then the bifurcation
involves self-sustained oscillations. This situation is represented in Figure 2.7.1. In particular,
in Figure 2.7.1b a pair of eigenvalues of J crosses the imaginary axis from left to right away
from the origin as α, increasing from stable values to unstable ones, crosses criticality.
Generally, the Hopf theorem does not tell whether a stable (supercritical) limit cycle exists for
α > αb or an unstable (subcritical) limit cycle exists for α < αb , inside of which all orbits spiral
toward ue and outside of which the orbits diverge. In order to know whether the limit cycle is
84
Figure 2.7.1 — Behavior of the critical eigenvalues as α increases from stable values to unstable
ones in the case of (a) stationary bifurcation and (b) Hopf bifurcation.
stable or unstable, it must be proved that ue is locally asymptotically stable at criticality. This
information can be achieved by using the Liapunov method. In fact, linear stability criteria
are useless because at criticality the eigenvalues are purely imaginary. This precludes any
conclusion on the character of the nonlinear system. This second part is, however, certainly
not simple.
These concepts are now applied to the following example.
Example 2.7.1
Limit Cycle for the van der Pol Model
Consider the van der Pol equation already encountered in Example 2.5.1:
d2 u du
− (α − βu2 ) + u = 0, (2.7.4)
dt2 dt
√
α± α2 − 4
λ= ·
2
dµ 1
λ(α = 0) = ± i 6= 0 and (α = 0) = > 0 .
dα 2
From Hopf theorem, then, there exists a limit cycle. To establish if the limit cycle is subcritical
(unstable) or supercritical (stable), one has to verify that u = 0 is locally asymptotically stable
for α = 0.
85
µ ¶2
2 du
By using V = u + as a Liapunov function, one can compute for α = 0,
dt
µ ¶2
dV du
= −βu2 , (2.7.5)
dt dt
An introduction to mathematical modelling at the microscopic scale has been given in this
chapter focusing on models stated in terms of ordinary differential equations. It has been
shown that the analysis of nonlinear systems needs to be approached using both analytic and
computational schemes. Analytic methods can provide existence results and some information
on the qualitative behavior of the solution, for instance stability properties, asymptotic be-
havior, bifurcations, while computational schemes integrate the information by visualizing and
completing what found by analytic methods.
Generally, none of the two approaches, namely analytic and computational, is sufficient to
deliver a complete information, while a sinergetic use can be successful in providing a picture,
as complete as possible of the solution patterns. Simulations, based on computational schemes,
have to be focussed on completing, and possible enlarging, what is delivered (and not delivered)
by the analytic qualitative analysis.
Let us analyze some modeling and complexity issues related to the contents of this chapter.
Finite models, i.e. at the microscopic scale, consider each system as a whole. It is worth
reasoning about some implications of the above approximation of physical reality. In fact, this
approximation can be, in some cases, not acceptable when the number of elements constituting
the overall system under consideration is large, namely the number of ordinary differential
equations to be dealt with is too high for the qualitative analysis, as well as for computational
treatment.
Moreover, let us critically consider that all models we have seen in this chapter need pa-
rameters related to the material behavior of the system under consideration. For instance,
elasticity characteristics of spring, coefficients of competitive and/or cooperative models, and
so on. Generally, these parameters are measured by empirical data obtained near equilibrium,
while their values are needed to describe the evolution of the system far from equilibrium.
Finally, let us observe that most of the models presented in this chapter belong to the inert
matter. Therefore they do not take into account the ability of living systems to extract energy
from the outer environment and live out of equilibrium. Moreover, the deterministic causality
principles, which are valid for models of the inert matter, should be replaced by relations where
the effect is related to the cause only in some probabilistic sense.
86
This aspect is even more difficult to treat in the case of large systems where, as in most living
systems, interactions are not linearly additive. This chapter has skipped over all aforesaid
problems which will be treated later in Chapter 4, by mathematical tools that appear to be
appropriate to take into account all topics outlined in this section.
87
Chapter 3
3.1 Introduction
This chapter deals with the derivation and analysis of mathematical models derived at the
macroscopic scale. These models, according to the definitions given in Chapter 1, provide
a description of real systems by suitable evolution equations for locally averaged quantities
which can be called macroscopic observable quantities. Of course, all physical systems
are constituted by several interacting elements. However, the local averaging is an acceptable
approximation when a small volume, in terms to be still precisely defined, contains a sufficiently
large number of elements.
The above approximation can be called continuous matter assumption, which states that
given two points belonging to the system, however close each other, some matter is included
between these points. A typical example is the flow of fluid particles regarded as a continuum
system. Rather than modelling the dynamics of each single particle and then averaging to obtain
gross quantities such as mass density and linear momentum, the macroscopic approach provides
directly, as we shall see, the evolution in time and space of the above macroscopic quantities.
This approach was already analyzed in Chapter 1, where a few introductory examples have
also been proposed. Therefore we simply recall that the derivation of equations in continuum
physics has been a fundamental chapter in the history of mathematical physics and, in general,
in the history of sciences.
Models of continuum physics, e.g. fluid dynamics, elasticity, electromagnetism fields, have
been derived at the macroscopic scale under the continuum matter approximation, and are
nowadays the fundamental background of the mathematical models of natural, applied, and
technological sciences. Moreover, the derivation of several models in technology takes advantage
of the above fundamental models of mathematical physics.
As we have already seen in Chapter 1, macroscopic models are characterized by a state
variable u which depends on time and space. In general this dependence is continuous and one
88
can write
where D ⊆ IR3 is the domain of the space variable and u is an n-dimensional vector. Math-
ematical models are generally stated in terms of a system of partial differential equations for
the variable u.
If the state variable is a scalar and the space variable is in dimension one, then one simply
has
The organization of this chapter is somehow analogous to the one of the previous one. After
the above introduction, the contents are organized as follows:
– Section 3.2 deals first with the description of some general modelling methods; then, these
methods are will be applied to the derivation of some elementary models of mathematical
physics related to simple phenomena, such as wave propagation, thermal diffusion.
– Section 3.3 analyzes the contents of the preceding section and deals with a classification of
models and equations. This means looking at the mathematical structure of the models to
provide a classification based on common properties of the said equations. As we shall see, the
classification, based on mathematical properties, can be related to different classes of physical
phenomena described by the model.
– Section 3.4 deals, on the basis of the above classification, with the statement of mathemat-
ical problems. The above classification is finalized to the correct statement of mathematical
problems technically related to the development of mathematical methods. The mathematical
formulation of problems is, as we shall see, preliminary to their solution and to the development
of simulations.
– Section 3.5 deals with an introduction to analytic methods for linear problems. It is only a
brief introduction considering that these Lecture Notes are mainly devoted to nonlinear models
and problems, a relatively deeper insight can be obtained by the pertinent literature which will
be given in what follows.
– Section 3.6 deals with the discretization of continuous models into a discrete number of finite
models. Using this method, one obtains a set of models which approximate the continuous one
by a system of ordinary differential equations related to the evolution of each finite element.
– Section 3.7 proposes a critical analysis mainly focused on the selection of the macroscopic
scale to represent the system.
89
It is worth mentioning that this chapter does not aim to provide a complete presentation of
the large variety of models of mathematical physics. The aim is to deal with methodological
aspects which may contribute to address the reader to mathematical modelling as a science.
The examples of models proposed in this chapter have to be regarded as relatively simple ones
proposed as a mathematical description of simple phenomena.
This section deals with the analysis of the methodological aspects of the mathematical mod-
elling of continuous systems, which will be then applied to the study of a number of physical
systems chosen as suitable examples.
As usual, the derivation of a mathematical model suitable to describe a certain physical
system requires a preliminary phenomenological observation both of the physical system and
of its connection with the outer environment. Then, the modelling procedure can be developed
along the following steps:
1: Selection of the state variable which describes, by the model, the real system: this variable
can be called state variable;
2: Modelling of the interactions between the inner system under consideration and the outer
environment;
3: Assessment of the equilibrium, conservation, and/or balance equations, related to the above
defined state variable, including the action of the outer environment to the inner system.
4: Modelling the material behavior of the system by means of phenomenological models, which
will be called constitutive or material models;
5: Derivation, using the equations dealt with in Steps 3 and 4, of a mathematical model. Such
model consists of a suitable set of equations describing in time and space the evolution of the
state variable.
The above scheme does not claim to cover all conceivable types of modelling procedures.
However, various examples are proposed to the reader to achieve a general understanding of
the above methodology and of its generalization.
Then, once the mathematical model has been designed, the usual procedure for its analysis
can be organized: formulation of mathematical problems by a proper statement of initial and
boundary conditions, qualitative analysis, solution of mathematical problems, and validation
of the model.
As mentioned in Step 3, several methodological approaches can be followed toward the deriva-
tion of a certain model, while a common feature of the analysis developed in what follows is
that we consider a real system represented with reference to a certain fixed, generally inertial,
frame.
Moreover, models are derived under the assumption that the matter is continuous. This is
certainly an approximation, since intermolecular distances are always positive quantities. On
90
the other hand, when these distances are sufficiently small with respect to the characteristic
dimensions of the body, the continuum hypothesis becomes reasonable and can be regarded as
an acceptable simplification. If this hypothesis is no longer applicable, for instance for a rarefied
gas, then mathematical models have to be derived at a molecular scale, as we shall see in the
next chapter.
Some examples of models and of the related modelling techniques will be now given in what
follows.
Example 3.2.1
Modelling the Vibration of an Elastic String
Let us consider, with reference to Fig. 3.2.1, an elastic string fixed at both its extrema. The
modelling procedure for the dynamics of the system can be based upon the following assump-
tions:
• The string is held in A and B by a strong tension directed along the string. This implies that
gravity can be neglected, and that consequently the string takes at equilibrium a straight
configuration, identified by the axis with unit vector i.
• The string is subjected to small displacements from the equilibrium configuration.
• The motion of the string is localized on a plane, identified by the unit vectors i and j and
every point of the string moves along j, namely perpendicularly to the equilibrium state.
• The state variable is the perpendicular displacement
where the independent variables are time t and location x along the string.
• A fluid-dynamic drag is exerted on the string by the outer environment. A phenomenological
model of the force acting upon the string element ∆x is as follows
¯ ¯n
¯ ∂u ¯ ∂u
F = −h ¯¯ ¯¯ ∆x j , (3.2.2)
∂t ∂t
Referring then to Fig. 3.2.1, if θ is the slope of the string, then the component of the tension
along j acting over the element ∆x of the string is
∆u ∆u ∼ ∂u ,
sin θ ∼
=√ ∼
= = (3.2.4)
2
∆u + ∆x2 ∆x ∂x
is obtained.
Therefore, the element ∆x is subject to the following actions:
∂2u
• T0 ∆x is the action of the internal tension;
∂x2
∂2u
• −ρ 2 ∆x is the inertial force;
∂t
¯ ¯p
¯ ∂u ¯ ∂u
• −h ¯¯ ¯¯ ∆x is the fluid-dynamic drag.
∂t ∂t
In the material reference frame, these forces set the system in equilibrium (Newton’s me-
chanical model or, more properly, d’Alembert’s principle). The application of the equilibrium
equation yields the following continuous dynamic model
¯ ¯p
∂2u 2
2∂ u
¯ ¯
2 ¯ ∂u ¯ ∂u ,
2
= c 2
− k ¯ ∂t ¯ ∂t (3.2.6)
∂t ∂x
p p
where c = T0 /ρ, and k = h/ρ.
Remark 3.2.1. The model is derived under the assumption of small deformations of the string.
Hence its validity is limited to the case of small vibrations. If the model is used to describe
large deformations, then unreliable descriptions follow.
92
Remark 3.2.2. The above model is nonlinear. Neglecting the fluid dynamic drag leads to the
following linear model:
∂2u 2
2∂ u
= c · (3.2.7)
∂t2 ∂x2
The model is linear also if p = 0
∂2u 2
2∂ u ∂u
2
= c 2
− k2 · (3.2.8)
∂t ∂x ∂t
Example 3.2.2
Linear Temperature Diffusion Model
Consider a linear diffusion model homogeneous in the space. The assumptions which generate
the model are the following:
• The physical quantity which defines the state of the system is the temperature in the body:
• The mass per unit volume of the medium is assumed to be constant, as well as the heat
capacity of the medium.
• The diffusion of temperature in a certain direction is described by a phenomenological model
which states that the said velocity is directly proportional to the directional derivative of u
in the above direction.
The evolution model, i.e. the evolution equation for u, can be obtained equating the net heat
flux in a control volume to the increase of heat capacity:
µ ¶
∂u ∂2 ∂2 ∂2
=k + + u = k ∆u , (3.2.9)
∂t ∂x2 ∂y 2 ∂z 2
Example 3.2.3
Equilibrium Temperature Profiles Model
Consider a linear diffusion phenomena homogeneous in the space according to Example 3.2.2.
The temperature profiles can be obtained by Model 3.2.2 simply assuming that the temperature
does not change in time. The following model is obtained:
µ ¶
∂2 ∂2 ∂2
+ + u = ∆u = 0 , (3.2.10)
∂x2 ∂y 2 ∂z 2
Remark 3.2.3. Nonlinear models are obtained if the diffusion coefficient depends on the tem-
perature. This applies to both Models 3.2.2 and 3.2.3.
Example 3.2.4
Hydrodynamic Models
Consider a description of the fluid such that the state variable u is simply defined by density ρ,
and velocity v, regarded as functions of time and space. The derivation of the model is based
on mass and momentum conservation equations. The velocity v is a three dimensional vector:
v = vx i + vy j + vz k . (3.2.11)
Therefore, the model needs four scalar components related to the variable:
u = {ρ , vx , vy , vz } . (3.2.12)
where dΩ is the elementary surface and n is the unit vector orthogonal to the surface conven-
tionally directed towards the outer region.
Gauss Theorem allows to transform the surface integral into a volume integral by replacing
the function to be integrated by its divergence
Z
G=− ∇x (ρv)(t, x, y, z) dx dy dz , (3.2.15)
Ω
where
∂ ∂ ∂
∇x = i+ j+ k. (3.2.16)
∂x ∂y ∂z
The time evolution of the mass in the control volume Ω equals the mass flux given by (3.2.14)
through Ω. Due to the arbitrary choice of Ω, the conservation equation can be written in
differential terms as follows:
∂ρ
+ ∇x (ρv) = 0 , (3.2.17)
∂t
94
∂ρ ∂
+ (ρvx ) = 0 . (3.2.18)
∂t ∂x
∂ρ ∂ ∂ ∂
+ (ρvx ) + (ρvy ) + (ρvz ) = 0 , (3.2.19)
∂t ∂x ∂y ∂z
for a system subject to an external field which may depend on the density and velocity of the
flow: F = F(t, x, y, z, ρ, v), referred to the unit mass at the time t in the point (x, y, z).
The integral conservation relation writes
Z Z Z
∂
ρv dx dy dz + ∇x ((ρv) · v) dx dy dz = ρF(t, ·) dx dy dz . (3.2.21)
∂t Ω Ω Ω
∂
(ρv) + ∇x ((ρv) · v) = ρ F(t, x, y, z, ρ, v) , (3.2.22)
∂t
∂ ∂
(ρvx ) + ((ρvx ) vx ) = ρ Fx (t, x, ρ, v) . (3.2.23)
∂t ∂x
Model (3.2.22) generates three scalar evolution equation which can be explicitly written as
follows:
∂
(ρvx ) + ρvx ∇x v + vx ∇x (ρv) = ρ Fx (t, x, y, z, ρ, v) ,
∂t
∂
(ρvy ) + ρvy ∇x v + vy ∇x (ρv) = ρ Fy (t, x, y, z, ρ, v) , (3.2.24)
∂t
∂ (ρv ) + ρv ∇ v + v ∇ (ρv) = ρ F (t, x, y, z, ρ, v) .
z z x z x z
∂t
System (3.2.24) coupled with (3.2.18) gives the evolution model. However additional micro-
scopic modelling is needed to define the term F generated by the external applied field and by
the internal viscous forces. A simple example of model is given in what follows.
95
Example 3.2.5
A One-dimensional Hydrodynamic Model
Consider, with reference to Model 3.2.4, the one-dimensional flow of a fluid in a duct, and let x
be the coordinate of the axis of the duct. Moreover, assume that the variation of density and
velocity in the direction orthogonal to the axis can be neglected. In this case the expressions
of mass and linear momentum conservation equations simplify as follows:
∂ρ ∂
+ (ρv) = 0 ,
∂t ∂x
(3.2.25)
∂v + v ∂v = F (ρ, v) .
∂t ∂x
The above equation can generate a self-consistent model if the action F , namely the force
acting on the volume element, can be properly modelled. One can distinguish two kind of
forces: those acting on the entire volume from outside, e.g. gravity, and those generated by
the pressure of the liquid present in the near volume elements pushing our volume element
through the sections separating them. The former are generally called body forces and can be
written as Fb = Af dx, where f is the force per unit volume. The latter are generally called
contact forces and give rise to a net force
∂P
Fc = A[−P (t, x + dx) + P (t, x)] = − (t, x)Adx . (3.2.26)
∂x
Therefore, momentum balance writes
µ ¶
∂v ∂v ∂P
ρ + 2v =f− . (3.2.27)
∂t ∂x ∂x
Equation (3.2.27) needs a phenomenological description of the dependence of the body and
contact forces on the state variables ρ and v. For instance, if the duct is vertical and only
gravity acts on the fluid, one can show that f = ρg. On the other hand, the phenomenological
law for the pressure P involves the identification of a constitutive relation which states how the
fluid responds to deformations (in this case compression). It is found, for instance, that certain
gases, called polytropic gases, satisfy the following constitutive law:
Remark 3.2.4. In some cases of unsteady flow conditions, it is used a constitutive law which
holds true at equilibrium. This is certainly an approximation of physical reality and the example
gives an idea of the difficulty of mathematical modelling.
Wave phenomena can be described by higher order equations to include the modelling of
solitary waves. This is the case of the celebrated Korteweg-deVries model.
Example 3.2.6
Korteweg-deVries - Solitary wave models
Solitary wave phenomena can be described by a model derived in 1895 by Korteweg and de Vries.
This model describes the evolution of the long water waves in a channel of rectangular cross-
section and is characterized by an appropriate balance between nonlinearity and dispersion.
The model arises in several fields of fluid mechanics, such as water waves, internal gravity
waves in stratified fluids, and waves in rotating atmosphere and can be regarded as one of the
relevant paradigms of the nonlinear waves and soliton solutions.
In details consider the following evolution equation mixing transport and third order backward
diffusion:
∂u ∂u ∂3u
+ um + µ 3 = 0, (3.2.30)
∂t ∂x ∂x
where u defines the wave depth.
It is useful stressing, in view of development of simulation methods, reporting some analytic
solutions which may be compared with computational ones. Specifically, the solution for a
single soliton is
£ ¤1/m
u(x, t) = A sech2 (κx − ωt − x0 ) , (3.2.31)
where κ and ω are free parameters and
µκ2 4µκ3
A=2 (m + 1)(m + 2) ω= · (3.2.32)
m2 m2
The model has an infinite number of conserved densities that, integrated over the whole range
of the space domain, are independent of time. The first five invariant quantities are
Z +∞
I1 = udx , (3.2.33)
−∞
Z +∞
1 2
I2 = u dx , (3.2.34)
−∞ 2
Z " µ ¶2 #
+∞
3 1 ∂u
I3 = u + dx , (3.2.35)
−∞ 2 ∂x
Z " µ ¶2 µ ¶2 #
+∞
4 ∂u ∂2u
I4 = 5u + 10u + dx , (3.2.36)
−∞ ∂x ∂x2
97
and
Z " µ ¶2 µ ¶2 µ ¶2 #
+∞
5 2 ∂u ∂2u ∂3u
I5 = 21u + 105u + 21u + dx , (3.2.37)
−∞ ∂x ∂x2 ∂x3
where the first three conservation equations correspond to mass, momentum, and energy, re-
spectively, of the wave. These equations can be used to test the application of computational
schemes.
The last model of this section focus on vehicular dynamics, already introduced in Section 1.5
of Chapter One, according to the conservation of mass equation, which writes
∂u ∂(uv)
+ = 0, (3.2.38)
∂t ∂x
where u is the dimensionless number density of vehicles, namely the real density divided by
the maximal value reached in bumper-to-bumper conditions, and v is the dimensionless mean
velocity, that is the real mean velocity divided by the maximal one, for instance that obtained
by speed limits.
The modeling approach takes advantage of the so called fundamental diagram which re-
ports, in steady uniform conditions, the flux q = uv versus the density u ∈ [0, 1]. A typical
model, which approximates empirical data, is the following
q = u v = u (1 − u), v = 1 − u, (3.2.39)
which is valid for a large variety of one–lane road. This phenomenological model is used to
derive a traffic dynamics model although more refined interpretation of physical reality can be
proposed. Still it is consistent with the tutorial reasonings presented in the following.
Example 3.2.7
Macroscopic Vehicular Model
First order models of vehicular traffic can be obtained by closing the mass conservation (3.2.38)
by phenomenological model linking the velocity v to the local density conditions. It is suggested
in De Angelis (1999) that this relation can be offered by using in the expression of v the perceived
density w, rather than the real one, modeled as follows:
· ¸
∂u
w = u 1 + ε (1 − u) . (3.2.40)
∂x
Remark 3.2.5. Model 3.2.39 is a non-linear transport diffusion equation that can be written
as follows:
· ¸
∂u ∂u ∂ ∂u
+ c(u) =ε k(u) , (3.2.42)
∂t ∂x ∂x ∂x
where c(u) = (1 − 2u) is the density dependent transport velocity, and k(u) = u2 (1 − u) is the
density dependent diffusion term.
Remark 3.2.6. The phenomenological model 3.2.38 basically states that the perceived density
w is higher/lower in the presence of postive/negative density gradients. Moreover, this effect
depends on (1 − u), namely it increases with decreasing density.
Once a mathematical model has been derived, its application can be developed in view of the
qualitative and computational analysis of mathematical problems generated by its application
to real world analysis. As we have seen in the examples given in Section 3.2, the governing
equations appears to be relatively more complex than those encountered for discrete systems.
Dealing with the statement of mathematical problems requires, actually it is even necessary,
dealing with a detailed analysis of the qualitative properties of the models.
Indeed the statement of the problems and the qualitative behavior of the solutions depends on
the properties of the models which can be classified according to their mathematical structure.
The application of computational schemes also needs to be related to the above properties.
This topic is dealt with in this section devoted to the classification of mathematical equations
on the basis of some structural properties of the equations themselves.
As we have seen, a large variety of macroscopic models can be written in terms of partial
differential equations. This section deals with the qualitative analysis of a large class of first
and second order equations, related to the models we have seen in Section 3.2. Considering
that the variety of this type of equations is extremely broad, completeness is not claimed.
Bearing all above in mind, consider the following class of second-order partial differential
equation in two dependent variables:
n
X µ ¶
∂2u ∂u ∂u
Aij (x) =F x; u, ,..., , (3.3.1)
i,j=1
∂xi ∂xj ∂x1 ∂xn
can be proposed:
)
∀i λi =
6 0 ∃!λi > 0
∀i λi =6 0 ∃!λi < 0 =⇒ hyperbolic ,
)
det A = 0 =⇒ parabolic ,
(or equivalently ∃λi = 0)
)
∀i λi > 0
∀i λi < 0 =⇒ elliptic .
Analogous classification can be developed for first order models, such as those we have seen
with reference to hydrodynamic models. Specifically, consider quasi-linear systems of n first-
order partial differential equations in two independent variables:
n
X n
∂uj X ∂uj
Aij (x, u) + Bij (x, u) = fi (x, u) , i = 1, . . . , n , (3.3.2)
j=1
∂t j=1
∂x
∂u ∂u
A(x, u) + B(x, u) = f (x, u) . (3.3.3)
∂t ∂x
If the dependence on u drops in Aij and Bij , then the system is said to be almost linear. If
it drops also in fi , then the system of partial differential equations is linear.
Under the assumption that A is nonsingular, the classification is based on the calculations of
the roots of the eigenvalue problem
It reads as follows:
• If Pn (λ) has n real distinct zeros =⇒ hyperbolic;
• If Pn (λ) has n real zeros at least one of which is
repeated and n independent eigenvectors =⇒ hyperbolic;
• If Pn (λ) has n real zeros at least one of which is
repeated and fewer than n independent eigenvectors =⇒ parabolic;
• If Pn (λ) has no real zeros =⇒ elliptic.
This classification, which does not cover all possibilities due to the assumptions on the Ma-
trices A and B, is however useful to deal with a large variety of models. For instance, it can
100
∂u ∂u
+c = f (t, x; u) , c > 0, (3.3.6)
∂t ∂x
is hyperbolic.
As a particular case of Eq. (3.3.1), consider the following class of equations:
µ ¶
∂2u ∂2u ∂2u ∂u ∂u
A(t, x) 2 + 2B(t, x) + C(t, x) 2 = F t, x; u, , . (3.3.7)
∂t ∂t∂x ∂x ∂t ∂x
B 2 − AC > 0 =⇒ hyperbolic,
B 2 − AC = 0 =⇒ parabolic, (3.3.8)
B 2 − AC < 0 =⇒ elliptic .
Considering that A, B, and C are functions of t and x, then the type of equation depends
upon the local values of the coefficients. In this case the model may change of type. For
instance, the abstract model
µ ¶
∂2u ∂2u ∂u ∂u ,
+ C(t, x) =F t, x; u, , (3.3.9)
∂t2 ∂x2 ∂t ∂x
which is such that A = 1 and B = 0. Therefore B 2 − AC = −C(t, x), then the equation
is elliptic, parabolic, or hyperbolic according to the sign of C. In particular, it is elliptic,
hyperbolic, or parabolic when, respectively, C is positive, negative, or equal to zero.
The above classification can be applied to the following specific examples:
Example 3.3.1
Hyperbolic, parabolic, and elliptic
Referring to the models presented in Section 3.2, the following properties holds true:
• Model 3.2.1 (vibrating string) is hyperbolic.
• Model 3.2.2 (diffusion phenomena) is parabolic.
• Model 3.2.3 (Laplace’s equation) is elliptic.
101
Example 3.3.2
Ferrari and Tricomi Model
A model, which changes of type, was proposed by Ferrari and Tricomi, to describe the space
evolution of the velocity potential, in the velocity space, for inviscid flow around the sonic line
for transonic flow around thin airfoils. The model is as follows:
∂2u ∂2u
+ y = 0, (3.3.13)
∂x2 ∂y 2
which is elliptic for y > 0, which corresponds to subsonic flow, parabolic for y = 0, which
corresponds to the sonic line, and hyperbolic for y < 0, which corresponds to supersonic flow.
It can be shown that the qualitative behavior of the solutions depends on the above qual-
itative properties of the model related to the above classification. Referring to the specialized
literature, e.g. Dautray and Lions (1990), the following examples are given:
• Parabolic equations are evolution equations that describe diffusion-like phenomena. In
general, the solution of a parabolic problem is smooth with respect to both space and time even
if the initial data are not continuous. In other words, parabolic systems have a smoothing
action so that singularities do not develop, nor can be maintained. Moreover, even if the initial
data have compact support, at any t > 0 the initial condition is felt everywhere. This is usually
indicated by saying that parabolic systems have an infinite speed of propagation.
• Elliptic equations describe systems in the equilibrium or steady state. They can be seen as
equations describing, for instance, the final state reached by a physical system described by a
parabolic equation after the transient term has died out.
• Hyperbolic equations are evolution equations that describe wave-like phenomena. The
solution of an hyperbolic initial-value problem cannot be smoother than the initial data. On
the other hand, it can actually develop, as time goes by, singularities even from smooth initial
data, which then characterize the whole evolution and are propagated along special curves
called characteristics. In particular, if the solution has initially a compact support, then such
a support expands with a finite speed of propagation of the effect modelled by the hyperbolic
equation.
The above qualitative properties will be analyzed with reference to the examples which follow:
Example 3.3.3
Initial Value Problem for the Linear Transport Model
Consider the linear transport model:
µ ¶
∂ ∂
Lu = +c u, c > 0, (3.3.14)
∂t ∂x
102
2 2
linked to the initial condition: u(t = 0, x) = e−x . Then the solution is u(t, x) = e−(x−ct) ,
which travels unchanged to the right with speed equal to c.
Example 3.3.4
Initial Value Problem for the Linear Diffusion Model
Consider the linear diffusion model:
µ ¶
∂ ∂2
Lu = −h 2 u, h > 0, (3.3.15)
∂t ∂x
linked to the initial condition given in Example 3.3.3. Then the solution is
x2
1 −
u(t, x) = √ e 1 + 4ht , (3.3.16)
1 + 4ht
which shows how the bell shape of the initial condition is preserved, but it broadens out; the
maximum is always at x = 0, but it decreases with time.
Example 3.3.5
Initial Value Problem for the Linear Wave Model
Consider the linear wave model:
µ ¶
∂2 2 ∂
2
Lu = − c u, (3.3.17)
∂t2 ∂x2
linked to the same above initial condition in Example 3.3.3 and to an additional condition
stating that the initial speed is equal to zero
∂u
(t = 0, x) = 0 . (3.3.18)
∂t
Then, the solution is as follows:
1 £ −(x−ct)2 2¤
u(t, x) = e + e−(x+ct) , (3.3.19)
2
which shows that the initial the bell shaped condition is split in two and both parts travel with
speed c, one to the left and the other to the right.
103
Remark 3.3.1. If the solution to the initial value problem related to the above models is
developed for initial conditions with compact support: u(t = 0, x) = 0 for x ∈
/ [a, b], and
u(t = 0, x) = uin (x) > 0 for x ∈ [a, b], then the support at time t is:
• [a + ct, b + ct] for the transport model;
• [a − ct, b − ct] ∪ [a + ct, b + ct] for the wave propagation problem;
• IR for the diffusion problem;
Remark 3.3.2. It can be observed that the solution to the transport model remains constant
on the lines x = x0 + ct. Similarly, the solution to the wave equation is made up of two terms:
the first remains constant on the lines x = x0 +ct, the second on the lines x = x0 −ct. From this
observation it should be clear that these are peculiar curves in the (t, x)-plane which deserve to
be named. They are, in fact, called characteristics and, as we shall see, are very important
both from the physical and the mathematical point of view.
dx
= c(t, x) , (3.3.21)
dt
and evaluate u on them we have that its temporal evolution is given by
d ∂u dx
u(t, x(t)) = + ∇u · · (3.3.22)
dt ∂t dt
Recalling Eq. (3.3.20), yields
d ³ ¡ ¢´
u(t, x(t)) = f t, x(t); u t, x(t) . (3.3.23)
dt
The line
dx = c(t, x) ,
dt (3.3.24)
b,
x( t̂ ) = x
b).
is the so-called characteristic through (t̂, x
The evolution of the solution on it is then given by
³
¡ ¢ ¡ ¢´
d u t, x(t) = f t, x(t); u t, x(t)
dt (3.3.25)
b) = u
u(t̂, x b,
104
where u b).
b is the value of u in (t̂, x
Thus we have reduced the solution of a first-order partial differential equation to the solution
of two coupled first-order ordinary differential equations.
Example 3.3.6
Characteristic lines
The characteristics of the equation
∂u ∂u
+ (αx + βt + γ) =0 (3.3.26)
∂t ∂x
dx
= αx + βt + γ , (3.3.27)
dt
where C is an integration constant. Hence the characteristic through the point (t̂, x
b) is
αγ + β ¡ α(t−t̂) ¢ β ¡ α(t−t̂) ¢
beα(t−t̂) +
x(t) = x 2
e −1 + t̂e −t , (3.3.29)
α α
if α 6= 0 , and
β 2
x(t) = x (t − t̂2 ) ,
b + γ(t − t̂) + (3.3.30)
2
if α = 0. Then the state variable remains constant along the characteristics.
This section deals with the mathematical formulation of problems related to models stated
in terms of partial differential equations. A line similar to that of Chapter 2 will be followed:
after having verified the consistency of a mathematical model, the formulation of mathematical
problems is stated adding to the evolution equation the conditions necessary to find quanti-
tative solutions. Possibly, before dealing with the search of solutions, the well position of the
mathematical problem should be verified.
105
The analysis developed in two steps: first we refer to scalar models in one space dimension
and then various generalizations are dealt with. Let us consider the class of models involving
dimensionless scaled independent and dependent variables:
stated in terms of partial differential equations which can be written, in normal form, as follows:
µ ¶
∂u ∂u , ∂ 2 u
=f t, x; u, . (3.4.2)
∂t ∂x ∂x2
Relatively more general cases can be technically developed starting from the indications
given for the above class of models. For instance higher order problems as well as problems in
unbounded domain, and so on.
Remark 3.4.1. Actually the range of the dependent variable cannot be stated precisely with-
out solving the equation. Therefore, only the physics can contribute to identify the smallest
and largest value of the dependent variable. Therefore, (3.4.1) can be interpreted by saying
that u is of the order of unity.
Remark 3.4.2. (Statement of problems) If both time and space derivatives appear in the
mathematical model, then both initial and boundary conditions usually have to be assigned.
The relative mathematical problem is then called an initial-boundary-value problem. If,
instead, the mathematical model is static, i.e., time independent, then initial conditions are not
needed. In this case, the mathematical problem is defined as a boundary-value problem.
Bearing all above in mind, some mathematical problems, in one space dimension, will be
stated in what follows.
Problem (Dirichlet) 3.4.1. The Dirichlet initial-boundary value problem for Eq. (3.4.2) is
stated with initial condition
where ϕ is a given function of space, while α and β are given continuous functions of time.
Problem (Neumann) 3.4.2. The Neumann initial-boundary value problem for Eq. (3.4.2)
is stated with initial condition (3.4.3) and Neumann boundary conditions
∂u ∂u
(t, 0) = γ(t) and (t, 1) = δ(t) , ∀ t ∈ [0, 1] , (3.4.5)
∂x ∂x
106
Problem (Mixed) 3.4.3. The mixed initial-boundary value problem for Eq. (3.4.2) is stated
with initial condition (3.4.3), to Dirichlet boundary conditions on one side, and Neumann
boundary conditions on the other one:
∂u
u(t, 0) = α(t) and (t, 1) = δ(t) , ∀ t ∈ [0, 1] , (3.4.6)
∂x
or viceversa
∂u
(t, 0) = γ(t) and u(t, 1) = β(t) , ∀ t ∈ [0, 1] . (3.4.7)
∂x
Problem (Robin) 3.4.4. The Robin initial-boundary value problem for Eq. (3.4.2) is stated
with initial condition (3.4.3), to the boundary conditions defined as a linear combination of
Dirichlet and Neumann boundary conditions, say
where c1,2,3,4 and ua,b are given functions of time ∀ t ∈ [0, 1].
Problem (Nonlinear boundary value) 3.4.5. The initial-boundary value problem for
Eq. (3.4.2) with nonlinear boundary conditions is stated with initial condition (3.4.3), to the
boundary conditions defined as a nonlinear combination of Dirichlet and Neumann boundary
conditions, say
∀ t ∈ [0, 1], where ba,b are given functions of time, and g1,2 are suitable functions of their
arguments.
Remark 3.4.3. (Sufficient initial and boundary conditions) The above statement of
problems has been obtained linking to the evolution equations a number of information on the
behavior of the state variable on the boundary of the independent variable equal to the highest
order of the partial derivative with respect to the said variable.
Example 3.4.1
Mathematical Problems for the Linear Wave Model
Consider the linear wave model
µ ¶
∂2 2 ∂
2
Lu = − c u = 0, (3.4.10)
∂t2 ∂x2
Bearing in mind the above definition of statement of problems, the following general definition
can be given:
Well-formulated problems
If the model is implemented with enough initial and boundary conditions to find a
solution, then the relative mathematical problem is said to be well-formulated.
where the boundary of the domain of D ⊆ IR3 of the space variables is denoted by ∂D. The
statement of some related mathematical problems is as follows:
Problem (Dirichlet) 3.4.6. The initial-boundary value problem for the scalar Eq. (3.4.12)
with Dirichlet boundary conditions, is stated with initial condition
Problem (Neumann) 3.4.7. The initial-boundary value problem for the scalar Eq. (3.4.12)
with Dirichlet boundary conditions, is stated with initial condition (3.4.14) and Neumann
boundary conditions
∂u
∀ t ∈ [0, 1], ∀ x ∈ ∂D : = γ ∗ (t; x ∈ ∂D) , (3.4.16)
∂n
given as functions of time and where n denotes the normal to ∂D directed inside D.
Remark 3.4.5. If the model is defined by a system of equations, then initial and boundary
conditions have to be assigned for each equation according to the rules stated above. For prob-
lems in unbounded domain the same reasoning developed for problems in one space dimension
can be technically generalized to problems in more than one space domain.
Example 3.4.2
Mathematical problems for the heat diffusion model
Consider the heat conduction problem
∂u
= k∇2 u + f (t, x; u) , x ∈ D, (3.4.17)
∂t
where ∂DD and ∂DN are a partition of the boundary ∂D and n is the normal to ∂DN .
It is worth stressing again that the above statements of problems do not imply good position.
This matter is dealt in the pertinent literature, e.g. Lions and Magenes (1968), and Dautray
and Lions (1990), with reference to well defined mathematical problems. The above problems
have been stated for bounded time intervals. If time is let to go to infinity, the statement is valid
globally in time. In this case it is useful looking for the trend of the solutions asymptotically
in time.
The statement of mathematical problems we have seen above has used different types of
boundary conditions, always linear ones. However, in some cases of interest for the applications,
non-linear boundary conditions are needed. This happens whenever the quantities that can be
effectively measured at the boundaries are not the classical Dirichlet or Neumann conditions,
or a linear combination of them.
It is not worth stressing that boundary conditions are not an abstract invention, but quanti-
ties, which have to be effectively measured on the real system. Only in such a way the prediction
of the model will correspond to the real system under consideration. The example which follows
refers to the vehicular traffic model presented in the Example 3.2.7 of Section 3.2. The mea-
surements of the boundary conditions refer specifically to the flow. In fact, for simple technical
reasons due to the available devices, the measurement of the flow is remarkably more precise
than that of density. The following example illustrates the above reasonings in a practical case.
Example 3.4.3
Traffic flow Model
Consider the macroscopic vehicular traffic model presented in Section 3.2, which is written
as follows
· ¸
∂u ∂u ∂ ∂u
+ c(u) =ε k(u) , (3.4.20)
∂t ∂x ∂x ∂x
corresponding to a model of the flow of the type
· ¸
∂u
q = 1 − u 1 − ε (1 − u) , (3.4.21)
∂x
∂ u(t, 0) ∂ u(t, 1)
where α(t) = u(t, 0), β(t) = u(t, 1), γ(t) = , δ(t) = .
∂x ∂x
110
It follows that if qα (t) and qβ (t) are given, then boundary conditions are identified as a
non–linear combination of the Dirichlet and Neumann boundary conditions.
The interested reader can find in Section 6.3 of Bellomo et al. (2007), how the above problem
can be technically solved. It is plain that the above problem cannot be solved by solving system
(3.4.22) with respect to α and β or γ and δ. On the other hand, the solution can be linked to
the discretization schema.
Without entering into the details of technical calculations, we can show that it is necessary to
derive the phenomenological relation (3.4.21) with respect to time and to link it to the equation
of conservation of mass. The following model is obtained:
∂u ∂q
∂t = − ∂x ,
· ¸ (3.4.23)
∂q ∂q ∂q ∂2q
= (1 − 2u) + ε (2 − 3u) − ε u(1 − u) 2 .
∂t ∂x ∂x ∂x
The boundary conditions for this problem are precisely qα (t) and qβ (t). Therefore, the problem
can be solved with the appropriate Dirichlet type conditions on x = 0 and x = 1.
The preceding sections have shown various examples of linear and nonlinear models. Similarly
to the case of models stated in terms of ordinary differential equations, linearity has to be
considered a very special case considering that all systems of real world are, at least in principle,
nonlinear. In case, however special, of linear models and problems one can attempt to deal with
analytic solutions. A brief account of mathematical methods to obtain analytic solutions is given
in this section.
Consider first the relatively simple case of scalar linear models in one space dimension. More-
over, suppose that the solution u = u(t, x) of some initial-boundary value problem is known,
then the starting point of the search for analytic solutions is the representation of the solution
in a Hilbert space. Specifically, consider a representation of a function u = u(t, x) given by an
expansion of the type
∞
X
u(t, x) = ci (t)ψi (x) , (3.5.1)
i=0
where the functions ψi belong to a complete space of orthonormal functions defined in a suitable
Hilbert space with weighted (the weight function is denoted by w = w(x)) inner product
Z
hf , giw (t) = f (t, x)g(t, x)w(x) dx . (3.5.2)
111
ci (t) = hu , ψi iw , (3.5.3)
where the calculation of the integrals may need numerical computation. In this case, one may
discretize the space variable and compute the integrals by weighted sums.
For practical applications u is approximated by un corresponding to the collocation Ix , then
a truncated expansion is used
n
X
u(t, x) ∼
= un (t, x) = ci (t)ψi (x) , (3.5.4)
i=0
Example 3.5.1
Tchebyschev Polynomes
Tchebyschev polynomials are characterized by the following features:
• Range: [−1, 1] .
• Polynomials:
• Weight:
2/π 1/π
i = j 6= 0 : , i=j=0: . (3.5.6)
1 − x2 1 − x2
Example 3.5.2
Legendre Polynomials
Legendre polynomials are characterized by the following features:
• Range: [−1, 1] .
• Polynomials:
2m − 1 m−1
L0 = 1 , L1 = x , Lm = xLm−1 − Lm−2 . (3.5.7)
m m
112
• Weight:
2
i = j 6= 0 : . (3.5.8)
2m + 1
Example 3.5.3
Laguerre Polynomials
Laguerre polynomials are characterized by the following features:
• Range: [0, ∞) .
• Polynomials:
and
1 1
Lam = (2m − 1 + a − x)Lam−1 − (m − 1 + a)Lam−2 . (3.5.10)
m m
• Weight:
m!
i = j 6= 0 : xa e−x . (3.5.11)
Γ(a + m + 1)
Example 3.5.4
Hermite Polynomials
Hermite polynomials are characterized by the following features:
• Range: (−∞, ∞) .
• Polynomials:
• Weight:
m!
i = j 6= 0 : √ e−x/2 . (3.5.13)
2π
The solution of mathematical problems consists in solving suitable evolution equations for
the coefficients ci which are obtained replacing the expression (3.5.4) into the partial differential
equation and in implementing, still into the same expression, initial and boundary conditions.
In the case of relatively simpler problems (e.g. linear differential equations with constant
113
coefficients) the solution can be recovered analytically, while in the general cases computational
methods have to be used. Generally, the method leads to systems of ordinary differential
equations of the type we have seen in Chapter 2.
Consider, in order to understand the application of the method, the following class of linear
equations:
∂ ∂ ∂2
u = a(t) u + b(t) 2 u , (3.5.14)
∂t ∂x ∂x
and initial value problems without constraint at the boundaries.
Replacing the expression (3.5.4) in the evolution equation Eq. (3.5.14), yields a system of
linear ordinary differential equations. In fact:
n n n
∂ X ∂ X ∂2 X
cj (t)ψj (x) = a(t) cj (t)ψj (x) + b(t) 2 cj (t)ψj (x) . (3.5.15)
∂t j=0 ∂x j=0 ∂x j=0
By taking the inner product, an evolution equation for the coefficients can be obtained as
follows:
Xn Xn
dci
= a(t) cj (t)hji (t) + b(t) cj (t)kji (t) , (3.5.16)
dt j=0 j=0
where
¿ À
dψj
hji = , ψi , (3.5.17)
dx
and
¿ À
d2 ψj
kji = , ψ i . (3.5.18)
dx2
The above linear equation can be treated with methods of linear ordinary differential equa-
tions. Initial ci0 conditions have to be implemented for t = 0. Then using the same approxi-
mation, one has:
n
X
u(0, x) = u0 (x) ∼
= un0 (x) = cj (t)ψj (x) . (3.5.19)
j=0
A similar reasoning can be applied to boundary conditions. Consider specifically the case of
Dirichlet boundary conditions. Then one has:
n
X
u(t, 0) = α(t) ∼
= un (t, 0) = cj (t)ψj (x = 0) , (3.5.20)
j=0
and
n
X
u(t, 1) = β(t) ∼
= un (t, 1) = cj (t)ψj (x = 1) . (3.5.21)
j=0
The solution of the above algebraic system allows computing the first and last coefficient in
terms of the others, thus reducing the dimension of System (3.5.16). Convergence for n → ∞ is
114
related to the application of classical convergence theorems for orthogonal functions in Hilbert
spaces. Dealing with systems of equations simply means applying the above procedure to all
equations.
In principle, the same method can be implemented for equations in several space dimension.
Formally the procedure is the same, however the method generates technical computational
complexity related to the large dimension of the system.
Continuum models may be viewed as systems with infinite number of degrees of freedom. The
analysis proposed in Section 3.5 has shown how one may look for analytic solutions in terms
of an infinite expansion of functions in a suitable Hilbert space. This method can be unlikely
applied to the solution of nonlinear problems and models, although, at least in principles,
one may attempt to apply the same procedure. On the other hand, the technical application
of the method immediately shows that the system of ordinary differential equations becomes
nonlinear, and the computational complexity is to heavy to be technically tackled.
Then methods of computational mathematics need to be developed to approximate the con-
tinuous system, say:
Lu = N u , (3.6.1)
Lun = N un , (3.6.2)
where n is such a number. In other words, the problem consists in approximating the continuous
model (or problem) by a discrete model (or problem). Then the solution of the finite model can
be obtained by methods reviewed in Chapter 2, while a technical analysis can possibly provide
informations on the distance between the solution of the continuous model and the finite one.
Although these Lecture Notes deal with modelling rather than with computational methods,
this section will however show how the above approximation can be developed for some simple
cases. As we shall see, the analysis is rather simple in the case of problems in one space dimen-
sion only, while computational complexity problems arise when models, and related mathemat-
ical problems are in more than one space dimensions. So far this sections should be regarded as
a brief introduction to the above problems to be properly dealt with referring to the specialized
literature.
Discrete models can be obtained by continuous ones by decomposing the domain D of the
space variable into a certain number of finite volumes Di :
n
D = Ui=1 Di , i = 1, . . . , n , (3.6.3)
and by taking averaged value of the state variable within each volume.
115
It follows that in each finite volume the dependent variable is ui = ui (t), and that the
mathematical model is a finite one consisting in a set of n equations providing the evolution of
the ui . The model is obtained by the same conservation and/or equilibrium equations of the
continuous model. The variables ui can be localized in the center of mass of each volume, while
each finite element is assumed to be subject to the action of the continuous ones as well as from
the boundaries for those elements which are localized on the boundary containing the system
which is modelled.
The above procedure generates a finite model and mathematical problems stated in terms of
a system of ordinary differential equations. The discretization method can be easily applied
in the case of one space dimension. The examples which follow refer to the discretization of a
model of an elastic string subject to Dirichlet boundary conditions, and to a model of diffusion
phenomena with Neumann boundary conditions. Namely we refer to models already presented
in Examples 3.2.1 and 3.2.2. Generalizations - and difficulties - to problems in more than one
space dimension will be subsequently described.
Example 3.6.1
Discrete Models for a Vibrating String
Consider a discrete model corresponding to the vibrating string model described in Example
3.2.1 where, in this case, gravitation is not neglected and the vibration plane is supposed to be
vertical.
Following the above listed modelling procedure, the space variable is discretized by the col-
location
The extrema are fixed; hence two degrees of freedom, corresponding to the movement of the
points on the boundary, are suppressed. A mass m = M/(n + 2) is placed in each collocation
point. It is constrained to move along the vertical direction and is connected to the contiguous
masses by linear springs with elastic constant k, as shown in Figure 3.6.1.
The procedure provides a mechanical model with n degrees of freedom identified by the
Lagrangian variable
where qi is the vertical displacement of the mass located in xi . The mathematical model
corresponding to the above mechanical model can be obtained by the following methods of
classical mechanics. In detail, the kinetic energy is
n µ ¶2
1 X dqi
Ec = m , (3.6.6)
2 i=1 dt
n
X n
k X£ ¤
U= mg qi − (qi+1 − qi )2 + (xi+1 − xi )2 , (3.6.7)
i=1
2 i=0
d2 qi k
=g+ (qi+1 − 2qi + qi−1 ) . (3.6.8)
dt2 m
It is worth remarking on a few connections between the above finite model and the original
continuous model:
i) The derivation of the discrete model required the preliminary construction of a mechanical
model approximating the continuous phenomenological model. Subsequently, modelling
methods for models at the microscopic scale are applied to obtain the finite mathematical
model.
ii) The system of ordinary differential equations is linear as it was the original continuous model.
On the other hand, nonlinear models are needed for large deviations when the elastic term
cannot be anymore assumed to be constant. The material behavior can be simulated assuming
that c depends on the lengths of the stretched elements.
iii) Increasing the number of finite elements may increase accuracy. However, it also increases
the dimension of the system of differential equations.
iv) The model with finite degrees of freedom satisfies the same linearity (or nonlinearity) prop-
erties of the continuous model.
v) The model has been derived assuming that each element moves in a vertical plane. On the
other hand, the same procedure can be technically applied to the case of elements moving
in the plane. This is a simple technical problem, while the continuous model implies severe
technical difficulties.
117
The above model was derived following a finite element scheme. One can technically generalize
them also to finite models derived in finite volume schemes as in the example which follow:
Example 3.6.2
Discrete Models for Diffusion Phenomena
Consider the discrete model corresponding to the one-dimensional case of the heat diffusion
model dealt with in Example 3.2.2. Obtained discretizing the space variable by the collocation
(3.6.4).
A finite element corresponds to each point. Moreover, the heat flux at the extrema are
prescribed, say q1 = q1 (t) and qn = qn (t).
A mechanical model is obtained, which has n degrees of freedom and identified by the variable
where ui is the local temperature in the finite volume localized in xi . The mathematical model
is obtained applying to each finite volume the balance equation:
∂u ∂2u
= k0 2 , (3.6.10)
∂t ∂x
where the flux is modelled by applying Fourier’s phenomenological model:
∂u
q = −h0 (3.6.11)
∂x
The evolution model for an inner element is as follows:
dui k0
= [(ui+1 − 2ui + ui−1 )] , (3.6.12)
dt H
where space derivatives have been replaced by finite differences, and where H is the size of the
volume. This crude approximation can be properly refined. The flux for the boundary elements
is directly given by the boundary conditions.
In the nonlinear case the constant k0 has to be replaced by a function of the local variable u.
The above method is based on the idea of decomposing the continuous system into finite
elements. Then the evolution equation for each element is derived using the same method used
to derive the continuous model. A system of coupled equations is obtained where the coupling
is related to the interactions between elements, in other words, i − 1 and i + 1-elements act on
the i-element. Boundary conditions act on the first and last element.
The application of the method is quite immediate in one space dimension, on the other hand,
various technical complexities occur in more than one space dimension, although the guiding
118
lines for the application of the method is precisely the same. The evolution in each sub-domain
Di is derived by local application of the above mentioned equilibrium and/or conservation
equations, while the actions of the elements on the boundaries of Di acts as boundary conditions.
An analogous result can be obtained by analytic interpolation methods of the variable u =
u(t, x) for x ∈ [0, 1]. In this case we shall talk about discretization methods by collocation-
interpolation. The description of the method refers both to interpolation of functions and to
calculations of local derivatives. Moreover, let us define the collocation
i = 1, . . . , n : Ix = {x1 = 0, . . . , xi , . . . , xn = 1} , (3.6.13)
1
xi = (i − 1)k , k= · (3.6.14)
n−1
or Tchebyschev type collocation with decreasing values of the measure |xi − xj | towards the
borders.
In general, u = u(t, x) can be interpolated and approximated by means of by Lagrange
polynomials as follows
n
X
u(t, x) ∼
= un (t, x) = Li (x)ui (t) , (3.6.15)
i=1
where ui (t) = u(t, xi ) and where Lagrange polinomials are given by the following expressions
The above defined interpolations can be used to approximate the partial derivatives of the
variable u in the nodal points of the discretization
n
X (r)
∂ru dr Lh
(t; xi ) ∼
(r)
r = ahi uh (t) , ahi = (xi ) , (3.6.17)
∂x dxr
h=1
for r = 1, 2, . . ..
The value of the coefficients depends on the number of collocation points and on the type of
collocation. Technical calculations provide, in the case of Lagrange polynomials, the following
result
Q X
(1) (xi ) (1) 1
ahi = Q , aii = , (3.6.18)
(xh − xi ) (xh ) xi − xh
h6=i
where
Y Y Y Y
(xi ) = (xi − xp ) , (xh ) = (xh − xp ) . (3.6.19)
p6=i p6=h
119
Higher order coefficients may be computed exploiting the following recurrence formula
à (r−1)
!
(r+1) (r) a (r+1)
X (r+1)
ahi =r ahi ahi + hi , aii =− ahi . (3.6.20)
xh − xi
h6=i
The same method can be used for time dependent functions in two space variables: u =
u(t, x, y) : [0, 1] × [0, 1] × [0, `] → [−1, 1], such that u(t; x, y) is a one to one map from
[0, 1] × [0, `] into [−1, 1], for every t ∈ [0, 1]. Consider, in addition to the collocation Ix , the
following
j = 1, . . . , m : Iy = {y1 = 0, . . . , yj , . . . , ym = 1} . (3.6.21)
It follows that
n X
X m
u = u(t, x, y) ∼
= unm (t, x, y) = Li (x)Lj (y)uij (t) . (3.6.22)
i=1 j=1
The approximation of the space derivatives in the collocation points is obtained by calculations
analogous to the ones of the one-dimensional case:
n
X (r)
∂ru ∼
(t; xi , y j ) = ahi uhj (t) , (3.6.23)
∂xr
h=1
and
m
X (r)
∂ru
r
(t; xi , yj ) ∼
= akj uik (t) , (3.6.24)
∂y
k=1
XX n m
∂2u
(t; xi , yj ) ∼
= ahi akj uhk (t) , (3.6.25)
∂x∂y
h=1 k=1
where the coefficients a are given by the expressions reported above and depend, as we have
seen, on the type of interpolation and number collocation points.
Both in one and two space dimension the interpolation is, by definition, exactly satisfied in
the nodal points
∗
n X
X m
∗
u = u(t, x, y) ∼
= unm (t, x, y) = Li (x)Lj (y)uij (t) , (3.6.27)
i=1 j=1
120
where the number of collocation points along the y-axis depends on the collocation on the
x-axis.
The above collocation interpolation methods can be applied to obtain finite models approx-
imating the continuous ones, and to solve the related initial-boundary value problems by ap-
proximating them by suitable initial value problems for ordinary differential equations. This
method is explained, at a practical level, in the examples which follow:
Example 3.6.3
From Continuous to Finite Diffusion Models
Consider continuous model for diffusion phenomena corresponding to the following class of
second order partial differential equations
µ ¶2
∂u ∂2u ∂u
= µ(x) 2 + ε(x) , (3.6.28)
∂t ∂x ∂x
n
à n !2
dui X (2)
X (1)
= µ(xi ) aji uj + ε(xi ) aji uj , (3.6.29)
dt j=1 j=1
for i = 1 , . . . , n.
Remark 3.6.1. The above finite model can be used to solve the Dirichlet initial-boundary value
problem simply by substituting the first and last equation by u1 = u(t, 0) and un = u(t, 1). The
differential system can then be solved by means of standard techniques for ordinary differential
equations we have seen in Chapter 2.
Remark 3.6.2. The solution of Neumann problem can be developed analogously. The dif-
ference consists in dealing with the first and last equation. Imposing Neumann boundary
121
conditions yields:
n−1
X (1)
∂u
(t, 0) =a
(1)
u +
(1)
aj1 uj (t) + an1 un ,
∂x 11 1
j=2
(3.6.30)
∂u
n−1
X (1)
(t, 1) =a
(1)
u + ajn uj (t) + a(1)
∂x 1n 1 nn un .
j=2
Then solving the above linear algebraic system with respect to u1 and u2 and substituting
these terms into the first and last equation of the finite model yields the system of ordinary
differential equations approximating the mathematical problem.
Remark 3.6.3. Mixed type problems with Dirichlet boundary conditions on one side and Neu-
mann boundary conditions on the other side can be dealt with following the same method we
have seen above, using for the first or last equations one of the expression given by Eqs. (3.6.30)
related to Neumann boundary conditions, while Dirichlet’s condition is inserted by direct sub-
stitution.
As we have seen, mathematical models derived at the macroscopic scale are generally stated
in terms of partial differential equations. Moreover, models should be written in a suitable
dimensionless form so that the independent variables are scaled in the interval [0, 1], while the
dependent variable is also scaled to be of the order of unity.
Discretization schemes toward computational solutions exploit directly the above formaliza-
tion which refers both to the models and to the mathematical problems. Often, such a formal-
ization is almost necessary to deal with the qualitative and quantitative analysis of models and
problems. Therefore, the class of models, dealt with in this chapter, need a structure differ-
ent from that one of discrete models which have been derived in terms of ordinary differential
equations.
Looking at modelling aspects, one may observe that the various examples we have seen above
have to be regarded as an approximation of physical reality. In fact, the reader can, for instance,
recognize the following:
• All systems seen in this chapter are discrete: continuity assumptions are always an approxi-
mation of physical reality;
• Random features often occur either in the mathematical model or in the statement of the
problem;
• Systems may be constituted by different interconnected systems. In this case one needs
different models for each sub-system and the overall system should be viewed as a system of
122
systems. Therefore the mathematical statement of problems requires the statement of com-
patibility conditions between contiguous systems that may involve both initial and boundary
conditions.
• Mathematical models are generally derived by equilibrium and/or conservation equations
properly closed using phenomenological models describing the material behaviour of the sys-
tem. Generally, these models are valid at equilibrium, while are used to describe dynamical
behaviour far from equilibrium. Any attempt to improve modeling generates various types of
nonlinearities.
Looking at computational aspects, the contents have been devoted simply to the approxima-
tion of continuous models by a suitable set of interconnected discrete models. The discretization
has been obtained in two different ways:
i) Discretization of the volume occupied by the system into several contiguous volumes. Subse-
quently a finite model is derived in each volume supposing that the material is uniform in each
volume. The model is obtained using the same equilibrium and/or conservation equations
used for the derivation of the continuous model.
ii) Identification of a certain number of collocation points in the physical volume occupied by the
system followed by a Lagrange type interpolation of the dependent variable corresponding to
the collocation points. Subsequently partial derivatives, with respect to the space variable,
are approximated by finite sums of the values of the dependent variables in the collocation
points. Finally a system of ordinary differential equations, corresponding to a system of finite
models, approximates the continuous model.
It may be argue that the first method discretizes the continuous model, with an infinite
number of degrees of freedom, by a model with a finite number of degrees of freedom. On the
other hand, the second method obtains an analogous result acting directly on the mathematical
problem generated by the application of the model.
The above approach has been developed in the relatively simpler case of models and problems
in one space dimension. Generalizations to more than one space dimension involve technical
difficulties that are not dealt with in these Lecture Notes. A deeper insight into the application
of collocation methods can be obtained by the book Bellomo et al. (2007), while finite elements
methods for engineering applications are accurately dealt with in the book Kojić and Bathe
(2005).
The reader is addressed to the pertinent literature Bellomo and Preziosi (1995) for alterna-
tive computational methods, for instance finite differences and spectral methods, and related
convergence methods.
Finally, let us remark that models at the macroscopic scale can be suited to model systems of
the inert matter, but not of the living systems. In fact, the averaging process ends up to kill the
ability to express certain strategies. In some cases, the model includes random variables. For
instance, in the vehicular traffic model of Section 3.2, the parameter ε, that attempts to model
the ability of the driver to capture the local density conditions, can be modeled as a random
123
variable suitable to depict the stochastic distribution over the driver–vehicle microsystem.
In some cases, a stochastic process is added to model fluctuations. However, although the
problem becomes very difficult, a deep insight into the ability of the model to capture the
complexity of the interactions of living entities is not achieved. Therefore, a substantial im-
provement of the mathematical approach is necessary, as we shall see in the next chapter.
124
125
Chapter 4
4.1 Introduction
Systems of real world are generally constituted by a large number of interacting elements. An
example, which we have already examined in the preceding chapters, is a fluid of several inter-
acting particles. In principle, one may model this system by microscopic type models related to
the dynamics of each element in interaction with the others. However, Chapter 2 has shown that
modeling systems with a large number of interacting elements generates complexity problems,
due to an excessively large number of equations, which cannot be properly dealt with. This
approach leads to a large number of equations with a computational time which exponentially
increases with the number of equations, so that it may become too computationally complex
or even impossible to deal with.
The macroscopic description, we have seen in Chapter 3, reduces the above complexity by
dealing with quantities which are averaged, at each time, locally in space. The application of
this modeling approach is possible when the number of elements is very large in a way that,
given a small volume (in terms to be mathematically specified), it still contains a sufficiently
large number of elements.
Intuitively one can realize that this approach cannot always be applied. For instance, this is
the case of a diluted fluid, when the mean distance between particles is large with respect to
their dimension and may even become of the same order of the container of the fluid. Therefore
different methods need to be developed.
Methods of kinetic theory represent an alternative to the approach typical of continuum
mechanics. Kinetic theory looks for evolution equations for the statistical distribution over
the variable which describes the microscopic state of each element. Gross quantities (which
are those delivered by macroscopic models) are obtained by suitable moments of the above
statistical distribution.
This chapter gives a brief account to some classical models of the kinetic theory with special
attention to the Boltzmann and Vlasov equations, which model the evolution of large systems
126
suitable generalizations and developments of the above mathematical approach to model large
complex systems in different field of applied sciences. Among others, Prigogine and Hermann
(1971) proposed a mathematical theory of traffic flow by Boltzmann type equations. The same
approach has been applied to model the social behavior of colonies of insects by Jager and
Segel (1992), cell populations with special attention to the immune competition Bellouquid and
Delitala (2006), social systems Bertotti and Delitala (2004). A systematic presentation of the
mathematical kinetic theory for active particles is proposed with various applications in the
book Bellomo (2008). The class of models under consideration have been called, in the book
Arlotti et al. (2003), generalized kinetic (Boltzmann) models. Their common feature is
that the model, similarly to the classical kinetic theory, is an evolution equation for the one
particle distribution function over the microscopic state of the active particles interacting in a
large system.
A physical fluid is an assembly of disordered interacting particles free to move in all directions,
inside a space domain Ω ⊂ IR3 possibly equal to the whole space IR3 . Assuming that the position
of each particle is correctly identified by the coordinates of its center of mass:
xk , k = 1, . . . , N , (4.2.1)
the system may be reduced to a set of point masses in a fixed frame of reference. This is the
case, for instance, if the shape of the particles is spherically symmetric, and hence rotational
degrees of freedom can be ignored.
When the domain Ω is bounded, the particles interact with its walls Ωw . If Ω contains
obstacles, say sub-domains Ω∗ ⊂ Ω which restrict the free motion, then the particles interact
also with the walls of Ω∗ . In some instance, e.g. for flows around space ships, the obstacles are
objects scattered throughout the surrounding space.
It is generally believed in physics, and in statistical mechanics in particular, that understand-
ing the properties of a fluid follows from the detailed knowledge of the state of each of its
atoms or molecules. In most fluids of practical interest, such states evolve according to the
laws of classical mechanics which, for a system of N particles give the following set of ordinary
differential equations:
dx
k
= vk ,
dt
N
X (4.2.2)
dvk
m
k = F k = f k + f 0
k k ,
dt 0 k =1
where Fk is the force referred to the mass mk acting on each particle. Fk may be expressed as
the superposition of an external field fk and of the force fk0 k acting on the k-particle due to the
action of all other particles. In general these forces are regular functions in the phase space,
and fk0 k may be allowed to exhibit pole-discontinuities when the distance between particles is
zero.
This approach prescribes that Eqs.(4.2.2) can be solved, and that the macroscopic properties
of the fluid can be obtained as averages involving the microscopic information contained in
such solution. However, it is very hard, if even possible, to implement such a program, unless
suitable simplifications are introduced. Indeed, unavoidable inaccuracies in our knowledge of
the initial conditions, the large value of N (about 1020 for a gas in normal conditions), and
the mathematical complexity, result in the impossibility of retrieving and manipulating all the
microscopic information provided by (4.2.2–4.2.3) and contained in {xk , vk } for k = 1, . . . , N .
It is true that what is of interest is to extract from (4.2.2–4.2.3) the information sufficient to
compute the time and space evolution of a restricted number of macroscopic observable such
as the number density, mass velocity, temperature, and stress tensor. However, computing the
macroscopic observable quantities using the solutions to Eq. (4.2.2) is an almost impossible
task. This is not only because of the initial difficulty in dealing with a large system of ordinary
differential equations, but also because afterwards one has to compute the averages which
correctly define the macroscopic quantities. For instance, the mean mass density E(ρ) should
be obtained, for a system of identical particles, by examining the ratio:
∆n ,
E(ρ) = m (4.2.4)
∆x
when the volume ∆x tends to zero and the number of particles remains sufficiently large.
Fluctuations cannot be avoided. Additional difficulties are related to the computation of the
other macroscopic variables. Thus, constitutive relations are needed.
As we have seen, a different possible approach is that of fluid dynamics, see Truesdell and Ra-
jagopal (2000). It consists in deriving the evolution equations, related to the above macroscopic
observable quantities, under several strong assumptions, including the hypothesis of continuity
of matter (continuum assumption). This constitutes a good approximation of a real system
only if the mean distance between pairs of particles is small with respect to the characteristic
lengths of the system, e.g. the typical length of Ω or of Ω∗ . Conversely, if the intermolecular
distances are of the same order of such lengths, the continuum assumption is no longer valid,
and a discrepancy is expected between the description of continuum fluid dynamics and that
furnished by Eqs. (4.2.2).
Therefore, considering that it is neither possible to deal with the equations of continuum fluid
dynamics, nor with particle dynamics (4.2.3), a different model is needed. An alternative to
the above descriptions is provided by the Boltzmann equation, the fundamental mathematical
model of kinetic theory, which describes the evolution of a dilute monoatomic gas of a large
number of identical particles undergoing elastic binary collisions.
129
Following Arlotti et al. (2003), we limit to recall the main features of the model, and we
refer to the classical literature, among others Cercignani et al. (1994), for its derivation and its
fundamental properties.
Let us consider a dilute monoatomic gas of particles modeled as mass points identified by
unit mass, position x and velocity v. The Boltzmann equation refers to the time evolution of
the one-particle distribution function:
• Gas particles move freely in the space. The collisions have instant in time and local in space
character. During the collisions the conservation of momentum and energy holds according
to the laws of classical mechanics.
• Collisions involving more than two particles are negligible.
• Collisions between pairs of particles are uncorrelated (molecular chaos hypothesis).
It should be remarked that, according to the first hypothesis, variations of the distribution
function, inside dx, are neglected. Therefore, the distribution function of the field particles
which enter into the action domain of the test particles is actually approximated by the value
of f in the position of the field particle.
Let us here recall that the momentum and energy conservation relations for a collision process
(v, w) → (v0 , w0 ) of two particles of simple gas are:
(
v + w = v0 + w0 ,
(4.2.9)
|v|2 + |w|2 = |v0 |2 + |w0 |2 .
The celebrated Boltzmann equation, which defines the evolution of the distribution function
f = f (t, x, v) reads:
∂f
+ v · ∇x f = J[f, f ] , (4.2.10)
∂t
where the collision operator J is given by:
Z Z
J[f, f ](t, x, v) = B(n, w − v)
IR3 S2+
³ ´
× f (t, x, v0 )f (t, x, w0 ) − f (t, x, v)f (t, x, w) dn dw , (4.2.11)
where we denote by a dot the scalar product, n is the unit vector in the direction of the apse-line
bisecting the angle between v − w and w0 − v0 , and
is the domain of integration of the variable n. Let us notice that Eq.(4.2.12) are reversible to
respect pre and post collision velocities.
The form of the collision kernel B depends upon the particle interaction potential and we
assume that B satisfies the following condition:
Moreover, as usual in the subject, we will assume that B satisfies the Grad’s angular cut-off
potential condition, Grad (1963).
An important example of such a collision kernel B is the one corresponding to the hard sphere
potential:
(
n · v if n · v ≥ 0 ,
B(n, v) = (4.2.15)
0 otherwise.
The above picture on the variation of f , as a result of competition between free streaming
and balance of losses and gains in dx dv, requires the size of the volume element dx dv be large
enough that the number of particles contained in it justifies the use of statistical methods. On
the other hand, this number must be small enough that information contained in it should
have local character. Clearly, these two features are not compatible in general, hence problems
are expected in justifying the whole procedure. Hopefully, in the cases of practical interest,
the molecule size does fall in a range of values which are small when compared to those of
the volume elements dx which, in turn, can be considered as microscopic with respect to the
observations scale.
With reference to the specialized literature, see for example by Cercignani et al. (1994)
and Glassey (1996), we are now interested in reporting some fundamental properties of the
Boltzmann model. In details:
Z
J[f, f ] dv = 0 , (4.2.16)
IR3
Z
vJ[f, f ] dv = 0 , (4.2.17)
IR3
and
Z
|v|2 J[f, f ] dv = 0 , (4.2.18)
IR3
for each f such that the integrals make sense.
Moreover formal computations show that, if Ω is either IR3 or the 3-dimensional torus T 3 ,
any solution to the Boltzmann equation should satisfy the following conditions:
Z Z Z Z
f (t, x, v) dx dv = f (0, x, v) dx dv , (4.2.19)
IR3 Ω IR3 Ω
Z Z Z Z
v f (t, x, v) dx dv = v f (0, x, v) dx dv , (4.2.20)
IR3 Ω IR3 Ω
and
Z Z Z Z
2
|v| f (t, x, v) dx dv = |v|2 f (0, x, v) dx dv . (4.2.21)
IR3 Ω IR3 Ω
Let us now consider the equilibrium solutions of Eq. (4.2.10), i.e. the solutions of the equation:
J[f, f ] = 0 , (4.2.22)
132
we recall that the only solutions of Eq. (4.2.23) are the so-called local Maxwellian distributions
given by:
· ¸
ρ(t, x) |v − u(t, x)|2
M (t, x, v) = exp − , (4.2.23)
(2π T (t, x))3/2 2 T (t, x)
where ρ, T , and u are the macroscopic observable quantities.
An other interesting feature of the Boltzmann equation is described by the H functional:
Z Z
H[f ](t) = f (t, x, v) log f (t, x, v) dx dv , (4.2.24)
IR3 Ω
if the term f log f is integrable for all t > 0. Indeed, the H-Theorem asserts that the H
functional is formally decreasing along the solution:
Z Z
d
H[f ](t) = J[f, f ](t, x, v) log f (t, x, v) dx dv ≤ 0 , (4.2.25)
dt IR3 Ω
The Boltzmann equation is such that the distribution function is modified only by external
actions and short range interactions. On the other hand, various physical systems are such that
also long range interactions may be significant.
To be more specific, consider the vector action P = P(x, v, x∗ , v∗ ) on the subject with
microscopic state x, v due to the subject with microscopic state x∗ , v∗ . The resultant action is:
Z
F[f ](t, x, v) = P(x, v, x∗ , v∗ ) f (t, x∗ , v∗ ) dx∗ dv∗ , (4.2.26)
IR3 ×DΩ
where DΩ is the domain around the test particle where the action of the field particle is effec-
tively felt, in other words the action P decays with the distance between test and field particles
and is equal to zero on the boundary of DΩ . Based on the above assumptions, the mean field
equation writes:
∂f
+ (v · ∇x )f + F · ∇v f + ∇ · (F[f ] f ) = 0 , (4.2.27)
∂t
where F is the positional macroscopic force is acting on the system delivered by Eq. (4.2.26).
133
Classical models of the kinetic theory of gases have been presented in the preceding section.
Following the line of Chapter 2 and 3, we shall now provide an outline of some methods to derive
discrete velocity models also in view of the development of suitable computational schemes.
The discrete models of the Boltzmann equation can be obtained assuming that particles
allowed to move with a finite number of velocities. The model is an evolution equation for the
number densities Ni linked to the admissible velocities vi , for i ∈ L = {1, . . . , n}. The set
N = {Ni }ni=1 corresponds, for certain aspects, to the one-particle distribution function of the
continuous Boltzmann equation.
This model is called discrete Boltzmann equation. The mathematical theory of the dis-
crete kinetic theory was systematically developed in the Lecture Notes by Gatignol (1975),
which provides a detailed analysis of the relevant aspects of the discrete kinetic theory: mod-
eling, analysis of thermodynamic equilibrium and application to fluid-dynamic problems. The
contents mainly refer to a simple monoatomic gas and to the related thermodynamic aspects.
After such a fundamental contribution, several developments have been proposed in order to
deal with more general physical systems: gas mixtures, chemically reacting gases, particles un-
dergoing multiple collisions and so on. Analytic topics, such as the qualitative analysis of the
initial value and of the initial-boundary value problem, have been object of continuous interest
of applied mathematicians. The existing literature is reported in the review papers Platkowski
and Illner (1988) and Bellomo and Gustafsson (1991).
The formal expression of the evolution equation is as follows:
µ ¶
∂
+ vi · ∇x Ni = Ji [N ] , (4.3.1)
∂t
where
and where the collision scheme must be such that momentum and energy are preserved. These
terms are positive constants which, according to the indistinguishability property of the gas
particles and to the reversibility of the collisions, satisfy the following relations:
A = Ahk kh kh
ji = Aij = Aji . (4.3.5)
134
As for the Boltzmann equation, the qualitative analysis of discrete models needs the identifi-
cation of the space of collision invariants and of the Maxwellian state. Specifically, the following
definitions can be used:
Collision Invariant
A vector φ = {φi }i∈L ∈ IRm is defined collision invariant if:
where the inner product is defined in IRm and m is the cardinality of the set L.
Maxwellian
Let Ni > 0 for any i ∈ L, then the vector N is defined Maxwellian if J[N ] = 0.
Moreover, the following propositions characterize the space of the collision invariants and the
Maxwellian state:
Proposition 4.3.1. Let φ ∈ IRm , then the following conditions are equivalent
i) φ ∈ M, i.e. φ is a collision invariant;
ii) hφ , J[N ]i = 0.
Proposition 4.3.2. Let Ni > 0 for any i ∈ L, then the following three conditions are equivalent
i) N is a Maxwellian;
ii) {log Ni }i∈L ∈ M;
iii) J[N ] = 0.
X
H= ci Ni log Ni . (4.3.7)
i∈L
Then, the evolution equation for the H–Boltzmann equation can be derived by multiplying the
discrete Boltzmann equation by 1 + log Ni and taking the sum over i ∈ L. It can be technically
verified that the time derivative of above functional is nonpositive and that the equality holds
if and only if N is a Maxwellian.
Applied mathematicians have attempted in the last decade to design models with arbitrarily
large number of velocities and hence to analyze convergence of discretized models toward the
full Boltzmann equation. However, various technical difficulties have to be tackled as well
documented in the paper Görsch (2002), such as:
135
i) The discretization schemes for each couples of incoming velocities do not assure a pair of
outgoing velocities such that conservation of mass and momentum is preserved;
ii) Specific models may have a number of spurious collision invariants in addition to the classical
ones corresponding to conservation of mass, linear momentum and energy;
iii) The convergence of the solutions of discretized equation to those of the full Boltzmann equa-
tion, when the number of discretization points tends to infinity, under suitable hypotheses
which have to be properly defined.
A technical difficulty in dealing with the above convergence proof consists in obtaining an ex-
istence theorem for the Boltzmann equation in a function space which can be properly exploited
for the development of the computational scheme.
An alternative consists in discretizing the velocity space by a suitable set of velocity modules,
but letting the velocity free to assume all directions in the space. The formal expression of the
evolution model is as follows:
µ ¶
∂
+ vi · ∇x Ni (t, x, Ω) = Ji [f ](t, x, Ω) , (4.3.8)
∂t
where
and where Ω is the unit vector which identifies the direction of the velocity, vi = vi Ω.
Example 4.3.1
Six Velocity Discrete Boltzmann Equation
Consider a one-component discrete velocity gas such that the particles can attain 6 velocities
in the xy–plane. The velocity discretization is defined by:
· ¸
i−1 i−1
vi = c cos( π) i + sin( π) j i = 1, . . . , 6 , (4.3.10)
3 3
that are the only nontrivial (generate a flux in the velocity space).
Detailed calculations provide the expression of the collisions operators. In particular the
binary collision operator is simply obtained by joining the transition rates Ahk ij to the corre-
sponding transition probability densities ahk
ij by A hk
ij = S|v i − v |a
j ij
hk
. The mathematical model
follows from simple calculations.
136
Mathematical problems related to the Boltzmann equation can be classified, as usual, into
initial, initial-boundary and boundary value problems. However the statement of the problems
is somewhat different from those models of continuous mechanics.
Consider first the initial value problem for the Boltzmann equation in absence of an external
force field, in the whole space IR3 , and with given initial conditions:
and
The statement of the initial-boundary value problem requires the modeling of gas surface
interaction phenomena. In particular, two specific problems, among several ones, can be stated:
• The interior domain problem which corresponds to a gas contained in a volume bounded
by a solid surface;
• The exterior domain problem, which corresponds to a gas in the whole space IR3 which
contains an obstacle.
The surface of the solid wall is defined in both cases by Ωw , the normal to the surface directed
toward the gas is ν. Moreover, in order to define the boundary conditions on a solid wall, we
need to define the partial incoming and outgoing traces f + and f − on the boundary Ωw , which,
for continuous f , can be defined as follows:
(
f + (x, v) =f (x, v) , x ∈ Ωw , v · ν(x) > 0
(4.4.5)
f + (x, v) =0 , x ∈ Ωw , v · ν(x) < 0 ,
and
(
f − (x, v) =f (x, v) , x ∈ Ωw , v · ν(x) < 0
(4.4.6)
f − (x, v) =0 , x ∈ Ωw , v · ν(x) > 0 .
Then the boundary condition may be formally defined as follows:
where the operator R, which maps the distribution function of the particles which collide with
the surface into the one of particles leaving the surface is characterized, for a broad range of
physical problems, by the following properties:
f− ≥ 0 ⇒ R f− ≥ 0 . (4.4.8)
2. R preserves mass, i.e. the flux of the incoming particles equals the one of the particles which
leaves the surface.
+ −
3. R preserves local equilibrium at the boundary: ωw = R ωw , where ωw is the Maxwellian
distribution at the wall temperature.
The formulation of the initial-boundary value problem, in the case of the interior domain
problem, consists in linking the evolution equation (4.2.26) to the initial condition (4.4.1)
and to the boundary condition (4.4.7) on the wall Ωw . In the case of the exterior domain
problems, in addition to the boundary conditions on the wall, suitable Maxwellian equilibrium
conditions are assumed at infinity. If one deals with the boundary value problem, the statement
of the boundary conditions must be linked to the steady Boltzmann equation, and the
definitions of solutions are analogous to the ones we have seen for the initial value problem.
The mathematical statements we have just seen refer to the Boltzmann equation in absence of
an external field, and hence the trajectories of the particles are straight lines. When an external
field is acting on the particles, trajectories may be explicitly determined only by solving the
equations of Newtonian dynamics.
Several books, for instance Glassey (1996), are available, for the interested reader, on the
qualitative analysis of mathematical problems related to the application of the Boltzmann
equation. The above cited book provides a detailed survey of existence theory for the Boltzmann
and Vlasov equations.
This section presents, referring to the existing literature, a unified approach to model living
systems by suitable development of the mathematical kinetic theory. The contents are presented
starting from the identification of their main complexity features. Subsequently, a system
approach is proposed by subdividing the overall system into functional subsystems and by
138
representing the state of them by a suitable probability distribution over the microscopic state
of the entities composing each subsystem. Finally a mathematical framework is derived which
may act as background paradigm for the derivation of specific models.
Therefore, this section offers the conceptual background for the derivation of models. The
authors have attempted to unify various approaches that have been offered in the literature.
Moreover, it will be discussed later how far this approach departs from the classical kinetic
theory.
of neighbors can play a prominent role in the development of the strategy and interactions.
Namely, interactions do not involve all entities that are within an interaction domain.
5. Stochastic games: Interactions modify their state according to the strategy they develop.
Living entities play a game at each interaction with an output that is technically related
to their strategy often related to surviving and adaptation ability. This dynamics is also related
to the fact that living systems receive a feedback from their environments, which enables them
to learn from their experiences.
6. Large deviations: Interactions involving entities and those with the outer environment
can lead to large effects, which in turn lead to even larger effects due to a natural trend far
from equilibrium. Deviations can be also related to bifurcation phenomena.
7. Large number of components: Complexity in living systems is induced by a large
number of variables, which are needed to describe their overall state. Therefore, the number
of equations needed for the modeling approach may be too large to be practically treated.
For instance, biological systems are different from the physical systems analyzed by statistical
mechanics, which typically deals with systems containing many copies of a few interacting
components, whereas cells contain from millions to a few copies of each of thousands of different
components, each with specific interactions.
8. Learning ability: Living systems have the ability to learn from past experience. Therefore
their strategic ability and the characteristics of interactions among living entities evolve in time.
In some case some of the system attempts to escape from the other (hiding), while the other
attempts to approach (learning).
9. Multiscale aspects: The study of complex living systems always needs a multiscale
approach. For instance, the dynamics of a cell at the molecular (genetic) level determines the
cellular behaviors. As a matter of fact, the structure of macroscopic tissues depends on such a
dynamics.
10. Darwinian selection: All living systems are evolutionary. For instance birth processes
can generate individuals more fitted to the environment, who in turn generate new individuals
again more fitted to the outer environment. Neglecting this aspect means that the time scale
of observation and modeling of the system itself is not long enough to observe evolutionary
events. Such a time scale can be very short for cellular systems and very long for vertebrates.
Therefore time is a key variable.
Of course, the interested reader can argue that some of the above features can be replaced
by different ones. This is certainly true, however we leave this type of arguing to the per-
sonal initiative to go back to the intrinsic difficulty of the problem under consideration and,
following Bellomo and Carbonaro (2011), we pose three key questions to applied mathemati-
cians, that can possibly contribute to define methodological guidelines to modeling in view of
a mathematical theory.
1. Can mathematics contribute to reduce the complexity of the overall system by splitting it
into suitable subsystems?
140
2. Can mathematics offer suitable tools to describe the five features proposed above as common
to all complex biological systems?
3. How far is the state-of-the-art from the development of a biological-mathematical theory
of living systems and how can an appropriate understanding of multiscale issues contribute to
this ambitious objective?
The modeling approach to complex systems should take into account the afore-said common
complexity features and the three key questions. Then, it can be developed, according to the
authors’ bias, through four steps as follows:
Step 1: Derivation of a mathematical structure suitable to describe the evolution of a complex
system in a way that the structure retains the common complexity features. It needs to be
regarded as a potential ability to be characterized for each specific case study.
Step 2: The second step specifically refers to a well defined real system. It consists in selecting
the specific complexity characteristics of the system under consideration and in adjusting the
afore-said structure to the specific case.
Step 3: The third step consists in characterizing the variables and the parameters that char-
acterize the model referring to the real system that is object of the modeling approach. This
basically means modeling the interaction at the individual (microscopic scale).
Step 4: Validation of the model by comparison of the dynamics predicted by it with empirical
data. More precisely, the model should show quantitative agreement with data that are quan-
titatively known and qualitative agreement with emerging behaviors that are observed in the
real behavior.
Bearing in mind the problem of validation, some models (too many) artificially include em-
pirical data. For instance trend to steady state conditions. This is not correct, prediction of the
model should be a consequence of interactions at the microscopic scale. In general validation
should be related to the ability of models to reproduce empirical data. However, quantitative
empirical data are available only for steady state (may be equilibrium) conditions, while emerg-
ing behaviors are observed in dynamical conditions. These behaviors are observed and repeated
only at a qualitative level as they are very sensitive to environmental conditions.
The validation of models of complex systems needs to be related to their ability to depict
empirical data. In some cases, models may possibly describe emerging behaviors that have not
yet been observed. In this case, specific experiments can be organized to verify these predictions.
4.5.2 Representation
Some preliminary reasonings on the modeling approach should be focused on the selection of
the appropriate representation scale and on the related variable that are supposed to describe,
in the mathematical model, the state of the system under consideration. This selection should
however face the problem of reducing complexity. Some considerations, related to the key
complexity features, are useful to the identify the strategy of the modeling approach.
141
1. The modeling of systems of the inert matter is developed within the framework of deter-
ministic causality principles. Namely for a given cause, the effect is deterministically identified.
This causality principle needs to be weakened, or even suppressed, in the case of the
living matter.
2. The description of the physical state of living systems needs the introduction of ad-
ditional variables suitable to describe the strategy that each element of the system develops.
Therefore, the use of models at the macroscopic scale ends up with killing the presence of such
variable, which is, as previously mentioned, heterogeneously distributed.
3. If system is constituted by a large number of interacting living entities, one may call them
active particles, and their microscopic state should include, in addition to geometrical and
mechanical variables, also an additional variable, called activity. This variable should be
appropriate to model the ability of each entity to express a specific strategy.
4. The ability to express a strategy is heterogeneously distributed over the active particles.
Therefore, a modeling approach can be based on the description of the overall state of the
system by a probability distribution function over such microscopic state.
5. Methods of the mathematical kinetic theory can be developed to model the evolution
in time and space of the distribution function. For instance using a suitable balance in each
elementary volume of the space of the microscopic states, where the inflow and outflow of
particles in the said volume is determined by interactions among particles.
6. Living systems, particularly in biology (e.g. multicellular systems), are constituted differ-
ent types of active particles, so that several different functions are expressed. This is a relevant
source of complexity. Therefore, it is useful reducing it by decomposing the system into
suitable functional subsystems according to well defined rules that will be stated in the
sequel.
7. A functional subsystem is a collection of active particles, which have the ability to express
collectively the same activity regarded as a scalar variable. The whole system is constituted
by several interacting functional subsystems. The link between a functional subsystem and its
activity depends also on the specific phenomena under consideration.
8. The decomposition into functional subsystems is a flexible approach to be adapted
at each particular investigation. Specifically, the identification of each functional sub-system is
related to the activity they express as well as to the specificity of the investigation that is devel-
oped. Namely, given the same system, different decompositions can be developed corresponding
to different specific investigation.
9. Active particles of the same functional subsystem may, however, differ for specific charac-
teristics. For instance in a multicellular system, cells with a different genotype may however
collectively express, with other cell, the same function. Therefore, different decomposition
may correspond, according to the afore-said statements, to different composition
of each functional subsystem.
142
10. Considering that the subsystems composing a system may be linked by networks, the
modeling approach needs treating both interactions within the same functional subsystem in a
node of the network and interactions involving functional subsystems of different nodes. In some
cases nodes of the network may identify different sub systems. Dealing also with interactions
among subsystems may need, in some cases, use of different scales.
Bearing all above in mind, let us consider a large system is constituted by interacting entities,
called active particles organized into n interacting populations labeled by the indexes i =
1, . . . , n. Each population corresponds to a functional subsystem, namely active particles in
each functional subsystem develop collectively a common strategy.
The variable charged to describe the state of each particles is called microscopic state,
which is denoted by the variable w that includes the geometrical and mechanical description
as well as of the activity variable. In the simplest case: w = {x , v , u}, where x ∈ Dx is
position, v ∈ Dv is mechanical state, e.g. linear velocity, and u ∈ Du is the activity.
The description of the overall state of the system is defined by the generalized one-particle
distribution function:
fi = fi (t, x, v, u) , i = 1, . . . , n,
such that fi (t, x, v, u) dx dv du denotes the number of active particles whose state, at time t, is
in the interval [w, w + dw]. where w = x, v, u is an element of the space of the microscopic
states.
Remark 4.5.1. If the number of active particles is constant in time, then the distribution
function can be normalized with respect to such a number and consequently is a probability
density.
If fi is known, moment calculations lead to the computing of macroscopic quantities. For
instance, marginal densities are computed as follows:
Z
fim (t, x, v) = fi (t, x, v, u) du , (4.5.1)
Du
Z
fia (t, x, u) = fi (t, x, v, u) dv (4.5.2) ,
Dv
and
Z
fib (t, u) = fi (t, x, v, u) dx dv . (4.5.3)
Dx ×Dv
while integration over the volume Dx containing the particles gives the total size of the ith
subsystem:
Z
Ni [fi ](t) = ν[fi ](t, x) dx , (4.5.5)
Dx
143
which can depend on time due to the role of proliferative or destructive interactions, as well as
to the flux of particles through the boundaries of the volume.
First order moments provide either linear mechanical macroscopic quantities, or lin-
ear activity macroscopic quantities. More precisely, the flux of particles, at the time t in
the position x, is given by:
Z Z
Q[fi ](t, x) = v fi (t, x, v, u) dv du = fim (t, x, v) dv . (4.5.6)
Dv ×Du Dv
The mass velocity of particles, at the time t in the position x, is given by:
Z
1
U[fi ](t, x) = v fi (t, x, v, u) dv du . (4.5.7)
νi [fi ](t, x) Dv ×Du
Second order moments yield quadratic quantities such as the local quadratic activity:
Z
e[fi ](t, x) = u2 fi (t, x, v, u) dv du , (4.5.10)
Dv ×Du
where the analogy between quadratic quantities and energy needs to be specified with reference
the specific real system under consideration.
Remark 4.5.2. Particular systems in life sciences are such that the microscopic state is
identified only by the activity variable. Namely, space and velocity variables are not
significant to describe the microscopic state of the active particles. In this case, the overall
state of the system is described by the distribution function fi (t, u) over the activity variable
only.
144
Remark 4.5.3. The microscopic state may identified only by the space and activity
variables. Namely, the velocity variable is not significant to describe the microscopic state,
while it is important to know the localization of the active particles. Therefore, the represen-
tation is given by fi (t, x, u) and the definition of vanishing variable is used.
Remark 4.5.4. The space variable can be used, in some cases to identify the regionalization
of active particles, where the activity variable is the same in each region, but the communication
rules, namely the dynamics at the microscopic scale, differ from region to region. In this case
the localization identifies different functional subsystems.
It is useful in view of applications considering also the case of discrete variables such as
discrete velocities defined by the set:
Iv = {v1 , . . . , vk , . . . , vK } , k = 1, . . . , K .
K
X
fi (t, x, v, u) = fik (t, x) δ(v − vk ) , fik (t, x, u) = fi (t, x, u; vk ) . (4.5.12)
k=1
Consider now the case of discrete velocity, activity and space variable where the dis-
cretization of the activity and space is delivered by the sets:
Iu = {u1 , . . . , xs , . . . , uS } , s = 1, . . . , S ,
and
Ix = {x1 , . . . , xr , . . . , xR } , r = 1, . . . , R .
S X
X R X
K
fi (t, x, v, u) = fisrk (t) δ(u − us ) δ(x − xr ) δ(v − vk ) , (4.5.13)
s=1 r=1 k=1
K Z
X S X
X K
ni (t, x) = fik (t, x, u) du , or ni (t, x) = fisk (t, x) , (4.5.14)
k=1 Du s=1 k=1
and so on.
Let us know apply the afore-said concepts, with tutorial aims, to the following examples:
145
Example 4.5.1
Vehicular Traffic
The representation, in view of the modeling approach, of vehicular traffic can be developed
according to the following guidelines:
• Micro-state: The state of the micro-system is defined by position, velocity and activity;
• Functional subsystems: The simplest case consists in assuming one only functional subsys-
tem. Otherwise, the decomposition can distinguish: Trucks, slow cars and fast cars. Multilane
flows can be treated by inserting the lane in the micro-variable of as a functional subsystem.
Introducing various functional subsystems reduces the need of an heterogeneous activity vari-
able to model the activity of each functional subsystem.
Example 4.5.2
Immune Competition
The representation of phenomena of the immune competition can take advantage of the following
guidelines:
• Micro-state: The state of the micro-system is defined by the function (activity) expressed
by cells of each functional subsystem;
• Functional subsystems: The simplest case consists in assuming two functional subsystems.
One for cells with a pathologic state, one for the immune cells.
The modeling approach should include the onset of new functional subsystems related to
Darwinian mutations.
Example 4.5.3
Social Dynamics
Social systems can be represented according to:
• Micro-state: The state of the micro-system is defined by the function (activity) expressed
by individuals of each functional subsystem. It depends very much on the type of system under
consideration.
• Functional subsystems: The simplest case consists in assuming one isolated functional
subsystems. A general case consists in modeling several functional subsystems interacting in a
network.
The localization can be used, for systems in a network, to identify different functional subsys-
tems. The modeling approach should include the onset of new functional subsystems related to
146
aggregation and/or fragmentation events. New subsystems can also be generated by migration
phenomena.
f = f (t, x, v) δ(u − u0 ).
The strategy to derive equations follows the guidelines of the classical kinetic theory. Namely,
by a balance equation for net flow of particles in the elementary volume of the space of the
microscopic state by transport and interactions. The following active particles are involved in
the interactions:
• Test particles with microscopic state, at the time t, defined by the variable (x, v, u), whose
distribution function is f = f (t, x, v, u).
• Field particles with microscopic state, at the time t, defined by the variable (x∗ , v∗ , u∗ ),
whose distribution function is f ∗ = f (t, x∗ , v∗ , u∗ ).
• Candidate particles with microscopic state, at the time t, defined by the variable (x∗ , v∗ , u∗ ),
whose distribution function is f∗ = f (t, x∗ , v∗ , u∗ ).
The rules of interactions are as follows: The candidate particles with microscopic
state defined, at the time t, by (x∗ , v∗ , u∗ ), interacts with field particles with microscopic
state, at the same time t, defined by (x∗ , v∗ , u∗ ) and acquires, in probability the state of the
147
test particles with microscopic state (x, v, u). Test particles interacts with field particles and
may lose their state.
The following interactions can be taken into account:
• Conservative interactions modify the microscopic state of particles;
• Non conservative interactions generate proliferation or destruction of particles in their
microscopic state.
• Onset of new functional subsystems: both type of (above) interactions may generate a
particle in a new functional subsystem.
The mathematical framework refers to the evolution in time and space of the test particle.
The derivation related to the distribution functions fi is based on the following balance of
equations in the elementary volume of the phase space:
µ ¶
dfi
dx dv = Gi [f ] − Li [f ] + Si [f ] dx dv , (4.5.15)
dt
where interactions of candidate and test particles refers to the field particles and f = {fi }ni=1 .
Moreover, for the ith functional subsystem:
Gi [f ] denotes the gain of candidate particles into the state x v, u of the test particle;
Li [f ] models the loss of test particles;
Si [f ] models proliferation/destruction of test particles in their microscopic state.
Let us consider the interactions between candidate or test particles of the hth functional
subsystem and the field particles of the k th functional subsystem.
H.4.1. The candidate and test particles in x, with state v∗ , u∗ and v, u, respectively, interact
with the field particles in x∗ , with state v∗ , u∗ located in its interaction domain Ω, x∗ ∈ Ω.
H.4.2. Interactions are weighted by a suitable term ηhk (v∗ , v∗ , u∗ , u∗ ), that can be interpreted
as an interaction rate, which depends on the local density in the position of the field particles.
H.4.3. The distance and topological distribution of the intensity of the interactions is weighted
by a function phk (x, x∗ ) such that:
Z
phk (x, x∗ ) dx∗ = 1 .
Ω
H.4.4. The candidate particle modifies its state according to the probability density A defined
as follows:
where Ahk denotes the probability density that a candidate particles of the h-subsystems with
state v∗ , u∗ reaches the state v, u after an interaction with the field particles k-subsystems with
state v∗ , u∗ , while the test particle looses its state v and u after interactions with field particles
with velocity v∗ and activity u∗ .
148
H.4.5. The test particle, in x, can proliferate, due to encounters with field particles in x∗ ,
with rate µihk (x, x∗ ), which denotes the proliferation rate into the functional subsystem i, due
the encounter of particles belonging the functional subsystems h and k. Destructive events can
occur only within the same functional subsystem with rate µiik (x, x∗ ).
Conjecture 4.1. Interactions modify the activity variable according to topological stochastic
games, however independently on the distribution of the velocity variable, while modification of
the velocity of the interacting particles depends also on the activity variable.
and
Z
Bhk (u∗ → u, |u∗ , u∗ ) du = 1 , ∀ u∗ , u∗ ,
Du
Z
Chk (v∗ → v |v∗ , v∗ , u∗ , u∗ ) dv = 1 , ∀ v∗ , v∗ u∗ , u∗ .
Dv
Conjecture 4.2. The encounter rate, namely, the frequency of interactions depends on
velocity and activity of the interacting pairs: ηij (v∗ , v∗ , u∗ , u∗ ), ηhk (v, v∗ , u, u∗ ).
Conjecture 4.3. The weight function, which models the intensity of interactions depends on
the localization of the interacting pairs: pij (x, x∗ ), phk (x, x∗ ).
Using the afore-said conjectures yields:
µ ¶ ·X
n µ ¶ n X
X n ¸
∂ i
+ vi · ∇ x fi (t, x, v, u) = Gij [f ] − Lij [f ] + Shk [f ] (t, x, v, u) , (4.5.16)
∂t j=1 h=1 k=1
where
Z
Gij = ηij (v∗ , v∗ , u∗ , u∗ )pij (x, x∗ ) Bij (u∗ → u|u∗ , u∗ ) Cij (v∗ → v |v∗ , v∗ , u∗ , u∗ )
2 ×D 2
Ω×Dv u
Z
Lij = fi (t, x, v) ηij (v, v∗ , u, u∗ )pij (x, x∗ ) fj (t, x∗ , v∗ , u∗ ) dv∗ du∗ dx∗ , (4.5.18)
Ω×Dv ×Du
and
Z
i
Shk = ηhk (v, v∗ , u, u∗ )phk (x, x∗ ) µihk (u, u∗ )
2
Ω×Dv ×Du
Let us now consider the case of open systems. Modeling external actions at the mi-
croscopic scale means modeling the action of functional subsystems generated by the outer
system delivered by the distribution functions:
gr (t, x, w) , r = 1, . . . , m , w ∈ Dw = Du , (4.5.20)
which depends on time, space and on a variable w modeling the activity of the outer functional
subsystem.
Reasonings analogous to those we have seen above lead to the derivation of the mathemati-
cal structures. The relatively simpler spatially homogeneous case correspond to the following
system:
∂fi ∂fi
(t,u) + Fi (t) (t, u) = Ji [f ](t, u) + Qi [f ](t, u)
∂t ∂u
Xn Z
= ηij (u∗ , u∗ )Bij (u∗ → u|u∗ , u∗ )fi (t, u∗ )fj (t, u∗ ) du∗ du∗
j=1 Du ×Du
n Z
X
− fi (t, u) ηij (u, u∗ )fj (t, u∗ ) du∗
j=1 Du
n Z
n X
X
+ ηhk (u∗ , u∗ )µihk (u, u∗ )fh (t, u∗ )fk (t, u∗ ) du∗ du∗
h=1 k=1 Du ×Du
m Z
X
e
+ ηij (u∗ , w∗ )Bij
e
(u∗ → u|u∗ , v ∗ )fi (t, u∗ )gj (t, w∗ )du∗ dw∗
j=1 Du ×Dv
m Z
X
e
− fi (t, u) ηij (u, w∗ )gj (t, w∗ ) dv ∗
j=1 Dv
m Z
r X
X Z
e
+ ηhk (u∗ , w∗ )µehk (i)(u∗ , w∗ ; u)fh (t, u∗ )gk (t, w∗ )du∗ dw∗ . (4.5.21)
h=1 k=1 Du Du
where:
e
ηhk models the encounter rates between the k th external action with state v ∗ and the hth
e
candidate particle with state u∗ , and similarly for ηij .
e
Bij (u∗ → u|u∗ , v ∗ ) denotes the probability density that the candidate particle of the ith func-
tional subsystem with state u∗ , falls into the state u of the same functional subsystem due to
interactions with the j th action with state v ∗ .
µehk (i)(u∗ , v ∗ ; u) models the net proliferation into the ith population, due to interactions, which
occur with rate ηhk , of the candidate particle of the population hth with state u∗ with the k th
action with state v ∗ .
More complex, but technically analogous calculations can be developed when the space vari-
able is included in the microscopic state.
150
It is well understood that interactions involving living entities may be nonlinearly additive.
A brief introduction is given in this subsection. Some sources of nonlinearity are the following:
• Encounter rate depending on the distribution function due to the hiding and learning
dynamics;
• Table of games conditioned by the distribution function;
• Nonlinearity induced by topological distribution of the interacting entities.
The modeling of the learning dynamics refers to the seminal paper by Cucker and Smale
(2002). This paper presents a deep and exhaustive understanding of the approximation of
sparse data, viewed as learning, i.e. the information to be learned.
Here some preliminary ideas are proposed, referring to Bellomo (2010), on the modeling
approach to hiding and learning dynamics of two interacting functional subsystems, where
the first one escapes, while the second one hunts. The mathematical structure, which is a
particular case of (4.5.21), can be used:
X2 Z
∂fi
(t, u) = ηij (u∗ , u∗ ) Bij (u∗ → u|u∗ , u∗ ) fi (t, u∗ ) fj (t, u∗ ) du∗ du∗
∂t j=1 D u ×Du
2 Z
X
− fi (t, u) ηij (u, u∗ ) [1 − µij (u, u∗ )] fj (t, u∗ ) du∗ . (4.6.1)
j=1 Du
where: ηij and µij (u, u∗ ) the encounter rate and the proliferation rate related to the candidate
active particle, with state u∗ , of the hth functional subsystem and the field active particle,
with state u∗ , of the j th functional subsystem. Moreover, Bij is the probability density that a
candidate particle, with state u∗ , of the ith functional subsystem ends up into the state u of
the test particle of the same functional subsystem after the interaction with the field particle,
with state u∗ , of the j th functional subsystem.
The mathematical structure can be specialized into a mathematical model when the inter-
action terms ηij and Bij are properly modeled according to a dynamics of hiding and learning
actions. Specifically, let us consider the encounter rate in the following general form
2 ∗
0
ηij = ηij e−c βij (t|f ,u∗ ,u ) , (4.6.2)
2
where c is a positive constant and βij (t|f ) models the distance between the two interacting
particles. In details, the following cases can be considered:
I – βij is equal to zero: This assumption corresponds to simplest approach that consists
in assuming that the encounter rate is identified only by the interacting pairs, so that it is a
0
constant ηij for each pair of interacting functional subsystems.
151
II – βij is the distance between the microscopic states of the two interacting pairs:
This amounts to suppose that βij is given by the difference between the microscopic state of
the interacting particles, and it can be related to the interaction rate as follows:
p ∗ 2
βij (u∗ , u∗ ) = (u∗ − u∗ )2 , 0
ηij = ηij e−c (u∗ −u )
. (4.6.3)
III – βij is given by the distance between the shapes of the distribution of the
two interacting subsystems: This corresponds to suppose that it is given by the difference
between the shapes of the interacting particles. More precisely distance βij between the ith and
the j th functional subsystem can be defined as follows:
Z
2
βij (t|f ) = (fi − fj )2 (t, u) du, (4.6.4)
Du
hence
R
0 2
−cβij (t|f ) 0 −c (fi −fj )2 (t,u) du
ηij = ηij e = ηij e Du . (4.6.5)
More in general, the distance βij can be given by the norm of the difference between fi and
fj in the appropriate metric space.
The dynamics is given by the following equation:
X2 Z
∂fi 2
0 −c βij (t|f )
(t, u) = ηij e Bij (u∗ → u|u∗ , u∗ ) fi (t, u∗ ) fj (t, u∗ ) du∗ du∗
∂t j=1 Du ×Du
2 Z
X 2
0 −c βij (t|f )
− fi (t, u) ηij e [1 − µij (u, u∗ )] fj (t, u∗ ) du∗ . (4.6.6)
j=1 Du
This structure can be generalized to include the generation of new functional subsys-
tems as follows:
XXp p Z
∂fi 0 2
(t, u) = ηhk e−c βhk (t|f ) Bhk
i
(u∗ → u|u∗ , u∗ ) fh (t, u∗ ) fk (t, u∗ ) du∗ du∗
∂t Du ×Du
h=1 k=1
p Z
X 2
0 −c βik (t|f )
− fi (t, u) ηik e fk (t, u∗ ) du∗
k=1 Du
p Z
p X
X 2
0
+ ηhk e−c βhk (t|f ) µihk (u∗ , u∗ )fh (t, u∗ ) fk (t, u∗ ) du∗ du∗ , (4.6.7)
h=1 k=1 Du ×Du
i
where Bhk is the probability density that a candidate particle of the hth functional subsystem,
with state u∗ ends up into the state u of the ith functional subsystem after the interaction
with the field particle of the k th functional subsystem, with state u∗ . Moreover, µihk is the
proliferative/destructive rate in the ith f.s. of the test particle, due to the encounter between
the hth candidate particle, with state u∗ , of the k th particle (field), with state u∗ .
152
Example 4.6.1
Modeling Hiding Learning Dynamics
A simple approach of modeling the terms Bij , for two functional subsystems, can be obtained
by supposing that the output of the interaction is defined by the most probable value mij and
the variance σij . In general it can be assumed that encounters within the same functional
subsystem do not modify the state of the interacting pair:
• The candidate particle with activity u∗ does not shows any modification of its state related
when it encounters a field particle of the same subsystem: m11 = m22 = u∗ .
• The candidate particle of the first subsystem, with activity u∗ , shows a trend to increase
the distance when it encounters the candidate particle of the second subsystem:
(
m12 =u∗ − ε1 (u∗ − u∗ ), if u∗ > u∗
• The candidate particle, with activity u∗ , of the second subsystem shows a trend to reduce
the distance with respect to the state of the candidate particle of the first subsystem:
(
m21 =u∗ − ε2 (u∗ − u∗ ), if u∗ < u∗
Let us now consider the case of nonlinear interactions where the term Bij , depend on the
distribution function of the interaction particles, for instance by the moments of such distribu-
tion. The evolution equation for functional subsystems undergoing conservative interactions is
as follows:
X n
∂fi
(t, u) = Ji [f ](t, u) = Jij [fi , fj ](t, u)
∂t j=1
n Z
X
= ηij (t|fi , fj ) Bij (u∗ → u|u∗ , u∗ , E p [fi ], E p [fj ]) fi (t, u∗ ) fj (t, u∗ ) du∗ du∗
j=1 Du ×Du
n Z
X
− fi (t, u) ηij (t|fi , fj ) fj (t, u∗ ) du∗ , (4.6.8)
j=1 Du
More precisely, the idea is that the candidate particle interacts with the field particles within
its space interaction domain and feels an action identified by the low order moments of the field
active particles. The action can be also related, whether consistent with the specific system
under consideration, to their most probable value, see Bellomo et al. (2010b). This paper also
shows that the interaction domain of the candidate particle with state u∗ may not be the whole
153
domain Du but a subset Ωu∗ ⊆ Du , which contains the field particles u∗ ∈ Ωu∗ that are able
to interact with candidate particle. Thus interactions only occur if the distance, in the space
of microscopic states of the interacting particles, are sufficiently small. Therefore a positive
function ω(u∗ , u∗ ), normalized with respect to integration over u∗ , is introduced to take into
account such dynamics. This function, which weight the interactions among the active particles,
is assumed to have a compact support in the domain of influence Ωu∗ ⊆ Du of the interactions.
Moreover:
Z Z
∗ ∗
ω(u∗ , u ) du = ω(u∗ , u∗ ) du∗ = 1. (4.6.9)
Du Ωu∗
n
X
∂fi
(t, u) = Qi [f ](t, u) = Qhk [fh , fk ](t, u)
∂t
h,k=1
n Z
n X
X 2
0 −cβhk (t|fh ,fk ) i
= ηhk e Bhk (u∗ → u|u∗ , u∗ , Ewp [fh ], Ewp [fk ])fh (t, u∗ )fk (t, u∗ )du∗ du∗
h=1 k=1 Du ×Du
n Z
X 2
0 −c βik (t|fi ,fk )
− fi (t, u) ηik e fk (t, u∗ ) du∗
k=1 Du
n Z
n X
X 2
0
− ηhk e−c βhk (t|fh ,fk ) µihk (u∗ , u∗ )fh (t, u∗ ) fk (t, u∗ ) du∗ du∗ . (4.6.11)
h=1 k=1 Du ×Du
where the number of particles is not any more constant in time due to the term that models
proliferative and/or destructive events. In details:
• µihk models the proliferative/destructive rate of particles of the hth functional subsystem,
with state u∗ , into the state u of the ith functional subsystem due to the encounter with the
particle (field) of the k th functional subsystem, with state u∗ . In particular, destructive events
occur only within the functional subsystem of the field particles.
i
• Bhk models the probability density that a candidate particle of the hth functional subsystem,
and with state u∗ , ends up into the state u of the ith functional subsystem after the interaction
with the field particle, with state u∗ , of the k th functional subsystem.
The application of the mathematical approach we have seen in the preceding and in this
sections can be practically applied to model vehicular traffic dynamics, namely a class of
complex systems already treated various times in these Lectures Notes. Three aspects will
be considered, and treated in the forthcoming examples; namely, the derivation of the math-
ematical structure suitable to capture the complexity of the system under consideration, the
154
derivation of the table of games modeling interactions at the microscopic scale and the val-
idation by using empirical data. The book by Kerner (2004) offers a valuable collection of
experimental results followed by a sharp interpretation based on the physics of traffic, while
the review papers by Helbing (2001) and Bellomo and Dogbè (2011) provide the mathematical
background for the kinetic theory approach to modeling vehicular traffic phenomena.
Let us recall, repeating some concepts of the preceding chapters, that the modeling should
be developed by using dimensionless quantities. Referring specifically to the maximum density
nM of vehicles corresponding to bumper-to-bumper traffic jam, and to the maximum admissible
mean velocity VM which can be reached, in average, by vehicles running in free flow conditions,
while a fast isolated vehicle can reach velocities larger that VM . More precisely, a limit velocity
can be defined as follows: V` = (1+µ)VM , with µ > 0, such that no vehicle can reach, simply by
mechanical reasons, a velocity larger than V` . Moreover, it is convenient, to identify the critical
time Tc as the ratio between ` and VM . Moreover, t is the dimensionless time variable obtained
referring the real time tr to a suitable critical time Tc = `/VM , while x is the dimensionless
space variable obtained dividing the real space xr by the length ` of the road.
The modeling approach can be based on a strategy defined by the following reasonings:
• Increasing the complexity of the model increases the number of parameters to be identified.
• The system is with finite degrees of freedom. However, microscopic models induce considerable
errors in the computation of macroscopic quantities.
• The flow is not continuous, hence hydrodynamic models should not be used. Moreover, it is
difficult to evaluate the entity of the approximation induced by macro-models.
• The number of individual entities is not large enough to allow the use of continuous distribution
functions within the framework of the mathematical kinetic theory. Moreover, interactions are
not localized, considering that drivers adapt the dynamics of the vehicle to the flow conditions
in the visibility zone.
• Individual entities cannot be regarded as classical particles, but as active particles due to their
ability to modify their dynamics according to specific strategies.
The examples proposed in the following tackle the afore-said specific modeling issues and
refer to the recent paper Bianca and Coscia (2011).
Example 4.6.2
Representation and Mathematical Structure of Vehicular Traffic
Let us consider discrete velocity models, where the velocity variable belongs to the following
set: Iv = {v1 = 0 , . . . , vi , . . . , vn = 1}. The corresponding discrete representation is obtained
by linking the discrete distribution functions to each vi : fi = fi (t, x) : IR+ × [0, 1] → IR+ , for
i = 1, . . . , n.
Macroscopic quantities are obtained by weighted sums. For instance, dimensionless number
155
and
n
X
q(t, x) = vi fi (t, x) , q(t, x) = ξ(t, x)ρ(t, x) .
i=1
The mathematical structure to be used for the derivation of models can be based on the
following assumptions:
• Interactions are distributed over a (dimensionless) characteristic length ξ > 0, which can
be interpreted as the visibility length of the drivers: a vehicle located at a point x ∈ Dx is
supposed to be affected in average by all vehicles comprised within a certain visibility zone.
• Interactions are weighted by means of a proper function w(x, y), and occur with intensity
according to the free space locally available in the visibility zone. Considering that a vehicle is
essentially an anisotropic particle, in the sense that it reacts mainly to frontal stimuli than to
rear ones, it is assumed that the visibility zone does not include any stretch behind the vehicles.
Accordingly, the mathematical structure to generate specific models is as follows:
XX n n Z x+ξ
∂fi ∂fi
+ vi = η[ρ](t, y)Aihk [ρ; α]fh (t, x)fk (t, y)w(x, y) dy
∂t ∂x x
h=1 k=1
n Z
X x+ξ
− fi (t, x) η[ρ](t, y)fh (t, y)w(x, y) dy , (4.6.12)
h=1 x
where
Z x+ξ
w(x, y) ≥ 0, w(x, y) dy = 1 ,
x
while
n
X
Aihk [ρ; α] ≥ 0, Aihk [ρ; α] = 1, ∀ h, k = 1, . . . , n, ρ(t, x) ∈ [0, 1) .
i=1
denotes the probability density that a candidate particle with velocity vh falls into the state
vi after an encounter with a field particle with velocity vk and α is a parametr to be properly
characterized.
156
Example 4.6.3
Modeling Interactions of Vehicles
The modeling of interactions, according to Bianca and Coscia (2011) uses a parameter α ∈ [0, 1]
is related to the quality of the road:
ρc ,
α= α ∈ [0, 1] ,
ρcM
where ρcM is the maximal admissible value corresponding to the best highway in optimal
environmental conditions.
The velocity is discretized by coupling of a steady grid when the density ρ is less than the
critical one ρc and with an adaptive grid when ρ is greater than ρc :
i−1
vi =2 n − 1 if 0 ≤ ρ ≤ ρc ,
vi =(1 + α) (1 − ρ3 )2 i − 1 if ρc < ρ ≤ 1,
n−1
for i ∈ {1, . . . , n}
The encounter rate nhk , for h, k ∈ {1, . . . , n}, is assumed as follows: ηhk = |vh − vk |. The
Table of the Games Aihk can be derived according to the following assumptions:
i) The trend of the candidate particle to change its velocity decreases in probability when the
“distance” |vh − vk | between the velocity of the candidate and field vehicle decreases.
ii) The candidate vehicle h can attain only a velocity vi ∈ {vh , vh−1 , . . . , vk } if it interacts with
a field vehicle with velocity vk such that vh > vk while if the candidate vehicle interacts with
a field vehicle such that vh < vk it can only attain a velocity vi ∈ {vh , vh+1 , . . . , vk }.
iii) If the candidate and field vehicles have the same velocity, the interaction does not imply a
change in velocity.
iv) When a candidate vehicle, with velocity vh , encounters a field vehicle, with velocity vk > vh
(faster vehicle), the candidate vehicle has a trend to increase its velocity from vh to vk .
v) When a candidate vehicle, with velocity vh , encounters a field vehicle, with velocity vk < vh
(slower vehicle), the candidate vehicle has a trend to decrease its velocity from vh to vk .
Detailed calculations reported in the afore stated paper provide the following results:
k < h, i=h: Aihk = 1 − εhk ,
εhk ,
k ≤ i < h : Aihk =
k < h,
|h − k|
εhk ,
k > h, h < i ≤ h : Aihk =
|h − k|
k > h, i=h: Aihk = 1 − εhk ,
157
where
1−ρ
εhk = α ·
|h − k|2
Let us finally consider the problem of the validation of models. Three types of empirical data
can be classified:
Quantitative data, which provide accurate data of macroscopic quantities such as number
density, mean velocity, and/or flow. Models are required to have the ability to reproduce them
quantitatively.
Qualitative data on emerging behaviors, which corresponds to specific conditions, for
instance in vehicular traffic interactions of clusters of fast and slow vehicles, bottlenecks. Model
are required to reproduce qualitatively emerging behaviors.
Data on individual behaviors, which should be used to design models for the table of
games.
A remarkable difficulty is that quantitative and qualitative empirical data provide macro-
scopic quantities, while the dynamics is ruled at the microscopic scale. Moreover:
• The averaging process which leads to macroscopic quantities introduces unavoidable fluctua-
tions due not only to measurement errors, but also to the stochastic and granular essence of the
flow, where deceleration and acceleration of vehicles are observed even in almost steady flow
stop and go dynamics;
• Generally, experimental results refer to steady state conditions, while rarely traffic
reach a steady state;
• Empirical data show results very sensitive to the quality of environmental conditions.
Therefore, it is impossible to identify a unique deterministic representation.
• Kerner (2004) remarks that additional transitions can be observed when the flow is congested,
for instance related to a considerable dispersion of data. Moreover, transitions have to be
interpreted carefully due to their variability with environmental conditions.
• The use of quantitative data as a direct input to derive models is an abuse as, at least
in principles, a careful modeling of microscopic interactions should provide the above result.
The interested reader can obtain a deeper understanding of the topic treated in this section
by Bianca and Coscia (2010), where it is shown that their model remarkably reproduce all
quantitative features of the velocity diagram studied by Kerner (2004). This diagram reports
the mean velocity versus density and shows that there exists a critical density ρc such that for
ρ < ρc the mean velocity maintains its maximum value, while for ρ > ρc a sharp decay indicates
the transition from free to congested flow. This characteristic, and specifically ρc , depends on
the quality of the outer environment. Additional information is obtained by Delitala and Tosin
(2007), where it is shown that various emerging behaviors are reproduced by their model.
158
A concise introduction to some models of the kinetic theory for classical and active particles
has been given in this chapter, while suitable bibliographical indications have been provided
toward a deeper knowledge on the topics treated in this chapter have been given.
The substantial difference between models for classical and active particles refers to mi-
croscopic interactions. Classical particles follow deterministic laws of Newtonian mechanics,
namely if both the input velocities and the impact parameter are assigned, then the output
velocities can be computed. On the other hand, interactions for active particles are stochastic.
Moreover the activity variable can modify the mechanical variables due to the strategy of the
particles to modify their dynamics.
Although this chapter does not report specific applications, some bibliographical indications
can be given. Specifically, the Boltzmann equation has been applied to study a variety of fluid
dynamic problems at large Knudsen number. Various applications are documented in the books
by Kogan (1969). The Vlasov equation has been applied to study various problems in plasma
physics, see for example Schram (1991).
The Kinetic Theory of Active Particles has been used to model some interesting complex
systems in applied sciences which are constituted by a large number of interacting entities
such that their microscopic state includes, in addition to geometrical and mechanical variable,
an additional microscopic state related to their behavior, which will be called characteristic
microscopic variable, or simply: activity. Microscopic interactions do not follow rules of
mechanics (classical or quantum), but are governed by a somehow organized behavior which is
able to modify laws of mechanics.
As already mentioned in Chapter 1, mathematical models of the kinetic theory are used to
overcame complexity problems induced by the need of modeling large systems of interacting in-
dividuals, where the number of equations needed to describe the system is not computationally
tractable. The representation offered by methods of the mathematical kinetic theory is elegant
and efficient. On the other hand the derivation of equations is based on heuristic assumptions
(for instance the factorization of the joint probability density), that cannot be rigorously jus-
tified. Therefore, models of the kinetic theory have to be regarded as an approximation of the
physical reality considering that even the structures, that the models refer to, are based on
mathematical approximations. Of course, various approaches to improve the above mentioned
approximations have been developed. However, we do not discuss this difficult topic in these
Lecture Notes.
Technically, different are the complexity problems related to models of active particles. In
fact, velocities of elements of living systems are always bounded, as documented in all appli-
cations dealt with in the book Bellomo (2008). Therefore, complexity problems are related to
modeling microscopic interactions rather than to the development of computational algorithms.
Discretization can be viewed as an approach to capture the main feature of the activity variable
in view of modeling microscopic interactions.
159
Finally, let us stress again that the various reasonings proposed in this chapter have to be
regarded simply as a brief introduction to modeling systems in applied and life sciences by the
approach of the classical and generalized kinetic theory. This brief introduction has the objective
to motivate the reader to a deeper insight, that may possibly start from the bibliography cited
in this chapter.
160
161
Chapter 5
Bibliography
Arlotti L., Bellomo N., De Angelis E., and Lachowicz M., (2003), General-
ized Kinetic Models in Applied Sciences - Lecture Notes on Mathematical
Problems, World Scientific, London, Singapore.
Arlotti L., Bellomo N., Lachowicz M., Polewczak J., and Waluś W., (1995),
Lecture Notes on the Mathematical Theory of the Boltzmann Equation, World
Scientific, London, Singapore.
Bellomo N., (1998), Guest Editorial: Perspectives on the relationship between math-
ematics and engineering sciences, Appl. Mech. Reviews, 51, iii)–iv).
Bellomo N., Berestycki H., Brezzi F., and Nadal J.P., (2010a), Mathematics
and Complexity in Life and Human Sciences, Math. Models Methods Appl. Sci, 20,
1391–1396.
Bellomo N., Bianca C., and Delitala M., (2009), Complexity analysis and mathe-
matical tools towards the modeling of living systems, Physics of Life Reviews, 6, 144–175.
Bellomo N., Bianca C., and Mongiovi M.S., (2010b), On the modeling of nonlin-
ear interactions in large complex systems, Appl. Math. Lett., 23, 1372–1377.
Bellomo N., and Carbonaro B., (2011), Toward a mathematical theory of living
systems focusing on developmental biology and evolution: A review and perspectives,
Physics of Life Reviews, to appear.
Bellomo N., Coscia V., and Delitala M., (2002), On the mathematical theory of
vehicular traffic flow. I. Fluid dynamic and kinetic modelling, Math. Models Methods
162
Bellomo N. and Dogbè C., (2011), On the modeling of traffic and crowds a survey
of models, speculations, and perspectives, SIAM Rev., to appear.
Bellomo N. and Gustafsson T., (1991), The discrete Boltzmann equation: a review
of the mathematical aspects of the initial and initial-boundary value problems, Rev.
Math. Phys., 3, 137–162.
Bellomo N., Lods B., Revelli R., and Ridolfi L., (2007), Generalized Colloca-
tion Methods - Solutions to Nonlinear Problems, Birkhäuser, Boston.
Bellomo N., Preziosi L., and Romano A., (2000), Mechanics and Dynamic
Systems with Mathematica, Birkhäuser, Boston.
Cercignani C., Illner R., and Pulvirenti M., (1994), The Mathematical The-
ory of Dilute Gases, Springer-Verlag, New York.
Cucker F. and Smale S., (2002), On the mathematical foundations of learning, Bull.
Amer. Math. Soc., 39, 1–49.
Dautray R. and Lions J.L., (1990), Mathematical Analysis and Numerical
Methods for Science and Technology - Vol. I, Springer-Verlag, Berlin.
De Angelis E., (1999), Nonlinear hydrodinamic models of traffic flow modelling and
mathematical problems, Math. Comput. Modelling, 29, 83–95.
De Jager E.M. and Furu J., (1996), The Theory of Singular Perturbations,
North-Holland, Amsterdam.
Glassey R., (1996), The Cauchy Problem in Kinetic Theory, SIAM Publish.,
Philadelphia.
Görsch D., (2002), Generalized discrete velocity models, Math. Models Methods Appl.
Sci., 12, 49–75.
Helbing D., (2001), Traffic and related self-driven many-particle systems, Rev. Modern
Phys., 73, 1067–1141.
163
Humi M. and Miller W., (1988), Second Course in Ordinary Differential Equa-
tions for Scientists and Engineers, Springer-Verlag, New York.
Kerner B., (2004), The Physics of Traffic, Springer, New York, Berlin.
Klar A., Kühne R.D., and Wegener R., (1996), Mathematical models for vehicular
traffic, Surveys Math. Indust., 6, 215–239.
Kogan M.N., (1969), Rarefied Gas Dynamics, Plenum Press, New York.
Kojić M. and Bathe K.J., (2005), Inelastic Analysis of Solids and Structures,
Springer-Verlag, Berlin.
Lions J.L. and Magenes E., (1968), Problèmes aux Limites Non Homogènes
et Application - Vol. I, Dunod, Paris.
May R.M., (2004), Uses and abuses of mathematics in biology. Science, 303, 790–793.
Mika, J.R. and Palczewski A., (1991), Asymptotic analysis of singularly perturbed
systems of ordinary differential equations, Comput. Math. Appl., 21, 13–32.
Mishchenko E.F. and Rozov N.Kh., (1980), Differential Equations with Small
Parameters and Relaxation Oscillations, Plenum Press, New York.
Platkowski T. and Illner R., (1988), Discrete velocity models of the Boltzmann
equation: a survey on the mathematical aspects of the theory, SIAM Rev., 30, 213–255.
Reed R., (2004), Why is mathematical biology so hard? Notices of the American
Mathematical Society, 51, 338–342.
Schram P.P.J.M., (1991), Kinetic Theory of Gases and Plasmas, Kluwer Aca-
demic Publishers Group, Dordrecht.