(Atlantis Series in Dynamical Systems 2) Jaap Eldering (Auth.) - Normally Hyperbolic Invariant Manifolds - The Noncompact Case-Atlantis Press (2013)
(Atlantis Series in Dynamical Systems 2) Jaap Eldering (Auth.) - Normally Hyperbolic Invariant Manifolds - The Noncompact Case-Atlantis Press (2013)
(Atlantis Series in Dynamical Systems 2) Jaap Eldering (Auth.) - Normally Hyperbolic Invariant Manifolds - The Noncompact Case-Atlantis Press (2013)
Jaap Eldering
Normally Hyperbolic
Invariant Manifolds
The Noncompact Case
Atlantis Series in Dynamical Systems
Volume 2
Normally Hyperbolic
Invariant Manifolds
The Noncompact Case
Jaap Eldering
Department of Mathematics
Utrecht University
Utrecht
The Netherlands
ISSN 2213-3526
ISBN 978-94-6239-002-7 ISBN 978-94-6239-003-4 (eBook)
DOI 10.2991/978-94-6239-003-4
Series Editors
Henk Broer
University of Groningen
Johann Bernoulli Institiute for Mathematics and Computer Science
Groningen, The Netherlands
Boris Hasselblatt
Department of Mathematics, Tufts University
Medford, USA
Atlantis Press
29, avenue Laumière
75019 Paris, France
For more information on this series and our other book series, please visit our
website www.atlantis-press.com
Preface
vii
viii Preface
In Chap. 3, we finally prove the main result in the trivial bundle setting. We first
state both this and the general version of the main theorem and discuss these in full
detail. We include a precise comparison with results in the literature, followed by
an outline of the proof. Section 3.3 contains a discussion of the differences to the
compact case and presents detailed examples to illustrate these. Then we start the
actual proof. We first prepare the system: we put it in a suitable form and obtain
estimates for the perturbed system. Then we prove that there exists a unique
persistent invariant manifold and that it is Lipschitz. Second, we set up an elab-
orate scheme in Sect. 3.7 to prove that this manifold is Cr smooth by induction
over the smoothness degree.
In Chap. 4, we discuss how the main result can be extended in a number of
different ways that may specifically be useful for applications. We show how time
and parameter dependence can be added and we present a slightly more general
definition of overflow invariance that might be applicable to systems that are not
overflowing invariant under the standard definition.
Finally, the appendices contain technical and reference material. These are
referenced from the main text where appropriate. Appendix A shows an important
idea that permeates this work: the implicit function theorem allows for explicit
estimates in terms of the input, hence it ‘preserves uniformity estimates’. This can
then directly be applied to dependence of a flow on the vector field. In Appendix
B, the Nemytskii operator is introduced as a technique to prove continuity of post-
composition with a function. This is an essential basic part in the smoothness
proof, together with the results on the exponential growth behavior of higher
derivatives of flows in Appendix C. Here, we also develop a framework to work
with higher derivatives on Riemannian manifolds. The last appendices include the
fiber contraction theorem of Hirsch and Pugh that is used in the smoothness proof,
Alekseev’s nonlinear variation of constants integral defined on manifolds, and a
brief overview of those parts of Riemannian geometry that we use.
The ‘core’ persistence proof itself is based on the Perron method; it can
probably be replaced by the proof of Bates, Lu, and Zeng, that is based on the
graph transform, when taking into account the necessary bounded geometry
technical details from Chap. 2. Our proof uses ideas of Henry and Vander-
bauwhede and Van Gils to extend the Perron method to NHIMs and higher
smoothness. A novel aspect is that we develop these ideas on a trivial bundle with
a bounded geometry manifold as base. This requires a whole framework to be set
up, including representations of higher jets of a flow in Appendix C and formal
tangent bundles of spaces of curves (see Sect. 3.7.4) to study derivatives of the
Perron contraction operator. These ideas might be of interest in other contexts of
dynamics in noncompact differential geometry.
Finally, our Definition 4.4 of a priori overflowing invariance might be new
(although probably not surprising to experts in the field) and could prove useful for
certain applications where the original definition of over- or inflowing invariance
does not hold.
Acknowledgments
1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
1.1 Normally Hyperbolic Invariant Manifolds. . . . . . . . . . . . . . . . . 1
1.1.1 Persistence and (Un)stable Manifolds . . . . . . . . . . . . . . 3
1.1.2 The Relation to Center Manifolds . . . . . . . . . . . . . . . . . 5
1.2 Examples . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
1.2.1 The Spectral Gap Condition . . . . . . . . . . . . . . . . . . . . . 6
1.2.2 Motivation for Noncompact NHIMs . . . . . . . . . . . . . . . 10
1.3 Historical Overview . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13
1.4 Comparison of Methods . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15
1.4.1 Hadamard’s Graph Transform. . . . . . . . . . . . . . . . . . . . 16
1.4.2 Perron’s Variation of Constants Method . . . . . . . . . . . . 16
1.4.3 Smoothness . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.5 Bounded Geometry . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19
1.6 Problem Statement and Results . . . . . . . . . . . . . . . . . . . . . . . . 20
1.6.1 Non-Autonomous Systems . . . . . . . . . . . . . . . . . . . . . . 24
1.6.2 Immersed Submanifolds. . . . . . . . . . . . . . . . . . . . . . . . 25
1.6.3 Overflowing Invariant Manifolds . . . . . . . . . . . . . . . . . 29
1.7 Induced Topology . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 30
1.8 Notation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31
xi
xii Contents
References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 183
Index . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 187
Chapter 1
Introduction
The basics of the theory of hyperbolic dynamics date back to the beginning of the 20th
century, and the general formulation of the theory of normally hyperbolic systems
was stated around 1970. Since then, many people have extended the theory, and even
more people have applied it to problems in all kinds of areas.
Normally hyperbolic invariant manifolds are important fundamental objects in
dynamical systems theory. They are useful in understanding global structures and
can also be used to simplify the description of the dynamics in, for example, slow-fast
or singularly perturbed systems.
In this work, we are specifically interested in noncompact normally hyperbolic
invariant manifolds. We extend classical results that were previously only formulated
for compact manifolds. However, in many applications the manifold is not compact,
so an extension of the theory to the general noncompact case allows one to attack
these problems in their natural context. The main result of this work is an extension
of the theorem on persistence of normally hyperbolic invariant manifolds to a general
noncompact setting in Riemannian manifolds of bounded geometry type.
We should first point out that the theory of (normally) hyperbolic systems can be
applied to both discrete and continuous dynamical systems. That is, if we have a
dynamical system (T, X, ) with X a smooth manifold and : T × X → X the
evolution function, then the system1 is called discrete if T = Z and continuous if
T = R. In the discrete case, one typically has a diffeomorphism ϕ : X → X and the
full evolution function is defined as (n, x) = ϕ n (x), i.e. iterated application of ϕ.
In the continuous case, the map is called a flow. It is generated by a vector field v ∈
X(X ) and in that case the map t : X → X is again a diffeomorphism for any t ∈ R.
1 For simplicity of presentation we ignore the facts that may have a smaller domain of definition,
or that it is a semi-flow or semi-cascade, only defined on T ≥ 0.
The two cases can be related by fixing a t ∈ R in the continuous case and then
view ϕ = t as generating a discrete system. The statements of definitions and
results are (almost) identical if formulated in terms of the evolution function . The
methods of proof share this similarity and can be translated into each other. We shall
adopt the continuous formulation in this work, and refer to the evolution parameter
t ∈ T = R as time. Even though our system is defined in terms of a vector field v,
we call x ∈ X a fixed point of the system when t (x) = x for any t ∈ R. This is
equivalent to saying that v(x) = 0, i.e. that it is a critical point of v; we adhere to
the former terminology to better preserve the analogy with discrete systems.
Before we proceed to explaining normally hyperbolic invariant manifolds, it
should be pointed out that these are a generalization of hyperbolic fixed points.
Many of the characteristic properties generalize as well, so we first sketch the basic
picture for hyperbolic fixed points. Let x be a fixed point of a vector field, v(x) = 0;
it is called hyperbolic if the derivative Dv(x) has no eigenvalues with zero real
part. This means that the eigenvalue spectrum splits into parts left and right of the
imaginary axis, that is, the stable and unstable eigenvalues, but no neutral ones. The
corresponding stable and unstable eigenspaces E ± are both invariant under the linear
flow of Dv(x) and these spaces are characterized by the fact that solution curves on
them converge exponentially fast towards the fixed point under forward or backward
time evolution respectively. It is a well-known result that there are corresponding
stable and unstable (local) manifolds, denoted Wloc S and W U respectively, which
loc
are the nonlinear versions of these, see Fig. 1.1. This situation can be generalized
to a normally hyperbolic invariant manifold by replacing the single fixed point by a
‘fixed set of points’, that is, a manifold which is, as a whole, invariant.
Let us start with a somewhat informal explanation of the concept of a normally
hyperbolic invariant manifold, which we shall from now on often abbreviate as a
S
W loc
E−
1.1 Normally Hyperbolic Invariant Manifolds 3
There are two important properties that generalize from hyperbolic fixed points to
normally hyperbolic invariant manifolds. These are persistence of the fixed point and
the existence of stable and unstable manifolds. The generalization of these properties
is not a trivial statement nor easily proven in the generalized case of NHIMs, however.
Let us first focus on persistence. In case of a hyperbolic fixed point, this is trivially
stated and proven. If the fixed point x is hyperbolic, then it will persist as a nearby fixed
4 1 Introduction
point under small perturbations of the vector field v and stay hyperbolic. The proof is
a direct application of the implicit function theorem. If Dv(x) has no eigenvalues on
the imaginary axis, then certainly it has no zero eigenvalue, and therefore is a bijective
linear map. So a slightly perturbed vector field ṽ will again have a fixed point x̃ nearby
x and the eigenvalues of Dṽ(x̃) will be close to those of Dv(x) if ṽ −v is small in C 1 -
norm. Hence the eigenvalues are still separated byAZ the imaginary axis. For a NHIM
the situation is similar but technically much more involved due to the fact that there
is no control on the behavior of solution curves in the invariant manifold. A normally
hyperbolic manifold M does persist under C 1 small perturbations and the perturbed
manifold M̃ is again normally hyperbolic and close to M in a precise way. The most
important difference, however, is that M̃ generally has only limited smoothness, even
if M and the system were smooth or analytic.2 This smoothness is dictated by the
spectral gap condition, which is roughly the ratio between the normal exponential
expansion/contraction and the exponential expansion/contraction tangential to M.
This fact already indicates that the proof of persistence of a NHIM cannot be a
straightforward application of the implicit function theorem.
The stable and unstable manifolds generalize as well. That is, a normally hyper-
bolic invariant manifold M has stable and unstable manifolds W S (M) and W U (M)
such that solution curves on these converge exponentially fast towards M in forward
or backward time, respectively. Their intersection is precisely M. But there is actually
more structure: these manifolds—we consider W S (M) but everything is equivalent
for W U (M)—are fibrations of families of stable and unstable fibers to each point
m ∈ M,
W S (M) = W S (m).
m∈M
We should be a bit careful with this last statement, as points m ∈ M are generally not
fixed points. These fibers W S (m) are invariant in the sense that the flow commutes
with the fiber projection π S :
In other words, each fiber is mapped into another single fiber under the flow, namely
the fiber over the flow-out of the base point m. This important fact means that if
we use the fibration for local coordinates, then in these coordinates the horizontal,
base flow decouples from the vertical, fiber flow. This is sometimes also called
an isochronous fibration [Guc75] as all points in a fiber have the same long-term
behavior. Each single fiber is as smooth as the system, but the dependence on the
base point m, and thus the smoothness of the fibrations as a whole, is generally not
better than continuous, see Fenichel [Fen74, Sect. I.G]. We do not investigate these
invariant fibrations in the present work, although the mentioned results should hold
for noncompact NHIMs as well.
2 I do not know whether loss of smoothness is generic for NHIMs. See [Has94, HW99] for the
case of Anosov systems.
1.1 Normally Hyperbolic Invariant Manifolds 5
that its tangent space at the fixed point is the (generalized) eigenspace E 0 of the
eigenvalues with real part zero, that is,
C
Tx Wloc (x) = E 0 . (1.1)
We can extend the definition of center manifold a bit by including all eigenvalues λ
with real part bounded by |Re(λ)| ≤ ρ0 . An associated generalized center manifold
consists of solutions that converge or diverge from x at an exponential rate bounded
by ρ0 . Curves in the strongly3 stable or unstable manifold converge or diverge at
exponential rates larger than ±ρ0 , respectively. These conditions can directly be
compared to the description of NHIMs above, or Definition 1.8 (with ρ0 = ρ M ).
If we take a look at Fig. 1.2 again, then we see that both fixed points (indicated
with a dot) on M have (generalized) center manifolds; M itself is a center manifold
for these, but for the rightmost fixed point we can actually construct the center man-
ifold from any two solution curves converging to that fixed point from the left and
right. For example, the union of the two curves drawn in the figure that converge to it
could be taken as alternative center manifold. This reflects the well-known fact that
center manifolds are generally not unique. This is the main difference with the case
of NHIMs: center manifolds are only defined in terms of growth rates of solution
curves locally with respect to one fixed point, while NHIMs are globally invariant
objects, where the spectral splitting must hold everywhere along the invariant mani-
fold. This difference is effectively the reason that center manifolds are not uniquely
defined, while the perturbations of NHIMs are, see below. If we perturb the system
in Fig. 1.2 a bit, then the persistent NHIM must everywhere be close to the original
invariant manifold M. This enforces uniqueness; in Fig. 1.2 this is clearly visible:
the alternative choice of center manifold to the rightmost fixed point diverges far
from M. See also the example in Section 1.2.1.
There is a subtle question of smoothness both for center manifolds and NHIMs,
related to the spectral gap condition (1.11). Center manifolds are arbitrarily smooth
in a sufficiently small neighborhood of the fixed point x, but they are generally
not C ∞ , even though they satisfy an infinite spectral gap. See Van Strien’s short
note [Str79]. The reason is that the size of the neighborhood may depend on the
degree of differentiability C k . Persistent NHIMs generally have bounded smoothness
due a finite spectral gap; but even if they have an infinite spectral gap, the smoothness
of a persistent NHIM is (generally) not C ∞ for the same reasons. See Remark 1.12
and Example 1.3.
1.2 Examples
We present a few examples. The first detailed example serves to show explicitly that
smoothness of a persistent manifold depends crucially on the spectral gap condition.
The next examples motivate the usefulness of a noncompact version of the theory of
normal hyperbolicity.
4 The definition of normal hyperbolicity in [Mañ78] is a bit more general than the definition in
this paper. That definition only requires a growth ratio r ≥ 1 along solution curves in the invariant
manifold, and not as a ratio of global growth rates ρ X , ρY , see also Remark 1.10.
5 The case r = ∞ would require ρ > 0; when ρ = 0, any finite order r can be obtained, but
X X
only for perturbations sufficiently small depending on r .
1.2 Examples 7
when the flow has exponential growth behavior eρt , then higher order derivatives will
generally have growth behavior ekρt and the interval inclusion [k ρ, ρ] ⊂ (ρY , ρ X ) is
required to show existence and uniqueness of the k–th derivatives via a contraction.
The optimal differentiability degree r can be extended to a real number by viewing
α– Hölder continuity as a fractional differentiability degree. That means that the
perturbed manifold can be shown to be C k,α when r = k + α satisfies the spectral
gap condition and the system is C k,α to start with.
The following example shows that this result is sharp. We construct a very simple
compact, normally hyperbolic invariant manifold, and then show that an arbitrarily
small perturbation yields a unique perturbed C k,α invariant manifold, where r = k+α
satisfies ρY = rρ X . This in fact precisely violates the spectral gap condition, since
that requires a strict inequality. The example could be adapted to obtain a perturbed
λY λY
X
λ− x− χ λ+ x+
Fig. 1.3 An example invariant manifold exhibiting C k,α smoothness under perturbation
8 1 Introduction
for as long as t (m) stays in that neighborhood of (x± , 0) where the vector field is
linear. The transition time between these two neighborhoods is finite as v does not
have zeros and the transition map preserves vertical lines {x} × Y . The latter fact
is because vx is independent of y, that is, we have also found the invariant, foliated
stable manifold of M. Gluing together these Dt maps on the different domains,
we see that the resulting tangent flow splits again into independent horizontal and
vertical parts, which can be estimated by
∀t ≤ 0 : Dt (m)|Tm X ≤ C X eλ− t ,
∀t ≥ 0 : Dt (m)|T Y ≤ CY eλY t ,
m
where the constants C X , CY are determined by the flow t in the domain where v is
nonlinear. For any point m close to (x− , 0) this estimate is sharp, hence we expect
maximal smoothness r = λY /λ− for a generic perturbation.
Next, we add a perturbation term ε χ to the vector field v, so we have a perturbed
vector field ṽ = v + ε χ , where χ ∈ C0∞ is chosen with support on a small ball
intersecting M away from the fixed points and pointing upward. This will ‘lift’ the
invariant manifold as indicated in Fig. 1.3 for any ε > 0. Let M̃ denote this lifted
manifold, that is, M̃ is the image of the two heteroclinic solution curves that run from
(x+ , 0) to (x− , 0) together with these fixed points. The solution curve that runs to
the left is lifted up from the x– axis after entering the region supp χ .
We first investigate two claims: that M̃ is invariant and that it is the unique invariant
manifold that is close to M. The invariance is obvious; to the right of x+ nothing has
changed, so there M̃ = M. To the left of x+ we follow the original unstable manifold,
get pushed up within the domain of support of χ and after leaving that domain and
entering the linear flow around (x− , 0) we follow a standard curve ending at (x− , 0).
This is a solution curve of ṽ ∈ C ∞ , hence invariant and even smooth. Now assume
there exists another invariant manifold M
nearby and let (x
, y
) ∈ M
\ M̃. The
backward orbit of the point (x
, y
) must diverge to |y| 1. If x
= x− , then y
= 0
and this is clear. If x
= x− , then the backward orbit will end up at a point (x, y) with
x close to x+ and y = 0; since we are in the linear domain of (x+ , 0), this orbit will
then diverge (in reverse time) along the stable manifold towards |y| 1. Hence, M
is not close to M.
Next, we show that (for any ε > 0) the perturbed manifold M̃ is not more than
C k,α with k + α = λY /λ− , even though the original and perturbed systems are
1.2 Examples 9
C ∞ -smooth. To the left of (x− , 0), M̃ is given by the graph of the zero function
from X to Y (as the continuation from (x+ , 0) to the right along X = S 1 ). To the
right of (x− , 0), the solution curve is given by (x, y)(t) = (x0 eλ− t , y0 eλY t ), hence
y = C x λY /λ− where C depends on x0 , y0 only. So we can write M̃ as the graph of
the function
0 if x ≤ 0,
h̃ : X → Y : x
→ λ /λ
Cx Y − if x > 0.
This function is exactly C k,α for k +α = r in x = 0. Note that the loss of smoothness
appears at a different place than the perturbation of the vector field. The relevant fact
is that the different solution curves approaching the stable limit point have finite
differentiability with respect to each other, and this depends on the horizontal and
vertical rates of attraction at (x− , 0).
If we had assumed that ρY = ρ X , that is, r = 1, but with a non-strict inequality
ρY ≤ r ρ X , then normal hyperbolicity precisely fails and the invariant manifold
indeed need not persist. By the arguments above it can already be seen that the
persistent manifold can lose differentiability: when r = 1, the graph of the manifold
will be given by
0 if x ≤ 0,
h̃(x) =
C x if x > 0,
6 The ratio r in the spectral gap is defined by a strict inequality, which we ignore here for simplicity
of presentation.
10 1 Introduction
but have smoothness C r where r can be made arbitrarily large by decreasing the
perturbation size.
Most of the literature on normal hyperbolicity and its applications treat com-
pact NHIMs only. This excludes possibly interesting applications. Settings where
a noncompact, general geometric version of normal hyperbolicity may be useful
include chemical reaction dynamics [Uze+02] and problems in classical and celes-
tial mechanics [DLS06].
We describe three examples where noncompactness naturally comes into play.
The first example, a normally attracting cylinder, is set in Euclidean space. This
example could be complicated a bit more by adding normal expanding directions
to get a fully normally hyperbolic system. Such situations show up in Hamiltonian
or reversible systems with invariant tori [Bro+09]. The second and third examples
are set in ambient manifolds with nontrivial topology, thus motivating the need for a
theory of noncompact NHIMs in such a geometric setting. The third example shows
an application to classical mechanics and actually motivated this work.
Let us first treat a simple example.
Example 1.4 (A normally attractive cylinder) Let us consider the infinite cylinder
y 2 + z 2 = 1 in R3 . If we define a very simple dynamics by
(ẋ, ṙ , θ̇ ) = (0, r (1 − r ), 1)
in cylindrical coordinates, then the cylinder is normally attractive and the motion on
the cylinder consists of only periodic orbits, see Fig. 1.5.
1.2 Examples 11
The dynamics on the cylinder is completely neutral, while it attracts in the normal
direction with rate −1. Hence, there exists a unique persistent manifold diffeomorphic
and close to the original cylinder. For any k ≥ 1, the persistent manifold has C k
smoothness if the perturbation is chosen sufficiently small. The perturbed manifold
must be uniformly close to the original cylinder; this rules out Example 3.9 of a
cylinder with exponentially shrinking radius.
The dynamics on the persistent manifold can be perturbed in arbitrary ways. It
could slowly spiral towards x– infinity, or develop attracting and repelling periodic
orbits on the cylinder. If the cylinder were higher dimensional, it could even become
chaotic.
The previous example is set in Euclidean space, even though the invariant manifold
has not completely trivial topology. Let us next consider a case where the ambient
space is a manifold of nontrivial topology.
Example 1.5 (A pendulum with time-dependent perturbation) Consider a classical
pendulum described by its angle θ ∈ S 1 and angular velocity ω. The unstable top
position θ = π is a hyperbolic fixed point, hence a special case of a NHIM.
The phase space TS 1 is noncompact and nontrivial. Without perturbations we
could restrict ourselves to a compact energy surface; if we add a general time-
dependent perturbation, however, then energy is not preserved anymore and the full
phase space must be considered. We should include the noncompact time interval R
as well, leading to the full phase space TS 1 × R. Then the unperturbed unstable top
position corresponds to the one-dimensional NHIM
Remark 1.6 The nontrivial topology of S 1 can easily be undone by modeling the
system on its universal cover R. The example can be changed to a spherical pendu-
lum modeled on S 2 ; this indicates more clearly the usefulness of having the theory
available in spaces with nontrivial topology, although in this case we still only need
to study the problem in a small neighborhood of the top position. It was pointed out
to me by Robert MacKay that TS 2 is still a trivializable tangent bundle, and that
more complex, non-trivializable examples including higher dimensional spheres can
be constructed in the setting of chemical reaction dynamics. ♦
The last example actually motivated this work.
Example 1.7 (Nonholonomic systems as singular perturbation limit) Let a classical
mechanical system be given by a smooth Riemannian manifold (Q, g) as configura-
tion space and a Lagrangian L : TQ → R. The vector field v on TQ is determined
by the Lagrange equations of motion, given in local coordinates by
i d ∂L ∂L
L = − = 0. (1.4)
dt ∂ ẋi ∂ xi
so(3) × S O(3) × R2 ∼
= D ⊂ T S O(3) × R2 .
7 Here, a distribution is meant in the sense of differential geometry as a subbundle of the tangent
can be obtained from uniform Lipschitz continuity of the tangent spaces of the invariant manifold.
I am not sure if this is a significantly weaker hypothesis. See also the discussion in Remark 3.13.
1.3 Historical Overview 15
using the fiber contraction theorem (see Appendix D), proved smoothness of center
manifolds with the Perron method. Although not the same, center manifolds have
many properties in common with NHIMs and Sakamoto [Sak90] has built upon the
works of Henry and Vanderbauwhede and Van Gils to prove persistence and C k−1
smoothness for singularly perturbed systems in a finite-dimensional Rm ×Rn product
space setting. The loss of one degree of smoothness is again due to the construction
of normal bundle coordinates, although this fact is obscured by the explicit Rm × Rn
setting.
Singularly perturbed, or, slow-fast systems are another important class of applica-
tions. These describe systems where the dynamics is governed by multiple, separate
time scales, or when a system can be viewed as an approximation of an idealized,
restricted system. Singularly perturbed systems can be studied using the theory of
normal hyperbolicity by turning them into a regular perturbation problem via a rescal-
ing of time, see foundational work by Fenichel [Fen79] or the more introductory
expositions [Jon95; Kap99; Ver05].
There are two well-known methods for proving the existence and smoothness of
invariant manifolds in hyperbolic-type dynamical systems. The Hadamard graph
transform and the variation of constants method, also known as the (Lyapunov–)
Perron method. Variations of both have been applied in many situations with some
form of hyperbolic dynamics. This ranges from the relatively simple problem of
finding the stable and unstable manifolds of a hyperbolic fixed point, to center man-
ifolds, partially hyperbolic systems, and normally hyperbolic systems. The quote
of Anosov [Ano69, p. 23] that “every five years or so, if not more often, someone
‘discovers’ the theorem of Hadamard and Perron, proving it either by Hadamard’s
method of proof or by Perron’s” is nowadays probably familiar to many researchers
in these areas; it illustrates the pervasiveness of these methods.
In this section, I describe the ideas that are common to both methods, as well as
their differences. I hope to elucidate the merits and weak points of both methods,
especially when applied to normally hyperbolic systems. Basically they seem to be
able to produce the same conclusions, but each method takes a different viewpoint
to the problem.
Let us first identify some basic common ideas. As a sample problem, we consider
finding the invariant unstable manifold W U of a hyperbolic fixed point, positioned at
the origin of Rn . The system is defined by either a diffeomorphism in the discrete
case, or a flow t in the continuous case. Both methods use the splitting of the
tangent space into stable and unstable directions:
T0 Rn ∼
= Rn = U ⊕ S.
16 1 Introduction
The graph transform is due to Hadamard. His paper [Had01] (in French, 4 pages) can
be used as a concise and basic introduction to the graph transform, applied to the stable
and unstable manifolds of a hyperbolic fixed point. He does not prove smoothness
or even continuity of these invariant manifolds, although continuity could easily be
concluded by introducing the Banach space of bounded continuous functions with
supremum norm.
The basic idea of the graph transform is to view the unstable manifold W U as
the graph of a function g : U → S. The graph, as a set, is invariant under (or e.g.
1 in the continuous case). The diffeomorphism can also be interpreted as a map
acting on functions g through its action on their graphs. This induces a mapping
Thus, by definition, any point (x, g(x)) on the graph of g gets mapped to a point
(x
, g̃(x
)) on Graph(g̃). The map T turns out to be well-defined and a contraction
on functions U → S that are sufficiently small in Lipschitz norm. The graph of
the unique fixed point g of T must correspond to the unstable manifold, that is,
W U = Graph(g ).
By considering the invariant sets, this method focuses on the geometry of the
problem. The method uses a diffeomorphism map ; the continuous case can be
studied by considering the flow map t for a fixed time t. The diffeomorphism can
easily be studied locally in charts on a manifold. Therefore this method lends itself
well to the generalized setting of normally hyperbolic invariant manifolds, where
the invariant manifold is intrinsically a global object. Even if this global object is
nontrivial, it can still be studied in local charts.
continuous case.11 We adopt the notation from the graph transform setting. A con-
traction operator T is constructed via a variation of constants integral. The non-
linear part of the vector field is viewed as a perturbation of the linear part. The
integral equation is split into the components along the stable and unstable direc-
tions. Then the integration of the unstable component is switched from the interval
[0, t] to [−∞, t], and only bounded functions are considered. Writing the vector
field v(x) = Dv(0) · x + f (x) in linearized form with nonlinearity f , this leads to
the following contraction operator on curves x = (x + , x − ) ∈ C 0 ([−∞, 0]; Rn ):
0 −
+ −
T : x (t), x (t)
→ x0+ − Dt−τ +
+ (0) f + x (τ ), x (τ ) dτ ,
t
t − +
Dt−τ
− (0) f − x (τ ), x (τ ) dτ . (1.7)
−∞
g : U → S : x0+ → x − (0).
First of all, it must be noted that this method requires f to be small in C 1 -norm. We
can make f small by restricting to a sufficiently small neighborhood of the origin
and cutting off f outside of it. This cut-off does not influence the results: due to
the boundedness condition, curves x stay in the neighborhood. The method can be
generalized to a separation of stable and unstable spectra (i.e. a dichotomy) away
from the imaginary axis,12 and for example be applied to show existence of center
manifolds. In that case, uniqueness is lost as solutions will generally run out of small
neighborhoods. This makes the Perron method not directly applicable to normally
hyperbolic invariant manifolds. The center direction corresponds to the invariant
manifold, but solution curves are global objects that cannot be treated locally.
The Perron method can be extended to overcome this problem. Henry [Hen81,
Chap. 9] linearizes the vector field only in the normal directions of the invariant
manifold. Henry uses a two-step contraction scheme, but this can be reduced to a
single contraction T = T− ◦ T+ that is a composition of two maps. The maps T± are
11 Contrary to the graph transform (which is only intrinsically defined for mappings), the Perron
method can be formulated both for flows and discrete mappings. For the discrete case, the integral
must be replaced by a sum, the mapping must be split into a linear and nonlinear part, and the
linearized flow must be replaced by iterates of the linearized mapping. See for example [APS02;
PS04].
12 This is for the continuous case. The imaginary axis of the spectrum of a vector field corresponds
(via the exponential map) to the unit circle for the spectrum of a diffeomorphism in the discrete
case.
18 1 Introduction
essentially the components of (1.7). Still, the results obtained are not quite as general
as those obtained with the graph transform. For the graph transform, the condition
of normal hyperbolicity can be formulated in terms of the ratio of the normal and
tangential growth rates of the flow along orbits, while for the Perron method it must be
formulated in terms of the ratio of global growth rates. This less general assumption
is required because the contraction operator (1.7) is studied on spaces of solution
curves with a fixed exponential growth behavior, see Definition 1.17.
Explicit time dependence can be added to the Perron method with only trivial
modifications. This allows one to study hyperbolic fixed points in non-autonomous
systems.13 An application is the study of invariant fibrations of, for example, normally
hyperbolic invariant manifolds. These have fibered stable and unstable manifolds.
Points in a single fiber are characterized by the unique orbit on the normally hyper-
bolic invariant manifold they are exponentially attracted to under forward or back-
ward evolution, respectively. Finding these fibers is turned into a non-autonomous
hyperbolic fixed point problem by following a point on the invariant manifold.
1.4.3 Smoothness
In the truly hyperbolic case—when the stable and unstable spectra are separated
by a neighborhood of the imaginary axis—the Perron method allows for a direct
proof of smoothness of the manifolds W U and W S , see [Irw70; Irw72] where this
is formulated for discrete systems. One first verifies that the contraction operator T
is as smooth as the system, still acting on continuous curves x. Then, by an implicit
function theorem argument, the fixed point depends smoothly on the (partial) initial
value parameter x0+ . To the best of my knowledge, there is no similarly simple
approach for the graph transform. The contraction map acts directly on graphs g, so
to obtain smoothness, one must consider the maps g ∈ C k (U ; S). A direct estimate
of contractivity in C k -norm requires higher than k-th order Lipschitz estimates on
the system.
When the spectra are not separated by the imaginary axis—this occurs for example
in normally hyperbolic systems—things become more complicated. The spectral gap
condition defines an intrinsic upper bound for the smoothness that one can generically
expect for a system, as was seen in Example 1.1. Both methods apply induction over
the smoothness degree in their proof. Formal derivatives of the contraction map T
are constructed. These are again contractions, but now on higher derivatives of the
fixed point mapping, while fixing the derivatives below. Finally, the fiber contraction
theorem (see Appendix D) can be used to conclude that these higher order derivatives
converge to a fixed point, jointly with all lower orders.
13 The term ‘fixed point’ in the context of a non-autonomous system is not definable in a coordinate-
free way: any orbit of the system can be made into a fixed point under a suitable time-dependent
coordinate transformation. However, there may be a preferred “time-independent” coordinate sys-
tem. Moreover, the hyperbolicity of an orbit with respect an intrinsic metric is independent of a
choice of coordinates.
1.4 Comparison of Methods 19
D T (x) δx1 , . . . , δxk (t) = Dt−τ (0) · Dk f (x(τ )) δx1 (τ ), . . . , δxk (τ ) dτ.
k
(1.8)
The solution curve x as well as its variations δxi are of growth order eρt , so the
variation of f in the integrand is of growth order ekρt , even if Dk f itself is bounded.
This means that k-th order variations must be considered in spaces of growth order
ekρt and Dk T is only contractive on such spaces if both ρ and k ρ are contained in
the spectral gap.
The main results of this work are formulated in a geometric context on differentiable
manifolds. Already in [Fen72; HPS77] the results are formulated in such a context.
This allows for more general situations than choosing Rn as ambient space. In the
compact case, it does not require a change in the basic proofs (as can be seen from
the approach taken in [Fen72]), but it does bring in some additional formalism. On
the other hand, pre-existing work on noncompact NHIMs [Hen81; Sak90; BLZ08;
BLZ99] explicitly assumes that the ambient space is Euclidean or Banach.
It turns out that if one switches to a noncompact setting in manifolds, then some
fundamental new ingredients must be added. Already in Euclidean space one must
assume uniformity of the vector fields as a replacement for compactness. Similar
additional uniformity assumptions are required for the ambient space—these do not
manifest themselves in Euclidean space. First, a choice of Riemannian metric (or
possibly a weaker form: a Finsler structure) is required since not all metrics are
equivalent anymore on a noncompact manifold, see Example 3.6. As an extension,
Example 3.7 shows that one cannot reduce the noncompact to a compact case by
compactification. Secondly, the ambient manifold and functions on it should satisfy
uniformity criteria that can be captured in terms of ‘bounded geometry’.14 For full
14 We do not claim that bounded geometry is a necessary condition to generalize the theory of
normal hyperbolicity to noncompact ambient spaces, only that it is sufficient. Section 3.3 does
contain some examples, though, that indicate that some form of bounded geometry is necessary.
20 1 Introduction
details see Sect. 3.3 on compactness and uniformity and Chap. 2 on bounded geom-
etry. Let us just give a quick overview here.
A Riemannian manifold has bounded geometry, loosely speaking, if it is globally,
uniformly well-behaved. More precisely, its curvature must be bounded and the
injectivity radius must be bounded away from zero, see Definition 2.1. Then there
exists a preferred set of so-called normal coordinate charts for which coordinate
transition maps are uniformly continuous and bounded, smooth functions. That is,
in k-th order bounded geometry we have a C k uniform atlas. As a consequence,
uniformly continuous and bounded submanifolds, vector fields, and other objects
can be defined and manipulated in a natural way in terms of these coordinates. Note
that Rn and compact manifolds have bounded geometry, see Example 2.3. Together
with corollaries 3.4 and 3.5 of the main theorem, this shows that bounded geometry
provides a natural generalization to the known settings of compact and Euclidean
spaces.
We use bounded geometry to obtain boundedness estimates on holonomy, see
Sect. 2.2. This is a fundamental ingredient in our proof of smoothness of the perturbed
manifold. Finally, we present more technical results in bounded geometry: a uniform
tubular neighborhood, uniform smoothing of submanifolds, and a trivializing embed-
ding of the normal bundle. We use these to reduce the full problem of persistence of
a normally hyperbolic submanifold M in an ambient manifold Q to the trivialized
situation X × Y , where M is represented by the graph of a small function h : X → Y
and Y is a vector space. Uniformity permeates all these constructions in order to
obtain uniform estimates required for the persistence proof in the trivialized setting.
The main problem in this work is the persistence of normally hyperbolic invariant
manifolds under small perturbations of the dynamical system. That is, given a flow
t defined by some vector field v and a normally hyperbolic invariant submanifold
M, we want to show that for any vector field ṽ sufficiently close to v, there exists
a unique manifold M̃ close to M that is invariant under the flow of ṽ; moreover we
would like to show that M̃ is normally hyperbolic again. To make this statement
precise, we need to define a lot of things: first of all, we need to rigorously define
normal hyperbolicity. Secondly, the statements about vector fields and manifolds
being ‘close’ need to be formalized and finally, we need to specify the ambient space
Q on which the system is defined.
We start with a Riemannian manifold (Q, g) as ambient space and a submani-
fold M. For technical reasons this manifold is assumed to be complete and of bounded
geometry (or at least in a δ > 0 neighborhood of M, since the whole analysis can be
restricted to such a neighborhood). Basically, these conditions impose uniformity of
the space, and fit in the principle of replacing compactness by uniform estimates, see
Sect.1.3 and Chap. 2 for more details. Note that Q = Rn with the standard Euclidean
metric is an easy (and typical) special case.
1.6 Problem Statement and Results 21
k,α
Let v ∈ X(Q) be a vector field on Q with v ∈ Cb,u , that is, v up to its k-th
derivative is uniformly continuous and bounded, and α-Hölder continuous if α = 0.
On Rn these statements make immediate sense; on general manifolds Q, results
from Chap. 2 are required, in particular Definition 2.9, to make sense of uniform
boundedness and continuity by means of normal coordinates. Let ṽ be another such
vector field. The closeness of v and ṽ will be measured using supremum norms. The
C 1 -norm is required to be small for the persistence result. Thus, even though we
consider the space of C k,α bounded vector fields, we endow this space with a C 1
topology. See Sect. 1.7 for some more remarks on this topology and a comparison
with standard topologies on noncompact function spaces. If we assume that ṽ − v
is small in C k,α -norm as well, then M̃ will be C k,α -close15 to M. These C 1 and C k
norm requirements and results are direct analogues of those in the implicit function
theorem.
Finally, we define normal hyperbolicity of a submanifold M with respect to a
continuous dynamical system (R, Q, ). The flow t should have a domain of
definition containing at least a neighborhood of the invariant manifold M. This
definition is easily adapted to the discrete case of a diffeomorphism : Q → Q;
simply replace t ∈ R by t ∈ Z as iterated powers of .
Definition 1.8 (Normally hyperbolic invariant manifold) Let (Q, g) be a smooth
Riemannian manifold, t ∈ C r ≥1 a flow on Q, and let M ∈ C r ≥1 be a submanifold
of Q. Then M is called a normally hyperbolic invariant manifold of the system
(Q, t ) if all of the following conditions hold true:
1. M is invariant, i.e. ∀ t ∈ R : t (M) = M;
2. there exists a continuous splitting
T M Q = TM ⊕ E + ⊕ E − (1.9)
15 We actually only obtain C k closeness for integer k ≤ r − 1 where r is the ratio in the spectral
gap condition (1.11). This is probably an artifact of the techniques we used, while C k,α closeness
with k + α = r should be obtainable.
22 1 Introduction
These exponential estimates imply that the tangent flow Dt must contract at a rate
of at least ρ− along the stable complementary bundle E − , expand16 as eρ+ t along
the unstable bundle E + , and may not expand or contract at a rate faster than ± ρ M ,
respectively, tangent along TM.
Remark 1.9 We added the condition that the projections π M , π+ , π− are globally
bounded. This is a natural extension to the noncompact case, and is automatically
satisfied in case M is compact.
Remark 1.10 This definition of normal hyperbolicity is not as general as could be.
Fenichel [Fen72, pp. 200–204] defines normal hyperbolicity in terms of ‘general-
ized Lyapunov type numbers’. It follows from his uniformity lemma that these are
essentially exponentiated versions of our Lyapunov exponents ρ. For example, his
ν is equivalent to our e−ρ+ . But Fenichel defines σ in terms of the ratio ρ M /ρ+
along orbits in M. His definition allows the expansion rate along TM to be large,
for example, as long as the expansion rate along E + is large enough to keep the
ratio σ (m) bounded, along the orbit through m. The definitions in [HPS77, Mañ78,
BLZ08] are equivalent in the compact context to the one in [Fen72]. Mañé’s work
shows that this definition is as general as possible, see below. ♦
When M is compact, normal hyperbolicity is a sufficient condition for the exis-
tence of a persistent manifold M̃ for a system generated by ṽ if ṽ − v1 is sufficiently
small. Conversely, Mañé [Mañ78] has proved that normal hyperbolicity (in the sense
of e.g. Fenichel’s definition) is also necessary: if a compact invariant manifold M is
persistent under any C 1 small perturbation, then M is normally hyperbolic (see also
Example 1.2 and the clear exposition in the introduction of [Fen72]). Definition 1.8,
however, only guarantees C 1 smoothness for the perturbed manifold M̃. To obtain
higher order smoothness, a more stringent condition of r -normal hyperbolicity must
be satisfied.
Definition 1.11 (r -normally hyperbolic invariant manifold) A manifold M is
called r -normally hyperbolic with r ≥ 1 a real number, if it satisfies M ∈ C r
and the conditions in Definition 1.8, but with the stronger inequalities
This means that the normal expansion and contraction must not just dominate the
tangential ones, but do so by a factor r . For r = 1 we recover the original definition,
while the generalized inequality (1.11) is called the spectral gap condition. If M is
r -normally hyperbolic and v and the perturbation ṽ are C r as well, then the persistent
manifold M̃ is C r smooth again. The example in Sect. 1.2.1 shows that this spectral
gap condition is sharp: even when everything is C ∞ , the perturbed manifold M̃ in
16 Note that expansion along E + could also be formulated as Dt (m) x ≥ C+ eρ+ t x for
+
t ≥ 0 and (m, x) ∈ E . This is equivalent to the condition as stated, which says that there is
contraction for t ≤ 0, that is, in backward time. This latter formulation is preferable because it is
the form required in estimates.
1.6 Problem Statement and Results 23
that example is only C r when no more than r -normal hyperbolicity holds. Note that
r can be interpreted as a ‘fractional differentiability degree’ when writing r = k + α
with integer k ≥ 1 the normal degree of differentiability and 0 ≤ α ≤ 1 an additional
Hölder continuity exponent.
Remark 1.12 We explicitly exclude the case r = ∞ from Definition 1.11, even
though the spectral gap condition (1.11) could hold for r = ∞, if ρ M = 0. The reason
is that one can generally not expect to obtain a persistent manifold M̃ ∈ C ∞ in this
case. Even though for any order r < ∞ there exist persistent manifolds M̃ ∈ C r for
sufficiently small perturbations, the maximum perturbation size generally depends
on r and may shrink to zero when r → ∞. See Example 1.3 and the example
in [Str79] for the closely related case of center manifolds.
On the other hand, it is shown in [HPS77] that there is forced smoothness. If
M ∈ C 1 is an r -NHIM and the system is C r , then M must be C r . This also holds in
our noncompact setting, see Remark 3.3, 10 for a sketch of its proof. ♦
With these preliminary definitions in place, we are now ready state our main
theorem; it is restated in Chap. 3. We should point out that M is not required to
be an embedded submanifold; immersions are allowed as well, see Sect. 1.6.2. For
k,α
the details of the smoothness notation Cb,u on manifolds we refer to definitions 2.9
and 2.21.
Theorem 1.1 (Persistence of noncompact NHIMs in bounded geometry) Let
k ≥ 2, α ∈ [0, 1] and r = k + α. Let (Q, g) be a smooth Riemannian manifold of
k,α k,α
bounded geometry and v ∈ Cb,u a vector field on Q. Let M ∈ Cb,u be a connected,
complete submanifold of Q that is r -normally hyperbolic for the flow defined by v,
with empty unstable bundle, i.e. rank(E + ) = 0.
Then for each sufficiently small η > 0 there exists a δ > 0 such that for any
k,α
vector field ṽ ∈ Cb,u with ṽ − v1 < δ, there is a unique submanifold M̃ in the
η-neighborhood of M, such that M̃ is diffeomorphic to M and invariant under the
k,α
flow defined by ṽ. Moreover, M̃ is Cb,u and the distance between M̃ and M can be
made arbitrarily small in C -norm by choosing ṽ − vk−1 sufficiently small.
k−1
This result generalizes the well-known results in [Fen72, HPS77] to the case of
noncompact submanifolds of Riemannian manifolds. Again, our definition of normal
hyperbolicity is slightly less general than the definitions used in these works. We also
assumed that only the stable bundle E − is present, see also Sect. 4.4; note that we
thus only have the spectral gap condition ρ− < −r ρ M with ρ M ≥ 0. See also the
restatement of this theorem on page 76 and the list of remarks 3.3 for more details.
We borrow the idea to generalize the Perron method to NHIMs from Henry
[Hen81], and use the techniques of Vanderbauwhede and Van Gils [VG87] (see
[Van89] for a clear presentation) for proving higher order smoothness. This is sim-
ilar, but developed independently from Sakamoto’s work [Sak90] in which he used
the same ideas to study singular perturbation problems. We improve these results in
a couple of ways. First of all, we simplify the basics of the proof by reducing the
24 1 Introduction
17 If the perturbation is time-dependent, but in an (almost) periodic way, then this can still be treated
in the compact setting. One can extend the configuration space with the circle S 1 (or an n-torus in
the almost periodic case).
1.6 Problem Statement and Results 25
make any C 1 small perturbation, and obtain a persistent manifold M̃ in the extended
configuration space. The perturbation is allowed to be generally time-dependent, as
long as it is uniform in time, including derivatives. The resulting manifold M̃ will
still be invariant and close to the original M, although it will depend on time. That
is, if we assume local coordinates (x, y) ∈ Rn × Rm for Q such that M = Rn × {0}
locally, then we can write M̃ = Graph(h) for a function
h : Rn × R → Rm , y = h(x, t).
In other words, M̃ can be viewed as a graph over M (i.e. a section of the normal
bundle), but this graph now additionally depends on time. The manifold M̃ itself
is again normally hyperbolic when viewed in the extended space Q × R, see also
Sect. 4.1.
Such time-dependent invariant manifolds are called ‘integral manifolds’. These
have been studied as non-autonomous generalizations of stable and unstable man-
ifolds of hyperbolic fixed points [Pal75], but also as generalizations of compact
NHIMs [Hal61, Yi93]. The theory of noncompact NHIMs allows one to treat all
such integral manifolds in the same way as the autonomous case. One can, for exam-
ple, also start with an integral manifold that is normally hyperbolic: it will persist
just as well.
In the main Theorem 1.1, we intentionally do not precisely state in what sense M is
a submanifold of Q. The implicit assumption that M is an embedded submanifold
can be weakened to M being an immersion, see also [HPS77, Sect. 6] and [BLZ99].
That is, M can be viewed as an abstract manifold together with an immersion map
ι : M → Q that need not be injective. This does not affect the theory as long as ι is
still locally injective: ι(M) including a neighborhood modeled on its normal bundle
N can be pulled back via the immersion ι to the abstract M. All local properties
are preserved, so we can study the system via this ‘covering’. We may not always
make a clear distinction between the abstract manifold M and its immersed image
ι(M) ⊂ Q; the discussion below shows that this distinction is not really necessary,
as long as we do not consider perturbations.
For a generic immersion one could expect a picture as in Fig. 1.6, where the
immersed manifold intersects itself transversely. Such situations cannot occur if M
is a NHIM. This follows from the exponential growth rates along tangent and normal
bundles of M. Let m ∈ ι(M) be an intersection point of two preimages m 1 , m 2 ∈ M.
If the tangent spaces along M at m
1 and m 2 are embedded
differently into Tm Q,
then one could find x ∈ Im Dι(m 1 ) \ Im Dι(m 2 ) . This would imply that x has a
component in Nm 2 and give contradictory growth rates for Dt (m) · x depending
on whether we view m as image of m 1 or m 2 , as the orbit of m ∈ ι(M) is uniquely
defined. Hence, at each point m ∈ ι(M) the tangent spaces Dι(m i ) of all preimages
26 1 Introduction
Dι (m2 )
m i ∈ ι−1 (m) must coincide, see Fig. 1.7. Stated more abstractly, M must have contact
of order one with itself. More generally it holds that an immersed k-NHIM has contact
of order k with itself,18 see [HPS77, p. 68].
Next, each maximal set of ι(M) with constant number of preimages19 p ∈ N ∪
{∞},
M p = {m ∈ ι(M)|# ι−1 (m) = p}, (1.12)
is an invariant subset of ι(M). This is again due to uniqueness of the flow. If an orbit
would cross into a set of different preimage number, then a least one of the ‘lifts’ of
this orbit from ι(M) to the ‘cover’ M would have to enter or leave M. This cannot
happen as M itself is invariant. Hence, the conclusion is that self-intersections of
ι(M) must be invariant.
Immersed NHIMs may occur on themselves, or appear as a persistent manifold
under perturbation from an embedded manifold. An example of an embedded non-
compact NHIM that collapses under a small perturbation into an immersed manifold
can be found in Sect. 3.3. The same can happen with an immersed manifold with
compact image. The following example is taken from [HPS77, p. 130] and shows
that the injection map is relevant for how the NHIM persists. Note that this example
exhibits a Shil’nikov bifurcation, see Remark 1.15 below.
Example 1.13 (Perturbation of a compact non-injectively immersed NHIM) We con-
sider on R3 the vector field
18 The order of contact is defined as the degree up to and including which the Taylor expansions
of the objects agree.
19 The number of preimages must be countable if M is assumed to be second-countable.
1.6 Problem Statement and Results 27
ẋ = arctan(x 2 ) + ε,
ẏ = y,
ż = −z
and smoothly modify it outside the cylinder y 2 + z 2 = 1 such that it flows in the
negative x-direction and connects the basin of repulsion of the origin intersected with
x > 0 to the basin of attraction intersected with x < 0. The perturbation parameter
ε is initially set to zero.
Note that the x-axis is a NHIM (the arctangent is there to keep the vector field and
tangential growth rate bounded). Due to the modification, the two loops in Fig. 1.8
are also NHIMs of this system, both separately and their union. They start from the
origin along the positive x-axis, then diverge from it in opposite directions in the
x y-plane; once outside the cylinder y 2 + z 2 = 1 they start moving into the negative
x direction and finally return to the origin approximately along the x z-plane.
We can parametrize their joint image with an injection ι1 mapping M = {0, 1}×S 1
separately onto the two loops, but we can also parametrize with ι2 that maps M = S 1
onto the full figure eight image. If we perturb to ε > 0, then ι1 will result in Fig. 1.9
where the two loops are separated, while ι2 will result in Fig. 1.10 which has one
loop, but the middle of the figure eight does not intersect anymore. Figure 1.11 shows
how the two orbits from the separate loops closely pass the x–axis along hyperbolic
trajectories. The single orbit of ι2 follows hyperbolic trajectories through the other
two quadrants.
Remark 1.14 Note that these different persistent NHIMs do not contradict the
uniqueness property of persistence, since the (abstract) manifolds M were differ-
ent to begin with. Formulated differently, if we consider the universal cover of the
tubular neighborhood of ι1 (M) (deduplicating the origin as image point), then Fig. 1.9
shows the unique invariant manifold that stays in this tubular neighborhood cover.
We obtain a different persistent NHIM for any prescribed (possibly infinite) sequence
of concatenating the two loops of the original figure eight into an immersion from
S 1 (or R if the sequence is infinite).
Remark 1.15 This example shows a Shil’nikov bifurcation of a saddle-saddle node
and provides an alternative proof of the result of Shil’nikov [Šil69, Thm. 3] that for
every bi-infinite encoding of the homoclinic loops there exists a unique orbit close
to the original homoclinics. We encode the bi-infinite sequence in the immersion ι of
a NHIM. For each ι there exists a unique persistent manifold, and these correspond
to separate orbits after perturbation. In particular, the periodic orbits correspond to
1.6 Problem Statement and Results 29
those bi-infinite encodings that are actually periodic and there are countably infinitely
many of these. ♦
Finally, we present an example of an injectively immersed (but not embedded)
NHIM, see [HPS77, p. 68]. The mapping below is known as Arnold’s cat map.
Example 1.16 (Injectively immersed dense line in the torus) The matrix
21
A=
11
In this work the topologies for spaces of vector fields, submanifold embeddings, et
cetera, are (implicitly) defined by norms and distance functions. The norms we use are
uniform C k -norms for bounded functions, and families with additional exponential
growth rates. Let us call the topologies induced by these norms Cbk -topologies and
consider how they compare to two common topologies: the weak and strong Whitney
topologies for maps between manifolds, alternatively known as the compact-open
and fine topology, see [Hir76].
The weak topology has a subbasis generated by the set of functions g that are
close to some function f in C k -norm on compact subsets in local coordinate charts.
This means that for example the function family
f δ : R → R : x → δ exp(x 2 )
converges to zero for δ → 0 in this topology. On any compact set f δ will become
arbitrarily small when δ → 0 while it does not converge in uniform norm (nor with
additional exponential growth rate). Hence the weak topology is weaker than our
induced Cbk -topologies.
The strong topology has as basis all sets of functions g that are close to some
function f on a locally finite cover by compact sets K i , where g must approximate
f in C k -norm on each K i in local coordinates up to a given chart-dependent size εi .
For any function without compact support, a collection εi > 0 can be found that
converges faster to zero on each larger K i than the function to zero when x → ∞.
Hence the only sequences of functions R → R that converge to the zero function in
the strong topology are those with (eventually) compact support. A family f δ of func-
tions with noncompact support cannot converge to the zero function, as can be seen
by using a diagonal argument. The family f δ (x) = δ exp(−x 2 ), for example, does
not converge to the zero function in the strong topology. Given a locally finite cover
of R by compact sets K i , we choose xi ∈ K i and corresponding εi = exp(−xi2 )/i.
Then for any given δ > 0, we will have | f δ (xi )| > εi for some large i. On the
other hand, this family f δ obviously converges under the uniform norm with any
exponential growth rate. Thus, the strong topology is stronger than our induced Cbk -
topologies, see also the remark in [GG73, p. 43] for noncompact manifolds.
We conclude that the Cbk -topologies induced by our uniform norms are not equiva-
lent to either the weak or strong Whitney topology, because the weak topology allows
arbitrary behavior of functions outside compact sets, while the strong topology com-
pletely restricts that behavior. Our norms allow moderate variations at infinity. In
general, ‘moderate behavior’ is not well-defined on a general noncompact manifold,
as it depends on the choice of charts. In the setting of bounded geometry, though,
1.7 Induced Topology 31
the uniform, metric structure makes this behavior unambiguous; we can restrict to
normal coordinate charts and consider ‘moderate behavior’ with respect to these.
Note that these topologies are equivalent on compact domains.
1.8 Notation
Here, we will establish some notation and conventions to be used throughout this
work. See the index for more specific symbols.
• The letters I and J will denote intervals in R; I will typically represent an interval
that is unbounded on one side, while J will be bounded.
• ε, δ > 0 will denote (small) bounds for continuity-like estimates; C > 0 will
denote arbitrary bounds. The specific meaning of these symbols will vary depend-
ing on context. ε f (δ) will denote a uniform continuity modulus of the function f ,
that is, ε f : R≥0 → R≥0 satisfies
• The D denotes a total derivative, while Di with index i ∈ N denotes a partial deriva-
tive with respect to the i-th argument, or, when a subscript symbol is appended, say
Dx , then this denotes a partial derivative with respect to the argument commonly
referred to by that symbol.
• We use the following symbols to denote classes of function spaces:
Cb bounded, continuous functions;
Cb,u bounded, uniformly continuous functions;
Ck k times continuously differentiable functions;
C k,α C k functions with α-Hölder continuous k-th derivative. We will conventionally write
k,α
r = k + α ∈ R≥1 ; the Hölder estimates are assumed to be uniform in Cb,u spaces.
L continuous, i.e. bounded, (multi)linear operators;
X vector fields;
sections of a fiber bundle.
Unless otherwise specified, Cbk and Cbk,α spaces will be endowed with the canonical
norms that turn these into Banach spaces, that is,
k
n D f (x2 ) − Dk f (x1 )
f k,α = sup D f (x) + sup . (1.14)
x x2 =x1 d(x2 , x1 )α
0≤n≤k
particularly if we want to stress that this family satisfies some properties uniformly
in x.
• We use the notation B(x; δ) not only to indicate open balls of radius δ around a
single point x, but also B(M; δ) to indicate a (tubular) neighborhood of some set
or submanifold M, that is,
We also state here some basic facts about uniformly Hölder continuous functions.
Lemma 1.19 (Product rule for Hölder continuity) Let f, g ∈ Cb,u α be defined on
α .
spaces such that the product f · g is well-defined. Then also f · g ∈ Cb,u
Proof Let f 0 , g0 ≤ M and let C f,α , C g,α be the respective Hölder coefficients
of f, g. Then we have for all x1 = x2
f (x2 ) g(x2 ) − f (x1 ) g(x1 ) ≤ f (x2 ) g(x2 ) − g(x1 ) + f (x2 ) − f (x2 ) g(x2 )
≤ M (C f,α + C g,α ) x − yα ,
which exhibits the Hölder coefficient M (C f,α + C g,α ) for the product, and f · g is
clearly bounded by M 2 .
Proof Let M be the bound on f , and Cα its α-Hölder coefficient. For x2 − x1 ≤ 1
the estimate for β follows automatically from that of α. For x2 − x1 > 1 we use
boundedness to obtain
Typographical Conventions
As usual we close proofs with the symbol , while we shall use ♦ and to denote
the end of (a series of) remarks or examples, respectively.
Chapter 2
Manifolds of Bounded Geometry
We follow the definition in [Eic91] to introduce bounded geometry. Recall that the
injectivity radius rinj (x) at a point x ∈ M is the maximum radius for which the
exponential map at x is a diffeomorphism, see also Appendix F.
Definition 2.1 (Bounded geometry) We say that a complete, finite-dimensional
Riemannian manifold (M, g) has k-th order bounded geometry when the following
conditions are satisfied:
(I) the global injectivity radius rinj (M) = inf rinj (x) is positive, rinj (M) > 0;
x∈M
(Bk ) the Riemannian curvature R and its covariant derivatives up to k-th order are
uniformly bounded,
1 This is a noncompact surface with constant negative curvature, hence it cannot be isometrically
embedded into R3 , see [Hil01]. The embedding is nearly isometric for x
0 though, so the figure
is still a good representation there.
2.1 Bounded Geometry 37
is a set of local isomorphisms that acts transitively on M. That is, for any two
points (x1 , θ1 ), (x2 , θ2 ) ∈ M there exist ξ, α and a neighborhood U (x1 , θ1 ) such
that ϕξ,α : U → ϕξ,α (U ) is an isomorphism and ϕξ,α (x1 , θ1 ) = (x2 , θ2 ). For any
(x, θ) ∈ U and ν, w ∈ T(x,θ) M we have
∗
(ϕξ,α g)(x,θ) (ν, w) = g(x+ξ,eξ θ+α) Dϕξ,α (x, θ ) ν, Dϕξ,α (x, θ ) w
= dx(ν)dx(w) + e−2(x+ξ ) eξ dθ (ν) eξ dθ (w)
= g(x,θ) (ν, w)
so ϕξ,α∗ g = g on U . Since the curvature and its derivatives are locally determined,
this implies that these are constant across M, hence uniformly bounded (actually all
derivatives of R vanish). Note that these local isometries do not imply a finite global
injectivity radius since the size of the neighborhood U does depend on the points
(x 1 , θ1 ), (x2 , θ2 ) ∈ M. ♦
Example 2.3 (Manifolds of bounded geometry) The following are examples of man-
ifolds with bounded geometry of any (i.e. infinite) order.
• Euclidean space with the standard metric trivially has bounded geometry.
• A smooth, compact Riemannian manifold M has bounded geometry as well; both
the injectivity radius and the curvature including derivatives are continuous func-
tions, so these attain their finite minimum and maxima, respectively, on M. If
M ∈ C k+2 , then it has bounded geometry of order k.
• Noncompact, smooth Riemannian manifolds that possess a transitive group of
isomorphisms (such as hyperbolic space) have bounded geometry since the finite
injectivity radius and curvature estimates at any single point translate to a uniform
estimate for all points under isomorphisms. Note that the example in Remark 2.2
above shows that it is not sufficient to have local isometries.
More manifolds of bounded geometry can be constructed with these basic building
blocks in the following ways.
• The product of a finite number of manifolds of bounded geometry again has
bounded geometry, since the direct sum structure of the metric is inherited by
the exponential map and curvature. We give an outline of the proof. In a product
coordinate chart
(ϕ1 , ϕ2 ) : U1 × U2 → Rn 1 × Rn 2
and ijk denote the Christoffel symbols on M, while the dvi are one-forms on the
manifold TU .
Bounded geometry of (M, g) is not inherited by (TM, ĝ) since the extended Rie-
mannian curvature R̂ contains unbounded terms v when expressed in terms of
R, see [GK02,
Prop. 7.5]. These expressions
do readily show that the restriction
Tr M = (x, v) ∈ TM | gx (v, v) ≤ r 2 satisfies curvature bounds of order k − 1 if
(M, g) has k-bounded geometry. The geodesic flow equation is given in induced
coordinates by [Sas58, Eq. (7.7)]. By application of Theorem A.6, one can then
show that the injectivity radius is bounded.
Note that Tr M is a manifold with boundary, but this is not problematic in our setting
as long as the invariant submanifold stays away from the boundary. Alternatively
one could try to use results from [Sch01].
When we say that a manifold has bounded geometry without specifying the order
k, then it is assumed that the order is infinite, k = ∞, or sufficiently large. When
k ≥ 1 we have the following result, see [Eic91, Thm. 2.4 and Cor. 2.5]. In case
k = ∞ the converse also holds [Roe88, Prop. 2.4].
2.1 Bounded Geometry 39
Proof Let δ < 21 rinj (M) and x ∈ M. We consider a normal coordinate chart ϕx on
B(x; 2δ). According to Theorem 2.4, the metric g and its derivatives are bounded
in normal coordinates. We have (exp∗x g)(0) = gx , the Euclidean inner product on
Tx M, while the total derivative D(exp∗x g)(ξ ) is bounded on B(0; 2δ) ξ , say by
D(exp∗x g)(ξ ) ≤ C1 , independent of x ∈ M. By the mean value theorem this
induces the uniform bounds
1 − 2δ C1 ≤ (exp∗x g)(ξ ) ≤ 1 + 2δ C1 .
Let x1 , x2 ∈ B(x; δ) and let γ E be the straight curve between ϕx (x1 ) and ϕx (x2 )
in Tx M parametrized by arc length. This curve γ E attains the Euclidean distance
l E (γ E ) = d E (ϕx (x1 ), ϕx (x2 )). On the other hand, it gives an upper bound on the
metric distance
40 2 Manifolds of Bounded Geometry
l E (γ E )
d(x1 , x2 ) = inf l(γ ) ≤ (exp∗x g)γ E (t) γ E (t), γ E (t) dt
γ
0
≤ 1 + 2δ C1 d E ϕx (x1 ), ϕx (x2 ) .
From here on, we shall frequently represent objects living in B(x; δ) ⊂ M on nor-
mal coordinate neighborhoods B(0; δ) ⊂ Tx M via the normal coordinate chart ϕx .
We will mostly use B(x; δ) to clearly indicate the base point, or B(0x ; δ) ⊂ Tx M
to stress the tangent space domain of the coordinates as well. In spaces of bounded
geometry, normal coordinate charts are the natural charts to works in and coordinate
transition maps are not just smooth, but uniformly bounded, as stated in the following
lemma.
Lemma 2.6 (Boundedness of transition maps) Let (M, g) be a Riemannian man-
ifold of k-bounded geometry with k ≥ 2. There exists a δ with 0 < δ < rinj (M) and
constants C, L > 0 such that for all x1 , x2 ∈ M with d(x1 , x2 ) < δ the following
holds.
1. The coordinate transition map
Remark 2.7 One degree of smoothness is lost because the exponential map is defined
in terms of the geodesic flow. This flow in turn is defined in terms of the Christoffel
symbols, which depend on derivatives of the metric, so these are only C k−1 bounded.
We lose another degree of smoothness in estimating ϕ2,1 −(γ2,1 ) since the Lipschitz
estimate follows from a uniform bound on one higher derivative of these. ♦
We shall first compare both ϕ2,1 and (γ2,1 ) to the identity in normal coordinates
and finally conclude with the triangle inequality that their difference must be small.
We compare ϕ2,1 to the parallel transport (γ2,1 ) since this is the most natural way
to identify the tangent spaces Tx1 M and Tx2 M.
Proof Let B(x1 ; δ), B(x2 ; δ) be two normal coordinate neighborhoods with non-
empty intersection. The coordinate transition map ϕ2,1 = ϕ2 ◦ ϕ1−1 = exp−1 x2 ◦ exp x1
can be studied as the exponential map expx1 : Tx1 M → M in normal coordinates
on B(x2 ; δ), since ϕ2 = exp−1 x2 . From here on, we will implicitly be working in
normal coordinates around x2 , using some choice of basis to isometrically identify
Tx2 M ∼= Rn .
Let x ∈ B(x1 ; δ) ∩ B(x2 ; δ), hence x1 ∈ B(x2 ; 2 δ). We choose δ ≤ 1, and small
enough so that the results of Theorem 2.4 and Proposition 2.5 (with C = 2) hold for
2 δ. The exponential map is given by the time-one geodesic flow projected on the
base manifold. For the base point x2 , this is the identity map, while for the base point
x1 we will show that it is a small perturbation thereof. The geodesic flow on TM is
given in local coordinates by
ẋ i = vi , (2.5)
v̇ = −
i i
jk (x) v v ,
j k
where ijk denote the Christoffel symbols with respect to the coordinates x i on M
and the v j are induced additional coordinates on TM, see the explanation above (2.1).
The Christoffel symbols are C k−1 bounded due to Theorem 2.4. Let ϒ t denote the
geodesic flow of (2.5) on TM restricted to B(x2 ; 2 δ). We denote by (x(t), v(t)) a
solution curve of ϒ t . The geodesic flow preserves the length of tangent vectors with
respect to the metric g, so we have v(t) ≤ 2 v(0) ≤ 2 δ with respect to the
Euclidean distance in the normal coordinates. This implies that the vector field (2.5)
is bounded in these induced coordinates. Hence, by Theorem A.6, ϒ t ∈ Cbk−1 is
bounded as well on the interval [0, 1]. Moreover, Dϒ t ∈ Cbk−2 exhibits a Lipschitz
estimate for the base point dependence ϕ2 (x1 ) E . By Proposition 2.5 the local
Euclidean distance is equivalent to the distance on M, so ϕ2 (x1 ) E ≤ 2 d(x1 , x2 ).
These conclusions directly translate to expx ( · ) = π ◦ ϒ 1 (x, · ) and we conclude
that ϕ2,1 = exp−1 k−1
x2 ◦ exp x1 ∈ C b with bound C > 0 uniform in x1 , x2 ∈ M and
ϕ2,1 − 1k−2 ≤ L d(x1 , x2 ) for some L > 0.
42 2 Manifolds of Bounded Geometry
The parallel transport (γ2,1 ) is given by integrating the pullback of the con-
nection along γ2,1 . This yields a differential equation similar to (2.5) and similarly
leads to C k−1 boundedness estimates in normal coordinates and Lipschitz estimates
for the C k−2 -norm. Thus, the difference ϕ2,1 − (γ2,1 ) is C k−1 bounded, and has
C k−2 -norm that satisfies the Lipschitz estimate (2.4) for some L > 0.
Definition 2.8 (M-small coordinate radius) Let (M, g) be a Riemannian manifold
of bounded geometry. We define δ > 0 to be M-small if Theorem 2.4 and Lemma 2.6
hold on all normal coordinate charts of radius δ.
Note that such a δ > 0 always exists. From now on, we shall always assume to have
selected such a δ for any given manifold of bounded geometry and restrict its atlas
to include these normal coordinate charts only.
Lemma 2.6 shows that normal coordinate transformations respect C k boundedness
of functions in coordinate representations. Thus, it is natural to consider manifolds
of bounded geometry as the class of C k bounded manifolds with respect to this
restricted atlas. This also makes the following definition natural.
Definition 2.9 (Ck bounded maps) Let X, Y be Riemannian manifolds of k + 1-
bounded geometry and f ∈ C k (X ; Y ). We say that f is of class Cbk when there
exist X, Y -small δ X , δY > 0 such that for each x ∈ X we have f (B(x; δ X )) ⊂
B( f (x); δY ) and the representation
f˜x = exp−1
f (x) ◦ f ◦ expx : B(0; δ X ) ⊂ Tx X → T y Y (2.6)
in normal coordinates is of class Cbk and the associated C k -norms of f˜• are bounded
k (X ; Y ) and C k,α (X ; Y ) functions
uniformly in x ∈ X . We define the classes of Cb,u b,u
analogously when X, Y are of k +2-bounded geometry.
Remark 2.10 We shall say that a vector field v ∈ X(X ) is of class Cbk , also denoted
by v ∈ Xkb (X ), when v ∈ Cbk with respect to coordinates on TM induced by normal
coordinates on M. This is slightly different from normal coordinates on TM induced
by the metric (2.1). Note that since v ≤ r is assumed bounded, we could restrict
to the submanifold Tr M of bounded geometry and consider v ∈ Cbk (M; Tr M), but
this is less practical.
Remark 2.11 The manifolds X, Y need to have bounded geometry of one or two
degrees higher than the smoothness of the maps to preserve boundedness and uniform
continuity estimates under normal coordinate transformations. This shall from now
on always be an implicit assumption.
Remark 2.12 (Locally/globally defined continuity modulus) The continuity modulus
ε f of a function f ∈ Cb,u
k (X ; Y ) is only defined on the interval
[0, δ X ) ⊂ R. On the
other hand, D f (x) is globally well-defined in terms local charts and assumed to
k
∼ ∼
ϕ : Tx1 X → Tx2 X and ψ : T f (x1 ) Y → T f (x2 ) Y,
Note that the right-hand expression does not depend on the choice2 of isomorphisms.
Thus, with such isomorphisms at hand, we can use (2.7) to heuristically extend
the local to a global continuity modulus. That is, for nearby points x1 , x2 we use
an estimate in terms of local charts; if this is not possible, then the points must be
separated by a distance larger than a δ as in Definition 2.8. Since the functions we
consider are globally bounded, we then use some (non-canonical) choice to identify
the vector bundle fibers over x1 , x2 that the function lives in and estimate by the
right-hand side of (2.7). This estimate is crude but independent of the choice of
k,α
identification and will always satisfy our needs. For example, if f ∈ Cb,u (X ; Y ),
with Hölder coefficient Cα locally for d(x1 , x2 ) ≤ δ then we have
Cα d(x1 , x2 )α if d(x1 , x2 ) < δ,
D f (x2 ) − D f (x1 ) ≤
k k
2 f k α else.
δα d(x ,
1 2x )
2f
This shows that we can heuristically consider max Cα , δ α k as a global Hölder
coefficient. ♦
The following proposition shows that we may measure continuity of the deriva-
tives of a function f using local parallel transport. With the remark above we see how
it can be extended to a global continuity modulus if a (non-unique) choice is made
for how to connect non-close points x1 , x2 by a path; this idea will be developed in
Sect. 3.7.4.
Proposition 2.13 (Equivalence of continuity moduli) Let X, Y be Riemannian
manifolds of bounded geometry and f ∈ Cbk (X ; Y ). Then the following statements
are equivalent:
k,α
1. f ∈ Cb,u (X ; Y ) according to Definition 2.9;
2. we have the continuity estimate
2 In practice, we shall use isomorphisms defined by parallel transport onX = Y , cf. Proposition 2.13.
This is a non-canonical choice, since it depends on the path connecting x1 , x2 . A canonical choice
that depends continuously on x1 , x2 cannot be made in general, since it would imply that the tangent
bundle is trivializable.
44 2 Manifolds of Bounded Geometry
where (η2,1 ) and (γ2,1 )⊗k denote parallel transport along the unique short-
est geodesic between f (x1 ), f (x2 ) and x1 , x2 , respectively, and ε f, denotes a
uniform or α-Hölder continuity modulus.
Proof We first prove the statement in case Y is a normed linear space, hence no
parallel transport term (η2,1 ) appears.
Let δ0 ≤ δ X as in Definition 2.9 (thus, in particular δ0 is X -small), and let
d(x1 , x2 ) ≤ δ0 . Then we have the Lipschitz estimate ϕ2,1 −(γ2,1 ) ≤ L d(x1 , x2 )
while the normal coordinate representations (2.6) of f at x1 , x2 are related by f˜x1 =
f˜x2 ◦ ϕ2,1 . This leads to
k−1
+ Dl f˜x2 (ϕ2 (x1 )) Pl,k D• ϕ2,1 (0)
l=1
k
k−1
≤ ε f (d(x2 , x1 )) + f k L d(x1 , x2 ) + f l Pl,k D• ϕ2,1 (0) ,
l=1
where ε f denotes the continuity modulus of f and its derivatives according to Def-
inition 2.9, and the Pl,k denote (l, k)-linear maps according to Proposition C.3. We
used the fact that both (γ2,1 ) and Dϕ2,1 (0) act on the k-tensor bundle as a k-tuple
of copies. By assumption f k is bounded, and to estimate the Pl,k terms, we note
that l < k, so each of the Pl,k contains at least a factor Di ϕ2,1 (0) with i ≥ 2. Since
ϕ2,1 is close to (γ2,1 ) and Di (γ2,1 ) = 0 for i ≥ 2, it follows that
Pl,k D• ϕ2,1 (0) ≤ C L d(x1 , x2 )
for some constant C independent of x1 , x2 . This shows that the continuity modulus
ε f, of (2.8) can be estimated by the continuity modulus ε f plus additional Lipschitz
terms. We can reverse the estimates above to arrive at the same conclusion when
expressing ε f in terms of ε f, . Hence, the continuity statements are equivalent for
any α ≤ 1.
If Y is a Riemannian manifold of bounded geometry, we just apply the same
estimates in the codomain. To this end, we must have d( f (x2 ), f (x1 )) ≤ δY , so we
choose δ0 small enough that
2.1 Bounded Geometry 45
such that if we have a transition function ϕ2,1 = τ2 ◦τ1 −1 between two trivializations
on B(m 1 ; δ) and B(m 2 ; δ), then the function g : B(m 1 ; δ) ∩ B(m 2 ; δ) → L(F)
defined by ϕ2,1 (m, f ) = g(m) f satisfies g ∈ Cbk independent of the points
m 1 , m 2 ∈ M.
Remark 2.15 Note that we could have replaced B(m; δ) by arbitrary (preferred)
coordinate charts. The relevant property is that we express uniformity of the transition
functions in terms of uniformity of the function g with respect to the underlying
coordinate charts of M, which are normal coordinates in our case. ♦
It follows from Lemma 2.6 that the tangent bundle TM has bounded geometry of
order k −2 if (M, g) has bounded geometry of order k ≥ 2. One order of smoothness
is lost (beyond the one expected) as noted in Remark 2.7.
We introduce the concept of a uniformly locally finite cover of a manifold of
bounded geometry. This is a natural extension of a locally finite cover. Uniformity
means that we require a global bound K on the number of sets in the cover that
intersect any small open ball.
Lemma 2.16 (Uniformly locally finite cover) Let (M, g) be a Riemannian mani-
fold of bounded geometry.
Then for δ2 > 0 small enough and any 0 < δ1 ≤ δ2 , M has a countable cover
B(xi ; δ1 ) i≥1 such that
1. ∀ i = j : d(xi , x j ) ≥ δ1 ;
2. there exists an explicit global bound K such that for each x ∈ M the ball B(x; δ2 )
intersects at most K of the B(xi ; δ2 ).
Note that the second result implies both that the cover is locally finite with fixed
neighborhood size, and that each set in the cover overlaps with at most K others, cf.
Lebesgue covering dimension.
Proof Using Proposition 2.5, choose δ > 0 such that Euclidean distance in normal
coordinates on each B(x; δ) is C = 2 equivalent to the metric distance and set
δ2 ≤ δ/3.
46 2 Manifolds of Bounded Geometry
(3 C δ2 )n
δ n
2
K ≤ = 24 . (2.10)
(δ1 /(2 C)) n δ1
Thus, only finitely many can intersect B(x; δ2 ). These estimates are uniform and do
not depend on x ∈ M so the bound K is global.
Lemma 2.17 (Uniform partition of unity) Let M be a manifold with a uniformly
locally finite cover with δ1 < δ2 and δ2 sufficiently small, as per Lemma 2.16.
Then there exists a partition of unity by functions χ• ∈ Cb,u k (B(x ; δ );
i 2 [0, 1])
subordinate to this cover.
We shall also apply this lemma to submanifolds which have a uniformly locally finite
cover due to Corollary 2.26 on page 54.
Proof Let δ2 be small enough that by Lemma 2.6 coordinate transition maps are
k . Define a standard radially symmetric smooth bump function ϕ ∈ C ∞ (Rn ;
Cb,u
[0, 1]) that is identically one on B(0; δ1 ) and has compact support in B(0; δ2 ), hence
k . We set ϕ = ϕ ◦ exp−1 by isometric identification T M ∼ Rn and zero
ϕ ∈ Cb,u i xi xi =
outside B(xi ; δ2 ). We have ϕ• ∈ Cb,u in any coordinate patch. Define in the usual
k
way
χi = ϕi ϕn . (2.11)
n≥1
The sum is finite as at most K of the B(xn ; δ2 ) overlap any B(xi ; δ2 ). The balls
B(xi ; δ1 ) already cover M, so the denominator is at least one, from which it follows
that χ• ∈ Cb,uk .
Corollary 2.18 Similar to a uniform partition of unity, we can construct a partition
by functions χ• ∈ Cb,u
k (B(x ; δ );
i 2 [0, 1]) whose squares sum to one.
To prove smoothness of the persistent manifold in Sect. 3.7, we shall want to estimate
the holonomy along closed loops to be close to the identity, that is, if c is a closed
loop, then we want (c) − 1 to be small. To this end, we relate the holonomy to the
curvature and finally obtain an estimate in terms of a global bound on the curvature
and the area of a surface enclosed by c.
The result that curvature is the generator of holonomy dates back at least to
Ambrose and Singer [AS53] who formulated this in differential form in the 1950’s;
they cite an even older statement (without proof) by Élie Cartan [Car26]. More recent
work by Reckziegel and Wilhelmus [RW06] shows explicit integral formulas for this
relation, formulated on fiber bundles, a context far more general than is required here.
We shall present a formulation for Riemannian manifolds (M, g).
Let denote the parallel transport functional, which takes C 1 curves to orthogonal
maps between the tangent spaces at their endpoints, see (F.3). If c is a closed loop,
then (c) is a linear endomorphism on Tc(0) M and we can measure (c) − 1.
Our goal is to bound this quantity by the integral of the curvature form R over a
surface with boundary precisely c. This result can be viewed as a generalization
of Stokes’ theorem where the curvature is the exterior derivative of the connection
form ω, while the connection on the other hand generates parallel transport along
the boundary of the surface A that the curvature is integrated over. Note though,
that we actually have R = dω + ω ∧ ω, so there is an additional term due to the
noncommutativity of the connection form.
Let
γ : D = 0, t¯ × [0, s̄] → M : (t, s) → γ (t, s) (2.13)
3 If the frame f is induced by local coordinates, then ω will precisely be given by the Christoffel
symbols and we recover Eq. (F.4).
48 2 Manifolds of Bounded Geometry
d
f (c|s0 ) = −ω(ċ(s)) ◦ f (c|s0 ), f (c|00 ) = 1Rn , (2.14)
ds
which has a unique solution s → f s,0 = f (c|s0 ). This can be viewed as time-
dependent flow in End(Rn ).
Let us define the parallel transport term
P(s) = s̄,s
t¯
◦ s0,t¯ ◦ s,0
0 : Tγ (0,0) M → Tγ (t¯,s̄) M, (2.15)
Expressing everything with respect to the frame f , we see that the first and last factor
of P(s) are easily differentiated using (2.14):
d s,0 ∂ d ∂
f 0 = −ω( ) ◦ f s,0 and s̄,s s̄,s
f t¯ = f t¯ ◦ ω( ). (2.17)
ds ∂s 0
ds ∂s
The middle term f st¯,0 can be differentiated by viewing s as parameter in the differ-
ential Eq. (2.14). Variation of constants yields (see e.g. [DK00, App. B] for a proof
of the differentiable dependence of a flow on parameters)
t¯
d d ∂
f t¯,t = f t¯,t ◦ − ω( ) ◦ f st,0 dt
s s
ds 0 ds ∂t
t¯ ∂ ∂ d ∂ ∂ ∂
= − f st¯,t ◦ dω( , ) + ω( ) + ω( , ) ◦ f st,0 dt
0 ∂s ∂t dt ∂s ∂s ∂t
(t , s)
(0, 0)
2.2 Curvature and Holonomy 49
∂ ∂
using standard rules for exterior derivatives. Next we note that
∂ ∂s , ∂t = 0, and
integrate by parts the term dtd ω( ∂s )
t¯ ∂ ∂ ∂ ∂ ∂ ∂
= − f st¯,t ◦ − ω( ) ◦ ω( ) + dω( , ) + ω( ) ◦ ω( ) ◦ f st,0 dt
0 ∂t ∂s ∂s ∂t ∂s ∂t
∂ ¯
t
− f st¯,t ◦ ω( ) ◦ f st,0
∂t t=0
t¯
∂ ∂ ∂ ∂
= f t¯,t ◦ (dω + ω ∧ ω)
s
, ◦ f st,0 dt − ω( ) ◦ f st¯,0 + f st¯,0 ◦ ω( ).
0 ∂t ∂s ∂s ∂s
(2.18)
The integrand on the last line is a two-form on D with values in L(Tγ (0,0) M;
Tγ (t¯,s̄) M). This final expression is clearly independent of a choice of frame, so we
have recovered an explicit integral formula relating holonomy along a null-homotopic
loop to the curvature.
We conclude from (2.19) that if c is a closed, null-homotopic loop, and the cur-
vature globally bounded, then (c) − 1 can be estimated by Rsup times the
surface area of any null-homotopy γ of c. Note that we do not require γ to be
an embedding; the integral is intrinsically defined on D by pullback. Furthermore,
γ is required to be C 1 only. This follows from the fact that both sides of the equation
are continuous with respect to γ in C 1 -norm; alternatively, an explicit calculation
∂2 γ
requires that the mixed partial derivative ∂s ∂t is continuous to perform integration
by parts. Both lead to the to the following result.
Lemma 2.19 (Exponential growth bound on holonomy) Let (M, g) be a man-
ifold of bounded geometry with normal coordinate radius δ that is M-small as in
Definition 2.8. Fix T > 0 and ρ > 0 and let x1 , x2 be two C 1 curves on M with
derivatives bounded by N such that dρ (x1 , x2 ) eρ T ≤ δ < rinj (M). Denote by γt the
unique shortest geodesic connecting x1 (t) to x2 (t) for any t ∈ [0, T ].
50 2 Manifolds of Bounded Geometry
If δ is sufficiently small, then the closed loop η = x2 |0T ◦ γT ◦ x1 |0T ◦ γ0−1 satisfies
the holonomy bound
eρ T
(η) − 1 ≤ C̃ R0 N dρ (x1 , x2 ) (2.20)
ρ
Since expx is a local diffeomorphism at least for d(x1 (t), x2 (t)) < δ eρ t < rinj (M),
that depends smoothly on x, it follows that (s, t) → γt (s) defines a homotopy
between the curves x1 , x2 restricted to the interval [0, T ]. The map γt (s) has contin-
uous mixed derivatives with respect to s, t (even though the double derivative with
respect to t does not exist since x1 , x2 ∈ C 1 only), so integration by parts is allowed
in (2.18).
We estimate the surface area mapped by γt (s). We use shorthand notation
ξ = s exp−1x1 (t) (x 2 (t)) ∈ Tx1 (t) M and denote by Dx expx the derivative of the expo-
nential map with respect to the base point parameter x. Then
d
γt (s) =D expx1 (t) (ξ ) · exp−1 x1 (t) (x 2 (t)),
ds
d
γt (s) =Dx expx1 (t) (ξ ) · ẋ1 (t) + D expx1 (t) (ξ )·
dt
s Dx (exp−1 −1
x1 (t) )(x 2 (t)) · ẋ 1 (t) + s D expx1 (t) (x 2 (t)) · ẋ 2 (t) .
Since M has bounded geometry, D expx and its inverse are bounded by Theorem 2.4,
while Dx expx and its inverse are bounded by Lemma 2.6, say by C > 1. This leads
to estimates
d
γt (s) ≤ C d(x1 (t), x2 (t)),
ds
d
γt (s) ≤ C ẋ1 (t) + C s[C ẋ1 (t) + C ẋ2 (t)] ≤ 3 C 2 N ,
dt
From this section on, we shall prove results that—although they may be of interest
independently within bounded geometry—are building up towards the final section
of this chapter, where we prove how to reduce Theorem 3.1 on persistence in general
manifolds of bounded geometry to the setting of a trivial bundle. These results form
the more technical part of this chapter and are not required elsewhere.
In the following, we assume that (Q, g) is an ambient manifold that has bounded
geometry of large or infinite order and M ∈ C k will denote a submanifold of Q. Only
a finite order l > k of bounded geometry is required of (Q, g), but for simplicity we
shall assume l = ∞. Recovering the explicit additional order l −k would amount to
tediously tracking the details throughout all the proofs; it should be sufficient if l is
larger than k by some number between 2 and 10.
Let ι : M → Q be a C 1 immersion. With abuse of notation we denote by
Tx M = Im(Dι(x)) and N x = Im(Dι(x))⊥ the tangent and normal spaces of M with
respect to the immersion. Note that even if ι is not injective, the original point x ∈ M
uniquely selects the tangent and normal spaces in Tι(x) Q.
Definition 2.21 (Uniformly immersed submanifold) Let ι : M → Q be a C k≥1
immersion of M into the Riemannian manifold (Q, g) of bounded geometry. Denote
by Mx,δ the image under ι of the connected component of x in ι−1 B(ι(x); δ)∩ι(M) .
k immersed submanifold when there exists a δ > 0 such that
We define M to be a Cb,u
for all x ∈ M, the connected component Mx,δ is represented in normal coordinates
on B(ι(x); δ) ⊂ Q by the graph of a function h x : Tx M → N x and the family of
functions h • ∈ Cb,u
k (T M; N ) has uniform continuity and boundedness estimates
• •
independent of x. We define Cbk≥1 immersions in a similar way.
52 2 Manifolds of Bounded Geometry
Nx Nx
B (0; δ )
Nx
hx
x′ x′
Tx M Tx M
ξ′ 0 0
Fig. 2.2 An immersed submanifold represented by the graph of h x in normal coordinates. In the
left figure, another part of M intersects transversely on the right; the right figure contains an orbit
of the geodesic flow along a normal vector at x ∼= ι(x )
Remark 2.22 By taking the connected component Mx,δ in M, we allow for immersed
submanifolds that intersect, or nearly intersect themselves. See Fig. 2.2 on the left:
Mx,δ is described by the graph of h x , while on the right side, a different part of
M embeds into this same neighborhood B(ι(x); δ). See Fig. 3.2 on page 86 for
an example of a nearly self-intersecting submanifold. If we want to rule out such
cases, we can assume that Mx,δ is the unique component of M ∩ B(ι(x); δ). This
will turn M into an embedded submanifold, but more strongly, the nearly self-
intersecting case is also ruled out. We will refer to this as a uniformly embedded
submanifold. ♦
Remark 2.23 The sets Mx,δ play a similar role as ‘plaques’ in [HPS77, p. 72–73].
♦
Remark 2.24 In case k = 1, boundedness is automatically implied by uniform conti-
nuity. This follows from the representation in normal coordinates. We have Dh x (0) =
0, so by uniform continuity there exists a δ > 0 such that Dh x (ξ ) < ε = 1 when
ξ < δ, hence Dh x is bounded. Put another way, there is no intrinsic measure for
the ‘size of the derivative or tangent’ of a submanifold. ♦
Note that the function h x is only defined on that part of the domain B(0; δ) ⊂ Tx M
where its graph is contained in B(0; δ) ⊂ Tι(x) Q, as can be seen in Fig. 2.2. In the
splitting Tι(x) Q = Tx M ⊕ N x , we denote with p1 , p2 orthogonal projections onto
the Tx M and N x subspaces, respectively.
k≥1
From now on we shall continually assume that M ∈ Cb,u is a uniformly immersed
submanifold of Q. We will often identify M with its image ι(M) ⊂ Q, as well as
identify points x ∈ M with ι(x), keeping in mind the definition of Mx,δ to track local
injectivity. Furthermore, denote by d M the distance on M induced by the pulled back
Riemannian metric ι∗ (g). This distance function measures whether points are close
when viewed along the domain of the immersion, disallowing ‘shortcuts’ through Q.
2.3 Submanifolds and Tubular Neighborhoods 53
It also distinguishes different points with the same immersion image. Note that it is
different from the distance d on Q pulled back to M. This we denote by d Q = ι∗ (d)
but it is not a distance on M when ι is not injective. Still, we have the following local
result, which will be useful for later estimates.
Lemma 2.25 (Local equivalence of distance) Let M ∈ Cb,u 1 be a uniformly
immersed submanifold of the bounded geometry manifold (Q, g). Then d Q and d M
are locally equivalent in the following sense:
1. ∀x1 , x2 ∈ M : d Q (x1 , x2 ) ≤ d M (x1 , x2 );
2. for any C > 1 there exists a δ > 0 such that for all d M (x1 , x2 ) < δ, we have the
local converse d M (x1 , x2 ) ≤ C d Q (x1 , x2 ).
Proof The first assertion follows directly from the fact that any path in M induces
a path of equal length in Q via the immersion ι.
For the second part, we first note that if δ is small enough and d M (x1 , x2 ) < δ, then
we must have x2 ∈ Mx1 ,δ . If this would not be the case, then any path γ connecting
x1 , x2 through M cannot be contained in Mx1 ,δ . But this implies that the path runs
out of B(x1 ; δ), so its length is greater than δ. This contradicts the assumption that
d M (x1 , x2 ) < δ. Hence, x2 can be represented as a point on the graph of h x1 in
B(x1 ; δ).
Let C > 1, ε > 0 be constants to be fixed later and let δ be small enough
such that the metric coefficients are bounded by C in normal coordinate charts, that
Proposition 2.5 holds with C, and we have h • 1 ≤ ε as in Remark 2.24. We consider
the normal coordinate chart on B(x1 ; δ) and construct a path in M to find an upper
bound for d M (x1 , x2 ). Let x2 = (ξ, h x1 (ξ )) and define γ (t) = (t ξ, h x1 (t ξ )) for
t ∈ [0, 1]. We estimate the length of γ as
1
√
We conclude that
where
S : TM × TM → N : X, Y → (∇ X Y )⊥ (2.22)
is the second fundamental form, and it is indeed pointwise defined. In normal coor-
dinates we find
Sx (X, Y ) = D2 h x (0)(X, Y ). (2.23)
2.3 Submanifolds and Tubular Neighborhoods 55
Since h ∈ Cbk and g, g −1 ∈ Cbk as well, it follows that S ∈ Cbk−2 and by (2.21) then
that R M ∈ Cbk−2 , so condition (Bk−2 ) of Definition 2.1 is satisfied.
Condition (I) on the injectivity radius follows from an implicit function argument
applied to the geodesic flow using Theorem A.6. We consider local coordinates
around x ∈ M by projecting the representation M ∩ B(x; δ) onto Tx M in normal
coordinates in Q. That is, we have the coordinate chart map
context of splittings of tangent spaces at any two nearby points, while the results for
coordinates along M follow as an easy corollary.
Lemma 2.29 (Coordinate transformations of splittings) Let (Q, g) be a smooth
Riemannian manifold of bounded geometry, let C be sufficiently large and let δ, ζ > 0
be sufficiently small. Let x1 , x2 ∈ Q and let Txi Q = Hi ⊕ Vi , i = 1, 2 be split-
tings along ‘horizontal’ and ‘vertical’ perpendicular subspaces with dim(H1 ) =
dim(H2 ). Assume that d(x1 , x2 ) < δ and that, for i = j, Hi is represented in
tangent normal coordinates at x j by the graph of L i ∈ L(H j ; V j ) with L i ≤ ζ .
Then the coordinate transformation ϕ2,1 in Lemma 2.6 is of the form
ϕ2,1 = O H ⊕ OV + ϕ̃2,1 with ϕ̃2,1 k ≤ C ζ + d(x1 , x2 ) , (2.25)
There exists an ε > 0 and a tubular neighborhood B(O(V ); η) ⊂ G L(V ) with fiber
projection π, such that on {A ∈ G L(V ) | f (A) < ε, A ≤ 2}, the map ϕ : A →
(π(A), f (A)) is a smooth diffeomorphism. As a direct corollary, if f (A) < ε and
A ≤ 2 then U = π(A) ∈ O(V ) is an orthogonal approximation of A in the sense
that U − A ≤ f (A).
Proof The map f is smooth and invariant under the left action of the orthogo-
nal maps O(V ), while O(V ) = ker( f ). Since G L(V ) is a Lie group, we have
the canonical trivialization TG L(V ) = G L(V ) × gl(V ) by left multiplication.
The similar trivialization TO(V ) = O(V ) × o(V ) can be viewed as a subbundle of
where o(V ) is identified with the skew-symmetric linear maps. We restrict the
exponential map exp : TG L(V ) → G L(V ) to O(V ) × Sym(V ). At 1 ∈ O(V )
this restriction has bijective derivative, hence it is a local diffeomorphism. Since
exp is O(V )-invariant, it defines a diffeomorphism onto a tubular neighborhood
B(O(V ); η) ⊂ G L(V ) of O(V ) of size η > 0 and a corresponding smooth fiber
projection map π : B(O(V ); η) → O(V ).
Now D f (1) : a → a T + a has image precisely Sym(V ). Thus, if we restrict
f to the fiber over 1 ∈ O(V ) in the tubular neighborhood, then D f (1)|Sym(V ) =
2 and f is a diffeomorphism with D f −1 ≤ 1 in some neighborhood of 0 ∈
π −1 (1); if necessary, we reduce η > 0 for D f −1 ≤ 1 to hold on B(O(V ); η) ∩
2.3 Submanifolds and Tubular Neighborhoods 57
π −1 (1). By O(V ) invariance of f , this holds globally on all (fibers) of the tubular
neighborhood. Since, Dπ and D f have complementary image at O(V ), ϕ = (π, f )
is a diffeomorphism on B(O(V ); η).
The set B(0; 2) \ B(O(V ); η) ⊂ L(V ) is compact, so f ( · ) attains its nonzero
minimum on it. Let ε be smaller than this minimum. Then, if f (A) < ε, we
must have A ∈ B(O(V ); η) and hence A = exp(U, a) for a unique (U, a) ∈
O(V ) × Sym(V ). By O(V )-invariance, we can assume w.l.o.g. that U = 1 and use
the mean value theorem to estimate
A − 1 ≤ D f −1 f (A) − f (1) ≤ f (A) < ε.
denote the orthogonal linear map induced by parallel transport. We decompose ϕ2,1 =
O + ϕ̂2,1 , where ϕ̂2,1 can be made arbitrarily small. Moreover, we write
A B
O= ∈ L(H1 ⊕ V1 ; H2 ⊕ V2 ),
C D
with the idea that B, C should be small and A, D should approximate orthogonal
maps O H , OV , respectively. Orthogonality of O implies
T
AT C T A B A A + C T C AT B + C T D
1=O O= T
· = .
B T DT C D B T A + DT C B T B + DT D
For the operator norm we have A, B, C, D ≤ O = 1, so if we assume
for the moment that B, C can be made sufficiently small, then, by writing A T A−1 =
−C T C and D T D − 1 = −B T B, Lemma 2.30 implies that we can find O H , OV
such that
A0 A0 OH 0
O − O H ⊕ OV ≤ O − + −
0 D 0 D 0 OV
≤ B + C + A − O H + D − OV
≤ B + C + C T C + B T B . (2.27)
since
AT C T
O −1 = O T = .
B T DT
We conclude that both B, C ≤ 4 ζ when δ is chosen small, hence O can be
approximated by O H ⊕ OV , and the error from (2.27) can be absorbed into ϕ̃2,1 :
ϕ̃2,1 = ϕ̂2,1 + (O − O H ⊕ OV ).
The errors introduced in ϕ̃2,1 from lemmas 2.6 and 2.30 are Lipschitz small in terms
of d(x1 , x2 ) and ζ , respectively, so these add up to the estimate in (2.25).
k≥1
Corollary 2.27 Let M ∈ Cb,u be a uniformly immersed submanifold of a smooth
Riemannian manifold (Q, g) of bounded geometry. Let x 1 , x2 ∈ M and let Txi Q =
Txi M ⊕ N xi , i = 1, 2, be the respective splittings in horizontal and vertical direc-
hold ford M (x1 , x2 ) < δ. If moreover M ∈ Cb ,
tions. Then the results of Lemma 2.29 2
Now, ϕ(ν ) = ϕ(ν) can only hold if the respective horizontal coordinates along Tx M
are equal. By definition of normal coordinates around x, we have exp(ν) = (0, σ2 ).
Therefore it is sufficient to prove that some η > 0 exists as a lower bound for
ν ∈ R | ϕ(ν )1 = 0, π(ν ) = x .
We view the exponential map as the time-one geodesic flow, which is given in local
coordinates by (2.5). The geodesic flow along ν = (σ1 , σ2 ) starting at (ξ , h(ξ )) is
60 2 Manifolds of Bounded Geometry
a small perturbation of the flow along (0, σ2 ) starting at (0, 0). The latter has solution
curve t → (0, t σ2 ) ∈ Tx M ⊕ N x .
By Theorem 2.4 we arrange for g − 1, ≤ 2 and D ≤ C in local
coordinates. We have estimates
(x (t)) − (x(t)) ≤ D x (t) − x(t) ≤ C x (t) − x(t),
√
σ (t) ≤ g(σ (t), σ (t)) = g(σ (0), σ (0)) ≤ 2 σ (0).
d
x (t) − (0, t σ ) ≤ σ (t) − (0, σ ),
2 2
dt
d
σ (t) − (0, σ ) ≤ D x (t) − (0, t σ ) σ (t)2
2 2
dt
+ σ (t) + (0, σ2 ) σ (t) − (0, σ2 )
√
≤ 4 C η2 + 4 2 η σ (t) − (0, σ ),2
Now, if η is chosen sufficiently small, then using (2.30), we have for all 0 ≤ t ≤ 1
that t
1
x (t) − x1 (0) ≤ C ξ η eC̃ η t dτ ≤ ξ .
1
0 2
This shows that there exists an explicit η > 0 such that ϕ is injective on N≤η
restricted to a neighborhood Mx,δ , and by construction this η is uniform over M.
Later modifications to choose η smaller will only depend on the global geometry of
Q, but not on any details of M.
If moreover Mx,δ = M ∩ B(x; δ) is the unique connected component of M in each
normal coordinate chart, then ϕ is injective globally on N≤η . This follows easily by
taking η < 2δ . Then, any ν and ν that have the same image, must have base points
x , x separated by a distance less than δ, as ϕ = exp will only map onto points at
most η away from the base point. Therefore, x must lie in B(x; δ) and in M, hence
on Graph(h) = Mx,δ . This case was already treated.
Finally, we will show that ϕ is surjective onto B(M; η) when η < rinj (Q). Take
y ∈ B(M; η), then M ∩ B(y; rinj (Q)) contains a nonempty compact set, so there
exists an x ∈ M such that d(y, M) = d(y, x). This distance must be realized by a
(unique) geodesic γ . We will derive a contradiction if γ (0) ∈ N x , by showing that
then the minimum distance is not attained at x. Let (ξ1 , ξ2 ) ∈ Tx M ⊕ N x be the
normalized tangent vector of γ (0). By assumption we have ξ1 = 0, so ξ2 < 1.
We parametrize
γ : [0, d(y, x)] → Tx Q : t → t (ξ1 , ξ2 )
2.3 Submanifolds and Tubular Neighborhoods 61
by arc length in normal coordinates, thus d(x, γ (t)) = t. Consider the Euclid-
ean distance in normal coordinates at x of γ (t) to its vertical projection onto
M = Graph(h). This shows that
d(γ (t), M)
lim ≤ lim C ξ2 + o(t)/t = C ξ2 .
t↓0 d(x, γ (t)) t↓0
k≥0
Lemma 2.34 (Smoothing by convolution) Let r, δr > 0, f ∈ Cb,u B(0; r +
2 δr ) ⊂ R ; R , and fix l > k and ε > 0. If the mollifier support radius ν > 0 is
m n
up to one degree less, so we only need to prove f˜ ∈ Cbl ∩ C ∞ for l shifted by one.
We construct f˜ by a combination of convolution and cut-off. Let
χ̂(x) = χr,r +δr (x) for x ∈ Rm and define
f˜(x) = (1 − χ̂ (x)) f (x) + χ̂(x) f (x − y) ϕν (y)dy. (2.32)
Rm
When ν < δr/2, this f˜ is smooth on B(0; r ) and equal to f outside B(0; r + δr ).
The convolution approximates f in C k -norm, as for any 0 ≤ j ≤ k and x ∈
B(0; r + δr )
2.4 Smoothing of Submanifolds 63
j
j
D f˜(x) − D j f (x) ≤
j
Di χ̂ (x) · D j−i ϕν ∗ f − f (x) ≤ C j ε(ν).
i
i=0
X = H1 × V2 , Y = H2 × Cb,u
k
(H1 ; V1 ), Z = H2 × V2 , and
F : X × Y → Z : (ξ1 , η2 ), (ξ2 , f 1 ) → ϕ2,1 (ξ1 , f 1 (ξ1 )) − (ξ2 , η2 ). (2.33)
Note that X and Z are isomorphic vector spaces, so we can apply the implicit function
theorem with Y as parameter space. Moreover, if we have two functions f 1 , f 2 whose
graphs represent the same manifold on the intersection B(x1 ; δ) ∩ B(x2 ; δ), then we
have −1
F p1 ◦ ϕ2,1 ξ2 , f 2 (ξ2 ) , f 2 (ξ2 ), ξ2 , f 1 = 0
is unitary when ϕ̃2,1 = f 1 = 0. When δ, ε are sufficiently small, then these functions
are still small enough such that D1 F is invertible with uniformly bounded inverse,
using Lemma A.1. Furthermore, F ∈ Cb,u k , as the dependence on ξ , η , ξ is clearly
1 2 2
k
Cb,u , while the omega Lemma [AMR88, p 101] guarantees joint Cb,u k -dependence
f 2 k ≤ A f 1 k + B (2.34)
f 2 k ≤ f 2 − g2 k + g2 k ≤ C̃ f 1 k + B
are used for sizes of (changes in) functions in these coordinates. The parameter ν
from Lemma 2.34 depends on most of the foregoing, while only the C l bound (but
not the C k bound) of Mσ depends on the choice of ν. The various ε’s will be fixed
later, and depend on σ and global properties of M and (Q, g), but not on δ’s. Also
note that everything is independent of points x ∈ M, Q.
We fix 2 δ1 = δ2 = 21 δ3 , ε∞ = 2 ε0 , and C∞ = C0 + 1 and choose
δ3 , ζ, ε∞ , εϕ > 0 sufficiently small such that all the following statements hold
true.
1. By Proposition 2.5 and Lemma 2.25, distances and metrics are C = 2 equivalent
on balls B(x; δ3 ), and d Q , d M are locally equivalent, all up to order l + 1.
2. By assumption of M ∈ Cb,u k , we have for each x ∈ M the representation M
x,δ3 =
graph(h x ) with h • 1 ≤ ε0 and h • k ≤ C0 .
3. By Corollary 2.26, there exists a uniformly locally finite cover i≥1 Mxi ,δ2 of M
with all xi separated by at least δ1 , the balls Mxi ,δ1 already covering M, and bound
K on the maximum number of Mxi ,δ2 intersecting any Mx,δ2 . Formula (2.10)
shows that K depends on the ratio δ2 /δ1 , but does not increase when δ3 → 0.
4. By Lemma 2.29, all coordinate transition maps ϕ2,1 between any x1 , x2 ∈ Q,
d(x1 , x2 ) < δ3 are C l bounded. When the graph representations Hi = Graph(L i )
are bounded by ζ > 0, then these are of the form ϕ2,1 = O H ⊕ OV + ϕ̃2,1 , with
ϕ̃2,1 l+1 ≤ εϕ . And by Corollary 2.31, this holds for the coordinate transfor-
mations between the xi chosen for the cover of M.
5. By Lemma 2.35, there exists a constant C̃ that estimates the graph change under
coordinate transformations from the previous point, when f 1 , g1 ≤ ε∞ and
f k , gk ≤ C∞ , while Corollary 2.37 holds on balls of size δ = 10 9
δ3 > δ2 .
6. If M was a uniformly embedded submanifold, then let Mx,δ3 be the unique
connected component of M in B(x; δ3 ) for each x ∈ M.
Let M j denote a modification of M after applying smoothing operations in the
j
first j charts, and let h i denote the graph representation of M j in chart i. So, initially
j
we have M 0 = M and h i0 = h i . Note that the sequence {h i } j≥1 is constant after some
finite index j (i), since it is changed at most K times by smoothing in overlapping
j (i)
charts. Thus, the final graphs are given by h i∞ = h i .
Initially, we have h • 1 ≤ ε0 and h • k ≤ C0 ,and we assume that throughout
j j
the sequential smoothings it holds for all i, j that h i ≤ ε∞ and h i ≤ C∞ ,
1 k
and therefore the final h i∞ satisfy these estimates as well. Let ε0 be sufficiently small,
such that by a mean value theorem estimate
h • (ξ ) = h • (ξ ) − h • (0) ≤ Dh • ξ ≤ ε0 ξ ,
j
so we must be careful that (at most) K repeated smoothing operations keep the h •
within the bounds required to apply this lemma, while at the same time we must
l at some
ensure that each point on the sequence of manifolds M j is smoothed to Cb,u
stage, even though M j changes to Mσ = M ∞ throughout the sequential smoothings.
Let us first show that each point is smoothed. We can keep track of each original
point x ∈ M as a sequence of points x j ∈ M j throughout the smoothings, and once
M j is smoothed around x j , then the convolution lemma guarantees that smoothness
is preserved around the sequence x j under further smoothing in other charts. Let
∼
: M → Mσ denote the diffeomorphism that assigns to x ∈ M the final point
∞
x ∈ Mσ . Each point x ∈ M is element of a graph h i in at least one ball B(xi ; δ1 ),
so if the corresponding sequence of points x j ∈ M j moves less than r − δ1 , then it
is smoothed in B(xi ; r ). Therefore, we choose ν in Lemma 2.34 small enough, such
that
r − δ1
f˜ − f k ≤ ε(ν) < .
2K
is if either the graph moves outside of B(0; δ3 ) or the modified manifold cannot
j
be represented by a graph anymore. The latter cannot occur if Dh i stays bounded,
j
while the former can be controlled by bounding h i 0 . Both can be controlled by
j j−1
estimating the C 1 changes h i − h i . First, in coordinate chart i = j we can directly
j j−1
use the convolution smoothing Lemma 2.34 to conclude that h i − h i 1 ≤ ε(ν).
In any other chart i = j, this change is amplified by a bounded factor C̃, as per
Lemma 2.35, so we have
j j−1
h i − h i 1 ≤ C̃ ε(ν).
1
K C̃ ε(ν) < min ε0 , δ3 , 1
10
holds, then this leads to
j 2
h i 0 ≤ h i 0 + K C̃ ε(ν) ≤ δ3 ,
10
j
h i 1 ≤ ε0 + ε0 = ε∞ ,
j
h i k ≤ h i k + K C̃ ε(ν) ≤ C0 + 1 = C∞ .
68 2 Manifolds of Bounded Geometry
j
This shows that indeed the assumed bounds ε∞ and C∞ hold, and that h i is defined
at least on the ball B(0; r + 2 δr ).
The sequential smoothings create and preserve Cb,ul ∩ C ∞ smoothness, while
every ‘point’ x j ∈ M j is touched by these operations. Moreover, Lemma 2.34 and
Corollary 2.37 together guarantee that the smoothing in each chart keeps
j j−1 j−1
h i l ≤ A h j 0 + B ≤ A C(ν, l) h j 0 + B ≤ A C(ν, l) ε∞ + B
for all points in the domains of h x and h ∞ x − h x within B(0; δ). So if we set
C̃ K ε(ν) < σ , then Mσ is C close to M in normal coordinate charts of radius δ
2 k
normally hyperbolic vector field to this trivial bundle setting. This procedure is also
alluded to in [Sak94, p. 333–334], but especially in the case of bounded geometry
requires a more careful inspection.
Theorem 2.39 (Uniform embedding of a normal bundle in a trivial bundle) Let
k≥1
M ∈ Cb,u be a uniformly immersed submanifold of the bounded geometry manifold
(Q, g). Then there exists an embedding λ : N → N of the (nontrivial) normal bundle
π : N → M into a larger, trivial vector bundle N = M × Rn̄ . The embedding map
k−1
λ ∈ Cb,u is a quasi-isometry when restricted to N≤η for any η > 0 and the splitting
N = λ(N ) ⊕ N ⊥ is Cb,u k , where N ⊥ is chosen perpendicular to λ(N ) according to
τx
N ⊃ N | Mx,δ / Mx,δ × N x
xxx
xx
π
xxxp1 (2.35)
xx
{x
x
M ⊃ Mx,δ
Then we take a uniformly locally finite cover of M by sets Mxi ,δ . The trivializations
on each Mxi ,δ induce a spanning set of sections, i.e. a frame. Using the uniformity
of the cover, we can globally glue these frames together to obtain rank(N ) = n̄ =
(K+1) rank(N ). Here, K is the maximum number of overlapping charts in the cover.
This identifies λ(N ) as the subbundle of N = M ×Rn̄ spanned by these glued frames.
Proof Let δ be Q-small as in Definition 2.8, as well as sufficiently small such that M
is given as the graph of h • : T• M → N• in normal coordinates as in Definition 2.21.
For any x ∈ M we have a trivialization map
∼
τx = (expx , p2 ) ◦ D exp−1
x : N | Mx,δ → M x,δ × N x , (2.36)
where we canonically identified T(Tx Q) ∼ = (Tx Q)2 and apply expx only on the base
Tx Q and p2 on the fibers of T(Tx Q). In a normal coordinate representation (see
page 52, Fig. 2.2 on the right) this just means that we project the normal fiber N x at
any point x = expx (ξ, h x (ξ )) ∈ Mx,δ onto N x . By Corollary 2.31 this projection
70 2 Manifolds of Bounded Geometry
k−1
This mapping is Cb,u as a composition of such maps (and can be extended, albeit
non-boundedly so, to a map N → N ). The τxi are quasi-isometries, while ēc(i) ei∗ is
isometric on each Mxi ,δ . Each frame ēc(i) is orthogonal to the frame of
any overlap-
ping set Mx j ,δ since c( j) = c(i) and the χi squared sum to one. Thus i χi ēc(i) ei∗
is an isometry, and so λ is a quasi-isometry.
Let p and p ⊥ = 1− p denote the projections from N onto N and N ⊥ , respectively.
One can verify that
∗
p = χi χ j ēc(i) ēc( j) (2.38)
i, j≥1
is the projection onto N by noting that λ(N ) equals the image of i≥1 χi ēc(i) , while
the identities
ēk∗ ēl = δkl and χi2 = 1
i≥1
2.5 Embedding into a Trivial Bundle 71
can be used to show that p 2 = p. Formula (2.38) shows that both p, p ⊥ ∈ Cb,u
k ,
Next, we must extend a vector field v on N to the larger bundle N . There are
additional directions along the fibers of N ⊥ and the extended vector field v̄ must be
such that it is normally hyperbolic in these directions as well. On the other hand,
the uniform boundedness of v must be preserved. We do not assume here that M
is the exact invariant manifold, since these results shall be applied after application
of Theorem 2.38, which has smoothed and slightly altered M such that it is not the
original NHIM anymore. The extension v̄ will keep N invariant and is identical to v
on N , so in the end, we can conclude that the perturbed manifold is contained in N
and restrict to the original setting again.
Lemma 2.40 (Normally hyperbolic extension of a vector field) Let λ : N →
N = M × Rn̄ be a trivializing embedding of vector bundles as in Theorem 2.39 and
l,α
let v ∈ Cb,u be a vector field on N with l + α ≤ k − 2. Let N |≤η be the restriction
to some radius η > 0. Then v can be extended to a vector field v̄ on N , such that v̄ is
l,α
Cb,u on N ≤η , leaves N invariant, and contracts at a given exponential rate ρ < 0
along the fiber direction of N ⊥ towards N ⊂ N .
To extend v to a vector field v̄ on N with the required properties, we must do two
things. First of all, v must be extended from N through λ to the whole of λ(N ) ⊕ N ⊥
and secondly, a normal component along the fibers of N ⊥ must be added to make v̄
contracting, thus normally hyperbolic in that direction. The idea can be expressed in
local coordinates (m, y, z) ∈ λ(N ) ⊕ N ⊥ as
v̄(m, y, z) = v̂(m, y, z) + v⊥ (m, y, z) = v(m, y), ρ z ,
∂ p⊥
We read off that the Christoffel symbols are given by p ⊥ ∂m i
(Dπ N )i , so they are
k−1
Cb,u . The horizontal lift
∂ p⊥
v̂(m, y, z) = v(m, y) − p ⊥ · ( z)(Dπ N v(m, y))i , (2.39)
∂m i
l,α l,α
then, is also Cb,u since all functions involved are at least Cb,u in these coordinates,
and z is bounded on N ≤η .
We define v⊥ as the Euler vector field along the fibers of N ∼ = E, taking values in
Vert(E). Each fiber E (m,y) = Nm⊥ is a linear space, so the tangent space at any point
is canonically identified with the fiber itself, which allows us to canonically define
This vector field leaves N ⊂ N invariant, while generating a flow that attracts towards
N at the exponential rate ρ < 0. It is clear that v⊥ ∈ Cb,uk for any k when z is bounded
on N ≤η .
We conclude that the vector field v̄ = v̂ + v⊥ indeed leaves N invariant. Since v̂
is neutral in the fiber directions of E and v⊥ contracting at rate ρ, it follows that v̄ is
contracting with rate ρ as well. The combined vector field v̄ is defined in terms of v
l,α l,α
and other functions that are all at least Cb,u , hence v̄ ∈ Cb,u .
2.6 Reduction of a NHIM to a Trivial Bundle 73
Having set up the theory of bounded geometry spaces, we are finally in a position
to reduce a general normally hyperbolic system to the setting of a trivial bundle.
That setting is required to apply our basic Theorem 3.2 on persistence of NHIMs.
k,α
Let M ∈ Cb,u with k ≥ 2 and 0 ≤ α ≤ 1 be a uniformly immersed or embedded
submanifold in (Q, g) of bounded geometry and furthermore assume that M is an
k,α
r -NHIM with r = k + α for the vector field v ∈ Cb,u on Q.
Remark 2.41 The bounded smoothness requirement k ≥ 2 is dictated by
Theorem 2.33. It is not present in the compact case where the normal bundle can
be “jiggled slightly” [Fen72, Prop. 2] to make it sufficiently smooth to model a C k
flow for k ≥ 1. Hypotheses 2 and 3 in [BLZ99, p. 987] require similar conditions in
the noncompact setting in Banach spaces, see also the discussion after Corollary 3.5
and Remark 3.13. I have not investigated in detail whether the Cb2 requirement is
necessary, or if M ∈ Cb,u1 could be modeled as a sufficiently small section of the
λ : N → N = Mσ × Rn̄ = N ⊕ N ⊥ .
4 This is similar to Fenichel’s argument in [Fen72, p 203] that normal hyperbolicity is independent
of a choice of metric when M is compact.
74 2 Manifolds of Bounded Geometry
section, then the family vσ can simply be taken constant and all σ dependence
can be dropped from the formulation.
4. We only obtain a C k−1 -norm estimate for the perturbation distance of M̃ away
from M, even though M̃ ∈ C k,α is preserved. This is due to a linearization along Y
and the smoothing convolution used to restore C k,α smoothness after linearization.
See Sect. 3.4, in particular Remarks 3.13 and 3.15, for more details. I fully expect
it to hold that M̃ and M are C k,α close when ṽ − vk,α is small.
5. The minimum smoothness requirement k ≥ 2 is a stronger assumption than k ≥ 1
in the well-known compact case. This seems to be intrinsic to the noncompact
case. If the spectral gap condition only holds for some 1 ≤ r < 2, then we can
still obtain a perturbed manifold M̃. This manifold M̃ will generally not have
better than C r smoothness, though.
6. We allow both values α = 0 and α = 1, where α = 0 is considered an empty
condition (besides the boundedness and uniform continuity). Thus, if r = k + 1
satisfies the spectral gap condition (1.11), then we can choose both C k,1 or C k+1,0
as resulting smoothness for M̃, if M had the same smoothness. Thus, if M was
sufficiently smooth, then the choice M̃ ∈ C k+1,0 yields the best result. Note,
though, that by Rademacher’s theorem, Lipschitz functions are differentiable
almost everywhere, so the difference is not that big.
Finally, it should also be noted that the spectral gap condition is a strict inequality
on r , so if we can choose r = k integer, then we can also find an α > 0 such that
r = k + α satisfies the spectral gap as well. This shows that in this context Cb,uk
k,α
‘integer’ smoothness really is a special case of Cb,u ‘fractional’ smoothness.
7. Both Riemannian manifolds Q in Theorem 3.1 and X in Theorem 3.2 are assumed
to be finite-dimensional; multiple results on bounded geometry crucially depend
on this fact. On the other hand, we allow Y to be an infinite-dimensional Banach
space simply because everything naturally generalizes to that setting. Note that
we do not allow semi-flows as in [Hen81; BLZ08], so the case that Y is infinite-
dimensional may not be that useful.
8. These results are weaker than those in the well-known compact case in a
few aspects. First of all, we use a stricter notion of normal hyperbolicity, see
Remark 1.10. This seems to be a fundamental restriction of the Perron method;
the more general definition of normal hyperbolicity is successfully applied to
noncompact manifolds in [BLZ08]. Secondly, we only include the stable nor-
mal bundle E − . Adding the unstable bundle E + as well should be possible, see
Sect. 4.4 for more details.
9. While we do prove that the NHIM persists into a new invariant manifold M̃,
we do not prove that M̃ is again normally hyperbolic. I fully expect this to be
true though: the perturbed flow satisfies slightly perturbed exponential growth
conditions and the spectral gap is an open condition, so should be preserved
under sufficiently small perturbations. It remains to be proven that M̃ again has a
continuous invariant splitting (1.19) with bounded projections. This is one possible
reason for breakdown of normal hyperbolicity [HL06]. In Sect. 4.5 we sketch how
to recover the invariant stable fibration and the invariant splitting.
78 3 Persistence of Noncompact NHIMs
k,α
10. In both theorems, we assumed that M ∈ Cb,u . Just as in the compact case, there
is forced smoothness, see [HPS77]. That is, if we only have M ∈ Cb2 , while it is
k,α k,α
r -normally hyperbolic and the system is Cb,u , then we have in fact M ∈ Cb,u .
Note that we do require k ≥ 2 unlike the compact case, see also point 3.3.
This statement can be verified by reviewing the persistence proof under zero per-
turbation. We construct a smoothed approximation of M in Theorem 3.1 to model
everything on, and in Theorem 3.2 we smoothen the linearization of the vertical
part vY of the original vector field. These can be made as smooth as required, while
k,α
we only use the perturbed flow (3.16) or the Cb,u smooth decomposition (3.24)
(hence with remainder f˜ ∈ C ) in the rest of the proof. Therefore the resulting
k,α
b,u
k,α
manifold will satisfy M̃ = M ∈ Cb,u . ♦
These two theorems reduce to the corollaries formulated below, when M is com-
pact or when Q = Rm+n with standard Euclidean metric and M = Rm × {0}. The
statements then significantly reduce in complexity, and are comparable to well-known
results.
Firstly, the case that M is compact. Then we can take a (pre)compact neigh-
borhood B(M; ε) ⊂ Q of M and thus conclude that bounded geometry holds on
B(M; ε), ignoring irrelevant boundary problems. Any C k,α function on B(M; ε) is
k,α
automatically Cb,u , so Theorem 3.1 reduces to the following corollary. For simplicity
we leave out α-Hölder continuity and the C k distance estimate between M and M̃.
Corollary 3.4 (Persistence of compact NHIMs) Let k ∈ Z≥2 , let (Q, g) be a
smooth Riemannian manifold and v ∈ C k be a vector field on Q. Let M ∈ C k be
a connected, compact submanifold of Q that is k-normally hyperbolic for the flow
defined by v, with empty unstable bundle, i.e. rank(E + ) = 0.
Then for each sufficiently small η > 0 there exists a δ > 0 such that for any
vector field ṽ ∈ C k with ṽ − v1 < δ, there is a unique submanifold M̃ in the
η-neighborhood of M, such that M̃ is diffeomorphic to M and invariant under the
flow defined by ṽ. Moreover, M̃ is C k .
This corollary closely resembles [Fen72, Thm. 1] in the absence of a boundary ∂ M
(a boundary is allowed when M is overflowing invariant, see also Sects. 1.6.3 and 4.3).
Note that our definition of normal hyperbolicity is less general (see Remark 1.10),
and that we exclude unstable normal directions and the case1 k = 1, while we do
allow M to be an immersed submanifold. The persistence result of Hirsch, Pugh,
and Shub [HPS77, Thm. 4.1 (f )] is similar to that in Fenichel’s work; it additionally
includes Hölder smoothness and allows immersed submanifolds as well.
Secondly, the case of M = Rm × {0} ⊂ Q = Rm+n . Again, Rm+n has bounded
k if it can (locally) be
geometry with one trivial, global chart. Thus, any object is Cb,u
k functions, but with common global bound and continuity modulus.
described by Cb,u
Then Theorem 3.2 reduces to the following corollary. We again suppress Hölder
1 This was for technical reasons in the noncompact setting, see Remark 3.3, 5, and could be repaired
in the compact setting.
3.1 Statement of the Main Theorems 79
continuity and drop the σ parameter dependence, which was only relevant for the
reduction of Theorem 3.1.
Corollary 3.5 (Persistence of a trivial NHIM in Euclidean space) Let k ∈ Z≥2 .
k be a vector field on Rm+n and let M = Rm × {0} be a k-NHIM for the
Let v ∈ Cb,u
flow defined by v, with empty unstable normal bundle.
Then for each sufficiently small η > 0 there exists a δ > 0 such that for any vector
k with ṽ − v < δ, there is a unique submanifold M̃ = Graph(h̃),
field ṽ ∈ Cb,u
1
h̃ : Rm → Rn , h̃ ≤ η such that M̃ is invariant under the flow defined by ṽ.
0
k and h̃
Moreover, h̃ ∈ Cb,u can be made arbitrary small by choosing ṽ − vk
k−1
sufficiently small.
This theorem can be compared, for example, to [Sak90, Thm. 2.1]. Sakamoto’s
theorem is specifically targeted to singular perturbation problems. His conditions
are more specific and concrete: the invariant manifold M is assumed to consist of
stationary points and normal hyperbolicity is formulated in terms of the eigenvalues
of normal derivatives of the vector field at M. He starts with an invariant manifold
that is the graph of a nonzero function h ∈ Cbk ; this he reduces to the zero graph
case M = Rm × {0}, while he incurs a loss of one degree of smoothness, obtaining
a Cbk−1 persistent manifold and he requires k ≥ 3, see [Sak90, p. 50]. He does allow
both stable and unstable normal bundles.
In their series of papers [BLZ98; BLZ99; BLZ08], Bates, Lu, and Zeng obtained
multiple results on noncompact NHIMs, including a persistence result. Most impor-
tantly, they work in Banach spaces with semi-flows, which adds nontrivial obstacles
to be overcome, and allows application to PDE problems. On the other hand, my
setting allows the ambient space to be a manifold, albeit finite-dimensional. They
use the graph transform instead of the Perron method. This allows for the more gen-
eral definition of relative normal hyperbolicity as in Remark 1.10. They include both
stable and unstable normal directions, while they do not prove Hölder regularity.
Finally, in [BLZ08] the interesting idea is developed to start with an approximate
NHIM only.
If we ignore these differences, then their results fit in between the formulations of
Theorem 3.1 and Corollary 3.5. Their invariant manifold M is immersed in a Banach
space, but not necessarily described by the graph of a function h. Their hypothe-
sis [BLZ08, p. 363] that the splitting does not twist too much is a bounded Lipschitz
condition on the (approximate) splitting of the (un)stable and tangent bundles over M.
This condition is similar, but slightly weaker than our condition M ∈ Cb,u 2 , see also
of Rn ; this procedure can be compared to the reduction in Sect. 2.6. The problem
that arises is that the Nash embedding theorem provides no control on the extrinsic
curvature of the embedding2 , so M need not be Cb2 in Rn . It might be possible to
work around this by proving a ‘bounded geometry version’ of the Nash embedding
theorem. Alternatively, their proof could likely be adapted to work with a uniform
atlas of charts, e.g. using theory similar to that developed in Chap. 2.
We should also mention the paper [JS99] by Jones and Shkoller. They generalize
Fenichel’s results on persistence of overflowing invariant manifolds to semi-flows on
infinite-dimensional Riemannian manifolds. They do assume the invariant manifold
itself to be compact.
The proof of Theorem 3.2 is lengthy and involves a lot of details. We therefore first
present an overview of the separate steps involved in the proof.
First, in Sect. 3.4 we bring the system into a form that is suitable for application
of further analytical techniques. That is, we decompose the vector fields along X and
Y directions and linearize the vertical direction, leading to equations
ẋ = v X (x, y),
ẏ = vY (x, y) = A(x) y + f (x, y)
2 This can be seen from the result that the Nash embedding can be obtained into an arbitrarily
small ball. As an explicit example, take Q = R with standard metric and embed it into R2 via the
map r (θ) = arctan(θ)/π + 21 in polar coordinates. Since the integral of r (θ) diverges both when
θ → ±∞, we obtain (after arc length reparametrization) an isometric embedding of R ‘curled up’
into B(0; 1), while the extrinsic curvature grows unbounded for θ → −∞.
3.2 Outline of the Proof 81
respectively, that are ‘close’ to solution curves of the original system. We directly
include the perturbation ṽ−v into the horizontal component of the vector field; for the
vertical component we include the perturbation in the nonlinear term f˜. This gives
rise to a nonlinear, horizontal flow
y (t, t0 , x0 ) and a linear, vertical flow x (t, t0 ) y0
that depend on a curve in the other space and satisfy estimates
∀t ≤ t0 : D
y (t, t0 , x0 ) ≤ C X eρ X (t−t0 ) ,
∀t ≥ t0 : x (t, t0 ) ≤ CY eρY (t−t0 ) ,
h̃ : X → Y : x0 → (x0 )(0)
must describe the unique perturbed invariant manifold M̃. Application of the contrac-
tion principle immediately implies that h̃, and therefore M̃, is Lipschitz continuous.
In Sect. 3.7 we continue to prove that M̃ is C k,α . We start that section with a more
detailed overview of this smoothness part of the proof and in Sect. 3.7.1 we present
a scheme to obtain the first derivative in a number of steps. Higher smoothness then
follows along the same lines, just with more complex expressions, see Sect. 3.7.9.
Let us focus here on the basic ideas.
Smoothness of M follows directly from smoothness of
. We study the derivatives
of
by formal differentiation of the fixed point equation; this leads to Eq. (3.42).
But let us consider for a moment a simpler heuristic formulation, similar to Eq. (1.8)
for the hyperbolic fixed point in Sect. 1.4.3. Then the derivatives of the Perron fixed
point map are
82 3 Persistence of Noncompact NHIMs
t
Dk T (y) δy1 , . . . , δyk (t) = (t, τ ) · Dk f˜(y(τ )) δy1 (τ ), . . . , δyk (τ ) dτ.
−∞
is tailored to the problem; ‘plain’ uniform continuity does not provide this, forcing
us to use an (arbitrarily) small amount of the spectral gap to compensate.
In the compact case, Fenichel [Fen72, p. 200] argues that persistence of the invari-
ant manifold should be independent of the choice of a Riemannian metric. Indeed,
he proves that the exponential growth rates are independent of such a choice, as all
metrics are equivalent. In a noncompact setting, however, non-equivalent metrics do
exist and we do expect persistence to depend on the choice of metric, since it deter-
mines which perturbations are globally C 1 small. Moreover, we make a technical
uniformity assumption of bounded geometry (see Chap. 2, also for example spaces of
bounded geometry) on both the underlying space and the invariant manifold. These
assumptions are automatically satisfied in the compact case. It is not clear though to
what extend they are essential in the noncompact case.
The remainder of this section is devoted to examples that show multiple aspects
that should be treated carefully in the noncompact setting, while being trivially
fulfilled in the compact case. Some interesting examples can also be found in the early
work [Hop66] by Hoppensteadt. He presents counterexamples to uniform stability of
solutions in a time-dependent singular perturbation setting when the stability criteria
do not have sufficient uniformity.
with λ < 0. This makes the vertical, y direction uniformly attracting with exponent λ,
while in the plane X × {0} the origin is an expanding fixed point and the exponential
growth rate is everywhere non-negative. Thus, M = X × {0} is an r -NHIM for
arbitrarily large r and v ∈ Cb,u
k for any k ≥ 1 on a tubular neighborhood of M of
Fig. 3.1 A normally hyperbolic flow with respect to Euclidean and sphere induced metrics
This shows that the vector field is still Cb1 , although not Cb,u
1 anymore in this met-
ric. More importantly, the system can be extended to include the North Pole as an
attracting fixed point. The rate of attraction along the perpendicular y direction has
not changed from λ, but along the horizontal directions of the sphere, the attraction
rate now is π
lim D1 (
∗ v)(ϕ, 0, 0) = − . (3.4)
ϕ→π 2
In both metrics the normal exponential attraction rate is ρY = λ < 0 since the linear
flow on Y is decoupled from X . The exponential growth rate on X does depend on
the choice of metric. For the Euclidean metric ge , we have ρ X = 0. This follows
from an analysis of the radial component ṡ = arctan(s) of the system. At s = 0
this has unstable exponent 1, while away from s = 0 the tangent flow is uniformly
bounded away from both zero and infinity, so there the Lyapunov exponent is zero.
With respect to the metric gs , it follows from 3.4 that the Lyapunov exponent is
ρ X = − π2 for solutions approaching planar infinity. Thus, we see that if λ ≥ − π2 ,
then the system is not normally hyperbolic with respect to the metric gs , while
3.3 Compactness and Uniformity 85
X × {0} is an r -NHIM for any r ≥ 1 under the metric ge . On the other hand, if
λ < − π2 , then the system is still only r -normally hyperbolic with respect to gs
for r < π/(2λ). Again, we can construct an explicit perturbation similar to that in
Example 1.1. If we add a small vertical perturbation ε χ with support away from
the poles (compare with Fig. 1.3), then along meridians passing through supp χ , the
invariant manifold is lifted and approaches the North Pole ϕ = π approximately
along a graph y = C (π − ϕ)−2λ/π , while on meridians not passing through supp χ ,
the invariant manifold stays at y = 0. See the perturbed flow lines in Fig. 3.1. This
results in unbounded C r derivatives of the perturbed invariant manifold at the North
Pole for r > −2λ/π .
We conclude that in the noncompact setting, normal hyperbolicity explicitly
depends on the choice of metric since metrics need not be equivalent. Moreover,
the allowed size of the perturbations depends on the metric.
Example 3.7 (Perturbation sizes depend on the choice of metric) We extend
Example 3.6 above. Let λ < − π2 so that the system is normally hyperbolic with
respect to both metrics, and set
w(s, θ, y) = 0, 0, arctan(s) sin(θ ) . (3.5)
Let us give another example which shows that an embedded invariant manifold need
not persist. This example clarifies the remarks already made in the introduction in
86 3 Persistence of Noncompact NHIMs
Sect. 1.6.2: noncompact embedded NHIMs can perturb into immersed manifolds. On
the one hand, this example shows that it is natural to consider immersed NHIMs. It
also shows that a noncompact NHIM must have a uniformly sized tubular neighbor-
hood that does not self-intersect, in order to guarantee perturbation as an embedded
manifold. Further details can be found in Sect. 2.3 where the concept of a uniformly
embedded submanifold is defined.
In the example presented here, the unperturbed manifold is normally hyperbolic
but noncompact and ‘touches’ itself in the limit to infinity, see Fig. 3.2, the top image.
In this case, we can find arbitrarily small perturbations that will let the two persisting
branches collapse into one at a finite point.
Example 3.8 (A non-uniformly embedded NHIM) Let (x, y) ∈ R2 = Q. For x ≤ 0
we define the vector field v of the system in polar coordinates, and for x ≥ 21 in
Cartesian coordinates as
t (x, y) = (log(e x + t) , y e−t ),
x
3.3 Compactness and Uniformity 87
which exhibits the rates of contraction in the normal and tangential directions by
considering either projection in
1
lim log πx,y ◦ D
t (x, y) .
t→∞ t
Let us now introduce the very simple perturbation vector field w(x, y) = (−ε , 0)
for x ≥ 1 and smoothly cut off to zero left of x = 1. When this perturbation is added
to the vector field v, the vertical line x = − log(ε) becomes a stable, invariant set,
see Fig. 3.2 the bottom image. The upper and lower branch of the original NHIM
will both converge to the newly created fixed point (− log(ε) , 0). On the right side
of this point the manifold is given by the single line y = 0.
Each branch separately persists as a C ∞ manifold, as could (naively) be expected.
The problem is that we have no control on the distance between the two branches,
so for any ε > 0, these branches will collapse at some point where the persisting
object ceases to be an embedded manifold. As already remarked, there are two ways
to address this issue. One can abandon the implicit assumption that the NHIM is an
embedded submanifold and replace this by immersed submanifolds; this idea was
introduced already in [HPS77]. If one insists on having embedded submanifolds,
even under perturbations, then one must eliminate the possibility of these ‘collapses’
occurring. A sufficient condition is the existence of a uniformly sized tubular neigh-
borhood of the invariant manifold that does not intersect itself. Global control on the
perturbation distance of the invariant manifold will imply that the perturbed manifold
stays inside this tubular neighborhood and thus will not self-intersect.
The previous examples were set in Euclidean space. The next two examples show that
additional uniformity conditions must be imposed on a nontrivial ambient space. It is
not enough to assume uniform continuity and boundedness for the dynamical system.
The first example is an extension to the previous one and shows that the ambient space
must have a uniformly finite injectivity radius. The second example indicates that
even if the ambient space has finite injectivity radius and trivial topology, persistence
might be lost due to non-bounded curvature of the ambient space.
Example 3.9 (Zero injectivity radius) We construct as ambient space Q a cylinder
whose radius shrinks exponentially. That is, we take Q = R × S 1 with metric
g(x, θ ) = dx 2 + e−2x dθ 2 . See Fig. 3.3 for an impression, but note that the metric
induced by the embedding in R3 is not (and cannot be made) the same as g. The
vector field
v(x, θ ) = (1, 0) (3.7)
88 3 Persistence of Noncompact NHIMs
generates a simple flow along the cylinder, and each solution curve is a NHIM purely
due to the fact that all curves flow into an exponentially shrinking tube, while there
is no contraction along the curve.
Let us consider the invariant manifold M = { θ = 0 }. We add a perturbation to
the vector field that is given by θ̇ = ε for x ≥ 0 and is smoothly cut off to zero left
of x = 0. This perturbation is smooth and C 1 small with respect to the metric3 g.
When the original curve M enters the region x ≥ 0, it is modified to a curve M̃ that
starts winding around the cylinder, as indicated in Fig. 3.3.
This clearly cannot be represented in a tubular neighborhood of M in Q since
the curve would leave the neighborhood ‘above’ and reenter ‘from below’. On the
other hand, the normal bundle of M can be viewed as a covering of Q, and on that
covering, M̃ is represented by the function θ (x) = ε x, which is still a bounded
graph with norm θ (x) = ε x e−x , but which winds around since θ ∈ [0, 2 π ).
Thus, a globally finite injectivity radius seems a necessary requirement if we want
the perturbed manifold to be represented in a diffeomorphic tubular neighborhood
of M.
The second example indicates that a finite injectivity radius is not enough;
unbounded curvature of the ambient manifold might lead to loss of persistence of the
NHIM. It should be pointed out that this example satisfies all properties of normal
hyperbolicity with uniform estimates up to C 1 smoothness, except that the vector
field has no uniformly continuous derivative. I have not been able to add this final
property to create a complete counterexample where persistence fails in the absence
of the curvature property of bounded geometry only.
Example 3.10 (Unbounded curvature) Let Q = R3 with metric
g(x, y, z) = dx 2 + exp − 2 |x| arctan(x z) dy 2 + exp − 2 |x| dz 2 , (3.8)
3 Measuring the C 1 size with respect to g requires taking covariant derivatives and may introduce
results not directly apparent in coordinates (x, θ). A perturbation term θ̇ = ε exp(x) would still be
globally small in this metric.
3.3 Compactness and Uniformity 89
and smoothly glued together in a neighborhood of the boundary |x| = 1, see Fig. 3.4.
Thus, the whole system is invariant under translations in y, and within any plane
{ y = y0 }, the point (x, z) = (0, 0) is a hyperbolic fixed point with eigenvalues 1
and −1 in the x and z direction, respectively. The system also has a mirror symmetry
around x = 0; from now on we only consider x ≥ 0.
The plane M = { z = 0 } is a NHIM; it is clearly invariant under the flow, and
similar to the exponentially shrinking cylinder, the metric contracts in the z direction
along solution curves x → ∞, while no contraction occurs along the manifold. On
TM the metric reduces to g|TM = dx 2 + dy 2 while the flow is linear in time. On a
neighborhood of the y-axis, finally, normal hyperbolicity follows from the attraction
along the z directions due to the term − arctan(z) in (3.9).
The vector field v and its covariant derivative are uniformly bounded with respect
to the metric. For x ≤ 2, this follows from the fact that g including its inverse
and derivatives, as well as v and its derivatives are bounded. For x ≥ 2, explicit
calculations in local coordinates show that v = 1, while we have
⎛ ⎞
0 0 0
∇ v = ⎝ 0 − 1+(x z)2 − arctan(x z) 0 ⎠ ,
x z
0 0 −1
We proceed with checking that exp : N → Q has finite injectivity radius on the
normal bundle4 N of M. Then all assumptions for persistence are fulfilled, except for
bounded curvature (and uniform continuity of ∇v). As each submanifold { y = y0 }
is invariant, we can restrict our investigation to y = 0, such that x denotes the
coordinate along the base manifold of the normal bundle; let t denote the normalized
coordinate in the vertical direction. The exponential map of (x, t) is generated by
the geodesic flow as follows: start at x with vertical unit vector and then follow a
geodesic for time t. For x = 2, say, this flow is well-defined and stays inside the
region x ≥ 1 for some bounded time |t| ≤ r . The diffeomorphism group
ϕ(x, z) = (x + ξ, z eξ ) with ξ ∈ R
translates along x while simultaneously scaling z, see also Remark 2.2. In the region
x ≥ 1 this is an isometry, so the exponential mapping defined for x = 2, |t| ≤ r can
be isometrically mapped onto the whole region x ≥ 2, |t| ≤ r . For x on the compact
interval [0, 2] the exponential map must have a finite injectivity radius too, so there
exists a global r > 0 such that exp : N≤r → Q is diffeomorphic onto its image.
Now we add a perturbation in a similar spirit to that in Sect. 1.2.1: we lift M by
a local, vertical perturbation of the vector field, varying along y. In a neighborhood
of the plane x = 2 we add a small vertical component
−1
ż = ε (2 − cos(y)) exp if |x − 2| ≤ 1.
1 − (x − 2)2
In the region x ≤ 1 the flow is unmodified, so there the perturbed manifold M̃ must
coincide with the original M; otherwise it would not stay in a bounded neighborhood
of M under the backward flow. Around x = 2, the flow lifts M̃ to at least a height
3 −1
z≥ε exp dx ≥ ε/4
1 1 − (x − 2)2
and the height z depends on y, see Fig. 3.5. Then in the region x ≥ 3 the manifold
M̃ continues along ẋ = 1 at the same y, z coordinates. Now we have ε/4 ≤ z ≤ 6 ε,
so for all small ε > 0 the y component of the metric along the flow on the invariant
manifold eventually shrinks at an exponential rate that is stronger than in the z
direction, while M̃ has variable height z(y) independent of x. Hence the Lipschitz
norm of M̃ can be estimated by measuring z (y) with respect to the metric g along the
manifold. But z (y) is nonzero and constant along x in coordinates, while horizontal
distances along y shrink faster than vertical distances. This means that the Lipschitz
norm of M̃ grows unbounded for x → ∞. Moreover, the normal exponential growth
4 We should actually show that the injectivity radius of Q is finite, i.e. rinj (Q) > 0, at least in a
neighborhood of M. I have not been able to do this. Finite injectivity radius of the normal bundle
does allow us to construct a tubular neighborhood to model persistent manifolds close to M, though.
3.3 Compactness and Uniformity 91
x=1 x=3
z
y
z
y
Fig. 3.5 The graph of the perturbed manifold with respect to the Euclidean metric (top) and an
approximate image of the same graph with respect to the metric g (bottom)
rate does not dominate the tangential rate anymore, so the perturbed manifold is not
normally hyperbolic anymore.
As a first step towards proving Theorem 3.2 we shall bring the system in a form
suitable to apply analytical tools to it. Let
92 3 Persistence of Noncompact NHIMs
ẋ = v X (x, y) ∈ Tx X,
ẏ = vY (x, y) ∈ Y, (3.10)
be the decomposition of the vector field v along X and Y . The invariant manifold is
given as the graph M = {y = h(x)}. We dropped the explicit dependence on σ from
the notation. The full flow of v will be denoted by ϒ t , while
, are reserved for
flows defined in terms of the horizontal and vertical components of v, respectively. To
shorten notation we write g(x) = (x, h(x)). We shall always assume that y ≤ η.
Our goal is to establish a linearized form
and generated by
t t
ẋ = v X (g(x)),
ẏ = A(x(t)) y with g(x(t)) a solution curve on M, (3.12)
T M (X × Y ) = TM⊕E − , 1 = πTM + π E − , Dϒ t = Dϒ M
t
⊕ Dϒ−
t
with associated exponential growth rates (1.10). On the other hand we have the
splitting
T(X × Y ) = πY∗ (TX ) ⊕ π X∗ (TY ) ∼
= TX × (Y × Y )
Tm M
m
X
x
h
3.4 Preparation of the System 93
the norms on the different tangent spaces. Recall that vσ depends on a parameter
σ ∈ (0, σ0 ]. We choose the bound σ1 small enough such that for all σ ≤ σ1 we have
∀ t ≤ 0 : D
t ≤ C̃ M e−ρ M t with C̃ M = 2 C M .
Let N = π X∗ (TY )| M denote the vertical bundle over M, whose fibers can be
canonically identified with Y . Just as above, we want to project the flow Dϒ− t onto
N while preserving the exponential growth rate. The projection π E − along TM is uni-
formly bounded for all σ . This means that the angle between TM and E − is bounded
away from zero. Since TM can be chosen arbitrarily close to the horizontal TX by
choosing σ sufficiently small, it follows that the projection DπY | E − : E − → N and
its inverse π E −,N are bounded for all σ ≤ σ1 when σ1 is sufficiently small, see also
Fig. 3.6. To this end, let (0, ϕ) ∈ Tg(x) (X × Y ) and consider the identity
−
thatDπY | E − has an inverse π E −,N : N → E for which
from which it follows
we have the bound π E −,N ≤ (1 − σ1 πTM )−1 π E − ≤ 2 π E − if we choose
σ1 ≤ 2π1TM .
Consider the flow
ˆ t = DπY ◦ Dϒ t ◦ π E −,N : N → N ,
(3.13)
generated by DvY ◦ π E −,N along solution curves g(x(t)). Both DπY | E − and π E −,N
are uniformly bounded, so the exponential estimates of Dϒ− ˆ t up to
t carry over to
a constant factor:
ˆ t
∀ t ≥ 0 : ≤ C̃− eρ− t with C̃− = 2 π E − C− . (3.14)
with a solution curve x(t) of the vector field v X ◦ g inserted. These flows are of
the form (3.12) and satisfy exponential estimates (1.10) inherited from the invariant
bundle splitting.
The vector field v X ◦ g already has sufficient smoothness5 , but  is not smooth
k,α
enough since the projection π E −,N is only continuous. We construct A ∈ Cb,u as a
smoothed approximation of D y vY ◦ g. This term is C -close to Â, since
0
 − D y vY ◦ g = (Dx vY ◦ g) · Dπ X · π E −,N 0 ≤ Dx vY ◦ g0 π E −,N 0
0
(3.15)
because DπY · π E −,N = 1 N and Dx vY ◦ g is small. Lemma 3.17 will imply that
the flow t of this approximation has exponential growth estimates close to those
of ˆ t . The following lemma will be used to obtain A from D y vY ◦ g. We apply it
with l = k − 1 to obtain A ∈ Cbk (X ; L(Y )) such that A − D y vY ◦ g k−1 ≤ ε(ν).
This lemma is a (strongly) simplified version of Theorem 2.38; the notation of l, k
is reversed to match the context here.
Lemma 3.12 (Uniform smoothing of a vector bundle section) Let (X, g) be a Rie-
mannian manifold of bounded geometry and V a Banach space. Let f ∈ Cb,ul (X ; V )
on the number of charts in the cover that intersect any one point, this guarantees that
f˜ satisfies estimates equivalent to those of the f˜i . Note that f˜l does not depend
on the smoothing parameter ν, but the higher derivative norms do.
5 It may seem impossible to define a C k vector field v on the tangent bundle TM of a C k manifold
M since TM ∈ C k−1 . See [PT77, App. 1] or [PT83, p. 398] for a method to endow an invariant
submanifold M ∈ C k with a compatible topology that makes v| M ∈ C k . We effectively used this
in our definition of v X ◦ g ∈ C k .
3.4 Preparation of the System 95
Remark 3.13 (On loss of smoothness). We must carefully construct the system 3.11
in order not to lose one degree of smoothness, while at the same time retaining
exponential growth rates and proximity estimates.
The invariant complementary bundle E − is only continuous, while the normal
bundle of M is only C k−1 , even if disguised in coordinate expressions. We use
k−1,α
the linearization at y = 0, but not directly, since D y v y ( · , 0) ∈ Cb,u artificially
decreases the smoothness as well. The loss of smoothness in [Sak90] occurs for
these reasons. Note that even though we retain C k,α smoothness by a convolution
smoothing, this does not preserve higher than C k−1 bounds. This seems to be an
artifact of the proof, inherent to the partial linearization along Y .
In the proof of Theorem 3.1 we construct a smoother, approximate manifold Mσ
exactly to circumvent these problems. In the trivial bundle setting of Theorem 3.2
then, we must be careful not to pick a representation that reintroduces this loss of
smoothness. On the other hand, we do not seem to obtain optimal results in the
sense that we require h2 small, while the classical results in the compact case only
require M = Graph(h) ∈ C 1 . Similarly, h ∈ Cbk with k ≥ 3 is assumed in [Sak90,
p. 50], while hypothesis H2 in [BLZ99, p. 987] is imposed to bound ‘twisting’ of
the invariant manifold. This requirement seems closely related to our condition on h,
and is necessary for the same reason as in our Theorem 3.1: to construct a tubular
neighborhood of uniform size. I do not know whether these stronger assumptions
can be weakened or removed. ♦
We shall finally put all the ingredients together to obtain exponential growth estimates
for perturbed flows contained in the tubular neighborhood y ≤ η of X × Y . We
write the perturbed vector field ṽ on X × Y as
ẋ = ṽ X (x, y),
ẏ = ṽY (x, y) = A(x) y + f˜(x, y), (3.16)
where f˜(x, y) = f (x, y) + ṽY (x, y) − vY (x, y) .
Let us assume that the conditions of Theorem 3.2 hold true. First, if ρ M = 0,
then for any fixed r we can always slightly increase6 to ρ M > 0, such that the
growth rates (1.10) and spectral gap condition ρ− < −r ρ M still hold true; this way,
we get rid of degenerate exponentials in integrals. We have some ‘spectral space’
ρ = r ρ M − ρ− > 0 that we use to define modified exponential growth numbers
6 We have ρ M ≥ 0 from (1.10). Note that we are interested in −ρ M for the stable side of the
spectrum. In the rest of this chapter, all exponential rates will be negative.
96 3 Persistence of Noncompact NHIMs
ρ
ρ X = −ρ M − , C X = 2 C̃ M ,
4
ρ
ρY = ρ− + , CY = C̃− . (3.17)
4
This allows us to get all perturbed flows within these slightly modified growth rates,
while we reserve another ρ/2 spectral space for later use, such as proving (higher
order) differentiability. Note that both ρY , ρ X are negative since we focus on the
stable normal bundle.
We first fix some notation to be used throughout the proof:
• Cv denotes the global C k,α bound on v and ṽ.
• ε(ν) denotes the perturbation size in Lemma 3.12 depending on the smooth-
ing convolution parameter ν from Lemma 2.34, while Cv (ν) denotes the C k,α
bound onA, f˜, which may grow due to smoothing when ν → 0. We also have
Ak−1 , f˜k−1 ≤ Cv .
• ζ denotes a small bound both on the derivative of f˜ and on perturbations of the
horizontal vector field on X , that is, we impose bounds
sup D f˜(x, y) ≤ ζ and sup ṽ X (x, y) − v X (x, h(x)) ≤ ζ,
x∈X x∈X
y≤η y≤η
The constants and bounds include (3.17) and Cv , and are all fixed. Note that there are
no circular dependencies, so we are free to choose any of these parameters smaller
if necessary without the risk of having unsatisfiable constraints.
By invariance of M = {y = h(x)} we have
dy
vY (x, h(x)) = = Dh(x) · v X (x, h(x)). (3.19)
dt
This can be used to estimate vY (x, h(x)) ≤ σ1 Cv and derived estimates, such as
(taking the derivative with respect to x)
3.5 Growth Estimates for the Perturbed System 97
where σ1 ≤ 1 has been assumed. Together with previous estimates, this leads to
while y − h(x) ≤ η + σ1 . We insert (3.19) and apply Proposition C.3 another time
to obtain
•
Dix vY (x, h(x)) = Dix Dh(x) · v X (x, h(x)) − y vY (x, h(x)) · Pm,i−l D h(x) .
Dlx Dm
l≥0,m≥1
l+m≤i
This expression can be made small since Dlx Dmy vY ≤ Cv and each term contains
at least one factor D j h(x) for some 0 ≤ j ≤ k.
Remark 3.15 Note that we cannot improve the result to a C k size estimate, since
Ak ≤ C(ν) may grow with ν → 0, while compensating this by choosing η smaller
would introduce a circular dependency in (3.18). ♦
As the next step, we will derive exponential growth estimates for the perturbed
system (3.16). More generally, we consider the horizontal flow
y and vertical, linear
flow x generated by
∀ x0 ∈ X, t ≤ t0 : D
(t, t0 , x0 ) ≤ C eρ(t−t0 ) . (3.25)
Lemma
3.17 (Perturbation of linear flow) Let Y be a Banach space and let A ∈
Cb0 R; L(Y ) generate a flow (t, t0 ) with growth estimate
˜ t0 ) ≤ ψ̃(t, t0 ).
(t,
Thus, let
˜ t0 ) > ψ̃(t, t0 ).
t1 = inf {t ∈ R|t ≥ t0 and (t,
We insert these estimates into the integral equality (3.27) and obtain
t
˜ t0 ) = ψ̃(t, t0 ) + f (t) ≤ ψ(t, t0 ) +
(t, ψ(t, τ ) δ ψ̃(τ, t0 ) + f (τ ) dτ
t0
t
≤ ψ̃(t, t0 ) + ψ(t, τ ) δ f (τ )dτ
t1
≤ ψ̃(t, t0 ) + (t1 − t) δ sup ψ(t, τ ) sup f,
τ ∈[t1 ,t] [t1 ,t]
where we used that ψ̃(t, t0 ) satisfies (3.28) and that f |[t0 ,t1 ] ≤ 0. Now we choose t2
and therefore t sufficiently small that (t − t1 ) δ supτ ∈[t1 ,t] ψ(t, τ ) ≤ q < 1, which
leads to the contradiction
Here Bη (R; Y ) denotes the closed ball of radius η in the space of bounded continuous
functions R → Y .
Proof Define the non-autonomous system v(t, x) = ṽ X (t, x, y(t)). This system is
a C 1 small perturbation of (v X ◦ g)(x), uniformly in t:
v(t, x) − (v X ◦ g)(x) ≤ ṽ X (x, y(t)) − v X (x, y(t)) + v X (x, (t)) − v X (x, h(x))
≤ δ + εDv (η + σ1 ),
Dx v(t, x) − D(v X ◦ g)(x) ≤ Dx ṽ X (x, y(t)) − Dx v X (x, y(t))
+ Dx v X (x, y(t)) − Dx v X (x, h(x))
≤ δ + εDv (η + σ1 ),
where εDv denotes the uniform continuity modulus of v and its first derivative,
which can be made small by choice of η, σ1 . We apply Lemma 3.16 to obtain
exponential growth numbers C X , ρ X for (3.24a) by choosing δ + εDv (η + σ1 )
sufficiently small.
The following definition and lemma for flows on X are again formulated in back-
ward time, similar to Lemma 3.16.
Definition 3.19 (Approximate solution) Let X be a Riemannian manifold and
v(t, x) a time-dependent vector field on X . We call a continuous curve x : R → X
a (β, T )-approximate solution of v if for each interval [t2 , t1 ] ⊂ R with t1 − t2 ≤ T
and associated exact solution curve ξ of v with initial condition ξ(t1 ) = x(t1 ), it
holds that
sup d(x(t), ξ(t)) < β. (3.29)
t2 ≤t≤t1
It would have been easier to define approximate solutions as C 1 curves x such that
ẋ(t) − v(t, x(t)) < β. We shall want to work with C 0 -norms, though, and C 1
curves do not form a complete space under such norms. We use this continuous
curve definition to avoid any complications associated with non-completeness. We
still have the following result, as a discretized variant on variation by constants
estimates.
Lemma 3.20 (Growth of approximate solutions) Let X be a Riemannian mani-
fold, v(t, x) a time-dependent vector field on X , and x a (β, T )-approximate solution
of v. Assume that v generates a flow
t,t0 that satisfies the exponential growth esti-
mate (3.25) with C ≥ 1, ρ < 0. Let ξ0 denote the exact solution of v with initial
condition ξ0 (0) = x(0).
Then the distance dρ (x, ξ0 ) is finite on the interval (−∞, 0], and explicitly
bounded by
102 3 Persistence of Noncompact NHIMs
C
dρ (x, ξ0 ) ≤ β 1 + . (3.30)
1 − eρ T
Proof Let ξi with i ∈ N be the associated exact solutions of x that satisfy (3.29)
on
the interval [−(i + 1)T, −i T ]. We have d ξi (−(i + 1)T ), ξi+1 (−(i + 1)T ) < β.
Hence, dρ (ξi , ξi+1 ) < β C eρ(i+1)T on the interval (−∞, (i + 1)T ] by the exponen-
tial growth estimate.
Thus on each interval [−(i + 1)T, −i T ] we can use the triangle inequality to
estimate
i−1
dρ (x, ξ0 ) ≤ dρ (x, ξi ) + dρ (ξ j , ξ j+1 )
j=0
i−1
< β eρ i T + β C eρ( j+1)T
j=0
C
≤β 1+ .
1 − eρ T
Note that A1 is bounded close to D y vY 1 ≤ Cv , and (3.15) and (3.20) were used
to estimate the third term. We thus have B(t) ≤ δ for any δ > 0 when σ1 , ν, β
are sufficiently small. Hence by Lemma 3.17, we have x (τ, τ0 ) ≤ C̃− eρ̃(τ −τ0 )
for any τ, τ0 ∈ [ti , ti + T ].
Now we cover the interval [t0 , t] by intervals [t0 + (i − 1)T, t0 + i T ] with cor-
responding exact solutions xi that approximate x. As in the proof of Lemma 3.16,
we write t − t0 = n T + τ and use the group property of the flow to obtain
3.5 Growth Estimates for the Perturbed System 103
n
x (t, t0 ) ≤ C̃− eρ̃ T C̃− eρ̃ τ ≤ eρY n T C̃− eρY τ = C̃− eρY (t−t0 ) .
Cα C 2 ρ(t−τ )
(1 − 2 )(t, τ ) ≤ e dρ (x1 , x2 )α eα ρ τ (3.31)
−α ρ
d
ϒ(t, τ ) = A(x1 (t)) ϒ(t, τ ) + A(x1 (t)) − A(x2 (t)) 2 (t, τ ), ϒ(t, t) = 0.
dt
By variation of constants we obtain
t
ϒ(t, τ ) ≤ 1 (t, σ ) Cα d(x1 (σ ), x2 (σ ))α 2 (σ, τ )dσ
τ
t
≤ Cα C 2 eρ(t−τ ) dρ (x1 , x2 )α eα ρ σ dσ
τ
Cα C2
≤ eρ(t−τ ) dρ (x1 , x2 )α eα ρ τ .
−α ρ
ρ ρ
T : B̄η (I ; Y ) × X → Bη (I ; Y ),
ρ ρ
TX : B̄η (I ; Y ) × X → Bβ (I ; X ), (3.32)
ρ ρ ρ ρ
TY : Bβ (I ; X ) × Bη (I ; Y ) → Bη (I ; Y ) ⊂ B̄η (I ; Y ),
TX (y, x0 )(t) =
y (t, 0, x0 ). (3.33)
where x is the flow of A(x(t)). This should ease proving smoothness properties of
T , which will subsequently imply smoothness of the invariant manifold.
ρ
Remark 3.24 Note that Bβ (I ; X ) is not a Banach space or even a complete metric
space.
It will only appear as an intermediate space in the composition T = TY ◦
TX , pr1 though, so this does not affect the Banach fixed point arguments, as long
ρ
as the mappings TX , TY compose to a contraction T on B̄η (I ; Y ) uniformly in the
parameter x0 ∈ X . ♦
The following two propositions show that the maps TX , TY do indeed map into
their specified codomains, when parameters are chosen sufficiently small.
Proposition 3.25 If ζ is chosen such that (3.36) holds and δ, σ1 are sufficiently
small, then TY maps into Bη (I ; Y ).
ρ
Proof The conditions of Proposition 3.21 are satisfied for any x ∈ Bβ (I ; X ), so
the flow x of system (3.24b) satisfies exponential growth estimates with numbers
ρY , C Y .
Now, TY maps into Bη (I ; Y ) since
t
TY (x, y)(t) ≤ (t, τ ) f˜(x(τ ), 0) + D y f˜ y(τ ) dτ
−∞
CY
≤ (Cv σ1 + δ + ζ η) (3.35)
−ρY
CY ζ 1
≤ (3.36)
−ρY 2
C X ζ ρX T
e < β, (3.37)
−ρ X
Hence, a fixed point of (TX , T̂Y ) is a fixed point of (TX , TY ). The last implication
can readily be verified by calculating the time derivatives of x = TX (y, x0 ) and
y = TY (x, y) to show that (x, y) is a solution of (3.16) with x(0) = x0 .
Next, we prove that both TX , TY are Lipschitz, while the Lipschitz constant of TY
can be made arbitrarily small.
Lemma 3.28 Let ρY < ρ ≤ ρ X and 0 < qY < 1. If ζ is sufficiently small, then
Lip(TY ) ≤ qY .
ρ ρ
Proof Let (xi , yi ), i = 1, 2 be curves from Bβ (I ; X ) × Bη (I ; Y ). Let i be the
corresponding flows of A along the curves xi . Then the application of Lemma 3.22
in the Lipschitz case α = 1 leads to a Lipschitz estimate on TY for any ρY < ρ ≤ ρ X :
Here C < ∞ depends only on the constants and additional integration factors
(ρ − ρY )−1 in the last integral, hence ζ C ≤ qY when ζ is small enough.
Lemma 3.29 Let ρY < ρ < ρ X . If ζ, η are sufficiently small, then Lip(TX ) ≤ q X
for some q X > 1 independent of all small parameters.
ρ
Proof Let ξ1 , ξ2 ∈ X and y1 , y2 ∈ Bη (I ; Y ). For i = 1, 2, define vi (t, · ) =
ṽ X ( · , yi (t)) and let xi ∈ C 1 (I ; X ) be a solution of the system vi with initial condition
ξi . We compare the systems v1 , v2 :
v1 (t, · ) − v2 (t, · ) ≤ D y ṽ X y1 (t) − y2 (t) ≤ Cv y1 (t) − y2 (t).
The flow
y1 has exponential growth numbers ρ X , C X . We view v2 as a small per-
turbation of v1 and apply the nonlinear variation of constants estimate (E.2) to obtain
108 3 Persistence of Noncompact NHIMs
0
ρX t
d(x1 (t), x2 (t)) ≤ C X e d(ξ1 , ξ2 ) + C X eρ X (t−τ ) Cv y1 (τ ) − y2 (τ )dτ
t
0
ρX t
≤ CX e d(ξ1 , ξ2 ) + C X Cv y1 − y2 ρ eρ X (t−τ ) eρ τ dτ
t
C X Cv
≤ C X eρ X t d(x1 , x2 ) + y1 − y2 ρ eρ X t .
ρX − ρ
Now
C X Cv
dρ TX (y1 , ξ1 ), TX (y2 , ξ2 ) ≤ sup C X d(ξ1 , ξ2 ) e(ρ X −ρ) t + y1 − y2 ρ e(ρ X −ρ) t
t≤0 ρX − ρ
C X Cv
≤ C X d(ξ1 , ξ2 ) + y1 − y2 ρ
ρX − ρ
exhibits a Lipschitz constant q X for TX that does not depend on any of the small
parameters.
Proposition 3.30 (Extension of solution is bounded in Y ) Let (x, y)(t) be a solu-
tion of the perturbed system (3.16) satisfying y ∈ Bη (I ; Y ) for t ≤ 0. For ζ, δ, σ1
sufficiently small, the forward extension to t ≥ 0 has y ∈ Bη (R; Y ).
Proof First of all, choose ζ, δ, σ1 sufficiently small such that by (3.35), we have
y ∈ Bη/2 (I ; Y ). Proceeding by contradiction, let t0 be the first time after which y(t)
becomes larger than η, thus
We finally put things together and prove that a unique persistent manifold M̃ exists
and that it is Lipschitz.
Since TX satisfies a fixed Lipschitz estimate, we can choose ζ small enough to
ρ
obtain q X ·qY < 1. Thus, T is a contraction on B̄η (I ; Y ) for each fixed ρY < ρ < ρ X ;
ρ
ζ will depend on ρ though. According to Proposition 3.23, B̄η (I ; Y ) is a complete
metric space, so the Banach fixed point theorem shows that there is a unique y ∈
ρ ρ
B̄η (I ; Y ) fixed point of T ; it holds moreover that y ∈ Bη (I ; Y ). This contraction
also depends (uniformly) on the parameter x0 ∈ X , hence we obtain a fixed point
map
∞ : X → Bηρ (I ; Y ), (3.39)
3.6 Existence and Lipschitz Regularity 109
The superscript ∞ indicates that this map is obtained as a limit of applying the
uniform contraction T . The parameter dependence in T is Lipschitz, so the map
∞
will be Lipschitz as well.
By Proposition 3.30, the fixed point y =
∞ (x0 ) is bounded by η for all time
ρ
and Bη (I ; Y ) = Bη (I ; Y ) as sets, so y is the unique η-bounded solution with partial
initial data x(0) = x0 . In combination with the evaluation map y → y(0), we obtain
the mapping
h̃ : X → B(0; η) ⊂ Y : x0 →
∞ (x0 )(0). (3.41)
Its graph M̃ = Graph(h̃) is the unique invariant manifold of the modified sys-
tem (3.16) and is Lipschitz as well.
by graphs of small functions X → Y , it follows
Since both M and M̃ are described
that they are homeomorphic and h̃ 0 ≤ η. We can choose another, arbitrarily small
η instead. This requires us to choose smaller δ , σ1 parameters as well. But as can
be seen from (3.18), η does not depend on δ, σ1 , so the newly found h̃ 0 ≤ η will
actually be unique in the original η-sized neighborhood as well.
3.7 Smoothness
All the remaining terms are expressions in the lower order derivatives Dn
(x0 ) for
n < k only; these form the base space in the fiber contraction theorem.
ρ
The derivatives Dk
and Dly Dm
x0 T in (3.42) do not exist on the space Bη (I ; Y ) as
codomain, however. Indeed, if they did, we could have applied the implicit function
110 3 Persistence of Noncompact NHIMs
theorem right away. Instead, the derivatives Dk
are only well-defined on spaces7
B kρ+μ (I ; Y ), where μ < 0 is an arbitrarily small additional exponential growth
rate. Derivatives of the maps TX , TY do not exist at all. By using the fiber contraction
theorem and interpreting the Dk TX , Dk TY as ‘formal derivatives’ in some appropriate
way, we can still show, though, that the Dk
are higher derivatives of
that converge
to the derivatives of
∞ under iteration of the fiber contraction maps (3.42). The
gap condition ρY < r ρ X will show up in the requirement that (3.42) is contractive
for k ≤ r (and finally k + α ≤ r when considering Hölder continuity). In case of
uniform continuity (i.e. when α = 0) we make use of the strict inequality to seize
some of the spectral space left for the terms μ < 0.
The interpretation of DT and its constituents DTX and DTY as true derivatives is
ρ ρ
obstructed already by the fact that neither Bη (I ; Y ) nor Bβ (I ; X ) are smooth Banach
manifolds8 , hence these can never be the (co)domain of differentiable maps. Thus,
the chain rule
D
n+1 = D y T · D
n + Dx0 T
7 Note that the spaces B kρ+μ (I ; Y ) are to be understood as the codomains of the maps
, hence
the Dk
as multilinear operators into these. The spaces Yη play the same role in [Van89, Def. 3.10].
8 At least, they are not smooth Banach manifolds in a natural way, see the discussion in Sect. 3.7.4.
3.7 Smoothness 111
The map T is not differentiable. Instead, we shall use the scheme below to obtain
ρ
differentiability of
∞ . The sequence {
n }n≥0 of maps X → Bη (I ; Y ) is defined
by
(x0 ) = T (
(x0 ), x0 ) and
≡ 0. We prove the differentiability of the
n
n+1 n 0
(
n , D
n ) : TX → Bηρ (I ; Y ) × B ρ+μ (I ; Y )
→ DT (
) · D
∞ : C 0 X ; Bηρ (I ; Y ) → b L TX ; T Bηρ+μ (I ; Y )
ρ+μ
into bounded sections of the bundle π : L TX ; T Bη (I ; Y ) → X is contin-
uous. Thus we can conclude that D
n converges in b L TX ; B ρ+μ (I ; Y )
to the unique fixed point D
∞ , simultaneously with
n →
∞ . See (3.45)
and (3.47) for precise definitions of these spaces. Moreover, D
∞ is uniformly
or Hölder continuous.
5. There is a family of maps, given by restricting the domain I of curves,
that approximate T , and moreover these T b,a are differentiable maps between
Banach manifolds whose derivatives DT b,a approximate the formal deriva-
tive DT .
6. With the continuous embedding B ρ (I ; Y ) → B ρ+μ (I ; Y ) and the previous point,
we show that if
n : X → B ρ+μ (I ; Y ) is differentiable, then
112 3 Persistence of Noncompact NHIMs
D
n+1 = D y T · D
n + Dx0 T
We first explicitly give the candidate mappings for the derivatives of TX , TY . From
now on, we will use shorthand notation x y (t) = TX (y, x0 )(t) =
y (t, 0, x0 ). The
spaces that these maps act on will be made more precise in the following sections;
ρ ρ
δx, δy denote variations of curves x ∈ Bβ (I ; X ) and y ∈ Bη (I ; Y ), respectively,
and δx0 ∈ Tx0 X .
Dx0 TX (y, x0 ) δx0 (t) = D
y (t, 0, x0 ) · δx0 (3.43a)
0
D y TX (y, x0 ) δy (t) = D
y (t, τ, x y (τ )) D y ṽ X (x y (τ ), y(τ )) δy(τ )dτ,
t
(3.43b)
t
Dx TY (x, y) δx (t) = x (t, τ ) Dx f˜(x(τ ), y(τ )) δx(τ )
−∞
+ Dx x · δx (t, τ ) f˜(x(τ ), y(τ ))dτ, (3.43c)
t
D y TY (x, y) δy (t) = x (t, τ ) D y f˜(x(τ ), y(τ )) δy(τ )dτ, (3.43d)
−∞
t
Dx x · δx (t, τ ) = x (t, σ ) DA(x(σ )) δx(σ ) x (σ, τ )dσ. (3.43e)
τ
We establish uniform boundedness of the formal derivative maps (??) as linear oper-
ators on δx, δy, and δx0 . The estimates are straightforward generalizations of those
in Sect. 3.6. The operator norms are induced by · ρ norms. We have the following
list of estimates:
Dx0 TX (y, x0 ) = sup D
y (t, 0, x0 )δx0 e−ρ t ≤ sup C X eρ X t e−ρ t ≤ C X ,
t∈I t∈I
δx0 =1
D y TX (y, x0 ) ≤ sup
t∈I
δyρ =1
0
D
y (t, τ, x y (τ )) D y ṽ X (x y (τ ), y(τ )) δy(τ )dτ · e−ρ t
t
0
C X Cv
≤ sup C X eρ X (t−τ ) Cv eρ(τ −t) dτ ≤ ,
t∈I t ρX − ρ
t
D y TY (x, y) ≤ sup x (t, τ ) D y f˜(x(τ ), y(τ )) δy(τ )dτ · e−ρ t
t∈I −∞
δyρ =1
t
CY ζ
≤ sup CY eρY (t−τ ) ζ eρ(τ −t) dτ ≤ ,
t∈I −∞ ρ − ρY
t
Dx x · δx (t, τ ) ≤ x (t, σ ) DA(x(σ )) δx(σ ) x (σ, τ )dσ
τ
t
≤ CY eρY (t−σ ) Cv δxρ eρ σ CY eρY (σ −τ ) dσ
τ
C2 C
δxρ eρY (t−τ ) eρ t ,
v
≤ Y
−ρ
t
Dx TY (x, y) ≤ sup x (t, τ ) Dx f˜(x(τ ), y(τ )) δx(τ )
t∈I −∞
δxρ =1
+ Dx x ·δx (t, τ ) f˜(x(τ ), y(τ)) dτ · e−ρ t
t 2
C Cv ζ ρY (t−τ ) ρ t
≤ sup CY eρY (t−τ ) ζ eρ τ + Y e e dτ · e−ρ t
t∈I −∞ −ρ
CY ζ C 2 Cv ζ
≤ + Y .
ρ − ρY ρ · ρY
114 3 Persistence of Noncompact NHIMs
D y T = Dx TY · D y TX + D y TY ,
Dx0 T = Dx TY · Dx0 TX (3.44)
are bounded linear maps when ρY < ρ < ρ X . Since we have some spectral elbow
room, we can first choose a value for ρ and then choose μ < 0 sufficiently close to
zero, such that this inequality holds both for ρ and ρ + μ. If ζ is sufficiently small,
then D y T ≤ q < 1 can be satisfied. This shows that (T, DT ) is a uniform fiber
ρ
contraction on δy ∈ B ρ+μ (I ; Y ) over base curves y ∈ Bη (I ; Y ), and with additional
parameter x0 ∈ X . It can also be viewed as a fiber mapping of maps D
over base
maps
. Let us define
μ
S0 = C 0 X ; Bηρ (I ; Y ) and S1 = b L TX ; B ρ+μ (I ; Y ) , (3.45)
μ
where S0 is equipped with the supremum norm and S1 is interpreted as bounded
sections of the bounded geometry bundle over X of linear maps between TX and the
trivial bundle π : X × B ρ+μ (I ; Y ) → X , equipped with the (supremum/operator)
norm
D
= sup D
(x0 )L(Tx0 X ;B ρ+μ (I ;Y )) .
x0 ∈X
μ μ
(T, DT ) : S0 × S1 → S0 × S1 ,
(
, D
) → (x0 , δx0 ) → T (
(x0 ), x0 ), (3.46)
D y T (
(x0 ), x0 ) · D
(x0 ) + Dx0 T (
(x0 ), x0 ) · δx0 .
As such, it is again a uniform fiber contraction since the contraction was uniform
in y and x0 to begin with, and the supremum norm does not affect the contraction
factor q < 1.
is no clear way to define local coordinates around a solution curve x. The obvious
method would be by constructing a tubular neighborhood of x and represent nearby
ρ
curves x̃ in the tubular neighborhood. But the metric on Bβ (I ; X ) allows x̃ to diverge
exponentially from x even if dρ (x, x̃) is small. Thus the tubular neighborhood would
need to be of infinite size to contain x̃(t) for all t ∈ I , which is generally not possible.
Since any finite-size tubular neighborhood does not contain a full neighborhood of
the curve x, we cannot use local coordinates to define tangent spaces.
Instead, we shall construct formal tangent bundles. These are just convenient
spaces to model variations of curves on; they are natural extensions of true Banach
tangent spaces, see Sect. 3.7.7. The primary role of these bundles is to introduce a
topology that allows us to show that DT is uniformly or Hölder continuous.
ρ
A formal tangent bundle of Bη (I ; Y ) can be constructed rather easily: B ρ (I ; Y )
is a Banach space, so its tangent bundle is canonically identified as TB ρ (I ; Y ) =
ρ
B ρ (I ; Y ) × B ρ (I ; Y ). We then define the formal tangent bundle of Bη (I ; Y ) by
restricting the base:
T Bηρ (I ; Y ) = TB ρ (I ; Y )| Bηρ (I ;Y ) ∼
= Bηρ (I ; Y ) × B ρ (I ; Y ) (3.47)
ρ
TBβ (I ; X ) = B ρ (I ; x ∗ (TX )). (3.48)
ρ
x∈Bβ (I ;X )
A curve δx lives above a specific base curve x, so there is no direct way of com-
paring two curves δx1 , δx2 with different base curves x1 , x2 ; (3.48) was constructed
as a coproduct without topological structure. We add a topology based on parallel
ρ
transport. This requires the base curves x ∈ Bβ (I ; X ) to be differentiable, so we
consider the bundle
ρ
TBβ (I ; X )C 1 (3.49)
116 3 Persistence of Noncompact NHIMs
ρ
restricted to differentiable9 base curves x ∈ Bβ (I ; X ) ∩ C 1 . Variational curves
ρ ∗ ρ
δx 0∈ B (I ; x (TX )) are isometrically mapped onto curves δx ∈ B (I ; Tx(0) X ) =
C (I ; Tx(0) X ), · ρ by
˜ x : δx → δx,
δx(t) = (x|t0 )−1 δx(t). (3.50)
Let the normal coordinate radius δ X be X -small as in Definition 2.8. If we now restrict
all base curves x under consideration to a small neighborhood
ρ
Ux̄ = {x ∈ Bβ (I ; X ) ∩ C 1 |x(0) ∈ B(x̄; δ X ), (3.51)
i.e., the curves x that start in the open ball B(x̄; δ X ) ⊂ X , then there exists a unique
shortest geodesic γx̄,x(0) from x(0) to x̄ for each x ∈ Ux̄ . Parallel transport along
these geodesics induces a local trivialization 10 of TB( x̄; δ ). This in turn induces a
ρ X
local trivialization of TBβ (I ; X ) C 1 :
ρ τx̄
TBβ (I ; X )|Ux̄ / Ux̄ × B ρ (I ; Tx̄ X ) (3.52)
ttt
t
tt
π ttt
t
tt p1
t tt
ρ yt
Bβ (I ; X )|C 1 ⊃ Ux̄
˜ x (δx) . The tran-
The trivialization map is given by τx̄ (x, δx) = x, (γx̄,x(0) ) ◦
sition maps between overlapping local trivializations Ux̄1 ∩ Ux̄2
= ∅ are induced by
transition functions
ϕ2,1 : B(x̄2 ; δ X ) ∩ B(x̄1 ; δ X ) × Tx̄2 X → Tx̄1 X : (ξ, ν) → (γx̄2 ,ξ ◦ γξ,x̄1 ) · ν
9 This does not cause problems since TX actually maps into curves x ∈ C 1 . The fiber contraction
theorem only requires that the base space has a globally attractive fixed point. Since the fixed point
is a C 1 curve, we can simply restrict to this subset of curves.
10 We make the specific choice to trivialize TB( x̄; δ ) by parallel transport along geodesics. Any
X
other trivialization with uniformly bounded transition maps would also suffice for our purposes
ρ
and induce a trivialization of TBβ (I ; X )|Ux̄ (see also the alternative viewpoint on this trivializa-
tion below). This explicit choice is somewhat natural in this context, though, and it shows that a
trivialization with these properties does exist.
3.7 Smoothness 117
ρ
We endow the bundle TBβ (I ; X )C 1 with the topology induced by these local
trivializations. Note that this topology is induced by a locally defined distance func-
ρ
tion, so we can express uniform and Hölder continuity ofmaps on TBβ (I ; X )C 1 .
ρ
That is, if (x1 , δx1 ) and (x2 , δx2 ) are elements of TBβ (I ; X )C 1 such that dρ (x1 , x2 ) <
δ X , then the topology is induced by the locally defined distance function
˜ x2 (δx2 ) −
d (x1 , δx1 ), (x2 , δx2 ) = dρ (x2 , x1 ) + (γx1 (0),x2 (0) ) ˜ x1 (δx1 )ρ .
(3.53)
The transition functions τx̄2 ◦ τx̄−1
1
are uniformly Lipschitz, so they preserve uniform
and Hölder continuity moduli up to a constant. Therefore, overlapping trivializations
define the same topology on their intersection, with compatible local distances. To
summarize, we have
ρ ρ
Proposition 3.31 The spaces T Bη (I ; Y ) and TBβ (I ; X )C 1 are well-defined nor-
med vector bundles of bounded geometry, and they have a (local) distance structure.
The topologies introduced above allow us to express uniform and Hölder con-
ρ
tinuity of the maps (??). The topology on T Bη (I ; Y ) is clear and explicit from
ρ ρ
the topology on B ρ (I ; Y ). For TBβ (I ; X )C 1 let x ∈ Bβ (I ; X ) ∩ C 1 be a curve
and δx ∈ B ρ (I ; x ∗ (TX )) a variational curve at x. The topology is induced by the
isometric representation
˜ x · δx ∈ B ρ (I ; Tx̄ X )
= (γx̄,x(0) ) ·
δx
ρ
of δx. Uniform continuity of maps (??) that have TBβ (I ; X )C 1 as (co)domain can
thus be checked by switching to a local trivialization, that is, substitute
δx(t) = (x|t0 ) · (γx(0),x̄ ) · δx(t)
and then use the known topology on Ux̄ × B ρ (I ; Tx̄ X ). In explicit calculations of
ρ
continuity with respect to the base Bβ (I ; X )∩C 1 , we shall thus add parallel transport
terms such as those above to the maps (??) and let these act on δx ∈ B ρ (I ; Tx̄ X ).
Alternative Viewpoints
ρ
Instead of the immediate trivialization (3.52) of the bundle TBβ (I ; X )C 1 , we can also
introduce an intermediate viewpoint that corresponds to only applying the parallel
transport term ˜ x , but not (γx̄,x(0) ) in the local neighborhood B(x̄; δ X ). We view
ρ ρ
ev0 : Bβ (I ; X ) → X as a bundle; this identifies TBβ (I ; X )C 1 as a bundle over X
as well, via ev0 ◦ π . Let B ρ (I ; TX ) X denote the space of (continuous, exponential
growth) functions δx : I → TX such that π ◦ δx is constant into X , viewed as a
bundle over X .
118 3 Persistence of Noncompact NHIMs
ρ ˜
TBβ (I ; X )C 1 / B ρ (I ; X )|C 1 × X B ρ (I ; TX ) X
β (3.54)
qqq q
qqq
qqqqq
π qqqp qq
qqq evt ◦ p2
xqqq 1 xqqq
ρ
Bβ (I ; X )|C 1 TX
oo
oo
ev0
o oooo
o π
woooo
X
ρ
This commutative diagram shows that TBβ (I ; X )C 1 can be identified via
˜ with
ρ
the fiber product bundle Bβ (I ; X )|C 1 × X B ρ (I ; TX ) X over X . This identification
is natural in the sense that no local trivialization of X or TX is used. The second
component B ρ (I ; TX ) X of this bundle contains the variational curves δx. This is a
(nontrivial) bundle over X , but its projection onto the base π ◦evt : B ρ (I ; TX ) X → X
factors through TX . The fact that π ◦ evt is constant for t ∈ I simply expresses that
∈ B ρ (I ; TX ) X maps into a fixed tangent space Tξ X . This shows that a local
each δx
trivialization σ : TX | B(x̄;δ X ) → B(x̄; δ X ) × Rn naturally lifts to a local trivialization
σ̃ : B ρ (I ; TX )| B(x̄;δ X ) → B(x̄; δ X ) × B ρ (I ; Rn ).
11 The precise choice does not matter and will drop out in the final, relevant equations. The relative
choice of frame along curves is what matters.
12 Note that e does not define a (global) frame on TX . The choice of frame at x(t) depends not just
ρ
on the point x(t) ∈ X , but on the whole curve x ∈ Bβ (I ; X )|C 1 . Another curve x̃ with x(t) = x̃(t)
will generally induce a different frame in Tx(t) X .
3.7 Smoothness 119
v(x(t))e = e−1
x̄ · (γx̄,x(0) ) · (x|t ) · v(x(t)),
0
γt
γ0
x
x1 (t)
120 3 Persistence of Noncompact NHIMs
d t,τ
ϒ = A(x2 (t)) x2 (t, τ ) − A(x1 (t)) x1 (t, τ )
dt
= A(x2 (t)) ϒ t,τ + A(x2 (t)) − A(x1 (t)) x1 (t, τ ),
Here we use ideas from Appendix B; we applied Corollary B.3 to obtain A as a uni-
ρ
formly continuous fiber mapping Bβ (I ; X ) → B μ (I ; L(Y )) with continuity modu-
lus ε Ã (that depends on μ).
ρ
Thus, the flow xt,τ depends uniformly continuously on x ∈ Bβ (I ; X ) when viewed
as a flow with ρY -exponential growth and measured with an additional exponential
factor eμ τ .
Remark 3.33 In the previous proposition, if A is α-Hölder continuous, then we can
replace μ by α ρ to obtain a similar, α-Hölder continuous result using Lemma B.2.
♦
To show that x → Dx x (t, τ ) is continuous as well, we first write down the
corresponding variation in the bundle trivialization chart:
3.7 Smoothness 121
− Dx x1 · δx
Dx x2 · δx (t, τ )
t
= ) x2 (t, σ )
x2 (t, σ ) DA(x2 (σ )) (x2 |σ0 ) (γx2 (0),x̄ ) δx(σ
τ
− (2 1)dσ
t
= ) x2 (t, σ ) − (2 1)dσ,
x2 (t, σ ) DA(x2 (σ ))e δx(σ (3.58)
τ
where the notation (2 1) means that we take the first expression and replace all
2’s by 1’s (note that γx2 (0),x̄ = γ0 in the first term and (γx1 (0),x̄ ) = 1 in the second
term). The last line is just a rewrite in terms of the frame as in (3.55) and suppresses
all parallel transport terms. We separately estimate continuity of the three factors
in the integrand, and insert the estimate of Proposition 3.32 for the variation • in
the first and third factor. Note that δx is the same over both curves x1 and x2 in this
trivialization.
For the middle factor DA(x(t))e , we again apply Nemytskii operator techniques
from Appendix B. But in this case we have to combine these with holonomy terms,
due to the fact that comparison of DA at nearby points ξ2 , ξ1 ∈ X only makes sense
after identification of the tangent spaces Tξ2 X and Tξ1 X .
1 according to Definition 2.9. Then for any μ < 0,
Proposition 3.34 Let A ∈ Cb,u
the map
ρ
x → t → DA(x(t))e : Ux̄ ⊂ Bβ (I ; X ) → B μ I ; L(Tx̄ X ; L(Y )) (3.59)
1,α
is uniformly continuous. If moreover A ∈ Cb,u , then the map (3.59) is α-Hölder with
μ replaced by α ρ.
Proof Let x1 , x2 ∈ Ux̄ . We introduce another frame f to directly compare DA
at points x1 (t), x2 (t). Let f x1 (t) = ex1 (t) : Rn → Tx1 (t) X and define f x2 (t) =
(γt ) · f x1 (t) . Thus, the frames e and f at x2 (t) are both defined in terms of the
frame ex̄ ; ex2 (t) by parallel transport along x2 ◦ γ0 and f x2 (t) by parallel transport
along γt ◦x1 , see Fig. 3.7. Since f x1 (t) = ex1 (t) , we can rewrite the difference of (3.59)
at points on these curves as
DA(x2 (t))e − DA(x1 (t))e = DA(x2 (t))e − DA(x2 (t)) f
+ DA(x2 (t)) f − DA(x1 (t)) f .
The first term can be estimated by the holonomy defect along the loop
using Lemma 2.19 and the second term using the continuity of DA and
Proposition 2.13. Together, this leads to
122 3 Persistence of Noncompact NHIMs
1
We absorbed all constants and integration factors such as −μ into the general constant
C and combine the continuity moduli into one; Cv is a global bound on all vector
fields including A and its derivatives.
We finally plug estimate (3.60) into Eq. (3.43c). We repeat the Nemytskii and
holonomy arguments for f˜ and Dx f˜ (just as for A and DA) to obtain a uniform
continuity estimate for
3.7 Smoothness 123
x → Dx TY (x, y) : Ux̄ → L B ρ (I ; Tx̄ X ); B ρ+μ (I ; Y ) .
1,α
both for any μ < 0, or with μ = α ρ when A, f ∈ Cb,u . That is, Dx TY ( · , y) is a
ρ
map that given a curve x ∈ Bβ (I ; X )|C 1 , linearly maps a variational curve δx over
ρ
x to a variational curve δy in the trivial bundle T Bη (I ; Y ). We can formulate this
more abstractly as
ρ ρ
α
Dx TY ( · , y) ∈ b,u Bβ (I ; X )|C 1 ; L TBβ (I ; X )C 1 ; B ρ+μ (I ; Y ) ,
We treated the continuity for one of the maps (??) with respect to a single variable.
The continuity in all other cases can be shown in a similar fashion. Many arguments
can be repeated, but each of these maps also has its own peculiar details which
makes that I have not been able to find one general, abstract way to prove continuity
of all of these maps at once. In this section we shall focus on these specific details
and not repeat the recurring elements. Let me reiterate that the uniform continuity
results hold for any μ < 0 sufficiently small, and these can be replaced by α-Hölder
continuity when μ is replaced by α ρ and the spectral gap condition (1.11) is satisfied
for r = 1 + α.
First of all, note that continuity with respect to the combined variables follows
directly from continuity with respect to each separate variable since we have explicit
uniform or Hölder continuity moduli. If f (x, y) has continuity moduli εx , ε y with
respect to x, y, respectively, then
ρ+μ
with a local trivialization Ux̄ × B ρ+μ (I ; Tx̄ X ) within13 the bundle TBβ (I ; X )C 1
with additional μ in the exponential growth norm on the fibers. Note that Dx0 TX (y, · )
could actually be considered as a bundle map on the vector bundle TX that is linear
on each tangent space Tx0 X . We consider a local trivialization of TB(x̄; δ X ) ⊂
TX by parallel transport along geodesics: this is equivalent to trivialization by a
normal coordinate chart for the purpose of measuring continuity, while it matches
ρ
the trivialization of TBβ (I ; X )|Ux̄ . This will lead to a holonomy term.
For any x0 ∈ B(x̄; δ X ) we have TX (y, x0 ) ∈ Ux̄ by construction, so let e denote
ρ
the frame introduced by the trivialization of TBβ (I ; X )|Ux̄ , i.e. by parallel transport
along solution curves TX (y, x0 ). On the other hand, let f denote a frame introduced
by local parallel transport. We define
13 Embeddings B ρ → B ρ+μ are continuous, so we can view Ux̄ × B ρ+μ (I ; Tx̄ X ) as a local
ρ
trivialization of a subset of TBβ (I ; X ) with ρ = ρ + μ.
3.7 Smoothness 125
D
y (t, 0, x0,2 )e − D
y (t, 0, x0,1 )e
= D
y (t, 0, x0,2 )e − D
y (t, 0, x0,2 ) f
+ D
y (t, 0, x0,2 ) f − D
y (t, 0, x0,1 ) f
= (γ0−1 ◦ x2 |0t ) − (x1 |0t ◦ γt−1 ) · D
y (t, 0, x0,2 ) · (γ0 )
+ D
y (t, 0, x0,2 ) f − D
y (t, 0, x0,1 ) f .
This shows that uniform and Hölder continuity with respect to the topology of
ρ
TBβ (I ; X )C 1 is equivalent to the same continuity with respect to normal coor-
dinate charts, since the additional holonomy term can be estimated in the same way
as in Proposition 3.34. Continuity of
y → Dx0 TX (y, x0 ) : Bηρ (I ; Y ) → L Tx0 X ; B ρ+μ (I ; Tx̄ X )
follows in the same way, if we first apply Corollary C.12 to obtain the continuity
estimates with respect to the frame f .
Finally, we consider continuity of the map (3.43b),
D y TX (y, x0 ) ∈ L B ρ (I ; Y ); B ρ+μ (I ; Tx̄ X )
ρ
with respect to y ∈ Bη (I ; Y ) and x0 ∈ X . We apply Corollary C.12 and Lemma 3.29
to conclude that D
y (t, τ, x y (τ )) depends α-Hölder or uniformly continuously
on y. Lemma 3.29 in combination with a Nemytskii operator argument shows that
D y ṽ X induces a uniformly continuous map
y → t → D y ṽ X (x y (t), y(t)) : Bηρ (I ; Y ) → B μ I ; L(Y ; TX )
In Sect. 3.7.3 we already established that the fiber mapping (T, DT ) in formula (3.46)
is uniformly contractive. With the results of the previous sections on formal tangent
bundles and continuous formal derivatives, we can now apply the fiber contraction
theorem, see Appendix D.
126 3 Persistence of Noncompact NHIMs
Proposition 3.36 For any μ < 0, the fiber mapping (3.46) has a unique, globally
μ
attractive fixed point (
∞ , D
∞ ) ∈ S0 × S1 , while it also holds that D
∞ ∈ S10 .
μ
Proof In the notation of Theorem D.1 we take X = S0 and Y = S1 as in (3.45) with
ρ, μ such that ρY < ρ + μ < ρ < ρ X holds. The fiber mapping is F = (T, DT ), as
in (3.46). The first two conditions of Theorem D.1 are satisfied due to the arguments
in Sect. 3.7.3, while the third condition that DT is continuous can be obtained from
the results in Sect. 3.7.5 as follows.
First, note that (T, DT ) is a well-defined, uniformly contractive fiber mapping
both when acting on B ρ (I ; Y ) and on B ρ+μ (I ; Y ) variational curves. Thus, for each
μ
n ≥ 0 we have D
n ∈ S10 → S1 , where the embedding is continuous. The same
∞
conclusion holds for D
by a simple uniform contraction argument. Next, we
view DT as a map
μ
DT : S0 × S10 → S1 . (3.61)
Note that we set μ = 0 in the domain only. To obtain continuity of (3.61) with respect
to the base variable
∈ S0 , it is sufficient to check that the maps
y → D y T (y, x0 ) : Bηρ (I ; Y ) → L B ρ (I ; Y ); B ρ+μ (I ; Y ) ,
y → Dx0 T (y, x0 ) : Bηρ (I ; Y ) → L Tx0 X ; B ρ+μ (I ; Y ) (3.62)
The · ρ2 ,ρ1 denote operator norms on linear (bundle) maps from B ρ1 to B ρ2 spaces.
In the factor that is not varied we can simply take the operator norm between functions
of either ρ or ρ + μ exponential growth: in Sect. 3.7.3 we have seen that the fiber
3.7 Smoothness 127
maps are uniformly bounded linear in both cases. The factor that is varied satisfies a
uniform continuity estimate in · ρ+μ,ρ -norm, a result from Sect. 3.7.5. Note that
ρ
we use the topology defined in Sect. 3.7.4 on the intermediate space TBβ (I ; X )C 1 ,
as well as a local trivialization to express the difference D y TX (y2 , x0 )−D y TX (y1 , x0 ).
As a result of the fiber contraction theorem, we conclude that there is a unique,
globally attractive fixed point (
∞ , D
∞ ) of the fiber mapping (3.46). Note that
D
∞ is already well-defined as an element of S10 , although it is only proven to be
μ
attractive in S1 .
As a next step, we show that the fixed point map D
∞ that we found is actually
continuous. This follows from a standard uniform contraction argument.
μ
Proposition 3.37 For any μ < 0, the map D
∞ ∈ S1 is uniformly continuous. If
we set μ ≤ α ρ and the assumptions of Theorem 3.2 are satisfied with r ≥ 1 + α,
then it is α-Hölder continuous.
Proof First note that it is sufficient to prove the statement for μ < 0 sufficiently
small, or μ = α ρ in case of α-Hölder continuity; by continuous embedding of
exponential growth spaces, it then automatically follows for any μ that is more neg-
ative. We use local trivializations by parallel transport to express continuity moduli
of functions with domain TX .
The assumptions of Theorem 3.2 imply that the spectral gap condition ρY <
ρ + μ < ρ < ρ X is satisfied. Since D
∞ is (the fiber part of) the fixed point of the
uniform contraction (T, DT ), we have for any two x1 , x2 ∈ B(x̄; δ X ) ⊂ X that
1
D
∞ (x2 ) − D
∞ (x1 )ρ+μ ≤ εD y T (L + 1)d(x2 , x1 )
1−q
+ εDx T (L + 1)d(x2 , x1 ) .
μ
This shows that D
∞ ∈ S1 has the same type of continuity modulus as DT .
We can recover the maps (??) as true derivatives on Banach manifolds if we restrict
to bounded time intervals J ⊂ I = R≤0 . The maps TX , TY naturally restrict to such
intervals, either exactly, or in a well-behaved approximate way. By restricting to
ρ ρ
intervals J = [a, 0] with a < 0, the spaces Bη (J ; Y ) and Bβ (J ; X ) become Banach
manifolds and the restrictions of TX , TY become continuously differentiable maps
on these.
ρ ρ
Lemma 3.38 For any −∞ < a < 0, the spaces Bη (J ; Y ) and Bβ (J ; X ) with
J = [a, 0] a bounded interval are well-defined Banach manifolds.
ρ
Proof We first treat the easy case Bη (J ; Y ). For any −∞ < a < 0, the norms · ρ
ρ
and · 0 are equivalent on B ρ (J ; Y ). The set Bη (J ; Y ) is an open ball of radius η
ρ
in the Banach space B (J ; Y ), so it follows that Bη (J ; Y ) is a Banach manifold as
0
ρ
an open subset of B (J ; Y ).
In the same way, the metrics dρ and d0 are equivalent on B ρ (J ; X ), but here we
need to do a little more work to show the following.
ρ
Proposition 3.39 The set Bβ (J ; X ) is open in B ρ (J ; X ).
ρ
Proof Let x ∈ Bβ (J ; X ), hence by Definition 3.19, x is approximated on each
interval of length |[t1 , t2 ]| ≤ T by t →
(t, t2 , x(t2 )), where
denotes the flow of
v X ◦ g, the horizontal part of the unperturbed vector field (3.10). The map
(t, t2 ) → d x(t),
(t, t2 , x(t2 ))
e−ρ a
η2 = (β − η1 ) .
1 + C X eρ X T
3.7 Smoothness 129
This shows that all functions in the ball B(x; η2 ) ⊂ B ρ (J ; X ) are still (β, T )-
ρ
approximate solutions of v X ◦ g, and thus Bβ (J ; X ) is open.
ρ
From here on we shall not always precisely distinguish between Bβ (J ; X ) and
B ρ (J ;
X ) anymore.
We introduce a local coordinate chart κx around a curve x ∈ B ρ (J ; X ) using the
exponential map (see also [Kli95, Sect. 2.3]):
κx : Ux ⊂ B ρ (J ; X ) → B ρ (J ; x ∗ (TX )) : ξ → t → exp−1
x(t) (ξ(t)) . (3.64)
κx2 ◦ κx−1
1
: B ρ (J ; x1∗ (TX )) → B ρ (J ; x2∗ (TX )) (3.65)
Tx B ρ (J ; X ) ∼
= B ρ (J ; x ∗ (TX )) (3.66)
shows that · ρ is the canonical norm on the chart B ρ (J ; x ∗ (TX )). Then v =
d
ρ ρ ∗
ds ξs s=0 represents a tangent vector in Tx B (J ; X ), while v ∈ B (J ; x (TX )) by
construction.
This completes our exposition of the manifold structure of B ρ (J ; X ). We shall
again exclusively make use of induced normal coordinate charts (3.64), in order to
use results on bounded geometry.
The map TX can be restricted to curves on any subinterval J = [a, 0] ⊂ I = R≤0 .
Let us introduce the restriction operator on curves
be an induced normal coordinate chart centered around the curve x = TXa (y, x0 ).
The map TX is Lipschitz, so for s sufficiently small, TXa (y + s δy, x0 ) maps into
B(x; δa ). The vector field ṽ X ( · , (y + s δy)(t)) depends smoothly on the parameter
s and generates xs = TXa (y + s δy, x0 ). We apply Theorem E.2 with
3.7 Smoothness 131
d
ṽ X x(t), (y + s δy)(t) = D y ṽ X (x(t), y(t)) · δy(t)
ds s=0
for some σt ∈ (0, s). Note that σt will in general depend on t ∈ J , so there is (a priori)
not one curve y + σ δy such that (3.70) holds for all t ∈ J at once. In Sect. 3.7.5 we
ρ
showed that D y TX : T Bη (I ; Y ) → TB ρ+μ (I ; X )|C 1 is continuous; on the bounded
interval J the norms · ρ and · ρ+μ are equivalent, so D y TX is continuous into
ρ ∗
B (J ; x (TX )) as well. Using this fact, we plug the result above into the definition
of (directional) derivative and verify
1
lim T a (y + s δy, x0 ) − TXa (y, x0 ) − D y TX (y, x0 ) · s δyρ
s→0 s X
|s|
≤ lim sup D y TX (y + σt δy, x0 ) − D y TX (y, x0 ) δyρ = 0.
s→0 t∈J s
ρ
TYb,a : Bβ ([a, 0]; X ) × Bηρ ([a, 0]; Y ) → Bηρ ([b, 0]; Y ),
t
(x, y) → t → x (t, τ ) f˜(x(τ ), y(τ ))dτ for each t ∈ [b, 0]. (3.71)
a
Proposition 3.42 The family TYb,a approximates TY in the sense that for any fixed
b ∈ (−∞, 0], we have
TYb,a ◦ ρa → ρb ◦ TY (3.72)
ρ ρ
when a → −∞, uniformly in x, y ∈ Bβ (I ; X ) × Bη (I ; Y ).
Proof This follows from straightforward estimates:
a
TYb,a ◦ ρa (x, y) − ρb ◦ TY (x, y)ρ ≤ sup e−ρ t x (t, τ ) f˜(x(τ ), y(τ ))dτ
t∈[b,0] −∞
a
≤ sup e−ρ t CY eρY (t−τ ) ζ dτ
t∈[b,0] −∞
CY ζ ρY (b−a)−ρ b
≤ e .
−ρY
In a same way we define approximate families for (3.43c) and (3.43d), denoted
by Dx TYb,a and D y TYb,a , respectively.
Corollary 3.43 The families Dx TYb,a and D y TYb,a approximate (3.43c) and (3.43d)
in the same way as in Proposition 3.42.
Proposition 3.44 Let −∞ < a ≤ b < 0. Then TYb,a is a differentiable map between
Banach manifolds.
Proof We shall only show that TYb,a is continuously partially differentiable respect
to x. Continuous partial differentiability with respect to y follows along the same
lines and total differentiability then is a direct consequence of these (also in the
Banach manifold setting, see [Lan95, Prop. 3.5]).
ρ ρ
Let x ∈ Bβ (J ; X ) and y ∈ Bη (J ; Y ) with J = [a, 0]. Let κx be an induced
normal coordinate chart around x and let
xs = x + s δx ∈ B ρ (J ; x ∗ (TX ))
ρ
be a one-parameter family of curves in Bβ (J ; X ), represented in the chart κx (for s
sufficiently
small). Then δx ∈ B ρ (J ; x ∗ (TX )) is naturally identified as the derivative
d
x .
ds s s=0
We shall show that the partial derivative Dx TYb,a is given by the formal deriva-
tive (3.43c), but with J as domain of integration and interpreted as a mapping into
B ρ ([b, 0]; Y ). Again, we split the full expression into manageable pieces and apply
the mean value theorem.
3.7 Smoothness 133
TYb,a (xs , y) − TYb,a (x, y) − Dx TYb,a (x, y) · s δx (t)
t
≤ x (t, τ ) f˜(xs (τ ), y(τ )) − x (t, τ ) f˜(x(τ ), y(τ ))
s
a
− x (t, τ ) Dx f˜(x(τ ), y(τ )) sδx(τ ) − (Dx x · sδx)(t, τ ) f˜(x(τ ), y(τ ))dτ
t
≤ xs (t, τ ) − x (t, τ ) − Dx x · s δx (t, τ ) f˜(x(τ ), y(τ ))
a
and application of Theorem E.2 shows that formula (3.43e) for Dx xs · δx is the
derivative of xs . We use this for a mean value theorem estimate14 in the first and
third15 term to arrive at
t
≤ Dx xσ · s δx (t, τ ) − Dx x · s δx (t, τ ) ζ
a
+ CY e ρY (t−τ )
Dx f˜(xσ (τ ), y(τ )) − Dx f˜(x(τ ), y(τ )) |s| δxρ eρ τ
+ Dx xσ · s δx (t, τ ) Dx f˜(xσ (τ ), y(τ )) |s| δxρ eρ τ dτ
t
≤ C eρY (t−τ ) ε(|σ | δxρ ) e(ρ+μ)τ |s| δxρ ζ
a
+ CY eρY (t−τ ) εDx f |σ | δxρ eρ τ |s| δxρ eρ τ
CY2 Cv
+ δxρ eρY (t−τ ) eρ t ζ |s| δxρ eρ τ dτ.
−ρ
14 The intermediate point σ in the mean value theorem implicitly depends on both t and τ and will
be different in each term. This does not affect the uniform estimates, so we suppress this dependence
in the notation.
15 We applied the intermediate value theorem to both factors in the third term. This is not strictly
necessary: we could also have applied it to only one of these, and apply a uniform continuity
estimate to the other term. That would still have yielded a size estimate ε(|s|) |s| = o(|s|). When
we generalize to higher derivatives, we shall make use of this fact: at least one of the factors will
be differentiable and yield a factor |s|, while the other term(s) can be estimated by a continuity
modulus ε(|s|).
134 3 Persistence of Noncompact NHIMs
were already in place, this technicality allows us to draw the conclusions of the final
points 6 and 7 in the scheme in Sect. 3.7.1.
Lemma 3.45 (The
n have true derivatives) Fix μ < 0 and let
n : X →
ρ
Bη (I ; Y ) be differentiable into B ρ+μ (I ; Y ). Recursively define
n+1 (x0 ) = T (
n
(x0 ), x0 ). Then
n+1 is again differentiable into B ρ+μ (I ; Y ).
Proof We define D
n+1 ∈ S10 using (3.46) and proceed to show that it is the
μ
derivative of
n+1 as a function D
n+1 ∈ S1 by a direct estimate
for some b(ε) that is sufficiently negative. We use the differentiability of T b,a =
TYb,a ◦ (TXa , pr1 ) on the finite interval [b, 0] that is left. We define
n+1
b,a = T
b,a ◦
ρa ◦
and estimate
n
sup
n+1 (x0 + h) −
n+1 (x0 ) − D
n+1 (x0 ) · h (t) e−(ρ+μ) t
t∈[b,0]
This holds for all a ≤ b and the first three terms can be made arbitrarily small when
a → −∞ due to Proposition 3.42 and Corollary 3.43, while the last term is o(h)
since
n+1
b,a is differentiable by the chain rule. If the estimate o(h) is independent
of a, then we can finally, for any ε > 0 and h ≤ δ sufficiently small, estimate this
by ε h.
That the term o(h) is independent of a follows from another application of the
mean value theorem:
3.7 Smoothness 135
since the continuity estimates for the formal derivatives DT directly translate into
the same estimates for the true derivative counterparts DT b,a on restricted intervals.
Thus, we can now conclude by induction, starting at
0 ≡ 0, that for each n ≥ 0
the map
n ∈ S0 is differentiable when viewed as map into B ρ+μ (I ; Y ), while the
results in Sect. 3.7.5 show that we actually have
1,α
n ∈ Cb,u X ; B ρ+μ (I ; Y ) . (3.73)
by the fiber contraction theorem, so now we apply Theorem D.2 (taking into
account Remark D.3) to conclude that D
∞ is the derivative of
∞ as a map
X → B ρ+μ (I ; Y ). It was already shown in Proposition 3.37 that D
∞ is bounded
and continuous, just as the D
n in (3.73).
Remark 3.46 (on topologies used) The convergence in (3.74) is with respect to uni-
form supremum norms as in Definition 2.9. These induce a topology that is stronger
than the weak Whitney (or compact-open) topology, cf. Sect. 1.7. The convergence
in Theorem D.2 is with respect to the weak Whitney topology, both the assumption
and result. This is sufficient, since we are primarily interested in the result that
∞ is
differentiable, not in what sense
n and its derivatives converge to
∞ . On the other
hand, we did already have convergence of D
n → D
∞ = D
∞ with respect to
these stronger uniform norms, so clearly
n →
∞ in uniform C 1 -norm as well. ♦
136 3 Persistence of Noncompact NHIMs
The evaluation map ev0 : B ρ+μ (I ; Y ) → Y is bounded linear so the graph (3.41) of
the persistent invariant manifold also satisfies
h̃ = ev0 ◦
∞ ∈ Cb,u
1,α
(X ; Y ).
The size of Dh̃ can be estimated using the fixed point equation for D
∞ . This yields
q
D
∞ ρ ≤ Dx TX ,
1−q 0
and the contraction factor q < 1 can be made arbitrarily small by choosing ζ small.
As indicated in (3.18), ζ is in turn controlled by δ, σ1 from Theorem 3.2, and ν from
Lemma 3.12, which can be chosen arbitrarily small. This completes the proof of all
statements in Theorem 3.2 for r = 1 + α with α ∈ [0, 1]. Note that this is the case
k = 1 as in Remark 3.3, 5.
We reuse the scheme already defined in Sect. 3.7.1 for the first order deriva-
tives (??). Let us walk through these items step by step and indicate the changes that
need to be made.
1. Candidate functions for the higher order derivatives can be found by formal dif-
ferentiation and application of Theorem E.2. This is a straightforward procedure,
although tedious and quite unenlightening to perform. Let us show just one exam-
ple16 :
D2y TX (y, x0 ) δy1 , δy2 (t)
0
= D
y (t, τ, x y (τ )) · D2y ṽ X (x y (τ ), y(τ )) δy1 (τ ), δy2 (τ )
t
+ Dx D y ṽ X (x y (τ ), y(τ )) δy1 (τ ), D y TX (y, x0 )δy2 (τ )
2
+ D
y (t, τ, x y (τ )) · D y TX (y, x0 )δy2 (τ )
τ
+ D
y (t, σ, x y (σ )) · D y Dx ṽ X (x y (σ ), y(σ )) · δy2 (σ ) · D
y (σ, τ, x y (τ ))dσ
t
· D y ṽ X (x y (τ ), y(τ )) · δy1 (τ ) dτ. (3.76)
2. For contractivity in the fibers we still only need to consider the map D y T as
in (3.44), since that is the principal term in (3.42). This map is contractive for any
ρ ∈ (ρY , ρ X ), hence also for ρ = k ρ + μ for any μ ≤ 0 sufficiently small, when
ρ ∈ (ρY , ρ X ) is chosen appropriately. The other terms in (3.42) are bounded maps
as well, and linear in the D j
(x0 ). It follows from Proposition C.3 that each of
these terms has weighted degree
k−1
i · pi = k − m
i=1
with respect to the D j
(x0 ), while they incur an additional exponential factor
em ρ t from taking m derivatives with respect to x0 ∈ X , due to Lemma C.1. Thus
the combined exponential growth rates sum to k ρ, and ρY < k ρ implies that the
variation of constants integrals still converge, so these terms are bounded maps
into B kρ spaces. This still holds if we add μ j ’s that satisfy the conditions set out
above.
In the notation of Appendix C we define spaces of higher order derivatives,
μ
Sk = b Lk TX ; B kρ+μ (I ; Y ) (3.77)
16 Even though it is not obvious from (3.76), this expression is in fact symmetric in δy1 , δy2 . To
verify this for the termscontaining Dx D yṽ X , one should change the order of integration of τ, σ and
expand the expression D y TX (y, x0 )δy2 (τ ) using (3.43b).
138 3 Persistence of Noncompact NHIMs
with norms
Dk
= sup Dk
(x0 )Lk (Tx
0 X ;B
kρ+μ (I ;Y ))
x0 ∈X
μ
These are again uniform fiber contractions with respect to the final factor Sk k as
fiber, for any choice of μ j ≤ 0 sufficiently small.
3. Instead of trying to construct higher order formal tangent bundles, we represent
the higher derivatives on ‘formal tensor bundles’
ρ
TBβ (I ; X )k = B ρ (I ; x ∗ (TX ))⊗k . (3.79)
ρ
x∈Bβ (I ;X )
Note that this choice of representation along base curves x matches our choice
to represent higher derivatives as in Definition C.6 when the former is evaluated
at a fixed t. The trivializations, then, are defined by tensor products of parallel
transport terms (x|t0 )⊗k , again when restricted to curves x ∈ C 1 . The resulting
holonomy terms can be estimated by either the kþpower of the single holonomy
term, or, k − 1 factors can be bounded by (γ ) ≤ 2 such that the remaining
factor fulfills the required α-Hölder estimate. Then, all details in Sect. 3.7.5 can
be repeated to obtain uniform or α-Hölder continuity of the higher derivatives
of TX , TY as maps on these formal tensor bundles. Note that we can break each
expression into parts such that only one factor is varied for the continuity estimate
and thus only once adds either α ρ or μ to the exponential growth rate. Thus, the
spectral gap condition is still satisfied.
μ μk−1
4. We apply the fiber contraction theorem to (3.78) with base S0 ×S1 1 ×· · ·×Sk−1
μk
and fiber Sk . In case of α = 0, we again seize some of the unused spectral space
for the carefully chosen μ j ’s, such that condition 3 of Theorem D.1 holds. Let
us assume by induction that F (k−1) already is a globally attractive fiber map. The
μ
conditions (3.75) imply that if we insert elements D j
∈ S j j into F (k) , then their
μ
exponents sum at most to k ρ̃, so the mapping onto the fiber Sk k is continuous
by application of Corollary B.3. For α-Hölder continuity we can simply choose
μ j = α ρ for all 1 ≤ j ≤ k. Thus, we find a globally attractive fixed point
μ μ
(
∞ , D
∞ , . . . , Dk
∞ ) ∈ S0 × S1 1 × · × Sk k with Dk
∞ ∈ Cb,u
α
.
ρ
5. We constructed a manifold structure on B ρ (J ; X ) (and a trivial one on Bη (J ; Y ) as
well) with an atlas of charts induced by normal coordinate charts of the underlying
3.7 Smoothness 139
This precisely matches the representation of the formal higher derivatives on the
tensor space B ρ (I ; x ∗ (TX ))⊗k in point 3. Higher differentiability of the restricted
maps TXa and TYb,a follows as in Sect. 3.7.7.
6. Lemma 3.45 can be generalized to prove by induction over n that higher derivatives
Dk
n exist; we define Dk
n+1 ∈ Sk0 by (3.78).
7. By induction we may assume that it was already proven that
k−1
n →
∞ ∈ Cb,u X ; B (k−1)ρ+μk−1 (I ; Y ) as n → ∞.
1 • ∞
Dk−1
∞ (x0 ) ≤ Dly Dm ∞
x0 T (
(x 0 ), x 0 ) · Pl,k−m D
(x 0 ) .
1−q
l,m≥0
l+m≤k−1
(l,m)
=(0,0),(1,0)
Each term with l ≥ 1 contains at least one factor D j
∞ (x0 ) with j < k −1; these can
be assumed to be small by induction. The one remaining term with (l, m) = (0, k −1)
can be expanded using Proposition C.3. This yields, suppressing arguments
∞ (x0 )
and x0 ,
k−1
Dx TY · P j,k−1 (D•x0 TX ).
j
x0 T =
Dk−1
j=1
Since all terms are uniformly bounded in appropriate norms, it suffices to show
j
that the Dx TY can be made small. Recall formula (3.43c) and the fiber contraction
estimate for Dx TY in Sect. 3.7.3, where we saw that Dx TY could be made small
by choosing f˜, Dx f˜ ≤ ζ small. The higher derivatives Dx TY , too, contain
j
17 It would probably be more natural to consider the higher derivatives as maps into Banach
manifolds with exponents k ρ, but these norms are equivalent anyways.
140 3 Persistence of Noncompact NHIMs
a factor Dix f˜ with 0 ≤ i ≤ j in each term, so by Proposition 3.14 these can be made
small. Hence, Dk−1
∞ and consequently h̃k−1 can be made uniformly small.
Note that h̃k cannot be made small though, see Remark 3.15.
Chapter 4
Extension of Results
In this chapter we discuss some ways to extend the main result of Theorem 3.1 to
slightly more general situations. These extensions are known from the compact and
Euclidean settings, but a bit scattered over the literature. We try to collect a number
of these results here, while extending them to our noncompact setting.
We proved the main theorem for an autonomous system and perturbation. The Perron
method admits without difficulty a time-dependent formulation; we refrained from
including this, since it would only have cluttered the already detailed proof, while
time-dependence is easily added as an afterthought, as already noted in Sect. 1.6.1.
Let us assume that M is an r -NHIM for the (time-independent) vector field v
on (Q, g) and that all assumptions of Theorem 3.1 are fulfilled. We can allow time-
dependent perturbations by the standard trick to extend the phase space of the system
by R t. Define
Q̂ = R × Q with metric ĝ = dt 2 + g. (4.1)
Then ( Q̂, ĝ) is again of bounded geometry. We trivially extend the vector field v to
v̂(t, x) = 1, v(x) ∈ T(t,x) Q̂ (4.2)
and set M̂ = R× M ⊂ Q̂. Then the flow ˆ of v̂ has the same hyperbolicity properties
as since the additional flow along t˙ = 1 is completely neutral and decoupled
from the original system. It follows that M̂ is again an r -NHIM for the dynamical
ˆ R). Note that we need a theory for noncompact NHIMs to perform
system ( Q̂, ,
this extension by the time interval R. Now we can choose a perturbed vector ṽ that
k,α
depends explicitly on time, as long as ṽ ∈ Cb,u ( Q̂) is close to v̂. This means that
the perturbation must be small in C k,α -norm (including derivatives with respect to
time), uniformly for all time. As a result we find that the perturbed manifold M̃ will
depend on time, i.e. it is not exactly of the form M̃ = R × M for some M ⊂ Q. We
do find that M̃ is uniformly close to M̂ = R × M, however, so M̃ is approximately
of this product form.
Remark 4.1 A direct application of Theorem 3.1 requires the perturbed vector field
to be C k,α with respect to time, too, since t ∈ R is added to the phase space variables.
Note that the result thus depends C k,α smoothly on time as well. A closer inspection of
k,α
the proof shows that this can in fact be replaced by the condition that v̂(t, · ) ∈ Cb,u ,
uniformly in t ∈ R, just as in Remark A.7. In that case the resulting manifold M̃
cannot be expected to be differentiable with respect to time anymore, but it still
satisfies all uniform C k,α smoothness and boundedness properties with respect to
x ∈ Q. In particular, M̃ is still uniformly close to M̂, uniformly for all t ∈ R. ♦
Instead of starting with an autonomous system v, we can also take an initial non-
autonomous system v̂ and perturb that. As long as v̂ truly describes a non-autonomous
system, that is, it is defined on a space R × Q and has component 1 along R, then
normal hyperbolicity is easily tested. The R-component of the flow is trivially neutral,
while the other Q-component must be checked in a context where, for example, also
the invariant splitting (1.9) may depend on time, but this introduces no fundamental
changes.
as a vector field
on Q̂. Note
that M̂ is an r -NHIM for v̂0 by trivial extension. One
can verify that v̂α − v̂0 r can be chosen small with α. Uniformity with respect to p
follows automatically from χ having compact support. As a result of Theorem 3.1
we conclude that there exists an α > 0 such that v̂α has a C r family of invariant
4.2 Smooth Parameter Dependence 143
manifolds
M̃ = M̃ p
, (4.4)
p
∈P
where M̃ p
is the invariant manifold corresponding to the vector field v( p, · ) with
p = χ ( p ) α p
. This parametrizes a full neighborhood B(0; α) ⊂ P.
Remark 4.5 Note that Definition 4.2 implies 4.4 when ṽ − v 1 is small enough and
ṽ ∈ Cb,u
1 .
Remark 4.6 A useful generalization of Definition 4.4 to the setting of Theorem 3.1
is less trivial. There we do not have canonical vertical fibers over ∂ M in the tubular
neighborhood, nor the associated projection of v onto TX at ∂ M. We cannot simply
take a non-vertical fiber; the Perron method adapts the curves x and y separately, so
it may happen that while x(0) ∈ ∂ M is kept fixed, y(0) is updated to a new value
such that (x(0), y(0)) lies outside of the tubular neighborhood over M̄, and control
is lost. ♦
Let us demonstrate the application of this more general definition with the fol-
lowing simple example, see also Fig. 4.1.
Example 4.7 (Persistence under a priori overflowing invariance) Let X × Y =
R × R and let the unperturbed vector field be given by
v(x, y) = − (x − 1)2 , (x 2 − 4) y .
M
X
0
−1 1 2
Remark 4.10 We can restrict to a smaller open subset U of X that contains M̄, so
k,α
we do not need vδ ∈ Cb,u to hold on all of X . If this subset U is not convex, though,
we may run into difficulties when applying the mean value theorem, see Fig. 4.2:
an intermediate point ξ ∈ M̄ on the line between x1 , x2 may be selected, so we need
to make sure that the uniform estimates still hold there. Thus the need for U ⊃ M̄
to be convex, see also the remark in [Hen81, p. 289]. ♦
Proof The proof of Theorem 3.2 requires minimal changes. Note that regardless
of the modifications and smoothing preparations performed in Sect. 3.4, the vector
field ṽ X is precisely the horizontal component of the perturbed vector field vδ . In
Sect. 3.6 where we proved existence and uniqueness of M̃, we take η small enough
4.4. This guaranteesρ that x = TX (y, x0 )
that it satisfies condition (2) of Definition
is a solution curve such that x (−∞, 0] ⊂ M̄ for any y ∈ Bη (I ; Y ) and x0 ∈ M̄.
Hence, the contraction mapping T = TY ◦(TX , pr1 ) is well-defined with intermediate
space Bβ (I ; M̄) and we find a unique Lipschitz continuous fixed point map
∞ :
ρ
M̄ → Bη (I ; Y ).
No essential changes are needed with respect to the smoothness proof in Sect. 3.7.
The formal derivatives (3.43) are well-defined along all curves x and y that are
considered, since the derivatives of ṽ X , A, f are defined on an open neighborhood
of M̄. In Sect. 3.7.7 we use the mean value theorem to prove that the restricted
maps T b,a have true derivatives. Remark 4.10 is not problematic here, since T b,a is
defined on the finite interval J = [a, 0] and thus we can restrict to arbitrarily small
open neighborhoods along the curves x, y when restricted to J . Hence we find that
∞ ∈ Cb,u k,α
on M.
We made the assumption in our main theorems that the unstable bundle E + was
absent, that is, that M was a normally attracting invariant manifold. As already noted
in Remark 3.3, 8, it should be possible to generalize this to the case of full normal
hyperbolicity where both stable and unstable normal directions are present. Let us
indicate here how this more general result can be obtained.
Assume that in Theorem 3.1 we have an invariant splitting (1.9) with both stable
and unstable bundles E ± present. The reduction principle in Sect. 2.6 leads to a
formulation of Theorem 3.2 with a trivial bundle
π : X × Y × Z → X, (4.5)
ρ
with z ∈ Bη (R≥0 ; Z ) and adapt the other maps to incorporate z as an argument. We
use Lemma 3.30 and extend all curves in X, Y, Z to the full real line. This should
yield a contraction
T = (TY , TZ ) ◦ TX , pr1 , pr2 on Bηρ (R; Y ) × Bηρ (R; Z ), (4.7)
(h̃ − , h̃ + ) : X → Y × Z , (4.8)
whose graph describes the persistent invariant manifold M̃. See e.g. [VG87] for an
application of this technique in the center manifold setting.
An alternative method to obtain M̃ is by constructing it as the intersection of
its (center-)stable and (center-)unstable manifolds. Here we make use of over- and
+
inflowing invariance. First we construct a tubular neighborhood P≤η of M along
+
(a smooth approximation of) the unstable bundle E , see Fig. 4.3. This manifold
is approximately invariant and overflowing according to Definition 4.2 if η is suffi-
ciently small since it lies approximately along the unstable direction of M. Careful
modifications to the setup of the proof in Sects. 3.4 and 3.5 will allow us recover the
local unstable manifold Wloc U of M̃ as the graph of
+
h̃ U : P≤η → ē− ,
M
η
P ≤+ η
1 Note that since t ≤ τ , we have a reverse flow + (t, τ ) for the unstable directions, which indeed
satisfies the growth estimates (1.10).
148 4 Extension of Results
+
where ē− is an extension of the bundle E − over P≤η . Vice versa we can find the
S −
local stable bundle Wloc as a graph over P≤η . Their intersection yields
k,α
M̃ = Wloc
U
∩ Wloc
S
∈ Cb,u
since the intersection is uniformly transversal. This is a standard trick which is also
applied in [Fen72; HPS77] using the graph transform.
In our main Theorem 3.1 we did not prove that the persistent manifold is again
normally hyperbolic. As stated in Remark 3.3, 9, we must recover the invariant
splitting of the perturbed manifold M̃ to be able to conclude that M̃ again satisfies
Definition 1.8 of normal hyperbolicity. We will sketch how to find the invariant fibers
of the stable manifold of M̃ in the setting of Theorem 3.2 with a trivial bundle Q =
X × Y . These fibers are easily recovered using the Perron method, and smoothness
of each single fiber essentially comes for free. We then readily obtain the vertical
part Ẽ − of the invariant splitting as the tangent planes to these fibers at M̃. Since
T M̃ is already invariant, we have
T M̃ Q = T M̃ ⊕ Ẽ − (4.9)
which is invariant under the tangent flow, cf. (1.9). Note that we should still prove
boundedness and uniform continuity of this splitting.
Each single invariant fiber can be found by application of the non-autonomous
Perron method. Let ϒ̃ : R × M̃ → M̃ denote the flow of the perturbed system,
restricted to the persistent manifold, and let γ (t) = ϒ̃ t (m) denote a solution curve
in M̃. Consider the pullback bundle
π
Em = γ ∗ (TQ) −→ R. (4.10)
Note that γ ∗ (T M̃) is a subbundle of Em in a natural way, and that it is invariant under
the tangent flow Dϒ̃ t . Next we consider γ ∗ (E − ) as a subbundle of Em : even though
E − was defined over the original manifold M, it can simply be translated along the
canonical fiber Y to be identified with a bundle over M̃ and then be pulled back
along γ . Thus we have a splitting
TM̃ Q = T M̃ (X × Y ) = T M̃ ⊕ E − .
Y S
W loc (γ (t ))
Eγ−(t )
Tγ (t ) M̃
M̃
X
S
h̃ m : Ê m− → Tm M̃,
That is, we look at the system in normal coordinates that follow the solution curve
γ . This means that the vector field ϕ ∗ ( ∂t∂ , ṽ) on Em has constant speed 1 on the base
R, i.e. the time axis, and preserves the zero section, i.e. the origin of the coordinates
at γ (t). Since D expγ (t) (0) = 1, the exponential growth conditions are preserved
under pullback. Hence, the splitting
Em = γ ∗ (T M̃) ⊕ γ ∗ ( Ê − )
2 We only can (and need to) perform the pullback in an η-sized tubular neighborhood of γ . Outside
this neighborhood we smoothly cut off the vector field to a suitable linearization. Therefore we only
recover the local stable manifold.
150 4 Extension of Results
∞
T (x, y, y0 )(t) = Dϒ̃ t−τ (γ (τ )) f X (τ, x(τ ), y(τ ))dτ, (4.12)
t
t
γ (t, 0) y0 + γ (t, τ ) f Y (τ, x(τ ), y(τ ))dτ
0
which is a contraction on pairs of curves (x, y) ∈ B ρ (R≥0 ; Em ) with the partial initial
condition y0 ∈ Em,0 = Ê m− as parameter. Here we chose ρ X < ρ < ρ X and used the
fact that f X ,Y (t, 0, 0) = 0. The functions f X ,Y are C 1 -small nonlinear terms. These
contain the linear and higher nonlinear perturbations away from the linear flows
Dϒ and γ along γ ∗ (T M̃) and γ ∗ ( Ê − ) respectively, corrected for the bounded
coordinate changes introduced by ϕ in (4.11). Thus, we find h̃ m S by evaluating the
This bundle can be endowed with a topology by parallel transport along the curves γ
as in Sect. 3.7.4. Then each derivative with respect to m adds an exponential growth
factor eρ M̃ t since the curves γ can diverge at that rate in forward time. Note that
ρ M̃ corresponds to ρ M in (1.11), that is, the upper bound on the spectrum of the
tangential flow. This leads to the spectral gap condition
from which it follows that the mapping m → h̃ m S has C k−1,α smoothness. This
b,u
implies that the invariant splitting is at least uniformly continuous and bounded. The
exponential growth rates with respect to the splitting follow from small perturbation
estimates similar to those in Sects. 3.4 and 3.5. This completes the requirements for
normal hyperbolicity of M̃ in Definition 1.8.
In this appendix, we carefully examine the implicit function theorem. We extend this
standard theorem to classes of functions with additional properties such as bound-
edness and uniform and Hölder continuity. The crucial ingredient is the explicit
formula (A.2) for the derivative of the implicit function, which allows us to transfer
regularity conditions onto the implicit function.
As an application of the implicit function theorem in Banach spaces, we will
establish existence, uniqueness and smooth dependence on parameters for the flow
of a system of ordinary differential equations. Essentially, these are standard results
from differential calculus, see e.g. Zeidler [Zei86, p. 150,165] or [Irw72, Rob68]. We
consider a general setting of ODEs in Banach spaces and show smooth dependence,
both on the initial data, as well as on the vector field itself. Moreover, our extension
of the implicit function theorem yields boundedness and uniform continuity results.
We start with some results on inversion of linear maps.
Lemma A.1 (Invertibility of linear maps) Let X be a Banach space and let A ∈
L(X ) be a continuous linear operator with continuous inverse. Let B ∈ L(X ) be
another linear operator such that B < A1−1 . Then A + B is also a continuous
linear operator with continuous inverse, given by the absolutely convergent series
−1 n n
A+B = − A−1 B A−1 = A−1 − B A−1 . (A.1)
n≥0 n≥0
Proof First of all, note that there exists an M ≥ 1 such that A, A−1 ≤ M. The
base of the geometric series can be estimated in operator norm as −A−1 B < 1, so
the series is absolutely convergent and the limit is a well-defined continuous linear
operator, whose operator norm can be estimated as
n −1
A
(A + B)−1 ≤ A−1 −A−1 B ≤ < ∞.
n≥0
1 − A−1 B
That the limit is again a well-defined linear operator follow from the fact that L(X )
is a Banach space.
Applying A + B to the left-hand side of (A.1), we see that the candidate is a right
inverse:
n n n+1
(A + B) A−1 − B A−1 = − B A−1 − − B A−1 = 1.
n≥0 n≥0 n≥0
n −1 2
A
(A + B)−1 − A−1 ≤ −A−1 B A−1 ≤ B.
n≥1
1 − A−1 B
See [Zei86, pp. 150–155] for a proof. Note that we do not need to assume that Y is
a complete space, as the contraction theorem is only applied on X . Recall that we
use notation where D denotes a total derivative, while Di with index i ∈ N denotes
a partial derivative with respect to the i-th argument.
Formula (A.2) for the derivative of the implicit function g will be crucial for the
extension of the implicit function theorem to many classes of regularity, extending
C k smoothness. We use the Lipschitz estimate for the inversion map and require that
the regularity conditions are preserved under composition, addition, multiplication
and localization of functions. By Proposition C.3, the derivatives of g are expressed
in terms of D1 f (g(y), y)−1 acting on a polynomial expression of same or lower
order derivatives of f and strictly lower order derivatives of g.
As an example, let us take Cbk,α functions. Using Lemma 1.19 and induction over
k, this function class is preserved under products. For composition, we check Hölder
continuity,
α
f (g(x2 )) − f (g(x1 )) ≤ C f C g x2 − x1 α ≤ (C f C gα ) x2 − x1 α
Thus, Hölder continuity is preserved with some new Hölder constant, while bound-
edness is trivially preserved as well. We conclude that if f ∈ Cbk,α , and (D1 f )−1 is
globally bounded, then we can read off from formula (A.2) that g ∈ Cbk,α . The same
k functions, or any other class of functions whose
results hold for the class of Cb,u
properties are preserved when inserted into (A.2). Together, interpreting α = 0 as
an empty condition, these lead to
k,α
Corollary A.4 Let in the Implicit Function Theorem A.3, f ∈ Cb,u with k ≥ 1 and
0 ≤ α ≤ 1. Assume moreover that D1 f (x, y) −1 ≤ M is bounded on U × V
k,α
for some constant M < ∞. Then g ∈ Cb,u , and the boundedness and continuity
estimates depend in an explicit way on those of f .
Remark A.5 Formula (A.2) only provides control on the derivatives of the implicit
function, but the size of g itself can be controlled by choice of the neighborhood U .
In our applications, this will match up with choosing coordinate charts around the
origin in Rn . ♦
Let us now consider an ordinary differential equation
has exactly the solution curves of (A.3) as fixed points. It also implicitly depends on
k,α
f ∈ Cb,u (R × B; B) and (t0 , x0 ) ∈ I × B. From now on we denote by Dx a partial
derivative with respect to the argument that is typically described by the variable x.
This T is a contraction for |I | = b − a small enough:
t
T (x1 ) − T (x2 ) = sup f (τ, x1 (τ )) − f (τ, x2 (τ )) dτ
t∈I t0
t
≤ sup Dx f (τ, ξ(τ ))x1 (τ ) − x2 (τ ) dτ
t∈I t0
≤ sup |t − t0 |Dx f x1 − x2
t∈I
≤ |I | Dx f x1 − x2 .
k,α
F ⊂ Cb,u (R × B; B), sup f k,α ≤ R.
f ∈F
1 Note that any actual solution x will be C 1 at least, but only x ∈ C 0 is required. This makes X a
complete space without the need to introduce norms more complicated than the supremum norm.
Appendix A: Explicit Estimates in the Implicit Function Theorem 155
Note that these are all bounded linear operators; Dt0 F(x) is because f ≤ R.
Hence, F ∈ Cb1 as a function of x, t0 , x0 , f , so by the implicit function theorem, the
solution x(t; t0 , x0 , f ) depends Cb1 on t0 , x0 , f .
k,α
Next, we establish Cb,u dependence on the initial conditions t0 , x0 and Cbk depen-
dence on f and t0 , x0 together. Uniform and Hölder dependence on f are lost because
k,α
the variations δ f ∈ Cb,u are not uniformly equicontinuous. The first derivatives can
be differentiated another k−1 times with respect to each of the variables, using similar
estimates as in (A.5). These derivatives are continuous as f is uniformly continuous
on the interval I . Uniform and Hölder continuity with respect to t0 , x0 can be read
off directly from the expressions (A.5), (A.6) or their higher order derivatives, as
k,α
f ∈ Cb,u . The implicit function theorem only gives an explicit formula (A.2) for
the derivative. Here, this translates into the fact that no boundedness follows for the
C 0 -norm of the solution curve, only for the norms on the derivatives.
We have thus shown that the conditions of Corollary A.4 of the implicit function
theorem have been satisfied, so there exists a neighborhood of (t0 , x0 , f ) in I × B ×F
such that for each (t 0 , x 0 , f ) in that neighborhood there is a unique solution to (A.3)
and the solutions x depend in a Cb,u k,α
way on t0 , x0 and Cbk on all of t 0 , x 0 , f . Note
that this result is obtained only on the interval I . We can however extend these
results to any bounded interval, by using the composition property of a flow; the
estimates may grow with interval size though. Hence, we have the following result,
see also [DK00, App. B].
Theorem A.6 (Uniform dependence on parameters of ODE solutions) Let an
k,α
ordinary differential equation (A.3) be given, where f ∈ F ⊂ Cb,u (R × B; B)
with k ≥ 1, 0 ≤ α ≤ 1, B a Banach space, and F a bounded subset. Let I ⊂ R
be a bounded interval and X = C 0 (I ; B) the Banach space of (solution) curves,
endowed with the supremum norm.
Then the flow is a Cbk mapping
: I × B × F → X : (t0 , x0 , f ) → t → x(t) .
k,α
The boundedness is understood to hold only for the derivatives. Moreover, ∈ Cb,u
holds as a mapping from I × B for fixed f ∈ F.
156 Appendix A: Explicit Estimates in the Implicit Function Theorem
Remark A.7 Differentiable dependence on time can be dropped from this theorem.
That is, let us instead assume that f (t, x) and its derivatives Dix f (t, x), i ≤ k with
respect to x are bounded continuous with respect to (t, x). Then the flow is a Cbk
mapping
: B × F → X : (x0 , f ) → t → x(t)
when I ⊂ R is a bounded interval. This result follows directly from the proof, since
we only used differentiability with respect to t for differentiable dependence of
on t.
Remark A.8 Instead of a Banach space B, we can also choose the setting of a
Riemannian manifold (M, g). Solving for the flow of a differential equation is defined
in terms of local charts, so by standard arguments the C k smoothness result extends
to this setting.
If we assume moreover in the context of Chap. 2 that (M, g) has bounded geometry
k,α
and that f ∈ Cb,u , then we can obtain stronger results close to those of Theorem A.6.
In any single normal coordinate chart the results of Theorem A.6 hold. To extend the
flow beyond one chart, we use the fact that coordinate chart transitions are uniformly
k,α
C k -bounded maps. It follows that ∈ Cb,u on any domain such that all image
curves are covered by a uniformly bounded number of charts. This includes the
domain M × I for any finite interval I ⊂ R, since f itself is assumed bounded. The
bounds and continuity moduli will depend on |I | though.
Alternatively, uniform (Hölder) continuity estimates independent of charts can be
obtained by using Proposition 2.13 to express continuity moduli in terms of parallel
transport. See Lemma C.10, which is proven via a variation of constants method. ♦
Appendix B
The Nemytskii Operator
The Nemytskii operator creates a mapping on curves from a simple function between
spaces. That is, in its simplest form, if we have a function f : Rn → Rm , then the
associated Nemytskii operator
2 For our purposes, a sufficient definition of a normed vector bundle π : X → R is that there exist
local trivializations τ : π −1 (U ) → U × F that are isometric with respect to the norms on X and
the normed linear space F. Note that we canonically have such trivializations by parallel transport,
see (3.52) and Proposition 3.34.
This leaves only the compact interval [0, T ] for which we still have to show that
f (x1 (t)) − f (x2 (t)) e−ρ2 t ≤ ε. Let us denote g : I → R : t → e−ρ2 t , then the
continuity estimate of f · g : X → Y is uniform on the compact set x1 ([0, T ]).
Hence, there exists a δ > 0 such that for all t ∈ [0, T ] and ξ2 ∈ π −1 (t) ⊂ X ,
We have that d(x1 (t), x2 (t)) ≤ e|ρ1 T | dρ1 (x1 , x2 ), so choosing δ = e−|ρ1 T | δ yields
the required estimate for dρ1 (x1 , x2 ) ≤ δ. This proves that F is continuous at x1 .
Secondly, assume (2) and let Cα be the Hölder coefficient of f . Then we can
estimate
3 We formulate all statements in this section with respect to exponentially bounded flows in the
(more natural) forward time direction. That is, we work with t ∈ R≥t0 and typical exponents ρ > 0.
In our applications in Chap. 3 we use the time-reversed statements. See also Remark 1.18.
Proof Let D denote the partial derivative with respect to the spatial variable x ∈ Rm .
We suppress the time dependence in the notation of v since we have the bound
Dn v ≤ V for all 1 ≤ n ≤ k, uniformly in space and time.
Since t,t0 is a flow, we have
d n t n
D = Dn (v ◦ t ) = Dv ◦ t · Dn t + Dl v ◦ t · Pl,n D1 t , . . . , Dn−1 t ,
dt
l=2
(C.3)
where the Pl,n are homogeneous, weighted polynomials as in Definition C.2 below.
In the first equality, the switching of partial derivatives is well-defined, because the
spatial derivative in the middle expression is well-defined and the resulting function
continuous. In the right-hand expression we have already used Proposition C.3 and
separated the homogeneous term with Dn t (when l = 1). The result is a linear
differential equation for Dn t with the inhomogeneous terms in the sum consisting
of lower order derivatives Di t , i < n, only.
For n = 1, statement (C.1) is already true by assumption and in that case we also
see that (C.3) is a homogeneous linear differential equation. Denote by x (t, t0 ) the
solution operator for this system with initial point x ∈ Rm , then
Dt,t0 (x) = x (t, t0 ) Dt0 ,t0 (x) = x (t, t0 ) · 1 = x (t, t0 ). (C.4)
This solution operator acts by left-composition on linear maps, so we read off that
x (t, t0 ) = Dt,t0 (x) and find the estimate x (t, t0 ) ≤ C1 eρ(t−t0 ) . Now we turn
to the induction step. For n > 1, we still have essentially the same solution operator
x (t, t0 ) for the homogeneous part, only now acting by composition on multilinear
maps Dn t,t0 (x) ∈ Ln (Rm ): the solution operator is not influenced by considering
multilinear maps, as Dv and act by linear composition from the left, essentially
on tangent vectors. Therefore, the same growth estimate for x (t, t0 ) still holds.
on thei ρDt , i < n and by
The inhomogeneous terms in (C.3) depend i t
i only
the induction hypothesis we can estimate D ≤ Ci e . Using variation of
t
where the homogeneous part of the solution is zero because Dn t0 ,t0 (x) = 0 for
n > 1. Given that the weighted degree of Pl,n is n, we can directly estimate
Appendix C: Exponential Growth Estimates 163
n t t n
D (x) ≤ x (t, τ ) Dl vP l,n D1 τ , . . . , Dn−1 τ dτ
t0 l=2
t
≤ C1 eρ(t−τ ) V R {Ci }i<n en ρ(τ −t0 ) dτ
t0
en ρ(t−t0 ) − eρ(t−t0 )
= C1 V R {Ci }i<n . (C.6)
(n − 1) ρ
The bound R depends on finite sums and products of finite terms, so is finite again.
When ρ > 0, the denominator is positive and the numerator can be estimated by
en ρ(t−t0 ) ; when ρ < 0, the numerator can be estimated by eρ(t−t0 ) , adding a minus
sign to both parts of the fraction. Thus, in both cases (C.1) holds. This completes the
induction step.
Before generalizing this lemma to Riemannian manifolds, we first refine some
previous notation. Instead of Rm , we more generally consider linear spaces V, W and
spaces Lk (V ; W ) of (multi)linear maps for the (higher order) derivatives of maps
f : V → W.
Definition C.2 (Homogeneous weighted polynomial) Let Pa,b (y1 , . . . , yn ) be a
polynomial in the variables y1 to yn . We call P a homogeneous weighted polynomial
of degree (a, b) if it is a homogeneous polynomial of degree a and moreover, each
p p
term y1 1 . . . yn n has weighted degree
n
i · pi = b. (C.7)
i=1
As a consequence, such a polynomial cannot have factors yn for n > b and the factor
yb can only occur as a term on itself when a = 1.
This definition can now be used to denote the higher derivatives of a composition
of two functions f, g on vector spaces.
Proposition C.3 (Higher order derivatives of compositions of functions) Let the
mapping x → f (g(x), x) be given with f : V × U → W and g : U → V two suffi-
ciently differentiable functions between vector spaces U, V, W . Then the k-th order
derivative of this mapping with respect to x is of the form
d k 1
f (g(x), x) = 2 f (g(x), x) · Pl,k−m D g(x), . . . , D
Dl1 Dm k−m
g(x) ,
dx
l,m≥0
l+m≤k
(l,m) =(0,0)
(C.8)
where Pl,k−m is a homogeneous weighted polynomial of degree (l, k−m) with l higher
order derivatives Di g(x) in each term, and weighted degree k−m: the total number
of derivatives that either produced an additional Dg(x) term or differentiated an
existing one.
164 Appendix C: Exponential Growth Estimates
Remark C.4 We will shorten the notation Pl,k D1 g(x), . . . , Dk g(x) = Pl,k D• g(x) .
d
f (g(x), x) = D1 f (g(x), x) · Dg(x) + D2 f (g(x), x),
dx
which satisfies (C.8). For the induction step we have
d k+1 d •
f (g(x), x) = 2 f (g(x), x) · Pl,k−m D g(x)
Dl1 Dm
dx dx
l,m≥0
l+m≤k
(l,m) =(0,0)
•
= Dl+1
1 D2 f (g(x), x) · D g(x) · Pl,k−m D g(x)
m 1
l,m≥0
l+m≤k
(l,m) =(0,0)
+ Dl1 D2m+1 f (g(x), x) · Pl,k−m D• g(x)
d •
+ Dl1 Dm2 f (g(x), x) · Pl,k−m D g(x)
dx
= D1 D2 f (g(x), x) · Pl+1,k+1−m D• g(x)
l+1 m
l,m≥0
l+m≤k
(l,m) =(0,0)
+ Dl1 D2m+1 f (g(x), x) · Pl,k+m D• g(x)
•
+ Dl1 Dm2 f (g(x), x) · Pl,k+1−m D g(x)
= D1 D2 f (g(x), x) · Pl,k+1−m D• g(x) .
l m
l,m≥0
l+m≤k+1
(l,m) =(0,0)
D1 f (g(x), x) · Dk g(x).
This can easily be seen by direct calculation or induction. Finally, when the compo-
sition mapping is of the form x → f (g(x)), then we only have terms with m = 0
and all polynomials in (C.8) have weighted degree k in that case.
The next step is to generalize Lemma C.1 to a Riemannian manifold (M, g). Here
we first need to define what we mean by higher derivatives of the flow. The tangent
flow Dt is well-defined as a mapping on TM, but higher derivatives live on higher
order tangent bundles Tk M. These abstract bundles make doing explicit estimates
as in the proof of Lemma C.1 difficult. Instead, we reuse the idea of Definition 2.9
and introduce a different representation of higher derivatives in terms of normal
coordinate charts.
Definition C.6 (Higher derivative on Riemannian manifolds) Let M, N be
Riemannian manifolds and f : M → N a smooth map. With the notation f x =
exp−1
f (x) ◦ f ◦ expx of f represented in normal coordinate charts, we define for k ≥ 1
and x ∈ M the higher order derivative
Dk f (x) = Dk f x (0) = Dk exp−1
f (x) ◦ f ◦ expx (0) (C.9)
that is, these operator norms as defined via normal coordinate charts are truly norms
and satisfy the usual product rules for compositions of (multi)linear maps.
The operator norms are induced by the norms on the tangent spaces of TM, which
in turn are induced by the metric. These norms depend smoothly on the base point,
so they glue together to a smooth function · : TM → R≥0 that we will call a
‘bundle norm’ on the tangent bundle4 or sometimes refer to as just a norm on TM.
Higher derivatives can be viewed as partial sections of the vector bundle
and the disjoint union of all these bundles. The bundle norms on TM and TN together
naturally induce bundle operator norms on these. From here on, we set M = N and
assume that f = t is a flow.
To finally generalize Lemma C.1 to Riemannian manifolds, there is still one issue
to tackle. When taking the time-derivative as in (C.3), the target base point t (x)
changes. This suggests that a covariant derivative is required. The Ll,k (TM; TM) are
smooth manifolds in a natural way, however, so both the tangent vector dtd Dk t (x)
and the differential of · are well defined in
this interpretation
and independent of
a connection, and certainly their product dtd Dk t (x) is. The tangent exponential
maps D exp at x and t (x) together induce a local coordinate chart on Ll,k (TM; TM)
in a neighborhood of Ll,k (Tx M; Tt (x) M). We will use these local coordinates for
explicit calculations.
The dependence on the base point of the norms and normal coordinate charts
in (C.9) introduces additional terms when formulating equations (C.3) and (C.5) on
a Riemannian manifold. Under the assumption that (M, g) is of bounded geometry,
however, all these additional terms will be globally bounded. Hence, these will only
contribute to the overall constants Cn in Lemma C.1, but not influence the basic
result.
Lemma C.8 (Exponential growth estimates on a Riemannian manifold) Let
t,t0 ∈ C k≥1 be the flow of a time-dependent vector field v on a Riemannian manifold
4 Note that this is stronger than a Finsler manifold as the Finsler structure F : TM → R≥0 is allowed
to be asymmetric, that is, on each tangent space, F need only scale linearly for positive scalars. I
did not investigate whether it is possible to generalize this theory to Finsler manifolds.
Appendix C: Exponential Growth Estimates 167
(M, g) of (k+3)-bounded geometry. Let v(t, · ) ∈ Xkb (M) with all derivatives jointly
continuous in (t, x) ∈ R × M and uniformly
bounded by V < ∞ with respect to
Definition C.6. Suppose that Dt,t0 (x) ≤ C1 eρ(t−t0 ) for all x ∈ M, t ≥ t0 and
fixed C1 > 0, ρ = 0. Then for each n, 1 ≤ n ≤ k there exists a bound Cn > 0 such
that
n t,t Cn en ρ(t−t0 ) if ρ > 0,
∀ x ∈ M, t ≥ t0 : D (x) ≤
0 (C.12)
Cn eρ(t−t0 ) if ρ < 0.
Proof The proof is basically the same as the proof of Lemma C.1, with additional
technicalities due to M being a manifold. We will focus on these.
Equation (C.3) can be formulated in terms of the tangent normal coordinate chart
D exp−1 ∼
y : TB(y; δ) ⊂ TM → T(T y M) = (T y M)
2
with y =
t (x) fixed. Note that we are finally interested in the growth behavior
of t → D t (x); this is defined in a coordinate-free way, so it is not influenced
n
d
D exp−1
y ◦D (x)
n t
dt
d −1
= D exp−1
y ◦D expt (x) ◦ ◦ expx (0)
n t
dt
d n
•
= D exp−1
y ◦ Dl exp−1
t (x) ◦ exp y (0) · Pl,n D exp−1
y ◦ ◦ expx (0) .
t
dt
l=1
This splits the dependence on t in the target base point t (x) from that in the
derivatives Dn t itself. Note that the sum must be interpreted as a sum of terms in
the single fiber Ln (Tx M; T y M) over the base point (x, y). By using the coordinate
map D exp−1 y , we transferred the problem to fixed linear spaces, which allows us to
make sense of the differentiation with respect to t. In other words, D exp−1 y induces
locally trivializing coordinates for Ln (Tx M; TM) in a neighborhood of y with x
fixed. As D exp−1y is linear on the fibers, we can distribute it over the sum to further
obtain
d
n
−1
•
= D exp−1
y ◦Dl
exp (x)
t ◦ exp y (0) · Pl,n D exp −1
y ◦ t
◦ exp x (0)
dt
l=1
d
n
−1
−1
−1 d
= D exp y ◦D expt (x) ◦ exp y (0) · D exp y ◦ ◦ expx (0) +
n t
... .
dt dt
l=2
(C.13)
168 Appendix C: Exponential Growth Estimates
In the last line, the homogeneous part is separated from the non-homogeneous terms
as in (C.3).
Working out the details of the homogeneous part, we obtain5
d
−1
D exp−1 −1
y ◦D expt (x) ◦ exp y (0) · D exp y ◦ ◦ expx (0)
n t
dt
d
−1
n t
= D exp−1
y ◦D expt (x) ◦ exp y (0) · D (x)
dt
nd
+ D D exp−1y ·D exp−1
y ◦ ◦ expx (0)
t
dt
d
n t
= D exp y ◦ D exp−1
−1
(x)
t ◦ exp y (0) · D (x)
dt
n
+ D D exp−1y · Dl D exp−1 • t
y ◦v ◦ exp y (0) · Pl,n (D (x)).
l=1
Note that again all terms l ≥ 2 in the sum are inhomogeneous terms that we will
add to those already present in (C.13). The homogeneous term is some linear vector
field acting (from the left) on Dn t (x) and it is precisely the vector field generating
Dt (x), which is the original case n = 1. Hence, we can again define the operator
xt,t0 as post-composition with Dt (x) and write the flow of Dn t (x) using a vari-
ation of constants integral with all the non-homogeneous terms. These terms again
contain only lower order derivative flows Dl t (x), l < n.
We can now take the operator norm of this expression. In principle we should be
careful that this bundle norm depends on the changing target point t (x). The normal
coordinates were chosen around y = t (x), however, and in these coordinates the
derivative of the metric at the origin (corresponding to y) is zero, hence the norm
has zero derivative. We can thus simply apply the operator norm to the variation of
constants integral and obtain estimates as in (C.6). The additional factors introduced
by differentiation of normal coordinate transition maps are bounded by Lemma 2.6
under the assumption that (M, g) is of (k+3)-bounded geometry. The inhomogeneous
terms still contain at least one factor Dl t (x), so the result in case ρ < 0 holds as
well.
These exponential growth results can be extended further to uniform and Hölder
continuity in the highest derivatives. The Hölder continuity then is with respect to
the growth rate (k + α)ρ, where k is the order of the derivative and 0 < α ≤ 1 the
Hölder constant. Thus, α-Hölder continuity can be viewed as a fractional derivative;
Lipschitz continuity (when α = 1) can indeed be viewed as almost differentiability to
one higher order. The case α = 0 we shall identify with uniform continuity. Here we
−1
5 The time derivative of D exp−1
y ◦ D expt (x) ◦ exp y (0) actually turns out to be zero in local
coordinates. This follows from an analysis of the exponential map as the time-one geodesic flow
in normal coordinates around y. This result is not relevant for us, so we leave out this tedious
calculation.
Appendix C: Exponential Growth Estimates 169
If instead v(t, · ) ∈ Xkb,u (M), i.e. the special case α = 0, then for each μ > 0 there
exists a continuity modulus εk,μ such that
∀ t ≥ t0 : Dk t,t0 (x2 ) − Dk t,t0 (x1 ) ≤ εk,μ (d(x1 , x2 )) e(k ρ+μ)(t−t0 ) , (C.15)
that is, x → t → Dk t,t0 (x) is uniformly continuous in x, in · k ρ+μ -norm.
Remark C.11 We restricted this lemma to the case ρ > 0 only. A result similar to
that in C.8 for ρ < 0 could be obtained for completeness sake, but it clutters the
already detailed proof, while we do not need the result. ♦
6 Note that the higher derivatives Dk t (x) of a flow are actually interpreted as elements of a
bundle of type (C.10). These bundles are still naturally induced by the tangent bundles of underlying
manifolds, so all bounded geometry techniques, such as uniformity of normal coordinate charts,
unique local trivializations by parallel transport, are induced on these bundles as well.
170 Appendix C: Exponential Growth Estimates
Proof The idea of the proof is essentially the same as that of Lemma C.8. The addi-
tional difficulty is that (Hölder) continuity requires finite, non-differential estimates
when comparing any two flows starting from different initial points x1 , x2 ∈ M.
Let d(x1 , x2 ) < δ M where δ M is M-small as in Definition 2.8. We drop t0 from
the notation and define ξi (t) = t (xi ), i = 1, 2 as the solution curves with xi as
initial conditions. We want to study the growth behavior of
Note that this difference is defined with respect to coordinate charts at source and
target that contain x1 , x2 and ξ1 (t), ξ2 (t), respectively, but not in general.
We denote by γt the unique shortest geodesic that connects ξ1 (t) to ξ2 (t) when
d(ξ1 (t), ξ2 (t)) < δ M . Next, we set
ϒ t = Dk t (x2 ) · (γ0 )⊗k − (γt ) · Dk t (x1 ) ∈ Lk Tx1 M; Tξ2 (t) M (C.17)
to be the difference of the respective k-th order derivative flows, parallel transported
to matching spaces at their source and target. It is easily verified that ϒ t satisfies
initial conditions ϒ t0 ,t0 = 0 for any k ≥ 1.
Due to Proposition 2.13, the formulation in (C.17) with parallel transport to mea-
sure variation of the flows isequivalent
to measuring (C.16) in normal coordinate
charts. Hence, if we study ϒ t in charts, we may drop7 the parallel transport
terms at the cost of an (unimportant) global factor in the estimates. We assume
that d(ξ1 (t), ξ2 (t)) < δ M and study ϒ t in a normal coordinate chart covering both
points. Taking the difference of (C.3) with x1 , x2 inserted, we see that ϒ t satisfies
the differential equation
d t
ϒ = Dv ◦ t (x2 ) · ϒ t + Dv ◦ t (x2 ) − Dv ◦ t (x1 ) · Dk t (x1 )
dt
k
+ Dl v ◦ t (x2 ) · Pl,k D• t (x2 ) − (x2 x1 ).
l=2
This equation provides a variation of constants integral for ϒ t based on the flow
x2 (t, t0 ):
7 We could include the parallel transport terms, repeat similar arguments as in the proof of
Lemma C.8 and express everything in (induced) normal coordinate charts, but this would clut-
ter the proof here even more. These terms would all be bounded and Lipschitz continuous by
bounded geometry, hence not essentially alter the result.
Appendix C: Exponential Growth Estimates 171
t
ϒt = x2 (t, τ ) · Dv ◦ τ (x2 ) − Dv ◦ τ (x1 ) · Dk τ (x1 )
t0
k
τ
• τ
+ x2 (t, τ ) · D v ◦ (x2 ) · Pl,k D (x2 ) − (x2 x1 ) dτ.
l
l=2
(C.18)
We proceed by induction over k. For k = 1 we only have the first term of the
integrand. Using that Dv is uniformly α-Hölder, we have
t t
ϒ ≤ x (t, τ ) Dv ◦ τ (x2 ) − Dv ◦ τ (x1 ) Dτ (x1 ) dτ
2
t0
t α
≤ C1 eρ(t−τ ) Dvα τ (x2 ) − τ (x1 ) C1 eρ(τ −t0 ) dτ
t0
t α
≤ C12 Dvα eρ(t−τ ) C1 eρ(τ −t0 ) x2 − x1 dτ
t0
eα ρ(t−t0 )
≤ C12+α Dvα eρ(t−τ ) x2 − x1 α .
αρ
Next, in the induction step for k > 1, we get the additional terms from (C.18) in
the integrand. These are (up to constants) a product of the flow , Dl v ◦ τ (x) and
Di (x)’s with weighted degree k. The terms Dl v ◦ τ (x) are uniformly α-Hölder
continuous in x analogous to the case k = 1 above. Each of the Di (x)’s satisfies the
Hölder estimate of this lemma by the induction hypothesis and the growth estimates
of Lemma C.1. Hence, for each term in the integrand, we obtain Hölder continuity
with respect to x with growth behavior at most eρ(t−τ ) e(k+α)ρ(τ −t0 ) . Integration then
yields the stated result.
Finally, the uniformly continuous case is an extension along the same lines as
Corollary B.3. The map x → Dl v ◦ τ (x) is uniformly continuous when measured
in · μ -norm and by induction the alternative result (C.15) follows.
Finally, we extend the Hölder continuous growth estimates to a parameter depen-
dent version. This is formulated to exactly fit the context of derivatives of TX with
ρ
respect to y ∈ Bη (I ; Y ), such as in (3.43b). Note that Remark C.9 applies again.
Corollary C.12 (Exponential growth with Hölder continuity and a parameter)
Assume the setting of Lemma C.10. Let the vector field v furthermore depend on a
α for all 0 ≤ l ≤ k, uniformly in
third variable y ∈ Y such that Dlx v(t, x, · ) ∈ Cb,u
t, x and that all original bounds are uniform in y as well. Let η ∈ B ρ (R; Y ) denote
a curve in Y and t,t t,t0
η the flow of v(t, · , η(t)). Assume that η → t → η (x) is
0
In case of uniform continuity (i.e. α = 0), then for each μ > 0 there exists a continuity
modulus εk,μ,Y such that
0 (x) − Dk t,t0 (x) ≤ ε (k ρ+μ)(t−t0 )
∀ t ≥ t0 , x ∈ X : Dk t,t
η2 η1 k,μ,Y (d(η1 , η2 )) e .
(C.20)
In both cases we interpret the continuity moduli as globally defined using Remark 2.12.
Proof The proof closely follows that of Lemma C.10; let us indicate the differences.
We define the variation
ϒt = η2 (t, τ ) · Dv τ, τη2 (x), η2 (τ ) − Dv τ, τη1 (x), η1 (τ ) · Dk τη2 (x)
t0
k
+ l
Dv τ, τη2 (x), η2 (τ ) · Pl,k D• τη2 (x) − (2 1) dτ. (C.22)
l=2
As in Lemma C10, the factors η2 (t, τ ) and Dl τη2 (x) satisfy appropriate exponential
growth conditions. By induction over l < k the maps η → Dl tη (x) are α-Hölder
continuous, while all Dl v are uniformly α-Hölder in x, y, and η → Dl tη (x) is
uniformly Lipschitz by assumption, so η → Dl v t, tη (x), η(t) is also α-Hölder
when measured in · α ρ -norm (or in · μ -norm in case of uniform continuity, see
Appendix B).
In each term of the integrand, we can estimate the variation with respect to η as a
sum of the variations with respect to each factor (a product rule). The factor that is
being varied adds eα ρ(τ −t0 ) (or eμ(τ −t0 ) in case α = 0) to the overall growth estimate.
The proof is completed by inserting all these estimates into (C.22) and again using
the fact that we have a finite number of globally bounded terms.
Appendix D
The Fiber Contraction Theorem
In this appendix, we give a proof of the fiber contraction theorem. This result is
originally due to Hirsch and Pugh [HP70]; the proof presented here is taken from
Vanderbauwhede [Van89, p. 105]. The fiber contraction theorem is a convenient
general tool to obtain convergence of functions in C k -norm when a direct contraction
in C k -norm is not available. Instead, one inductively constructs contractions for the k-
thderivative with all lower order derivatives assumed fixed. If this contraction depends
continuously on the lower order derivatives, then the fiber contraction theorem can
be applied to conclude that the sequence of the function together with its derivatives
converges to a fixed point. With the additional theorem on the differentiability of
limit functions, it can then be concluded that the sequence converges in C k -norm.
Theorem D.1 (Fiber contraction theorem) Let X be a topological space, (Y, d)
a complete metric space and let F : X × Y → X × Y be a fiber mapping, that is,
F(x, y) = F1 (x), F2 (x, y) , with the following properties:
1. F1 has a unique, globally attracting fixed point x ∈ X , that is,
ᾱ
δ = .
1−q
Let ε > 0 be given and choose k large enough that ᾱk < 21 (1−q)ε. As limn→∞ δn,k =
δk we see that there exists some N such that
2 ᾱk
∀n ≥ N: δn,k < 2δk = < ε.
1−q
f (x + h) − f (x) − g(x) · h
lim = 0.
h→0 h
In this appendix we collect two results on variation of nonlinear flows. The first is
a generalization of Lagrange’s variation of constants formula and the second is an
application of it to calculate the derivative of a flow with respect to parameters. Both
results are formulated for fully nonlinear flows.
The classical variation of constants integral due to Lagrange is well known.
Although Lagrange applied this method to the nonlinear problem of orbital mechan-
ics, a less known result of Alekseev [Ale61] (see also [LL69, p. 78]) generalizes the
variation of constants integral to the full nonlinear case.
Theorem E.1 (Nonlinear variation of constants) Let X be a smooth manifold and
let t,t0 (x) be the flow generated by the time-dependent vector field v(t, x), locally
Lipschitz in x. Let r (t, x) be an arbitrary (not necessarily small) perturbation, locally
Lipschitz in x as well. Then rt,t0 (x) is the flow generated by v + r if and only if it
satisfies the nonlinear variation of constants formula
t
rt,t0 (x) = t,t0 (x) + D(t, τ, rτ,t0 (x)) r (τ, rτ,t0 (x)) dτ. (E.1)
t0
Proof Using uniqueness of solutions, it is sufficient to show that (E.1) satisfies the
differential equation and initial conditions rt,t (x) = x. The latter follows automat-
ically from t,t (x) = x. For the first part, we differentiate
d
t,τ ∂ t,τ d τ,t0
◦ rτ,t0 (x) = (y) τ,t0 + Dt,τ (rτ,t0 (x)) · (x)
dτ ∂τ y=r (x) dτ r
= −Dt,τ (rτ,t0 (x)) · v(τ, rτ,t0 (x))
+ Dt,τ (rτ,t0 (x)) · (v + r )(τ, rτ,t0 (x))
= Dt,τ (rτ,t0 (x)) · r (τ, rτ,t0 (x)).
Notice that (E.1) looks ill-defined on a manifold, but should be read as integration
from the point x along the vector field defined by the integrand, which is indeed, for
each τ ∈ [t0 , t], exactly defined to be the tangent vector to the curve τ → t,τ ◦
rτ,t0 (x), making the equation self-consistent. If (X, g) is a Riemannian manifold,
then this formula yields the distance estimate
t
d rt,t0 (x), t,t0 (x) ≤ D(t, τ, τ,t0 (x)) r (τ, τ,t0 (x)) dτ. (E.2)
r r
t0
defined by a vector field vs (t, x) that also depends on time and an external parameter
s ∈ R. Let (s, x) → vs (t, x) ∈ Cb1 with derivative jointly continuous in (s, t, x).
Then the derivative of the flow with respect to s is given by
t
d t,t0 d
(x0 ) = s (x(τ ))
Dt,τ vs (τ, x(τ )) dτ, (E.3)
ds s t0 ds
See [DK00, Thm. B.3] for a proof of the formula for differentiation of a flow with
respect to a parameter. This is a slightly modified case where the vector field is time-
dependent. Theorem A.6 and Remark A.7 show that the result can be generalized to
the non-autonomous case and differentiable time-dependence of v is not required.
Appendix F
Riemannian Geometry
In this appendix we recall standard facts from Riemannian geometry and establish
some notational conventions. This appendix is targeted at the reader who has basic
knowledge of Riemannian manifolds, but wants to have a quick refresh. For more
detailed expositions see for example [GHL04, Jos08], or [Lan95] for a more abstract
presentation in the context of Banach manifolds. We shall not try to be exhaustive or
as general as possible in this overview.
A Riemannian manifold (M, g) is a pair of a smooth (or at least C 1 , respectively
2
C for defining curvature) manifold together with a metric g: a family of positive-
definite bilinear forms gx on each tangent space Tx M. The metric is a generalization
of the Euclidean inner product on Rn and gx depends in a smooth way on the point
x ∈ M in the manifold. The metric can be used to measure angles and lengths of
tangent vectors, so we can define the length of a piecewise C 1 curve γ : [a, b] → M
as b
l(γ ) = gγ (t) (γ (t), γ (t)) dt.
a
on M, where the infimum is taken over all piecewise C 1 curves γ connecting the
points x and y. This turns M into a metric space.
Simple examples of Riemannian manifolds are Rn with the standard Euclidean
inner product and the sphere S n−1 ⊂ Rn with the induced metric on its tangent
bundle. Due to the Nash embedding theorem, any C k≥3 Riemannian manifold can
actually be realized as a submanifold of Rn equipped with the induced metric.
Each Riemannian manifold (M, g) has an associated linear connection, or, covari-
ant derivative ∇ on the tangent bundle TM. This so-called Levi-Civita connection is
uniquely defined by the requirements that it is torsion-free and compatible with the
metric, i.e.
for all smooth vector fields X, Y, Z on M. The connection is given in local coordinates
x i by the Christoffel symbols ijk ,
∇∂ j ∂k = ijk ∂i ,
where we used the Einstein summation convention for the repeated index i. The
connection can be extended to the tensor bundle of M so that it satisfies the Leibniz
rule.
A connection, more generally on a vector bundle π : E → M, can also be viewed
as a choice of a horizontal subbundle in TE. There is a naturally defined vertical
subbundle Vert(E) ⊂ TE where Vert(E)ξ = Tξ E x for ξ ∈ E x = π −1 (x). A
horizontal bundle Hor(E) is any subbundle complementary to the vertical bundle,
so
TE = Hor(E) ⊕ Vert(E).
This definition of a connection is related to the definition via the covariant derivative.
The horizontal bundle precisely corresponds to the tangent plane to a section s of E
that is flat at a given point x ∈ M:
Hor(E)s(x) = Im Ds(x) ⇐⇒ (∇• s)(x) = 0.
The Levi-Civita connection induces two important concepts: the geodesic flow
and parallel transport. Intuitively, the geodesic flow says how to follow a straight line
from an initial point along a given direction, while parallel transport defines how to
keep a tangent vector fixed while carrying it along a path.8 Both maps are defined
in local coordinates as solutions of (subtly different) differential equations involving
the Christoffel symbols.
The geodesic flow ϒ t is a flow on the tangent bundle TM and defined in local
coordinates x i by
ẋ i = vi ,
(F.2)
v̇i = − ijk (x) v j vk .
Here, the vi denote the induced additional coordinates on the tangent bundle. This
geodesic flow need not be complete, that is, defined for all times. However, by the
Hopf–Rinow theorem, the geodesic flow is complete if and only if M is complete
as a metric space with respect to (F.1). In the following we shall assume that M is
complete to simplify the exposition.
If we restrict the geodesic flow map to the tangent space T p M at a fixed point p ∈
M and to time t = 1, and finally project onto M, then we obtain the exponential map
8 If the path is a geodesic, then parallel transport carries the initial velocity vector to the velocity
vector along the entire path.
Appendix F: Riemannian Geometry 181
exp p = π ◦ ϒ 1 |T p M : T p M → M.
If M is noncompact then this global injectivity radius need not be positive. The
shortest path from p ∈ M to any point x within distance rinj ( p) is uniquely realized
by one geodesic curve. In normal coordinates these curves are rays emanating from
the origin. That is, let v = exp−1
p (x) and γ (t) = exp p (t v) with t ∈ [0, 1], then
d( p, x) = l(γ ) = v.
Let γ : [a, b] → M be a C 1 curve, then parallel transport is a linear isometry (i.e.
it preserves the metric g)
between the tangent spaces at the endpoints. We use the notation (γ |at ) for parallel
transport along a part of the curve. Parallel transport is defined in local coordinates
x i by the differential equation
d
(γ |at )i = − ijk (γ (t)) γ (t) j (γ |at )k with (γ |aa ) = 1. (F.4)
dt
In (F.4) the x i are local coordinates around the point γ (t) with additional induced
coordinates ∂i on the tangent bundle. The representation (γ |at )i is defined by
(γ |at ) = (γ |at )i ∂i . Put more abstractly, parallel transport defines a horizontal
extension of a vector v ∈ Tγ (a) M to a section of the pullback bundle γ ∗ (TM), that
is, a vector field v(t) defined along γ (t), which has covariant derivative zero.
On a Riemannian manifold there is the concept of curvature. A manifold is flat,
i.e. it has zero curvature, if it is (locally) isometric to Rn . The Riemann curvature R
measures non-flatness on an infinitesimal level. It is given by
R(X, Y ) Z = ∇ X ∇Y Z − ∇Y ∇ X Z − ∇[X,Y ] Z ,
182 Appendix F: Riemannian Geometry
which measures how much the direction of a vector Z changes when parallel trans-
porting it around an infinitesimal loop spanned by the directions X, Y . There is a
relation between the curvature and parallel transport that is important to us. If we
consider holonomy, that is, parallel transport along a closed loop γ , then the deficit
(γ ) − 1 is (heuristically put) equal to the curvature form R integrated over any
surface enclosed by γ . This relation can be seen as an application of Stokes’ the-
orem and the differential statement is that the curvature R is the generator of the
infinitesimal holonomy group [AS53, RW06].
References
[Ale61] V.M. Alekseev, An estimate for the perturbations of the solutions of ordinary differential
equations. Vestnik Moskov. Univ. Ser. I Mat. Mech. (2), 28–36 (1961).
[AMR88] R. Abraham, J.E. Marsden, T. Ratiu, Manifolds, tensor analysis, and applications, 2nd
edn, in Applied Mathematical Sciences, vol. 75 (Springer, New York, 1988)
[Ano69] D.V. Anosov, Geodesic flows on closed Riemann manifolds with negative curvature,
in Proceedings of the Steklov Institute of Mathematics, No. 90, (Translated from the
Russian by S. Feder, American Mathematical Society, Providence, R.I., 1967), 1969.
[APS02] B. Aulbach, C. Pötzsche, S. Siegmund, A smoothness theorem for invariant fiber bundles.
J. Dyn. Differ. Equ. 14(3), 519–547 (2002)
[AS53] W. Ambrose, I.M. Singer, A theorem on holonomy. Trans. Am. Math. Soc. 75, 428–443
(1953)
[Att94] O. Attie, Quasi-isometry classification of some manifolds of bounded geometry. Math.
Z. 216(4), 501–527 (1994)
[Bro+09] H.W. Broer, M.C. Ciocci, H. Hanßman, A. Vanderbauwhede, Quasi-periodic stability
of normally resonant tori. Physica D 238, 309–318 (2009)
[BLZ98] P.W. Bates, K. Lu, C. Zeng, Existence and persistence of invariant manifolds for semi-
flows in Banach space. Mem. Am. Math. Soc. 135(645), viii+129 (1998).
[BLZ99] P.W. Bates, K. Lu, C. Zeng, Persistence of overflowing manifolds for semiflow. Comm.
Pure Appl. Math. 52(8), 983–1046 (1999)
[BLZ08] P.W. Bates, K. Lu, C. Zeng, Approximately invariant manifolds and global dynamics of
spike states. Invent. Math. 174(2), 355–433 (2008)
[Bre81] V.N. Brendelev, On the realization of constraints in nonholonomic mechanics. J. Appl.
Math. Mech. 45(3), 481–487 (1981)
[Car26] É. Cartan, Les groupes d’holonomie des espaces généralisés. Acta Math. 48, 1–42 (1926)
[Cot11] É. Cotton, Sur les solutions asymptotiques des équations différentielles. Ann. Sci. École
Norm. Sup. 28(3), 473–521 (1911)
[DLS06] A. Delshams, R. de la Llave, T.M. Seara, Orbits of unbounded energy in quasi-periodic
perturbations of geodesic flows. Adv. Math. 202(1), 64–188 (2006)
[DK00] J.J. Duistermaat, J.A.C. Kolk, Lie groups. Universitext (Springer, Berlin, 2000)
[Dui76] J.J. Duistermaat, in “Stable Manifolds” Unpublished preprint, Utrecht University,
Department of Mathematics (1976)
[Eic91] J. Eichhorn, The Banach manifold structure of the space of metrics on noncompact
manifolds. Differ. Geom. Appl. 1(2), 89–108 (1991)
[Fen72] N. Fenichel, Persistence and smoothness of invariant manifolds for flows. Indiana Univ.
Math. J. 21, 193–226 (1971/1972).
[Fen74] N. Fenichel, Asymptotic stability with rate conditions. Indiana Univ. Math. J. 23, 1109–
1137 (1973/74).
[Fen79] N. Fenichel, Geometric singular perturbation theory for ordinary differential equations.
J. Differ. Equ. 31(1), 53–98 (1979)
[GG73] M. Golubitsky, V. Guillemin, Stable mappings and their singularities, in Graduate Texts
in Mathematics, vol. 14 (Springer, New York, 1973)
[GHL04] S. Gallot, D. Hulin, J. Lafontaine, Riemannian Geometry. Universitext, 3rd edn.
(Springer, Berlin, 2004).
[GK02] S. Gudmundsson, E. Kappos, On the geometry of tangent bundles. Expo. Math. 20(1),
1–41 (2002)
[Guc75] J. Guckenheimer, Isochrons and phaseless sets. J. Math. Biol. 1(3), 259–273 (1974/75).
[Had01] J. Hadamard, Sur l’itération et les solutions asymptotiques des equations différentielles.
Bull. Soc. Math. Fr. 29, 224–228 (1901)
[Hal61] J.K. Hale, Integral manifolds of perturbed differential systems. Ann. Math. 73(2), 496–
531 (1961)
[Hal69] J.K. Hale, Ordinary differential equations, in Pure and Applied Mathematics, vol. XXI
(Wiley-Interscience, New York, 1969)
[Has94] B. Hasselblatt, Regularity of the Anosov splitting and of horospheric foliations. Ergodic
Theory Dyn. Syst. 14(4), 645–666 (1994)
[HL06] À. Haro, R. de la Llave, Manifolds on the verge of a hyperbolicity breakdown. Chaos
16(1), 013120–013128 (2006)
[Hen81] D. Henry, in Geometric Theory of Semilinear Parabolic Equations, Lecture Notes in
Mathematics, vol. 840 (Springer, Berlin, 1981)
[Hil01] D. Hilbert, Ueber Flächen von constanter Gaussscher Krümmung. Trans. Amer. Math.
Soc. 2(1), 87–99 (1901)
[Hir76] M.W. Hirsch, Differential topology, in Graduate Texts in Mathematics, No. 33 (Springer,
New York, 1976).
[Hop66] F.C. Hoppensteadt, Singular perturbations on the infinite interval. Trans. Am. Math.
Soc. 123, 521–535 (1966)
[Hör03] L. Hörmander, The analysis of linear partial differential operators, in I, Classics in
Mathematics, Distribution theory and Fourier analysis. Reprint of the second (1990)
edition (Springer, Berlin, 2003).
[HP70] M.W. Hirsch, C.C. Pugh, Stable Manifolds and Hyperbolic Sets, Global analysis (Proc.
Sympos. Pure Math., Vol. XIV, Berkeley, California, 1968). Amer. Math. Soc., Provi-
dence, R.I., 133–163 (1970).
[HPS77] M.W. Hirsch, C.C. Pugh, M. Shub, Invariant manifolds, Lecture Notes in Mathematics,
vol. 583 (Springer, Berlin, 1977)
[HW99] B. Hasselblatt, A. Wilkinson, Prevalence of non-Lipschitz Anosov foliations. Ergodic
Theory Dyn. Syst. 19(3), 643–656 (1999)
[Irw70] M.C. Irwin, On the stable manifold theorem. Bull. London Math. Soc. 2, 196–198 (1970)
[Irw72] M.C. Irwin, On the smoothness of the composition map. Quart. J. Math. Oxford Ser.
23(2), 113–133 (1972)
[Jon95] C.K.R.T. Jones, Geometric singular perturbation theory, in Dynamical Systems
(Springer, Berlin, 1995), pp. 44–118 (Montecatini Terme, Lecture Notes in Math., vol.
1609, 1994).
[Jos08] J. Jost, Riemannian geometry and geometric analysis. Universitext, 5th edn. (Springer,
Berlin, 2008).
[JS99] A.D. Jones, S. Shkoller, Persistence of invariant manifolds for nonlinear PDEs. Stud.
Appl. Math. 102(1), 27–67 (1999)
[Kap99] T.J. Kaper, An introduction to geometric methods and dynamical systems theory for
singular perturbation problems, in Proceedings of Symposia in Applied Mathematics,
Analyzing Multiscale Phenomena using Singular Perturbation Methods, ed. by M.D.
Baltimore (American Mathematical Society, Providence, 1999), pp. 85–131
References 185
[Kar81] A.V. Karapetian, On realizing nonholonomic constraints by viscous friction forces and
Celtic stones stability. J. Appl. Math. Mech. 45(1), 42–51 (1981)
[Kli95] W.P.A. Klingenberg, Riemannian geometry, 2nd ed., in de Gruyter Studies in Mathe-
matics, vol. 1 (Walter de Gruyter & Co., Berlin, 1995).
[KN90] V.V. Kozlov, A.I. Neı̆shtadt, Realization of holonomic constraints. Prikl. Mat. Mekh.
54(5), 858–861 (1990)
[Koz92] V.V. Kozlov, On the realization of constraints in dynamics. Prikl. Mat. Mekh. 56(4),
692–698 (1992)
[Lan95] S. Lang, Differential and Riemannian manifolds, in Graduate Texts in Mathematics, 3rd
edn, vol. 160 (Springer, New York, 1995)
[LL69] V. Lakshmikantham, S. Leela, Differential and integral inequalities: Theory and appli-
cations, vol. I, in Ordinary Differential Equations (Academic Press, New York, 1969),
Mathematics in Science and Engineering, vol. 55-I.
[LM95] A.D. Lewis, R.M. Murray, Variational principles for constrained systems: theory and
experiment. Int. J. Non-Linear Mech. 30(6), 793–815 (1995)
[Lya07] A.M. Lyapunov, Problème général de la stabilité du mouvement. Ann. Fac. Sci. Toulouse
Sci. Math. Sci. Phys. 9(2), 203–474 (1907).
[Lya92] A.M. Lyapunov, The general problem of the stability of motion. Int. J. Control 55(3),
521–790 (1992). Trans. by A.T. Fuller, Édouard Davaux’s French translation (1907) of
the 1892 Russian original.
[Mañ78] R. Mañé, Persistent manifolds are normally hyperbolic. Trans. Amer. Math. Soc. 246,
261–283 (1978)
[Muj86] J. Mujica, Complex analysis in Banach spaces, in North-Holland Mathematics Studies,
vol. 120 (North-Holland Publishing Co., Amsterdam, 1986). Holomorphic functions
and domains of holomorphy in finite and infinite dimensions, Mathematical Notes, 107
[Pal75] J. Kenneth, Palmer, linearization near an integral manifold. J. Math. Anal. Appl. 51,
243–255 (1975)
[Per29] O. Perron, Über Stabilität und asymptotisches Verhalten der Integrale von Differential-
gleichungssystemen. Math. Z. 29(1), 129–160 (1929)
[Per30] O. Perron, Die Stabilitätsfrage bei Differentialgleichungen. Math. Z. 32(1), 703–728
(1930)
[Poi92] H. Poincaré, Les méthodes nouvelles de la mécanique céleste (Gauthier-Villars, Paris,
1892). Tome I (Les Grands Classiques).
[PS04] C. Pötzsche, S. Siegmund, C m -smoothness of invariant fiber bundles. Topol. Methods
Nonlinear Anal. 24(1), 107–145 (2004)
[PT77] J. Palis, F. Takens, Topological equivalence of normally hyperbolic dynamical systems.
Topology 16(4), 335–345 (1977)
[PT83] J. Palis, F. Takens, Stability of parametrized families of gradient vector fields. Ann.
Math. 118(2, 3), 383–421 (1983).
[Rob68] W.J. Robbin, On the existence theorem for differential equations. Proc. Amer. Math.
Soc. 19, 1005–1006 (1968)
[Roe88] J. Roe, An index theorem on open manifolds. I, II, J. Diff. Geom. 27(1), 87–113, 115–136
(1988).
[RU57] H. Rubin, P. Ungar, Motion under a strong constraining force. Commun. Pure Appl.
Math. 10, 65–87 (1957)
[RW06] H. Reckziegel, E. Wilhelmus, How the curvature generates the holonomy of a connection
in an arbitrary fibre bundle. Results Math. 49(3–4), 339–359 (2006)
[Sak90] K. Sakamoto, Invariant manifolds in singular perturbation problems for ordinary differ-
ential equations. Proc. Roy. Soc. Edinburgh Sect. A 116(1–2), 45–78 (1990)
[Sak94] K. Sakamoto, Smooth linearization of vector fields near invariant manifolds. Hiroshima
Math. J. 24(2), 331–355 (1994)
[Sas58] S. Sasaki, On the differential geometry of tangent bundles of Riemannian manifolds.
Tôhoku Math. J. 10(2), 338–354 (1958)
186 References
[Sch01] T. Schick, Manifolds with boundary and of bounded geometry. Math. Nachr. 223, 103–
120 (2001)
[Shu92] M.A. Shubin, Spectral theory of elliptic operators on noncompact manifolds. Astérisque
5(207), 35–108 (1992), in Méthodes semi-classiques, vol. 1 (Nantes, 1991).
[Šil69] L.P. Šil nikov, A certain new type of bifurcation of multidimensional dynamic systems.
Dokl. Akad. Nauk SSSR 189, 59–62 (1969).
[Tak80] F. Takens, Motion under the influence of a strong constraining force, Global theory of
dynamical systems, in Proceedings of the International Conference, Northwestern Uni-
versity, Evanston, Ill., Lecture Notes in Mathematics, 1979, vol. 819 (Springer, Berlin,
1980), pp. 425–445.
[Uze+02] T. Uzer, C. Jaffé, J. Palacián, P. Yanguas, s Wiggin. The geometry of reaction dynamics.
Nonlinearity 15(4), 957–992 (2002)
[Van89] A. Vanderbauwhede, Centre manifolds, normal forms and elementary bifurcations,
Dynamics reported, in Dyn. Report. Ser. Dynam. Systems Appl., vol. 2 (Wiley, Chich-
ester, 1989), pp. 89–169.
[Ver05] F. Verhulst, Methods and applications of singular perturbations, in Texts in Applied Math-
ematics, vol. 50 (Springer, New York, 2005) (Boundary layers and multiple timescale
dynamics).
[Str79] S.J. van Strien, Center manifolds are not C ∞ . Math. Z. 166(2), 143–145 (1979)
[VG87] A. Vanderbauwhede, S.A. van Gils, Center manifolds and contractions on a scale of
Banach spaces. J. Funct. Anal. 72(2), 209–224 (1987)
[Whi36] H. Whitney, Differentiable manifolds. Ann. Math. 37(2, 3), 645–680 (1936).
[Wig94] S. Wiggins, Normally hyperbolic invariant manifolds in dynamical systems, in Applied
Mathematical Sciences ed. by G. Haller, I. Mezić, vol. 105 (Springer, New York, 1994)
[Yi93] Y. Yi, A generalized integral manifold theorem. J. Diff. Equ. 102(1), 153–187 (1993)
[Zei86] E. Zeidler, Nonlinear functional analysis and its applications, in Fixed-point theorems
(Springer, New York, 1986). (Translated from the German by P.R., Wadsack)
Index
N
D Nemytskii operator, 157
Dynamical system NHIM, 2, see also normally hyperbolic
continuous, 1 invariant manifold
discrete, 1, 21 Non-autonomous system, 24, 141
Noncompactness, 10, 19, 24, 82
Nonholonomic system, 12
E Normal coordinates, 39, 129, 181
Exponential growth numbers, 33, 95 Normal hyperbolicity, 21
Exponential growth rate, 6, 21, 33, 161 history, 13
r-normal hyperbolicity, 22, 61
Normally hyperbolic invariant manifold, 1
F definition, 21
Fiber contraction theorem, 173 parameter dependence, 142
Fixed point, 2 persistence, 3, 76
Forced smoothness, 23, 78
Formal tangent bundle, 114
topology, 115 O
Overflow invariance, 29, 143
a priori, 144
G persistence, 145
Geodesic flow, 41, 180
Graph transform, 16
P
Parallel transport, 47, 181
H differential equation, 47, 181
Hölder continuity, 23, 33 of a frame, 118
Higher order derivative, 18, 136, 161 Partition of unity (uniform), 46
chain rule, 163 Perron method, 14, 16, 106
on a Riemannian manifold, 165 Persistence, see normally hyperbolic
Holonomy, 47, 119 invariant
Hyperbolic fixed point, 2, 15 manifold
non-, 9, 85
I
Immersed submanifold, 25 Q
Implicit function theorem, 152 Quasi-isometry, 58
Index 189
R T
Riemannian manifold, 179 Trivial bundle, 69, 76
S U
Shil’nikov bifurcation, 28 Uniform tubular neighborhood, 58
Smoothness, 3, 7, 18, 109 Uniformly embedded submanifold, 52, 86
forced, see forced smoothness Uniformly immersed submanifold, 51
loss of, 62, 77, 95 Uniformly locally finite cover, 45
non-C?, 6, 9, 23 of a submanifold, 54
of a flow, 155 Unstable manifold, see stable manifold
Spectral gap condition, 4, 6, 22
Stable (unstable) fibration, 4, 148
Stable (unstable) manifold, 2, 15, 147