PDF High Dimensional Probability Viii The Oaxaca Volume Nathael Gozlan Ebook Full Chapter
PDF High Dimensional Probability Viii The Oaxaca Volume Nathael Gozlan Ebook Full Chapter
PDF High Dimensional Probability Viii The Oaxaca Volume Nathael Gozlan Ebook Full Chapter
https://textbookfull.com/product/coulson-and-richardsons-
chemical-engineering-fourth-edition-volume-3a-chemical-and-
biochemical-reactors-and-reaction-engineering-r-ravi/
https://textbookfull.com/product/viii-international-scientific-
siberian-transport-forum-transsiberia-2019-volume-2-zdenka-
popovic/
https://textbookfull.com/product/high-dimensional-microarray-
data-analysis-cancer-gene-diagnosis-and-malignancy-indexes-by-
microarray-shuichi-shinmura/
https://textbookfull.com/product/analyzing-high-dimensional-gene-
expression-and-dna-methylation-data-with-r-1st-edition-hongmei-
zhang-author/
Advanced High Resolution Tomography in Regenerative
Medicine Three Dimensional Exploration into the
Interactions between Tissues Cells and Biomaterials
Alessandra Giuliani
https://textbookfull.com/product/advanced-high-resolution-
tomography-in-regenerative-medicine-three-dimensional-
exploration-into-the-interactions-between-tissues-cells-and-
biomaterials-alessandra-giuliani/
https://textbookfull.com/product/current-critical-debates-in-the-
field-of-transsexual-studies-in-transition-1st-edition-oren-
gozlan/
https://textbookfull.com/product/archaeological-chemistry-
viii-1st-edition-armitage/
https://textbookfull.com/product/a-revolution-unfinished-the-
chegomista-rebellion-and-the-limits-of-revolutionary-democracy-
in-juchitan-oaxaca-colby-ristow/
https://textbookfull.com/product/germany-and-the-second-world-
war-volume-viii-the-eastern-front-1943-1944-the-war-in-the-east-
and-on-the-neighbouring-fronts-1st-edition-frieser/
Progress in Probability
74
Nathael Gozlan
Rafał Latała
Karim Lounici
Mokshay Madiman
Editors
High Dimensional
Probability VIII
The Oaxaca Volume
Progress in Probability
Volume 74
Series Editors
Steffen Dereich, Universität Münster, Münster, Germany
Davar Khoshnevisan, The University of Utah, Salt Lake City, UT, USA
Andreas E. Kyprianou, University of Bath, Bath, UK
Sidney I. Resnick, Cornell University, Ithaca, NY, USA
High Dimensional
Probability VIII
The Oaxaca Volume
Editors
Nathael Gozlan Rafał Latała
MAP 5 Institute of Mathematics
Université Paris Descartes University of Warsaw
Paris, France Warsaw, Poland
This book is published under the imprint Birkhäuser, www.birkhauser-science.com, by the registered
company Springer Nature Switzerland AG.
The registered company address is: Gewerbestrasse 11, 6330 Cham, Switzerland
Preface
v
vi Preface
Ef (X − Y ) ≤ Ef (X + Y )
u(x, y) = u(x, y) + f (y), x, y ∈ S,
where u is symmetric and f has some good properties. In turn, they define con-
ditions that determine whether the kernel
u is symmetrizable or asymptotically
symmetrizable.
• Pointwise properties of martingales with values in Banach function spaces,
by M. Veraar and I. Yaroslavtsev. In this paper, the authors consider local
viii Preface
martingales with values in a UMD Banach function space and prove that
such martingales have a version which is a martingale field. Moreover, a new
Burkholder–Davis–Gundy type inequality is obtained.
High-Dimensional Statistics
• Concentration inequalities for randomly permuted sums, by M. Albert. The
author proves a deviation inequality for random permutations and uses it to
analyze the second kind error rate in a test of independence.
• Uncertainty quantification for matrix compressed sensing and quantum tomog-
raphy problems, by A. Carpentier, J. Eisert, D. Gross and R. Nickl. The authors
construct minimax optimal non-asymptotic confidence sets for low-rank matrix
recovery algorithms such as the Matrix Lasso and Dantzig selector.
• Uniform-in-bandwidth estimation of the gradient lines of a density, by D. Mason
and B. Pelletier. This paper exploits non parametric statistical techniques to
estimate the gradient flow of a stochastic differential equation. The results can
be of interest in clustering applications or the analysis of stochastic gradient
schemes.
ix
x Contents
M. B. Marcus
CUNY Graduate Center, New York, NY, USA
e-mail: [email protected]
G. Peskir
Department of Mathematics, The University of Manchester, Manchester, UK
e-mail: [email protected]
J. Rosiński ()
Department of Mathematics, University of Tennessee, Knoxville, TN, USA
e-mail: [email protected]
(first edition), mostly dealing with random Fourier series, also came out in 1968.
Functional analysts in the circle of Laurent Schwartz were using properties of sums
of independent Banach space valued random variables to classify Banach spaces.
Engaged in this work, Jørgen published his most cited papers, ‘Sums of
independent Banach space valued random variables’, as a publication of the Institute
of Mathematics in Aarhus in 1972, and a paper with the same title, in Studia
Mathematica in 1974 (cf. [9]). The two papers overlap but each has material that
is not in the other. They contain the important and very useful relationship, between
the norm of the maximal term in a series and the norm of the series, that is now
commonly referred to as ‘Hoffmann-Jørgensen’s inequality’.
Continuing in this study, Jørgen collaborated on two important papers; with
Gilles Pisier on the law of large numbers and the central limit theorem in Banach
spaces [12], and with Richard Dudley and Larry Shepp on the lower tails of
Gaussian seminorms [13]. He returned repeatedly to the topics of these and his
other early papers, examining them in more general and abstract spaces. In this vein
Jørgen reexamined the concept of weak convergence from a new perspective that
completely changed the paradigm of its applications in statistics. He formulated his
new definition of weak convergence in the 1980s1. This is now referred to as ‘weak
convergence in Hoffmann-Jørgensen’s sense’.
Jørgen remained an active researcher throughout his life. He was completing a
paper with Andreas Basse-O’Connor and Jan Rosiński on the extension of the Itô-
Nisio theorem to non-separable Banach spaces, when he died.
Jørgen was also a very fine teacher and advisor with great concern for his
students. He wrote 10 sets of lecture notes for his courses, 2,620 pages in total, and
a monumental 1,184 page, two volume, ‘Probability with a view toward Statistics’,
published by Chapman and Hall in 1994. He was the principal advisor of seven
Ph.D. students.
Reflecting the interest in sums of independent Banach space valued random
variables, and the related field of Gaussian processes in Europe, Laurent Schwarz
and Jacques Neveu organized an auspicious conference on Gaussian Processes in
Strasbourg in 1973. This stimulated research and collaborations that continue to
this day. The Strasbourg conference was followed, every two or three years, by
nine conferences on Probability in Banach Spaces and eight conferences on High
Dimensional Probability. The last one was in Oaxaca, Mexico in 2017. The change
in the conference name reflected a broadening of the interests of the participants.
Jørgen was one of a core group, many of whom attended the 1973 conference,
who took part in all or most of the eighteen conferences throughout their careers,
and often were the conference organizers and editors of the conference proceedings.
Most significantly, Jørgen was the principal organizer of three of these conferences
in the beautiful, serene, conference center in Sandbjerg, Denmark in 1986, 1993
and 2002, and was an editor of the proceedings of these conferences. Moreover, his
1 Some authors have claimed, as we did in [14], that this definition was introduced in Jørgen’s paper
Probability in Banach space [10] in 1977. However, after a careful reading of this paper, we do not
think that this is correct.
1 Jørgen Hoffmann-Jørgensen (1942–2017) 3
n
Sn = Xj , N = sup Xn , M = sup Sn .
n n
j =1
This is how he uses the inequality to prove this theorem. Assume that the
elements of (Xn ) are symmetric and let R(t) = P(M ≥ t) and Q(t) = P(N ≥ t)
for t ≥ 0. Using the relationship
∞
E(M p ) = px p−1 R(x)dx,
0
and similarly for N and Q, it follows from (1.1) that for A > 0
A A/3
px p−1 R(x)dx = p 3p px p−1 R(3x)dx (1.2)
0 0
A/3 A/3
≤ 2p 3p px p−1 Q(x)dx + 8p 3p px p−1 R 2 (x)dx
0 0
A/3
≤ 2p 3p E(N p ) + 8p 3p px p−1 R 2 (x)dx.
0
Choose t0 > 0 such that R(t0 ) < (16p3p )−1 . The condition that P(M < ∞) = 1
implies that t0 < ∞. Then choose A > 3t0 . Note that
A/3 t0 A/3
px p−1 R 2 (x)dx = px p−1 R 2 (x)dx + px p−1 R 2 (x)dx
0 0 t0
A/3
p
≤ t0 + R(t0 ) px p−1 R(x)dx. (1.3)
t0
It follows from (1.4) that when the elements of (Xn ) are symmetric and E(N p ) <
∞, then E(M p ) < ∞. Eliminating the condition that (Xn ) is symmetric is routine.
Inequalities for sums of independent random variables that relate the sum to
the supremum of the individual terms are often referred to as Hoffmann-Jørgensen
type inequalities. Jørgen’s original inequality has been generalized and extended.
Many of these results are surveyed in [5] which obtains Hoffmann-Jørgensen type
inequalities for U statistics. See [4] for a more recent treatment of Hoffmann-
Jørgensen type inequalities in statistics.
Weak Convergence in Hoffmann-Jørgensen’s Sense The classic concept of
convergence in distribution, dating back to de Moivre’s central limit theorem in
1737, admits the following well-known characterisation, traditionally referred to as
weak convergence (cf. [3]).
1 Jørgen Hoffmann-Jørgensen (1942–2017) 5
Let (, F, P) be a probability space, let S be a metric (topological) space, and let
B(S) be the Borel σ -algebra on S. Let X1 , X2 , . . . and X be measurable functions
from to S with respect to F and B(S). If
1
n
Fn (t, ω) := I (ξi (ω) ≤ t) (1.7)
n
i=1
√ 1
n
Xn (f, ω) := n f (Xi (ω))−Ef (X1 ) (1.9)
n
i=1
Q(s, t, A) = P((Xs , Xt ) ∈ A)
References
1. N.T. Andersen, The central limit theorem for nonseparable valued functions. Z. Wahrsch. Verw.
Gebiete 70, 445–455 (1985)
2. P. Berti, P. Rigo, Convergence in distribution of nonmeasurable random elements. Ann. Probab.
32, 365–379 (2004)
3. P. Billingsley, Convergence of Probability Measures (Willey, New York, 1968)
4. E. Giné, R. Nickl, Mathematical Foundations of Infinite-Dimensional Statistical Models
(Cambridge University Press, New York, 2016)
5. R. Giné, E. Latała, J. Zinn, Exponential and moment inequalities for U-statistics, in High
Dimensional Probability II (Seattle 1999). Programs and Probability, vol. 47 (Birkhäuser,
Boston, 2000), pp. 13–38
6. J. Hoffmann-Jørgensen, Markov sets. Math. Scand. 24, 145–166 (1969)
7. J. Hoffmann-Jørgensen, The Theory of Analytic Spaces, vol. 10 (Matematisk Institut, Aarhus
Universitet, Aarhus, 1970)
8. J. Hoffmann-Jørgensen, Existence of measurable modifications of stochastic processes. Z.
Wahrsch. Verw. Gebiete 25, 205–207 (1973)
9. J. Hoffmann-Jørgensen, Sums of independent Banach space valued random variables. Stud.
Math. 52, 159–186 (1974)
10. J. Hoffmann-Jørgensen, Probability in Banach Space. Lecture Notes in Mathematics, vol. 598
(Springer, Berlin, 1977), pp. 1–186
11. J. Hoffmann-Jørgensen, Stochastic Processes on Polish Spaces, vol. 39 (Matematisk Institut,
Aarhus Universitet, Aarhus, 1991)
12. J. Hoffmann-Jørgensen, G. Pisier, The law of large numbers and the central limit theorem in
Banach spaces. Ann. Probab. 4, 587–599 (1976)
13. J. Hoffmann-Jørgensen, L.A. Shepp, R.M. Dudley, On the lower tail of Gaussian seminorms.
Ann. Probab. 7, 319–342 (1979)
14. M.B. Marcus, G. Peskir, J. Rosiński, Jørgen Hoffmann-Jørgensen (1942–2017), vol. 54
(Danish Mathematical Society, Matilde, 2018), pp. 14–15
Chapter 2
Moment Estimation Implied
by the Bobkov-Ledoux Inequality
with f locally Lipschitz. It is easy to see that above results with the following bound
√ √
f − Eμ f p D p d|∇f |∞ p .
Adamczak and Wolff has conjectured in [1] that Bobkov-Ledoux inequality (2.1)
imply
√ √
f − Eμ f p D p|∇f |2 p + Cp|∇f |∞ p .
Their result is based on tricky modification of given function so that (2.1) could be
used. In our paper we are trying to understand this phenomenon and apply its more
advanced form.
2 Moment Estimation Implied by the Bobkov-Ledoux Inequality 11
In this section we investigate possible estimates for gpα , with a given α > 0,
when we know that g α is globally Lipschitz. This bounds will be useful when we
start dealing with the exponential Orlicz norms.
Theorem 2.1 If measure μ satisfies (2.1), function g is non-negative, locally
Lipschitz and p 1, then
for 0 < α 1
1 1 1 1 √
gpα 2 α max p α |∇g α |∞ α
∞
, g2α , αp 2 D |∇g|2 pα
1 1 1 1 1 √
gpα max 2 α p α |∇g α |∞ α
∞
, 2 α g2α , αp 2 D |∇g|2 pα .
gαpα 2p |∇g α |∞ ∞
(2.5)
then following the idea of the proof of (2.4) from [1] we define function h =
1
max{g, c}, where c = gpα /2 α . Obviously, for 2 t p
|∇hα |∞ 2|∇g α |∞
.
hα gαpα
|∇hαt /2|∞ t 1
.
hαt /2 ∞ 2p 2
12 W. Bednorz and G. Głowienko
Therefore, we may apply (2.1) to the function hαt /2 and thus by the Aida Stroock
[2] argument i.e.
d α 2 2 2/t −1 D αt 2/t −1
h t = 2 Ehαt Ent(hαt /2 )2 Eh E|hαt /2−α ∇hα |22 ,
dt t 2
combined with Hölder inequality with exponents t/(t − 2) and t/2 applied to the
last term, gives us
D
hα 2p − hα 22 (p − 2)|∇hα |2 2p . (2.7)
2
Now we have to consider two cases. First suppose that α 1 and then
α2 D
hα 2p hα 22 + (p − 2)c2α−2|∇g|2 2p .
2
1 α 2
hα 22 c2α + g α 22 g p + g α 22 ,
4
which combined together gives us
3 α 2 α2 D
g p g α 22 + (p − 2)c2α−2|∇g|2 2p . (2.8)
4 2
Noting that the case of
1 1 α 2
g α 22 = g2α
2α g2α
pα = g p
4 4
2 Moment Estimation Implied by the Bobkov-Ledoux Inequality 13
g2α 2
pα α D(p − 2)c
2α−2
|∇g|2 2p . (2.10)
Combining together (2.5), (2.9), and (2.11) implies the result in the case of 0 < α
1.
Consider now case of α > 1, following the same reasoning as in previous case,
up to the (2.7) after that Hölder inequality is used, we get
Therefore, by (2.7)
h2α D
h2pα (1 − 2α
) α 2 (p − 2)|∇g|2 2pα . (2.12)
hpα
2α 2
1 α 2 1 1
h2α α 2
2α = h 2 c
2α
+ g α 22 = g p + g α 2p = g2α
pα .
4 4 2
pα gpα , we get
Since obviously h2α 2α
h2α
h2pα (1 − 2α
) 2−1 g2pα
h2α
pα
Clearly (2.5), (2.9), and (2.13) cover the case of α > 1, which ends whole proof.
Next step of the reasoning is to apply previous result to g = |f −Eμ f | and combine
it with Poincaré inequality. Let us gather everything together in form of
14 W. Bednorz and G. Głowienko
|∇g|2 pα
= |∇f |2 pα
,
First, let us recall the notion of exponential Orlicz norms. For any α > 0
1 1
|f |α + |g|α ϕ(1)
α
(|f |α ϕ(1) + |g|α ϕ(1) ) α
1 1
= (f αϕ(α) + gαϕ(α) ) α 2 α −1 (f ϕ(α) + gϕ(α) ).
2 Moment Estimation Implied by the Bobkov-Ledoux Inequality 15
f kα
It is worth to know that f ϕ(α) is always comparable with supk1 k 1/α
. More
precisely, observe that for all k 1 and a positive g
gkα
kα
gkα
ϕ(α) .
k!
Note that, just by the definition of gϕ(α) , there exists k 1 for which
gkα
kα
2−k gkα
ϕ(α) .
k!
Let us denote the set of such k 1 by J (g, α) and note that for any k ∈ J (g, α)
1 1 1
(k!)− kα gkα gϕ(α) 2 α (k!)− kα gkα . (2.14)
1
Next let M e be such a constant that (k!) k k/M for all k 1. We have
following crucial observation namely for all k ∈ J (g, α)
1 gkα
gϕ(α) (2M) α 1
. (2.15)
kα
Therefore, we may use Theorem 2.1 in order to obtain
Corollary 2.2 If μ satisfies (2.1) and g is non-negative locally Lipschitz function,
then for any k ∈ J (g, α) in case of 0 < α 1
1 √
, k − α g2α , αk 2 − α
1 1 1 1
gϕ(α) (4M) α max |∇g α |∞ α
∞
D |∇g|2 kα
.
Note that set J (g, α) is stable with respect to g → h, where h = max{g, c} i.e. if c
is comparable to gϕ(α) there exists C 1 such that for k ∈ J (g, α)
hkα 1
kα
k hkα
ϕ(α) ,
k! C
which means that we cannot easily improve the result using the trick.
In the same way as we have established Corollary 2.1 we can deduce the
following result.
16 W. Bednorz and G. Głowienko
Corollary 2.3 If μ satisfies (2.1) and g is locally Lipschitz function, then for any
k ∈ J (g, α) in case of 0 < α 1
1
1 α
f − Eμ f ϕ(α) (4M) α max ∇|f − Eμ f |α ∞ ∞
,
1√ 1 1√
k − α D |∇f |2 2 , αk 2 − α D |∇f |2 kα .
The result shows that at least for globally Lipschitz function |f |α , α 1 the
exponential moment f − Eμ f ϕ(α) has to bounded, though it is still far from
1
replacement of |∇|f − Eμ f |α |∞ ∞
α
by the expected |∇f |∞ ϕ( 1−α
α .
)
1
Note that it is not possible to simply replace the constant C(α) ∼ (4M) α in
Corollary 2.4 by 1 which would be a natural choice for the question. In the next
section we will show another approach which allows to obtain such a result.
Theorem 2.2 If μ satisfies (2.1) and 0 < α 2, then for any locally Lipschitz
function f
1 √
f − Eμ f ϕ(α) ∇|f − Eμ f |α α
∞ ∞
+ C(α) D |∇f |2 2α
ϕ( 2−α )
1
α
where C(α) = α 2
ln 2 .
Then for any t 1 and a function h = exp(g α t/2) we can apply (2.1), indeed
|∇h|∞ t 1
= |∇g α |∞ ∞
h ∞ 2 2
In fact there are three possibilities we should acknowledge.
The first case we should consider is Eμ exp(g α ) 2, but then
gϕ(α) 1. (2.16)
1
Otherwise there must exist t∗ 1 such that E exp(g α t∗ ) = 2. Clearly 1/t∗α =
gϕ(α) . For simplicity let us denote V (t) = ln E exp(g α t), t 0. It is well known
that V is convex, increasing and V (0) = 0. Now we use (2.1), in order to get for all
t ∈ [0, 1]
V (t) D
Eμ |∇g α |22 exp(g α t − V (t)). (2.17)
t 4
Since V is convex V (0) = 0 we know that V (t)/t is increasing and also V (0) =
Eμ g α . Consequently, integrating (2.18) on [0, t∗ ]
∞
V (t∗ ) D g kα t∗k+1
− Eμ g α Eμ |∇g α |22 .
t∗ 4 (k + 1)!
k=0
1 2 1
α
gϕ(α) = 1
gα . (2.19)
ln 2
t∗α
18 W. Bednorz and G. Głowienko
1
t∗ Eμ g α ln 2.
2
Obviously, we have then
∞
ln 2 D t∗k+2
Eμ |∇g α |22 g kα .
2 4 (k + 1)!
k=0
(k+2)α−2
α 2 |∇g|2 2(k+2)α
α 2 |∇g|2 2(k+2)α g(k+2)α = Eμ g (k+2)α .
g2(k+2)α
Therefore,
∞
Dα 2 (k + 2)|∇g|2 (k+2)α t∗k+2 Eμ g (k+2)α
2
1
ln 2 . (2.20)
2 4 g2(k+2)α (k + 2)!
k=0
where the constant M 1 will be chosen later. First, we bound summands over the
set I , i.e.
(k+2)k+2
Obviously it is easy to choose M close to 2e so that k∈I M k+2 (k+2)! 1. Thus,
we may state our bound over I in the following form
|∇g|2 kα
where K = maxk1 1−1 . On the set J we do as follows
kα 2
But now
t k+2 Eμ g (k+2)α t k+2 Eμ g (k+2)α
∗ ∗
eV (t∗ ) − 1 = 1.
(k + 2)! (k + 2)!
k∈J k0
2 ln 2 2 2
M α K 2t α
∗
Dα 2
but this implies
1 α
(D ) 2 α α K α .
t∗
Note that K is comparable with |∇g|2 ϕ( 2α . It leads to the formula
2−α )
1 √
gϕ(α) = α D |∇g|2 ϕ( 2α . (2.23)
2−α )
1
α
t∗
Bound (2.16), (2.19), and (2.23) implies that for any positive g
1 √
1 2 α
gϕ(α) max |∇g α |∞
α
∞
, gα , α D |∇g|2 ϕ( 2α . (2.24)
ln 2 2−α )