\addbibresource

GSSSources.bib

Geodesic slice sampling on Riemannian manifolds

Alain Durmus École Polytechnique, France, Email: [email protected]    Samuel Gruffaz Université Paris Saclay, France, Email: [email protected]    Mareike Hasenpflug University of Passau, Germany, Email: [email protected], [email protected]    Daniel Rudolf33footnotemark: 3
Abstract

We propose a theoretically justified and practically applicable slice sampling based Markov chain Monte Carlo (MCMC) method for approximate sampling from probability measures on Riemannian manifolds. The latter naturally arise as posterior distributions in Bayesian inference of matrix-valued parameters, for example belonging to either the Stiefel or the Grassmann manifold. Our method, called geodesic slice sampling, is reversible with respect to the distribution of interest, and generalizes Hit-and-run slice sampling on dsuperscript𝑑\mathbb{R}^{d}blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT to Riemannian manifolds by using geodesics instead of straight lines. We demonstrate the robustness of our sampler’s performance compared to other MCMC methods dealing with manifold valued distributions through extensive numerical experiments, on both synthetic and real data. In particular, we illustrate its remarkable ability to cope with anisotropic target densities, without using gradient information and preconditioning.

1 Introduction

In statistical models for real world phenomena it is natural to incorporate geometric knowledge in terms of manifolds into state- or parameter-spaces. This allows to better capture dependencies, to easily embed additional constraints and to include expert/data guidance. Extracting information with Bayesian inference from such models requires the ability to sample (at least approximately) from the usually highly intractable posterior distribution on the manifold. However, due to the involved geometric features, standard dsuperscript𝑑\mathbb{R}^{d}blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT approaches are not directly applicable. This highlights the importance of developing efficient sampling techniques specifically designed for manifolds.

In this paper we tackle this problem and consider a target measure π𝜋\piitalic_π on a general Riemannian manifold 𝖬𝖬\mathsf{M}sansserif_M that has a density with respect to the Riemannian measure ν𝔤subscript𝜈𝔤\nu_{\mathfrak{g}}italic_ν start_POSTSUBSCRIPT fraktur_g end_POSTSUBSCRIPT of the manifold, i.e., that is of the form

π(dx)=p(x)𝖬p(y)ν𝔤(dy)ν𝔤(dx),𝜋d𝑥𝑝𝑥subscript𝖬𝑝𝑦subscript𝜈𝔤d𝑦subscript𝜈𝔤d𝑥\pi({\rm d}x)=\frac{p(x)}{\int_{\mathsf{M}}p(y)\,\nu_{\mathfrak{g}}({\rm d}y)}% \nu_{\mathfrak{g}}({\rm d}x),italic_π ( roman_d italic_x ) = divide start_ARG italic_p ( italic_x ) end_ARG start_ARG ∫ start_POSTSUBSCRIPT sansserif_M end_POSTSUBSCRIPT italic_p ( italic_y ) italic_ν start_POSTSUBSCRIPT fraktur_g end_POSTSUBSCRIPT ( roman_d italic_y ) end_ARG italic_ν start_POSTSUBSCRIPT fraktur_g end_POSTSUBSCRIPT ( roman_d italic_x ) , (1)

where p:𝖬(0,):𝑝𝖬0p:\mathsf{M}\to(0,\infty)italic_p : sansserif_M → ( 0 , ∞ ) is integrable with respect to the Riemannian measure ν𝔤subscript𝜈𝔤\nu_{\mathfrak{g}}italic_ν start_POSTSUBSCRIPT fraktur_g end_POSTSUBSCRIPT. Such frameworks are encountered in various applications, e.g., brain connectivity network analysis [mantoux2021understanding], dimensionality reduction [holbrook2016bayesian], computer vision [lui2012advances], texture analysis [kunze2004bingham] and protein conformation modeling [hamelryck2006sampling]. In most instances, the Riemannian manifolds that arise are matrix manifolds, such as the Stiefel manifold or the Grassmann manifold [edelman1998geometry].

One popular way to approximately sample from intractable distributions are Markov chain Monte Carlo (MCMC) methods. We follow this approach and propose a practical MCMC algorithm based on slice sampling techniques. This method that we refer to as geodesic slice sampler (GSS) incorporates the geometry of the underlying Riemannian manifold by using the geodesics. We briefly describe the transition mechanism of GSS. It crucially exploits the fact that on a Riemannian manifold for every pair (x,v)𝑥𝑣(x,v)( italic_x , italic_v ), where x𝖬𝑥𝖬x\in\mathsf{M}italic_x ∈ sansserif_M and vTx𝖬𝑣subscript𝑇𝑥𝖬v\in T_{x}\mathsf{M}italic_v ∈ italic_T start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT sansserif_M is an element of the tangent space at x𝑥xitalic_x, there exists a unique geodesic γ(x,v)subscript𝛾𝑥𝑣\gamma_{(x,v)}italic_γ start_POSTSUBSCRIPT ( italic_x , italic_v ) end_POSTSUBSCRIPT emanating from x𝑥xitalic_x in direction v𝑣vitalic_v.

More precisely, given the current state x𝖬𝑥𝖬x\in\mathsf{M}italic_x ∈ sansserif_M, a single transition of the GSS targeting the distribution π𝜋\piitalic_π proceeds in three steps. First, a level t𝑡titalic_t is uniformly sampled from the interval (0,p(x))0𝑝𝑥(0,p(x))( 0 , italic_p ( italic_x ) ). Second, a geodesic γ(x,v)subscript𝛾𝑥𝑣\gamma_{(x,v)}italic_γ start_POSTSUBSCRIPT ( italic_x , italic_v ) end_POSTSUBSCRIPT that passes through x𝑥xitalic_x is randomly chosen. Lastly, a point is generated from the intersection of the geodesic γ(x,v)subscript𝛾𝑥𝑣\gamma_{(x,v)}italic_γ start_POSTSUBSCRIPT ( italic_x , italic_v ) end_POSTSUBSCRIPT and the level set L(t):={y𝖬p(y)>t}assign𝐿𝑡conditional-set𝑦𝖬𝑝𝑦𝑡L(t):=\{y\in\mathsf{M}\mid p(y)>t\}italic_L ( italic_t ) := { italic_y ∈ sansserif_M ∣ italic_p ( italic_y ) > italic_t }. This final step presents the most significant challenge and requires special care to ensure invariance of the target distribution. To this end, we carefully adapt Neal’s stepping-out and shrinkage procedure (detailed in [neal2003slice]) to our manifold setting.

Metropolis-Hastings, Hamiltonian Monte Carlo and Langevin-type algorithms have already been adapted to the Riemannian manifold setting. Moreover, there exist several tailor-made algorithms for specific manifolds. Consult Section 2.3 for a literature review of MCMC-methods on Riemannian manifolds. However, to the best of our knowledge GSS represents the first practical slice sampling-based MCMC method applicable to general Riemannian manifolds. Following a slice sampling paradigm is appealing because, by design, the length of the transition step is fitly chosen for each transition. This is especially advantageous when efficiently exploring the target distribution requires varying the length of transition steps based on the position and direction of the move, e.g., due to anisotropy. It is worth noting that some slice sampling algorithms achieve this without the need for any tunable hyperparameters. However, a practical implementation usually introduces additional tunable parameters. Nonetheless, it is expected that the resulting slice sampler algorithms’ performance will be more robust with regard to the choice of these parameters compared to, for example, the sensitivity of a Metropolis-Hastings or Hamiltonian Monte Carlo algorithm to the selection of step size. We refer to [neal2003slice] for some more details on these properties of slice sampling; see also [murray2010elliptical] for a further discussion of advantageous and limitations of a slice sampling approach.

To conclude this introduction, our main contributions can be summarized as follows:

  • We propose a practical slice sampling based MCMC-method, which we call geodesic slice sampling, to target distributions of the form (1). It combines a geodesical Hit-and-run algorithm with a 1-dimensional slice sampler arriving at a method that generalizes Hit-and-run slice sampling to Riemannian manifolds.

  • We demonstrate the applicability of GSS in numerical experiments and evaluate its strong suits as well as its drawbacks. Its main feature being its capacity to be quite robust to the geometry of the target, without using gradient information and with an easy tuning of parameters using the diameter of the manifold.

  • We verify that GSS has the correct invariant distribution by showing reversibility with respect to π𝜋\piitalic_π.

The structure of the paper is as follows. First we provide an introduction to slice sampling on dsuperscript𝑑\mathbb{R}^{d}blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT in Section 2.1, before turning to GSS on general Riemannian manifolds in Section 2.2. This is followed by a literature review of MCMC-methods on Riemannian manifolds in Section 2.3. Readers that wish more details on the differential geometry background used in Section 2 may find it in Appendix C. Section 3 is devoted to numerical experiments. The proof of the reversibility of GSS with respect to π𝜋\piitalic_π is given in Section 4. A formal treatment of the stepping-out and shrinkage procedure is also included in this section.

1.1 General notation

We introduce some general notation that is valid throughout the whole paper. Let \mathbb{N}blackboard_N be the set of strictly positive integers and call 0:={0}assignsubscript00\mathbb{N}_{0}:=\mathbb{N}\cup\{0\}blackboard_N start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT := blackboard_N ∪ { 0 }. We denote by LebdsubscriptLeb𝑑\mathrm{Leb}_{d}roman_Leb start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT the d𝑑ditalic_d-dimensional Lebesgue measure on dsuperscript𝑑\mathbb{R}^{d}blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT and by (d)superscript𝑑\mathcal{B}(\mathbb{R}^{d})caligraphic_B ( blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT ) the Borel-σ𝜎\sigmaitalic_σ-algebra. Similar, for a set 𝖲(d)𝖲superscript𝑑\mathsf{S}\in\mathcal{B}(\mathbb{R}^{d})sansserif_S ∈ caligraphic_B ( blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT ) we write (𝖲)𝖲\mathcal{B}(\mathsf{S})caligraphic_B ( sansserif_S ) for the trace Borel-σ𝜎\sigmaitalic_σ-algebra on 𝖲𝖲\mathsf{S}sansserif_S. Moreover, we set 𝕊d1:={xdx=1}assignsuperscript𝕊𝑑1conditional-set𝑥superscript𝑑norm𝑥1\mathbb{S}^{d-1}:=\{x\in\mathbb{R}^{d}\mid\|x\|=1\}blackboard_S start_POSTSUPERSCRIPT italic_d - 1 end_POSTSUPERSCRIPT := { italic_x ∈ blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT ∣ ∥ italic_x ∥ = 1 } to be the d1𝑑1d-1italic_d - 1-dimensional Euclidean unit sphere. For xd𝑥superscript𝑑x\in\mathbb{R}^{d}italic_x ∈ blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT, let xsuperscript𝑥topx^{\top}italic_x start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT be its transpose, and write Iddd×dsubscriptId𝑑superscript𝑑𝑑\mathrm{Id}_{d}\in\mathbb{R}^{d\times d}roman_Id start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_d × italic_d end_POSTSUPERSCRIPT for the identity matrix. If 𝖲(d)𝖲superscript𝑑\mathsf{S}\in\mathcal{B}(\mathbb{R}^{d})sansserif_S ∈ caligraphic_B ( blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT ) is a finite set or satisfies Lebd(𝖲)(0,)subscriptLeb𝑑𝖲0\mathrm{Leb}_{d}(\mathsf{S})\in(0,\infty)roman_Leb start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT ( sansserif_S ) ∈ ( 0 , ∞ ), denote the discrete, respectively continuous, uniform distribution on 𝖲𝖲\mathsf{S}sansserif_S as Unif(𝖲)Unif𝖲\mathrm{Unif}(\mathsf{S})roman_Unif ( sansserif_S ). Whenever we introduce random variables in the sequel, we assume them to be defined on some rich enough probability space (Ω,,)Ω(\Omega,\mathcal{F},\mathbb{P})( roman_Ω , caligraphic_F , blackboard_P ). Let (𝖷,𝒳)𝖷𝒳(\mathsf{X},\mathcal{X})( sansserif_X , caligraphic_X ) be a measurable space and let R𝑅Ritalic_R be an (𝖷,𝒳)𝖷𝒳(\mathsf{X},\mathcal{X})( sansserif_X , caligraphic_X )-valued random variable. Then we denote by

R(𝖠):=(R𝖠),A𝒳,formulae-sequenceassignsuperscript𝑅𝖠𝑅𝖠𝐴𝒳\mathbb{P}^{R}(\mathsf{A}):=\mathbb{P}(R\in\mathsf{A}),\qquad A\in\mathcal{X},blackboard_P start_POSTSUPERSCRIPT italic_R end_POSTSUPERSCRIPT ( sansserif_A ) := blackboard_P ( italic_R ∈ sansserif_A ) , italic_A ∈ caligraphic_X ,

the distribution of R𝑅Ritalic_R. If R=μsuperscript𝑅𝜇\mathbb{P}^{R}=\mublackboard_P start_POSTSUPERSCRIPT italic_R end_POSTSUPERSCRIPT = italic_μ for some probability measure μ𝜇\muitalic_μ, then we also write Rμsimilar-to𝑅𝜇R\sim\muitalic_R ∼ italic_μ. Furthermore, let 𝖸𝖸\mathsf{Y}sansserif_Y be a possibly different set and let f:𝖷𝖸:𝑓𝖷𝖸f:\mathsf{X}\to\mathsf{Y}italic_f : sansserif_X → sansserif_Y be a map from 𝖷𝖷\mathsf{X}sansserif_X to 𝖸𝖸\mathsf{Y}sansserif_Y. For some set 𝖲𝖷𝖲𝖷\mathsf{S}\subseteq\mathsf{X}sansserif_S ⊆ sansserif_X, we denote by f|𝖲evaluated-at𝑓𝖲f|_{\mathsf{S}}italic_f | start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT the restriction of f𝑓fitalic_f to 𝖲𝖲\mathsf{S}sansserif_S. Now equip also 𝖸𝖸\mathsf{Y}sansserif_Y with a σ𝜎\sigmaitalic_σ-algebra and turn it into the measurable space (𝖸,𝒴)𝖸𝒴(\mathsf{Y},\mathcal{Y})( sansserif_Y , caligraphic_Y ). Additionally assume f𝑓fitalic_f to be measurable and let μ𝜇\muitalic_μ be a measure on (𝖷,𝒳)𝖷𝒳(\mathsf{X},\mathcal{X})( sansserif_X , caligraphic_X ). We call

fμ(𝖠):=μ(f1(𝖠)),A𝒴formulae-sequenceassignsubscript𝑓𝜇𝖠𝜇superscript𝑓1𝖠𝐴𝒴f_{\sharp}\mu(\mathsf{A}):=\mu\left(f^{-1}(\mathsf{A})\right),\qquad A\in% \mathcal{Y}italic_f start_POSTSUBSCRIPT ♯ end_POSTSUBSCRIPT italic_μ ( sansserif_A ) := italic_μ ( italic_f start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( sansserif_A ) ) , italic_A ∈ caligraphic_Y

the push forward measure of μ𝜇\muitalic_μ under f𝑓fitalic_f, and for 𝖲𝒳𝖲𝒳\mathsf{S}\in\mathcal{X}sansserif_S ∈ caligraphic_X we call

μ|𝖲(𝖠):=μ(𝖲A),A𝒳,formulae-sequenceassignevaluated-at𝜇𝖲𝖠𝜇𝖲𝐴𝐴𝒳\mu|_{\mathsf{S}}(\mathsf{A}):=\mu(\mathsf{S}\cap A),\qquad A\in\mathcal{X},italic_μ | start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT ( sansserif_A ) := italic_μ ( sansserif_S ∩ italic_A ) , italic_A ∈ caligraphic_X ,

the restriction of μ𝜇\muitalic_μ to 𝖲𝖲\mathsf{S}sansserif_S. To emphasize that a union is disjoint we write square-union\sqcup. Finally, for simplicity we also use \land and \lor to denote the minimum respectively the maximum between two real numbers, i.e., rs:=min{r,s}assign𝑟𝑠𝑟𝑠r\land s:=\min\{r,s\}italic_r ∧ italic_s := roman_min { italic_r , italic_s } and rs:=max{r,s}assign𝑟𝑠𝑟𝑠r\lor s:=\max\{r,s\}italic_r ∨ italic_s := roman_max { italic_r , italic_s } for r,s𝑟𝑠r,s\in\mathbb{R}italic_r , italic_s ∈ blackboard_R.

2 Methodology: Geodesic slice sampling

2.1 Slice sampling on dsuperscript𝑑\mathbb{R}^{d}blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT

In this section we revisit slice sampling on dsuperscript𝑑\mathbb{R}^{d}blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT in order to provide a general introduction to the ideas of (uniform simple) slice sampling. The slice sampler, as all MCMC-methods, defines a Markov chain which, for a given measurable unnormalized density p:d(0,):𝑝superscript𝑑0p:\mathbb{R}^{d}\to(0,\infty)italic_p : blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT → ( 0 , ∞ ) that satisfies dp(y)Lebd(dy)(0,)subscriptsuperscript𝑑𝑝𝑦subscriptLeb𝑑d𝑦0\int_{\mathbb{R}^{d}}p(y)\mathrm{Leb}_{d}({\rm d}y)\in(0,\infty)∫ start_POSTSUBSCRIPT blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_p ( italic_y ) roman_Leb start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT ( roman_d italic_y ) ∈ ( 0 , ∞ ), can be used to approximately sample from the probability measure

π(dx)=p(x)dp(y)Lebd(dy)Lebd(dx).𝜋d𝑥𝑝𝑥subscriptsuperscript𝑑𝑝𝑦subscriptLeb𝑑d𝑦subscriptLeb𝑑d𝑥\pi({\rm d}x)=\frac{p(x)}{\int_{\mathbb{R}^{d}}p(y)\mathrm{Leb}_{d}({\rm d}y)}% \ \mathrm{Leb}_{d}({\rm d}x).italic_π ( roman_d italic_x ) = divide start_ARG italic_p ( italic_x ) end_ARG start_ARG ∫ start_POSTSUBSCRIPT blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_p ( italic_y ) roman_Leb start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT ( roman_d italic_y ) end_ARG roman_Leb start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT ( roman_d italic_x ) .

At its heart are the (super) level sets of p𝑝pitalic_p defined by

L(t):={xdp(x)>t},t(0,),formulae-sequenceassign𝐿𝑡conditional-set𝑥superscript𝑑𝑝𝑥𝑡𝑡0L(t):=\{x\in\mathbb{R}^{d}\mid p(x)>t\},\qquad t\in(0,\infty),italic_L ( italic_t ) := { italic_x ∈ blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT ∣ italic_p ( italic_x ) > italic_t } , italic_t ∈ ( 0 , ∞ ) ,

which contain all points that have a function value with respect to p𝑝pitalic_p that is greater than a specified value. The uniform simple slice sampler, which we also call idealized slice sampler, generates approximate samples from π𝜋\piitalic_π by drawing suitably from these level sets.

We denote by (Yk)ksubscriptsubscript𝑌𝑘𝑘(Y_{k})_{k\in\mathbb{N}}( italic_Y start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_k ∈ blackboard_N end_POSTSUBSCRIPT the Markov chain that corresponds to the idealized slice sampler. A transition from step Yk=xdsubscript𝑌𝑘𝑥superscript𝑑Y_{k}=x\in\mathbb{R}^{d}italic_Y start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = italic_x ∈ blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT to step Yk+1subscript𝑌𝑘1Y_{k+1}italic_Y start_POSTSUBSCRIPT italic_k + 1 end_POSTSUBSCRIPT works as follows:

  1. 1.

    Draw a random level Tk+1Unif((0,p(x)))similar-tosubscript𝑇𝑘1Unif0𝑝𝑥T_{k+1}\sim\mathrm{Unif}\big{(}(0,p(x))\big{)}italic_T start_POSTSUBSCRIPT italic_k + 1 end_POSTSUBSCRIPT ∼ roman_Unif ( ( 0 , italic_p ( italic_x ) ) ), call the result t𝑡titalic_t. This specifies a level set L(t)𝐿𝑡L(t)italic_L ( italic_t ).

  2. 2.

    Draw Yk+1Unif(L(t))similar-tosubscript𝑌𝑘1Unif𝐿𝑡Y_{k+1}\sim\mathrm{Unif}(L(t))italic_Y start_POSTSUBSCRIPT italic_k + 1 end_POSTSUBSCRIPT ∼ roman_Unif ( italic_L ( italic_t ) ) uniformly from this level set L(t)𝐿𝑡L(t)italic_L ( italic_t ).

Consequently, for any k𝑘k\in\mathbb{N}italic_k ∈ blackboard_N this defines the conditional distributions

(Tk+1Y1,,Yk,T1,,Tk)=(Tk+1Yk)=Unif((0,p(Yk))),\displaystyle\mathbb{P}\left(T_{k+1}\in\cdot\mid Y_{1},\ldots,Y_{k},T_{1},% \ldots,T_{k}\right)=\mathbb{P}\left(T_{k+1}\in\cdot\mid Y_{k}\right)=\mathrm{% Unif}\big{(}(0,p(Y_{k}))\big{)},blackboard_P ( italic_T start_POSTSUBSCRIPT italic_k + 1 end_POSTSUBSCRIPT ∈ ⋅ ∣ italic_Y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_Y start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_T start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_T start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) = blackboard_P ( italic_T start_POSTSUBSCRIPT italic_k + 1 end_POSTSUBSCRIPT ∈ ⋅ ∣ italic_Y start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) = roman_Unif ( ( 0 , italic_p ( italic_Y start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) ) ) ,
(Yk+1Y1,,Yk,T1,,Tk+1)=(Yk+1Tk+1)=Unif(L(Tk+1)).\displaystyle\mathbb{P}\left(Y_{k+1}\in\cdot\mid Y_{1},\ldots,Y_{k},T_{1},% \ldots,T_{k+1}\right)=\mathbb{P}\left(Y_{k+1}\in\cdot\mid T_{k+1}\right)=% \mathrm{Unif}\big{(}L(T_{k+1})\big{)}.blackboard_P ( italic_Y start_POSTSUBSCRIPT italic_k + 1 end_POSTSUBSCRIPT ∈ ⋅ ∣ italic_Y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_Y start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_T start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_T start_POSTSUBSCRIPT italic_k + 1 end_POSTSUBSCRIPT ) = blackboard_P ( italic_Y start_POSTSUBSCRIPT italic_k + 1 end_POSTSUBSCRIPT ∈ ⋅ ∣ italic_T start_POSTSUBSCRIPT italic_k + 1 end_POSTSUBSCRIPT ) = roman_Unif ( italic_L ( italic_T start_POSTSUBSCRIPT italic_k + 1 end_POSTSUBSCRIPT ) ) .

A graphic representation of the transition mechanism for d=1𝑑1d=1italic_d = 1 can be found in Figure 1.

Refer to caption
(a) Sample the level t𝑡titalic_t uniformly from (0,p(x))0𝑝𝑥(0,p(x))( 0 , italic_p ( italic_x ) ).
Refer to caption
(b) The level t𝑡titalic_t specifies a level set L(t)𝐿𝑡L(t)italic_L ( italic_t ).
Refer to caption
(c) Sample the next point y𝑦yitalic_y uniformly from L(t)𝐿𝑡L(t)italic_L ( italic_t ).
Figure 1: Transition mechanism of the idealized slice sampler for initial point x𝑥xitalic_x.

We can also describe the idealized slice sampler through its transition kernel given by

H:d×(d):𝐻superscript𝑑superscript𝑑\displaystyle H:\mathbb{R}^{d}\times\mathcal{B}(\mathbb{R}^{d})italic_H : blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT × caligraphic_B ( blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT ) [0,1]absent01\displaystyle\to[0,1]→ [ 0 , 1 ]
(x,𝖠)𝑥𝖠\displaystyle(x,\mathsf{A})( italic_x , sansserif_A ) 1p(x)(0,p(x))1Lebd(L(t))L(t)𝟙𝖠(y)Lebd(dy)Leb1(dt).maps-toabsent1𝑝𝑥subscript0𝑝𝑥1subscriptLeb𝑑𝐿𝑡subscript𝐿𝑡subscript1𝖠𝑦subscriptLeb𝑑d𝑦subscriptLeb1d𝑡\displaystyle\mapsto\frac{1}{p(x)}\int_{(0,p(x))}\frac{1}{\mathrm{Leb}_{d}\big% {(}L(t)\big{)}}\int_{L(t)}\mathbbm{1}_{\mathsf{A}}(y)\ \mathrm{Leb}_{d}({\rm d% }y)\,\mathrm{Leb}_{1}({\rm d}t).↦ divide start_ARG 1 end_ARG start_ARG italic_p ( italic_x ) end_ARG ∫ start_POSTSUBSCRIPT ( 0 , italic_p ( italic_x ) ) end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG roman_Leb start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT ( italic_L ( italic_t ) ) end_ARG ∫ start_POSTSUBSCRIPT italic_L ( italic_t ) end_POSTSUBSCRIPT blackboard_1 start_POSTSUBSCRIPT sansserif_A end_POSTSUBSCRIPT ( italic_y ) roman_Leb start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT ( roman_d italic_y ) roman_Leb start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( roman_d italic_t ) .

Since slice sampling was brought to the attention of the statistics community in [besag1993spatial], the properties of idealized slice sampling, including reversibility with respect to π𝜋\piitalic_π and conditions for ergodicity, have been investigated in several works, e.g., [mira2002efficiency, natarovskii2021quantitative, roberts2002convergence, rudolf2013positivity, rudolf2018comparison]. Exemplary, the following result illustrates a major advantage of slice sampling: In [natarovskii2021quantitative, Corollary 3.7], Natarovskii et. al. show that the spectral gap of H𝐻Hitalic_H only depends on the level set function tLebd(L(t))maps-to𝑡subscriptLeb𝑑𝐿𝑡t\mapsto\mathrm{Leb}_{d}(L(t))italic_t ↦ roman_Leb start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT ( italic_L ( italic_t ) ), i.e., on the volume of the level sets, not their shape. This means that the performance of the idealized slice sampler is ignorant of the introduction of, e.g., multimodality, local modes or anisotropy as long as the volume of the level sets is not modified.

Unfortunately, each transition of the idealized slice sampler requires to sample from the uniform distribution Unif(L(t))Unif𝐿𝑡\mathrm{Unif}(L(t))roman_Unif ( italic_L ( italic_t ) ), t>0𝑡0t>0italic_t > 0, of a level set. They are d𝑑ditalic_d-dimensional, measurable sets and in general there is no efficient algorithm to tackle this problem. This is a major prevention for the implementation of the idealized slice sampler. One modification strategy to obtain a practical algorithm is called hybrid slice sampling, see [latuszyinski2014convergence]. Here, the uniform distribution on the level sets is replaced by a family of kernels (Ht)t>0subscriptsubscript𝐻𝑡𝑡0(H_{t})_{t>0}( italic_H start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_t > 0 end_POSTSUBSCRIPT such that for any t>0𝑡0t>0italic_t > 0 (where it is well-defined) Unif(L(t))Unif𝐿𝑡\mathrm{Unif}(L(t))roman_Unif ( italic_L ( italic_t ) ) is invariant for Htsubscript𝐻𝑡H_{t}italic_H start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT. This leads to a transition kernel of the form

(x,A)1p(x)(0,p(x))Ht(x,A)Leb1(dt),xd,A(d).formulae-sequencemaps-to𝑥𝐴1𝑝𝑥subscript0𝑝𝑥subscript𝐻𝑡𝑥𝐴subscriptLeb1d𝑡formulae-sequence𝑥superscript𝑑𝐴superscript𝑑(x,A)\mapsto\frac{1}{p(x)}\int_{(0,p(x))}H_{t}(x,A)\ \mathrm{Leb}_{1}({\rm d}t% ),\qquad x\in\mathbb{R}^{d},A\in\mathcal{B}(\mathbb{R}^{d}).( italic_x , italic_A ) ↦ divide start_ARG 1 end_ARG start_ARG italic_p ( italic_x ) end_ARG ∫ start_POSTSUBSCRIPT ( 0 , italic_p ( italic_x ) ) end_POSTSUBSCRIPT italic_H start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_x , italic_A ) roman_Leb start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( roman_d italic_t ) , italic_x ∈ blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT , italic_A ∈ caligraphic_B ( blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT ) .

In order to get a better understanding of this strategy, we exemplary consider the case d=1𝑑1d=1italic_d = 1 introduced in [neal2003slice]. There a stepping-out and a shrinkage procedure is proposed. In the following we aim to provide a basic understanding of the concepts behind these two schemes. To this end, we give a verbal description, and a visualization in Figure 3. The stepping-out procedure is treated in more detail in Section 4.1. This includes pseudocode and a careful definition of the generated distributions. For an extensive treatment of the shrinkage procedure, we refer to [ReversibilityEllipticalSliceSampler].

The stepping-out and shrinkage based hybrid slice sampler takes a point x𝑥x\in\mathbb{R}italic_x ∈ blackboard_R (current state of Markov chain) and a level set L(t)𝐿𝑡L(t)italic_L ( italic_t ) (level generated as in the first step of the transition mechanism of the idealized slice sampler), and proceeds in two steps.

1. Stepping-out:

Under the specification of two hyper parameters w>0𝑤0w>0italic_w > 0 and m𝑚m\in\mathbb{N}italic_m ∈ blackboard_N, the stepping-out procedure chooses a random segment of \mathbb{R}blackboard_R containing x𝑥xitalic_x. To this end, an interval of length w𝑤witalic_w is placed randomly around x𝑥xitalic_x by sampling the left interval boundary point L1Unif((xw,x))similar-tosubscript𝐿1Unif𝑥𝑤𝑥L_{1}\sim\mathrm{Unif}\big{(}(x-w,x)\big{)}italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∼ roman_Unif ( ( italic_x - italic_w , italic_x ) ) and setting the right interval boundary to R1=L1+wsubscript𝑅1subscript𝐿1𝑤R_{1}=L_{1}+witalic_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + italic_w. Then this interval is extended iteratively to the left by intervals of length w𝑤witalic_w until for the first time the left boundary leaves the level set L(t)𝐿𝑡L(t)italic_L ( italic_t ), or the maximal number of stepping-out steps to the left ιι\upiotaroman_ι is reached. Here ιι\upiotaroman_ι is obtained by randomly splitting m+1𝑚1m+1italic_m + 1, the maximal number of total stepping-out steps, into two summands ιι\upiotaroman_ι and m+1ι𝑚1ιm+1-\upiotaitalic_m + 1 - roman_ι. Similarly the interval is extended iteratively to the right by intervals of length w𝑤witalic_w until the right boundary hits L(t)𝐿𝑡\mathbb{R}\setminus L(t)blackboard_R ∖ italic_L ( italic_t ) for the first time, or m+1ι𝑚1ιm+1-\upiotaitalic_m + 1 - roman_ι steps have been performed. This provides a randomly generated interval I=(L,R)𝐼superscript𝐿superscript𝑅I=(L^{\ast},R^{\ast})italic_I = ( italic_L start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , italic_R start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ), where

L=L1(τ(ι1))wandR=R1+(τr(mι))wformulae-sequencesuperscript𝐿subscript𝐿1subscriptτι1𝑤andsuperscript𝑅subscript𝑅1subscriptτ𝑟𝑚ι𝑤L^{\ast}=L_{1}-\big{(}\uptau_{\ell}\land(\upiota-1)\big{)}w\qquad\text{and}% \qquad R^{\ast}=R_{1}+\big{(}\uptau_{r}\land(m-\upiota)\big{)}witalic_L start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT = italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT - ( roman_τ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ∧ ( roman_ι - 1 ) ) italic_w and italic_R start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT = italic_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + ( roman_τ start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ∧ ( italic_m - roman_ι ) ) italic_w

with

τ:=inf{k0L1kwL(t)}andτr:={k0R1+kwL(t)}.formulae-sequenceassignsubscriptτinfimumconditional-set𝑘0subscript𝐿1𝑘𝑤𝐿𝑡andassignsubscriptτ𝑟conditional-set𝑘0subscript𝑅1𝑘𝑤𝐿𝑡\uptau_{\ell}:=\inf\{k\geqslant 0\mid L_{1}-kw\notin L(t)\}\qquad\text{and}% \qquad\uptau_{r}:=\{k\geqslant 0\mid R_{1}+kw\notin L(t)\}.roman_τ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT := roman_inf { italic_k ⩾ 0 ∣ italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT - italic_k italic_w ∉ italic_L ( italic_t ) } and roman_τ start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT := { italic_k ⩾ 0 ∣ italic_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + italic_k italic_w ∉ italic_L ( italic_t ) } .
2. Shrinkage:

We generate a point from IL(t)𝐼𝐿𝑡I\cap L(t)italic_I ∩ italic_L ( italic_t ) with the shrinkage procedure. Roughly speaking, it is an adaptive acceptance/rejection scheme that shrinks the proposal area with each rejection.111Note that we describe here a scheme that differs slightly from the original one in [neal2003slice] and rather resembles the one of the elliptical slice sampler, see [murray2010elliptical]. Crucially, we view here the interval I𝐼Iitalic_I as a circle, i.e., if one \ldqleaves\rdq the interval at the right boundary, it is immediately \ldqreentered\rdq at the left boundary. Observe that a set 𝖩𝖩\mathsf{J}\subseteq\mathbb{R}sansserif_J ⊆ blackboard_R (viewed as a circle) is divided by two points y,z𝖩𝑦𝑧𝖩y,z\in\mathsf{J}italic_y , italic_z ∈ sansserif_J into two segments, namely (yz,yz)𝖩𝑦𝑧𝑦𝑧𝖩(y\land z,y\lor z)\cap\mathsf{J}( italic_y ∧ italic_z , italic_y ∨ italic_z ) ∩ sansserif_J and 𝖩(yz,yz)𝖩𝑦𝑧𝑦𝑧\mathsf{J}\setminus(y\land z,y\lor z)sansserif_J ∖ ( italic_y ∧ italic_z , italic_y ∨ italic_z ). If 𝖩𝖩\mathsf{J}sansserif_J contains the initial point x𝑥xitalic_x, we set

𝕁(y,z,𝖩):={(yz,yz)𝖩,if x(yz,yz),𝖩(yz,yz),otherwise,assign𝕁𝑦𝑧𝖩cases𝑦𝑧𝑦𝑧𝖩if 𝑥𝑦𝑧𝑦𝑧𝖩𝑦𝑧𝑦𝑧otherwise\mathbb{J}(y,z,\mathsf{J}):=\begin{dcases}(y\land z,y\lor z)\cap\mathsf{J},&% \text{if }x\in(y\land z,y\lor z),\\ \mathsf{J}\setminus(y\land z,y\lor z),&\text{otherwise},\end{dcases}blackboard_J ( italic_y , italic_z , sansserif_J ) := { start_ROW start_CELL ( italic_y ∧ italic_z , italic_y ∨ italic_z ) ∩ sansserif_J , end_CELL start_CELL if italic_x ∈ ( italic_y ∧ italic_z , italic_y ∨ italic_z ) , end_CELL end_ROW start_ROW start_CELL sansserif_J ∖ ( italic_y ∧ italic_z , italic_y ∨ italic_z ) , end_CELL start_CELL otherwise , end_CELL end_ROW

to be the segment containing x𝑥xitalic_x. The shrinkage procedure now builds a sequence of such segments. First we sample Y1Unif(I)similar-tosubscript𝑌1Unif𝐼Y_{1}\sim\mathrm{Unif}(I)italic_Y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∼ roman_Unif ( italic_I ), and set 𝖩1=Isubscript𝖩1𝐼\mathsf{J}_{1}=Isansserif_J start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = italic_I. For k𝑘k\in\mathbb{N}italic_k ∈ blackboard_N, we let Yk+1subscript𝑌𝑘1Y_{k+1}italic_Y start_POSTSUBSCRIPT italic_k + 1 end_POSTSUBSCRIPT be a random variable with conditional distribution

(Yk+1Y1,,Yk,𝖩1,,𝖩k)=Unif(𝖩k),\mathbb{P}\left(Y_{k+1}\in\cdot\mid Y_{1},\ldots,Y_{k},\mathsf{J}_{1},\ldots,% \mathsf{J}_{k}\right)=\mathrm{Unif}(\mathsf{J}_{k}),blackboard_P ( italic_Y start_POSTSUBSCRIPT italic_k + 1 end_POSTSUBSCRIPT ∈ ⋅ ∣ italic_Y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_Y start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , sansserif_J start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , sansserif_J start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) = roman_Unif ( sansserif_J start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) ,

that is, we draw the next proposal uniformly from the current segment. Observe that Yk+1subscript𝑌𝑘1Y_{k+1}italic_Y start_POSTSUBSCRIPT italic_k + 1 end_POSTSUBSCRIPT divides 𝖩ksubscript𝖩𝑘\mathsf{J}_{k}sansserif_J start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT into two segments as described above. Set 𝖩k+1=𝕁(Yk,Yk+1,𝖩k)subscript𝖩𝑘1𝕁subscript𝑌𝑘subscript𝑌𝑘1subscript𝖩𝑘\mathsf{J}_{k+1}=\mathbb{J}(Y_{k},Y_{k+1},\mathsf{J}_{k})sansserif_J start_POSTSUBSCRIPT italic_k + 1 end_POSTSUBSCRIPT = blackboard_J ( italic_Y start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_Y start_POSTSUBSCRIPT italic_k + 1 end_POSTSUBSCRIPT , sansserif_J start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) for k𝑘k\in\mathbb{N}italic_k ∈ blackboard_N, i.e., we keep the segment of 𝖩ksubscript𝖩𝑘\mathsf{J}_{k}sansserif_J start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT that contains the initial point x𝑥xitalic_x. This is continued until we generate a proposal that lies in L(t)𝐿𝑡L(t)italic_L ( italic_t ). Hence, overall the shrinkage procedure yields a random point Y=Yτsuperscript𝑌subscript𝑌τY^{\ast}=Y_{\uptau}italic_Y start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT = italic_Y start_POSTSUBSCRIPT roman_τ end_POSTSUBSCRIPT where τ:=inf{kYkL(t)}assignτinfimumconditional-set𝑘subscript𝑌𝑘𝐿𝑡\uptau:=\inf\{k\in\mathbb{N}\mid Y_{k}\in L(t)\}roman_τ := roman_inf { italic_k ∈ blackboard_N ∣ italic_Y start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∈ italic_L ( italic_t ) }. If the stepping-out and shrinkage scheme is embedded into a 1-dimensional hybrid slice sampler this point is then the next random variable of the chain.

We comment on the hyperparameters of the stepping-out procedure.

Remark 1.

The output of the stepping-out procedure can be viewed as a \ldqloose\rdq approximation of the set L(t)𝐿𝑡L(t)italic_L ( italic_t ). The maximal possible length of this approximation is given by mw𝑚𝑤mwitalic_m italic_w, but also the individual choice of m𝑚mitalic_m and w𝑤witalic_w affects the quality of the approximation depending on the shape of L(t)𝐿𝑡L(t)italic_L ( italic_t ). Choosing m𝑚mitalic_m larger and w𝑤witalic_w smaller can lead to an interval that lies \ldqtighter\rdq around L(t)𝐿𝑡L(t)italic_L ( italic_t ). However, if L(t)𝐿𝑡L(t)italic_L ( italic_t ) has \ldqholes\rdq, it is also more likely that parts of L(t)𝐿𝑡L(t)italic_L ( italic_t ) are \ldqcut off\rdq. Moreover, increasing m𝑚mitalic_m increases the computational cost of the stepping-out procedure.

Refer to caption Refer to caption Refer to caption Refer to caption

(a) Stepping-out procedure with one step to the left and three steps to the right.

Refer to caption Refer to caption Refer to caption Refer to caption

(b) Shrinkage procedure where fourth proposal is accepted.
Figure 3: One run of the stepping-out and shrinkage procedure each.

Now we have a practical slice sampling algorithm on \mathbb{R}blackboard_R. One way to lift the stepping-out and shrinkage approach to dsuperscript𝑑\mathbb{R}^{d}blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT is to combine it with the Hit-and-run algorithm arriving at something called the Hit-and-run slice sampler222Hit-and-run slice sampling is already mentioned in [Mackay, Section 29.7]. Convergence and comparison results for this sampler are provided in [latuszyinski2014convergence, rudolf2018comparison], and it is used as a benchmark approach in [murray2010elliptical, schaer2023gibbsian]., which essentially samples a random line though the current point and then runs a slice sampler on this line. For xd𝑥superscript𝑑x\in\mathbb{R}^{d}italic_x ∈ blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT and v𝕊d1𝑣superscript𝕊𝑑1v\in\mathbb{S}^{d-1}italic_v ∈ blackboard_S start_POSTSUPERSCRIPT italic_d - 1 end_POSTSUPERSCRIPT we define

γ(x,v)(θ)=x+θv,θ,formulae-sequencesubscript𝛾𝑥𝑣𝜃𝑥𝜃𝑣𝜃\gamma_{(x,v)}(\theta)=x+\theta v,\qquad\theta\in\mathbb{R},italic_γ start_POSTSUBSCRIPT ( italic_x , italic_v ) end_POSTSUBSCRIPT ( italic_θ ) = italic_x + italic_θ italic_v , italic_θ ∈ blackboard_R ,

to be the line through x𝑥xitalic_x in direction v𝑣vitalic_v, and for t>0𝑡0t>0italic_t > 0

L(x,v,t):={αp(γ(x,v)(α))>t}={αγ(x,v)(α)L(t)}assign𝐿𝑥𝑣𝑡conditional-set𝛼𝑝subscript𝛾𝑥𝑣𝛼𝑡conditional-set𝛼subscript𝛾𝑥𝑣𝛼𝐿𝑡L(x,v,t):=\{\alpha\in\mathbb{R}\mid p\left(\gamma_{(x,v)}(\alpha)\right)>t\}=% \{\alpha\in\mathbb{R}\mid\gamma_{(x,v)}(\alpha)\in L(t)\}italic_L ( italic_x , italic_v , italic_t ) := { italic_α ∈ blackboard_R ∣ italic_p ( italic_γ start_POSTSUBSCRIPT ( italic_x , italic_v ) end_POSTSUBSCRIPT ( italic_α ) ) > italic_t } = { italic_α ∈ blackboard_R ∣ italic_γ start_POSTSUBSCRIPT ( italic_x , italic_v ) end_POSTSUBSCRIPT ( italic_α ) ∈ italic_L ( italic_t ) }

to be the parameterized intersection of a straight line with a level set. The transition mechanism of the Hit-and-run slice sampler then proceeds as depicted in Figure 4.

Input: current state xd𝑥superscript𝑑x\in\mathbb{R}^{d}italic_x ∈ blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPTSample level t𝑡titalic_t uniformlydistributed from (0,p(x))0𝑝𝑥(0,p(x))( 0 , italic_p ( italic_x ) ).Sample direction v𝑣vitalic_v uniformlydistributed from 𝕊d1superscript𝕊𝑑1\mathbb{S}^{d-1}blackboard_S start_POSTSUPERSCRIPT italic_d - 1 end_POSTSUPERSCRIPT.Generate I𝐼Iitalic_I with stepping-outprocedure for L(x,v,t)𝐿𝑥𝑣𝑡L(x,v,t)italic_L ( italic_x , italic_v , italic_t ).Refer to captionRefer to captionRefer to captionRefer to captionGenerate θ𝜃\thetaitalic_θ from IL(x,v,t)𝐼𝐿𝑥𝑣𝑡I\cap L(x,v,t)italic_I ∩ italic_L ( italic_x , italic_v , italic_t )with shrinkage procedure.Refer to captionRefer to captionRefer to captionRefer to captionOutput: next state y=γ(x,v)(θ)𝑦subscript𝛾𝑥𝑣𝜃y=\gamma_{(x,v)}(\theta)italic_y = italic_γ start_POSTSUBSCRIPT ( italic_x , italic_v ) end_POSTSUBSCRIPT ( italic_θ )
Figure 4: Flow chart describing the transition mechanism of the Hit-and-run slice sampler.

For a complete algorithmic description see Algorithm 1 with σd1(x)superscriptsubscript𝜎𝑑1𝑥\sigma_{d-1}^{(x)}italic_σ start_POSTSUBSCRIPT italic_d - 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_x ) end_POSTSUPERSCRIPT being the uniform distribution on 𝕊d1superscript𝕊𝑑1\mathbb{S}^{d-1}blackboard_S start_POSTSUPERSCRIPT italic_d - 1 end_POSTSUPERSCRIPT for all xd𝑥superscript𝑑x\in\mathbb{R}^{d}italic_x ∈ blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT. Equivalently, the transition mechanism can be described as first sampling a direction v𝑣vitalic_v uniformly from 𝕊d1superscript𝕊𝑑1\mathbb{S}^{d-1}blackboard_S start_POSTSUPERSCRIPT italic_d - 1 end_POSTSUPERSCRIPT, and then running a 1-dimensional stepping-out and shrinkage based slice sampler for the unnormalized density αp(γ(x,v)(α))maps-to𝛼𝑝subscript𝛾𝑥𝑣𝛼\alpha\mapsto p\left(\gamma_{(x,v)}(\alpha)\right)italic_α ↦ italic_p ( italic_γ start_POSTSUBSCRIPT ( italic_x , italic_v ) end_POSTSUBSCRIPT ( italic_α ) ) with initial point 0.

Our interest into this framework arises from the fact that, by generalizing straight lines to geodesics, it can be leveraged to general Riemannian manifolds, which we discuss in the next section.

2.2 Geodesic slice sampling

We now turn to slice sampling on Riemannian manifolds. More precise, in the following we consider sampling from measures π𝜋\piitalic_π defined on a state space 𝖬𝖬\mathsf{M}sansserif_M satisfying the following assumption:

Assumption A.

Let 𝖬𝖬\mathsf{M}sansserif_M be a d𝑑ditalic_d-dimensional, smooth, connected manifold. In addition we assume that 𝖬𝖬\mathsf{M}sansserif_M is endowed with a Riemannian metric 𝔤𝔤\mathfrak{g}fraktur_g and is complete.

(For the sake of brevity, we keep the introduction of objects from differential and Riemannian geometry to a bare minimum here. References and details on certain aspects can be found in Appendix C.) First, we need to equip 𝖬𝖬\mathsf{M}sansserif_M with a suitable reference measure. Under Assumption A there exists an atlas (𝖴i,φi)isubscriptsubscript𝖴𝑖subscript𝜑𝑖𝑖(\mathsf{U}_{i},\varphi_{i})_{i\in\mathbb{N}}( sansserif_U start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_φ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_i ∈ blackboard_N end_POSTSUBSCRIPT consisting of homeomorphisms φi:𝖴id:subscript𝜑𝑖subscript𝖴𝑖superscript𝑑\varphi_{i}:\mathsf{U}_{i}\to\mathbb{R}^{d}italic_φ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT : sansserif_U start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT → blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT such that for all i,j𝑖𝑗i,j\in\mathbb{N}italic_i , italic_j ∈ blackboard_N the map φiφj1:φj(𝖴j𝖴i)φi(𝖴j𝖴i):subscript𝜑𝑖superscriptsubscript𝜑𝑗1subscript𝜑𝑗subscript𝖴𝑗subscript𝖴𝑖subscript𝜑𝑖subscript𝖴𝑗subscript𝖴𝑖\varphi_{i}\circ\varphi_{j}^{-1}:\varphi_{j}\left(\mathsf{U}_{j}\cap\mathsf{U}% _{i}\right)\to\varphi_{i}\left(\mathsf{U}_{j}\cap\mathsf{U}_{i}\right)italic_φ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∘ italic_φ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT : italic_φ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( sansserif_U start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∩ sansserif_U start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) → italic_φ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( sansserif_U start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∩ sansserif_U start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) is infinitely often continuously differentiable. We denote the tangent space to 𝖬𝖬\mathsf{M}sansserif_M at x𝖬𝑥𝖬x\in\mathsf{M}italic_x ∈ sansserif_M as Tx𝖬subscript𝑇𝑥𝖬T_{x}\mathsf{M}italic_T start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT sansserif_M. The Riemannian metric 𝔤:x𝔤x:𝔤maps-to𝑥subscript𝔤𝑥\mathfrak{g}:x\mapsto\mathfrak{g}_{x}fraktur_g : italic_x ↦ fraktur_g start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT is a smooth field of symmetric, positive definite covariant 2-tensors. Let (𝖬)𝖬\mathcal{B}(\mathsf{M})caligraphic_B ( sansserif_M ) be the Borel-σ𝜎\sigmaitalic_σ-algebra induced by the topology of 𝖬𝖬\mathsf{M}sansserif_M. The measure ν𝔤subscript𝜈𝔤\nu_{\mathfrak{g}}italic_ν start_POSTSUBSCRIPT fraktur_g end_POSTSUBSCRIPT on 𝖬𝖬\mathsf{M}sansserif_M is defined for any 𝖠(𝖬)𝖠𝖬\mathsf{A}\in\mathcal{B}(\mathsf{M})sansserif_A ∈ caligraphic_B ( sansserif_M ) as

ν𝔤(𝖠):=i=1φi(𝖴i)(ρi𝟙𝖠det(g,φi))φi1(z)Lebd(dz),assignsubscript𝜈𝔤𝖠superscriptsubscript𝑖1subscriptsubscript𝜑𝑖subscript𝖴𝑖subscript𝜌𝑖subscript1𝖠𝑔subscript𝜑𝑖superscriptsubscript𝜑𝑖1𝑧subscriptLeb𝑑d𝑧\nu_{\mathfrak{g}}(\mathsf{A}):=\sum_{i=1}^{\infty}\int_{\varphi_{i}(\mathsf{U% }_{i})}\left(\rho_{i}\cdot\mathbbm{1}_{\mathsf{A}}\cdot\sqrt{\det(g,\varphi_{i% })}\right)\circ\varphi_{i}^{-1}(z)\ \mathrm{Leb}_{d}({\rm d}z),italic_ν start_POSTSUBSCRIPT fraktur_g end_POSTSUBSCRIPT ( sansserif_A ) := ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ∫ start_POSTSUBSCRIPT italic_φ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( sansserif_U start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_POSTSUBSCRIPT ( italic_ρ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ⋅ blackboard_1 start_POSTSUBSCRIPT sansserif_A end_POSTSUBSCRIPT ⋅ square-root start_ARG roman_det ( italic_g , italic_φ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_ARG ) ∘ italic_φ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_z ) roman_Leb start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT ( roman_d italic_z ) , (2)

where {ρi}isubscriptsubscript𝜌𝑖𝑖\{\rho_{i}\}_{i\in\mathbb{N}}{ italic_ρ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_i ∈ blackboard_N end_POSTSUBSCRIPT is a partition of unity subordinate to {𝖴i}isubscriptsubscript𝖴𝑖𝑖\{\mathsf{U}_{i}\}_{i\in\mathbb{N}}{ sansserif_U start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_i ∈ blackboard_N end_POSTSUBSCRIPT, and for i𝑖i\in\mathbb{N}italic_i ∈ blackboard_N

det(g,φi):𝖴i:𝑔subscript𝜑𝑖subscript𝖴𝑖\displaystyle\sqrt{\det(g,\varphi_{i})}:\mathsf{U}_{i}square-root start_ARG roman_det ( italic_g , italic_φ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_ARG : sansserif_U start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT [0,)absent0\displaystyle\to[0,\infty)→ [ 0 , ∞ )
x𝑥\displaystyle xitalic_x det[(𝔤x(Ej,xφi,Ek,xφi)){1j,kd}]maps-toabsentdelimited-[]subscriptsubscript𝔤𝑥superscriptsubscript𝐸𝑗𝑥subscript𝜑𝑖superscriptsubscript𝐸𝑘𝑥subscript𝜑𝑖formulae-sequence1𝑗𝑘𝑑\displaystyle\mapsto\sqrt{\det\left[\left(\mathfrak{g}_{x}(E_{j,x}^{\varphi_{i% }},E_{k,x}^{\varphi_{i}})\right)_{\{1\leqslant j,k\leqslant d\}}\right]}↦ square-root start_ARG roman_det [ ( fraktur_g start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT ( italic_E start_POSTSUBSCRIPT italic_j , italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_φ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT , italic_E start_POSTSUBSCRIPT italic_k , italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_φ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ) ) start_POSTSUBSCRIPT { 1 ⩽ italic_j , italic_k ⩽ italic_d } end_POSTSUBSCRIPT ] end_ARG

with E1φi,,Edφisuperscriptsubscript𝐸1subscript𝜑𝑖superscriptsubscript𝐸𝑑subscript𝜑𝑖E_{1}^{\varphi_{i}},\ldots,E_{d}^{\varphi_{i}}italic_E start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_φ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT , … , italic_E start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_φ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT being the coordinate frames associated to (𝖴i,φi)subscript𝖴𝑖subscript𝜑𝑖(\mathsf{U}_{i},\varphi_{i})( sansserif_U start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_φ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ). We call ν𝔤subscript𝜈𝔤\nu_{\mathfrak{g}}italic_ν start_POSTSUBSCRIPT fraktur_g end_POSTSUBSCRIPT the Riemannian measure induced by 𝔤𝔤\mathfrak{g}fraktur_g. It can be viewed as an extension of the Lebesgue measure to Riemannian manifolds, see e.g. [AnalysisIII, Section XII.1]. We provide some examples for manifolds satisfying Assumption A.

Example 2.
  1. 1.

    The most simple example for a d𝑑ditalic_d-dimensional, smooth, connected manifold is dsuperscript𝑑\mathbb{R}^{d}blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT. Since for each xd𝑥superscript𝑑x\in\mathbb{R}^{d}italic_x ∈ blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT the tangent space to dsuperscript𝑑\mathbb{R}^{d}blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT at x𝑥xitalic_x is again dsuperscript𝑑\mathbb{R}^{d}blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT, we can equip it with the Riemannian metric

    𝔤x(v1,v2)=v1v2,xd,v1,v2Tx𝖬,formulae-sequencesubscript𝔤𝑥subscript𝑣1subscript𝑣2superscriptsubscript𝑣1topsubscript𝑣2formulae-sequence𝑥superscript𝑑subscript𝑣1subscript𝑣2subscript𝑇𝑥𝖬\mathfrak{g}_{x}(v_{1},v_{2})=v_{1}^{\top}v_{2},\qquad x\in\mathbb{R}^{d},v_{1% },v_{2}\in T_{x}\mathsf{M},fraktur_g start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT ( italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) = italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_x ∈ blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT , italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ∈ italic_T start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT sansserif_M ,

    rendering dsuperscript𝑑\mathbb{R}^{d}blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT complete. The induced Riemannian measure is the Lebesgue measure.

  2. 2.

    We equip the Euclidean unit sphere 𝕊d1superscript𝕊𝑑1\mathbb{S}^{d-1}blackboard_S start_POSTSUPERSCRIPT italic_d - 1 end_POSTSUPERSCRIPT with the standard Riemannian metric 𝔤^^𝔤\widehat{\mathfrak{g}}over^ start_ARG fraktur_g end_ARG induced by its embedding Id:𝕊d1d:Idsuperscript𝕊𝑑1superscript𝑑\mathrm{Id}:\mathbb{S}^{d-1}\to\mathbb{R}^{d}roman_Id : blackboard_S start_POSTSUPERSCRIPT italic_d - 1 end_POSTSUPERSCRIPT → blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT in dsuperscript𝑑\mathbb{R}^{d}blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT, that is,

    𝔤^x=(Id(v1))Id(v2),x𝕊d1,v1,v2Tx𝕊d1,formulae-sequencesubscript^𝔤𝑥superscriptsubscriptIdsubscript𝑣1topsubscriptIdsubscript𝑣2formulae-sequence𝑥superscript𝕊𝑑1subscript𝑣1subscript𝑣2subscript𝑇𝑥superscript𝕊𝑑1\widehat{\mathfrak{g}}_{x}=\left(\mathrm{Id}_{*}(v_{1})\right)^{\top}\mathrm{% Id}_{*}(v_{2}),\qquad x\in\mathbb{S}^{d-1},v_{1},v_{2}\in T_{x}\mathbb{S}^{d-1},over^ start_ARG fraktur_g end_ARG start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT = ( roman_Id start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT ( italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT roman_Id start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT ( italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) , italic_x ∈ blackboard_S start_POSTSUPERSCRIPT italic_d - 1 end_POSTSUPERSCRIPT , italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ∈ italic_T start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT blackboard_S start_POSTSUPERSCRIPT italic_d - 1 end_POSTSUPERSCRIPT ,

    where IdsubscriptId\mathrm{Id}_{*}roman_Id start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT is the map on the tangent spaces induced by IdId\mathrm{Id}roman_Id. Then (𝕊d1,𝔤^)superscript𝕊𝑑1^𝔤(\mathbb{S}^{d-1},\widehat{\mathfrak{g}})( blackboard_S start_POSTSUPERSCRIPT italic_d - 1 end_POSTSUPERSCRIPT , over^ start_ARG fraktur_g end_ARG ) satisfies Assumption A. The corresponding Riemannian measure ν𝔤^subscript𝜈^𝔤\nu_{\widehat{\mathfrak{g}}}italic_ν start_POSTSUBSCRIPT over^ start_ARG fraktur_g end_ARG end_POSTSUBSCRIPT is the standard volume measure.

  3. 3.

    Let k,n𝑘𝑛k,n\in\mathbb{N}italic_k , italic_n ∈ blackboard_N with kn𝑘𝑛k\leqslant nitalic_k ⩽ italic_n. The k(k1)/2+k(nk)𝑘𝑘12𝑘𝑛𝑘k(k-1)/2+k(n-k)italic_k ( italic_k - 1 ) / 2 + italic_k ( italic_n - italic_k )-dimensional, smooth, connected Stiefel manifold

    𝒱(n,k):={Γn×kΓΓ=Idk},assign𝒱𝑛𝑘conditional-setΓsuperscript𝑛𝑘superscriptΓtopΓsubscriptId𝑘\mathcal{V}(n,k):=\{\Gamma\in\mathbb{R}^{n\times k}\mid\Gamma^{\top}\Gamma=% \mathrm{Id}_{k}\},caligraphic_V ( italic_n , italic_k ) := { roman_Γ ∈ blackboard_R start_POSTSUPERSCRIPT italic_n × italic_k end_POSTSUPERSCRIPT ∣ roman_Γ start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT roman_Γ = roman_Id start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT } ,

    consists of (ordered) k𝑘kitalic_k-tuples of vectors in nsuperscript𝑛\mathbb{R}^{n}blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT that from an orthonormal system. This means that each point on the Stiefel manifold describes (not uniquely) a k𝑘kitalic_k-dimensional subspace of nsuperscript𝑛\mathbb{R}^{n}blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT. To characterize the tangent space TΓ𝒱(n,k)subscript𝑇Γ𝒱𝑛𝑘T_{\Gamma}\mathcal{V}(n,k)italic_T start_POSTSUBSCRIPT roman_Γ end_POSTSUBSCRIPT caligraphic_V ( italic_n , italic_k ) to a point Γ𝒱(n,k)Γ𝒱𝑛𝑘\Gamma\in\mathcal{V}(n,k)roman_Γ ∈ caligraphic_V ( italic_n , italic_k ), we need a matrix Γn×(nk)subscriptΓperpendicular-tosuperscript𝑛𝑛𝑘\Gamma_{\perp}\in\mathbb{R}^{n\times(n-k)}roman_Γ start_POSTSUBSCRIPT ⟂ end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_n × ( italic_n - italic_k ) end_POSTSUPERSCRIPT such that the columns of ΓΓ\Gammaroman_Γ and ΓsubscriptΓperpendicular-to\Gamma_{\perp}roman_Γ start_POSTSUBSCRIPT ⟂ end_POSTSUBSCRIPT form an orthonormal basis of nsuperscript𝑛\mathbb{R}^{n}blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT. Then we have

    TΓ𝒱(n,k):={ΓΠ+ΓΣn×kΠk×k skew symmetric,Σ(nk)×k}.assignsubscript𝑇Γ𝒱𝑛𝑘conditional-setΓΠsubscriptΓperpendicular-toΣsuperscript𝑛𝑘formulae-sequenceΠsuperscript𝑘𝑘 skew symmetricΣsuperscript𝑛𝑘𝑘T_{\Gamma}\mathcal{V}(n,k):=\{\Gamma\Pi+\Gamma_{\perp}\Sigma\in\mathbb{R}^{n% \times k}\mid\Pi\in\mathbb{R}^{k\times k}\text{ skew symmetric},\Sigma\in% \mathbb{R}^{(n-k)\times k}\}.italic_T start_POSTSUBSCRIPT roman_Γ end_POSTSUBSCRIPT caligraphic_V ( italic_n , italic_k ) := { roman_Γ roman_Π + roman_Γ start_POSTSUBSCRIPT ⟂ end_POSTSUBSCRIPT roman_Σ ∈ blackboard_R start_POSTSUPERSCRIPT italic_n × italic_k end_POSTSUPERSCRIPT ∣ roman_Π ∈ blackboard_R start_POSTSUPERSCRIPT italic_k × italic_k end_POSTSUPERSCRIPT skew symmetric , roman_Σ ∈ blackboard_R start_POSTSUPERSCRIPT ( italic_n - italic_k ) × italic_k end_POSTSUPERSCRIPT } .

    If we introduce the Riemannian metric

    𝔤Γ(Δ1,Δ2)=Tr(Δ1(Idn12ΓΓ)Δ2)=12Tr(Π1Π2)+Tr(Σ1Σ2),subscript𝔤ΓsubscriptΔ1subscriptΔ2TrsuperscriptsubscriptΔ1topsubscriptId𝑛12ΓsuperscriptΓtopsubscriptΔ212TrsuperscriptsubscriptΠ1topsubscriptΠ2TrsuperscriptsubscriptΣ1topsubscriptΣ2\mathfrak{g}_{\Gamma}(\Delta_{1},\Delta_{2})=\textrm{Tr}\big{(}\Delta_{1}^{% \top}(\mathrm{Id}_{n}-\frac{1}{2}\Gamma\Gamma^{\top})\Delta_{2}\big{)}=\frac{1% }{2}\textrm{Tr}(\Pi_{1}^{\top}\Pi_{2})+\textrm{Tr}(\Sigma_{1}^{\top}\Sigma_{2}),fraktur_g start_POSTSUBSCRIPT roman_Γ end_POSTSUBSCRIPT ( roman_Δ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , roman_Δ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) = Tr ( roman_Δ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ( roman_Id start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT - divide start_ARG 1 end_ARG start_ARG 2 end_ARG roman_Γ roman_Γ start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ) roman_Δ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) = divide start_ARG 1 end_ARG start_ARG 2 end_ARG Tr ( roman_Π start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT roman_Π start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) + Tr ( roman_Σ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT roman_Σ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) ,

    where Δ1=ΓΠ1+ΓΣ1,Δ2=ΓΠ2+ΓΣ2TΓ𝒱(n,k)formulae-sequencesubscriptΔ1ΓsubscriptΠ1subscriptΓperpendicular-tosubscriptΣ1subscriptΔ2ΓsubscriptΠ2subscriptΓperpendicular-tosubscriptΣ2subscript𝑇Γ𝒱𝑛𝑘\Delta_{1}=\Gamma\Pi_{1}+\Gamma_{\perp}\Sigma_{1},\Delta_{2}=\Gamma\Pi_{2}+% \Gamma_{\perp}\Sigma_{2}\in T_{\Gamma}\mathcal{V}(n,k)roman_Δ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = roman_Γ roman_Π start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + roman_Γ start_POSTSUBSCRIPT ⟂ end_POSTSUBSCRIPT roman_Σ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , roman_Δ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = roman_Γ roman_Π start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT + roman_Γ start_POSTSUBSCRIPT ⟂ end_POSTSUBSCRIPT roman_Σ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ∈ italic_T start_POSTSUBSCRIPT roman_Γ end_POSTSUBSCRIPT caligraphic_V ( italic_n , italic_k ) and Tr denotes the trace of a matrix, Assumption A holds true for 𝒱(n,k)𝒱𝑛𝑘\mathcal{V}(n,k)caligraphic_V ( italic_n , italic_k ). For more details see [edelman1998geometry].

The need to sample from measures π𝜋\piitalic_π defined on connected, complete Riemannian manifolds occurs in several applications in Bayesian statistics, see e.g. [holbrook2016bayesian, holbrook2020nonparametric, lieAccepteddimension, mantoux2021understanding].

We assume in this paper that the measure π𝜋\piitalic_π on 𝖬𝖬\mathsf{M}sansserif_M admits a density with respect to the Riemannian measure ν𝔤subscript𝜈𝔤\nu_{\mathfrak{g}}italic_ν start_POSTSUBSCRIPT fraktur_g end_POSTSUBSCRIPT, i.e.,

π(dx):=p(x)𝖬p(y)ν𝔤(dy)ν𝔤(dx).assign𝜋d𝑥𝑝𝑥subscript𝖬𝑝𝑦subscript𝜈𝔤d𝑦subscript𝜈𝔤d𝑥\pi({\rm d}x):=\frac{p(x)}{\int_{\mathsf{M}}p(y)\ \nu_{\mathfrak{g}}({\rm d}y)% }\nu_{\mathfrak{g}}({\rm d}x).italic_π ( roman_d italic_x ) := divide start_ARG italic_p ( italic_x ) end_ARG start_ARG ∫ start_POSTSUBSCRIPT sansserif_M end_POSTSUBSCRIPT italic_p ( italic_y ) italic_ν start_POSTSUBSCRIPT fraktur_g end_POSTSUBSCRIPT ( roman_d italic_y ) end_ARG italic_ν start_POSTSUBSCRIPT fraktur_g end_POSTSUBSCRIPT ( roman_d italic_x ) . (3)

Very much in parallel to the (special) case dsuperscript𝑑\mathbb{R}^{d}blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT, we are now able to develop a slice sampling approach to target π𝜋\piitalic_π defined on 𝖬𝖬\mathsf{M}sansserif_M. We can easily extend the uniform simple slice sampler (also called idealized slice sampler) from dsuperscript𝑑\mathbb{R}^{d}blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT to 𝖬𝖬\mathsf{M}sansserif_M. (In fact this works for every measure space, see Appendix B.) The level sets, containing all points where the unnormalized density p𝑝pitalic_p is greater than a given value t(0,)𝑡0t\in(0,\infty)italic_t ∈ ( 0 , ∞ ), take the form

L(t):={x𝖬p(x)>t}.assign𝐿𝑡conditional-set𝑥𝖬𝑝𝑥𝑡L(t):=\{x\in\mathsf{M}\mid p(x)>t\}.italic_L ( italic_t ) := { italic_x ∈ sansserif_M ∣ italic_p ( italic_x ) > italic_t } .

The idealized slice sampler with initial point x𝖬𝑥𝖬x\in\mathsf{M}italic_x ∈ sansserif_M then proceeds, as in dsuperscript𝑑\mathbb{R}^{d}blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT, by first sampling a level t𝑡titalic_t uniformly from (0,p(x))0𝑝𝑥\big{(}0,p(x)\big{)}( 0 , italic_p ( italic_x ) ). Then the next state of the chain is sampled according to the uniform distribution on L(t)𝐿𝑡L(t)italic_L ( italic_t ), that is, according to ν𝔤(L(t))1ν𝔤|L(t)evaluated-atsubscript𝜈𝔤superscript𝐿𝑡1subscript𝜈𝔤𝐿𝑡\nu_{\mathfrak{g}}(L(t))^{-1}\nu_{\mathfrak{g}}|_{L(t)}italic_ν start_POSTSUBSCRIPT fraktur_g end_POSTSUBSCRIPT ( italic_L ( italic_t ) ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_ν start_POSTSUBSCRIPT fraktur_g end_POSTSUBSCRIPT | start_POSTSUBSCRIPT italic_L ( italic_t ) end_POSTSUBSCRIPT. More precisely, the idealized slice sampler has transition kernel

H:𝖬×(𝖬):𝐻𝖬𝖬\displaystyle H:\mathsf{M}\times\mathcal{B}(\mathsf{M})italic_H : sansserif_M × caligraphic_B ( sansserif_M ) [0,1]absent01\displaystyle\to[0,1]→ [ 0 , 1 ]
(x,𝖠)𝑥𝖠\displaystyle(x,\mathsf{A})( italic_x , sansserif_A ) 1p(x)(0,p(x))1ν𝔤(L(t))L(t)𝟙𝖠(y)ν𝔤(dy)Leb1(dt).maps-toabsent1𝑝𝑥subscript0𝑝𝑥1subscript𝜈𝔤𝐿𝑡subscript𝐿𝑡subscript1𝖠𝑦subscript𝜈𝔤d𝑦subscriptLeb1d𝑡\displaystyle\mapsto\frac{1}{p(x)}\int_{(0,p(x))}\frac{1}{\nu_{\mathfrak{g}}% \big{(}L(t)\big{)}}\int_{L(t)}\mathbbm{1}_{\mathsf{A}}(y)\ \nu_{\mathfrak{g}}(% {\rm d}y)\,\mathrm{Leb}_{1}({\rm d}t).↦ divide start_ARG 1 end_ARG start_ARG italic_p ( italic_x ) end_ARG ∫ start_POSTSUBSCRIPT ( 0 , italic_p ( italic_x ) ) end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG italic_ν start_POSTSUBSCRIPT fraktur_g end_POSTSUBSCRIPT ( italic_L ( italic_t ) ) end_ARG ∫ start_POSTSUBSCRIPT italic_L ( italic_t ) end_POSTSUBSCRIPT blackboard_1 start_POSTSUBSCRIPT sansserif_A end_POSTSUBSCRIPT ( italic_y ) italic_ν start_POSTSUBSCRIPT fraktur_g end_POSTSUBSCRIPT ( roman_d italic_y ) roman_Leb start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( roman_d italic_t ) .

The implementation of the kernel requires sampling from the manifold-equivalent ν𝔤(L(t))1ν𝔤|L(t)evaluated-atsubscript𝜈𝔤superscript𝐿𝑡1subscript𝜈𝔤𝐿𝑡\nu_{\mathfrak{g}}(L(t))^{-1}\nu_{\mathfrak{g}}|_{L(t)}italic_ν start_POSTSUBSCRIPT fraktur_g end_POSTSUBSCRIPT ( italic_L ( italic_t ) ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_ν start_POSTSUBSCRIPT fraktur_g end_POSTSUBSCRIPT | start_POSTSUBSCRIPT italic_L ( italic_t ) end_POSTSUBSCRIPT of the uniform distribution on a level set L(t)𝐿𝑡L(t)italic_L ( italic_t ). Doing this efficiently poses a problem, as in dsuperscript𝑑\mathbb{R}^{d}blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT, because in general we have no knowledge about the shape of the level sets other than that they are d𝑑ditalic_d-dimensional, measurable sets. Therefore, we propose a hybrid slice sampler that lifts Hit-and-run slice sampling from dsuperscript𝑑\mathbb{R}^{d}blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT to the general Riemannian manifold 𝖬𝖬\mathsf{M}sansserif_M. To this end, we need a generalization of straight lines to 𝖬𝖬\mathsf{M}sansserif_M, the geodesics. A curve γ:𝖨𝖬:𝛾𝖨𝖬\gamma:\mathsf{I}\to\mathsf{M}italic_γ : sansserif_I → sansserif_M, where 𝖨𝖨\mathsf{I}\subseteq\mathbb{R}sansserif_I ⊆ blackboard_R is an interval, is called a geodesic if the covariant derivative of its velocity vector field is zero, i.e., if the equation (D/dt)((dγ(x,v))/(dt))=0𝐷𝑑𝑡dsubscript𝛾𝑥𝑣d𝑡0(D/dt)\big{(}({\rm d}\gamma_{(x,v)})/({\rm d}t)\big{)}=0( italic_D / italic_d italic_t ) ( ( roman_d italic_γ start_POSTSUBSCRIPT ( italic_x , italic_v ) end_POSTSUBSCRIPT ) / ( roman_d italic_t ) ) = 0 holds. Under Assumption A, we know that for all x𝖬𝑥𝖬x\in\mathsf{M}italic_x ∈ sansserif_M and all vTx𝖬𝑣subscript𝑇𝑥𝖬v\in T_{x}\mathsf{M}italic_v ∈ italic_T start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT sansserif_M there exists a unique geodesic

γ(x,v):𝖬:subscript𝛾𝑥𝑣𝖬\gamma_{(x,v)}:\mathbb{R}\to\mathsf{M}italic_γ start_POSTSUBSCRIPT ( italic_x , italic_v ) end_POSTSUBSCRIPT : blackboard_R → sansserif_M (4)

satisfying γ(x,v)(0)=xsubscript𝛾𝑥𝑣0𝑥\gamma_{(x,v)}(0)=xitalic_γ start_POSTSUBSCRIPT ( italic_x , italic_v ) end_POSTSUBSCRIPT ( 0 ) = italic_x and (dγ(x,v))/(dt)|0=vevaluated-atdsubscript𝛾𝑥𝑣d𝑡0𝑣({\rm d}\gamma_{(x,v)})/({\rm d}t)|_{0}=v( roman_d italic_γ start_POSTSUBSCRIPT ( italic_x , italic_v ) end_POSTSUBSCRIPT ) / ( roman_d italic_t ) | start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = italic_v. We may interpret γ(x,v)subscript𝛾𝑥𝑣\gamma_{(x,v)}italic_γ start_POSTSUBSCRIPT ( italic_x , italic_v ) end_POSTSUBSCRIPT as the geodesic emanating from x𝑥xitalic_x in direction v𝑣vitalic_v.

As in dsuperscript𝑑\mathbb{R}^{d}blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT, we want to index the geodesics emanating from a point x𝖬𝑥𝖬x\in\mathsf{M}italic_x ∈ sansserif_M by a \ldqunit sphere of directions\rdq, which naturally is given by the unit tangent sphere in Tx𝖬subscript𝑇𝑥𝖬T_{x}\mathsf{M}italic_T start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT sansserif_M

𝕊xd1:={vTx𝖬𝔤x(v,v)=1}.assignsuperscriptsubscript𝕊𝑥𝑑1conditional-set𝑣subscript𝑇𝑥𝖬subscript𝔤𝑥𝑣𝑣1\mathbb{S}_{x}^{d-1}:=\{v\in T_{x}\mathsf{M}\mid\mathfrak{g}_{x}(v,v)=1\}.blackboard_S start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_d - 1 end_POSTSUPERSCRIPT := { italic_v ∈ italic_T start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT sansserif_M ∣ fraktur_g start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT ( italic_v , italic_v ) = 1 } .

The natural immersion of 𝕊xd1subscriptsuperscript𝕊𝑑1𝑥\mathbb{S}^{d-1}_{x}blackboard_S start_POSTSUPERSCRIPT italic_d - 1 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT into the inner product space (Tx𝖬,𝔤x)subscript𝑇𝑥𝖬subscript𝔤𝑥(T_{x}\mathsf{M},\mathfrak{g}_{x})( italic_T start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT sansserif_M , fraktur_g start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT ) via the identity induces a Riemannian metric 𝔤^xsubscript^𝔤𝑥\widehat{\mathfrak{g}}_{x}over^ start_ARG fraktur_g end_ARG start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT on 𝕊xd1superscriptsubscript𝕊𝑥𝑑1\mathbb{S}_{x}^{d-1}blackboard_S start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_d - 1 end_POSTSUPERSCRIPT. We call the normalization

σd1(x):=1ν𝔤^x(𝕊xd1)ν𝔤^x,x𝖬,formulae-sequenceassignsuperscriptsubscript𝜎𝑑1𝑥1subscript𝜈subscript^𝔤𝑥subscriptsuperscript𝕊𝑑1𝑥subscript𝜈subscript^𝔤𝑥𝑥𝖬\sigma_{d-1}^{(x)}:=\frac{1}{\nu_{\widehat{\mathfrak{g}}_{x}}(\mathbb{S}^{d-1}% _{x})}\nu_{\widehat{\mathfrak{g}}_{x}},\qquad x\in\mathsf{M},italic_σ start_POSTSUBSCRIPT italic_d - 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_x ) end_POSTSUPERSCRIPT := divide start_ARG 1 end_ARG start_ARG italic_ν start_POSTSUBSCRIPT over^ start_ARG fraktur_g end_ARG start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( blackboard_S start_POSTSUPERSCRIPT italic_d - 1 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT ) end_ARG italic_ν start_POSTSUBSCRIPT over^ start_ARG fraktur_g end_ARG start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT end_POSTSUBSCRIPT , italic_x ∈ sansserif_M ,

of the Riemannian measure ν𝔤^xsubscript𝜈subscript^𝔤𝑥\nu_{\widehat{\mathfrak{g}}_{x}}italic_ν start_POSTSUBSCRIPT over^ start_ARG fraktur_g end_ARG start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT end_POSTSUBSCRIPT, induced by 𝔤^xsubscript^𝔤𝑥\widehat{\mathfrak{g}}_{x}over^ start_ARG fraktur_g end_ARG start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT, uniform distribution on 𝕊xd1superscriptsubscript𝕊𝑥𝑑1\mathbb{S}_{x}^{d-1}blackboard_S start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_d - 1 end_POSTSUPERSCRIPT . Then for x𝖬𝑥𝖬x\in\mathsf{M}italic_x ∈ sansserif_M, v𝕊xd1𝑣superscriptsubscript𝕊𝑥𝑑1v\in\mathbb{S}_{x}^{d-1}italic_v ∈ blackboard_S start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_d - 1 end_POSTSUPERSCRIPT and t>0𝑡0t>0italic_t > 0 we immediately obtain

L(x,v,t):={αp(γ(x,v)(α))>t}={αγ(x,v)(α)L(t)}assign𝐿𝑥𝑣𝑡conditional-set𝛼𝑝subscript𝛾𝑥𝑣𝛼𝑡conditional-set𝛼subscript𝛾𝑥𝑣𝛼𝐿𝑡L(x,v,t):=\{\alpha\in\mathbb{R}\mid p\left(\gamma_{(x,v)}(\alpha)\right)>t\}=% \{\alpha\in\mathbb{R}\mid\gamma_{(x,v)}(\alpha)\in L(t)\}italic_L ( italic_x , italic_v , italic_t ) := { italic_α ∈ blackboard_R ∣ italic_p ( italic_γ start_POSTSUBSCRIPT ( italic_x , italic_v ) end_POSTSUBSCRIPT ( italic_α ) ) > italic_t } = { italic_α ∈ blackboard_R ∣ italic_γ start_POSTSUBSCRIPT ( italic_x , italic_v ) end_POSTSUBSCRIPT ( italic_α ) ∈ italic_L ( italic_t ) }

as the parameterized intersection of a geodesic with a level set. We now present the extension of the Hit-and-run slice sampler to manifolds replacing straight lines by geodesics. We call this sampler the geodesic slice sampler. Roughly speaking, we arrive at a transition mechanism which at each step randomly chooses a geodesic and then runs a stepping-out and shrinkage based 1-dimensional slice sampler on this geodesic: Given a point x𝖬𝑥𝖬x\in\mathsf{M}italic_x ∈ sansserif_M, a new point y𝖬𝑦𝖬y\in\mathsf{M}italic_y ∈ sansserif_M is generated by first sampling a level t𝑡titalic_t uniformly from (0,p(x))0𝑝𝑥(0,p(x))( 0 , italic_p ( italic_x ) ), and a direction v𝑣vitalic_v uniformly from 𝕊xd1subscriptsuperscript𝕊𝑑1𝑥\mathbb{S}^{d-1}_{x}blackboard_S start_POSTSUPERSCRIPT italic_d - 1 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT, i.e., from σd1(x)superscriptsubscript𝜎𝑑1𝑥\sigma_{d-1}^{(x)}italic_σ start_POSTSUBSCRIPT italic_d - 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_x ) end_POSTSUPERSCRIPT. The sampled level t𝑡titalic_t defines a level set L(t)𝐿𝑡L(t)italic_L ( italic_t ), and the sampled direction v𝑣vitalic_v specifies a geodesic γ(x,v)subscript𝛾𝑥𝑣\gamma_{(x,v)}italic_γ start_POSTSUBSCRIPT ( italic_x , italic_v ) end_POSTSUBSCRIPT emanating from x𝑥xitalic_x in direction v𝑣vitalic_v. Now we use Neal’s stepping-out and shrinkage techniques described in Section 2.1 to generate a point θ𝜃\theta\in\mathbb{R}italic_θ ∈ blackboard_R from the intersection L(x,v,t)𝐿𝑥𝑣𝑡L(x,v,t)italic_L ( italic_x , italic_v , italic_t ) of the level set L(t)𝐿𝑡L(t)italic_L ( italic_t ) and the geodesic γ(x,v)subscript𝛾𝑥𝑣\gamma_{(x,v)}italic_γ start_POSTSUBSCRIPT ( italic_x , italic_v ) end_POSTSUBSCRIPT. The new point y𝖬𝑦𝖬y\in\mathsf{M}italic_y ∈ sansserif_M is then given by y=γ(x,v)(θ)𝑦subscript𝛾𝑥𝑣𝜃y=\gamma_{(x,v)}(\theta)italic_y = italic_γ start_POSTSUBSCRIPT ( italic_x , italic_v ) end_POSTSUBSCRIPT ( italic_θ ).

Algorithm 1 Geodesic slice sampler.

Input: point x𝖬𝑥𝖬x\in\mathsf{M}italic_x ∈ sansserif_M, hyperparameters w(0,)𝑤0w\in(0,\infty)italic_w ∈ ( 0 , ∞ ) and m𝑚m\in\mathbb{N}italic_m ∈ blackboard_N
Output: point y𝖬𝑦𝖬y\in\mathsf{M}italic_y ∈ sansserif_M

1:  Draw TUnif((0,p(x)))similar-to𝑇Unif0𝑝𝑥T\sim\mathrm{Unif}\big{(}(0,p(x))\big{)}italic_T ∼ roman_Unif ( ( 0 , italic_p ( italic_x ) ) ), call the result t𝑡titalic_t.
2:  Draw Vσd1(x)similar-to𝑉superscriptsubscript𝜎𝑑1𝑥V\sim\sigma_{d-1}^{(x)}italic_V ∼ italic_σ start_POSTSUBSCRIPT italic_d - 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_x ) end_POSTSUPERSCRIPT, call the result v𝑣vitalic_v.
3:  Generate a realization of (L,R)=Step-outw,m(x,v,t)𝐿𝑅subscriptStep-out𝑤𝑚𝑥𝑣𝑡(L,R)=\texttt{Step-out}_{w,m}(x,v,t)( italic_L , italic_R ) = Step-out start_POSTSUBSCRIPT italic_w , italic_m end_POSTSUBSCRIPT ( italic_x , italic_v , italic_t ), call the result (,r)𝑟(\ell,r)( roman_ℓ , italic_r ).
4:  Generate a realization of Θ=Shrink,r(x,v,t)ΘsubscriptShrink𝑟𝑥𝑣𝑡\Theta=\texttt{Shrink}_{\ell,r}(x,v,t)roman_Θ = Shrink start_POSTSUBSCRIPT roman_ℓ , italic_r end_POSTSUBSCRIPT ( italic_x , italic_v , italic_t ), call the result θ𝜃\thetaitalic_θ.
5:  return  y=γ(x,v)(θ)𝑦subscript𝛾𝑥𝑣𝜃y=\gamma_{(x,v)}(\theta)italic_y = italic_γ start_POSTSUBSCRIPT ( italic_x , italic_v ) end_POSTSUBSCRIPT ( italic_θ ).
Algorithm 2 Stepping-out procedure. Call as Step-outw,m(x,v,t)subscriptStep-out𝑤𝑚𝑥𝑣𝑡\texttt{Step-out}_{w,m}(x,v,t)Step-out start_POSTSUBSCRIPT italic_w , italic_m end_POSTSUBSCRIPT ( italic_x , italic_v , italic_t ).

Input: point x𝖬𝑥𝖬x\in\mathsf{M}italic_x ∈ sansserif_M, direction v𝕊xd1𝑣superscriptsubscript𝕊𝑥𝑑1v\in\mathbb{S}_{x}^{d-1}italic_v ∈ blackboard_S start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_d - 1 end_POSTSUPERSCRIPT, level t(0,p(x))𝑡0𝑝𝑥t\in(0,p(x))italic_t ∈ ( 0 , italic_p ( italic_x ) ), hyperparameters w(0,)𝑤0w\in(0,\infty)italic_w ∈ ( 0 , ∞ ) and m𝑚m\in\mathbb{N}italic_m ∈ blackboard_N
Output: two points ,r𝑟\ell,r\in\mathbb{R}roman_ℓ , italic_r ∈ blackboard_R such that <0<r0𝑟\ell<0<rroman_ℓ < 0 < italic_r

1:  Draw ΥUnif([0,w])similar-toΥUnif0𝑤\Upsilon\sim\mathrm{Unif}\big{(}[0,w]\big{)}roman_Υ ∼ roman_Unif ( [ 0 , italic_w ] ), call the result u𝑢uitalic_u.
2:  Set :=uassign𝑢\ell:=-uroman_ℓ := - italic_u and r:=+wassign𝑟𝑤r:=\ell+witalic_r := roman_ℓ + italic_w.
3:  Draw JUnif({1,,m})similar-to𝐽Unif1𝑚J\sim\mathrm{Unif}(\{1,\ldots,m\})italic_J ∼ roman_Unif ( { 1 , … , italic_m } ), call the result ιι\upiotaroman_ι.
4:  Set i=2𝑖2i=2italic_i = 2 and j=2𝑗2j=2italic_j = 2.
5:  while iι𝑖ιi\leqslant\upiotaitalic_i ⩽ roman_ι and p(γ(x,v)())>t𝑝subscript𝛾𝑥𝑣𝑡p\left(\gamma_{(x,v)}(\ell)\right)>titalic_p ( italic_γ start_POSTSUBSCRIPT ( italic_x , italic_v ) end_POSTSUBSCRIPT ( roman_ℓ ) ) > italic_t do
6:     Set =w𝑤\ell=\ell-wroman_ℓ = roman_ℓ - italic_w.
7:     Update i=i+1𝑖𝑖1i=i+1italic_i = italic_i + 1.
8:  end while
9:  while jm+1ι𝑗𝑚1ιj\leqslant m+1-\upiotaitalic_j ⩽ italic_m + 1 - roman_ι and p(γ(x,v)(r))>t𝑝subscript𝛾𝑥𝑣𝑟𝑡p\left(\gamma_{(x,v)}(r)\right)>titalic_p ( italic_γ start_POSTSUBSCRIPT ( italic_x , italic_v ) end_POSTSUBSCRIPT ( italic_r ) ) > italic_t do
10:     Set r=r+w𝑟𝑟𝑤r=r+witalic_r = italic_r + italic_w.
11:     Update j=j+1𝑗𝑗1j=j+1italic_j = italic_j + 1.
12:  end while
13:  return  (,r)𝑟(\ell,r)( roman_ℓ , italic_r )
Algorithm 3 Shrinkage procedure. Call as Shrink,r(x,v,t)subscriptShrink𝑟𝑥𝑣𝑡\texttt{Shrink}_{\ell,r}(x,v,t)Shrink start_POSTSUBSCRIPT roman_ℓ , italic_r end_POSTSUBSCRIPT ( italic_x , italic_v , italic_t ).

Input: point x𝖬𝑥𝖬x\in\mathsf{M}italic_x ∈ sansserif_M, direction v𝕊xd1𝑣superscriptsubscript𝕊𝑥𝑑1v\in\mathbb{S}_{x}^{d-1}italic_v ∈ blackboard_S start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_d - 1 end_POSTSUPERSCRIPT, level t(0,p(x))𝑡0𝑝𝑥t\in(0,p(x))italic_t ∈ ( 0 , italic_p ( italic_x ) ) and parameters <0<r0𝑟\ell<0<rroman_ℓ < 0 < italic_r
Output: point θL(x,v,t)[,r)𝜃𝐿𝑥𝑣𝑡𝑟\theta\in L(x,v,t)\cap[\ell,r)italic_θ ∈ italic_L ( italic_x , italic_v , italic_t ) ∩ [ roman_ℓ , italic_r )

1:  Draw ΘUnif((0,rl))similar-toΘUnif0𝑟𝑙\Theta\sim\mathrm{Unif}\big{(}(0,r-l)\big{)}roman_Θ ∼ roman_Unif ( ( 0 , italic_r - italic_l ) ), call the result θhsubscript𝜃\theta_{h}italic_θ start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT.
2:  Set θ:=θh𝟙{θh>r}(rl)assign𝜃subscript𝜃subscript1subscript𝜃𝑟𝑟𝑙\theta:=\theta_{h}-\mathbbm{1}_{\{\theta_{h}>r\}}(r-l)italic_θ := italic_θ start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT - blackboard_1 start_POSTSUBSCRIPT { italic_θ start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT > italic_r } end_POSTSUBSCRIPT ( italic_r - italic_l ).
3:  Set θmin:=θhassignsubscript𝜃subscript𝜃\theta_{\min}:=\theta_{h}italic_θ start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT := italic_θ start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT.
4:  Set θmax:=θhassignsubscript𝜃subscript𝜃\theta_{\max}:=\theta_{h}italic_θ start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT := italic_θ start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT.
5:  while (γ(x,v)(θ))tsubscript𝛾𝑥𝑣𝜃𝑡\left(\gamma_{(x,v)}(\theta)\right)\leqslant t( italic_γ start_POSTSUBSCRIPT ( italic_x , italic_v ) end_POSTSUBSCRIPT ( italic_θ ) ) ⩽ italic_t do
6:     if θh[θmin,rl)subscript𝜃subscript𝜃𝑟𝑙\theta_{h}\in[\theta_{\min},r-l)italic_θ start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT ∈ [ italic_θ start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT , italic_r - italic_l ) then
7:        Set θmin=θhsubscript𝜃subscript𝜃\theta_{\min}=\theta_{h}italic_θ start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT = italic_θ start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT.
8:     else
9:        Set θmax=θhsubscript𝜃subscript𝜃\theta_{\max}=\theta_{h}italic_θ start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT = italic_θ start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT.
10:     end if
11:     Draw ΘUnif((0,θmax)[θmin,rl))similar-toΘUnif0subscript𝜃subscript𝜃𝑟𝑙\Theta\sim\mathrm{Unif}\big{(}(0,\theta_{\max})\cup[\theta_{\min},r-l)\big{)}roman_Θ ∼ roman_Unif ( ( 0 , italic_θ start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT ) ∪ [ italic_θ start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT , italic_r - italic_l ) ), call result θhsubscript𝜃\theta_{h}italic_θ start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT.
12:     Set θ=θh𝟙{θh>r}(rl)𝜃subscript𝜃subscript1subscript𝜃𝑟𝑟𝑙\theta=\theta_{h}-\mathbbm{1}_{\{\theta_{h}>r\}}(r-l)italic_θ = italic_θ start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT - blackboard_1 start_POSTSUBSCRIPT { italic_θ start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT > italic_r } end_POSTSUBSCRIPT ( italic_r - italic_l ).
13:  end while
14:  return  θ𝜃\thetaitalic_θ.

A complete algorithmic description of the geodesic slice sampler in pseudo code can be found in Algorithm 1. It calls Algorithm 2 and Algorithm 3 representing the stepping-out and shrinkage procedure on the geodesic respectively. We also provide a description in terms of random variables.

Remark 3.

Let (Yk)ksubscriptsubscript𝑌𝑘𝑘(Y_{k})_{k\in\mathbb{N}}( italic_Y start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_k ∈ blackboard_N end_POSTSUBSCRIPT be the Markov chain corresponding to the geodesic slice sampler. For k𝑘k\in\mathbb{N}italic_k ∈ blackboard_N let Tk+1subscript𝑇𝑘1T_{k+1}italic_T start_POSTSUBSCRIPT italic_k + 1 end_POSTSUBSCRIPT and Vk+1subscript𝑉𝑘1V_{k+1}italic_V start_POSTSUBSCRIPT italic_k + 1 end_POSTSUBSCRIPT be a random variables with conditional distributions

(Tk+1Y1,,Yk)=Unif((0,p(Yk)))and(Vk+1Y1,,Yk,Tk+1)=σd1(Yk).\mathbb{P}\left(T_{k+1}\in\cdot\mid Y_{1},\ldots,Y_{k}\right)=\mathrm{Unif}% \big{(}(0,p(Y_{k}))\big{)}\qquad\text{and}\qquad\mathbb{P}\left(V_{k+1}\in% \cdot\mid Y_{1},\ldots,Y_{k},T_{k+1}\right)=\sigma_{d-1}^{(Y_{k})}.blackboard_P ( italic_T start_POSTSUBSCRIPT italic_k + 1 end_POSTSUBSCRIPT ∈ ⋅ ∣ italic_Y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_Y start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) = roman_Unif ( ( 0 , italic_p ( italic_Y start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) ) ) and blackboard_P ( italic_V start_POSTSUBSCRIPT italic_k + 1 end_POSTSUBSCRIPT ∈ ⋅ ∣ italic_Y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_Y start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_T start_POSTSUBSCRIPT italic_k + 1 end_POSTSUBSCRIPT ) = italic_σ start_POSTSUBSCRIPT italic_d - 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_Y start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT .

For fixed hyperparameters w>0𝑤0w>0italic_w > 0 and m𝑚m\in\mathbb{N}italic_m ∈ blackboard_N, let Jk+1Unif({1,,m})similar-tosubscript𝐽𝑘1Unif1𝑚J_{k+1}\sim\mathrm{Unif}(\{1,\ldots,m\})italic_J start_POSTSUBSCRIPT italic_k + 1 end_POSTSUBSCRIPT ∼ roman_Unif ( { 1 , … , italic_m } ) and Υk+1Unif((0,w))similar-tosubscriptΥ𝑘1Unif0𝑤\Upsilon_{k+1}\sim\mathrm{Unif}\big{(}(0,w)\big{)}roman_Υ start_POSTSUBSCRIPT italic_k + 1 end_POSTSUBSCRIPT ∼ roman_Unif ( ( 0 , italic_w ) ) be independent of all previous random variables. We set

Lk=Υk+1(inf{i0Υk+1iwL(Yk,Tk+1,Vk+1)}(J1))w,subscriptsuperscript𝐿𝑘subscriptΥ𝑘1infimumconditional-set𝑖subscript0subscriptΥ𝑘1𝑖𝑤𝐿subscript𝑌𝑘subscript𝑇𝑘1subscript𝑉𝑘1𝐽1𝑤\displaystyle L^{\ast}_{k}=-\Upsilon_{k+1}-\big{(}\inf\{i\in\mathbb{N}_{0}\mid% -\Upsilon_{k+1}-iw\notin L(Y_{k},T_{k+1},V_{k+1})\}\land(J-1)\big{)}w,italic_L start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = - roman_Υ start_POSTSUBSCRIPT italic_k + 1 end_POSTSUBSCRIPT - ( roman_inf { italic_i ∈ blackboard_N start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ - roman_Υ start_POSTSUBSCRIPT italic_k + 1 end_POSTSUBSCRIPT - italic_i italic_w ∉ italic_L ( italic_Y start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_T start_POSTSUBSCRIPT italic_k + 1 end_POSTSUBSCRIPT , italic_V start_POSTSUBSCRIPT italic_k + 1 end_POSTSUBSCRIPT ) } ∧ ( italic_J - 1 ) ) italic_w ,
Rk=wΥk+1+(inf{i0wΥk+1+iwL(Yk,Tk+1,Vk+1)}(mJ))w,subscriptsuperscript𝑅𝑘𝑤subscriptΥ𝑘1infimumconditional-set𝑖subscript0𝑤subscriptΥ𝑘1𝑖𝑤𝐿subscript𝑌𝑘subscript𝑇𝑘1subscript𝑉𝑘1𝑚𝐽𝑤\displaystyle R^{\ast}_{k}=w-\Upsilon_{k+1}+\big{(}\inf\{i\in\mathbb{N}_{0}% \mid w-\Upsilon_{k+1}+iw\notin L(Y_{k},T_{k+1},V_{k+1})\}\land(m-J)\big{)}w,italic_R start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = italic_w - roman_Υ start_POSTSUBSCRIPT italic_k + 1 end_POSTSUBSCRIPT + ( roman_inf { italic_i ∈ blackboard_N start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_w - roman_Υ start_POSTSUBSCRIPT italic_k + 1 end_POSTSUBSCRIPT + italic_i italic_w ∉ italic_L ( italic_Y start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_T start_POSTSUBSCRIPT italic_k + 1 end_POSTSUBSCRIPT , italic_V start_POSTSUBSCRIPT italic_k + 1 end_POSTSUBSCRIPT ) } ∧ ( italic_m - italic_J ) ) italic_w ,

and define 𝖨(k+1)=(Lk,Rk)superscript𝖨𝑘1subscriptsuperscript𝐿𝑘subscriptsuperscript𝑅𝑘\mathsf{I}^{(k+1)}=(L^{\ast}_{k},R^{\ast}_{k})sansserif_I start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT = ( italic_L start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_R start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ). Let the random variable Θ1(k+1)superscriptsubscriptΘ1𝑘1\Theta_{1}^{(k+1)}roman_Θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT have conditional distribution

(Θ1(k+1)Y1,,Yk,Tk+1,Vk+1,𝖨(k+1))=Unif(𝖨(k+1)),\displaystyle\mathbb{P}\left(\Theta_{1}^{(k+1)}\in\cdot\mid Y_{1},\ldots,Y_{k}% ,T_{k+1},V_{k+1},\mathsf{I}^{(k+1)}\right)=\mathrm{Unif}(\mathsf{I}^{(k+1)}),blackboard_P ( roman_Θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT ∈ ⋅ ∣ italic_Y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_Y start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_T start_POSTSUBSCRIPT italic_k + 1 end_POSTSUBSCRIPT , italic_V start_POSTSUBSCRIPT italic_k + 1 end_POSTSUBSCRIPT , sansserif_I start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT ) = roman_Unif ( sansserif_I start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT ) ,

and set 𝖨1(k+1)=𝖨(k+1)superscriptsubscript𝖨1𝑘1superscript𝖨𝑘1\mathsf{I}_{1}^{(k+1)}=\mathsf{I}^{(k+1)}sansserif_I start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT = sansserif_I start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT. Using the segment-notation 𝕁𝕁\mathbb{J}blackboard_J from the description of the shrinkage procedure in Section 2.1, we define Θi+1(k+1)superscriptsubscriptΘ𝑖1𝑘1\Theta_{i+1}^{(k+1)}roman_Θ start_POSTSUBSCRIPT italic_i + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT to have conditional distribution

(Θi+1(k+1)Y1,,Yk,Tk+1,Vk+1,𝖨1(k+1),,𝖨i(k+1),Θ1(k+1),,Θik+1)=Unif(𝖨i(k+1)),\mathbb{P}\left(\Theta_{i+1}^{(k+1)}\in\cdot\mid Y_{1},\ldots,Y_{k},T_{k+1},V_% {k+1},\mathsf{I}_{1}^{(k+1)},\ldots,\mathsf{I}_{i}^{(k+1)},\Theta_{1}^{(k+1)},% \ldots,\Theta_{i}^{k+1}\right)=\mathrm{Unif}(\mathsf{I}_{i}^{(k+1)}),blackboard_P ( roman_Θ start_POSTSUBSCRIPT italic_i + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT ∈ ⋅ ∣ italic_Y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_Y start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_T start_POSTSUBSCRIPT italic_k + 1 end_POSTSUBSCRIPT , italic_V start_POSTSUBSCRIPT italic_k + 1 end_POSTSUBSCRIPT , sansserif_I start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT , … , sansserif_I start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT , roman_Θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT , … , roman_Θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k + 1 end_POSTSUPERSCRIPT ) = roman_Unif ( sansserif_I start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT ) ,

and

𝖨i+1(k+1)=𝕁(Θi+1(k+1),Θi(k+1),𝖨i(k+1))superscriptsubscript𝖨𝑖1𝑘1𝕁superscriptsubscriptΘ𝑖1𝑘1superscriptsubscriptΘ𝑖𝑘1superscriptsubscript𝖨𝑖𝑘1\mathsf{I}_{i+1}^{(k+1)}=\mathbb{J}(\Theta_{i+1}^{(k+1)},\Theta_{i}^{(k+1)},% \mathsf{I}_{i}^{(k+1)})sansserif_I start_POSTSUBSCRIPT italic_i + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT = blackboard_J ( roman_Θ start_POSTSUBSCRIPT italic_i + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT , roman_Θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT , sansserif_I start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT )

for all i𝑖i\in\mathbb{N}italic_i ∈ blackboard_N. Then set Θk=Θτk+1(k+1)subscriptsuperscriptΘ𝑘superscriptsubscriptΘsubscriptτ𝑘1𝑘1\Theta^{\ast}_{k}=\Theta_{\uptau_{k+1}}^{(k+1)}roman_Θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = roman_Θ start_POSTSUBSCRIPT roman_τ start_POSTSUBSCRIPT italic_k + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT, where

τk+1:=inf{iΘi(k+1)L(Yk,Vk+1,Tk+1)}.assignsubscriptτ𝑘1infimumconditional-set𝑖superscriptsubscriptΘ𝑖𝑘1𝐿subscript𝑌𝑘subscript𝑉𝑘1subscript𝑇𝑘1\uptau_{k+1}:=\inf\{i\in\mathbb{N}\mid\Theta_{i}^{(k+1)}\in L(Y_{k},V_{k+1},T_% {k+1})\}.roman_τ start_POSTSUBSCRIPT italic_k + 1 end_POSTSUBSCRIPT := roman_inf { italic_i ∈ blackboard_N ∣ roman_Θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT ∈ italic_L ( italic_Y start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_V start_POSTSUBSCRIPT italic_k + 1 end_POSTSUBSCRIPT , italic_T start_POSTSUBSCRIPT italic_k + 1 end_POSTSUBSCRIPT ) } .

The next state of the geodesic slice sampler is then given by

Yk+1=γ(Yk,Vk+1)(Θk).subscript𝑌𝑘1subscript𝛾subscript𝑌𝑘subscript𝑉𝑘1subscriptsuperscriptΘ𝑘Y_{k+1}=\gamma_{(Y_{k},V_{k+1})}\left(\Theta^{\ast}_{k}\right).italic_Y start_POSTSUBSCRIPT italic_k + 1 end_POSTSUBSCRIPT = italic_γ start_POSTSUBSCRIPT ( italic_Y start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_V start_POSTSUBSCRIPT italic_k + 1 end_POSTSUBSCRIPT ) end_POSTSUBSCRIPT ( roman_Θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) .

We comment on the prerequisites of the geodesic slice sampler.

Remark 4.
  1. 1.

    In order to implement Algorithm 1 we need to be able to perform the following operations:

    • Evaluation of the unnormalized density p(x)𝑝𝑥p(x)italic_p ( italic_x ) at every x𝖬𝑥𝖬x\in\mathsf{M}italic_x ∈ sansserif_M.

    • Sampling from σd1(x)superscriptsubscript𝜎𝑑1𝑥\sigma_{d-1}^{(x)}italic_σ start_POSTSUBSCRIPT italic_d - 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_x ) end_POSTSUPERSCRIPT for all x𝖬𝑥𝖬x\in\mathsf{M}italic_x ∈ sansserif_M. If we know an isometric isomorphism dTx𝖬superscript𝑑subscript𝑇𝑥𝖬\mathbb{R}^{d}\to T_{x}\mathsf{M}blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT → italic_T start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT sansserif_M for all x𝖬𝑥𝖬x\in\mathsf{M}italic_x ∈ sansserif_M this is an easy task.

    • Evaluation of geodesics γ(x,v)(θ)subscript𝛾𝑥𝑣𝜃\gamma_{(x,v)}(\theta)italic_γ start_POSTSUBSCRIPT ( italic_x , italic_v ) end_POSTSUBSCRIPT ( italic_θ ) for all x𝖬𝑥𝖬x\in\mathsf{M}italic_x ∈ sansserif_M, v𝕊xd1𝑣subscriptsuperscript𝕊𝑑1𝑥v\in\mathbb{S}^{d-1}_{x}italic_v ∈ blackboard_S start_POSTSUPERSCRIPT italic_d - 1 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT and θ𝜃\theta\in\mathbb{R}italic_θ ∈ blackboard_R. Some cases where this is possible are provided in Example 5.

  2. 2.

    The geodesic slice sampler takes two hyperparameters, namely w(0,)𝑤0w\in(0,\infty)italic_w ∈ ( 0 , ∞ ) and m𝑚m\in\mathbb{N}italic_m ∈ blackboard_N. They arise from the usage of Algorithm 2 (the stepping-out procedure), and their influence on the geodesic slice sampler is derived from their influence on the stepping-out procedure, see Remark 1. Roughly speaking, mw𝑚𝑤mwitalic_m italic_w determines the maximal possible size of the neighborhood of the initial point x𝖬𝑥𝖬x\in\mathsf{M}italic_x ∈ sansserif_M that the geodesic slice sampler takes into account when performing its transition. This affects the reach of the algorithm as well as its ability to jump between modes of the target distribution π𝜋\piitalic_π. Choosing m𝑚mitalic_m larger and w𝑤witalic_w smaller can be seen as increasing the likelihood that a smaller neighborhood of x𝑥xitalic_x is considered for the transition, compared to the maximal possible reach of the algorithm. Depending on the shape of the unnormalized density p𝑝pitalic_p, this can hamper the ability of the geodesic slice sampler to jump between modes or allow the consideration of more \ldqrelevant\rdq neighborhoods. Observe that larger m𝑚mitalic_m leads to higher computational cost by increasing the cost of Algorithm 2.

We provide some illustrative scenarios.

Example 5.
  1. 1.

    The Hit-and-run slice sampler on dsuperscript𝑑\mathbb{R}^{d}blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT described in Section 2.1 fits into the framework of the geodesic slice sampler.

  2. 2.

    We consider 𝕊dd+1superscript𝕊𝑑superscript𝑑1\mathbb{S}^{d}\subseteq\mathbb{R}^{d+1}blackboard_S start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT ⊆ blackboard_R start_POSTSUPERSCRIPT italic_d + 1 end_POSTSUPERSCRIPT. For x𝕊d𝑥superscript𝕊𝑑x\in\mathbb{S}^{d}italic_x ∈ blackboard_S start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT we have 𝕊xd1={v𝕊dxv=0}subscriptsuperscript𝕊𝑑1𝑥conditional-set𝑣superscript𝕊𝑑superscript𝑥top𝑣0\mathbb{S}^{d-1}_{x}=\{v\in\mathbb{S}^{d}\mid x^{\top}v=0\}blackboard_S start_POSTSUPERSCRIPT italic_d - 1 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT = { italic_v ∈ blackboard_S start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT ∣ italic_x start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_v = 0 }. The projection onto the subspace orthogonal to x𝕊d𝑥superscript𝕊𝑑x\in\mathbb{S}^{d}italic_x ∈ blackboard_S start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT

    ψx:𝕊d𝕊xd1,v(Idxx)v,:subscript𝜓𝑥formulae-sequencesuperscript𝕊𝑑subscriptsuperscript𝕊𝑑1𝑥maps-to𝑣Id𝑥superscript𝑥top𝑣\displaystyle\psi_{x}:\mathbb{S}^{d}\to\mathbb{S}^{d-1}_{x},\qquad v\mapsto% \left(\mathrm{Id}-xx^{\top}\right)v,italic_ψ start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT : blackboard_S start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT → blackboard_S start_POSTSUPERSCRIPT italic_d - 1 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT , italic_v ↦ ( roman_Id - italic_x italic_x start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ) italic_v ,

    where IdId\mathrm{Id}roman_Id denotes the identity on 𝕊dsuperscript𝕊𝑑\mathbb{S}^{d}blackboard_S start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT, yields a simple formula for

    σd1(x)=(ψx)(1ν𝔤^(𝕊d)ν𝔤^).superscriptsubscript𝜎𝑑1𝑥subscriptsubscript𝜓𝑥1subscript𝜈^𝔤superscript𝕊𝑑subscript𝜈^𝔤\sigma_{d-1}^{(x)}=\left(\psi_{x}\right)_{\sharp}\left(\frac{1}{\nu_{\widehat{% \mathfrak{g}}}(\mathbb{S}^{d})}\nu_{\widehat{\mathfrak{g}}}\right).italic_σ start_POSTSUBSCRIPT italic_d - 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_x ) end_POSTSUPERSCRIPT = ( italic_ψ start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT ♯ end_POSTSUBSCRIPT ( divide start_ARG 1 end_ARG start_ARG italic_ν start_POSTSUBSCRIPT over^ start_ARG fraktur_g end_ARG end_POSTSUBSCRIPT ( blackboard_S start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT ) end_ARG italic_ν start_POSTSUBSCRIPT over^ start_ARG fraktur_g end_ARG end_POSTSUBSCRIPT ) .

    The geodesics of 𝕊dsuperscript𝕊𝑑\mathbb{S}^{d}blackboard_S start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT are the great circles given by the explicit formula

    γ(x,v)(θ)=cos(θ)x+sin(θ)v,θ,formulae-sequencesubscript𝛾𝑥𝑣𝜃𝜃𝑥𝜃𝑣𝜃\gamma_{(x,v)}(\theta)=\cos(\theta)x+\sin(\theta)v,\qquad\theta\in\mathbb{R},italic_γ start_POSTSUBSCRIPT ( italic_x , italic_v ) end_POSTSUBSCRIPT ( italic_θ ) = roman_cos ( italic_θ ) italic_x + roman_sin ( italic_θ ) italic_v , italic_θ ∈ blackboard_R ,

    for x𝕊d𝑥superscript𝕊𝑑x\in\mathbb{S}^{d}italic_x ∈ blackboard_S start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT and v𝕊xd1𝑣subscriptsuperscript𝕊𝑑1𝑥v\in\mathbb{S}^{d-1}_{x}italic_v ∈ blackboard_S start_POSTSUPERSCRIPT italic_d - 1 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT. Of course we may apply the geodesic slice sampler for arbitrary hyperparameters as described in Algorithm 1. However, since all geodesics are periodic of a known period length (namely 2π2𝜋2\pi2 italic_π), this renders Algorithm 2 (the stepping-out procedure) somehow superfluous. For wisely chosen hyperparameters (w=2π,m=1formulae-sequence𝑤2𝜋𝑚1w=2\pi,m=1italic_w = 2 italic_π , italic_m = 1), the geodesic segment sampled by Algorithm 2 always equals exactly one winding of the great circle, and we can simply replace line 3 in Algorithm 1 by deterministically setting (,r)=(π,π)𝑟𝜋𝜋(\ell,r)=(-\pi,\pi)( roman_ℓ , italic_r ) = ( - italic_π , italic_π ). The resulting algorithm is the geodesic shrinkage slice sampler on the sphere from [habeck2023geodesic].

  3. 3.

    For the Stiefel manifold defined in Example 2.3, an isometric isomorphism between k(k1)/2+k(nk)superscript𝑘𝑘12𝑘𝑛𝑘\mathbb{R}^{k(k-1)/2+k(n-k)}blackboard_R start_POSTSUPERSCRIPT italic_k ( italic_k - 1 ) / 2 + italic_k ( italic_n - italic_k ) end_POSTSUPERSCRIPT and the tangent space TΓ𝒱(n,k)subscript𝑇Γ𝒱𝑛𝑘T_{\Gamma}\mathcal{V}(n,k)italic_T start_POSTSUBSCRIPT roman_Γ end_POSTSUBSCRIPT caligraphic_V ( italic_n , italic_k ) at a point Γ𝒱(n,k)Γ𝒱𝑛𝑘\Gamma\in\mathcal{V}(n,k)roman_Γ ∈ caligraphic_V ( italic_n , italic_k ) is given by using the first k(k1)/2𝑘𝑘12k(k-1)/2italic_k ( italic_k - 1 ) / 2 components of vk(k1)/2+k(nk)𝑣superscript𝑘𝑘12𝑘𝑛𝑘v\in\mathbb{R}^{k(k-1)/2+k(n-k)}italic_v ∈ blackboard_R start_POSTSUPERSCRIPT italic_k ( italic_k - 1 ) / 2 + italic_k ( italic_n - italic_k ) end_POSTSUPERSCRIPT to determine a skew symmetric matrix ΠΠ\Piroman_Π and the remaining ones to form a matrix Σ(nk)×kΣsuperscript𝑛𝑘𝑘\Sigma\in\mathbb{R}^{(n-k)\times k}roman_Σ ∈ blackboard_R start_POSTSUPERSCRIPT ( italic_n - italic_k ) × italic_k end_POSTSUPERSCRIPT. These two matrices determine an element of TΓ𝒱(n,k)subscript𝑇Γ𝒱𝑛𝑘T_{\Gamma}\mathcal{V}(n,k)italic_T start_POSTSUBSCRIPT roman_Γ end_POSTSUBSCRIPT caligraphic_V ( italic_n , italic_k ) (after fixing ΓsubscriptΓperpendicular-to\Gamma_{\perp}roman_Γ start_POSTSUBSCRIPT ⟂ end_POSTSUBSCRIPT) by Δ=ΓΠ+ΓΣΔΓΠsubscriptΓperpendicular-toΣ\Delta=\Gamma\Pi+\Gamma_{\perp}\Sigmaroman_Δ = roman_Γ roman_Π + roman_Γ start_POSTSUBSCRIPT ⟂ end_POSTSUBSCRIPT roman_Σ. We provide an explicit formula for the geodesic γ(Γ,Δ)subscript𝛾ΓΔ\gamma_{(\Gamma,\Delta)}italic_γ start_POSTSUBSCRIPT ( roman_Γ , roman_Δ ) end_POSTSUBSCRIPT. To this end let QR=ΓΣ𝑄𝑅subscriptΓperpendicular-toΣQR=\Gamma_{\perp}\Sigmaitalic_Q italic_R = roman_Γ start_POSTSUBSCRIPT ⟂ end_POSTSUBSCRIPT roman_Σ be the compact QR-decomposition of ΓΣ=(IdnΓΓ)ΔsubscriptΓperpendicular-toΣsubscriptId𝑛ΓsuperscriptΓtopΔ\Gamma_{\perp}\Sigma=(\mathrm{Id}_{n}-\Gamma\Gamma^{\top})\Deltaroman_Γ start_POSTSUBSCRIPT ⟂ end_POSTSUBSCRIPT roman_Σ = ( roman_Id start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT - roman_Γ roman_Γ start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ) roman_Δ. For θ𝜃\theta\in\mathbb{R}italic_θ ∈ blackboard_R set N1(θ)k×ksubscript𝑁1𝜃superscript𝑘𝑘N_{1}(\theta)\in\mathbb{R}^{k\times k}italic_N start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_θ ) ∈ blackboard_R start_POSTSUPERSCRIPT italic_k × italic_k end_POSTSUPERSCRIPT and N2(θ)n×ksubscript𝑁2𝜃superscript𝑛𝑘N_{2}(\theta)\in\mathbb{R}^{n\times k}italic_N start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_θ ) ∈ blackboard_R start_POSTSUPERSCRIPT italic_n × italic_k end_POSTSUPERSCRIPT to be

    (N1(θ)N2(θ))=exp(θ(ΠRR𝟎))(Idk𝟎),matrixsubscript𝑁1𝜃subscript𝑁2𝜃𝜃matrixΠsuperscript𝑅top𝑅0matrixsubscriptId𝑘0\begin{pmatrix}N_{1}(\theta)\\ N_{2}(\theta)\end{pmatrix}=\exp\left(\theta\begin{pmatrix}\Pi&-R^{\top}\\ R&\boldsymbol{0}\end{pmatrix}\right)\begin{pmatrix}\mathrm{Id}_{k}\\ \boldsymbol{0}\end{pmatrix},( start_ARG start_ROW start_CELL italic_N start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_θ ) end_CELL end_ROW start_ROW start_CELL italic_N start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_θ ) end_CELL end_ROW end_ARG ) = roman_exp ( italic_θ ( start_ARG start_ROW start_CELL roman_Π end_CELL start_CELL - italic_R start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT end_CELL end_ROW start_ROW start_CELL italic_R end_CELL start_CELL bold_0 end_CELL end_ROW end_ARG ) ) ( start_ARG start_ROW start_CELL roman_Id start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL bold_0 end_CELL end_ROW end_ARG ) ,

    where exp\exproman_exp denotes here the matrix exponential and 𝟎k×k0superscript𝑘𝑘\boldsymbol{0}\in\mathbb{R}^{k\times k}bold_0 ∈ blackboard_R start_POSTSUPERSCRIPT italic_k × italic_k end_POSTSUPERSCRIPT is the matrix with all entries zero. Then

    γ(Γ,Δ)(θ)=ΓN1(θ)+QN2(θ),θ.formulae-sequencesubscript𝛾ΓΔ𝜃Γsubscript𝑁1𝜃𝑄subscript𝑁2𝜃𝜃\gamma_{(\Gamma,\Delta)}(\theta)=\Gamma N_{1}(\theta)+QN_{2}(\theta),\qquad% \theta\in\mathbb{R}.italic_γ start_POSTSUBSCRIPT ( roman_Γ , roman_Δ ) end_POSTSUBSCRIPT ( italic_θ ) = roman_Γ italic_N start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_θ ) + italic_Q italic_N start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_θ ) , italic_θ ∈ blackboard_R .

    The derivation of these results can be found in [edelman1998geometry].

Finally, we present the Markov transition kernel corresponding to the geodesic slice sampler. To this end, we first give a rigorous specification of the unnormalized density p𝑝pitalic_p:

Assumption B.

The unnormalized density p:𝖬(0,):𝑝𝖬0p:\mathsf{M}\to(0,\infty)italic_p : sansserif_M → ( 0 , ∞ ) is a lower semi-continuous333All level sets L(t):={x𝖬p(x)>t}assign𝐿𝑡conditional-set𝑥𝖬𝑝𝑥𝑡L(t):=\{x\in\mathsf{M}\mid p(x)>t\}italic_L ( italic_t ) := { italic_x ∈ sansserif_M ∣ italic_p ( italic_x ) > italic_t }, t𝑡t\in\mathbb{R}italic_t ∈ blackboard_R, are open. function such that 𝖬p(x)ν𝔤(dx)(0,)subscript𝖬𝑝𝑥subscript𝜈𝔤d𝑥0\int_{\mathsf{M}}p(x)\ \nu_{\mathfrak{g}}({\rm d}x)\in(0,\infty)∫ start_POSTSUBSCRIPT sansserif_M end_POSTSUBSCRIPT italic_p ( italic_x ) italic_ν start_POSTSUBSCRIPT fraktur_g end_POSTSUBSCRIPT ( roman_d italic_x ) ∈ ( 0 , ∞ ).

We denote by

p:=supx𝖬|p(x)|assignsubscriptnorm𝑝subscriptsupremum𝑥𝖬𝑝𝑥\|p\|_{\infty}:=\sup_{x\in\mathsf{M}}|p(x)|∥ italic_p ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT := roman_sup start_POSTSUBSCRIPT italic_x ∈ sansserif_M end_POSTSUBSCRIPT | italic_p ( italic_x ) |

the supremum norm of p𝑝pitalic_p. Observe that Assumption B gives p(0,]subscriptnorm𝑝0\|p\|_{\infty}\in(0,\infty]∥ italic_p ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT ∈ ( 0 , ∞ ].

Remark 6.

We impose lower semicontinuity of the unormalized density p𝑝pitalic_p in Assumption B to ensure that Algorithm 3 (the shrinkage procedure) terminates almost surely. This guarantees that its output has indeed a distribution on \mathbb{R}blackboard_R. For more details see [ReversibilityEllipticalSliceSampler].

Next we fix w(0,)𝑤0w\in(0,\infty)italic_w ∈ ( 0 , ∞ ) and m𝑚m\in\mathbb{N}italic_m ∈ blackboard_N. For simplicity we drop these two hyperparameters of the geodesic slice sampler in our subsequent notation. Let x𝖬𝑥𝖬x\in\mathsf{M}italic_x ∈ sansserif_M, v𝕊xd1𝑣superscriptsubscript𝕊𝑥𝑑1v\in\mathbb{S}_{x}^{d-1}italic_v ∈ blackboard_S start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_d - 1 end_POSTSUPERSCRIPT and t(0,p(x))𝑡0𝑝𝑥t\in(0,p(x))italic_t ∈ ( 0 , italic_p ( italic_x ) ). We denote by

ξL(x,v,t)(0)(𝖠):=(Step-outw,m(x,v,t)𝖠),𝖠(2),formulae-sequenceassignsuperscriptsubscript𝜉𝐿𝑥𝑣𝑡0𝖠subscriptStep-out𝑤𝑚𝑥𝑣𝑡𝖠𝖠superscript2\xi_{L(x,v,t)}^{(0)}(\mathsf{A}):=\mathbb{P}(\texttt{Step-out}_{w,m}(x,v,t)\in% \mathsf{A}),\qquad\mathsf{A}\in\mathcal{B}(\mathbb{R}^{2}),italic_ξ start_POSTSUBSCRIPT italic_L ( italic_x , italic_v , italic_t ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT ( sansserif_A ) := blackboard_P ( Step-out start_POSTSUBSCRIPT italic_w , italic_m end_POSTSUBSCRIPT ( italic_x , italic_v , italic_t ) ∈ sansserif_A ) , sansserif_A ∈ caligraphic_B ( blackboard_R start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) , (5)

the distribution of the output of Algorithm 2 and by

QL(x,v,t),r(0,𝖠):=(Shrink,r(x,v,t)𝖠),𝖠(),formulae-sequenceassignsuperscriptsubscript𝑄𝐿𝑥𝑣𝑡𝑟0𝖠subscriptShrink𝑟𝑥𝑣𝑡𝖠𝖠Q_{L(x,v,t)}^{\ell,r}(0,\mathsf{A}):=\mathbb{P}(\texttt{Shrink}_{\ell,r}(x,v,t% )\in\mathsf{A}),\qquad\mathsf{A}\in\mathcal{B}(\mathbb{R}),italic_Q start_POSTSUBSCRIPT italic_L ( italic_x , italic_v , italic_t ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ℓ , italic_r end_POSTSUPERSCRIPT ( 0 , sansserif_A ) := blackboard_P ( Shrink start_POSTSUBSCRIPT roman_ℓ , italic_r end_POSTSUBSCRIPT ( italic_x , italic_v , italic_t ) ∈ sansserif_A ) , sansserif_A ∈ caligraphic_B ( blackboard_R ) , (6)

where <0<r0𝑟\ell<0<rroman_ℓ < 0 < italic_r, the distribution of the output of Algorithm 3. Note that in (5) and (6) the right hand side only depends on x𝖬𝑥𝖬x\in\mathsf{M}italic_x ∈ sansserif_M, v𝕊xd1𝑣superscriptsubscript𝕊𝑥𝑑1v\in\mathbb{S}_{x}^{d-1}italic_v ∈ blackboard_S start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_d - 1 end_POSTSUPERSCRIPT and t(0,p(x))𝑡0𝑝𝑥t\in(0,p(x))italic_t ∈ ( 0 , italic_p ( italic_x ) ) through the set L(x,v,t)𝐿𝑥𝑣𝑡L(x,v,t)italic_L ( italic_x , italic_v , italic_t ). A formal definition of these distributions and some of their properties can be found in Section 4.1 and Section 4.2. For x𝖬𝑥𝖬x\in\mathsf{M}italic_x ∈ sansserif_M, t(0,p(x))𝑡0𝑝𝑥t\in(0,p(x))italic_t ∈ ( 0 , italic_p ( italic_x ) ) and 𝖠(𝖬)𝖠𝖬\mathsf{A}\in\mathcal{B}(\mathsf{M})sansserif_A ∈ caligraphic_B ( sansserif_M ) we define the auxiliary Markov kernels

Kt(x,𝖠):=𝕊xd12[,r)𝟙𝖠(γ(x,v)(θ))QL(x,v,t),r(0,dθ)ξL(x,v,t)(0)(d(,r))σd1(x)(dv).assignsubscript𝐾𝑡𝑥𝖠subscriptsuperscriptsubscript𝕊𝑥𝑑1subscriptsuperscript2subscript𝑟subscript1𝖠subscript𝛾𝑥𝑣𝜃superscriptsubscript𝑄𝐿𝑥𝑣𝑡𝑟0d𝜃superscriptsubscript𝜉𝐿𝑥𝑣𝑡0d𝑟superscriptsubscript𝜎𝑑1𝑥d𝑣K_{t}(x,\mathsf{A}):=\int_{\mathbb{S}_{x}^{d-1}}\int_{\mathbb{R}^{2}}\int_{[% \ell,r)}\mathbbm{1}_{\mathsf{A}}\left(\gamma_{(x,v)}(\theta)\right)\ Q_{L(x,v,% t)}^{\ell,r}(0,{\rm d}\theta)\ \xi_{L(x,v,t)}^{(0)}\big{(}{\rm d}(\ell,r)\big{% )}\ \sigma_{d-1}^{(x)}({\rm d}v).italic_K start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_x , sansserif_A ) := ∫ start_POSTSUBSCRIPT blackboard_S start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_d - 1 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ∫ start_POSTSUBSCRIPT blackboard_R start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ∫ start_POSTSUBSCRIPT [ roman_ℓ , italic_r ) end_POSTSUBSCRIPT blackboard_1 start_POSTSUBSCRIPT sansserif_A end_POSTSUBSCRIPT ( italic_γ start_POSTSUBSCRIPT ( italic_x , italic_v ) end_POSTSUBSCRIPT ( italic_θ ) ) italic_Q start_POSTSUBSCRIPT italic_L ( italic_x , italic_v , italic_t ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ℓ , italic_r end_POSTSUPERSCRIPT ( 0 , roman_d italic_θ ) italic_ξ start_POSTSUBSCRIPT italic_L ( italic_x , italic_v , italic_t ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT ( roman_d ( roman_ℓ , italic_r ) ) italic_σ start_POSTSUBSCRIPT italic_d - 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_x ) end_POSTSUPERSCRIPT ( roman_d italic_v ) .

Then the Markov kernel

K:𝖬×(𝖬)[0,1](x,𝖠)1p(x)(0,p(x))Kt(x,𝖠)Leb1(dt):𝐾𝖬𝖬01𝑥𝖠maps-to1𝑝𝑥subscript0𝑝𝑥subscript𝐾𝑡𝑥𝖠subscriptLeb1d𝑡\begin{split}K:\mathsf{M}\times\mathcal{B}(\mathsf{M})&\to[0,1]\\ (x,\mathsf{A})&\mapsto\frac{1}{p(x)}\int_{(0,p(x))}K_{t}(x,\mathsf{A})\ % \mathrm{Leb}_{1}({\rm d}t)\end{split}start_ROW start_CELL italic_K : sansserif_M × caligraphic_B ( sansserif_M ) end_CELL start_CELL → [ 0 , 1 ] end_CELL end_ROW start_ROW start_CELL ( italic_x , sansserif_A ) end_CELL start_CELL ↦ divide start_ARG 1 end_ARG start_ARG italic_p ( italic_x ) end_ARG ∫ start_POSTSUBSCRIPT ( 0 , italic_p ( italic_x ) ) end_POSTSUBSCRIPT italic_K start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_x , sansserif_A ) roman_Leb start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( roman_d italic_t ) end_CELL end_ROW (7)

corresponds to Algorithm 1. Observe that K𝐾Kitalic_K has the correct invariant distribution, as implied by the following theorem.

Theorem 7.

Suppose Assumption A and B are satisfied, and let π𝜋\piitalic_π be defined as in (3). Fix w(0,)𝑤0w\in(0,\infty)italic_w ∈ ( 0 , ∞ ) and m𝑚m\in\mathbb{N}italic_m ∈ blackboard_N. Then the Markov kernel K𝐾Kitalic_K given in (7) is reversible with respect to π𝜋\piitalic_π.

The proof of this statement can be found in Section 4.3.

2.3 Literature review of MCMC-methods on Riemannian manifolds

In this section, we aim to provide an overview of existing MCMC-methods on Riemannian manifolds in the literature. Roughly speaking they can be assigned to three different categories.

The first class of MCMC algorithms are defined on open sets of dsuperscript𝑑\mathbb{R}^{d}blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT but consider non canonical Riemannian metrics. In [girolami2011riemann], Girolami and Calderhead generalize Hamiltonian Monte Carlo (HMC) to dsuperscript𝑑\mathbb{R}^{d}blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT equipped with an arbitrary metric tensor obtaining Riemannian manifold HMC (RMHMC), which assumes knowledge about the Riemannian metric of the underlying manifold. They also introduce a MALA-type algorithm in this setting.

A second class of MCMC algorithms consists of methods defined on submanifolds of dsuperscript𝑑\mathbb{R}^{d}blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT associated with the canonical metric introduced by the embedding in dsuperscript𝑑\mathbb{R}^{d}blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT that are not necessary open sets. This includes Hamiltonian based MCMC-methods tailor-made for specific classes of manifolds that have been further developed upon RMHMC such as for implicitly defined manifolds [brubaker2012family], manifolds embedded in Euclidean space [byrne2013geodesic] and the sphere [lan2014spherical].

Distributions on submanifolds of dsuperscript𝑑\mathbb{R}^{d}blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT can also be approximated by proposing a sample from the ambient dsuperscript𝑑\mathbb{R}^{d}blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT projected to the manifold and then running a Metropolis-Hastings acceptance rejection step, see e.g. [mantoux2021understanding, zappa2018monte]. However, as usually the conditional distribution of the proposal is intractable and is therefore not taken into account in the acceptance rejection step, the resulting Metropolis-Hastings algorithm is biased. In [zappa2018monte], Zappa et. al propose a bias-free modification of this method for submanifolds of dsuperscript𝑑\mathbb{R}^{d}blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT defined by inequalities and equality constraints. In the special case when the underlying manifold is a hypersphere equipped with the angular Gaussian distribution as reference measure, other specialized bias-free reprojected MCMC algorithms have been proposed such as reprojected preconditioned Crank–Nicolson algorithm or reprojected Elliptical Slice sampling, see [lieAccepteddimension].

The third class of MCMC-methods on Riemannian manifolds employs geodesic flows. In [mangoubi2018rapid], the authors analyze a geodesic random walk on Riemannian manifolds with positive bounded curvature, which is invariant with respect to the Riemannian measure. This analysis has been extended in [goyal2019sampling] to arbitrary target probability measures on manifolds with bounded non-negative curvature by adding a Metropolis-Hastings-like acceptance step resulting in a geodesic Metropolis-Hastings algorithm. Note that a similar approach has been taken before in [lee2017geodesic] to target the uniform distribution on a polytope by equipping it with a Hessian structure. In addition to this class of Metropolis-Hastings algorithms, there already exist MCMC-methods for specific manifolds that combine slice sampling with geodesic, that is, for distributions on dsuperscript𝑑\mathbb{R}^{d}blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT there is Hit-and-Run slice sampling [latuszyinski2014convergence, Mackay], and for distributions on hyperspheres there is geodesic slice sampling on the sphere which uses great circles in stead of straight lines [habeck2023geodesic]. They can both be viewed as special cases of GSS.

3 Application

We numerically asses the performance of GSS (Algorithm 1) in comparison with other Riemannian MCMC algorithms. In our experiments, we consider the compact Stiefel manifold 𝒱(n,k)n×k𝒱𝑛𝑘superscript𝑛𝑘\mathcal{V}(n,k)\subseteq\mathbb{R}^{n\times k}caligraphic_V ( italic_n , italic_k ) ⊆ blackboard_R start_POSTSUPERSCRIPT italic_n × italic_k end_POSTSUPERSCRIPT and the Grassmann manifold 𝒢(n,k)𝒢𝑛𝑘\mathcal{G}(n,k)caligraphic_G ( italic_n , italic_k ) with, k,n𝑘𝑛k,n\in\mathbb{N}italic_k , italic_n ∈ blackboard_N, k<n𝑘𝑛k<nitalic_k < italic_n. They find applications in shape analysis [hong2017regression], dimensionality reduction [holbrook2016bayesian] and computer vision [lui2012advances, nguyen2019neural]. We give a brief overview of the conducted experiments. All the code is available on GitHub444https://github.com/samuelgruffaz/Geodesic_Slice_Sampling_on_Riemannian_Manifold.git.

  • We first consider the case where the target distribution belongs to the class of von Mises-Fisher distributions, defining families of distributions on 𝒱(n,k)𝒱𝑛𝑘\mathcal{V}(n,k)caligraphic_V ( italic_n , italic_k ) and 𝒢(n,k)𝒢𝑛𝑘\mathcal{G}(n,k)caligraphic_G ( italic_n , italic_k ). When examining the Stiefel manifold 𝒱(n,k)𝒱𝑛𝑘\mathcal{V}(n,k)caligraphic_V ( italic_n , italic_k ), we compare GSS with an adaptive random walk Metropolis Hastings (RMH) sampler. This comparison highlights the impact of the GSS approach compared to a well-chosen random walk. On the Grassmann manifold, we contrast the GSS with a gradient-informed sampler referred to as the geodesics Metropolis-adjusted Langevin algorithm (GeoMALA), inspired by the framework proposed by [byrne2013geodesic].

  • We compare GSS and RMH in inferring a latent variable model developed in [mantoux2021understanding]. This model is used for the analysis of brain network structures. Specifically, it encodes principal directions of adjacency matrices obtained from MRI as points on a Stiefel manifold.

  • Finally, we introduce a Bayesian von Mises-Fisher clustering model for action recognition in videos. We approximate the posterior distribution using GSS and compare our resulting model with other existing approaches [lin2017bayesian, sengupta2017bayesian].

We do not conduct comparisons with the Riemannian Hamiltonian Monte Carlo (RHMC) algorithm [girolami2011riemann], since it uses the expression of the metric of the manifold rather than its geodesics as GSS. Throughout this whole section we denote the Gaussian distribution on dsuperscript𝑑\mathbb{R}^{d}blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT with mean xd𝑥superscript𝑑x\in\mathbb{R}^{d}italic_x ∈ blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT and covariance matrix Σd×dΣsuperscript𝑑𝑑\Sigma\in\mathbb{R}^{d\times d}roman_Σ ∈ blackboard_R start_POSTSUPERSCRIPT italic_d × italic_d end_POSTSUPERSCRIPT by 𝒩(x,Σ)𝒩𝑥Σ\mathcal{N}(x,\Sigma)caligraphic_N ( italic_x , roman_Σ ). For κd𝜅superscript𝑑\kappa\in\mathbb{R}^{d}italic_κ ∈ blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT, let diag(κ)d×ddiag𝜅superscript𝑑𝑑\mathrm{diag}(\kappa)\in\mathbb{R}^{d\times d}roman_diag ( italic_κ ) ∈ blackboard_R start_POSTSUPERSCRIPT italic_d × italic_d end_POSTSUPERSCRIPT be the diagonal matrix with diagonal entries given by the components of κ𝜅\kappaitalic_κ. Moreover, we define 𝟎ddsubscript0𝑑superscript𝑑\boldsymbol{0}_{d}\in\mathbb{R}^{d}bold_0 start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT to be the vector and 𝟎d,kd×ksubscript0𝑑𝑘superscript𝑑𝑘\boldsymbol{0}_{d,k}\in\mathbb{R}^{d\times k}bold_0 start_POSTSUBSCRIPT italic_d , italic_k end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_d × italic_k end_POSTSUPERSCRIPT to be the matrix with all entries zero, and write Tr(Σ)TrΣ\textrm{Tr}(\Sigma)Tr ( roman_Σ ) for the trace of a square matrix ΣΣ\Sigmaroman_Σ.

We comment on the structures of the Stiefel and the Grassmann manifold needed to implement GSS.

Remark 8.
  1. 1.

    (Stiefel manifold.) The Stiefel manifold 𝒱(n,k)𝒱𝑛𝑘\mathcal{V}(n,k)caligraphic_V ( italic_n , italic_k ), including an explicit formula for its geodesic and the uniform distribution on its tangent spheres, is introduced in Example 2-2 and 5-3. It is worth noting that the computation of a geodesic requires evaluating the matrix exponential map on skew symmetric matrices of size 2k×2k2𝑘2𝑘2k\times 2k2 italic_k × 2 italic_k , which can be efficiently done using the eigenvalue decomposition of skew symmetric matrices.

  2. 2.

    (Grassmann manifold.) Let n,k𝑛𝑘n,k\in\mathbb{N}italic_n , italic_k ∈ blackboard_N with kn𝑘𝑛k\leqslant nitalic_k ⩽ italic_n. The k(nk)𝑘𝑛𝑘k(n-k)italic_k ( italic_n - italic_k )-dimensional Grassmann manifold 𝒢(n,k)𝒢𝑛𝑘\mathcal{G}(n,k)caligraphic_G ( italic_n , italic_k ) can be defined as the set of all k𝑘kitalic_k-dimensional subspaces of the Euclidean space nsuperscript𝑛\mathbb{R}^{n}blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT (see [bendokat2020grassmann] for a complete overview), i.e.,

    𝒢(n,k)={WnW is a k-dimensional subspace}.𝒢𝑛𝑘conditional-set𝑊superscript𝑛𝑊 is a 𝑘-dimensional subspace\mathcal{G}(n,k)=\{W\subseteq\mathbb{R}^{n}\mid W\text{ is a }k\text{-% dimensional subspace}\}.caligraphic_G ( italic_n , italic_k ) = { italic_W ⊆ blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ∣ italic_W is a italic_k -dimensional subspace } . (8)

    The following representation allows for an efficient implementation of points on the Grassmann manifold. First, observe that for any subspace W𝒢(n,k)𝑊𝒢𝑛𝑘W\in\mathcal{G}(n,k)italic_W ∈ caligraphic_G ( italic_n , italic_k ), there exists a (non-unique) orthonormal basis formed by the column of Γ𝒱(n,k)Γ𝒱𝑛𝑘\Gamma\in\mathcal{V}(n,k)roman_Γ ∈ caligraphic_V ( italic_n , italic_k ) such that W=spanΓ𝑊spanΓW=\operatorname{span}\Gammaitalic_W = roman_span roman_Γ. Thus, by defining the equivalence relation similar-to\sim on 𝒱(n,k)𝒱𝑛𝑘\mathcal{V}(n,k)caligraphic_V ( italic_n , italic_k ) as Γ1Γ2similar-tosubscriptΓ1subscriptΓ2\Gamma_{1}\sim\Gamma_{2}roman_Γ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∼ roman_Γ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT if and only if spanΓ1=spanΓ2spansubscriptΓ1spansubscriptΓ2\operatorname{span}\Gamma_{1}=\operatorname{span}\Gamma_{2}roman_span roman_Γ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = roman_span roman_Γ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT, the Grassmann manifold 𝒢(n,k)𝒢𝑛𝑘\mathcal{G}(n,k)caligraphic_G ( italic_n , italic_k ) can be identified with the quotient manifold 𝒱(n,k)/\mathcal{V}(n,k)/\simcaligraphic_V ( italic_n , italic_k ) / ∼. As a result, an element W𝑊Witalic_W of 𝒢(n,k)𝒢𝑛𝑘\mathcal{G}(n,k)caligraphic_G ( italic_n , italic_k ) can be represented by an element ΓΓ\Gammaroman_Γ of 𝒱(n,k)𝒱𝑛𝑘\mathcal{V}(n,k)caligraphic_V ( italic_n , italic_k ).

    Fixing Γn×(nk)subscriptΓperpendicular-tosuperscript𝑛𝑛𝑘\Gamma_{\perp}\in\mathbb{R}^{n\times(n-k)}roman_Γ start_POSTSUBSCRIPT ⟂ end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_n × ( italic_n - italic_k ) end_POSTSUPERSCRIPT such that the columns of ΓΓ\Gammaroman_Γ and ΓsubscriptΓperpendicular-to\Gamma_{\perp}roman_Γ start_POSTSUBSCRIPT ⟂ end_POSTSUBSCRIPT form an orthonormal basis of nsuperscript𝑛\mathbb{R}^{n}blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT, the tangent space to 𝒢(n,k)𝒢𝑛𝑘\mathcal{G}(n,k)caligraphic_G ( italic_n , italic_k ) at W𝑊Witalic_W is given by

    TW𝒢(n,k):={ΓΣn×kΣ(nk)×k}.assignsubscript𝑇𝑊𝒢𝑛𝑘conditional-setsubscriptΓperpendicular-toΣsuperscript𝑛𝑘Σsuperscript𝑛𝑘𝑘T_{W}\mathcal{G}(n,k):=\{\Gamma_{\perp}\Sigma\in\mathbb{R}^{n\times k}\mid% \Sigma\in\mathbb{R}^{(n-k)\times k}\}.italic_T start_POSTSUBSCRIPT italic_W end_POSTSUBSCRIPT caligraphic_G ( italic_n , italic_k ) := { roman_Γ start_POSTSUBSCRIPT ⟂ end_POSTSUBSCRIPT roman_Σ ∈ blackboard_R start_POSTSUPERSCRIPT italic_n × italic_k end_POSTSUPERSCRIPT ∣ roman_Σ ∈ blackboard_R start_POSTSUPERSCRIPT ( italic_n - italic_k ) × italic_k end_POSTSUPERSCRIPT } .

    We equip 𝒢(n,k)𝒢𝑛𝑘\mathcal{G}(n,k)caligraphic_G ( italic_n , italic_k ) with the Riemannian metric

    𝔤W(Δ1,Δ2)=Tr(Δ1Δ2),Δ1,Δ2TW𝒢(n,k),formulae-sequencesubscript𝔤𝑊subscriptΔ1subscriptΔ2TrsuperscriptsubscriptΔ1topsubscriptΔ2subscriptΔ1subscriptΔ2subscript𝑇𝑊𝒢𝑛𝑘\mathfrak{g}_{W}(\Delta_{1},\Delta_{2})=\textrm{Tr}(\Delta_{1}^{\top}\Delta_{2% }),\qquad\Delta_{1},\Delta_{2}\in T_{W}\mathcal{G}(n,k),fraktur_g start_POSTSUBSCRIPT italic_W end_POSTSUBSCRIPT ( roman_Δ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , roman_Δ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) = Tr ( roman_Δ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT roman_Δ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) , roman_Δ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , roman_Δ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ∈ italic_T start_POSTSUBSCRIPT italic_W end_POSTSUBSCRIPT caligraphic_G ( italic_n , italic_k ) ,

    such that ΣΓΣmaps-toΣsubscriptΓperpendicular-toΣ\Sigma\mapsto\Gamma_{\perp}\Sigmaroman_Σ ↦ roman_Γ start_POSTSUBSCRIPT ⟂ end_POSTSUBSCRIPT roman_Σ is an isometric isomorphism between n×(nk)superscript𝑛𝑛𝑘\mathbb{R}^{n\times(n-k)}blackboard_R start_POSTSUPERSCRIPT italic_n × ( italic_n - italic_k ) end_POSTSUPERSCRIPT and TW𝒢(n,k)subscript𝑇𝑊𝒢𝑛𝑘T_{W}\mathcal{G}(n,k)italic_T start_POSTSUBSCRIPT italic_W end_POSTSUBSCRIPT caligraphic_G ( italic_n , italic_k ). Let Δ=U^DV^Δ^𝑈𝐷superscript^𝑉top\Delta=\widehat{U}D\widehat{V}^{\top}roman_Δ = over^ start_ARG italic_U end_ARG italic_D over^ start_ARG italic_V end_ARG start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT be the compact singular value decomposition (SVD) of an element ΔTW𝒢(n,k)Δsubscript𝑇𝑊𝒢𝑛𝑘\Delta\in T_{W}\mathcal{G}(n,k)roman_Δ ∈ italic_T start_POSTSUBSCRIPT italic_W end_POSTSUBSCRIPT caligraphic_G ( italic_n , italic_k ). Then, the geodesic at W𝑊Witalic_W with direction ΔTW𝒢(n,k)Δsubscript𝑇𝑊𝒢𝑛𝑘\Delta\in T_{W}\mathcal{G}(n,k)roman_Δ ∈ italic_T start_POSTSUBSCRIPT italic_W end_POSTSUBSCRIPT caligraphic_G ( italic_n , italic_k ) admits the following explicit expression

    γ(W,Δ)(θ)=(ΓV^cos(Dθ)+U^sin(Dθ))V^,θ.formulae-sequencesubscript𝛾𝑊Δ𝜃Γ^𝑉𝐷𝜃^𝑈𝐷𝜃superscript^𝑉top𝜃\gamma_{(W,\Delta)}(\theta)=\big{(}\Gamma\widehat{V}\cos(D\theta)+\widehat{U}% \sin(D\theta)\big{)}\widehat{V}^{\top},\qquad\theta\in\mathbb{R}.italic_γ start_POSTSUBSCRIPT ( italic_W , roman_Δ ) end_POSTSUBSCRIPT ( italic_θ ) = ( roman_Γ over^ start_ARG italic_V end_ARG roman_cos ( italic_D italic_θ ) + over^ start_ARG italic_U end_ARG roman_sin ( italic_D italic_θ ) ) over^ start_ARG italic_V end_ARG start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT , italic_θ ∈ blackboard_R .

    Further details and derivations can be found in [edelman1998geometry].

Refer to caption
Figure 5: Examples of geodesics on the Stiefel and the Grassmann manifold. Points on the Stiefel manifold 𝒱(3,2)𝒱32\mathcal{V}(3,2)caligraphic_V ( 3 , 2 ) and Grassmann manifold 𝒢(3,2)𝒢32\mathcal{G}(3,2)caligraphic_G ( 3 , 2 ) are represented by two orthogonal vectors (v1,v2)3×2subscript𝑣1subscript𝑣2superscript32(v_{1},v_{2})\in\mathbb{R}^{3\times 2}( italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) ∈ blackboard_R start_POSTSUPERSCRIPT 3 × 2 end_POSTSUPERSCRIPT. The circles beginning from the red point (0,0,1)001(0,0,1)( 0 , 0 , 1 ) and the green point (1,0,0)100(1,0,0)( 1 , 0 , 0 ) are related to the trajectory of v1subscript𝑣1v_{1}italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and v2subscript𝑣2v_{2}italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT respectively. The velocity V𝑉Vitalic_V is sampled randomly in the tangent space of the starting point X0subscript𝑋0X_{0}italic_X start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT such that Tr(VV)=1Trsuperscript𝑉top𝑉1\mathrm{Tr}(V^{\top}V)=1roman_Tr ( italic_V start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_V ) = 1. The points are colored according to the time, beginning with dark blue and finishing with light yellow. The periodicity of the geodesics visible in the figure is a special characteristic of a few choices of dimension parameters, amongst them (n,k)=(3,2)𝑛𝑘32(n,k)=(3,2)( italic_n , italic_k ) = ( 3 , 2 ), and does not generalize.

For the convenience of the reader we provide more details on the three samplers appearing in our numerical experiments. Each sampler defines a Markov chain (Xi)isubscriptsubscript𝑋𝑖𝑖(X_{i})_{i\in\mathbb{N}}( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_i ∈ blackboard_N end_POSTSUBSCRIPT, and we describe the respective transition mechanisms from Xisubscript𝑋𝑖X_{i}italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT to Xi+1subscript𝑋𝑖1X_{i+1}italic_X start_POSTSUBSCRIPT italic_i + 1 end_POSTSUBSCRIPT for i𝑖i\in\mathbb{N}italic_i ∈ blackboard_N.

(a) adaptive random walk Metropolis Hastings (RMH).

Given Xi𝒱(n,k)subscript𝑋𝑖𝒱𝑛𝑘X_{i}\in\mathcal{V}(n,k)italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ caligraphic_V ( italic_n , italic_k ) and a step size ha(0,)subscript𝑎0h_{a}\in(0,\infty)italic_h start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ∈ ( 0 , ∞ ) this sampler uses the following Metropolis-Hastings (MH) like transition mechanism :

  1. 1.

    Sample Vi+1𝒩(𝟎nk,Idnk)similar-tosubscript𝑉𝑖1𝒩subscript0𝑛𝑘subscriptId𝑛𝑘V_{i+1}\sim\mathcal{N}(\boldsymbol{0}_{nk},\mathrm{Id}_{nk})italic_V start_POSTSUBSCRIPT italic_i + 1 end_POSTSUBSCRIPT ∼ caligraphic_N ( bold_0 start_POSTSUBSCRIPT italic_n italic_k end_POSTSUBSCRIPT , roman_Id start_POSTSUBSCRIPT italic_n italic_k end_POSTSUBSCRIPT ) interpreted as a matrix in n×ksuperscript𝑛𝑘\mathbb{R}^{n\times k}blackboard_R start_POSTSUPERSCRIPT italic_n × italic_k end_POSTSUPERSCRIPT. Let proj𝒱(n,k):n×k𝒱(n,k),V=U^DV^U^V^:subscriptproj𝒱𝑛𝑘formulae-sequencesuperscript𝑛𝑘𝒱𝑛𝑘𝑉^𝑈𝐷superscript^𝑉topmaps-to^𝑈superscript^𝑉top\operatorname{proj}_{\mathcal{V}(n,k)}:\mathbb{R}^{n\times k}\to\mathcal{V}(n,% k),\ V=\widehat{U}D\widehat{V}^{\top}\mapsto\widehat{U}\widehat{V}^{\top}roman_proj start_POSTSUBSCRIPT caligraphic_V ( italic_n , italic_k ) end_POSTSUBSCRIPT : blackboard_R start_POSTSUPERSCRIPT italic_n × italic_k end_POSTSUPERSCRIPT → caligraphic_V ( italic_n , italic_k ) , italic_V = over^ start_ARG italic_U end_ARG italic_D over^ start_ARG italic_V end_ARG start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ↦ over^ start_ARG italic_U end_ARG over^ start_ARG italic_V end_ARG start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT be the projection on the Stiefel manifold, where U^DV^^𝑈𝐷superscript^𝑉top\widehat{U}D\widehat{V}^{\top}over^ start_ARG italic_U end_ARG italic_D over^ start_ARG italic_V end_ARG start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT is the SVD of V𝑉Vitalic_V. Then define X~i+1=proj𝒱(n,k)(Xi+haVi+1)subscript~𝑋𝑖1subscriptproj𝒱𝑛𝑘subscript𝑋𝑖subscript𝑎subscript𝑉𝑖1\widetilde{X}_{i+1}=\operatorname{proj}_{\mathcal{V}(n,k)}(X_{i}+h_{a}V_{i+1})over~ start_ARG italic_X end_ARG start_POSTSUBSCRIPT italic_i + 1 end_POSTSUBSCRIPT = roman_proj start_POSTSUBSCRIPT caligraphic_V ( italic_n , italic_k ) end_POSTSUBSCRIPT ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT + italic_h start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT italic_V start_POSTSUBSCRIPT italic_i + 1 end_POSTSUBSCRIPT ).

  2. 2.

    Sample Υi+1Unif([0,1])similar-tosubscriptΥ𝑖1Unif01\Upsilon_{i+1}\sim\mathrm{Unif}\big{(}[0,1]\big{)}roman_Υ start_POSTSUBSCRIPT italic_i + 1 end_POSTSUBSCRIPT ∼ roman_Unif ( [ 0 , 1 ] ) independent of all previously appearing random variables. If Υi+1<α(Xi,X~i+1)subscriptΥ𝑖1αsubscript𝑋𝑖subscript~𝑋𝑖1\Upsilon_{i+1}<\upalpha(X_{i},\widetilde{X}_{i+1})roman_Υ start_POSTSUBSCRIPT italic_i + 1 end_POSTSUBSCRIPT < roman_α ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , over~ start_ARG italic_X end_ARG start_POSTSUBSCRIPT italic_i + 1 end_POSTSUBSCRIPT ), where

    α(Xi,X~i+1)=p(X~i+1)p(Xi),αsubscript𝑋𝑖subscript~𝑋𝑖1𝑝subscript~𝑋𝑖1𝑝subscript𝑋𝑖\upalpha(X_{i},\widetilde{X}_{i+1})=\frac{p(\widetilde{X}_{i+1})}{p(X_{i})},roman_α ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , over~ start_ARG italic_X end_ARG start_POSTSUBSCRIPT italic_i + 1 end_POSTSUBSCRIPT ) = divide start_ARG italic_p ( over~ start_ARG italic_X end_ARG start_POSTSUBSCRIPT italic_i + 1 end_POSTSUBSCRIPT ) end_ARG start_ARG italic_p ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_ARG , (9)

    then set Xi+1=X~i+1subscript𝑋𝑖1subscript~𝑋𝑖1X_{i+1}=\widetilde{X}_{i+1}italic_X start_POSTSUBSCRIPT italic_i + 1 end_POSTSUBSCRIPT = over~ start_ARG italic_X end_ARG start_POSTSUBSCRIPT italic_i + 1 end_POSTSUBSCRIPT, otherwise Xi+1=Xisubscript𝑋𝑖1subscript𝑋𝑖X_{i+1}=X_{i}italic_X start_POSTSUBSCRIPT italic_i + 1 end_POSTSUBSCRIPT = italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT.

The hyperparameter hasubscript𝑎h_{a}italic_h start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT is adaptively tuned to target an acceptance probability of 0.2340.2340.2340.234, a constant proposed by optimal design analysis when using the same sampler in an Euclidean space [roberts2001optimal]. As remarked in [zappa2018monte], it is worth noting the proposal mechanism is not symmetric here. More precisely, if the conditional distribution of X~i+1subscript~𝑋𝑖1\widetilde{X}_{i+1}over~ start_ARG italic_X end_ARG start_POSTSUBSCRIPT italic_i + 1 end_POSTSUBSCRIPT given Xi=Γ𝒱(n,k)subscript𝑋𝑖Γ𝒱𝑛𝑘X_{i}=\Gamma\in\mathcal{V}(n,k)italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = roman_Γ ∈ caligraphic_V ( italic_n , italic_k ) admits a transition density denoted by q𝑞qitalic_q, this function is not symmetric, i.e., q(Γ1|Γ2)q(Γ2|Γ1)𝑞conditionalsubscriptΓ1subscriptΓ2𝑞conditionalsubscriptΓ2subscriptΓ1q(\Gamma_{1}|\Gamma_{2})\neq q(\Gamma_{2}|\Gamma_{1})italic_q ( roman_Γ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | roman_Γ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) ≠ italic_q ( roman_Γ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT | roman_Γ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) for Γ1,Γ2𝒱(n,k)subscriptΓ1subscriptΓ2𝒱𝑛𝑘\Gamma_{1},\Gamma_{2}\in\mathcal{V}(n,k)roman_Γ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , roman_Γ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ∈ caligraphic_V ( italic_n , italic_k ). Therefore, using (9) as the acceptance ratio leads to a biased MCMC algorithm, which is justified in [mantoux2021understanding] by the observation that, for sufficiently small ha>0subscript𝑎0h_{a}>0italic_h start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT > 0, q(Γ1|Γ2)q(Γ2|Γ1)𝑞conditionalsubscriptΓ1subscriptΓ2𝑞conditionalsubscriptΓ2subscriptΓ1q(\Gamma_{1}|\Gamma_{2})\approx q(\Gamma_{2}|\Gamma_{1})italic_q ( roman_Γ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | roman_Γ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) ≈ italic_q ( roman_Γ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT | roman_Γ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) holds for all Γ1,Γ2𝒱(n,k)subscriptΓ1subscriptΓ2𝒱𝑛𝑘\Gamma_{1},\Gamma_{2}\in\mathcal{V}(n,k)roman_Γ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , roman_Γ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ∈ caligraphic_V ( italic_n , italic_k ). However, this introduces a bias that grows with increasing hasubscript𝑎h_{a}italic_h start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT. In Section 3.2, we compare adaptive RMH with GSS using the same application as discussed in [mantoux2021understanding].
It is worth noting that [zappa2018monte] proposes an MCMC algorithm that, by employing another projection and an additional accept-reject step, results in an unbiased MCMC algorithm. However, their approach does not exploit the specificity of a \ldqtractable geodesics\rdq-framework.

(b) geodesic adaptive Metropolis Hastings (GeoRMH).

This methods is a bias free modification of RMH. It essentially uses the same transition mechanism, but replaces the proposal at step i𝑖i\in\mathbb{N}italic_i ∈ blackboard_N with γ(Xi,Vi+1)(ha)subscript𝛾subscript𝑋𝑖subscript𝑉𝑖1subscript𝑎\gamma_{(X_{i},V_{i+1})}(h_{a})italic_γ start_POSTSUBSCRIPT ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_V start_POSTSUBSCRIPT italic_i + 1 end_POSTSUBSCRIPT ) end_POSTSUBSCRIPT ( italic_h start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) where Vi+1subscript𝑉𝑖1V_{i+1}italic_V start_POSTSUBSCRIPT italic_i + 1 end_POSTSUBSCRIPT is distributed according to a standard Gaussian on TXi𝒱(n,k)subscript𝑇subscript𝑋𝑖𝒱𝑛𝑘T_{X_{i}}\mathcal{V}(n,k)italic_T start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT caligraphic_V ( italic_n , italic_k ). In [goyal2019sampling], a similar algorithm is proposed to sample uniformly from a convex subset of a manifold with non-negative curvature.

(c) geodesic Metropolis adjusted Langevin algorithm (GeoMALA).

On the Grassmann manifold, GeoMALA is used and presented in [holbrook2016bayesian, Algorithm 1] to estimate parameters in Bayesian inference models involving dimensionality reduction. Fix a step h(0,)0h\in(0,\infty)italic_h ∈ ( 0 , ∞ ) and denote by projTW𝒢(n,k):n×kTW𝒢(n,k),V(IdnΓΓ)V:subscriptprojsubscript𝑇𝑊𝒢𝑛𝑘formulae-sequencesuperscript𝑛𝑘subscript𝑇𝑊𝒢𝑛𝑘maps-to𝑉subscriptId𝑛ΓsuperscriptΓtop𝑉\operatorname{proj}_{T_{W}\mathcal{G}(n,k)}:\mathbb{R}^{n\times k}\to T_{W}% \mathcal{G}(n,k),V\mapsto(\mathrm{Id}_{n}-\Gamma\Gamma^{\top})Vroman_proj start_POSTSUBSCRIPT italic_T start_POSTSUBSCRIPT italic_W end_POSTSUBSCRIPT caligraphic_G ( italic_n , italic_k ) end_POSTSUBSCRIPT : blackboard_R start_POSTSUPERSCRIPT italic_n × italic_k end_POSTSUPERSCRIPT → italic_T start_POSTSUBSCRIPT italic_W end_POSTSUBSCRIPT caligraphic_G ( italic_n , italic_k ) , italic_V ↦ ( roman_Id start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT - roman_Γ roman_Γ start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ) italic_V for W=spanΓ𝒢(n,k)𝑊spanΓ𝒢𝑛𝑘W=\mathrm{span}\ \Gamma\in\mathcal{G}(n,k)italic_W = roman_span roman_Γ ∈ caligraphic_G ( italic_n , italic_k ) the projection onto the tangent space. The transition mechanism of GeoMALA given Xi𝒢(n,k)subscript𝑋𝑖𝒢𝑛𝑘X_{i}\in\mathcal{G}(n,k)italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ caligraphic_G ( italic_n , italic_k ) works as follows:

  1. 1.

    Sample Vi+1𝒩(𝟎nk,Idnk)similar-tosubscript𝑉𝑖1𝒩subscript0𝑛𝑘subscriptId𝑛𝑘V_{i+1}\sim\mathcal{N}(\boldsymbol{0}_{nk},\mathrm{Id}_{nk})italic_V start_POSTSUBSCRIPT italic_i + 1 end_POSTSUBSCRIPT ∼ caligraphic_N ( bold_0 start_POSTSUBSCRIPT italic_n italic_k end_POSTSUBSCRIPT , roman_Id start_POSTSUBSCRIPT italic_n italic_k end_POSTSUBSCRIPT ) interpreted as a matrix in n×ksuperscript𝑛𝑘\mathbb{R}^{n\times k}blackboard_R start_POSTSUPERSCRIPT italic_n × italic_k end_POSTSUPERSCRIPT. Then define V¯i+1=projTXi𝒢(n,k)(Vi+1)subscript¯𝑉𝑖1subscriptprojsubscript𝑇subscript𝑋𝑖𝒢𝑛𝑘subscript𝑉𝑖1\bar{V}_{i+1}=\operatorname{proj}_{T_{X_{i}}\mathcal{G}(n,k)}(V_{i+1})over¯ start_ARG italic_V end_ARG start_POSTSUBSCRIPT italic_i + 1 end_POSTSUBSCRIPT = roman_proj start_POSTSUBSCRIPT italic_T start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT caligraphic_G ( italic_n , italic_k ) end_POSTSUBSCRIPT ( italic_V start_POSTSUBSCRIPT italic_i + 1 end_POSTSUBSCRIPT ) and E0=logp(Xi)Tr(V¯i+1V¯i+1)/2subscript𝐸0𝑝subscript𝑋𝑖Trsuperscriptsubscript¯𝑉𝑖1topsubscript¯𝑉𝑖12E_{0}=\log p(X_{i})-\mathrm{Tr}(\bar{V}_{i+1}^{\top}\bar{V}_{i+1})/2italic_E start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = roman_log italic_p ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) - roman_Tr ( over¯ start_ARG italic_V end_ARG start_POSTSUBSCRIPT italic_i + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT over¯ start_ARG italic_V end_ARG start_POSTSUBSCRIPT italic_i + 1 end_POSTSUBSCRIPT ) / 2.

  2. 2.

    Define V~i+1=projTXi𝒢(n,k)(V¯i+1+hlogp(Xi)/2)subscript~𝑉𝑖1subscriptprojsubscript𝑇subscript𝑋𝑖𝒢𝑛𝑘subscript¯𝑉𝑖1𝑝subscript𝑋𝑖2\widetilde{V}_{i+1}=\operatorname{proj}_{T_{X_{i}}\mathcal{G}(n,k)}(\bar{V}_{i% +1}+h\nabla\log p(X_{i})/2)over~ start_ARG italic_V end_ARG start_POSTSUBSCRIPT italic_i + 1 end_POSTSUBSCRIPT = roman_proj start_POSTSUBSCRIPT italic_T start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT caligraphic_G ( italic_n , italic_k ) end_POSTSUBSCRIPT ( over¯ start_ARG italic_V end_ARG start_POSTSUBSCRIPT italic_i + 1 end_POSTSUBSCRIPT + italic_h ∇ roman_log italic_p ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) / 2 ) and X¯i+1=γ(Xi,V~i+1)(h)subscript¯𝑋𝑖1subscript𝛾subscript𝑋𝑖subscript~𝑉𝑖1\bar{X}_{i+1}=\gamma_{(X_{i},\widetilde{V}_{i+1})}(h)over¯ start_ARG italic_X end_ARG start_POSTSUBSCRIPT italic_i + 1 end_POSTSUBSCRIPT = italic_γ start_POSTSUBSCRIPT ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , over~ start_ARG italic_V end_ARG start_POSTSUBSCRIPT italic_i + 1 end_POSTSUBSCRIPT ) end_POSTSUBSCRIPT ( italic_h ), V¯i+1=γ˙(Xi,V~i+1)(h)subscript¯𝑉𝑖1subscript˙𝛾subscript𝑋𝑖subscript~𝑉𝑖1\bar{V}_{i+1}=\dot{\gamma}_{(X_{i},\widetilde{V}_{i+1})}(h)over¯ start_ARG italic_V end_ARG start_POSTSUBSCRIPT italic_i + 1 end_POSTSUBSCRIPT = over˙ start_ARG italic_γ end_ARG start_POSTSUBSCRIPT ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , over~ start_ARG italic_V end_ARG start_POSTSUBSCRIPT italic_i + 1 end_POSTSUBSCRIPT ) end_POSTSUBSCRIPT ( italic_h ), as well as Vi+1=projTXi𝒢(n,k)(V¯i+1+hlogp(X¯i)/2)superscriptsubscript𝑉𝑖1subscriptprojsubscript𝑇subscript𝑋𝑖𝒢𝑛𝑘subscript¯𝑉𝑖1𝑝subscript¯𝑋𝑖2V_{i+1}^{*}=\operatorname{proj}_{T_{X_{i}}\mathcal{G}(n,k)}(\bar{V}_{i+1}+h% \nabla\log p(\bar{X}_{i})/2)italic_V start_POSTSUBSCRIPT italic_i + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT = roman_proj start_POSTSUBSCRIPT italic_T start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT caligraphic_G ( italic_n , italic_k ) end_POSTSUBSCRIPT ( over¯ start_ARG italic_V end_ARG start_POSTSUBSCRIPT italic_i + 1 end_POSTSUBSCRIPT + italic_h ∇ roman_log italic_p ( over¯ start_ARG italic_X end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) / 2 ) and E1=logp(X¯i+1)Tr(Vi+1Vi+1)/2subscript𝐸1𝑝subscript¯𝑋𝑖1Trsuperscriptsubscript𝑉𝑖1absenttopsuperscriptsubscript𝑉𝑖12E_{1}=\log p(\bar{X}_{i+1})-\mathrm{Tr}(V_{i+1}^{*\top}V_{i+1}^{*})/2italic_E start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = roman_log italic_p ( over¯ start_ARG italic_X end_ARG start_POSTSUBSCRIPT italic_i + 1 end_POSTSUBSCRIPT ) - roman_Tr ( italic_V start_POSTSUBSCRIPT italic_i + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ ⊤ end_POSTSUPERSCRIPT italic_V start_POSTSUBSCRIPT italic_i + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) / 2.

  3. 3.

    Sample Υi+1Unif([0,1])similar-tosubscriptΥ𝑖1Unif01\Upsilon_{i+1}\sim\mathrm{Unif}\big{(}[0,1]\big{)}roman_Υ start_POSTSUBSCRIPT italic_i + 1 end_POSTSUBSCRIPT ∼ roman_Unif ( [ 0 , 1 ] ) independently. If log(Υi+1)<E1E0subscriptΥ𝑖1subscript𝐸1subscript𝐸0\log(\Upsilon_{i+1})<E_{1}-E_{0}roman_log ( roman_Υ start_POSTSUBSCRIPT italic_i + 1 end_POSTSUBSCRIPT ) < italic_E start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT - italic_E start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT then set Xi+1=X¯i+1subscript𝑋𝑖1subscript¯𝑋𝑖1X_{i+1}=\bar{X}_{i+1}italic_X start_POSTSUBSCRIPT italic_i + 1 end_POSTSUBSCRIPT = over¯ start_ARG italic_X end_ARG start_POSTSUBSCRIPT italic_i + 1 end_POSTSUBSCRIPT, otherwise Xi+1=Xisubscript𝑋𝑖1subscript𝑋𝑖X_{i+1}=X_{i}italic_X start_POSTSUBSCRIPT italic_i + 1 end_POSTSUBSCRIPT = italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT.

Note that for all the algorithms that we implement, GSS, GeoRMH, and GeoMALA, we always reproject the (final) state of the Markov chain at each step onto the Stiefel manifold to mitigate numerical errors arising from the geodesic computations. This reprojection step ensures that the resulting samples lie on the manifold, preserving the desired manifold structure and improving the accuracy of the sampling algorithms.

3.1 Sampling the von Mises–Fisher distribution

In this section we present numerical experiments targeting the von Mises-Fisher distribution. Given a matrix-valued parameter Fn×k𝐹superscript𝑛𝑘F\in\mathbb{R}^{n\times k}italic_F ∈ blackboard_R start_POSTSUPERSCRIPT italic_n × italic_k end_POSTSUPERSCRIPT, the von Mises–Fisher distribution vMF(F)vMF𝐹\operatorname{vMF}(F)roman_vMF ( italic_F ) on the Stiefel manifold 𝒱(n,k)𝒱𝑛𝑘\mathcal{V}(n,k)caligraphic_V ( italic_n , italic_k ) has unnormalized density with respect to its Riemannian measure

pvMF(F)(Γ)=exp(Tr(FΓ)),Γ𝒱(n,k).formulae-sequencesubscript𝑝vMF𝐹ΓTrsuperscript𝐹topΓΓ𝒱𝑛𝑘p_{\operatorname{vMF}(F)}(\Gamma)=\exp\left(\textrm{Tr}(F^{\top}\Gamma)\right)% ,\qquad\Gamma\in\mathcal{V}(n,k).italic_p start_POSTSUBSCRIPT roman_vMF ( italic_F ) end_POSTSUBSCRIPT ( roman_Γ ) = roman_exp ( Tr ( italic_F start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT roman_Γ ) ) , roman_Γ ∈ caligraphic_V ( italic_n , italic_k ) . (10)

However, this expression can not be used for the Grassmann manifold since spanΓ1=spanΓ2spansubscriptΓ1spansubscriptΓ2\operatorname{span}\Gamma_{1}=\operatorname{span}\Gamma_{2}roman_span roman_Γ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = roman_span roman_Γ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT with (Γ1,Γ2)𝒱(n,k)2subscriptΓ1subscriptΓ2𝒱superscript𝑛𝑘2(\Gamma_{1},\Gamma_{2})\in\mathcal{V}(n,k)^{2}( roman_Γ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , roman_Γ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) ∈ caligraphic_V ( italic_n , italic_k ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT does not imply pvMF(F)(Γ1)=pvMF(F)(Γ2)subscript𝑝vMF𝐹subscriptΓ1subscript𝑝vMF𝐹subscriptΓ2p_{\operatorname{vMF}(F)}(\Gamma_{1})=p_{\operatorname{vMF}(F)}(\Gamma_{2})italic_p start_POSTSUBSCRIPT roman_vMF ( italic_F ) end_POSTSUBSCRIPT ( roman_Γ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) = italic_p start_POSTSUBSCRIPT roman_vMF ( italic_F ) end_POSTSUBSCRIPT ( roman_Γ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ). Note that an element W𝑊Witalic_W of the Grassmann manifold can be identified with its orthogonal projector ΓΓΓsuperscriptΓtop\Gamma\Gamma^{\top}roman_Γ roman_Γ start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT, which does not depend on the choice of the representative Γ𝒱(n,k)Γ𝒱𝑛𝑘\Gamma\in\mathcal{V}(n,k)roman_Γ ∈ caligraphic_V ( italic_n , italic_k ) for W𝑊Witalic_W. Therefore, given a positive semi-definite matrix Pn×n𝑃superscript𝑛𝑛P\in\mathbb{R}^{n\times n}italic_P ∈ blackboard_R start_POSTSUPERSCRIPT italic_n × italic_n end_POSTSUPERSCRIPT, the von Mises–Fisher distribution vMF(P)vMF𝑃\operatorname{vMF}(P)roman_vMF ( italic_P ) (also called matrix Langevin distribution [chikuse2003concentrated]) on the Grassmann manifold 𝒢(n,k)𝒢𝑛𝑘\mathcal{G}(n,k)caligraphic_G ( italic_n , italic_k ) has unnormalized density

pvMF(P)(W)=exp(Tr(PΓΓ)),spanΓ=W(Γ,W)𝒱(n,k)×𝒢(n,k).formulae-sequencesubscript𝑝vMF𝑃𝑊Trsuperscript𝑃topΓsuperscriptΓtopformulae-sequencespanΓ𝑊Γ𝑊𝒱𝑛𝑘𝒢𝑛𝑘p_{\operatorname{vMF}(P)}(W)=\exp\left(\textrm{Tr}(P^{\top}\Gamma\Gamma^{\top}% )\right),\quad\operatorname{span}\Gamma=W\qquad(\Gamma,W)\in\mathcal{V}(n,k)% \times\mathcal{G}(n,k)\;.italic_p start_POSTSUBSCRIPT roman_vMF ( italic_P ) end_POSTSUBSCRIPT ( italic_W ) = roman_exp ( Tr ( italic_P start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT roman_Γ roman_Γ start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ) ) , roman_span roman_Γ = italic_W ( roman_Γ , italic_W ) ∈ caligraphic_V ( italic_n , italic_k ) × caligraphic_G ( italic_n , italic_k ) . (11)

In the following, given n𝑛nitalic_n and k𝑘kitalic_k, we always choose the parameters F𝐹Fitalic_F and P𝑃Pitalic_P to be of the form

F=(D𝟎nk,k)n×k,P=FF.formulae-sequence𝐹matrix𝐷subscript0𝑛𝑘𝑘superscript𝑛𝑘𝑃𝐹superscript𝐹topF=\begin{pmatrix}D\\ \boldsymbol{0}_{n-k,k}\end{pmatrix}\in\mathbb{R}^{n\times k},\qquad P=FF^{\top}.italic_F = ( start_ARG start_ROW start_CELL italic_D end_CELL end_ROW start_ROW start_CELL bold_0 start_POSTSUBSCRIPT italic_n - italic_k , italic_k end_POSTSUBSCRIPT end_CELL end_ROW end_ARG ) ∈ blackboard_R start_POSTSUPERSCRIPT italic_n × italic_k end_POSTSUPERSCRIPT , italic_P = italic_F italic_F start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT . (12)

where Dk×k𝐷superscript𝑘𝑘D\in\mathbb{R}^{k\times k}italic_D ∈ blackboard_R start_POSTSUPERSCRIPT italic_k × italic_k end_POSTSUPERSCRIPT.

We consider three different experimental setups on 𝒱(n,k)𝒱𝑛𝑘\mathcal{V}(n,k)caligraphic_V ( italic_n , italic_k ). Firstly, k𝑘kitalic_k is fixed at 2222, we vary n𝑛nitalic_n in the set {3,30,100}330100\{3,30,100\}{ 3 , 30 , 100 } and set D=diag((1,,k))d×d𝐷diagsuperscript1𝑘topsuperscript𝑑𝑑D=\mathrm{diag}((1,\ldots,k)^{\top})\in\mathbb{R}^{d\times d}italic_D = roman_diag ( ( 1 , … , italic_k ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ) ∈ blackboard_R start_POSTSUPERSCRIPT italic_d × italic_d end_POSTSUPERSCRIPT in (12). Secondly, for the same choice of D𝐷Ditalic_D as in the first experiment, we fix n=30𝑛30n=30italic_n = 30 and vary k𝑘kitalic_k in the set {3,30,100}330100\{3,30,100\}{ 3 , 30 , 100 }. Thirdly, the pair (n,k)𝑛𝑘(n,k)( italic_n , italic_k ) is fixed at (30,2)302(30,2)( 30 , 2 ), but we choose D=diag((1,λ))2×2𝐷diagsuperscript1𝜆topsuperscript22D=\mathrm{diag}((1,\lambda)^{\top})\in\mathbb{R}^{2\times 2}italic_D = roman_diag ( ( 1 , italic_λ ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ) ∈ blackboard_R start_POSTSUPERSCRIPT 2 × 2 end_POSTSUPERSCRIPT, where λ𝜆\lambdaitalic_λ varies in the set {1,10,100}110100\{1,10,100\}{ 1 , 10 , 100 }. This allows us to study the impact of the target distribution’s anisotropy on the samplers performance.

On the Grassmann manifold 𝒢(n,k)𝒢𝑛𝑘\mathcal{G}(n,k)caligraphic_G ( italic_n , italic_k ) we repeat similar experiments. However in the first two setups, we use D=Idk𝐷subscriptId𝑘D=\mathrm{Id}_{k}italic_D = roman_Id start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT in (12). For the third experiment, we set (n,k)=(3,2)𝑛𝑘32(n,k)=(3,2)( italic_n , italic_k ) = ( 3 , 2 ) and D=λIdk𝐷𝜆subscriptId𝑘D=\sqrt{\lambda}\mathrm{Id}_{k}italic_D = square-root start_ARG italic_λ end_ARG roman_Id start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT for λ{1,10,100}𝜆110100\lambda\in\{1,10,100\}italic_λ ∈ { 1 , 10 , 100 }. There, we also vary the stepping-out parameter m𝑚mitalic_m of GSS in the set {1,3,10}1310\{1,3,10\}{ 1 , 3 , 10 }. In all the other experiments, we fix the hyperparameter m𝑚mitalic_m of GSS to 1 for the sake of simplicity. The samplers run for N=100,000𝑁100000N=100,000italic_N = 100 , 000 iterations, and the stepsize hasubscript𝑎h_{a}italic_h start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT is initialized to 0.01 and updated every 20 steps for RMH and GeoRMH.

We define the initialization X1subscript𝑋1X_{1}italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT by first sampling X~1Unif([0,1]n×k)similar-tosubscript~𝑋1Unifsuperscript01𝑛𝑘\widetilde{X}_{1}\sim\mathrm{Unif}\left([0,1]^{n\times k}\right)over~ start_ARG italic_X end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∼ roman_Unif ( [ 0 , 1 ] start_POSTSUPERSCRIPT italic_n × italic_k end_POSTSUPERSCRIPT ). Then, X~1subscript~𝑋1\widetilde{X}_{1}over~ start_ARG italic_X end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT is projected onto 𝒱(n,k)𝒱𝑛𝑘\mathcal{V}(n,k)caligraphic_V ( italic_n , italic_k ) using [mantoux2021understanding, Lemma 2] to obtain X1subscript𝑋1X_{1}italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT, thereby identifying a point on 𝒱(n,k)𝒱𝑛𝑘\mathcal{V}(n,k)caligraphic_V ( italic_n , italic_k ) with its equivalence class when we work on 𝒢(n,k)𝒢𝑛𝑘\mathcal{G}(n,k)caligraphic_G ( italic_n , italic_k ). To evaluate the performance of the samplers, we compute the effective sample size (ESS) of (log(pvMF(F)(xi)))i{1,,N}Nsubscriptsubscript𝑝vMF𝐹subscript𝑥𝑖𝑖1𝑁superscript𝑁(\log(p_{\operatorname{vMF}(F)}(x_{i})))_{i\in\{1,\ldots,N\}}\in\mathbb{R}^{N}( roman_log ( italic_p start_POSTSUBSCRIPT roman_vMF ( italic_F ) end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ) ) start_POSTSUBSCRIPT italic_i ∈ { 1 , … , italic_N } end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT, where (xi)i{1,,N}subscriptsubscript𝑥𝑖𝑖1𝑁(x_{i})_{i\in\{1,\ldots,N\}}( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_i ∈ { 1 , … , italic_N } end_POSTSUBSCRIPT denotes the samples generated by a single sampler. The results are averaged over 10 different resamplings, but the initialization is kept fixed since the use of a burn-in period has no significant impact.

Table 1: Experiment performances recorded with the effective sample size [min, median, max] over 10 repetitions for varying dimension n𝑛nitalic_n, on the Stiefel manifold 𝒱(n,k)𝒱𝑛𝑘\mathcal{V}(n,k)caligraphic_V ( italic_n , italic_k ).
(n,k) (3,2) (30,2) (100,2)
GSS w=1𝑤1w=1italic_w = 1 [1842,2156,2549]184221562549[1842,2156,2549][ 1842 , 2156 , 2549 ] [1668,2000,2254]166820002254[1668,2000,2254][ 1668 , 2000 , 2254 ] [1693,2033,2181]169320332181[1693,2033,2181][ 1693 , 2033 , 2181 ]
GSS w=3𝑤3w=3italic_w = 3 [10232,9315,11460]10232931511460[10232,9315,11460][ 10232 , 9315 , 11460 ] [12290,13445,15014]122901344515014[12290,13445,15014][ 12290 , 13445 , 15014 ] [14355,15786,18229]143551578618229[14355,15786,18229][ 14355 , 15786 , 18229 ]
GSS w=5𝑤5w=5italic_w = 5 [11899,13986,18806]118991398618806[11899,13986,18806][ 11899 , 13986 , 18806 ] [22756,26360,32073]227562636032073[22756,26360,32073][ 22756 , 26360 , 32073 ] [31946,35769,42168]319463576942168[31946,35769,42168][ 31946 , 35769 , 42168 ]
GSS w=7𝑤7w=7italic_w = 7 [13160,14734,18276]131601473418276[13160,14734,\textbf{18276}][ 13160 , 14734 , 18276 ] [30215,33844,35793]302153384435793[30215,33844,35793][ 30215 , 33844 , 35793 ] [43254,52119,57652]432545211957652[43254,52119,57652][ 43254 , 52119 , 57652 ]
GSS w=9𝑤9w=9italic_w = 9 [12810,14159,16628]128101415916628[12810,14159,16628][ 12810 , 14159 , 16628 ] [34512,38960,48829]345123896048829[34512,38960,\textbf{48829}][ 34512 , 38960 , 48829 ] [47771,57845,68873]477715784568873[47771,57845,68873][ 47771 , 57845 , 68873 ]
GSS w=11𝑤11w=11italic_w = 11 [14477, 15615,16733]14477, 1561516733[\textbf{14477, 15615},16733][ 14477, 15615 , 16733 ] [35254, 39762,46525]35254, 3976246525[\textbf{35254, 39762},46525][ 35254, 39762 , 46525 ] [51742,57700,62830]517425770062830[51742,57700,62830][ 51742 , 57700 , 62830 ]
RMH: [10173,11980,13985]101731198013985[10173,11980,13985][ 10173 , 11980 , 13985 ] [30329,39098,44068]303293909844068[30329,39098,44068][ 30329 , 39098 , 44068 ] [53187,60005, 72752]delimited-[]53187,60005, 72752[\textbf{53187,60005, 72752}][ 53187,60005, 72752 ]
GeoRMH: [7047,8399,10792]7047839910792[7047,8399,10792][ 7047 , 8399 , 10792 ] [28481,34113,37743]284813411337743[28481,34113,37743][ 28481 , 34113 , 37743 ] [45427,51125,56370]454275112556370[45427,51125,56370][ 45427 , 51125 , 56370 ]
Table 2: Experiment performances recorded with the effective sample size [min, median, max] on 10 repetitions for varying dimension k𝑘kitalic_k, on the Stiefel manifold 𝒱(n,k)𝒱𝑛𝑘\mathcal{V}(n,k)caligraphic_V ( italic_n , italic_k ).
(n,k) (30,5) (30,10) (30,20)
GSS w=1𝑤1w=1italic_w = 1 [728,794,859]728794859[728,794,859][ 728 , 794 , 859 ] [428,448,479]428448479[428,448,479][ 428 , 448 , 479 ] [316,331,344]316331344[316,331,344][ 316 , 331 , 344 ]
GSS w=3𝑤3w=3italic_w = 3 [3141,3628,4002]314136284002[3141,3628,4002][ 3141 , 3628 , 4002 ] [975,1111,1264]97511111264[975,1111,1264][ 975 , 1111 , 1264 ] [365,390,427]365390427[365,390,427][ 365 , 390 , 427 ]
GSS w=5𝑤5w=5italic_w = 5 [5327, 5843, 6581]delimited-[]5327, 5843, 6581[\textbf{5327, 5843, 6581}][ 5327, 5843, 6581 ] [1172, 1242, 1364]delimited-[]1172, 1242, 1364[\textbf{1172, 1242, 1364}][ 1172, 1242, 1364 ] [381, 397, 426]delimited-[]381, 397, 426[\textbf{381, 397, 426}][ 381, 397, 426 ]
RMH: [2803,3664,4609]280336644609[2803,3664,4609][ 2803 , 3664 , 4609 ] [1028,1140,1286]102811401286[1028,1140,1286][ 1028 , 1140 , 1286 ] [354,370,387]354370387[354,370,387][ 354 , 370 , 387 ]
GeoRMH: [1006,4249,4704]100642494704[1006,4249,4704][ 1006 , 4249 , 4704 ] [1069,1148,1260]106911481260[1069,1148,1260][ 1069 , 1148 , 1260 ] [363,376,392]363376392[363,376,392][ 363 , 376 , 392 ]

Varying (n,k)𝑛𝑘(n,k)( italic_n , italic_k ) on the Stiefel manifold.

First, we observe in Table 1 and 2 that the larger the value of w𝑤witalic_w, the higher the effective sample size (ESS). This is coherent with the fact that the average distance between the proposal and the current state increases with w𝑤witalic_w, leading to a higher ESS as the space is better explored. However, note that on a compact manifold like 𝒱(n,k)𝒱𝑛𝑘\mathcal{V}(n,k)caligraphic_V ( italic_n , italic_k ), following a geodesic for a long time may cause the proposal to return close to the starting point, similar to following the great circle of a sphere as shown in Figure 5. As a result, the gain in ESS reaches a plateau when w72π𝑤72𝜋w\geqslant 7\geqslant 2\piitalic_w ⩾ 7 ⩾ 2 italic_π.

The second observation is that for all samplers, as n𝑛nitalic_n increases for fixed k𝑘kitalic_k, the ESS increases, and inversely, as k𝑘kitalic_k increases with n𝑛nitalic_n fixed, the ESS decreases. Considering (12), the number of directions that impact the density is equal to k𝑘kitalic_k since Tr(FΓ)=i=1kfiΓiTrsuperscript𝐹topΓsuperscriptsubscript𝑖1𝑘superscriptsubscript𝑓𝑖topsubscriptΓ𝑖\textrm{Tr}(F^{\top}\Gamma)=\sum_{i=1}^{k}f_{i}^{\top}\Gamma_{i}Tr ( italic_F start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT roman_Γ ) = ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT roman_Γ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT for any F=(fi)i{1,,k},Γ=(Γi)i{1,,k}n×kformulae-sequence𝐹subscriptsubscript𝑓𝑖𝑖1𝑘ΓsubscriptsubscriptΓ𝑖𝑖1𝑘superscript𝑛𝑘F=(f_{i})_{i\in\{1,\ldots,k\}},\Gamma=(\Gamma_{i})_{i\in\{1,\ldots,k\}}\in% \mathbb{R}^{n\times k}italic_F = ( italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_i ∈ { 1 , … , italic_k } end_POSTSUBSCRIPT , roman_Γ = ( roman_Γ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_i ∈ { 1 , … , italic_k } end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_n × italic_k end_POSTSUPERSCRIPT. Thus, as k/n𝑘𝑛k/nitalic_k / italic_n is small, the target density is \ldqflat\rdq in many directions and thus the risk of proposal rejection is small if the sampler attempts to move in these directions.

In Table 1, we see that RMH outperforms GSS and GeoRMH for n=100𝑛100n=100italic_n = 100, and remains competitive for n{3,30}𝑛330n\in\{3,30\}italic_n ∈ { 3 , 30 }. This, maybe at first glance surprising, performance of RMH can be explained by the fact that when k/n𝑘𝑛k/nitalic_k / italic_n is small, the number of constraints is low compared to the dimensionality of the space, making 𝒱(n,k)𝒱𝑛𝑘\mathcal{V}(n,k)caligraphic_V ( italic_n , italic_k ) nearly Euclidean and the Stiefel projection nearly equal to the identity. Consequently, since the optimal acceptance rate of 0.2340.2340.2340.234 for the adaptive mechanism was found in an Euclidean space, it is reasonably explainable that RMH performs better in this scenario. This interpretation is further confirmed by Table 2, where we see that GSS(w=5𝑤5w=5italic_w = 5) outperforms RMH and GeoRMH for any k{5,10,20}𝑘51020k\in\{5,10,20\}italic_k ∈ { 5 , 10 , 20 }, though this advantage is less significant for GeoRMH due to its ability to explore the space according to its intrinsic geometry. Recall, to put our observations into perspective, that while RMH may outperform GSS and GeoRMH from ESS, it is fundamentally biased.

In Table 1, we observe that GeoRMH is not as efficient as RMH but performs comparably to GSS(w=7𝑤7w=7italic_w = 7) when n{30,100}𝑛30100n\in\{30,100\}italic_n ∈ { 30 , 100 }. This difference can be linked to the fact that GeoRMH and GSS only differ in the method they employ to move on a geodesic. The first uses a Metropolis Hastings mechanism whereas the latter runs a slice sampler. Using (uniform) slice sampling, can be viewed as first fixing an acceptance level and then drawing proposals (with the help of the stepping-out and shrinkage procedure) until acceptance is reached. This ensures that the sampled direction in the tangent space is not wasted, but it increases the computational cost of each transition step, since each proposal within the shrinkage procedure involves new computations. The parameter w𝑤witalic_w affects the average number of attempts in the shrinkage procedure since it widens the portion of the geodesic where the shrinkage proposal is sampled, thus increasing the possibility of not meeting the acceptance level. For example, in the case n=3𝑛3n=3italic_n = 3, there are, on average, 1.111.111.111.11 attempts when w=1𝑤1w=1italic_w = 1, but 1.411.411.411.41 attempts when w=5𝑤5w=5italic_w = 5. Therefore, there is a trade-off when selecting w𝑤witalic_w to optimize the time efficiency of sampling, as larger values of w𝑤witalic_w increase both the ESS and the computation time.

Varying (n,k)𝑛𝑘(n,k)( italic_n , italic_k ) on the Grassmann manifold.

In Table 3, we present only a subset of our experiments to convey the following message: When GeoMALA is well tuned, the gradient-informed sampler outperforms GSS regardless of the choice of w𝑤witalic_w. However, the tuning of GeoMALA is very sensitive. For instance, in the case (n,k)=(100,2)𝑛𝑘1002(n,k)=(100,2)( italic_n , italic_k ) = ( 100 , 2 ) with h=11h=1italic_h = 1, the gradient information encourages to focus on high density area and does not explore enough to outperform GSS, but if we increase the stepsize hhitalic_h to 2222, GeoMALA is better than GSS. Regarding the role of the hyperparameter w𝑤witalic_w, the conclusions are consistent with those on the Stiefel manifold. Increasing w𝑤witalic_w beyond a certain point does not pay off, since the manifold is compact. The sweet spot appears to be around w=72π𝑤72𝜋w=7\approx 2\piitalic_w = 7 ≈ 2 italic_π. Both methods have the same complexity, as in both cases, we need to sample a point on the tangent space and compute a geodesic using SVD.

Table 3: Experiment performances recorded with the effective sample size [min, median, max] on 10 repetitions for varying dimension (n,k)𝑛𝑘(n,k)( italic_n , italic_k ) with a fixed shape of distribution, on the Grassmann manifold. We show the results only for w=7𝑤7w=7italic_w = 7 since it does not affect the comparison with GeoMALA, and the dependence according to w𝑤witalic_w is globally the same as on the Stiefel manifold. \ldqGeoMALA Best hhitalic_h\rdq means that we provide the result for the best stepsize parameter hhitalic_h in {0.01,0.1,0.5,1,2}0.010.10.512\{0.01,0.1,0.5,1,2\}{ 0.01 , 0.1 , 0.5 , 1 , 2 }.
(n,k)𝑛𝑘(n,k)( italic_n , italic_k ) (3,2) (30,20) (100,2)
GSS w=7𝑤7w=7italic_w = 7 [29520,37691,41409]295203769141409\small{[29520,37691,41409]}[ 29520 , 37691 , 41409 ] [22434,24668,28531]224342466828531\small{[22434,24668,28531]}[ 22434 , 24668 , 28531 ] [27294,30578,35995]272943057835995\small{[27294,30578,35995]}[ 27294 , 30578 , 35995 ]
GeoMALA Best hhitalic_h [46086, 51165, 55630]delimited-[]46086, 51165, 55630\small{[\textbf{46086, 51165, 55630}]}[ 46086, 51165, 55630 ] [39445,41776,48142]delimited-[]39445,41776,48142\small{[\textbf{39445,41776,48142}]}[ 39445,41776,48142 ] [56437, 61601, 71250]delimited-[]56437, 61601, 71250\small{[\textbf{56437, 61601, 71250}]}[ 56437, 61601, 71250 ]

Varying anisotropy on the Stiefel manifold.

Table 4: Experiment performances recorded with the effective sample size [min, median, max] on the 10 repetitions for varying anisotropy factor λ𝜆\lambdaitalic_λ when (n,k)=(30,2)𝑛𝑘302(n,k)=(30,2)( italic_n , italic_k ) = ( 30 , 2 ), on the Stiefel manifold.
λ𝜆\lambdaitalic_λ 1 10 100
GSS w=5𝑤5w=5italic_w = 5 [28375,34262,37405]283753426237405[28375,34262,37405][ 28375 , 34262 , 37405 ] [4901, 5283, 5477]delimited-[]4901, 5283, 5477[\textbf{4901, 5283, 5477}][ 4901, 5283, 5477 ] [1153, 1328, 1453]delimited-[]1153, 1328, 1453[\textbf{1153, 1328, 1453}][ 1153, 1328, 1453 ]
RMH [50772,54243,59906]507725424359906[\textbf{50772},54243,59906][ 50772 , 54243 , 59906 ] [1492,2314,3214]149223143214[1492,2314,3214][ 1492 , 2314 , 3214 ] [669,878,998]669878998[669,878,998][ 669 , 878 , 998 ]
GeoRMH [49007,57195, 68948]4900757195, 68948[49007,\textbf{57195, 68948}][ 49007 , 57195, 68948 ] [1978,2336,3217]197823363217[1978,2336,3217][ 1978 , 2336 , 3217 ] [682,870,1075]6828701075[682,870,1075][ 682 , 870 , 1075 ]

In Table 4, the samplers’ performances worsen as λ𝜆\lambdaitalic_λ increases, indicating the difficulty of exploring the space when the density is sharp in a specific direction.

We observe that RMH and GeoRMH outperform GSS only when λ=1𝜆1\lambda=1italic_λ = 1, highlighting the effectiveness of the slice sampling approach of GSS in dealing with sharp densities.

In our experiments, we notice that the number of attempts in the shrinkage procedure is more sensitive to the variance of the target distribution than the value of the parameter w𝑤witalic_w. Therefore, in cases where GSS is a good fit for the sampling task, the computation time increases accordingly.

Varying the variance on the Grassmann manifold.

In Table 5, we observe that GSS outperforms GeoMALA when λ{10,100}𝜆10100\lambda\in\{10,100\}italic_λ ∈ { 10 , 100 }, indicating its advantage in situations where the density is concentrated. It highlights the fact that GeoMALA is not robust to the choice of the stepsize parameter.

Furthermore, we find that using a lower value for w𝑤witalic_w is more suitable when the distribution is sharp (e.g., w=1,λ=100formulae-sequence𝑤1𝜆100w=1,\lambda=100italic_w = 1 , italic_λ = 100). Additionally, choosing a large value of the stepping out parameter m𝑚mitalic_m can improve the performance when w𝑤witalic_w is chosen too small.

This experiment reaffirms the previous findings, demonstrating that GSS adapts to the geometry of the density, which is promising for practical applications. Moreover, the performance of GSS appears to be quite robust across different choices of w𝑤witalic_w, and taking a large value for m𝑚mitalic_m strengthens this feature.

Table 5: Experiment performances recorded with the effective sample size [min, median, max] on 10 repetitions for varying variance factor λ𝜆\lambdaitalic_λ when (n,k)=(3,2)𝑛𝑘32(n,k)=(3,2)( italic_n , italic_k ) = ( 3 , 2 ), on the Grassmann manifold. \ldqGeoMALA Best hhitalic_h\rdq means that we provide the result for the best stepsize parameter hhitalic_h in {0.01,0.1,0.5,1}0.010.10.51\{0.01,0.1,0.5,1\}{ 0.01 , 0.1 , 0.5 , 1 }
λ𝜆\lambdaitalic_λ 1 10 100
GSS w=1,m=1formulae-sequence𝑤1𝑚1w=1,m=1italic_w = 1 , italic_m = 1 [8877,10070,11010]88771007011010\small{[8877,10070,11010]}[ 8877 , 10070 , 11010 ] [10905,12282,13582]109051228213582\small{[10905,12282,13582]}[ 10905 , 12282 , 13582 ] [16111,19371,23164]161111937123164\small{[16111,19371,23164]}[ 16111 , 19371 , 23164 ]
GSS w=1,m=3formulae-sequence𝑤1𝑚3w=1,m=3italic_w = 1 , italic_m = 3 [31097,35149,42968]310973514942968\small{[31097,35149,42968]}[ 31097 , 35149 , 42968 ] [13630,16001,19422]136301600119422\small{[13630,16001,19422]}[ 13630 , 16001 , 19422 ] [12298,20144,21981]122982014421981\small{[12298,20144,21981]}[ 12298 , 20144 , 21981 ]
GSS w=1,m=10formulae-sequence𝑤1𝑚10w=1,m=10italic_w = 1 , italic_m = 10 [32075,36968,42086]320753696842086\small{[32075,36968,42086]}[ 32075 , 36968 , 42086 ] [17346, 18913, 22446]delimited-[]17346, 18913, 22446\small{[\textbf{17346, 18913, 22446}]}[ 17346, 18913, 22446 ] [16390,19436,23697]163901943623697\small{[16390,19436,23697]}[ 16390 , 19436 , 23697 ]
GSS w=7,m=1formulae-sequence𝑤7𝑚1w=7,m=1italic_w = 7 , italic_m = 1 [33768,36142,41443]337683614241443\small{[33768,36142,41443]}[ 33768 , 36142 , 41443 ] [14807,17197,20787]148071719720787\small{[14807,17197,20787]}[ 14807 , 17197 , 20787 ] [18110, 19526,23424]18110, 1952623424\small{[\textbf{18110, 19526},23424]}[ 18110, 19526 , 23424 ]
GeoMALA Best hhitalic_h [44628, 49368, 56009]delimited-[]44628, 49368, 56009\small{[\textbf{44628, 49368, 56009}]}[ 44628, 49368, 56009 ] [11173,12986,14294]111731298614294\small{[11173,12986,14294]}[ 11173 , 12986 , 14294 ] [395,2611,43087]395261143087\small{[395,2611,\textbf{43087}]}[ 395 , 2611 , 43087 ]

We discuss some further aspects of the conducted numerical experiments.

Remark 9.

(Complexity.) Up to this point, a discerning reader might consider the comparison of ESS between RMH, GSS, and GeoRMH unfair, as GSS involves additional computations due to the shrinkage procedure. While this is true, the main computational bottlenecks in all methods are the reprojection of the proposal on the Stiefel manifold at the end of each step using an SVD O(nk2)𝑂𝑛superscript𝑘2O(nk^{2})italic_O ( italic_n italic_k start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ), the sampling on the tangent space using a QR decomposition O(k3)𝑂superscript𝑘3O(k^{3})italic_O ( italic_k start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT ), and the eigenvalues of the skew-symmetric matrices O((2k)3)𝑂superscript2𝑘3O((2k)^{3})italic_O ( ( 2 italic_k ) start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT ).

In the case of the shrinkage procedure in GSS, an eigenvalue decomposition is initially computed, enabling the generation of geodesics with some matrix products for the subsequent attempts. Thus, from a computational perspective, the only difference between GeoRMH and GSS is the cost of these matrix products O(n2.37)𝑂superscript𝑛2.37O(n^{2.37})italic_O ( italic_n start_POSTSUPERSCRIPT 2.37 end_POSTSUPERSCRIPT ).

However, when comparing RMH and GSS, we need to account for the additional computational cost of the QR decomposition and eigenvalue decomposition, both in O(k3)𝑂superscript𝑘3O(k^{3})italic_O ( italic_k start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT ). Notably, the computational cost related to constraints naturally increases with k𝑘kitalic_k. In practice, without any engineering optimization of the codes, we observe that GSS takes between two to four times longer to execute than RMH (which is biased), but it is nearly equivalent to GeoRMH in terms of computation time when n𝑛nitalic_n is not excessively large. The relative speed of RMH compared to GSS has to be weighted by its bias (of unknown size).

Remark 10.

(Choice of the hyperparameters.) The performance of GSS seems to be quite robust to the choice of the hyperparameters m𝑚mitalic_m and w𝑤witalic_w as soon as mw2π𝑚𝑤2𝜋mw\geqslant 2\piitalic_m italic_w ⩾ 2 italic_π (see Tables 1, 2, 5). The number 2π2𝜋2\pi2 italic_π coincides with twice the diameter of the Grassmann and the Stiefel manifold. Therefore, as a heuristic we propose to choose m𝑚mitalic_m and w𝑤witalic_w such that mw𝑚𝑤mwitalic_m italic_w is about twice the diameter for compact manifolds in general.

3.2 A practical case: Understanding the variability in graph data sets.

We consider in this section a model introduced in [mantoux2021understanding] that aims to infer the structure of adjacency matrices from functional connectivity networks of brains. Consider (Φ(j))j{1,,J}subscriptsuperscriptΦ𝑗𝑗1𝐽(\Upphi^{(j)})_{j\in\{1,\ldots,J\}}( roman_Φ start_POSTSUPERSCRIPT ( italic_j ) end_POSTSUPERSCRIPT ) start_POSTSUBSCRIPT italic_j ∈ { 1 , … , italic_J } end_POSTSUBSCRIPT, J𝐽Jitalic_J adjacency matrices of different networks with n𝑛n\in\mathbb{N}italic_n ∈ blackboard_N nodes each. Let k𝑘k\in\mathbb{N}italic_k ∈ blackboard_N with kn𝑘𝑛k\leqslant nitalic_k ⩽ italic_n. The model that we consider has parameter θ=(σκ2,σϵ2,μ,F)𝜃superscriptsubscript𝜎𝜅2subscriptsuperscript𝜎2italic-ϵ𝜇𝐹\theta=(\sigma_{\kappa}^{2},\sigma^{2}_{\epsilon},\mu,F)italic_θ = ( italic_σ start_POSTSUBSCRIPT italic_κ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_ϵ end_POSTSUBSCRIPT , italic_μ , italic_F ), σκ2,σϵ2>0,μkformulae-sequencesuperscriptsubscript𝜎𝜅2subscriptsuperscript𝜎2italic-ϵ0𝜇superscript𝑘\sigma_{\kappa}^{2},\sigma^{2}_{\epsilon}>0,\mu\in\mathbb{R}^{k}italic_σ start_POSTSUBSCRIPT italic_κ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_ϵ end_POSTSUBSCRIPT > 0 , italic_μ ∈ blackboard_R start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT, and Fn×k𝐹superscript𝑛𝑘F\in\mathbb{R}^{n\times k}italic_F ∈ blackboard_R start_POSTSUPERSCRIPT italic_n × italic_k end_POSTSUPERSCRIPT, and is defined for j{1,,J}𝑗1𝐽j\in\{1,\ldots,J\}italic_j ∈ { 1 , … , italic_J } as

Φ(j)superscriptΦ𝑗\displaystyle\Upphi^{(j)}roman_Φ start_POSTSUPERSCRIPT ( italic_j ) end_POSTSUPERSCRIPT =Γ(j)diag(κ(j))(Γ(j))+(ϵ(j)),ϵ(j)i.i.d.𝒩(0,σϵ2Idn(n+1)/2),absentsuperscriptΓ𝑗diagsuperscript𝜅𝑗superscriptsuperscriptΓ𝑗topsuperscriptitalic-ϵ𝑗superscriptitalic-ϵ𝑗i.i.d.similar-to𝒩0superscriptsubscript𝜎italic-ϵ2subscriptId𝑛𝑛12\displaystyle=\Gamma^{(j)}\operatorname{diag}(\kappa^{(j)})(\Gamma^{(j)})^{% \top}+\mathcal{E}(\epsilon^{(j)}),\qquad\epsilon^{(j)}\overset{\text{i.i.d.}}{% \sim}\mathcal{N}(0,\sigma_{\epsilon}^{2}\mathrm{Id}_{n(n+1)/2}),= roman_Γ start_POSTSUPERSCRIPT ( italic_j ) end_POSTSUPERSCRIPT roman_diag ( italic_κ start_POSTSUPERSCRIPT ( italic_j ) end_POSTSUPERSCRIPT ) ( roman_Γ start_POSTSUPERSCRIPT ( italic_j ) end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT + caligraphic_E ( italic_ϵ start_POSTSUPERSCRIPT ( italic_j ) end_POSTSUPERSCRIPT ) , italic_ϵ start_POSTSUPERSCRIPT ( italic_j ) end_POSTSUPERSCRIPT overi.i.d. start_ARG ∼ end_ARG caligraphic_N ( 0 , italic_σ start_POSTSUBSCRIPT italic_ϵ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT roman_Id start_POSTSUBSCRIPT italic_n ( italic_n + 1 ) / 2 end_POSTSUBSCRIPT ) , (13)
κ(j)superscript𝜅𝑗\displaystyle\kappa^{(j)}italic_κ start_POSTSUPERSCRIPT ( italic_j ) end_POSTSUPERSCRIPT i.i.d.𝒩(μ,σκ2Idk),Γ(j)i.i.d.vMF(F),i.i.d.similar-to𝒩𝜇superscriptsubscript𝜎𝜅2subscriptId𝑘superscriptΓ𝑗i.i.d.similar-tovMF𝐹\displaystyle\overset{\text{i.i.d.}}{\sim}\mathcal{N}(\mu,\sigma_{\kappa}^{2}% \mathrm{Id}_{k}),\qquad\Gamma^{(j)}\overset{\text{i.i.d.}}{\sim}\operatorname{% vMF}(F),overi.i.d. start_ARG ∼ end_ARG caligraphic_N ( italic_μ , italic_σ start_POSTSUBSCRIPT italic_κ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT roman_Id start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) , roman_Γ start_POSTSUPERSCRIPT ( italic_j ) end_POSTSUPERSCRIPT overi.i.d. start_ARG ∼ end_ARG roman_vMF ( italic_F ) ,

where Γ(j)𝒱(n,k)superscriptΓ𝑗𝒱𝑛𝑘\Gamma^{(j)}\in\mathcal{V}(n,k)roman_Γ start_POSTSUPERSCRIPT ( italic_j ) end_POSTSUPERSCRIPT ∈ caligraphic_V ( italic_n , italic_k ), κ(j)ksuperscript𝜅𝑗superscript𝑘\kappa^{(j)}\in\mathbb{R}^{k}italic_κ start_POSTSUPERSCRIPT ( italic_j ) end_POSTSUPERSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT is called pattern weight vector, and ϵ(j)n(n+1)/2superscriptitalic-ϵ𝑗superscript𝑛𝑛12\epsilon^{(j)}\in\mathbb{R}^{n(n+1)/2}italic_ϵ start_POSTSUPERSCRIPT ( italic_j ) end_POSTSUPERSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_n ( italic_n + 1 ) / 2 end_POSTSUPERSCRIPT represents the symmetric residual noise (i.e., \mathcal{E}caligraphic_E maps a vector in n(n+1)/2superscript𝑛𝑛12\mathbb{R}^{n(n+1)/2}blackboard_R start_POSTSUPERSCRIPT italic_n ( italic_n + 1 ) / 2 end_POSTSUPERSCRIPT to the symmetric matrix in n×nsuperscript𝑛𝑛\mathbb{R}^{n\times n}blackboard_R start_POSTSUPERSCRIPT italic_n × italic_n end_POSTSUPERSCRIPT determined by its components). The unobserved variables Γ(j)𝒱(n,k)superscriptΓ𝑗𝒱𝑛𝑘\Gamma^{(j)}\in\mathcal{V}(n,k)roman_Γ start_POSTSUPERSCRIPT ( italic_j ) end_POSTSUPERSCRIPT ∈ caligraphic_V ( italic_n , italic_k ) and κ(j)superscript𝜅𝑗\kappa^{(j)}italic_κ start_POSTSUPERSCRIPT ( italic_j ) end_POSTSUPERSCRIPT determine the individual-level specificity of network j𝑗jitalic_j.

The original paper estimates the parameters using the Markov chain Monte Carlo-stochastic algorithm expectation maximization (MCMC-SAEM) procedure [kuhn2004coupling]. MCMC-SAEM is an extension of the expectation maximization (EM) algorithm, where the E-step involves approximating integrals using MCMC methods. In this context, the E-step samples the density pΓ=p(Γ(j)|Φ(j),κ(j),θn)subscript𝑝Γ𝑝conditionalsuperscriptΓ𝑗superscriptΦ𝑗superscript𝜅𝑗subscript𝜃𝑛p_{\Gamma}=p(\Gamma^{(j)}|\Upphi^{(j)},\kappa^{(j)},\theta_{n})italic_p start_POSTSUBSCRIPT roman_Γ end_POSTSUBSCRIPT = italic_p ( roman_Γ start_POSTSUPERSCRIPT ( italic_j ) end_POSTSUPERSCRIPT | roman_Φ start_POSTSUPERSCRIPT ( italic_j ) end_POSTSUPERSCRIPT , italic_κ start_POSTSUPERSCRIPT ( italic_j ) end_POSTSUPERSCRIPT , italic_θ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ), where θnsubscript𝜃𝑛\theta_{n}italic_θ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT is the current estimate for the parameter θ𝜃\thetaitalic_θ, within a Gibbs procedure using RMH (see [mantoux2021understanding, Algorithm 3]).

For the optimization procedure, we propose to replace RMH with GSS(w=1,m=5formulae-sequence𝑤1𝑚5w=1,m=5italic_w = 1 , italic_m = 5) to compare performances. Since our implementation of RMH is four times faster than GSS, we chose to multiply the number of MCMC iterations by four when using RMH for the E-step.

On a synthetic dataset.

We follow the same procedure as [mantoux2021understanding] to generate synthetic data (Φ(j),Γ(j),κ(j),ϵ(j))j{1,,J}subscriptsubscriptsuperscriptΦ𝑗subscriptsuperscriptΓ𝑗subscriptsuperscript𝜅𝑗subscriptsuperscriptitalic-ϵ𝑗𝑗1𝐽(\Upphi^{(j)}_{*},\Gamma^{(j)}_{*},\kappa^{(j)}_{*},\epsilon^{(j)}_{*})_{j\in% \{1,\ldots,J\}}( roman_Φ start_POSTSUPERSCRIPT ( italic_j ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT , roman_Γ start_POSTSUPERSCRIPT ( italic_j ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT , italic_κ start_POSTSUPERSCRIPT ( italic_j ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT , italic_ϵ start_POSTSUPERSCRIPT ( italic_j ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_j ∈ { 1 , … , italic_J } end_POSTSUBSCRIPT with J=100𝐽100J=100italic_J = 100, (n,k)=(30,5)𝑛𝑘305(n,k)=(30,5)( italic_n , italic_k ) = ( 30 , 5 ) and (n,k)=(3,2)𝑛𝑘32(n,k)=(3,2)( italic_n , italic_k ) = ( 3 , 2 ). The generation parameters are fixed as σϵ2=0.1superscriptsubscript𝜎italic-ϵ20.1\sigma_{\epsilon}^{2}=0.1italic_σ start_POSTSUBSCRIPT italic_ϵ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = 0.1, σκ2=2superscriptsubscript𝜎𝜅22\sigma_{\kappa}^{2}=2italic_σ start_POSTSUBSCRIPT italic_κ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = 2, μ=(10,2,,2)k𝜇1022superscript𝑘\mu=(10,2,\ldots,2)\in\mathbb{R}^{k}italic_μ = ( 10 , 2 , … , 2 ) ∈ blackboard_R start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT, and Fn×ksuperscript𝐹superscript𝑛𝑘F^{*}\in\mathbb{R}^{n\times k}italic_F start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_n × italic_k end_POSTSUPERSCRIPT is chosen as the matrix with columns a1f1,,akfknsubscript𝑎1subscript𝑓1subscript𝑎𝑘subscript𝑓𝑘superscript𝑛a_{1}f_{1},\ldots,a_{k}f_{k}\in\mathbb{R}^{n}italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_a start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT, where (fi)i{1,,k}𝒱(n,k)subscriptsubscript𝑓𝑖𝑖1𝑘𝒱𝑛𝑘(f_{i})_{i\in\{1,\ldots,k\}}\in\mathcal{V}(n,k)( italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_i ∈ { 1 , … , italic_k } end_POSTSUBSCRIPT ∈ caligraphic_V ( italic_n , italic_k ) is sampled uniformly from 𝒱(n,k)𝒱𝑛𝑘\mathcal{V}(n,k)caligraphic_V ( italic_n , italic_k ), and a1=λ{1,100}subscript𝑎1𝜆1100a_{1}=\lambda\in\{1,100\}italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = italic_λ ∈ { 1 , 100 }, ai=1subscript𝑎𝑖1a_{i}=1italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = 1 for any i{2,,k}𝑖2𝑘i\in\{2,\ldots,k\}italic_i ∈ { 2 , … , italic_k }. The factor λ𝜆\lambdaitalic_λ in this context serves as an anisotropy factor incorporated to examine how performance can be influenced by anisotropy. The optimization process involves random initialization of the parameters F𝐹Fitalic_F and κ𝜅\kappaitalic_κ, and we deterministically set σκ2=σϵ2=1superscriptsubscript𝜎𝜅2subscriptsuperscript𝜎2italic-ϵ1\sigma_{\kappa}^{2}=\sigma^{2}_{\epsilon}=1italic_σ start_POSTSUBSCRIPT italic_κ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_ϵ end_POSTSUBSCRIPT = 1. Then, (Γ(j))j{1,,J}subscriptsuperscriptΓ𝑗𝑗1𝐽(\Gamma^{(j)})_{j\in\{1,\ldots,J\}}( roman_Γ start_POSTSUPERSCRIPT ( italic_j ) end_POSTSUPERSCRIPT ) start_POSTSUBSCRIPT italic_j ∈ { 1 , … , italic_J } end_POSTSUBSCRIPT is initialized either by performing 200 iterations of GSS on pΓsubscript𝑝Γp_{\Gamma}italic_p start_POSTSUBSCRIPT roman_Γ end_POSTSUBSCRIPT or 800 iterations of RMH. The results are averaged over 10 repetitions with different random initializations and generation parameters.

We run the MCMC-SAEM procedure for 100 iterations, and at each step of MCMC-SAEM, 20 iterations of MCMC are performed when GSS is used, while 80 iterations are performed when RMH is used for the E-step of the EM algorithm.

Refer to caption
Figure 6: We compare the performance of RMH and GSS(w=1,m=5formulae-sequence𝑤1𝑚5w=1,m=5italic_w = 1 , italic_m = 5) used as MCMC-method within the MCMC-SAEM procedure to maximize the likelihood of the model. The log complete likelihood is used as a proxy to perform the comparison since the log likelihood is not tractable. The rRMSE is computed for the difference between the estimated parameter μ^^𝜇\hat{\mu}over^ start_ARG italic_μ end_ARG and the true parameter μ𝜇\muitalic_μ. The convergence of the estimated parameter μ^^𝜇\hat{\mu}over^ start_ARG italic_μ end_ARG is depicted for the case (n,k)=(30,5)𝑛𝑘305(n,k)=(30,5)( italic_n , italic_k ) = ( 30 , 5 ). To ensure a fair comparison, we use four times more MCMC steps for RMH compared to GSS, since RMH is four times faster.

First, upon examining Figure 6, we can observe that the complete log-likelihood curve555Denoting by θ𝜃\thetaitalic_θ the model parameter, the log complete likelihood is logp(Φ(j),κ(i),Γ(i)|θ)𝑝superscriptΦ𝑗superscript𝜅𝑖conditionalsuperscriptΓ𝑖𝜃\log p(\Upphi^{(j)},\kappa^{(i)},\Gamma^{(i)}|\theta)roman_log italic_p ( roman_Φ start_POSTSUPERSCRIPT ( italic_j ) end_POSTSUPERSCRIPT , italic_κ start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT , roman_Γ start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT | italic_θ ) and the log likelihood is logp(Φ(j)|θ)𝑝conditionalsuperscriptΦ𝑗𝜃\log p(\Upphi^{(j)}|\theta)roman_log italic_p ( roman_Φ start_POSTSUPERSCRIPT ( italic_j ) end_POSTSUPERSCRIPT | italic_θ ). exhibits higher values when λ=100𝜆100\lambda=100italic_λ = 100 in contrast to λ=1𝜆1\lambda=1italic_λ = 1. This can be attributed to the unobserved variables (Γ(j))j1,,JsubscriptsuperscriptΓ𝑗𝑗1𝐽(\Gamma^{(j)})_{j\in{1,\ldots,J}}( roman_Γ start_POSTSUPERSCRIPT ( italic_j ) end_POSTSUPERSCRIPT ) start_POSTSUBSCRIPT italic_j ∈ 1 , … , italic_J end_POSTSUBSCRIPT being more concentrated in the direction indicated by the first column of Fsuperscript𝐹F^{*}italic_F start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT, making recovery easier.

Secondly, GSS outperforms RMH regarding the complete likelihood. The curve increases more rapidly with the number of iterations and attains a higher final value. This improvement is particularly striking for (n,k)=(30,5)𝑛𝑘305(n,k)=(30,5)( italic_n , italic_k ) = ( 30 , 5 ), which is consistent with the findings in Table 2. Moreover, as illustrated by the third graph of Figure 6, the relative root mean square error (rRMSE) of the estimated parameter to the true parameter is always smaller when using GSS, especially when λ=100𝜆100\lambda=100italic_λ = 100.

Missing links imputation.

We follow the experimental setup proposed in [mantoux2021understanding, Section 5.1.2] for missing links imputation: A synthetic data set of N=200𝑁200N=200italic_N = 200 adjacency matrices {Φ(i)}i=1NsuperscriptsubscriptsuperscriptΦ𝑖𝑖1𝑁\{\Upphi^{(i)}\}_{i=1}^{N}{ roman_Φ start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT } start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT with n=20𝑛20n=20italic_n = 20 nodes and k=5𝑘5k=5italic_k = 5 is generated from the model specified by (13) with parameters θ=(σκ2,σϵ2,μ,F)𝜃superscriptsubscript𝜎𝜅2subscriptsuperscript𝜎2italic-ϵ𝜇𝐹\theta=(\sigma_{\kappa}^{2},\sigma^{2}_{\epsilon},\mu,F)italic_θ = ( italic_σ start_POSTSUBSCRIPT italic_κ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_ϵ end_POSTSUBSCRIPT , italic_μ , italic_F ) given in Appendix D.1. In a first stage, the MCMC-SAEM algorithm with GSS is applied to perform the estimation of the parameters resulting in an estimator θ^^𝜃\hat{\mathbf{\theta}}over^ start_ARG italic_θ end_ARG. Then, from the same model, we generate another 200 samples {Φ¯(i)}i=1Nsuperscriptsubscriptsuperscript¯Φ𝑖𝑖1𝑁\{\bar{\Upphi}^{(i)}\}_{i=1}^{N}{ over¯ start_ARG roman_Φ end_ARG start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT } start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT. For each of these samples, 16% of the edge weights corresponding to the interactions between the last eight nodes are masked. Denote by {Φ~(i)}i=1Nsuperscriptsubscriptsuperscript~Φ𝑖𝑖1𝑁\{\tilde{\Upphi}^{(i)}\}_{i=1}^{N}{ over~ start_ARG roman_Φ end_ARG start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT } start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT the resulting adjacency matrices. We then aim to reconstruct the missing links from the samples using the following three procedures:

  • The missing links are found from the maximum a posteriori (MAP) approximated as MAP=Γdiag(κ)(Γ)MAPΓdiag𝜅superscriptΓtop\text{MAP}=\Gamma\operatorname{diag}(\kappa)(\Gamma)^{\top}MAP = roman_Γ roman_diag ( italic_κ ) ( roman_Γ ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT, where Γ,κΓ𝜅\Gamma,\kapparoman_Γ , italic_κ are the result of 4000 iterations of gradient ascent on the posterior density p(Γ,κ|Φ~(i),θ^)𝑝Γconditional𝜅superscript~Φ𝑖^𝜃p(\Gamma,\kappa|\tilde{\Upphi}^{(i)},\hat{\mathbf{\theta}})italic_p ( roman_Γ , italic_κ | over~ start_ARG roman_Φ end_ARG start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT , over^ start_ARG italic_θ end_ARG ), the conditional density of (Γ,κ)Γ𝜅(\Gamma,\kappa)( roman_Γ , italic_κ ) given the masked observation Φ~(i)superscript~Φ𝑖\tilde{\Upphi}^{(i)}over~ start_ARG roman_Φ end_ARG start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT in the model (13).

  • The missing links are found from the posterior mean (PM) defined as

    PM=d𝒱(n,k)Γdiag(κ)(Γ)p(Γ,κ|Φ~(i),θ^)ν𝔤(dΓ)Lebd(dκ).PMsubscriptsuperscript𝑑subscript𝒱𝑛𝑘Γdiag𝜅superscriptΓtop𝑝Γconditional𝜅superscript~Φ𝑖^𝜃subscript𝜈𝔤dΓsubscriptLeb𝑑d𝜅\text{PM}=\int_{\mathbb{R}^{d}}\int_{\mathcal{V}(n,k)}\Gamma\operatorname{diag% }(\kappa)(\Gamma)^{\top}p(\Gamma,\kappa|\tilde{\Upphi}^{(i)},\hat{\mathbf{% \theta}})\ \nu_{\mathfrak{g}}(\mathrm{d}\Gamma)\,\mathrm{Leb}_{d}(\mathrm{d}% \kappa).PM = ∫ start_POSTSUBSCRIPT blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ∫ start_POSTSUBSCRIPT caligraphic_V ( italic_n , italic_k ) end_POSTSUBSCRIPT roman_Γ roman_diag ( italic_κ ) ( roman_Γ ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_p ( roman_Γ , italic_κ | over~ start_ARG roman_Φ end_ARG start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT , over^ start_ARG italic_θ end_ARG ) italic_ν start_POSTSUBSCRIPT fraktur_g end_POSTSUBSCRIPT ( roman_d roman_Γ ) roman_Leb start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT ( roman_d italic_κ ) . (14)

    This distribution is approximated using θ^^𝜃\hat{\mathbf{\theta}}over^ start_ARG italic_θ end_ARG and GSS or RMH within Gibbs sampler.

  • Finally, we consider a reconstruction using simply the link from N1i=1NΦ¯(i)superscript𝑁1superscriptsubscript𝑖1𝑁superscript¯Φ𝑖N^{-1}\sum_{i=1}^{N}\bar{\Upphi}^{(i)}italic_N start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT over¯ start_ARG roman_Φ end_ARG start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT.

When computing the PM and the MAP, GSS achieves a rRMSE of 50% (±plus-or-minus\pm± 24% standard deviation over the dataset) and 52% (±plus-or-minus\pm± 16%) on average respectively, in contrast to 57% (±plus-or-minus\pm± 24%) and 58% (±plus-or-minus\pm± 28%) for RMH, and 85% (±plus-or-minus\pm± 10 %) for the mean sample. Then the same experiment is repeated, but this time, 40% of the edges are uniformly masked instead of constraining the mask. With GSS, we find a rRMSE of 30% (±plus-or-minus\pm± 7% standard deviation over the dataset) and 35% (±plus-or-minus\pm± 8%) on average for the PM and the MAP, compared to 34% (±plus-or-minus\pm± 9%) and 35% (±plus-or-minus\pm± 7%) when using RMH, and 75% (±plus-or-minus\pm± 5 %) with the mean sample. This clearly indicates that GSS outperforms RMH on this example.

Refer to caption
Figure 7: The box plots of the rRMSE when reconstructing the observation Φ(i)superscriptΦ𝑖\Upphi^{(i)}roman_Φ start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT with the posterior mean (14) estimated with GSS or RMH and a five components PCA in the model(k=5𝑘5k=5italic_k = 5). To ensure a fair comparison, we use four times more MCMC steps for RMH compared to GSS, since RMH is four times faster.

On a real dataset.

We use real data to perform a comparison similar to the one proposed in [mantoux2021understanding] where the authors use connectivity matrices with n=21𝑛21n=21italic_n = 21 nodes and k=5𝑘5k=5italic_k = 5 in their model to analyze N=1000𝑁1000N=1000italic_N = 1000 subjects666We did not use the same data, since their dataset was unavailable to us.. Data were provided by the Human Connectome Project777https://www.humanconnectome.org/study/hcp-young-adult/document/extensively-processed-fmri-data-documentation. The dataset is composed of brain connectivity matrices generated from resting-state functional MRI (rs-fMRI) by following the pipeline described in this documentation888https://www.humanconnectome.org/storage/app/media/documentation/s1200/HCP1200-DenseConnectome+PTN+Appendix-July2017.pdf. On a subject which receives no stimulation (at rest), the rs-fMRI records fluctuations in blood oxygenation levels throughout the brain. By maximizing the signal coherence in each region of the brain with a spatial independent component analysis (ICA) [beckmann2004probabilistic], it yields a partition of the brain depending on its structure and variations from one individual to another. Finally, the temporal correlations between the mean blood oxygenation levels in each region are assembled into a matrix. This matrix is called the brain’s functional connectivity network. It should be noted that this network is not necessarily derived from physical reality, since it only represents correlations between brain regions. This is why the term “functional” is coined. In this study, the connectivity matrices are defined on a parcellation of the brain into n=25𝑛25n=25italic_n = 25 regions999The network modeling is related to “netmats” in the documentation.. We chose the “recon2” group of the dataset leaving us 812 matrices of dimension 25×25252525\times 2525 × 25 to analyze.

We choose k=5𝑘5k=5italic_k = 5, and run 1000 MCMC-SAEM iterations with 20 MCMC steps per SAEM iteration when using GSS(w=1,m=10formulae-sequence𝑤1𝑚10w=1,m=10italic_w = 1 , italic_m = 10), and 80 iterations when using RMH. The initialization procedure is the same as for synthetic data. To compare the samplers, we compute the rRMSE between PM and the observations similarly as in the previous paragraph. To asses the benefit of the model, we also compute the approximation given by the projection onto the subspace of the first five principal component analysis (PCA) components of the full data set, where each matrix Φ(i)superscriptΦ𝑖\Upphi^{(i)}roman_Φ start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT has been vectorized.

The boxplots in Figure 7 show that GSS outperforms RMH since it reduces the tail of the error distribution. In addition, GSS achieves better results than RMH when looking at the evolution of the complete likelihood, as already observed on synthetic data. The model proves to be better suited than a simple PCA, as shown in [mantoux2021understanding], but often fails to offer a good representation of the observations in large dimensions, even for synthetic data.

3.3 ARMA model

Time series related to different types of data, such as dynamic textures, shape sequences and videos, are often modeled as auto-regressive and moving average (ARMA) models [doretto2003dynamic, aggarwal2004system, bissacco2001recognition, veeraraghavan2005matching]. Provided observations z=(zt)t{1,,T}(n)T𝑧subscriptsubscript𝑧𝑡𝑡1𝑇superscriptsuperscript𝑛𝑇z=(z_{t})_{t\in\{1,\ldots,T\}}\in(\mathbb{R}^{n})^{T}italic_z = ( italic_z start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_t ∈ { 1 , … , italic_T } end_POSTSUBSCRIPT ∈ ( blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT, the ARMA model equations are

ztsubscript𝑧𝑡\displaystyle z_{t}italic_z start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT =Hxt+wt,wti.i.d.𝒩(𝟎n,R),absent𝐻subscript𝑥𝑡subscript𝑤𝑡subscript𝑤𝑡i.i.d.similar-to𝒩subscript0𝑛𝑅\displaystyle=Hx_{t}+w_{t},\quad w_{t}\overset{\text{i.i.d.}}{\sim}\mathcal{N}% (\boldsymbol{0}_{n},R),= italic_H italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT + italic_w start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , italic_w start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT overi.i.d. start_ARG ∼ end_ARG caligraphic_N ( bold_0 start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_R ) ,
xt+1subscript𝑥𝑡1\displaystyle x_{t+1}italic_x start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT =Bxt+vt,vti.i.d.𝒩(𝟎k,Q),t{1,,T},formulae-sequenceabsent𝐵subscript𝑥𝑡subscript𝑣𝑡subscript𝑣𝑡i.i.d.similar-to𝒩subscript0𝑘𝑄𝑡1𝑇\displaystyle=Bx_{t}+v_{t},\quad v_{t}\overset{\text{i.i.d.}}{\sim}\mathcal{N}% (\boldsymbol{0}_{k},Q),\qquad t\in\{1,\ldots,T\},= italic_B italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT + italic_v start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , italic_v start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT overi.i.d. start_ARG ∼ end_ARG caligraphic_N ( bold_0 start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_Q ) , italic_t ∈ { 1 , … , italic_T } ,

where x=(xt)t{1,,T}(k)T𝑥subscriptsubscript𝑥𝑡𝑡1𝑇superscriptsuperscript𝑘𝑇x=(x_{t})_{t\in\{1,\ldots,T\}}\in(\mathbb{R}^{k})^{T}italic_x = ( italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_t ∈ { 1 , … , italic_T } end_POSTSUBSCRIPT ∈ ( blackboard_R start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT is the hidden state vector, Bk×k𝐵superscript𝑘𝑘B\in\mathbb{R}^{k\times k}italic_B ∈ blackboard_R start_POSTSUPERSCRIPT italic_k × italic_k end_POSTSUPERSCRIPT the transition matrix, H𝒱(n,k)𝐻𝒱𝑛𝑘H\in\mathcal{V}(n,k)italic_H ∈ caligraphic_V ( italic_n , italic_k ) the measurement matrix and Rn×n,Qk×kformulae-sequence𝑅superscript𝑛𝑛𝑄superscript𝑘𝑘R\in\mathbb{R}^{n\times n},Q\in\mathbb{R}^{k\times k}italic_R ∈ blackboard_R start_POSTSUPERSCRIPT italic_n × italic_n end_POSTSUPERSCRIPT , italic_Q ∈ blackboard_R start_POSTSUPERSCRIPT italic_k × italic_k end_POSTSUPERSCRIPT covariance matrices. We can wrap this model in a Bayesian framework by considering the priors x1𝒩(𝟎k,Q)similar-tosubscript𝑥1𝒩subscript0𝑘𝑄x_{1}\sim\mathcal{N}(\boldsymbol{0}_{k},Q)italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∼ caligraphic_N ( bold_0 start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_Q ) and HvMF(F)similar-to𝐻vMF𝐹H\sim\operatorname{vMF}(F)italic_H ∼ roman_vMF ( italic_F ) with Fn×k𝐹superscript𝑛𝑘F\in\mathbb{R}^{n\times k}italic_F ∈ blackboard_R start_POSTSUPERSCRIPT italic_n × italic_k end_POSTSUPERSCRIPT, such that (F,B,R,Q)𝐹𝐵𝑅𝑄(F,B,R,Q)( italic_F , italic_B , italic_R , italic_Q ) are seen as hyperparameters.

In this experiment, we compare the ESS related to the sampling of the posterior p(H|z)p(z|H)p(H)proportional-to𝑝conditional𝐻𝑧𝑝conditional𝑧𝐻𝑝𝐻p(H|z)\propto p(z|H)p(H)italic_p ( italic_H | italic_z ) ∝ italic_p ( italic_z | italic_H ) italic_p ( italic_H ) by using GSS and RMH. The ESS is computed from (Tr(FHi))i{1,,T}subscriptTrsuperscript𝐹topsubscript𝐻𝑖𝑖1𝑇(\textrm{Tr}(F^{\top}H_{i}))_{i\in\{1,\ldots,T\}}( Tr ( italic_F start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_H start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ) start_POSTSUBSCRIPT italic_i ∈ { 1 , … , italic_T } end_POSTSUBSCRIPT where (Hi)i{1,,T}subscriptsubscript𝐻𝑖𝑖1𝑇(H_{i})_{i\in\{1,\ldots,T\}}( italic_H start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_i ∈ { 1 , … , italic_T } end_POSTSUBSCRIPT are the samples. The expression of p(H)𝑝𝐻p(H)italic_p ( italic_H ) is given in (10) and p(z|H)𝑝conditional𝑧𝐻p(z|H)italic_p ( italic_z | italic_H ) can be computed using Kalman filter updates. The observations (zt)t{1,,T}subscriptsubscript𝑧𝑡𝑡1𝑇(z_{t})_{t\in\{1,\ldots,T\}}( italic_z start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_t ∈ { 1 , … , italic_T } end_POSTSUBSCRIPT are synthetically generated from the model where the parameters are chosen randomly with T=10𝑇10T=10italic_T = 10 and varying dimensions (n,k)𝑛𝑘(n,k)( italic_n , italic_k ). We choose a low variance for the observation covariance matrix R𝑅Ritalic_R and a prior close to the true parameter in order to have a concentrated posterior.

In Table 6, GSS outperforms RMH, and a large w𝑤witalic_w increases the ESS. Surprisingly, increasing n𝑛nitalic_n improves the ESS for GSS and not for RMH, contrary to the experiments with the von Mises-Fisher distribution in Section 3.1. This highlights the impact of the target distribution on the quality of the sampling and the robustness of GSS. The case (n,k)=(30,5)𝑛𝑘305(n,k)=(30,5)( italic_n , italic_k ) = ( 30 , 5 ) reveals the influence of the parameter m𝑚mitalic_m: increasing m𝑚mitalic_m while reducing w𝑤witalic_w increases the performances. Sometimes, large transitions can be achieved only in specific contexts, and large m𝑚mitalic_m allows this when necessary.

Table 6: Experiment performances recorded with the effective sample size [min, median, max] on 10 repetitions for varying dimensions (n,k)𝑛𝑘(n,k)( italic_n , italic_k ) of the Stiefel manifold.
(n,k𝑛𝑘n,kitalic_n , italic_k) (30,2)302(30,2)( 30 , 2 ) (30,5)305(30,5)( 30 , 5 ) (100,2)1002(100,2)( 100 , 2 )
GSS w=1,m=1formulae-sequence𝑤1𝑚1w=1,m=1italic_w = 1 , italic_m = 1 : [389,425,453] [386,404,439] [305,312,337]
GSS w=5,m=2formulae-sequence𝑤5𝑚2w=5,m=2italic_w = 5 , italic_m = 2 : [418,467,484] [470,511,565] [1146,1414,1779]
GSS w=10,m=1formulae-sequence𝑤10𝑚1w=10,m=1italic_w = 10 , italic_m = 1 : [506,549,589] [436,484,527] [1513,1966,2674]
RMH : [293,367,387] [402,421,468] [286,296,323]

3.4 Bayesian clustering on the KTH video action dataset.

In this section, GSS is used to estimate parameters of a Bayesian clustering model on the KTH video action data [schuldt2004recognizing]. The pipeline used in [chakraborty2019statistics] is followed. For four different scenarios (called “d1”, “d2”, “d3” and “d4”), the dataset records 6 actions carried out by 25 humans, which yields 125 videos per scenario. From each video, a sequence of frames is extracted, and each frame is resized to 64×1286412864\times 12864 × 128, before computing its histogram of oriented gradients (HOG) [dalal2005histograms] features. Their dimension is d=3780𝑑3780d=3780italic_d = 3780. Finally, an ARMA model is used to model the sequence of HOG features by estimating the parameter with the closed-form formula given in [doretto2003dynamic]. For each video, let T𝑇Titalic_T be the number of frames and fed×Tsubscript𝑓𝑒superscript𝑑𝑇f_{e}\in\mathbb{R}^{d\times T}italic_f start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_d × italic_T end_POSTSUPERSCRIPT be the matrix formed by stacking the HOG feature vectors from each frame. Let fe=U^diag(λ)V^subscript𝑓𝑒^𝑈diag𝜆superscript^𝑉topf_{e}=\widehat{U}\mathrm{diag}(\lambda)\widehat{V}^{\top}italic_f start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT = over^ start_ARG italic_U end_ARG roman_diag ( italic_λ ) over^ start_ARG italic_V end_ARG start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT be the SVD of fesubscript𝑓𝑒f_{e}italic_f start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT by taking only the dl=50subscript𝑑𝑙50d_{l}=50italic_d start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT = 50 first components, i.e., U^d×dl^𝑈superscript𝑑subscript𝑑𝑙\widehat{U}\in\mathbb{R}^{d\times d_{l}}over^ start_ARG italic_U end_ARG ∈ blackboard_R start_POSTSUPERSCRIPT italic_d × italic_d start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT end_POSTSUPERSCRIPT, λdl𝜆superscriptsubscript𝑑𝑙\lambda\in\mathbb{R}^{d_{l}}italic_λ ∈ blackboard_R start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT end_POSTSUPERSCRIPT and V^T×dl^𝑉superscript𝑇subscript𝑑𝑙\widehat{V}\in\mathbb{R}^{T\times d_{l}}over^ start_ARG italic_V end_ARG ∈ blackboard_R start_POSTSUPERSCRIPT italic_T × italic_d start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT end_POSTSUPERSCRIPT. Then the video is represented by (U^,λ,Σ)𝒱(d,dl)×dl×𝒱(dl,dl)=:𝒳(\widehat{U},\lambda,\Sigma)\in\mathcal{V}(d,d_{l})\times\mathbb{R}^{d_{l}}% \times\mathcal{V}(d_{l},d_{l})=:\mathcal{X}( over^ start_ARG italic_U end_ARG , italic_λ , roman_Σ ) ∈ caligraphic_V ( italic_d , italic_d start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT ) × blackboard_R start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT end_POSTSUPERSCRIPT × caligraphic_V ( italic_d start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT , italic_d start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT ) = : caligraphic_X where

Σ=diag(λ)V^M1V^(V^M2V^)1diag(λ)1,M1=(00IdT10),M2=(IdT1000)T×T.formulae-sequenceΣdiag𝜆superscript^𝑉topsubscript𝑀1^𝑉superscriptsuperscript^𝑉topsubscript𝑀2^𝑉1diagsuperscript𝜆1formulae-sequencesubscript𝑀1matrix00subscriptId𝑇10subscript𝑀2matrixsubscriptId𝑇1000superscript𝑇𝑇\Sigma=\mathrm{diag}(\lambda)\widehat{V}^{\top}M_{1}\widehat{V}(\widehat{V}^{% \top}M_{2}\widehat{V})^{-1}\mathrm{diag}(\lambda)^{-1},\quad M_{1}=\begin{% pmatrix}0&0\\ \mathrm{Id}_{T-1}&0\end{pmatrix},\,\,M_{2}=\begin{pmatrix}\mathrm{Id}_{T-1}&0% \\ 0&0\end{pmatrix}\in\mathbb{R}^{T\times T}.roman_Σ = roman_diag ( italic_λ ) over^ start_ARG italic_V end_ARG start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_M start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT over^ start_ARG italic_V end_ARG ( over^ start_ARG italic_V end_ARG start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_M start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT over^ start_ARG italic_V end_ARG ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT roman_diag ( italic_λ ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT , italic_M start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = ( start_ARG start_ROW start_CELL 0 end_CELL start_CELL 0 end_CELL end_ROW start_ROW start_CELL roman_Id start_POSTSUBSCRIPT italic_T - 1 end_POSTSUBSCRIPT end_CELL start_CELL 0 end_CELL end_ROW end_ARG ) , italic_M start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = ( start_ARG start_ROW start_CELL roman_Id start_POSTSUBSCRIPT italic_T - 1 end_POSTSUBSCRIPT end_CELL start_CELL 0 end_CELL end_ROW start_ROW start_CELL 0 end_CELL start_CELL 0 end_CELL end_ROW end_ARG ) ∈ blackboard_R start_POSTSUPERSCRIPT italic_T × italic_T end_POSTSUPERSCRIPT .

Clustering models using mixture of von Mises-Fisher distributions have already proposed in [lin2017bayesian, sengupta2017bayesian], where the distribution on dlsuperscriptsubscript𝑑𝑙\mathbb{R}^{d_{l}}blackboard_R start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT end_POSTSUPERSCRIPT is a mixture of multivariate Gaussian distributions with diagonal covariance matrix. The number of clusters equals six as the number of actions. The parameter of each cluster is θk=(Fk1,Fk2,μk,sk)(d×dl×dl×dl×dl×(0,)dl)subscript𝜃𝑘superscriptsubscript𝐹𝑘1superscriptsubscript𝐹𝑘2subscript𝜇𝑘subscript𝑠𝑘superscript𝑑subscript𝑑𝑙superscriptsubscript𝑑𝑙subscript𝑑𝑙superscriptsubscript𝑑𝑙superscript0subscript𝑑𝑙\theta_{k}=(F_{k}^{1},F_{k}^{2},\mu_{k},s_{k})\in(\mathbb{R}^{d\times d_{l}}% \times\mathbb{R}^{d_{l}\times d_{l}}\times\mathbb{R}^{d_{l}}\times(0,\infty)^{% d_{l}})italic_θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = ( italic_F start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT , italic_F start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , italic_μ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_s start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) ∈ ( blackboard_R start_POSTSUPERSCRIPT italic_d × italic_d start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT end_POSTSUPERSCRIPT × blackboard_R start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT × italic_d start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT end_POSTSUPERSCRIPT × blackboard_R start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT end_POSTSUPERSCRIPT × ( 0 , ∞ ) start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ) and the mixing weight is mk[0,1]subscript𝑚𝑘01m_{k}\in[0,1]italic_m start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∈ [ 0 , 1 ]. The observations yi=(U^i,λi,Σi)𝒳subscript𝑦𝑖subscript^𝑈𝑖subscript𝜆𝑖subscriptΣ𝑖𝒳y_{i}=(\widehat{U}_{i},\lambda_{i},\Sigma_{i})\in\mathcal{X}italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = ( over^ start_ARG italic_U end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_λ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , roman_Σ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ∈ caligraphic_X and their cluster assignment Zisubscript𝑍𝑖Z_{i}italic_Z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT are assumed to follow this generation process:

Zii.i.d.Mult((mk)k{1,,6}),yii.i.d.k=16mkp(|Zi=k,θk),\displaystyle Z_{i}\overset{\text{i.i.d.}}{\sim}\mathrm{Mult}((m_{k})_{k\in\{1% ,\ldots,6\}}),\quad y_{i}\overset{\text{i.i.d.}}{\sim}\sum_{k=1}^{6}m_{k}p(% \cdot|Z_{i}=k,\theta_{k})\ ,italic_Z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT overi.i.d. start_ARG ∼ end_ARG roman_Mult ( ( italic_m start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_k ∈ { 1 , … , 6 } end_POSTSUBSCRIPT ) , italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT overi.i.d. start_ARG ∼ end_ARG ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 6 end_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_p ( ⋅ | italic_Z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_k , italic_θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) , (15)
p(yi|Zi=k,θk)=pvMF(Fk1)(U^i)pvMF(Fk2)(Σi)exp(12(λμk)diag(sk)1(λμk)),𝑝conditionalsubscript𝑦𝑖subscript𝑍𝑖𝑘subscript𝜃𝑘subscript𝑝vMFsuperscriptsubscript𝐹𝑘1subscript^𝑈𝑖subscript𝑝vMFsuperscriptsubscript𝐹𝑘2subscriptΣ𝑖12superscript𝜆subscript𝜇𝑘topdiagsuperscriptsubscript𝑠𝑘1𝜆subscript𝜇𝑘\displaystyle p(y_{i}|Z_{i}=k,\theta_{k})=p_{\operatorname{vMF}(F_{k}^{1})}(% \widehat{U}_{i})p_{\operatorname{vMF}(F_{k}^{2})}(\Sigma_{i})\exp\left(-\frac{% 1}{2}(\lambda-\mu_{k})^{\top}\mathrm{diag}(s_{k})^{-1}(\lambda-\mu_{k})\right),italic_p ( italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT | italic_Z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_k , italic_θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) = italic_p start_POSTSUBSCRIPT roman_vMF ( italic_F start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ) end_POSTSUBSCRIPT ( over^ start_ARG italic_U end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) italic_p start_POSTSUBSCRIPT roman_vMF ( italic_F start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) end_POSTSUBSCRIPT ( roman_Σ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) roman_exp ( - divide start_ARG 1 end_ARG start_ARG 2 end_ARG ( italic_λ - italic_μ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT roman_diag ( italic_s start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_λ - italic_μ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) ) , (16)

where Mult((mk)k{1,,6})Multsubscriptsubscript𝑚𝑘𝑘16\mathrm{Mult}((m_{k})_{k\in\{1,\ldots,6\}})roman_Mult ( ( italic_m start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_k ∈ { 1 , … , 6 } end_POSTSUBSCRIPT ) is the multinomial distribution with parameter (mk)k{1,,6}subscriptsubscript𝑚𝑘𝑘16(m_{k})_{k\in\{1,\ldots,6\}}( italic_m start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_k ∈ { 1 , … , 6 } end_POSTSUBSCRIPT. In this Bayesian setting, we take a uniform non-informative prior on every parameter. We want to assess if the clustering separates the 6666 actions. To this end, for each environment, the dataset is split into a training set (67% of the data) and a test set (33%). The clustering model is trained with the training set and evaluated on the test set.

The parameter estimation is performed on the training set by adapting the EM algorithm described in [sengupta2017bayesian] to the product space 𝒳𝒳\mathcal{X}caligraphic_X. We denote by (θ^k)k{1,,6}subscriptsubscript^𝜃𝑘𝑘16(\hat{\theta}_{k})_{k\in\{1,\ldots,6\}}( over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_k ∈ { 1 , … , 6 } end_POSTSUBSCRIPT the estimated parameters. The cluster label for a point y𝑦yitalic_y in the test set is given by li=argmaxk{1,,6}{p(y|θ^k,Zi=k)}subscript𝑙𝑖subscriptargmax𝑘16𝑝conditional𝑦subscript^𝜃𝑘subscript𝑍𝑖𝑘l_{i}=\operatorname*{arg\,max}_{k\in\{1,\ldots,6\}}\{p(y|\hat{\theta}_{k},Z_{i% }=k)\}italic_l start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = start_OPERATOR roman_arg roman_max end_OPERATOR start_POSTSUBSCRIPT italic_k ∈ { 1 , … , 6 } end_POSTSUBSCRIPT { italic_p ( italic_y | over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_Z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_k ) }. The related results are reported in Table 7 as “vMF clustering EM”.

The result of the EM algorithm is then used as an initialization for the sampling of the posterior. However, the sampling is only done for some parameters for computational reasons and to highlight the use of GSS on the Stiefel manifold. The vMF distributions parameters (Fk1,Fk2)k{1,,6}subscriptsuperscriptsubscript𝐹𝑘1superscriptsubscript𝐹𝑘2𝑘16(F_{k}^{1},F_{k}^{2})_{k\in\{1,\ldots,6\}}( italic_F start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT , italic_F start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) start_POSTSUBSCRIPT italic_k ∈ { 1 , … , 6 } end_POSTSUBSCRIPT are parameterized using their SVD Fk1=U¯kD¯kV¯k,Fk2=U~kD~kV~kformulae-sequencesuperscriptsubscript𝐹𝑘1subscript¯𝑈𝑘subscript¯𝐷𝑘superscriptsubscript¯𝑉𝑘topsuperscriptsubscript𝐹𝑘2subscript~𝑈𝑘subscript~𝐷𝑘superscriptsubscript~𝑉𝑘topF_{k}^{1}=\bar{U}_{k}\bar{D}_{k}\bar{V}_{k}^{\top},F_{k}^{2}=\widetilde{U}_{k}% \widetilde{D}_{k}\widetilde{V}_{k}^{\top}italic_F start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT = over¯ start_ARG italic_U end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT over¯ start_ARG italic_D end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT over¯ start_ARG italic_V end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT , italic_F start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = over~ start_ARG italic_U end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT over~ start_ARG italic_D end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT over~ start_ARG italic_V end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT belonging to 𝒱(d,dl)×dl×𝒱(dl,dl)𝒱𝑑subscript𝑑𝑙superscriptsubscript𝑑𝑙𝒱subscript𝑑𝑙subscript𝑑𝑙\mathcal{V}(d,d_{l})\times\mathbb{R}^{d_{l}}\times\mathcal{V}(d_{l},d_{l})caligraphic_V ( italic_d , italic_d start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT ) × blackboard_R start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT end_POSTSUPERSCRIPT × caligraphic_V ( italic_d start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT , italic_d start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT ) and 𝒱(dl,dl)×dl×𝒱(dl,dl)𝒱subscript𝑑𝑙subscript𝑑𝑙superscriptsubscript𝑑𝑙𝒱subscript𝑑𝑙subscript𝑑𝑙\mathcal{V}(d_{l},d_{l})\times\mathbb{R}^{d_{l}}\times\mathcal{V}(d_{l},d_{l})caligraphic_V ( italic_d start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT , italic_d start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT ) × blackboard_R start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT end_POSTSUPERSCRIPT × caligraphic_V ( italic_d start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT , italic_d start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT ) respectively. Then we approximately sample from the posterior associated with (U¯k,U~k)k{1,,6}subscriptsubscript¯𝑈𝑘subscript~𝑈𝑘𝑘16(\bar{U}_{k},\widetilde{U}_{k})_{k\in\{1,\ldots,6\}}( over¯ start_ARG italic_U end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , over~ start_ARG italic_U end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_k ∈ { 1 , … , 6 } end_POSTSUBSCRIPT using GSS, and the resulting posterior samples (U¯kj,U~kj)k{1,,6}j{1,,N}superscriptsubscriptsuperscriptsubscript¯𝑈𝑘𝑗superscriptsubscript~𝑈𝑘𝑗𝑘16𝑗1𝑁(\bar{U}_{k}^{j},\widetilde{U}_{k}^{j})_{k\in\{1,\ldots,6\}}^{j\in\{1,\ldots,N\}}( over¯ start_ARG italic_U end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT , over~ start_ARG italic_U end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT ) start_POSTSUBSCRIPT italic_k ∈ { 1 , … , 6 } end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j ∈ { 1 , … , italic_N } end_POSTSUPERSCRIPT yield pseudo-posterior cluster parameter samples (θ~kj)k{1,,6}j{1,,N}subscriptsuperscriptsuperscriptsubscript~𝜃𝑘𝑗𝑗1𝑁𝑘16(\tilde{\theta}_{k}^{j})^{j\in\{1,\ldots,N\}}_{k\in\{1,\ldots,6\}}( over~ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT italic_j ∈ { 1 , … , italic_N } end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k ∈ { 1 , … , 6 } end_POSTSUBSCRIPT where N=500𝑁500N=500italic_N = 500 is the number of samples. Then, the samples (θ~kj)k{1,,6}j{1,,N}superscriptsubscriptsuperscriptsubscript~𝜃𝑘𝑗𝑘16𝑗1𝑁(\tilde{\theta}_{k}^{j})_{k\in\{1,\ldots,6\}}^{j\in\{1,\ldots,N\}}( over~ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT ) start_POSTSUBSCRIPT italic_k ∈ { 1 , … , 6 } end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j ∈ { 1 , … , italic_N } end_POSTSUPERSCRIPT are used to compute Bayesian assignment weights wi,k=jp(yi|θ~kj,Zi=k)/Nsubscript𝑤𝑖𝑘subscript𝑗𝑝conditionalsubscript𝑦𝑖superscriptsubscript~𝜃𝑘𝑗subscript𝑍𝑖𝑘𝑁w_{i,k}=\sum_{j}p(y_{i}|\tilde{\theta}_{k}^{j},Z_{i}=k)/Nitalic_w start_POSTSUBSCRIPT italic_i , italic_k end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_p ( italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT | over~ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT , italic_Z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_k ) / italic_N for each observation yi𝒳subscript𝑦𝑖𝒳y_{i}\in\mathcal{X}italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ caligraphic_X of the test set and each cluster k𝑘kitalic_k. The label of each observation is finally assigned as li=argmaxk{1,,6}{wi,k}subscript𝑙𝑖subscriptargmax𝑘16subscript𝑤𝑖𝑘l_{i}=\operatorname*{arg\,max}_{k\in\{1,\ldots,6\}}\{w_{i,k}\}italic_l start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = start_OPERATOR roman_arg roman_max end_OPERATOR start_POSTSUBSCRIPT italic_k ∈ { 1 , … , 6 } end_POSTSUBSCRIPT { italic_w start_POSTSUBSCRIPT italic_i , italic_k end_POSTSUBSCRIPT }. This procedure is referred to as “vMF clustering MCMC” in Table 7.

To demonstrate the benefit of our Bayesian approach, for each observation yisubscript𝑦𝑖y_{i}italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT of the test set, we compute the empirical variance of the sample

Li=(li,j=argmaxk{1,,6}p(yi|θ~kj,Zi=k))j{1,,N}subscript𝐿𝑖subscriptsubscript𝑙𝑖𝑗subscriptargmax𝑘16𝑝conditionalsubscript𝑦𝑖superscriptsubscript~𝜃𝑘𝑗subscript𝑍𝑖𝑘𝑗1𝑁L_{i}=(l_{i,j}=\operatorname*{arg\,max}_{k\in\{1,\ldots,6\}}p(y_{i}|\tilde{% \theta}_{k}^{j},Z_{i}=k))_{j\in\{1,\ldots,N\}}italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = ( italic_l start_POSTSUBSCRIPT italic_i , italic_j end_POSTSUBSCRIPT = start_OPERATOR roman_arg roman_max end_OPERATOR start_POSTSUBSCRIPT italic_k ∈ { 1 , … , 6 } end_POSTSUBSCRIPT italic_p ( italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT | over~ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT , italic_Z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_k ) ) start_POSTSUBSCRIPT italic_j ∈ { 1 , … , italic_N } end_POSTSUBSCRIPT (17)

and remove the point i𝑖iitalic_i if it is positive. The previous procedure is then applied to this reduced test set. The scores resulting from this procedure referred to as “vMF clustering MCMC lower variance”, are given in Table 7.

Table 7: Clustering results on the KTH action recognition dataset reported with f1 score weighted (%).
Scenario d1 d2 d3 d4
vMF clustering EM : 49.54 56.61 57.99 49.98
vMF clustering MCMC : 52.21 56.61 57.99 49.98
vMF clustering MCMC lower variance : 50.15 57.75 59.62 54.06

The Bayesian approach strengthens the estimation by averaging different plausible weights, as in the ensemble methods [dietterich2000ensemble], which always turns out to be better than the simple maximum likelihood estimator (MLE) in 7.

4 Validity

The aim of this section is to prove the reversibility of the geodesic slice sampler. To this end, we introduce a stepping-out distribution to describe Algorithm 2 and a shrinkage kernel to describe Algorithm 3. Their properties collected in Lemma 11, Lemma 12, Lemma 15 and Lemma 16 intuitively ensure ‘reversibility on every parameterized intersection of geodesic and levelset’. Together with the invariance of the Liouville measure under a certain map on the tangent bundle, both introduced in Section 4.3, they are the essential indigence for the proof of the reversibility of the geodesic slice sampler.

4.1 Stepping-out procedure

Throughout this section fix w(0,)𝑤0w\in(0,\infty)italic_w ∈ ( 0 , ∞ ) and m𝑚m\in\mathbb{N}italic_m ∈ blackboard_N. We consider a generalization of the stepping-out procedure described in Section 2.1 that targets an arbitrary set 𝖲()𝖲\mathsf{S}\in\mathcal{B}(\mathbb{R})sansserif_S ∈ caligraphic_B ( blackboard_R ), see Algorithm 4.

Algorithm 4 Stepping-out procedure targeting 𝖲()𝖲\mathsf{S}\in\mathcal{B}(\mathbb{R})sansserif_S ∈ caligraphic_B ( blackboard_R ).

Input: point θ𝜃\theta\in\mathbb{R}italic_θ ∈ blackboard_R, hyperparameters w(0,)𝑤0w\in(0,\infty)italic_w ∈ ( 0 , ∞ ) and m𝑚m\in\mathbb{N}italic_m ∈ blackboard_N
Output: two points ,r𝑟\ell,r\in\mathbb{R}roman_ℓ , italic_r ∈ blackboard_R such that <θ<r𝜃𝑟\ell<\theta<rroman_ℓ < italic_θ < italic_r

1:  Draw ΥUnif([0,w])similar-toΥUnif0𝑤\Upsilon\sim\mathrm{Unif}([0,w])roman_Υ ∼ roman_Unif ( [ 0 , italic_w ] ), call the result u𝑢uitalic_u.
2:  Set :=θuassign𝜃𝑢\ell:=\theta-uroman_ℓ := italic_θ - italic_u and r:=+wassign𝑟𝑤r:=\ell+witalic_r := roman_ℓ + italic_w.
3:  Draw JUnif({1,,m})similar-to𝐽Unif1𝑚J\sim\mathrm{Unif}(\{1,\ldots,m\})italic_J ∼ roman_Unif ( { 1 , … , italic_m } ), call the result ιι\upiotaroman_ι.
4:  Set i=2𝑖2i=2italic_i = 2 and j=2𝑗2j=2italic_j = 2.
5:  while iι𝑖ιi\leqslant\upiotaitalic_i ⩽ roman_ι and 𝖲𝖲\ell\in\mathsf{S}roman_ℓ ∈ sansserif_S do
6:     Set =w𝑤\ell=\ell-wroman_ℓ = roman_ℓ - italic_w.
7:     Update i=i+1𝑖𝑖1i=i+1italic_i = italic_i + 1.
8:  end while
9:  while jm+1ι𝑗𝑚1ιj\leqslant m+1-\upiotaitalic_j ⩽ italic_m + 1 - roman_ι and r𝖲𝑟𝖲r\in\mathsf{S}italic_r ∈ sansserif_S do
10:     Set r=r+w𝑟𝑟𝑤r=r+witalic_r = italic_r + italic_w.
11:     Update j=j+1𝑗𝑗1j=j+1italic_j = italic_j + 1.
12:  end while
13:  return  (,r)𝑟(\ell,r)( roman_ℓ , italic_r )

To formally describe the resulting distribution on 2superscript2\mathbb{R}^{2}blackboard_R start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT we use stopped random variables. Let ΥUnif([0,w])similar-toΥUnif0𝑤\Upsilon\sim\mathrm{Unif}([0,w])roman_Υ ∼ roman_Unif ( [ 0 , italic_w ] ). For every θ𝜃\theta\in\mathbb{R}italic_θ ∈ blackboard_R let

Li(θ):=θΥ(i1)w,i,Rj(θ):=θ+(wΥ)+(j1)w=θΥ+jw,j,\begin{split}L_{i}^{(\theta)}&:=\theta-\Upsilon-(i-1)w,\qquad i\in\mathbb{N},% \\ R_{j}^{(\theta)}&:=\theta+(w-\Upsilon)+(j-1)w=\theta-\Upsilon+jw,\qquad j\in% \mathbb{N},\end{split}start_ROW start_CELL italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT end_CELL start_CELL := italic_θ - roman_Υ - ( italic_i - 1 ) italic_w , italic_i ∈ blackboard_N , end_CELL end_ROW start_ROW start_CELL italic_R start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT end_CELL start_CELL := italic_θ + ( italic_w - roman_Υ ) + ( italic_j - 1 ) italic_w = italic_θ - roman_Υ + italic_j italic_w , italic_j ∈ blackboard_N , end_CELL end_ROW (18)

be two sequences of random variables. Observe that setting

L0(θ):=R1(θ)=θΥ(1)wR0(θ):=L1(θ)=θΥ+0wassignsuperscriptsubscript𝐿0𝜃superscriptsubscript𝑅1𝜃𝜃Υ1𝑤superscriptsubscript𝑅0𝜃assignsuperscriptsubscript𝐿1𝜃𝜃Υ0𝑤\begin{split}L_{0}^{(\theta)}&:=R_{1}^{(\theta)}=\theta-\Upsilon-(-1)w\\ R_{0}^{(\theta)}&:=L_{1}^{(\theta)}=\theta-\Upsilon+0\cdot w\end{split}start_ROW start_CELL italic_L start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT end_CELL start_CELL := italic_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT = italic_θ - roman_Υ - ( - 1 ) italic_w end_CELL end_ROW start_ROW start_CELL italic_R start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT end_CELL start_CELL := italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT = italic_θ - roman_Υ + 0 ⋅ italic_w end_CELL end_ROW (19)

is consistent with this definition. By construction both sequences are strictly monotone, i.e.,

Li+1(θ)=θΥiw<θΥ(i1)w=Li(θ),i0,θ,Rj(θ)=θΥ+jw<θΥ+(j+1)w=Rj+1(θ),j0,θ.\begin{split}L_{i+1}^{(\theta)}=\theta-\Upsilon-iw&<\theta-\Upsilon-(i-1)w=L_{% i}^{(\theta)},\qquad i\in\mathbb{N}_{0},\theta\in\mathbb{R},\\ R_{j}^{(\theta)}=\theta-\Upsilon+jw&<\theta-\Upsilon+(j+1)w=R_{j+1}^{(\theta)}% ,\qquad j\in\mathbb{N}_{0},\theta\in\mathbb{R}.\end{split}start_ROW start_CELL italic_L start_POSTSUBSCRIPT italic_i + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT = italic_θ - roman_Υ - italic_i italic_w end_CELL start_CELL < italic_θ - roman_Υ - ( italic_i - 1 ) italic_w = italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT , italic_i ∈ blackboard_N start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_θ ∈ blackboard_R , end_CELL end_ROW start_ROW start_CELL italic_R start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT = italic_θ - roman_Υ + italic_j italic_w end_CELL start_CELL < italic_θ - roman_Υ + ( italic_j + 1 ) italic_w = italic_R start_POSTSUBSCRIPT italic_j + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT , italic_j ∈ blackboard_N start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_θ ∈ blackboard_R . end_CELL end_ROW (20)

We now define appropriate stopping times depending on the target set 𝖲()𝖲\mathsf{S}\in\mathcal{B}(\mathbb{R})sansserif_S ∈ caligraphic_B ( blackboard_R ). To this end, let J𝐽Jitalic_J be a random variable that is independent of all previous objects satisfying JUnif({1,,m})similar-to𝐽Unif1𝑚J\sim\mathrm{Unif}(\{1,\ldots,m\})italic_J ∼ roman_Unif ( { 1 , … , italic_m } ), and set

τ𝖲(θ):=inf{iLi(θ)𝖲}J,𝔗𝖲(θ):=inf{jRj(θ)𝖲}(m+1J),θ,𝖲().\begin{split}\tau_{\mathsf{S}}^{(\theta)}&:=\inf\{i\in\mathbb{N}\mid L_{i}^{(% \theta)}\notin\mathsf{S}\}\land J,\\ \mathfrak{T}_{\mathsf{S}}^{(\theta)}&:=\inf\{j\in\mathbb{N}\mid R_{j}^{(\theta% )}\notin\mathsf{S}\}\land(m+1-J),\qquad\theta\in\mathbb{R},\mathsf{S}\in% \mathcal{B}(\mathbb{R}).\end{split}start_ROW start_CELL italic_τ start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT end_CELL start_CELL := roman_inf { italic_i ∈ blackboard_N ∣ italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∉ sansserif_S } ∧ italic_J , end_CELL end_ROW start_ROW start_CELL fraktur_T start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT end_CELL start_CELL := roman_inf { italic_j ∈ blackboard_N ∣ italic_R start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∉ sansserif_S } ∧ ( italic_m + 1 - italic_J ) , italic_θ ∈ blackboard_R , sansserif_S ∈ caligraphic_B ( blackboard_R ) . end_CELL end_ROW (21)

Note that τ𝖲(θ)superscriptsubscript𝜏𝖲𝜃\tau_{\mathsf{S}}^{(\theta)}italic_τ start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT and 𝔗𝖲(θ)superscriptsubscript𝔗𝖲𝜃\mathfrak{T}_{\mathsf{S}}^{(\theta)}fraktur_T start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT are finite stopping times with respect to the filtration generated by the sequence (J,L1(θ),,Ln(θ),R1(θ),,Rn(θ))nsubscript𝐽superscriptsubscript𝐿1𝜃superscriptsubscript𝐿𝑛𝜃superscriptsubscript𝑅1𝜃superscriptsubscript𝑅𝑛𝜃𝑛(J,L_{1}^{(\theta)},\ldots,L_{n}^{(\theta)},R_{1}^{(\theta)},\ldots,R_{n}^{(% \theta)})_{n\in\mathbb{N}}( italic_J , italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT , … , italic_L start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT , italic_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT , … , italic_R start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ) start_POSTSUBSCRIPT italic_n ∈ blackboard_N end_POSTSUBSCRIPT. More precisely, we have the bounds

1τ𝖲(θ)Jm,1𝔗𝖲(θ)m+1Jm+11=m,2τ𝖲(θ)+𝔗𝖲(θ)J+m+1Jm+1.formulae-sequence1superscriptsubscript𝜏𝖲𝜃𝐽𝑚1superscriptsubscript𝔗𝖲𝜃𝑚1𝐽𝑚11𝑚2superscriptsubscript𝜏𝖲𝜃superscriptsubscript𝔗𝖲𝜃𝐽𝑚1𝐽𝑚1\begin{split}1\leqslant\tau_{\mathsf{S}}^{(\theta)}\leqslant J\leqslant m,\\ 1\leqslant\mathfrak{T}_{\mathsf{S}}^{(\theta)}\leqslant m+1-J\leqslant m+1-1=m% ,\\ 2\leqslant\tau_{\mathsf{S}}^{(\theta)}+\mathfrak{T}_{\mathsf{S}}^{(\theta)}% \leqslant J+m+1-J\leqslant m+1.\end{split}start_ROW start_CELL 1 ⩽ italic_τ start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ⩽ italic_J ⩽ italic_m , end_CELL end_ROW start_ROW start_CELL 1 ⩽ fraktur_T start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ⩽ italic_m + 1 - italic_J ⩽ italic_m + 1 - 1 = italic_m , end_CELL end_ROW start_ROW start_CELL 2 ⩽ italic_τ start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT + fraktur_T start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ⩽ italic_J + italic_m + 1 - italic_J ⩽ italic_m + 1 . end_CELL end_ROW (22)

As τ𝖲(θ)superscriptsubscript𝜏𝖲𝜃\tau_{\mathsf{S}}^{(\theta)}italic_τ start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT and 𝔗𝖲(θ)superscriptsubscript𝔗𝖲𝜃\mathfrak{T}_{\mathsf{S}}^{(\theta)}fraktur_T start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT are finite stopping times,

(𝑳𝖲(θ),𝑹𝖲(θ)):=(Lτ𝖲(θ)(θ),R𝔗𝖲(θ)(θ))assignsuperscriptsubscript𝑳𝖲𝜃superscriptsubscript𝑹𝖲𝜃superscriptsubscript𝐿superscriptsubscript𝜏𝖲𝜃𝜃superscriptsubscript𝑅superscriptsubscript𝔗𝖲𝜃𝜃\left(\boldsymbol{L}_{\mathsf{S}}^{(\theta)},\boldsymbol{R}_{\mathsf{S}}^{(% \theta)}\right):=\left(L_{\tau_{\mathsf{S}}^{(\theta)}}^{(\theta)},R_{% \mathfrak{T}_{\mathsf{S}}^{(\theta)}}^{(\theta)}\right)( bold_italic_L start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT , bold_italic_R start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ) := ( italic_L start_POSTSUBSCRIPT italic_τ start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT , italic_R start_POSTSUBSCRIPT fraktur_T start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ) (23)

are random variables for all θ𝜃\theta\in\mathbb{R}italic_θ ∈ blackboard_R and 𝖲()𝖲\mathsf{S}\in\mathcal{B}(\mathbb{R})sansserif_S ∈ caligraphic_B ( blackboard_R ), see [Klenke, Lemma 9.23]. We define the stepping-out distributions

ξ𝖲(θ):=(𝑳𝖲(θ),𝑹𝖲(θ)),θ,𝖲(),formulae-sequenceassignsuperscriptsubscript𝜉𝖲𝜃superscriptsuperscriptsubscript𝑳𝖲𝜃superscriptsubscript𝑹𝖲𝜃formulae-sequence𝜃𝖲\xi_{\mathsf{S}}^{(\theta)}:=\mathbb{P}^{\big{(}\boldsymbol{L}_{\mathsf{S}}^{(% \theta)},\boldsymbol{R}_{\mathsf{S}}^{(\theta)}\big{)}},\qquad\theta\in\mathbb% {R},\mathsf{S}\in\mathcal{B}(\mathbb{R}),italic_ξ start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT := blackboard_P start_POSTSUPERSCRIPT ( bold_italic_L start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT , bold_italic_R start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ) end_POSTSUPERSCRIPT , italic_θ ∈ blackboard_R , sansserif_S ∈ caligraphic_B ( blackboard_R ) ,

on (2,(2))superscript2superscript2(\mathbb{R}^{2},\mathcal{B}(\mathbb{R}^{2}))( blackboard_R start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , caligraphic_B ( blackboard_R start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) ). Observe that the output of Algorithm 4 has distribution ξ𝖲(θ)superscriptsubscript𝜉𝖲𝜃\xi_{\mathsf{S}}^{(\theta)}italic_ξ start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT. Since Algorithm 2 is a special case of Algorithm 4 with 𝖲=L(x,v,t)𝖲𝐿𝑥𝑣𝑡\mathsf{S}=L(x,v,t)sansserif_S = italic_L ( italic_x , italic_v , italic_t ) and θ=0𝜃0\theta=0italic_θ = 0, this definition is coherent with (5). Moreover, note that (θ,C)ξ𝖲(θ)(C)maps-to𝜃𝐶superscriptsubscript𝜉𝖲𝜃𝐶(\theta,C)\mapsto\xi_{\mathsf{S}}^{(\theta)}(C)( italic_θ , italic_C ) ↦ italic_ξ start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ( italic_C ) is a transition kernel. More details on this can be found in Remark 21.

We collect two properties of the stepping-out distribution that are useful to show reversibility of the geodesic slice sampler. Their proof is postponed to Appendix A.

Lemma 11.

Let 𝖲(),𝖲\mathsf{S}\in\mathcal{B}(\mathbb{R}),sansserif_S ∈ caligraphic_B ( blackboard_R ) , and let f:2[0,):𝑓superscript20f:\mathbb{R}^{2}\to[0,\infty)italic_f : blackboard_R start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT → [ 0 , ∞ ) be a measurable function. We have for all θ,α𝜃𝛼\theta,\alpha\in\mathbb{R}italic_θ , italic_α ∈ blackboard_R

2f(,r)𝟙(,r)(α)ξ𝖲(θ)(d(,r))=2f(,r)𝟙(,r)(θ)ξ𝖲(α)(d(,r)).subscriptsuperscript2𝑓𝑟subscript1𝑟𝛼superscriptsubscript𝜉𝖲𝜃d𝑟subscriptsuperscript2𝑓𝑟subscript1𝑟𝜃superscriptsubscript𝜉𝖲𝛼d𝑟\displaystyle\int_{\mathbb{R}^{2}}f(\ell,r)\mathbbm{1}_{(\ell,r)}(\alpha)\ \xi% _{\mathsf{S}}^{(\theta)}\big{(}{\rm d}(\ell,r)\big{)}=\int_{\mathbb{R}^{2}}f(% \ell,r)\mathbbm{1}_{(\ell,r)}(\theta)\ \xi_{\mathsf{S}}^{(\alpha)}\big{(}{\rm d% }(\ell,r)\big{)}.∫ start_POSTSUBSCRIPT blackboard_R start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_f ( roman_ℓ , italic_r ) blackboard_1 start_POSTSUBSCRIPT ( roman_ℓ , italic_r ) end_POSTSUBSCRIPT ( italic_α ) italic_ξ start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ( roman_d ( roman_ℓ , italic_r ) ) = ∫ start_POSTSUBSCRIPT blackboard_R start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_f ( roman_ℓ , italic_r ) blackboard_1 start_POSTSUBSCRIPT ( roman_ℓ , italic_r ) end_POSTSUBSCRIPT ( italic_θ ) italic_ξ start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ( roman_d ( roman_ℓ , italic_r ) ) .

The previous lemma essentially states that, conditioned on the event that the initial point lies inside the resulting interval, the stepping-out distribution does not depend on the initial point. To formulate the second property we need the collection

Λα:,θαθ:subscriptΛ𝛼formulae-sequencemaps-to𝜃𝛼𝜃\Lambda_{\alpha}:\mathbb{R}\to\mathbb{R},\qquad\theta\mapsto\alpha-\thetaroman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT : blackboard_R → blackboard_R , italic_θ ↦ italic_α - italic_θ (24)

of linear functions index by α𝛼\alpha\in\mathbb{R}italic_α ∈ blackboard_R. Intuitively, they express a U-turn at α𝛼\alpha\in\mathbb{R}italic_α ∈ blackboard_R. Moreover, for α𝛼\alpha\in\mathbb{R}italic_α ∈ blackboard_R we define

λα:22,(,r)(αr,α).:subscriptλ𝛼formulae-sequencesuperscript2superscript2maps-to𝑟𝛼𝑟𝛼\uplambda_{\alpha}:\mathbb{R}^{2}\to\mathbb{R}^{2},\qquad(\ell,r)\mapsto(% \alpha-r,\alpha-\ell).roman_λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT : blackboard_R start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT → blackboard_R start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , ( roman_ℓ , italic_r ) ↦ ( italic_α - italic_r , italic_α - roman_ℓ ) . (25)

The next lemma describes the behavior of the stepping-out distribution under U-turns.

Lemma 12.

Let 𝖲()𝖲\mathsf{S}\in\mathcal{B}(\mathbb{R})sansserif_S ∈ caligraphic_B ( blackboard_R ) and θ,α𝜃𝛼\theta,\alpha\in\mathbb{R}italic_θ , italic_α ∈ blackboard_R. We have

ξΛα(𝖲)(θ)=(λα)ξ𝖲(Λα(θ)).superscriptsubscript𝜉subscriptΛ𝛼𝖲𝜃subscriptsubscriptλ𝛼superscriptsubscript𝜉𝖲subscriptΛ𝛼𝜃\xi_{\Lambda_{\alpha}(\mathsf{S})}^{(\theta)}=(\uplambda_{\alpha})_{\sharp}\xi% _{\mathsf{S}}^{(\Lambda_{\alpha}(\theta))}.italic_ξ start_POSTSUBSCRIPT roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( sansserif_S ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT = ( roman_λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT ♯ end_POSTSUBSCRIPT italic_ξ start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( italic_θ ) ) end_POSTSUPERSCRIPT .

4.2 Shrinkage procedure

In this section, for every half open interval [,r)𝑟[\ell,r)\subseteq\mathbb{R}[ roman_ℓ , italic_r ) ⊆ blackboard_R, we introduce an algorithm that generalizes Algorithm 3 approximating the uniform distribution on 𝖲(,r)𝖲𝑟\mathsf{S}\cap(\ell,r)sansserif_S ∩ ( roman_ℓ , italic_r ) for an arbitrary open set 𝖲()𝖲\mathsf{S}\in\mathcal{B}\big{(}\mathbb{R}\big{)}sansserif_S ∈ caligraphic_B ( blackboard_R ). To express this scheme as a kernel, we employ the kernel of the shrinkage procedure, essentially operating on 𝕊1superscript𝕊1\mathbb{S}^{1}blackboard_S start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT parameterized by [0,2π)02𝜋[0,2\pi)[ 0 , 2 italic_π ), introduced in [ReversibilityEllipticalSliceSampler] and push it forward to an arbitrary interval [,r)𝑟[\ell,r)[ roman_ℓ , italic_r ) as described in [rudolf2022robust, Appendix A]. For the convenience of the reader we quickly sketch the kernel Q\textgothSsubscript𝑄\textgoth𝑆Q_{\textgoth}{S}italic_Q start_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_S from [ReversibilityEllipticalSliceSampler]. To this end set

𝕁(α,θ):={[α,θ),α<θ,[0,θ)[α,2π),αθ.assign𝕁𝛼𝜃cases𝛼𝜃𝛼𝜃0𝜃𝛼2𝜋𝛼𝜃\mathbb{J}(\alpha,\theta):=\begin{dcases}[\alpha,\theta),&\alpha<\theta,\\ [0,\theta)\cup[\alpha,2\pi),&\alpha\geqslant\theta.\end{dcases}blackboard_J ( italic_α , italic_θ ) := { start_ROW start_CELL [ italic_α , italic_θ ) , end_CELL start_CELL italic_α < italic_θ , end_CELL end_ROW start_ROW start_CELL [ 0 , italic_θ ) ∪ [ italic_α , 2 italic_π ) , end_CELL start_CELL italic_α ⩾ italic_θ . end_CELL end_ROW

We denote by δzsubscript𝛿𝑧\delta_{z}italic_δ start_POSTSUBSCRIPT italic_z end_POSTSUBSCRIPT the Dirac measure at z𝑧z\in\mathbb{R}italic_z ∈ blackboard_R. Let ΘΘ\Upthetaroman_Θ be a random variable on [0,2π)02𝜋[0,2\pi)[ 0 , 2 italic_π ), and let (Γn,Γnmin,Γnmax)nsubscriptsubscriptΓ𝑛superscriptsubscriptΓ𝑛superscriptsubscriptΓ𝑛𝑛(\Upgamma_{n},\Upgamma_{n}^{\min},\Upgamma_{n}^{\max})_{n\in\mathbb{N}}( roman_Γ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , roman_Γ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_min end_POSTSUPERSCRIPT , roman_Γ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT ) start_POSTSUBSCRIPT italic_n ∈ blackboard_N end_POSTSUBSCRIPT be a sequence of random variables with conditional distributions

((Γn+1,Γn+1min,Γn+1max)𝖢(Γn,Γnmin,Γnmax)=(z,zmin,zmax),Θ=θ)\displaystyle\mathbb{P}\big{(}(\Upgamma_{n+1},\Upgamma_{n+1}^{\min},\Upgamma_{% n+1}^{\max})\in\mathsf{C}\mid(\Upgamma_{n},\Upgamma_{n}^{\min},\Upgamma_{n}^{% \max})=(z,z^{\min},z^{\max}),\Uptheta=\theta\big{)}blackboard_P ( ( roman_Γ start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT , roman_Γ start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_min end_POSTSUPERSCRIPT , roman_Γ start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT ) ∈ sansserif_C ∣ ( roman_Γ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , roman_Γ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_min end_POSTSUPERSCRIPT , roman_Γ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT ) = ( italic_z , italic_z start_POSTSUPERSCRIPT roman_min end_POSTSUPERSCRIPT , italic_z start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT ) , roman_Θ = italic_θ )
=𝟙𝕁(z,zmax)(θ)(Unif(𝕁(z,zmax))δzδzmax)(𝖢)absentsubscript1𝕁𝑧superscript𝑧𝜃tensor-productUnif𝕁𝑧superscript𝑧subscript𝛿𝑧subscript𝛿superscript𝑧𝖢\displaystyle\qquad=\mathbbm{1}_{\mathbb{J}(z,z^{\max})}(\theta)\cdot\left(% \mathrm{Unif}\big{(}\mathbb{J}(z,z^{\max})\big{)}\otimes\delta_{z}\otimes% \delta_{z^{\max}}\right)(\mathsf{C})= blackboard_1 start_POSTSUBSCRIPT blackboard_J ( italic_z , italic_z start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT ) end_POSTSUBSCRIPT ( italic_θ ) ⋅ ( roman_Unif ( blackboard_J ( italic_z , italic_z start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT ) ) ⊗ italic_δ start_POSTSUBSCRIPT italic_z end_POSTSUBSCRIPT ⊗ italic_δ start_POSTSUBSCRIPT italic_z start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ) ( sansserif_C )
+𝟙𝕁(zmin,z)(θ)(Unif(𝕁(zmin,z))δzminδz)(𝖢)subscript1𝕁superscript𝑧𝑧𝜃tensor-productUnif𝕁superscript𝑧𝑧subscript𝛿superscript𝑧subscript𝛿𝑧𝖢\displaystyle\qquad\qquad+\mathbbm{1}_{\mathbb{J}(z^{\min},z)}(\theta)\cdot% \left(\mathrm{Unif}\big{(}\mathbb{J}(z^{\min},z)\big{)}\otimes\delta_{z^{\min}% }\otimes\delta_{z}\right)(\mathsf{C})+ blackboard_1 start_POSTSUBSCRIPT blackboard_J ( italic_z start_POSTSUPERSCRIPT roman_min end_POSTSUPERSCRIPT , italic_z ) end_POSTSUBSCRIPT ( italic_θ ) ⋅ ( roman_Unif ( blackboard_J ( italic_z start_POSTSUPERSCRIPT roman_min end_POSTSUPERSCRIPT , italic_z ) ) ⊗ italic_δ start_POSTSUBSCRIPT italic_z start_POSTSUPERSCRIPT roman_min end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ⊗ italic_δ start_POSTSUBSCRIPT italic_z end_POSTSUBSCRIPT ) ( sansserif_C )

for θ,z,zmin,zmax[0,2π)𝜃𝑧superscript𝑧superscript𝑧02𝜋\theta,z,z^{\min},z^{\max}\in[0,2\pi)italic_θ , italic_z , italic_z start_POSTSUPERSCRIPT roman_min end_POSTSUPERSCRIPT , italic_z start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT ∈ [ 0 , 2 italic_π ) with θ,z𝕁(zmin,zmax)𝜃𝑧𝕁superscript𝑧superscript𝑧\theta,z\in\mathbb{J}(z^{\min},z^{\max})italic_θ , italic_z ∈ blackboard_J ( italic_z start_POSTSUPERSCRIPT roman_min end_POSTSUPERSCRIPT , italic_z start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT ), 𝖢([0,2π)3)𝖢superscript02𝜋3\mathsf{C}\in\mathcal{B}([0,2\pi)^{3})sansserif_C ∈ caligraphic_B ( [ 0 , 2 italic_π ) start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT ) and n𝑛n\in\mathbb{N}italic_n ∈ blackboard_N. Moreover, for every set \textgothS([0,2π))\textgoth𝑆02𝜋\textgoth{S}\in\mathcal{B}([0,2\pi))italic_S ∈ caligraphic_B ( [ 0 , 2 italic_π ) ) which is open in 𝕊1superscript𝕊1\mathbb{S}^{1}blackboard_S start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT, i.e., satisfies that for all θ\textgothS𝜃\textgoth𝑆\theta\in\textgoth{S}italic_θ ∈ italic_S there exists ε>0𝜀0\varepsilon>0italic_ε > 0 such that {αmod2π|αθ|<ε,α}\textgothSconditional-setmodulo𝛼2𝜋formulae-sequence𝛼𝜃𝜀𝛼\textgoth𝑆\{\alpha\mod 2\pi\mid|\alpha-\theta|<\varepsilon,\alpha\in\mathbb{R}\}% \subseteq\textgoth{S}{ italic_α roman_mod 2 italic_π ∣ | italic_α - italic_θ | < italic_ε , italic_α ∈ blackboard_R } ⊆ italic_S, define the stopping time

𝒯\textgothS:=inf{nΓn\textgothS}.assignsubscript𝒯\textgoth𝑆infimumconditional-set𝑛subscriptΓ𝑛\textgoth𝑆\mathcal{T}_{\textgoth}{S}:=\inf\{n\in\mathbb{N}\mid\Upgamma_{n}\in\textgoth{S% }\}.caligraphic_T start_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_S := roman_inf { italic_n ∈ blackboard_N ∣ roman_Γ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ∈ italic_S } .

Then the kernel of the shrinkage procedure targeting Unif(\textgothS)Unif\textgoth𝑆\mathrm{Unif}(\textgoth{S})roman_Unif ( italic_S ) is given by

Q\textgothS:\textgothS×(\textgothS)[0,1],(θ,𝖠)(Γ𝒯\textgothSA,𝒯\textgothS<Θ=θ).:subscript𝑄\textgoth𝑆formulae-sequence\textgoth𝑆\textgoth𝑆01maps-to𝜃𝖠formulae-sequencesubscriptΓsubscript𝒯\textgoth𝑆𝐴subscript𝒯\textgoth𝑆braΘ𝜃Q_{\textgoth{S}}:\textgoth{S}\times\mathcal{B}(\textgoth{S})\to[0,1],\quad(% \theta,\mathsf{A})\mapsto\mathbb{P}\left(\Upgamma_{\mathcal{T}_{\textgoth}{S}}% \in A,\mathcal{T}_{\textgoth}{S}<\infty\mid\Uptheta=\theta\right).italic_Q start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT : italic_S × caligraphic_B ( italic_S ) → [ 0 , 1 ] , ( italic_θ , sansserif_A ) ↦ blackboard_P ( roman_Γ start_POSTSUBSCRIPT caligraphic_T start_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT ∈ italic_A , caligraphic_T start_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_S < ∞ ∣ roman_Θ = italic_θ ) .

To \ldqbend the interval [,r)𝑟[\ell,r)[ roman_ℓ , italic_r ) onto 𝕊1superscript𝕊1\mathbb{S}^{1}blackboard_S start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT\rdq, where ,r𝑟\ell,r\in\mathbb{R}roman_ℓ , italic_r ∈ blackboard_R such that <r𝑟\ell<rroman_ℓ < italic_r, we use the family of maps

h,r:[,r)[0,2π),θ2πrθmod2π.:subscript𝑟formulae-sequence𝑟02𝜋maps-to𝜃modulo2𝜋𝑟𝜃2𝜋\displaystyle h_{\ell,r}:[\ell,r)\to[0,2\pi),\qquad\theta\mapsto\frac{2\pi}{r-% \ell}\theta\ \mod 2\pi.italic_h start_POSTSUBSCRIPT roman_ℓ , italic_r end_POSTSUBSCRIPT : [ roman_ℓ , italic_r ) → [ 0 , 2 italic_π ) , italic_θ ↦ divide start_ARG 2 italic_π end_ARG start_ARG italic_r - roman_ℓ end_ARG italic_θ roman_mod 2 italic_π .

Note that due to the restriction of the domain, these functions are bijective and therefore have inverses h,r1superscriptsubscript𝑟1h_{\ell,r}^{-1}italic_h start_POSTSUBSCRIPT roman_ℓ , italic_r end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT. Let ,r𝑟\ell,r\in\mathbb{R}roman_ℓ , italic_r ∈ blackboard_R such that <r𝑟\ell<rroman_ℓ < italic_r, and let 𝖲()𝖲\mathsf{S}\in\mathcal{B}(\mathbb{R})sansserif_S ∈ caligraphic_B ( blackboard_R ) be an open set such that (,r)𝖲𝑟𝖲(\ell,r)\cap\mathsf{S}\neq\emptyset( roman_ℓ , italic_r ) ∩ sansserif_S ≠ ∅. For such numbers ,r𝑟\ell,rroman_ℓ , italic_r and sets 𝖲𝖲\mathsf{S}sansserif_S we define the shrinkage kernel as the push forward of the kernel Q\textgothSsubscript𝑄\textgoth𝑆Q_{\textgoth{S}}italic_Q start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT for \textgothS=h,r(𝖲(,r))\textgoth𝑆subscript𝑟𝖲𝑟\textgoth{S}=h_{\ell,r}(\mathsf{S}\cap(\ell,r))italic_S = italic_h start_POSTSUBSCRIPT roman_ℓ , italic_r end_POSTSUBSCRIPT ( sansserif_S ∩ ( roman_ℓ , italic_r ) )101010Observe that h,r(𝖲(,r))subscript𝑟𝖲𝑟h_{\ell,r}(\mathsf{S}\cap(\ell,r))italic_h start_POSTSUBSCRIPT roman_ℓ , italic_r end_POSTSUBSCRIPT ( sansserif_S ∩ ( roman_ℓ , italic_r ) ) is open in 𝕊1superscript𝕊1\mathbb{S}^{1}blackboard_S start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT, since 𝖲(,r)𝖲𝑟\mathsf{S}\cap(\ell,r)sansserif_S ∩ ( roman_ℓ , italic_r ) is open as a set in \mathbb{R}blackboard_R, and non-empty by assumption. under h,r1superscriptsubscript𝑟1h_{\ell,r}^{-1}italic_h start_POSTSUBSCRIPT roman_ℓ , italic_r end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT, i.e.,

Q𝖲,r(θ,𝖠):=Qh,r(𝖲(,r))(h,r(θ),h,r(𝖠)),θ𝖲(,r),𝖠(𝖲(,r)).formulae-sequenceassignsuperscriptsubscript𝑄𝖲𝑟𝜃𝖠subscript𝑄subscript𝑟𝖲𝑟subscript𝑟𝜃subscript𝑟𝖠formulae-sequence𝜃𝖲𝑟𝖠𝖲𝑟\displaystyle Q_{\mathsf{S}}^{\ell,r}(\theta,\mathsf{A}):=Q_{h_{\ell,r}(% \mathsf{S}\cap(\ell,r))}\big{(}h_{\ell,r}(\theta),h_{\ell,r}(\mathsf{A})\big{)% },\qquad\theta\in\mathsf{S}\cap(\ell,r),\mathsf{A}\in\mathcal{B}\big{(}\mathsf% {S}\cap(\ell,r)\big{)}.italic_Q start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ℓ , italic_r end_POSTSUPERSCRIPT ( italic_θ , sansserif_A ) := italic_Q start_POSTSUBSCRIPT italic_h start_POSTSUBSCRIPT roman_ℓ , italic_r end_POSTSUBSCRIPT ( sansserif_S ∩ ( roman_ℓ , italic_r ) ) end_POSTSUBSCRIPT ( italic_h start_POSTSUBSCRIPT roman_ℓ , italic_r end_POSTSUBSCRIPT ( italic_θ ) , italic_h start_POSTSUBSCRIPT roman_ℓ , italic_r end_POSTSUBSCRIPT ( sansserif_A ) ) , italic_θ ∈ sansserif_S ∩ ( roman_ℓ , italic_r ) , sansserif_A ∈ caligraphic_B ( sansserif_S ∩ ( roman_ℓ , italic_r ) ) .

Observe that this agrees with the definition made in (6), where we have θ=0𝜃0\theta=0italic_θ = 0 and 𝖲=L(x,v,t)𝖲𝐿𝑥𝑣𝑡\mathsf{S}=L(x,v,t)sansserif_S = italic_L ( italic_x , italic_v , italic_t ) for x𝖬𝑥𝖬x\in\mathsf{M}italic_x ∈ sansserif_M, v𝕊xd1𝑣superscriptsubscript𝕊𝑥𝑑1v\in\mathbb{S}_{x}^{d-1}italic_v ∈ blackboard_S start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_d - 1 end_POSTSUPERSCRIPT and t(0,p(x))𝑡0𝑝𝑥t\in(0,p(x))italic_t ∈ ( 0 , italic_p ( italic_x ) ).

We briefly discuss the measureability of the shrinkage kernel in the arguments (θ,,r)𝜃𝑟(\theta,\ell,r)( italic_θ , roman_ℓ , italic_r ).

Remark 13.

Let L𝐿Litalic_L and R𝑅Ritalic_R be two random variables independent of ΘΘ\Thetaroman_Θ and (Γn,Γnmin,Γnmax)nsubscriptsubscriptΓ𝑛superscriptsubscriptΓ𝑛superscriptsubscriptΓ𝑛𝑛(\Upgamma_{n},\Upgamma_{n}^{\min},\Upgamma_{n}^{\max})_{n\in\mathbb{N}}( roman_Γ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , roman_Γ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_min end_POSTSUPERSCRIPT , roman_Γ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT ) start_POSTSUBSCRIPT italic_n ∈ blackboard_N end_POSTSUBSCRIPT satisfying L<R𝐿𝑅L<Ritalic_L < italic_R almost surely, and let 𝖲()𝖲\mathsf{S}\in\mathcal{B}(\mathbb{R})sansserif_S ∈ caligraphic_B ( blackboard_R ), 𝖠(𝖲)𝖠𝖲\mathsf{A}\in\mathcal{B}(\mathsf{S})sansserif_A ∈ caligraphic_B ( sansserif_S ). By a disintegration argument, we have for all θ[0,2π)𝜃02𝜋\theta\in[0,2\pi)italic_θ ∈ [ 0 , 2 italic_π ), ,r𝑟\ell,r\in\mathbb{R}roman_ℓ , italic_r ∈ blackboard_R that

fk(θ,,r)subscript𝑓𝑘𝜃𝑟\displaystyle f_{k}(\theta,\ell,r)italic_f start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_θ , roman_ℓ , italic_r ) :=𝔼(𝟙hl,r(𝖠𝖲(l,r))(Γk)i=1k1𝟙[0,2π)hl,r(𝖲(l,r))(Γi)Θ=θ)assignabsent𝔼conditionalsubscript1subscript𝑙𝑟𝖠𝖲𝑙𝑟subscriptΓ𝑘superscriptsubscriptproduct𝑖1𝑘1subscript102𝜋subscript𝑙𝑟𝖲𝑙𝑟subscriptΓ𝑖Θ𝜃\displaystyle:=\mathbb{E}\left(\mathbbm{1}_{h_{l,r}(\mathsf{A}\cap\mathsf{S}% \cap(l,r))}(\Gamma_{k})\prod_{i=1}^{k-1}\mathbbm{1}_{[0,2\pi)\setminus h_{l,r}% (\mathsf{S}\cap(l,r))}(\Gamma_{i})\mid\Theta=\theta\right):= blackboard_E ( blackboard_1 start_POSTSUBSCRIPT italic_h start_POSTSUBSCRIPT italic_l , italic_r end_POSTSUBSCRIPT ( sansserif_A ∩ sansserif_S ∩ ( italic_l , italic_r ) ) end_POSTSUBSCRIPT ( roman_Γ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) ∏ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k - 1 end_POSTSUPERSCRIPT blackboard_1 start_POSTSUBSCRIPT [ 0 , 2 italic_π ) ∖ italic_h start_POSTSUBSCRIPT italic_l , italic_r end_POSTSUBSCRIPT ( sansserif_S ∩ ( italic_l , italic_r ) ) end_POSTSUBSCRIPT ( roman_Γ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ∣ roman_Θ = italic_θ )
=𝔼(𝟙hL,R(𝖠𝖲(L,R))(Γk)i=1k1𝟙[0,2π)hL,R(𝖲(L,R))(Γi)Θ=θ,L=,R=r).absent𝔼formulae-sequenceconditionalsubscript1subscript𝐿𝑅𝖠𝖲𝐿𝑅subscriptΓ𝑘superscriptsubscriptproduct𝑖1𝑘1subscript102𝜋subscript𝐿𝑅𝖲𝐿𝑅subscriptΓ𝑖Θ𝜃formulae-sequence𝐿𝑅𝑟\displaystyle=\mathbb{E}\left(\mathbbm{1}_{h_{L,R}(\mathsf{A}\cap\mathsf{S}% \cap(L,R))}(\Gamma_{k})\prod_{i=1}^{k-1}\mathbbm{1}_{[0,2\pi)\setminus h_{L,R}% (\mathsf{S}\cap(L,R))}(\Gamma_{i})\mid\Theta=\theta,L=\ell,R=r\right).= blackboard_E ( blackboard_1 start_POSTSUBSCRIPT italic_h start_POSTSUBSCRIPT italic_L , italic_R end_POSTSUBSCRIPT ( sansserif_A ∩ sansserif_S ∩ ( italic_L , italic_R ) ) end_POSTSUBSCRIPT ( roman_Γ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) ∏ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k - 1 end_POSTSUPERSCRIPT blackboard_1 start_POSTSUBSCRIPT [ 0 , 2 italic_π ) ∖ italic_h start_POSTSUBSCRIPT italic_L , italic_R end_POSTSUBSCRIPT ( sansserif_S ∩ ( italic_L , italic_R ) ) end_POSTSUBSCRIPT ( roman_Γ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ∣ roman_Θ = italic_θ , italic_L = roman_ℓ , italic_R = italic_r ) .

Therefore

Q𝖲,r(θ,𝖠(,r))=k=1fk(h,r(θ),,r),θ𝖲(,r),<r,formulae-sequencesuperscriptsubscript𝑄𝖲𝑟𝜃𝖠𝑟superscriptsubscript𝑘1subscript𝑓𝑘subscript𝑟𝜃𝑟formulae-sequence𝜃𝖲𝑟𝑟Q_{\mathsf{S}}^{\ell,r}(\theta,\mathsf{A}\cap(\ell,r))=\sum_{k=1}^{\infty}f_{k% }\left(h_{\ell,r}(\theta),\ell,r\right),\qquad\theta\in\mathsf{S}\cap(\ell,r),% \ell<r,italic_Q start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ℓ , italic_r end_POSTSUPERSCRIPT ( italic_θ , sansserif_A ∩ ( roman_ℓ , italic_r ) ) = ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT italic_f start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_h start_POSTSUBSCRIPT roman_ℓ , italic_r end_POSTSUBSCRIPT ( italic_θ ) , roman_ℓ , italic_r ) , italic_θ ∈ sansserif_S ∩ ( roman_ℓ , italic_r ) , roman_ℓ < italic_r ,

is measurable in (θ,,r)𝜃𝑟(\theta,\ell,r)( italic_θ , roman_ℓ , italic_r ). The equality above holds by definition of the shrinkage kernel, see also [ReversibilityEllipticalSliceSampler, Proof of Theorem 2.9]. Consequently

{(θ,,r)3<r,θ𝖲(,r)}×(𝖲)[0,1],((θ,,r),B)Q𝖲,r(θ,𝖡(,r))formulae-sequenceconditional-set𝜃𝑟superscript3formulae-sequence𝑟𝜃𝖲𝑟𝖲01maps-to𝜃𝑟𝐵superscriptsubscript𝑄𝖲𝑟𝜃𝖡𝑟\{(\theta,\ell,r)\in\mathbb{R}^{3}\mid\ell<r,\theta\in\mathsf{S}\cap(\ell,r)\}% \times\mathcal{B}(\mathsf{S})\to[0,1],\qquad\big{(}(\theta,\ell,r),B\big{)}% \mapsto Q_{\mathsf{S}}^{\ell,r}(\theta,\mathsf{B}\cap(\ell,r)){ ( italic_θ , roman_ℓ , italic_r ) ∈ blackboard_R start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT ∣ roman_ℓ < italic_r , italic_θ ∈ sansserif_S ∩ ( roman_ℓ , italic_r ) } × caligraphic_B ( sansserif_S ) → [ 0 , 1 ] , ( ( italic_θ , roman_ℓ , italic_r ) , italic_B ) ↦ italic_Q start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ℓ , italic_r end_POSTSUPERSCRIPT ( italic_θ , sansserif_B ∩ ( roman_ℓ , italic_r ) )

is a transition kernel.

Remark 14.

Note that the combination of stepping-out distribution and shrinkage kernel as in (7) is valid, i.e., the random interval generated by the stepping-out procedure can be used as an input for the shrinkage procedure. To see this, fix x𝖬𝑥𝖬x\in\mathsf{M}italic_x ∈ sansserif_M, v𝕊xd1𝑣superscriptsubscript𝕊𝑥𝑑1v\in\mathbb{S}_{x}^{d-1}italic_v ∈ blackboard_S start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_d - 1 end_POSTSUPERSCRIPT and t(0,p(x))𝑡0𝑝𝑥t\in(0,p(x))italic_t ∈ ( 0 , italic_p ( italic_x ) ). Let 𝑳L(x,v,t)(0)superscriptsubscript𝑳𝐿𝑥𝑣𝑡0\boldsymbol{L}_{L(x,v,t)}^{(0)}bold_italic_L start_POSTSUBSCRIPT italic_L ( italic_x , italic_v , italic_t ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT and 𝑹L(x,v,t)(0)superscriptsubscript𝑹𝐿𝑥𝑣𝑡0\boldsymbol{R}_{L(x,v,t)}^{(0)}bold_italic_R start_POSTSUBSCRIPT italic_L ( italic_x , italic_v , italic_t ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT be as in (23). We need to verify that

  • L(x,v,t)𝐿𝑥𝑣𝑡L(x,v,t)italic_L ( italic_x , italic_v , italic_t ) is open,

  • 0L(x,v,t)(𝑳L(x,v,t)(0),𝑹L(x,v,t)(0))0𝐿𝑥𝑣𝑡superscriptsubscript𝑳𝐿𝑥𝑣𝑡0superscriptsubscript𝑹𝐿𝑥𝑣𝑡00\in L(x,v,t)\cap\big{(}\boldsymbol{L}_{L(x,v,t)}^{(0)},\boldsymbol{R}_{L(x,v,% t)}^{(0)}\big{)}0 ∈ italic_L ( italic_x , italic_v , italic_t ) ∩ ( bold_italic_L start_POSTSUBSCRIPT italic_L ( italic_x , italic_v , italic_t ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT , bold_italic_R start_POSTSUBSCRIPT italic_L ( italic_x , italic_v , italic_t ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT ) almost surely. In particular this implies that this intersection is almost surely non-empty.

The lower semi-continuity of p𝑝pitalic_p yields that L(x,v,t)𝐿𝑥𝑣𝑡L(x,v,t)italic_L ( italic_x , italic_v , italic_t ) is open. Since t(0,p(x))𝑡0𝑝𝑥t\in(0,p(x))italic_t ∈ ( 0 , italic_p ( italic_x ) ), we have 0L(x,v,t)0𝐿𝑥𝑣𝑡0\in L(x,v,t)0 ∈ italic_L ( italic_x , italic_v , italic_t ). Moreover, we have 0(𝑳L(x,v,t)(0),𝑹L(x,v,t)(0))0superscriptsubscript𝑳𝐿𝑥𝑣𝑡0superscriptsubscript𝑹𝐿𝑥𝑣𝑡00\in\big{(}\boldsymbol{L}_{L(x,v,t)}^{(0)},\boldsymbol{R}_{L(x,v,t)}^{(0)}\big% {)}0 ∈ ( bold_italic_L start_POSTSUBSCRIPT italic_L ( italic_x , italic_v , italic_t ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT , bold_italic_R start_POSTSUBSCRIPT italic_L ( italic_x , italic_v , italic_t ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT ) almost surely by construction. Therefore 0 is also almost surely contained in the intersection of these two sets.

We provide two properties of the shrinkage kernel Q𝖲,rsuperscriptsubscript𝑄𝖲𝑟Q_{\mathsf{S}}^{\ell,r}italic_Q start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ℓ , italic_r end_POSTSUPERSCRIPT that are useful to derive the reversibility of the geodesic slice sampler. Both are essentially extensions of corresponding properties of Q\textgothSsubscript𝑄\textgoth𝑆Q_{\textgoth{S}}italic_Q start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT.

Lemma 15.

Let ,r𝑟\ell,r\in\mathbb{R}roman_ℓ , italic_r ∈ blackboard_R and 𝖲()𝖲\mathsf{S}\in\mathcal{B}(\mathbb{R})sansserif_S ∈ caligraphic_B ( blackboard_R ) an open set such that (,r)𝖲𝑟𝖲(\ell,r)\cap\mathsf{S}\neq\emptyset( roman_ℓ , italic_r ) ∩ sansserif_S ≠ ∅. Then the kernel Q𝖲,rsuperscriptsubscript𝑄𝖲𝑟Q_{\mathsf{S}}^{\ell,r}italic_Q start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ℓ , italic_r end_POSTSUPERSCRIPT is reversible with respect to Unif(𝖲(,r))Unif𝖲𝑟\mathrm{Unif}\big{(}\mathsf{S}\cap(\ell,r)\big{)}roman_Unif ( sansserif_S ∩ ( roman_ℓ , italic_r ) ).

To obtain this result, we push the reversibility statement for Q\textgothSsubscript𝑄\textgoth𝑆Q_{\textgoth{S}}italic_Q start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT formulated in [ReversibilityEllipticalSliceSampler] forward to the shrinkage kernel on arbitrary half open intervals.

Proof.

By [ReversibilityEllipticalSliceSampler, Theorem 2.9] we know that Qh,r(𝖲(,r))subscript𝑄subscript𝑟𝖲𝑟Q_{h_{\ell,r}(\mathsf{S}\cap(\ell,r))}italic_Q start_POSTSUBSCRIPT italic_h start_POSTSUBSCRIPT roman_ℓ , italic_r end_POSTSUBSCRIPT ( sansserif_S ∩ ( roman_ℓ , italic_r ) ) end_POSTSUBSCRIPT is reversible with respect Unif(h,r(𝖲(,r)))Unifsubscript𝑟𝖲𝑟\mathrm{Unif}\big{(}h_{\ell,r}(\mathsf{S}\cap(\ell,r))\big{)}roman_Unif ( italic_h start_POSTSUBSCRIPT roman_ℓ , italic_r end_POSTSUBSCRIPT ( sansserif_S ∩ ( roman_ℓ , italic_r ) ) ). Observe that by [rudolf2022robust, Proposition 19] this implies that Q𝖲,rsuperscriptsubscript𝑄𝖲𝑟Q_{\mathsf{S}}^{\ell,r}italic_Q start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ℓ , italic_r end_POSTSUPERSCRIPT is reversible with respect to (h,r1)Unif(h,r(𝖲(,r)))=Unif(𝖲(,r))subscriptsuperscriptsubscript𝑟1Unifsubscript𝑟𝖲𝑟Unif𝖲𝑟(h_{\ell,r}^{-1})_{\sharp}\mathrm{Unif}\big{(}h_{\ell,r}(\mathsf{S}\cap(\ell,r% ))\big{)}=\mathrm{Unif}\big{(}\mathsf{S}\cap(\ell,r)\big{)}( italic_h start_POSTSUBSCRIPT roman_ℓ , italic_r end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ) start_POSTSUBSCRIPT ♯ end_POSTSUBSCRIPT roman_Unif ( italic_h start_POSTSUBSCRIPT roman_ℓ , italic_r end_POSTSUBSCRIPT ( sansserif_S ∩ ( roman_ℓ , italic_r ) ) ) = roman_Unif ( sansserif_S ∩ ( roman_ℓ , italic_r ) ).

The next lemma can be seen as describing the behavior of the shrinkage kernel under U-turns.

Lemma 16.

Let ,r𝑟\ell,r\in\mathbb{R}roman_ℓ , italic_r ∈ blackboard_R such that <r𝑟\ell<rroman_ℓ < italic_r and let 𝖲()𝖲\mathsf{S}\in\mathcal{B}(\mathbb{R})sansserif_S ∈ caligraphic_B ( blackboard_R ) be an open set such that (,r)𝖲𝑟𝖲(\ell,r)\cap\mathsf{S}\neq\emptyset( roman_ℓ , italic_r ) ∩ sansserif_S ≠ ∅. For all α𝖲(,r)𝛼𝖲𝑟\alpha\in\mathsf{S}\cap(\ell,r)italic_α ∈ sansserif_S ∩ ( roman_ℓ , italic_r ) and 𝖠(Λα(𝖲(,r)))𝖠subscriptΛ𝛼𝖲𝑟\mathsf{A}\in\mathcal{B}(\Lambda_{\alpha}(\mathsf{S}\cap(\ell,r)))sansserif_A ∈ caligraphic_B ( roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( sansserif_S ∩ ( roman_ℓ , italic_r ) ) ) we have

QΛα(𝖲)λα(,r)(0,𝖠)=Q𝖲,r(α,Λα(𝖠)),superscriptsubscript𝑄subscriptΛ𝛼𝖲subscriptλ𝛼𝑟0𝖠superscriptsubscript𝑄𝖲𝑟𝛼subscriptΛ𝛼𝖠Q_{\Lambda_{\alpha}(\mathsf{S})}^{\uplambda_{\alpha}(\ell,r)}(0,\mathsf{A})=Q_% {\mathsf{S}}^{\ell,r}(\alpha,\Lambda_{\alpha}(\mathsf{A})),italic_Q start_POSTSUBSCRIPT roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( sansserif_S ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( roman_ℓ , italic_r ) end_POSTSUPERSCRIPT ( 0 , sansserif_A ) = italic_Q start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ℓ , italic_r end_POSTSUPERSCRIPT ( italic_α , roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( sansserif_A ) ) ,

where ΛαsubscriptΛ𝛼\Lambda_{\alpha}roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT and λαsubscriptλ𝛼\uplambda_{\alpha}roman_λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT are defined as in (24) and (25) respectively.

In order to leverage a similar property of the kernel Q\textgothSsubscript𝑄\textgoth𝑆Q_{\textgoth{S}}italic_Q start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT for showing the above statement, we need the collection of maps

Λ~α:[0,2π)[0,2π),θαθmod2π:subscript~Λ𝛼formulae-sequence02𝜋02𝜋maps-to𝜃modulo𝛼𝜃2𝜋\displaystyle\widetilde{\Lambda}_{\alpha}:[0,2\pi)\to[0,2\pi),\qquad\theta% \mapsto\alpha-\theta\mod 2\piover~ start_ARG roman_Λ end_ARG start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT : [ 0 , 2 italic_π ) → [ 0 , 2 italic_π ) , italic_θ ↦ italic_α - italic_θ roman_mod 2 italic_π

indexed by α[0,2π)𝛼02𝜋\alpha\in[0,2\pi)italic_α ∈ [ 0 , 2 italic_π ).

Proof.

We aim to apply [ReversibilityEllipticalSliceSampler, Lemma 2.10]. Let ,r𝑟\ell,r\in\mathbb{R}roman_ℓ , italic_r ∈ blackboard_R such that <r𝑟\ell<rroman_ℓ < italic_r. Moreover, let 𝖲()𝖲\mathsf{S}\in\mathcal{B}(\mathbb{R})sansserif_S ∈ caligraphic_B ( blackboard_R ) be an open set such that (,r)𝖲𝑟𝖲(\ell,r)\cap\mathsf{S}\neq\emptyset( roman_ℓ , italic_r ) ∩ sansserif_S ≠ ∅, and α(,r)𝖲𝛼𝑟𝖲\alpha\in(\ell,r)\cap\mathsf{S}italic_α ∈ ( roman_ℓ , italic_r ) ∩ sansserif_S. Observe that for θ𝖲(,r)𝜃𝖲𝑟\theta\in\mathsf{S}\cap(\ell,r)italic_θ ∈ sansserif_S ∩ ( roman_ℓ , italic_r ) we have

Λ~h,r(α)(h,r(θ))=(2πrαmod2π2πrθmod2π)mod2πsubscript~Λsubscript𝑟𝛼subscript𝑟𝜃modulomodulo2𝜋𝑟𝛼modulo2𝜋2𝜋𝑟𝜃2𝜋2𝜋\displaystyle\widetilde{\Lambda}_{h_{\ell,r}(\alpha)}\left(h_{\ell,r}(\theta)% \right)=\left(\frac{2\pi}{r-\ell}\alpha\mod 2\pi-\frac{2\pi}{r-\ell}\theta\mod 2% \pi\right)\mod 2\piover~ start_ARG roman_Λ end_ARG start_POSTSUBSCRIPT italic_h start_POSTSUBSCRIPT roman_ℓ , italic_r end_POSTSUBSCRIPT ( italic_α ) end_POSTSUBSCRIPT ( italic_h start_POSTSUBSCRIPT roman_ℓ , italic_r end_POSTSUBSCRIPT ( italic_θ ) ) = ( divide start_ARG 2 italic_π end_ARG start_ARG italic_r - roman_ℓ end_ARG italic_α roman_mod 2 italic_π - divide start_ARG 2 italic_π end_ARG start_ARG italic_r - roman_ℓ end_ARG italic_θ roman_mod 2 italic_π ) roman_mod 2 italic_π
=(2πr(αθ))mod2π=(2π(α)(αr)(αθ))mod2πabsentmodulo2𝜋𝑟𝛼𝜃2𝜋modulo2𝜋𝛼𝛼𝑟𝛼𝜃2𝜋\displaystyle\qquad=\left(\frac{2\pi}{r-\ell}(\alpha-\theta)\right)\mod 2\pi=% \left(\frac{2\pi}{(\alpha-\ell)-(\alpha-r)}(\alpha-\theta)\right)\mod 2\pi= ( divide start_ARG 2 italic_π end_ARG start_ARG italic_r - roman_ℓ end_ARG ( italic_α - italic_θ ) ) roman_mod 2 italic_π = ( divide start_ARG 2 italic_π end_ARG start_ARG ( italic_α - roman_ℓ ) - ( italic_α - italic_r ) end_ARG ( italic_α - italic_θ ) ) roman_mod 2 italic_π
=hλα(,r)(Λα(θ)).absentsubscriptsubscriptλ𝛼𝑟subscriptΛ𝛼𝜃\displaystyle\qquad=h_{\uplambda_{\alpha}(\ell,r)}\left(\Lambda_{\alpha}(% \theta)\right).= italic_h start_POSTSUBSCRIPT roman_λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( roman_ℓ , italic_r ) end_POSTSUBSCRIPT ( roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( italic_θ ) ) .

Therefore

Λ~h,r(α)(h,r(𝖲(,r)))=hλα(,r)(Λα(𝖲(,r)))=hλα(,r)(Λα(𝖲)(Λα(r),Λα())),subscript~Λsubscript𝑟𝛼subscript𝑟𝖲𝑟subscriptsubscriptλ𝛼𝑟subscriptΛ𝛼𝖲𝑟subscriptsubscriptλ𝛼𝑟subscriptΛ𝛼𝖲subscriptΛ𝛼𝑟subscriptΛ𝛼\widetilde{\Lambda}_{h_{\ell,r}(\alpha)}\Big{(}h_{\ell,r}\big{(}\mathsf{S}\cap% (\ell,r)\big{)}\Big{)}=h_{\uplambda_{\alpha}(\ell,r)}\Big{(}\Lambda_{\alpha}% \big{(}\mathsf{S}\cap(\ell,r)\big{)}\Big{)}=h_{\uplambda_{\alpha}(\ell,r)}\Big% {(}\Lambda_{\alpha}(\mathsf{S})\cap\big{(}\Lambda_{\alpha}(r),\Lambda_{\alpha}% (\ell)\big{)}\Big{)},over~ start_ARG roman_Λ end_ARG start_POSTSUBSCRIPT italic_h start_POSTSUBSCRIPT roman_ℓ , italic_r end_POSTSUBSCRIPT ( italic_α ) end_POSTSUBSCRIPT ( italic_h start_POSTSUBSCRIPT roman_ℓ , italic_r end_POSTSUBSCRIPT ( sansserif_S ∩ ( roman_ℓ , italic_r ) ) ) = italic_h start_POSTSUBSCRIPT roman_λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( roman_ℓ , italic_r ) end_POSTSUBSCRIPT ( roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( sansserif_S ∩ ( roman_ℓ , italic_r ) ) ) = italic_h start_POSTSUBSCRIPT roman_λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( roman_ℓ , italic_r ) end_POSTSUBSCRIPT ( roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( sansserif_S ) ∩ ( roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( italic_r ) , roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( roman_ℓ ) ) ) ,

and

Λ~h,r(α)(h,r(Λα(𝖠)))=hλα(,r)(𝖠)subscript~Λsubscript𝑟𝛼subscript𝑟subscriptΛ𝛼𝖠subscriptsubscriptλ𝛼𝑟𝖠\widetilde{\Lambda}_{h_{\ell,r}(\alpha)}\left(h_{\ell,r}\big{(}\Lambda_{\alpha% }(\mathsf{A})\big{)}\right)=h_{\uplambda_{\alpha}(\ell,r)}(\mathsf{A})over~ start_ARG roman_Λ end_ARG start_POSTSUBSCRIPT italic_h start_POSTSUBSCRIPT roman_ℓ , italic_r end_POSTSUBSCRIPT ( italic_α ) end_POSTSUBSCRIPT ( italic_h start_POSTSUBSCRIPT roman_ℓ , italic_r end_POSTSUBSCRIPT ( roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( sansserif_A ) ) ) = italic_h start_POSTSUBSCRIPT roman_λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( roman_ℓ , italic_r ) end_POSTSUBSCRIPT ( sansserif_A )

for all 𝖠(Λα(𝖲(,r)))𝖠subscriptΛ𝛼𝖲𝑟\mathsf{A}\in\mathcal{B}(\Lambda_{\alpha}(\mathsf{S}\cap(\ell,r)))sansserif_A ∈ caligraphic_B ( roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( sansserif_S ∩ ( roman_ℓ , italic_r ) ) ), as Λα1=ΛαsuperscriptsubscriptΛ𝛼1subscriptΛ𝛼\Lambda_{\alpha}^{-1}=\Lambda_{\alpha}roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT = roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT. Since Λ~h,r(α)(h,r(α))=h,r(α)h,r(α)mod2π=0subscript~Λsubscript𝑟𝛼subscript𝑟𝛼modulosubscript𝑟𝛼subscript𝑟𝛼2𝜋0\widetilde{\Lambda}_{h_{\ell,r}(\alpha)}\left(h_{\ell,r}(\alpha)\right)=h_{% \ell,r}(\alpha)-h_{\ell,r}(\alpha)\mod 2\pi=0over~ start_ARG roman_Λ end_ARG start_POSTSUBSCRIPT italic_h start_POSTSUBSCRIPT roman_ℓ , italic_r end_POSTSUBSCRIPT ( italic_α ) end_POSTSUBSCRIPT ( italic_h start_POSTSUBSCRIPT roman_ℓ , italic_r end_POSTSUBSCRIPT ( italic_α ) ) = italic_h start_POSTSUBSCRIPT roman_ℓ , italic_r end_POSTSUBSCRIPT ( italic_α ) - italic_h start_POSTSUBSCRIPT roman_ℓ , italic_r end_POSTSUBSCRIPT ( italic_α ) roman_mod 2 italic_π = 0 and Λ~α1=Λ~αsuperscriptsubscript~Λ𝛼1subscript~Λ𝛼\widetilde{\Lambda}_{\alpha}^{-1}=\widetilde{\Lambda}_{\alpha}over~ start_ARG roman_Λ end_ARG start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT = over~ start_ARG roman_Λ end_ARG start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT, we get by [ReversibilityEllipticalSliceSampler, Lemma 2.10, note that gθ=Λ~αsubscript𝑔𝜃subscript~Λ𝛼g_{\theta}=\widetilde{\Lambda}_{\alpha}italic_g start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT = over~ start_ARG roman_Λ end_ARG start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT] that for 𝖠(Λα(𝖲(,r)))𝖠subscriptΛ𝛼𝖲𝑟\mathsf{A}\in\mathcal{B}(\Lambda_{\alpha}(\mathsf{S}\cap(\ell,r)))sansserif_A ∈ caligraphic_B ( roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( sansserif_S ∩ ( roman_ℓ , italic_r ) ) )

QΛα(𝖲)λα(,r)(0,𝖠)superscriptsubscript𝑄subscriptΛ𝛼𝖲subscriptλ𝛼𝑟0𝖠\displaystyle Q_{\Lambda_{\alpha}(\mathsf{S})}^{\uplambda_{\alpha}(\ell,r)}(0,% \mathsf{A})italic_Q start_POSTSUBSCRIPT roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( sansserif_S ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( roman_ℓ , italic_r ) end_POSTSUPERSCRIPT ( 0 , sansserif_A ) =Qhλα(,r)(Λα(𝖲)(Λα(r),Λα()))(0,hλα(,r)(𝖠))absentsubscript𝑄subscriptsubscriptλ𝛼𝑟subscriptΛ𝛼𝖲subscriptΛ𝛼𝑟subscriptΛ𝛼0subscriptsubscriptλ𝛼𝑟𝖠\displaystyle=Q_{h_{\uplambda_{\alpha}(\ell,r)}\big{(}\Lambda_{\alpha}(\mathsf% {S})\cap(\Lambda_{\alpha}(r),\Lambda_{\alpha}(\ell))\big{)}}\left(0,h_{% \uplambda_{\alpha}(\ell,r)}(\mathsf{A})\right)= italic_Q start_POSTSUBSCRIPT italic_h start_POSTSUBSCRIPT roman_λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( roman_ℓ , italic_r ) end_POSTSUBSCRIPT ( roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( sansserif_S ) ∩ ( roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( italic_r ) , roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( roman_ℓ ) ) ) end_POSTSUBSCRIPT ( 0 , italic_h start_POSTSUBSCRIPT roman_λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( roman_ℓ , italic_r ) end_POSTSUBSCRIPT ( sansserif_A ) )
=QΛ~h,r(α)(h,r(𝖲(,r)))(Λ~h,r(α)(h,r(α)),Λ~h,r(α)(h,r(Λα(𝖠))))absentsubscript𝑄subscript~Λsubscript𝑟𝛼subscript𝑟𝖲𝑟subscript~Λsubscript𝑟𝛼subscript𝑟𝛼subscript~Λsubscript𝑟𝛼subscript𝑟subscriptΛ𝛼𝖠\displaystyle=Q_{\widetilde{\Lambda}_{h_{\ell,r}(\alpha)}\big{(}h_{\ell,r}(% \mathsf{S}\cap(\ell,r))\big{)}}\left(\widetilde{\Lambda}_{h_{\ell,r}(\alpha)}% \left(h_{\ell,r}(\alpha)\right),\widetilde{\Lambda}_{h_{\ell,r}(\alpha)}\big{(% }h_{\ell,r}(\Lambda_{\alpha}(\mathsf{A}))\big{)}\right)= italic_Q start_POSTSUBSCRIPT over~ start_ARG roman_Λ end_ARG start_POSTSUBSCRIPT italic_h start_POSTSUBSCRIPT roman_ℓ , italic_r end_POSTSUBSCRIPT ( italic_α ) end_POSTSUBSCRIPT ( italic_h start_POSTSUBSCRIPT roman_ℓ , italic_r end_POSTSUBSCRIPT ( sansserif_S ∩ ( roman_ℓ , italic_r ) ) ) end_POSTSUBSCRIPT ( over~ start_ARG roman_Λ end_ARG start_POSTSUBSCRIPT italic_h start_POSTSUBSCRIPT roman_ℓ , italic_r end_POSTSUBSCRIPT ( italic_α ) end_POSTSUBSCRIPT ( italic_h start_POSTSUBSCRIPT roman_ℓ , italic_r end_POSTSUBSCRIPT ( italic_α ) ) , over~ start_ARG roman_Λ end_ARG start_POSTSUBSCRIPT italic_h start_POSTSUBSCRIPT roman_ℓ , italic_r end_POSTSUBSCRIPT ( italic_α ) end_POSTSUBSCRIPT ( italic_h start_POSTSUBSCRIPT roman_ℓ , italic_r end_POSTSUBSCRIPT ( roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( sansserif_A ) ) ) )
=Qh,r(𝖲(,r))(h,r(α),h,r(Λα(𝖠)))absentsubscript𝑄subscript𝑟𝖲𝑟subscript𝑟𝛼subscript𝑟subscriptΛ𝛼𝖠\displaystyle=Q_{h_{\ell,r}(\mathsf{S}\cap(\ell,r))}\big{(}h_{\ell,r}(\alpha),% h_{\ell,r}(\Lambda_{\alpha}(\mathsf{A}))\big{)}= italic_Q start_POSTSUBSCRIPT italic_h start_POSTSUBSCRIPT roman_ℓ , italic_r end_POSTSUBSCRIPT ( sansserif_S ∩ ( roman_ℓ , italic_r ) ) end_POSTSUBSCRIPT ( italic_h start_POSTSUBSCRIPT roman_ℓ , italic_r end_POSTSUBSCRIPT ( italic_α ) , italic_h start_POSTSUBSCRIPT roman_ℓ , italic_r end_POSTSUBSCRIPT ( roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( sansserif_A ) ) )
=Q𝖲,r(α,Λα(𝖠)).absentsuperscriptsubscript𝑄𝖲𝑟𝛼subscriptΛ𝛼𝖠\displaystyle=Q_{\mathsf{S}}^{\ell,r}(\alpha,\Lambda_{\alpha}(\mathsf{A})).= italic_Q start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ℓ , italic_r end_POSTSUPERSCRIPT ( italic_α , roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( sansserif_A ) ) .

4.3 Reversibility of the geodesic slice sampler

Note that the tangent bundle

T𝖬:=x𝖬{x}×Tx𝖬assign𝑇𝖬subscript𝑥𝖬𝑥subscript𝑇𝑥𝖬T\mathsf{M}:=\bigcup_{x\in\mathsf{M}}\{x\}\times T_{x}\mathsf{M}italic_T sansserif_M := ⋃ start_POSTSUBSCRIPT italic_x ∈ sansserif_M end_POSTSUBSCRIPT { italic_x } × italic_T start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT sansserif_M

of 𝖬𝖬\mathsf{M}sansserif_M is a smooth, 2d2𝑑2d2 italic_d-dimensional, connected manifold, see e.g. [Sakai, Section I.2.2]. To prove Theorem 7, we employ the Riemannian structure of T𝖬𝑇𝖬T\mathsf{M}italic_T sansserif_M. We only briefly sketch how this Riemannian structure is introduced, for more details see [Sakai, Section II.4].

We denote by

proj𝖬:T𝖬𝖬,(x,v)x:subscriptproj𝖬formulae-sequence𝑇𝖬𝖬maps-to𝑥𝑣𝑥\mathrm{proj}_{\mathsf{M}}:T\mathsf{M}\to\mathsf{M},\qquad(x,v)\mapsto xroman_proj start_POSTSUBSCRIPT sansserif_M end_POSTSUBSCRIPT : italic_T sansserif_M → sansserif_M , ( italic_x , italic_v ) ↦ italic_x

the projection map from the tangent bundle onto 𝖬𝖬\mathsf{M}sansserif_M. Observe that for u=(x,v)T𝖬𝑢𝑥𝑣𝑇𝖬u=(x,v)\in T\mathsf{M}italic_u = ( italic_x , italic_v ) ∈ italic_T sansserif_M the tangent space TuT𝖬subscript𝑇𝑢𝑇𝖬T_{u}T\mathsf{M}italic_T start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT italic_T sansserif_M to T𝖬𝑇𝖬T\mathsf{M}italic_T sansserif_M at u𝑢uitalic_u can be identified with the direct sum

TuT𝖬=Tx𝖬Tx𝖬.subscript𝑇𝑢𝑇𝖬direct-sumsubscript𝑇𝑥𝖬subscript𝑇𝑥𝖬T_{u}T\mathsf{M}=T_{x}\mathsf{M}\oplus T_{x}\mathsf{M}.italic_T start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT italic_T sansserif_M = italic_T start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT sansserif_M ⊕ italic_T start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT sansserif_M .

For u=(x,v)𝖬𝑢𝑥𝑣𝖬u=(x,v)\in\mathsf{M}italic_u = ( italic_x , italic_v ) ∈ sansserif_M this identification allows us to introduce a \ldqcanonical\rdq metric on T𝖬𝑇𝖬T\mathsf{M}italic_T sansserif_M referenced to as the Sasaki metric

𝔊u(η,η~):=𝔤x(ηh,η~h)+𝔤x(ηv,η~v),η=(ηh,ηv),η~=(η~h,η~v)Tx𝖬Tx𝖬.formulae-sequenceassignsubscript𝔊𝑢𝜂~𝜂subscript𝔤𝑥subscript𝜂subscript~𝜂subscript𝔤𝑥subscript𝜂𝑣subscript~𝜂𝑣formulae-sequence𝜂subscript𝜂subscript𝜂𝑣~𝜂subscript~𝜂subscript~𝜂𝑣direct-sumsubscript𝑇𝑥𝖬subscript𝑇𝑥𝖬\mathfrak{G}_{u}(\eta,\widetilde{\eta}):=\mathfrak{g}_{x}\left(\eta_{h},% \widetilde{\eta}_{h}\right)+\mathfrak{g}_{x}(\eta_{v},\widetilde{\eta}_{v}),% \qquad\eta=(\eta_{h},\eta_{v}),\widetilde{\eta}=(\widetilde{\eta}_{h},% \widetilde{\eta}_{v})\in T_{x}\mathsf{M}\oplus T_{x}\mathsf{M}.fraktur_G start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ( italic_η , over~ start_ARG italic_η end_ARG ) := fraktur_g start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT ( italic_η start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT , over~ start_ARG italic_η end_ARG start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT ) + fraktur_g start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT ( italic_η start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT , over~ start_ARG italic_η end_ARG start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) , italic_η = ( italic_η start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT , italic_η start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) , over~ start_ARG italic_η end_ARG = ( over~ start_ARG italic_η end_ARG start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT , over~ start_ARG italic_η end_ARG start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ) ∈ italic_T start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT sansserif_M ⊕ italic_T start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT sansserif_M .

Together with 𝔊𝔊\mathfrak{G}fraktur_G, the tangent bundle is a Riemannian manifold. However, in fact we are more interested into a submanifold of T𝖬𝑇𝖬T\mathsf{M}italic_T sansserif_M, that is, the unit tangent bundle

U𝖬:=x𝖬Ux𝖬:=x𝖬{x}×𝕊xd1.assign𝑈𝖬subscript𝑥𝖬subscript𝑈𝑥𝖬assignsubscript𝑥𝖬𝑥subscriptsuperscript𝕊𝑑1𝑥U\mathsf{M}:=\bigcup_{x\in\mathsf{M}}U_{x}\mathsf{M}:=\bigcup_{x\in\mathsf{M}}% \{x\}\times\mathbb{S}^{d-1}_{x}.italic_U sansserif_M := ⋃ start_POSTSUBSCRIPT italic_x ∈ sansserif_M end_POSTSUBSCRIPT italic_U start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT sansserif_M := ⋃ start_POSTSUBSCRIPT italic_x ∈ sansserif_M end_POSTSUBSCRIPT { italic_x } × blackboard_S start_POSTSUPERSCRIPT italic_d - 1 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT .

We call the Riemannian measure ν𝔊subscript𝜈𝔊\nu_{\mathfrak{G}}italic_ν start_POSTSUBSCRIPT fraktur_G end_POSTSUBSCRIPT, which is induced by the Sasaki metric 𝔊𝔊\mathfrak{G}fraktur_G onto the unit tangent bundle U𝖬𝑈𝖬U\mathsf{M}italic_U sansserif_M, the Liouville measure. Observe that the restriction proj𝖬|U𝖬evaluated-atsubscriptproj𝖬𝑈𝖬\mathrm{proj}_{\mathsf{M}}|_{U\mathsf{M}}roman_proj start_POSTSUBSCRIPT sansserif_M end_POSTSUBSCRIPT | start_POSTSUBSCRIPT italic_U sansserif_M end_POSTSUBSCRIPT is a Riemannian submersion, and for x𝖬𝑥𝖬x\in\mathsf{M}italic_x ∈ sansserif_M the fiber proj𝖬|U𝖬1(x)evaluated-atsubscriptproj𝖬𝑈𝖬1𝑥\mathrm{proj}_{\mathsf{M}}|_{U\mathsf{M}}^{-1}(x)roman_proj start_POSTSUBSCRIPT sansserif_M end_POSTSUBSCRIPT | start_POSTSUBSCRIPT italic_U sansserif_M end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_x ) equipped with the metric induced by 𝔤𝔤\mathfrak{g}fraktur_g is (𝕊xd1,𝔤^x)subscriptsuperscript𝕊𝑑1𝑥subscript^𝔤𝑥\left(\mathbb{S}^{d-1}_{x},\widehat{\mathfrak{g}}_{x}\right)( blackboard_S start_POSTSUPERSCRIPT italic_d - 1 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT , over^ start_ARG fraktur_g end_ARG start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT ). Note that additionally ν𝔤^x(𝕊xd1)=ν𝔤^(𝕊d1)subscript𝜈subscript^𝔤𝑥subscriptsuperscript𝕊𝑑1𝑥subscript𝜈^𝔤superscript𝕊𝑑1\nu_{\widehat{\mathfrak{g}}_{x}}(\mathbb{S}^{d-1}_{x})=\nu_{\widehat{\mathfrak% {g}}}(\mathbb{S}^{d-1})italic_ν start_POSTSUBSCRIPT over^ start_ARG fraktur_g end_ARG start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( blackboard_S start_POSTSUPERSCRIPT italic_d - 1 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT ) = italic_ν start_POSTSUBSCRIPT over^ start_ARG fraktur_g end_ARG end_POSTSUBSCRIPT ( blackboard_S start_POSTSUPERSCRIPT italic_d - 1 end_POSTSUPERSCRIPT ) for all x𝖬𝑥𝖬x\in\mathsf{M}italic_x ∈ sansserif_M. Applying Fubini’s theorem for manifolds (see [Sakai, Theorem II.5.6]), this yields a nice expression for the Liouville measure, namely we have

U𝖬f(x,v)ν𝔊(d(x,v))=𝖬𝕊xd1f(x,v)ν𝔤^x(dv)ν𝔤(dx)subscript𝑈𝖬𝑓𝑥𝑣subscript𝜈𝔊d𝑥𝑣subscript𝖬subscriptsuperscriptsubscript𝕊𝑥𝑑1𝑓𝑥𝑣subscript𝜈subscript^𝔤𝑥d𝑣subscript𝜈𝔤d𝑥\int_{U\mathsf{M}}f(x,v)\ \nu_{\mathfrak{G}}\big{(}{\rm d}(x,v)\big{)}=\int_{% \mathsf{M}}\int_{\mathbb{S}_{x}^{d-1}}f(x,v)\ \nu_{\widehat{\mathfrak{g}}_{x}}% ({\rm d}v)\,\nu_{\mathfrak{g}}({\rm d}x)∫ start_POSTSUBSCRIPT italic_U sansserif_M end_POSTSUBSCRIPT italic_f ( italic_x , italic_v ) italic_ν start_POSTSUBSCRIPT fraktur_G end_POSTSUBSCRIPT ( roman_d ( italic_x , italic_v ) ) = ∫ start_POSTSUBSCRIPT sansserif_M end_POSTSUBSCRIPT ∫ start_POSTSUBSCRIPT blackboard_S start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_d - 1 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_f ( italic_x , italic_v ) italic_ν start_POSTSUBSCRIPT over^ start_ARG fraktur_g end_ARG start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( roman_d italic_v ) italic_ν start_POSTSUBSCRIPT fraktur_g end_POSTSUBSCRIPT ( roman_d italic_x ) (26)

for all measurable functions f:U𝖬[0,):𝑓𝑈𝖬0f:U\mathsf{M}\to[0,\infty)italic_f : italic_U sansserif_M → [ 0 , ∞ ).

In the following, we combine the Liouville measure with a family of maps T(θ):U𝖬U𝖬:superscript𝑇𝜃𝑈𝖬𝑈𝖬T^{(\theta)}:U\mathsf{M}\to U\mathsf{M}italic_T start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT : italic_U sansserif_M → italic_U sansserif_M indexed by θ𝜃\theta\in\mathbb{R}italic_θ ∈ blackboard_R which can be interpreted as \ldqwalking along the geodesic specified by uU𝖬𝑢𝑈𝖬u\in U\mathsf{M}italic_u ∈ italic_U sansserif_M with step length θ𝜃\thetaitalic_θ and then doing a U-turn\rdq. To this end, for θ𝜃\theta\in\mathbb{R}italic_θ ∈ blackboard_R we denote the geodesic flow by

ϕθ:U𝖬U𝖬,(x,v)(γ(x,v)(θ),dγ(x,v)dt|θ).:subscriptitalic-ϕ𝜃formulae-sequence𝑈𝖬𝑈𝖬maps-to𝑥𝑣subscript𝛾𝑥𝑣𝜃evaluated-atdsubscript𝛾𝑥𝑣d𝑡𝜃\displaystyle\phi_{\theta}:U\mathsf{M}\to U\mathsf{M},\qquad(x,v)\mapsto\left(% \left.\gamma_{(x,v)}(\theta),\frac{{\rm d}\gamma_{(x,v)}}{{\rm d}t}\right|_{% \theta}\right).italic_ϕ start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT : italic_U sansserif_M → italic_U sansserif_M , ( italic_x , italic_v ) ↦ ( italic_γ start_POSTSUBSCRIPT ( italic_x , italic_v ) end_POSTSUBSCRIPT ( italic_θ ) , divide start_ARG roman_d italic_γ start_POSTSUBSCRIPT ( italic_x , italic_v ) end_POSTSUBSCRIPT end_ARG start_ARG roman_d italic_t end_ARG | start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT ) .

For more details on the geodesic flow see [Sakai, Section II.4.II]. Moreover, we define a flip on the unit tangent bundle

:U𝖬U𝖬,(x,v)(x,v).:formulae-sequence𝑈𝖬𝑈𝖬maps-to𝑥𝑣𝑥𝑣\displaystyle\mathfrak{I}:U\mathsf{M}\to U\mathsf{M},\qquad(x,v)\mapsto(x,-v).fraktur_I : italic_U sansserif_M → italic_U sansserif_M , ( italic_x , italic_v ) ↦ ( italic_x , - italic_v ) .

Then we set

T(θ):=ϕθ,θ.formulae-sequenceassignsuperscript𝑇𝜃subscriptitalic-ϕ𝜃𝜃T^{(\theta)}:=\mathfrak{I}\circ\phi_{\theta},\qquad\theta\in\mathbb{R}.italic_T start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT := fraktur_I ∘ italic_ϕ start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT , italic_θ ∈ blackboard_R .

Observe that the Liouville measure is invariant under the geodesic flow (see [Sakai, Exercise II.16]) and under the flip \mathfrak{I}fraktur_I (see [Paternain, Lemma 1.34]). Therefore we have due to the representation in (26) for all measurable functions f:U𝖬[0,):𝑓𝑈𝖬0f:U\mathsf{M}\to[0,\infty)italic_f : italic_U sansserif_M → [ 0 , ∞ ) and all θ𝜃\theta\in\mathbb{R}italic_θ ∈ blackboard_R that

𝖬𝕊xd1f(T(θ)(x,v))σd1(x)(dv)ν𝔤(dx)=1ν𝔤^(𝕊d1)U𝖬f(T(θ)(x,v))ν𝔊(d(x,v))=1ν𝔤^(𝕊d1)U𝖬f(x,v)ν𝔊(d(x,v))=𝖬𝕊xd1f(x,v)σd1(x)(dv)ν𝔤(dx),subscript𝖬subscriptsuperscriptsubscript𝕊𝑥𝑑1𝑓superscript𝑇𝜃𝑥𝑣superscriptsubscript𝜎𝑑1𝑥d𝑣subscript𝜈𝔤d𝑥1subscript𝜈^𝔤superscript𝕊𝑑1subscript𝑈𝖬𝑓superscript𝑇𝜃𝑥𝑣subscript𝜈𝔊d𝑥𝑣1subscript𝜈^𝔤superscript𝕊𝑑1subscript𝑈𝖬𝑓𝑥𝑣subscript𝜈𝔊d𝑥𝑣subscript𝖬subscriptsuperscriptsubscript𝕊𝑥𝑑1𝑓𝑥𝑣superscriptsubscript𝜎𝑑1𝑥d𝑣subscript𝜈𝔤d𝑥\begin{split}&\int_{\mathsf{M}}\int_{\mathbb{S}_{x}^{d-1}}f\left(T^{(\theta)}(% x,v)\right)\ \sigma_{d-1}^{(x)}({\rm d}v)\,\nu_{\mathfrak{g}}({\rm d}x)=\frac{% 1}{\nu_{\widehat{\mathfrak{g}}}(\mathbb{S}^{d-1})}\int_{U\mathsf{M}}f\left(T^{% (\theta)}(x,v)\right)\ \nu_{\mathfrak{G}}\big{(}{\rm d}(x,v)\big{)}\\ &\qquad=\frac{1}{\nu_{\widehat{\mathfrak{g}}}(\mathbb{S}^{d-1})}\int_{U\mathsf% {M}}f(x,v)\ \nu_{\mathfrak{G}}\big{(}{\rm d}(x,v)\big{)}=\int_{\mathsf{M}}\int% _{\mathbb{S}_{x}^{d-1}}f(x,v)\ \sigma_{d-1}^{(x)}({\rm d}v)\,\nu_{\mathfrak{g}% }({\rm d}x),\end{split}start_ROW start_CELL end_CELL start_CELL ∫ start_POSTSUBSCRIPT sansserif_M end_POSTSUBSCRIPT ∫ start_POSTSUBSCRIPT blackboard_S start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_d - 1 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_f ( italic_T start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ( italic_x , italic_v ) ) italic_σ start_POSTSUBSCRIPT italic_d - 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_x ) end_POSTSUPERSCRIPT ( roman_d italic_v ) italic_ν start_POSTSUBSCRIPT fraktur_g end_POSTSUBSCRIPT ( roman_d italic_x ) = divide start_ARG 1 end_ARG start_ARG italic_ν start_POSTSUBSCRIPT over^ start_ARG fraktur_g end_ARG end_POSTSUBSCRIPT ( blackboard_S start_POSTSUPERSCRIPT italic_d - 1 end_POSTSUPERSCRIPT ) end_ARG ∫ start_POSTSUBSCRIPT italic_U sansserif_M end_POSTSUBSCRIPT italic_f ( italic_T start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ( italic_x , italic_v ) ) italic_ν start_POSTSUBSCRIPT fraktur_G end_POSTSUBSCRIPT ( roman_d ( italic_x , italic_v ) ) end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL = divide start_ARG 1 end_ARG start_ARG italic_ν start_POSTSUBSCRIPT over^ start_ARG fraktur_g end_ARG end_POSTSUBSCRIPT ( blackboard_S start_POSTSUPERSCRIPT italic_d - 1 end_POSTSUPERSCRIPT ) end_ARG ∫ start_POSTSUBSCRIPT italic_U sansserif_M end_POSTSUBSCRIPT italic_f ( italic_x , italic_v ) italic_ν start_POSTSUBSCRIPT fraktur_G end_POSTSUBSCRIPT ( roman_d ( italic_x , italic_v ) ) = ∫ start_POSTSUBSCRIPT sansserif_M end_POSTSUBSCRIPT ∫ start_POSTSUBSCRIPT blackboard_S start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_d - 1 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_f ( italic_x , italic_v ) italic_σ start_POSTSUBSCRIPT italic_d - 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_x ) end_POSTSUPERSCRIPT ( roman_d italic_v ) italic_ν start_POSTSUBSCRIPT fraktur_g end_POSTSUBSCRIPT ( roman_d italic_x ) , end_CELL end_ROW (27)

i.e., the Liouville measure is invariant under T(θ)superscript𝑇𝜃T^{(\theta)}italic_T start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT.

We shed some further light on the interaction of T(θ)superscript𝑇𝜃T^{(\theta)}italic_T start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT and the geodesics.

Remark 17.

Let x𝖬𝑥𝖬x\in\mathsf{M}italic_x ∈ sansserif_M, v𝕊xd1𝑣superscriptsubscript𝕊𝑥𝑑1v\in\mathbb{S}_{x}^{d-1}italic_v ∈ blackboard_S start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_d - 1 end_POSTSUPERSCRIPT and θ,α𝜃𝛼\theta,\alpha\in\mathbb{R}italic_θ , italic_α ∈ blackboard_R. Using the rescaling property of geodesics (see [Lee, Lemma 5.18]) and the chain rule, we have

(ϕθ)(x,v)subscriptitalic-ϕ𝜃𝑥𝑣\displaystyle(\mathfrak{I}\circ\phi_{\theta})(x,v)( fraktur_I ∘ italic_ϕ start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT ) ( italic_x , italic_v ) =(γ(x,v)(θ),dγ(x,v)dt|θ)=(γ(x,v)(θ),dγ(x,v)dt|θ)absentsubscript𝛾𝑥𝑣𝜃evaluated-atdsubscript𝛾𝑥𝑣d𝑡𝜃subscript𝛾𝑥𝑣𝜃evaluated-atdsubscript𝛾𝑥𝑣d𝑡𝜃\displaystyle=\left(\gamma_{(x,v)}(\theta),-\frac{{\rm d}\gamma_{(x,v)}}{{\rm d% }t}|_{\theta}\right)=\left(\gamma_{(x,-v)}(-\theta),\frac{{\rm d}\gamma_{(x,-v% )}}{{\rm d}t}|_{-\theta}\right)= ( italic_γ start_POSTSUBSCRIPT ( italic_x , italic_v ) end_POSTSUBSCRIPT ( italic_θ ) , - divide start_ARG roman_d italic_γ start_POSTSUBSCRIPT ( italic_x , italic_v ) end_POSTSUBSCRIPT end_ARG start_ARG roman_d italic_t end_ARG | start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT ) = ( italic_γ start_POSTSUBSCRIPT ( italic_x , - italic_v ) end_POSTSUBSCRIPT ( - italic_θ ) , divide start_ARG roman_d italic_γ start_POSTSUBSCRIPT ( italic_x , - italic_v ) end_POSTSUBSCRIPT end_ARG start_ARG roman_d italic_t end_ARG | start_POSTSUBSCRIPT - italic_θ end_POSTSUBSCRIPT )
=(ϕθ)(x,v).absentsubscriptitalic-ϕ𝜃𝑥𝑣\displaystyle=(\phi_{-\theta}\circ\mathfrak{I})(x,v).= ( italic_ϕ start_POSTSUBSCRIPT - italic_θ end_POSTSUBSCRIPT ∘ fraktur_I ) ( italic_x , italic_v ) .

Since ϕθϕα=ϕθαsubscriptitalic-ϕ𝜃subscriptitalic-ϕ𝛼subscriptitalic-ϕ𝜃𝛼\phi_{\theta}\circ\phi_{-\alpha}=\phi_{\theta-\alpha}italic_ϕ start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT ∘ italic_ϕ start_POSTSUBSCRIPT - italic_α end_POSTSUBSCRIPT = italic_ϕ start_POSTSUBSCRIPT italic_θ - italic_α end_POSTSUBSCRIPT111111This is a basic property of a flow, see e.g. [LeeSmooth, Chapter 9]. Observe that the geodesic flow is a flow on the (unit) tangent bundle., this yields

γT(α)(x,v)(θ)subscript𝛾superscript𝑇𝛼𝑥𝑣𝜃\displaystyle\gamma_{T^{(\alpha)}(x,v)}(\theta)italic_γ start_POSTSUBSCRIPT italic_T start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ( italic_x , italic_v ) end_POSTSUBSCRIPT ( italic_θ ) =proj𝖬(ϕθ(T(α)(x,v)))=proj𝖬((ϕθϕα)(x,v))absentsubscriptproj𝖬subscriptitalic-ϕ𝜃superscript𝑇𝛼𝑥𝑣subscriptproj𝖬subscriptitalic-ϕ𝜃subscriptitalic-ϕ𝛼𝑥𝑣\displaystyle=\mathrm{proj}_{\mathsf{M}}\big{(}\phi_{\theta}\left(T^{(\alpha)}% (x,v)\right)\big{)}=\mathrm{proj}_{\mathsf{M}}\big{(}(\phi_{\theta}\circ% \mathfrak{I}\circ\phi_{\alpha})(x,v)\big{)}= roman_proj start_POSTSUBSCRIPT sansserif_M end_POSTSUBSCRIPT ( italic_ϕ start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT ( italic_T start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ( italic_x , italic_v ) ) ) = roman_proj start_POSTSUBSCRIPT sansserif_M end_POSTSUBSCRIPT ( ( italic_ϕ start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT ∘ fraktur_I ∘ italic_ϕ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ) ( italic_x , italic_v ) )
=proj𝖬((ϕθϕα)(x,v))=proj𝖬((ϕθα)(x,v))absentsubscriptproj𝖬subscriptitalic-ϕ𝜃subscriptitalic-ϕ𝛼𝑥𝑣subscriptproj𝖬subscriptitalic-ϕ𝜃𝛼𝑥𝑣\displaystyle=\mathrm{proj}_{\mathsf{M}}\big{(}(\phi_{\theta}\circ\phi_{-% \alpha}\circ\mathfrak{I})(x,v)\big{)}=\mathrm{proj}_{\mathsf{M}}\big{(}(\phi_{% \theta-\alpha}\circ\mathfrak{I})(x,v)\big{)}= roman_proj start_POSTSUBSCRIPT sansserif_M end_POSTSUBSCRIPT ( ( italic_ϕ start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT ∘ italic_ϕ start_POSTSUBSCRIPT - italic_α end_POSTSUBSCRIPT ∘ fraktur_I ) ( italic_x , italic_v ) ) = roman_proj start_POSTSUBSCRIPT sansserif_M end_POSTSUBSCRIPT ( ( italic_ϕ start_POSTSUBSCRIPT italic_θ - italic_α end_POSTSUBSCRIPT ∘ fraktur_I ) ( italic_x , italic_v ) )
=proj𝖬((ϕΛα(θ))(x,v))=proj𝖬(ϕΛα(θ)(x,v))=γ(x,v)(Λα(θ)).absentsubscriptproj𝖬subscriptitalic-ϕsubscriptΛ𝛼𝜃𝑥𝑣subscriptproj𝖬subscriptitalic-ϕsubscriptΛ𝛼𝜃𝑥𝑣subscript𝛾𝑥𝑣subscriptΛ𝛼𝜃\displaystyle=\mathrm{proj}_{\mathsf{M}}\left((\mathfrak{I}\circ\phi_{\Lambda_% {\alpha}(\theta)})(x,v)\right)=\mathrm{proj}_{\mathsf{M}}\left(\phi_{\Lambda_{% \alpha}(\theta)}(x,v)\right)=\gamma_{(x,v)}\big{(}\Lambda_{\alpha}(\theta)\big% {)}.= roman_proj start_POSTSUBSCRIPT sansserif_M end_POSTSUBSCRIPT ( ( fraktur_I ∘ italic_ϕ start_POSTSUBSCRIPT roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( italic_θ ) end_POSTSUBSCRIPT ) ( italic_x , italic_v ) ) = roman_proj start_POSTSUBSCRIPT sansserif_M end_POSTSUBSCRIPT ( italic_ϕ start_POSTSUBSCRIPT roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( italic_θ ) end_POSTSUBSCRIPT ( italic_x , italic_v ) ) = italic_γ start_POSTSUBSCRIPT ( italic_x , italic_v ) end_POSTSUBSCRIPT ( roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( italic_θ ) ) .

In particular this implies

L(T(α)(x,v),t)𝐿superscript𝑇𝛼𝑥𝑣𝑡\displaystyle L\big{(}T^{(\alpha)}(x,v),t\big{)}italic_L ( italic_T start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ( italic_x , italic_v ) , italic_t ) ={θp(γT(α)(x,v)(θ))>t}={θp(γ(x,v)(Λα(θ)))>t}absentconditional-set𝜃𝑝subscript𝛾superscript𝑇𝛼𝑥𝑣𝜃𝑡conditional-set𝜃𝑝subscript𝛾𝑥𝑣subscriptΛ𝛼𝜃𝑡\displaystyle=\{\theta\mid p\left(\gamma_{T^{(\alpha)}(x,v)}(\theta)\right)>t% \}=\{\theta\mid p\left(\gamma_{(x,v)}\big{(}\Lambda_{\alpha}(\theta)\big{)}% \right)>t\}= { italic_θ ∣ italic_p ( italic_γ start_POSTSUBSCRIPT italic_T start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ( italic_x , italic_v ) end_POSTSUBSCRIPT ( italic_θ ) ) > italic_t } = { italic_θ ∣ italic_p ( italic_γ start_POSTSUBSCRIPT ( italic_x , italic_v ) end_POSTSUBSCRIPT ( roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( italic_θ ) ) ) > italic_t }
=Λα(L(x,v,t)),absentsubscriptΛ𝛼𝐿𝑥𝑣𝑡\displaystyle=\Lambda_{\alpha}\left(L(x,v,t)\right),= roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( italic_L ( italic_x , italic_v , italic_t ) ) ,

as Λα1=ΛαsuperscriptsubscriptΛ𝛼1subscriptΛ𝛼\Lambda_{\alpha}^{-1}=\Lambda_{\alpha}roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT = roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT.

Now we can prove the reversibility of the geodesic slice sampler.

Proof of Theorem 7.

Observe that by virtue of [latuszyinski2014convergence, Lemma 1] it suffices to show that Ktsubscript𝐾𝑡K_{t}italic_K start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT is reversible with respect to ν𝔤(L(t))1ν𝔤|L(t)evaluated-atsubscript𝜈𝔤superscript𝐿𝑡1subscript𝜈𝔤𝐿𝑡\nu_{\mathfrak{g}}(L(t))^{-1}\nu_{\mathfrak{g}}|_{L(t)}italic_ν start_POSTSUBSCRIPT fraktur_g end_POSTSUBSCRIPT ( italic_L ( italic_t ) ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_ν start_POSTSUBSCRIPT fraktur_g end_POSTSUBSCRIPT | start_POSTSUBSCRIPT italic_L ( italic_t ) end_POSTSUBSCRIPT for all t(0,p)𝑡0subscriptnorm𝑝t\in(0,\|p\|_{\infty})italic_t ∈ ( 0 , ∥ italic_p ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT ).

Let t(0,p)𝑡0subscriptnorm𝑝t\in(0,\|p\|_{\infty})italic_t ∈ ( 0 , ∥ italic_p ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT ) and 𝖠,𝖡(𝖬)𝖠𝖡𝖬\mathsf{A},\mathsf{B}\in\mathcal{B}(\mathsf{M})sansserif_A , sansserif_B ∈ caligraphic_B ( sansserif_M ). After introducing the uniform distribution on L(x,v,t)𝐿𝑥𝑣𝑡L(x,v,t)italic_L ( italic_x , italic_v , italic_t ), we get by Lemma 11 that

L(t)𝖠Kt(x,𝖡)ν𝔤(dx)subscript𝐿𝑡𝖠subscript𝐾𝑡𝑥𝖡subscript𝜈𝔤d𝑥\displaystyle\int_{L(t)\cap\mathsf{A}}K_{t}(x,\mathsf{B})\ \nu_{\mathfrak{g}}(% {\rm d}x)∫ start_POSTSUBSCRIPT italic_L ( italic_t ) ∩ sansserif_A end_POSTSUBSCRIPT italic_K start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_x , sansserif_B ) italic_ν start_POSTSUBSCRIPT fraktur_g end_POSTSUBSCRIPT ( roman_d italic_x )
=𝖬𝕊xd12(,r)𝟙L(t)𝖠(x)𝟙𝖡(γ(x,v)(θ))QL(x,v,t),r(0,dθ)ξL(x,v,t)(0)(d(,r))σd1(x)(dv)ν𝔤(dx)absentsubscript𝖬subscriptsuperscriptsubscript𝕊𝑥𝑑1subscriptsuperscript2subscript𝑟subscript1𝐿𝑡𝖠𝑥subscript1𝖡subscript𝛾𝑥𝑣𝜃superscriptsubscript𝑄𝐿𝑥𝑣𝑡𝑟0d𝜃superscriptsubscript𝜉𝐿𝑥𝑣𝑡0d𝑟superscriptsubscript𝜎𝑑1𝑥d𝑣subscript𝜈𝔤d𝑥\displaystyle\quad=\int_{\mathsf{M}}\int_{\mathbb{S}_{x}^{d-1}}\int_{\mathbb{R% }^{2}}\int_{(\ell,r)}\mathbbm{1}_{L(t)\cap\mathsf{A}}(x)\mathbbm{1}_{\mathsf{B% }}\left(\gamma_{(x,v)}(\theta)\right)\ Q_{L(x,v,t)}^{\ell,r}(0,{\rm d}\theta)% \,\xi_{L(x,v,t)}^{(0)}\big{(}{\rm d}(\ell,r)\big{)}\,\sigma_{d-1}^{(x)}({\rm d% }v)\,\nu_{\mathfrak{g}}({\rm d}x)= ∫ start_POSTSUBSCRIPT sansserif_M end_POSTSUBSCRIPT ∫ start_POSTSUBSCRIPT blackboard_S start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_d - 1 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ∫ start_POSTSUBSCRIPT blackboard_R start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ∫ start_POSTSUBSCRIPT ( roman_ℓ , italic_r ) end_POSTSUBSCRIPT blackboard_1 start_POSTSUBSCRIPT italic_L ( italic_t ) ∩ sansserif_A end_POSTSUBSCRIPT ( italic_x ) blackboard_1 start_POSTSUBSCRIPT sansserif_B end_POSTSUBSCRIPT ( italic_γ start_POSTSUBSCRIPT ( italic_x , italic_v ) end_POSTSUBSCRIPT ( italic_θ ) ) italic_Q start_POSTSUBSCRIPT italic_L ( italic_x , italic_v , italic_t ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ℓ , italic_r end_POSTSUPERSCRIPT ( 0 , roman_d italic_θ ) italic_ξ start_POSTSUBSCRIPT italic_L ( italic_x , italic_v , italic_t ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT ( roman_d ( roman_ℓ , italic_r ) ) italic_σ start_POSTSUBSCRIPT italic_d - 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_x ) end_POSTSUPERSCRIPT ( roman_d italic_v ) italic_ν start_POSTSUBSCRIPT fraktur_g end_POSTSUBSCRIPT ( roman_d italic_x )
=𝖬𝕊xd12(,r)𝟙L(t)𝖠(x)𝟙𝖡(γ(x,v)(θ))1Leb1(L(x,v,t)(,r))𝟙L(x,v,t)(,r)(α)absentsubscriptsubscript𝖬subscriptsuperscriptsubscript𝕊𝑥𝑑1subscriptsuperscript2subscript𝑟subscript1𝐿𝑡𝖠𝑥subscript1𝖡subscript𝛾𝑥𝑣𝜃1subscriptLeb1𝐿𝑥𝑣𝑡𝑟subscript1𝐿𝑥𝑣𝑡𝑟𝛼\displaystyle\quad=\int_{\mathbb{R}}\int_{\mathsf{M}}\int_{\mathbb{S}_{x}^{d-1% }}\int_{\mathbb{R}^{2}}\int_{(\ell,r)}\mathbbm{1}_{L(t)\cap\mathsf{A}}(x)% \mathbbm{1}_{\mathsf{B}}\left(\gamma_{(x,v)}(\theta)\right)\frac{1}{\mathrm{% Leb}_{1}(L(x,v,t)\cap(\ell,r))}\mathbbm{1}_{L(x,v,t)\cap(\ell,r)}(\alpha)= ∫ start_POSTSUBSCRIPT blackboard_R end_POSTSUBSCRIPT ∫ start_POSTSUBSCRIPT sansserif_M end_POSTSUBSCRIPT ∫ start_POSTSUBSCRIPT blackboard_S start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_d - 1 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ∫ start_POSTSUBSCRIPT blackboard_R start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ∫ start_POSTSUBSCRIPT ( roman_ℓ , italic_r ) end_POSTSUBSCRIPT blackboard_1 start_POSTSUBSCRIPT italic_L ( italic_t ) ∩ sansserif_A end_POSTSUBSCRIPT ( italic_x ) blackboard_1 start_POSTSUBSCRIPT sansserif_B end_POSTSUBSCRIPT ( italic_γ start_POSTSUBSCRIPT ( italic_x , italic_v ) end_POSTSUBSCRIPT ( italic_θ ) ) divide start_ARG 1 end_ARG start_ARG roman_Leb start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_L ( italic_x , italic_v , italic_t ) ∩ ( roman_ℓ , italic_r ) ) end_ARG blackboard_1 start_POSTSUBSCRIPT italic_L ( italic_x , italic_v , italic_t ) ∩ ( roman_ℓ , italic_r ) end_POSTSUBSCRIPT ( italic_α )
×QL(x,v,t),r(0,dθ)ξL(x,v,t)(0)(d(,r))σd1(x)(dv)ν𝔤(dx)Leb1(dα)absentsuperscriptsubscript𝑄𝐿𝑥𝑣𝑡𝑟0d𝜃superscriptsubscript𝜉𝐿𝑥𝑣𝑡0d𝑟superscriptsubscript𝜎𝑑1𝑥d𝑣subscript𝜈𝔤d𝑥subscriptLeb1d𝛼\displaystyle\hskip 142.26378pt\times Q_{L(x,v,t)}^{\ell,r}(0,{\rm d}\theta)\ % \xi_{L(x,v,t)}^{(0)}\big{(}{\rm d}(\ell,r)\big{)}\ \sigma_{d-1}^{(x)}({\rm d}v% )\ \nu_{\mathfrak{g}}({\rm d}x)\ \mathrm{Leb}_{1}({\rm d}\alpha)× italic_Q start_POSTSUBSCRIPT italic_L ( italic_x , italic_v , italic_t ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ℓ , italic_r end_POSTSUPERSCRIPT ( 0 , roman_d italic_θ ) italic_ξ start_POSTSUBSCRIPT italic_L ( italic_x , italic_v , italic_t ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT ( roman_d ( roman_ℓ , italic_r ) ) italic_σ start_POSTSUBSCRIPT italic_d - 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_x ) end_POSTSUPERSCRIPT ( roman_d italic_v ) italic_ν start_POSTSUBSCRIPT fraktur_g end_POSTSUBSCRIPT ( roman_d italic_x ) roman_Leb start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( roman_d italic_α )
=𝖬𝕊xd12(,r)𝟙L(t)𝖠(x)𝟙𝖡(γ(x,v)(θ))1Leb1(L(x,v,t)(,r))𝟙L(x,v,t)(α)𝟙(,r)(0)absentsubscriptsubscript𝖬subscriptsuperscriptsubscript𝕊𝑥𝑑1subscriptsuperscript2subscript𝑟subscript1𝐿𝑡𝖠𝑥subscript1𝖡subscript𝛾𝑥𝑣𝜃1subscriptLeb1𝐿𝑥𝑣𝑡𝑟subscript1𝐿𝑥𝑣𝑡𝛼subscript1𝑟0\displaystyle\quad=\int_{\mathbb{R}}\int_{\mathsf{M}}\int_{\mathbb{S}_{x}^{d-1% }}\int_{\mathbb{R}^{2}}\int_{(\ell,r)}\mathbbm{1}_{L(t)\cap\mathsf{A}}(x)% \mathbbm{1}_{\mathsf{B}}\left(\gamma_{(x,v)}(\theta)\right)\frac{1}{\mathrm{% Leb}_{1}(L(x,v,t)\cap(\ell,r))}\mathbbm{1}_{L(x,v,t)}(\alpha)\mathbbm{1}_{(% \ell,r)}(0)= ∫ start_POSTSUBSCRIPT blackboard_R end_POSTSUBSCRIPT ∫ start_POSTSUBSCRIPT sansserif_M end_POSTSUBSCRIPT ∫ start_POSTSUBSCRIPT blackboard_S start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_d - 1 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ∫ start_POSTSUBSCRIPT blackboard_R start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ∫ start_POSTSUBSCRIPT ( roman_ℓ , italic_r ) end_POSTSUBSCRIPT blackboard_1 start_POSTSUBSCRIPT italic_L ( italic_t ) ∩ sansserif_A end_POSTSUBSCRIPT ( italic_x ) blackboard_1 start_POSTSUBSCRIPT sansserif_B end_POSTSUBSCRIPT ( italic_γ start_POSTSUBSCRIPT ( italic_x , italic_v ) end_POSTSUBSCRIPT ( italic_θ ) ) divide start_ARG 1 end_ARG start_ARG roman_Leb start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_L ( italic_x , italic_v , italic_t ) ∩ ( roman_ℓ , italic_r ) ) end_ARG blackboard_1 start_POSTSUBSCRIPT italic_L ( italic_x , italic_v , italic_t ) end_POSTSUBSCRIPT ( italic_α ) blackboard_1 start_POSTSUBSCRIPT ( roman_ℓ , italic_r ) end_POSTSUBSCRIPT ( 0 )
×QL(x,v,t),r(0,dθ)ξL(x,v,t)(α)(d(,r))σd1(x)(dv)ν𝔤(dx)Leb1(dα).absentsuperscriptsubscript𝑄𝐿𝑥𝑣𝑡𝑟0d𝜃superscriptsubscript𝜉𝐿𝑥𝑣𝑡𝛼d𝑟superscriptsubscript𝜎𝑑1𝑥d𝑣subscript𝜈𝔤d𝑥subscriptLeb1d𝛼\displaystyle\hskip 142.26378pt\times Q_{L(x,v,t)}^{\ell,r}(0,{\rm d}\theta)\ % \xi_{L(x,v,t)}^{(\alpha)}\big{(}{\rm d}(\ell,r)\big{)}\ \sigma_{d-1}^{(x)}({% \rm d}v)\ \nu_{\mathfrak{g}}({\rm d}x)\ \mathrm{Leb}_{1}({\rm d}\alpha).× italic_Q start_POSTSUBSCRIPT italic_L ( italic_x , italic_v , italic_t ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ℓ , italic_r end_POSTSUPERSCRIPT ( 0 , roman_d italic_θ ) italic_ξ start_POSTSUBSCRIPT italic_L ( italic_x , italic_v , italic_t ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ( roman_d ( roman_ℓ , italic_r ) ) italic_σ start_POSTSUBSCRIPT italic_d - 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_x ) end_POSTSUPERSCRIPT ( roman_d italic_v ) italic_ν start_POSTSUBSCRIPT fraktur_g end_POSTSUBSCRIPT ( roman_d italic_x ) roman_Leb start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( roman_d italic_α ) .

Using (27), we obtain

L(t)𝖠Kt(x,𝖡)ν𝔤(dx)subscript𝐿𝑡𝖠subscript𝐾𝑡𝑥𝖡subscript𝜈𝔤d𝑥\displaystyle\int_{L(t)\cap\mathsf{A}}K_{t}(x,\mathsf{B})\ \nu_{\mathfrak{g}}(% {\rm d}x)∫ start_POSTSUBSCRIPT italic_L ( italic_t ) ∩ sansserif_A end_POSTSUBSCRIPT italic_K start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_x , sansserif_B ) italic_ν start_POSTSUBSCRIPT fraktur_g end_POSTSUBSCRIPT ( roman_d italic_x )
=𝖬𝕊xd12(,r)𝟙L(t)𝖠(γ(x,v)(α))𝟙𝖡(γT(α)(x,v)(θ))𝟙L(T(α)(x,v),t)(α)𝟙(,r)(0)Leb1(L(T(α)(x,v),t)(,r))absentsubscriptsubscript𝖬subscriptsuperscriptsubscript𝕊𝑥𝑑1subscriptsuperscript2subscript𝑟subscript1𝐿𝑡𝖠subscript𝛾𝑥𝑣𝛼subscript1𝖡subscript𝛾superscript𝑇𝛼𝑥𝑣𝜃subscript1𝐿superscript𝑇𝛼𝑥𝑣𝑡𝛼subscript1𝑟0subscriptLeb1𝐿superscript𝑇𝛼𝑥𝑣𝑡𝑟\displaystyle\quad=\int_{\mathbb{R}}\int_{\mathsf{M}}\int_{\mathbb{S}_{x}^{d-1% }}\int_{\mathbb{R}^{2}}\int_{(\ell,r)}\frac{\mathbbm{1}_{L(t)\cap\mathsf{A}}% \left(\gamma_{(x,v)}(\alpha)\right)\mathbbm{1}_{\mathsf{B}}\left(\gamma_{T^{(% \alpha)}(x,v)}(\theta)\right)\mathbbm{1}_{L\left(T^{(\alpha)}(x,v),t\right)}(% \alpha)\mathbbm{1}_{(\ell,r)}(0)}{\mathrm{Leb}_{1}\big{(}L(T^{(\alpha)}(x,v),t% )\cap(\ell,r)\big{)}}= ∫ start_POSTSUBSCRIPT blackboard_R end_POSTSUBSCRIPT ∫ start_POSTSUBSCRIPT sansserif_M end_POSTSUBSCRIPT ∫ start_POSTSUBSCRIPT blackboard_S start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_d - 1 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ∫ start_POSTSUBSCRIPT blackboard_R start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ∫ start_POSTSUBSCRIPT ( roman_ℓ , italic_r ) end_POSTSUBSCRIPT divide start_ARG blackboard_1 start_POSTSUBSCRIPT italic_L ( italic_t ) ∩ sansserif_A end_POSTSUBSCRIPT ( italic_γ start_POSTSUBSCRIPT ( italic_x , italic_v ) end_POSTSUBSCRIPT ( italic_α ) ) blackboard_1 start_POSTSUBSCRIPT sansserif_B end_POSTSUBSCRIPT ( italic_γ start_POSTSUBSCRIPT italic_T start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ( italic_x , italic_v ) end_POSTSUBSCRIPT ( italic_θ ) ) blackboard_1 start_POSTSUBSCRIPT italic_L ( italic_T start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ( italic_x , italic_v ) , italic_t ) end_POSTSUBSCRIPT ( italic_α ) blackboard_1 start_POSTSUBSCRIPT ( roman_ℓ , italic_r ) end_POSTSUBSCRIPT ( 0 ) end_ARG start_ARG roman_Leb start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_L ( italic_T start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ( italic_x , italic_v ) , italic_t ) ∩ ( roman_ℓ , italic_r ) ) end_ARG
×QL(T(α)(x,v),t),r(0,dθ)ξL(T(α)(x,v),t)(α)(d(,r))σd1(x)(dv)ν𝔤(dx)Leb1(dα).absentsuperscriptsubscript𝑄𝐿superscript𝑇𝛼𝑥𝑣𝑡𝑟0d𝜃superscriptsubscript𝜉𝐿superscript𝑇𝛼𝑥𝑣𝑡𝛼d𝑟superscriptsubscript𝜎𝑑1𝑥d𝑣subscript𝜈𝔤d𝑥subscriptLeb1d𝛼\displaystyle\hskip 113.81102pt\times Q_{L\left(T^{(\alpha)}(x,v),t\right)}^{% \ell,r}(0,{\rm d}\theta)\,\xi_{L\left(T^{(\alpha)}(x,v),t\right)}^{(\alpha)}% \big{(}{\rm d}(\ell,r)\big{)}\,\sigma_{d-1}^{(x)}({\rm d}v)\,\nu_{\mathfrak{g}% }({\rm d}x)\,\mathrm{Leb}_{1}({\rm d}\alpha).× italic_Q start_POSTSUBSCRIPT italic_L ( italic_T start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ( italic_x , italic_v ) , italic_t ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ℓ , italic_r end_POSTSUPERSCRIPT ( 0 , roman_d italic_θ ) italic_ξ start_POSTSUBSCRIPT italic_L ( italic_T start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ( italic_x , italic_v ) , italic_t ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ( roman_d ( roman_ℓ , italic_r ) ) italic_σ start_POSTSUBSCRIPT italic_d - 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_x ) end_POSTSUPERSCRIPT ( roman_d italic_v ) italic_ν start_POSTSUBSCRIPT fraktur_g end_POSTSUBSCRIPT ( roman_d italic_x ) roman_Leb start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( roman_d italic_α ) .

For all α𝛼\alpha\in\mathbb{R}italic_α ∈ blackboard_R, we have

γ(x,v)(α)L(t)p(γ(x,v)(α))>tαL(x,v,t),formulae-sequencesubscript𝛾𝑥𝑣𝛼𝐿𝑡formulae-sequence𝑝subscript𝛾𝑥𝑣𝛼𝑡𝛼𝐿𝑥𝑣𝑡\gamma_{(x,v)}(\alpha)\in L(t)\quad\Leftrightarrow\quad p\left(\gamma_{(x,v)}(% \alpha)\right)>t\quad\Leftrightarrow\quad\alpha\in L(x,v,t),italic_γ start_POSTSUBSCRIPT ( italic_x , italic_v ) end_POSTSUBSCRIPT ( italic_α ) ∈ italic_L ( italic_t ) ⇔ italic_p ( italic_γ start_POSTSUBSCRIPT ( italic_x , italic_v ) end_POSTSUBSCRIPT ( italic_α ) ) > italic_t ⇔ italic_α ∈ italic_L ( italic_x , italic_v , italic_t ) ,

and by Remark 17

αL(T(α)(x,v),t)p(x)=p(γ(x,v)(αα))=p(γT(α)(x,v)(α))>txL(t).formulae-sequenceformulae-sequence𝛼𝐿superscript𝑇𝛼𝑥𝑣𝑡𝑝𝑥𝑝subscript𝛾𝑥𝑣𝛼𝛼𝑝subscript𝛾superscript𝑇𝛼𝑥𝑣𝛼𝑡𝑥𝐿𝑡\alpha\in L\big{(}T^{(\alpha)}(x,v),t\big{)}\quad\Leftrightarrow\quad p(x)=p% \left(\gamma_{(x,v)}(\alpha-\alpha)\right)=p\left(\gamma_{T^{(\alpha)}(x,v)}(% \alpha)\right)>t\quad\Leftrightarrow\quad x\in L(t).italic_α ∈ italic_L ( italic_T start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ( italic_x , italic_v ) , italic_t ) ⇔ italic_p ( italic_x ) = italic_p ( italic_γ start_POSTSUBSCRIPT ( italic_x , italic_v ) end_POSTSUBSCRIPT ( italic_α - italic_α ) ) = italic_p ( italic_γ start_POSTSUBSCRIPT italic_T start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ( italic_x , italic_v ) end_POSTSUBSCRIPT ( italic_α ) ) > italic_t ⇔ italic_x ∈ italic_L ( italic_t ) .

If we also apply Remark 17 to 𝟙𝖡subscript1𝖡\mathbbm{1}_{\mathsf{\mathsf{B}}}blackboard_1 start_POSTSUBSCRIPT sansserif_B end_POSTSUBSCRIPT and to the set in the 1-dimensional Lebesgue measure in the numerator, we overall obtain

L(t)𝖠Kt(x,𝖡)ν𝔤(dx)subscript𝐿𝑡𝖠subscript𝐾𝑡𝑥𝖡subscript𝜈𝔤d𝑥\displaystyle\int_{L(t)\cap\mathsf{A}}K_{t}(x,\mathsf{B})\ \nu_{\mathfrak{g}}(% {\rm d}x)∫ start_POSTSUBSCRIPT italic_L ( italic_t ) ∩ sansserif_A end_POSTSUBSCRIPT italic_K start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_x , sansserif_B ) italic_ν start_POSTSUBSCRIPT fraktur_g end_POSTSUBSCRIPT ( roman_d italic_x )
=𝖬𝕊xd12(,r)𝟙L(x,v,t)(α)𝟙𝖠(γ(x,v)(α))𝟙𝖡(γ(x,v)(Λα(θ)))𝟙L(t)(x)𝟙(,r)(0)Leb1(Λα(L(x,v,t))(,r))absentsubscriptsubscript𝖬subscriptsuperscriptsubscript𝕊𝑥𝑑1subscriptsuperscript2subscript𝑟subscript1𝐿𝑥𝑣𝑡𝛼subscript1𝖠subscript𝛾𝑥𝑣𝛼subscript1𝖡subscript𝛾𝑥𝑣subscriptΛ𝛼𝜃subscript1𝐿𝑡𝑥subscript1𝑟0subscriptLeb1subscriptΛ𝛼𝐿𝑥𝑣𝑡𝑟\displaystyle\quad=\int_{\mathbb{R}}\int_{\mathsf{M}}\int_{\mathbb{S}_{x}^{d-1% }}\int_{\mathbb{R}^{2}}\int_{(\ell,r)}\frac{\mathbbm{1}_{L(x,v,t)}\left(\alpha% \right)\mathbbm{1}_{\mathsf{A}}\left(\gamma_{(x,v)}(\alpha)\right)\mathbbm{1}_% {\mathsf{B}}\left(\gamma_{(x,v)}\big{(}\Lambda_{\alpha}(\theta)\big{)}\right)% \mathbbm{1}_{L(t)}(x)\mathbbm{1}_{(\ell,r)}(0)}{\mathrm{Leb}_{1}\left(\Lambda_% {\alpha}(L(x,v,t))\cap(\ell,r)\right)}= ∫ start_POSTSUBSCRIPT blackboard_R end_POSTSUBSCRIPT ∫ start_POSTSUBSCRIPT sansserif_M end_POSTSUBSCRIPT ∫ start_POSTSUBSCRIPT blackboard_S start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_d - 1 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ∫ start_POSTSUBSCRIPT blackboard_R start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ∫ start_POSTSUBSCRIPT ( roman_ℓ , italic_r ) end_POSTSUBSCRIPT divide start_ARG blackboard_1 start_POSTSUBSCRIPT italic_L ( italic_x , italic_v , italic_t ) end_POSTSUBSCRIPT ( italic_α ) blackboard_1 start_POSTSUBSCRIPT sansserif_A end_POSTSUBSCRIPT ( italic_γ start_POSTSUBSCRIPT ( italic_x , italic_v ) end_POSTSUBSCRIPT ( italic_α ) ) blackboard_1 start_POSTSUBSCRIPT sansserif_B end_POSTSUBSCRIPT ( italic_γ start_POSTSUBSCRIPT ( italic_x , italic_v ) end_POSTSUBSCRIPT ( roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( italic_θ ) ) ) blackboard_1 start_POSTSUBSCRIPT italic_L ( italic_t ) end_POSTSUBSCRIPT ( italic_x ) blackboard_1 start_POSTSUBSCRIPT ( roman_ℓ , italic_r ) end_POSTSUBSCRIPT ( 0 ) end_ARG start_ARG roman_Leb start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( italic_L ( italic_x , italic_v , italic_t ) ) ∩ ( roman_ℓ , italic_r ) ) end_ARG
×QL(T(α)(x,v),t),r(0,dθ)ξL(T(α)(x,v),t)(α)(d(,r))σd1(x)(dv)ν𝔤(dx)Leb1(dα).absentsuperscriptsubscript𝑄𝐿superscript𝑇𝛼𝑥𝑣𝑡𝑟0d𝜃superscriptsubscript𝜉𝐿superscript𝑇𝛼𝑥𝑣𝑡𝛼d𝑟superscriptsubscript𝜎𝑑1𝑥d𝑣subscript𝜈𝔤d𝑥subscriptLeb1d𝛼\displaystyle\hskip 113.81102pt\times Q_{L(T^{(\alpha)}(x,v),t)}^{\ell,r}(0,{% \rm d}\theta)\ \xi_{L(T^{(\alpha)}(x,v),t)}^{(\alpha)}\big{(}{\rm d}(\ell,r)% \big{)}\ \sigma_{d-1}^{(x)}({\rm d}v)\ \nu_{\mathfrak{g}}({\rm d}x)\ \mathrm{% Leb}_{1}({\rm d}\alpha).× italic_Q start_POSTSUBSCRIPT italic_L ( italic_T start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ( italic_x , italic_v ) , italic_t ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ℓ , italic_r end_POSTSUPERSCRIPT ( 0 , roman_d italic_θ ) italic_ξ start_POSTSUBSCRIPT italic_L ( italic_T start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ( italic_x , italic_v ) , italic_t ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ( roman_d ( roman_ℓ , italic_r ) ) italic_σ start_POSTSUBSCRIPT italic_d - 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_x ) end_POSTSUPERSCRIPT ( roman_d italic_v ) italic_ν start_POSTSUBSCRIPT fraktur_g end_POSTSUBSCRIPT ( roman_d italic_x ) roman_Leb start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( roman_d italic_α ) .

Then Lemma 12 together with Remark 17 yields

L(t)𝖠Kt(x,𝖡)ν𝔤(dx)subscript𝐿𝑡𝖠subscript𝐾𝑡𝑥𝖡subscript𝜈𝔤d𝑥\displaystyle\int_{L(t)\cap\mathsf{A}}K_{t}(x,\mathsf{B})\ \nu_{\mathfrak{g}}(% {\rm d}x)∫ start_POSTSUBSCRIPT italic_L ( italic_t ) ∩ sansserif_A end_POSTSUBSCRIPT italic_K start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_x , sansserif_B ) italic_ν start_POSTSUBSCRIPT fraktur_g end_POSTSUBSCRIPT ( roman_d italic_x )
=𝖬𝕊xd12Λα((,r))𝟙L(x,v,t)(α)𝟙𝖠(γ(x,v)(α))𝟙𝖡(γ(x,v)(Λα(θ)))𝟙L(t)(x)𝟙Λα((,r))(0)Leb1(Λα(L(x,v,t)(,r)))absentsubscriptsubscript𝖬subscriptsuperscriptsubscript𝕊𝑥𝑑1subscriptsuperscript2subscriptsubscriptΛ𝛼𝑟subscript1𝐿𝑥𝑣𝑡𝛼subscript1𝖠subscript𝛾𝑥𝑣𝛼subscript1𝖡subscript𝛾𝑥𝑣subscriptΛ𝛼𝜃subscript1𝐿𝑡𝑥subscript1subscriptΛ𝛼𝑟0subscriptLeb1subscriptΛ𝛼𝐿𝑥𝑣𝑡𝑟\displaystyle\quad=\int_{\mathbb{R}}\int_{\mathsf{M}}\int_{\mathbb{S}_{x}^{d-1% }}\int_{\mathbb{R}^{2}}\int_{\Lambda_{\alpha}((\ell,r))}\frac{\mathbbm{1}_{L(x% ,v,t)}\left(\alpha\right)\mathbbm{1}_{\mathsf{A}}\left(\gamma_{(x,v)}(\alpha)% \right)\mathbbm{1}_{\mathsf{B}}\left(\gamma_{(x,v)}\big{(}\Lambda_{\alpha}(% \theta)\big{)}\right)\mathbbm{1}_{L(t)}(x)\mathbbm{1}_{\Lambda_{\alpha}((\ell,% r))}(0)}{\mathrm{Leb}_{1}\big{(}\Lambda_{\alpha}(L(x,v,t)\cap(\ell,r))\big{)}}= ∫ start_POSTSUBSCRIPT blackboard_R end_POSTSUBSCRIPT ∫ start_POSTSUBSCRIPT sansserif_M end_POSTSUBSCRIPT ∫ start_POSTSUBSCRIPT blackboard_S start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_d - 1 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ∫ start_POSTSUBSCRIPT blackboard_R start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ∫ start_POSTSUBSCRIPT roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( ( roman_ℓ , italic_r ) ) end_POSTSUBSCRIPT divide start_ARG blackboard_1 start_POSTSUBSCRIPT italic_L ( italic_x , italic_v , italic_t ) end_POSTSUBSCRIPT ( italic_α ) blackboard_1 start_POSTSUBSCRIPT sansserif_A end_POSTSUBSCRIPT ( italic_γ start_POSTSUBSCRIPT ( italic_x , italic_v ) end_POSTSUBSCRIPT ( italic_α ) ) blackboard_1 start_POSTSUBSCRIPT sansserif_B end_POSTSUBSCRIPT ( italic_γ start_POSTSUBSCRIPT ( italic_x , italic_v ) end_POSTSUBSCRIPT ( roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( italic_θ ) ) ) blackboard_1 start_POSTSUBSCRIPT italic_L ( italic_t ) end_POSTSUBSCRIPT ( italic_x ) blackboard_1 start_POSTSUBSCRIPT roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( ( roman_ℓ , italic_r ) ) end_POSTSUBSCRIPT ( 0 ) end_ARG start_ARG roman_Leb start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( italic_L ( italic_x , italic_v , italic_t ) ∩ ( roman_ℓ , italic_r ) ) ) end_ARG
×QL(T(α)(x,v),t)λα(,r)(0,dθ)ξL(x,v,t)(0)(d(,r))σd1(x)(dv)ν𝔤(dx)Leb1(dα).absentsuperscriptsubscript𝑄𝐿superscript𝑇𝛼𝑥𝑣𝑡subscriptλ𝛼𝑟0d𝜃superscriptsubscript𝜉𝐿𝑥𝑣𝑡0d𝑟superscriptsubscript𝜎𝑑1𝑥d𝑣subscript𝜈𝔤d𝑥subscriptLeb1d𝛼\displaystyle\hskip 113.81102pt\times Q_{L(T^{(\alpha)}(x,v),t)}^{\uplambda_{% \alpha}(\ell,r)}(0,{\rm d}\theta)\ \xi_{L(x,v,t)}^{(0)}\big{(}{\rm d}(\ell,r)% \big{)}\ \sigma_{d-1}^{(x)}({\rm d}v)\ \nu_{\mathfrak{g}}({\rm d}x)\ \mathrm{% Leb}_{1}({\rm d}\alpha).× italic_Q start_POSTSUBSCRIPT italic_L ( italic_T start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ( italic_x , italic_v ) , italic_t ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( roman_ℓ , italic_r ) end_POSTSUPERSCRIPT ( 0 , roman_d italic_θ ) italic_ξ start_POSTSUBSCRIPT italic_L ( italic_x , italic_v , italic_t ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT ( roman_d ( roman_ℓ , italic_r ) ) italic_σ start_POSTSUBSCRIPT italic_d - 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_x ) end_POSTSUPERSCRIPT ( roman_d italic_v ) italic_ν start_POSTSUBSCRIPT fraktur_g end_POSTSUBSCRIPT ( roman_d italic_x ) roman_Leb start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( roman_d italic_α ) .

Observe that

Leb1(Λα(L(x,v,t)(,r)))=Leb1(L(x,v,t)(,r))subscriptLeb1subscriptΛ𝛼𝐿𝑥𝑣𝑡𝑟subscriptLeb1𝐿𝑥𝑣𝑡𝑟\displaystyle\mathrm{Leb}_{1}\big{(}\Lambda_{\alpha}(L(x,v,t)\cap(\ell,r))\big% {)}=\mathrm{Leb}_{1}\big{(}L(x,v,t)\cap(\ell,r)\big{)}roman_Leb start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( italic_L ( italic_x , italic_v , italic_t ) ∩ ( roman_ℓ , italic_r ) ) ) = roman_Leb start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_L ( italic_x , italic_v , italic_t ) ∩ ( roman_ℓ , italic_r ) )

for all xM𝑥𝑀x\in Mitalic_x ∈ italic_M, v𝕊xd1𝑣superscriptsubscript𝕊𝑥𝑑1v\in\mathbb{S}_{x}^{d-1}italic_v ∈ blackboard_S start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_d - 1 end_POSTSUPERSCRIPT and α,,r𝛼𝑟\alpha,\ell,r\in\mathbb{R}italic_α , roman_ℓ , italic_r ∈ blackboard_R. Hence

L(t)𝖠Kt(x,𝖡)ν𝔤(dx)subscript𝐿𝑡𝖠subscript𝐾𝑡𝑥𝖡subscript𝜈𝔤d𝑥\displaystyle\int_{L(t)\cap\mathsf{A}}K_{t}(x,\mathsf{B})\ \nu_{\mathfrak{g}}(% {\rm d}x)∫ start_POSTSUBSCRIPT italic_L ( italic_t ) ∩ sansserif_A end_POSTSUBSCRIPT italic_K start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_x , sansserif_B ) italic_ν start_POSTSUBSCRIPT fraktur_g end_POSTSUBSCRIPT ( roman_d italic_x )
=𝖬𝕊xd12Λα((,r))𝟙L(x,v,t)(α)𝟙𝖠(γ(x,v)(α))𝟙𝖡(γ(x,v)(Λα(θ)))𝟙L(t)(x)𝟙Λα((,r))(0)Leb1(L(x,v,t)(,r))absentsubscriptsubscript𝖬subscriptsuperscriptsubscript𝕊𝑥𝑑1subscriptsuperscript2subscriptsubscriptΛ𝛼𝑟subscript1𝐿𝑥𝑣𝑡𝛼subscript1𝖠subscript𝛾𝑥𝑣𝛼subscript1𝖡subscript𝛾𝑥𝑣subscriptΛ𝛼𝜃subscript1𝐿𝑡𝑥subscript1subscriptΛ𝛼𝑟0subscriptLeb1𝐿𝑥𝑣𝑡𝑟\displaystyle\quad=\int_{\mathbb{R}}\int_{\mathsf{M}}\int_{\mathbb{S}_{x}^{d-1% }}\int_{\mathbb{R}^{2}}\int_{\Lambda_{\alpha}((\ell,r))}\frac{\mathbbm{1}_{L(x% ,v,t)}\left(\alpha\right)\mathbbm{1}_{\mathsf{A}}\left(\gamma_{(x,v)}(\alpha)% \right)\mathbbm{1}_{\mathsf{B}}\left(\gamma_{(x,v)}\big{(}\Lambda_{\alpha}(% \theta)\big{)}\right)\mathbbm{1}_{L(t)}(x)\mathbbm{1}_{\Lambda_{\alpha}((\ell,% r))}(0)}{\mathrm{Leb}_{1}(L(x,v,t)\cap(\ell,r))}= ∫ start_POSTSUBSCRIPT blackboard_R end_POSTSUBSCRIPT ∫ start_POSTSUBSCRIPT sansserif_M end_POSTSUBSCRIPT ∫ start_POSTSUBSCRIPT blackboard_S start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_d - 1 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ∫ start_POSTSUBSCRIPT blackboard_R start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ∫ start_POSTSUBSCRIPT roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( ( roman_ℓ , italic_r ) ) end_POSTSUBSCRIPT divide start_ARG blackboard_1 start_POSTSUBSCRIPT italic_L ( italic_x , italic_v , italic_t ) end_POSTSUBSCRIPT ( italic_α ) blackboard_1 start_POSTSUBSCRIPT sansserif_A end_POSTSUBSCRIPT ( italic_γ start_POSTSUBSCRIPT ( italic_x , italic_v ) end_POSTSUBSCRIPT ( italic_α ) ) blackboard_1 start_POSTSUBSCRIPT sansserif_B end_POSTSUBSCRIPT ( italic_γ start_POSTSUBSCRIPT ( italic_x , italic_v ) end_POSTSUBSCRIPT ( roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( italic_θ ) ) ) blackboard_1 start_POSTSUBSCRIPT italic_L ( italic_t ) end_POSTSUBSCRIPT ( italic_x ) blackboard_1 start_POSTSUBSCRIPT roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( ( roman_ℓ , italic_r ) ) end_POSTSUBSCRIPT ( 0 ) end_ARG start_ARG roman_Leb start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_L ( italic_x , italic_v , italic_t ) ∩ ( roman_ℓ , italic_r ) ) end_ARG
×QL(T(α)(x,v),t)λα(,r)(0,dθ)ξL(x,v,t)(0)(d(,r))σd1(x)(dv)ν𝔤(dx)Leb1(dα).absentsuperscriptsubscript𝑄𝐿superscript𝑇𝛼𝑥𝑣𝑡subscriptλ𝛼𝑟0d𝜃superscriptsubscript𝜉𝐿𝑥𝑣𝑡0d𝑟superscriptsubscript𝜎𝑑1𝑥d𝑣subscript𝜈𝔤d𝑥subscriptLeb1d𝛼\displaystyle\hskip 113.81102pt\times Q_{L(T^{(\alpha)}(x,v),t)}^{\uplambda_{% \alpha}(\ell,r)}(0,{\rm d}\theta)\ \xi_{L(x,v,t)}^{(0)}\big{(}{\rm d}(\ell,r)% \big{)}\ \sigma_{d-1}^{(x)}({\rm d}v)\ \nu_{\mathfrak{g}}({\rm d}x)\ \mathrm{% Leb}_{1}({\rm d}\alpha).× italic_Q start_POSTSUBSCRIPT italic_L ( italic_T start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ( italic_x , italic_v ) , italic_t ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( roman_ℓ , italic_r ) end_POSTSUPERSCRIPT ( 0 , roman_d italic_θ ) italic_ξ start_POSTSUBSCRIPT italic_L ( italic_x , italic_v , italic_t ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT ( roman_d ( roman_ℓ , italic_r ) ) italic_σ start_POSTSUBSCRIPT italic_d - 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_x ) end_POSTSUPERSCRIPT ( roman_d italic_v ) italic_ν start_POSTSUBSCRIPT fraktur_g end_POSTSUBSCRIPT ( roman_d italic_x ) roman_Leb start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( roman_d italic_α ) .

Note that due to Remark 14 and Remark 17, we may apply Lemma 16, such that we obtain

L(t)𝖠Kt(x,𝖡)ν𝔤(dx)subscript𝐿𝑡𝖠subscript𝐾𝑡𝑥𝖡subscript𝜈𝔤d𝑥\displaystyle\int_{L(t)\cap\mathsf{A}}K_{t}(x,\mathsf{B})\ \nu_{\mathfrak{g}}(% {\rm d}x)∫ start_POSTSUBSCRIPT italic_L ( italic_t ) ∩ sansserif_A end_POSTSUBSCRIPT italic_K start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_x , sansserif_B ) italic_ν start_POSTSUBSCRIPT fraktur_g end_POSTSUBSCRIPT ( roman_d italic_x )
=𝖬𝕊xd12(,r)𝟙L(x,v,t)(α)𝟙𝖠(γ(x,v)(α))𝟙𝖡(γ(x,v)(θ))1Leb1(L(x,v,t)(,r))absentsubscriptsubscript𝖬subscriptsuperscriptsubscript𝕊𝑥𝑑1subscriptsuperscript2subscript𝑟subscript1𝐿𝑥𝑣𝑡𝛼subscript1𝖠subscript𝛾𝑥𝑣𝛼subscript1𝖡subscript𝛾𝑥𝑣𝜃1subscriptLeb1𝐿𝑥𝑣𝑡𝑟\displaystyle\quad=\int_{\mathbb{R}}\int_{\mathsf{M}}\int_{\mathbb{S}_{x}^{d-1% }}\int_{\mathbb{R}^{2}}\int_{(\ell,r)}\mathbbm{1}_{L(x,v,t)}\left(\alpha\right% )\mathbbm{1}_{\mathsf{A}}\left(\gamma_{(x,v)}(\alpha)\right)\mathbbm{1}_{% \mathsf{B}}\left(\gamma_{(x,v)}(\theta)\right)\frac{1}{\mathrm{Leb}_{1}(L(x,v,% t)\cap(\ell,r))}= ∫ start_POSTSUBSCRIPT blackboard_R end_POSTSUBSCRIPT ∫ start_POSTSUBSCRIPT sansserif_M end_POSTSUBSCRIPT ∫ start_POSTSUBSCRIPT blackboard_S start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_d - 1 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ∫ start_POSTSUBSCRIPT blackboard_R start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ∫ start_POSTSUBSCRIPT ( roman_ℓ , italic_r ) end_POSTSUBSCRIPT blackboard_1 start_POSTSUBSCRIPT italic_L ( italic_x , italic_v , italic_t ) end_POSTSUBSCRIPT ( italic_α ) blackboard_1 start_POSTSUBSCRIPT sansserif_A end_POSTSUBSCRIPT ( italic_γ start_POSTSUBSCRIPT ( italic_x , italic_v ) end_POSTSUBSCRIPT ( italic_α ) ) blackboard_1 start_POSTSUBSCRIPT sansserif_B end_POSTSUBSCRIPT ( italic_γ start_POSTSUBSCRIPT ( italic_x , italic_v ) end_POSTSUBSCRIPT ( italic_θ ) ) divide start_ARG 1 end_ARG start_ARG roman_Leb start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_L ( italic_x , italic_v , italic_t ) ∩ ( roman_ℓ , italic_r ) ) end_ARG
𝟙L(t)(x)𝟙Λα((,r))(0)QL(x,v,t),r(α,dθ)ξL(x,v,t)(0)(d(,r))σd1(x)(dv)ν𝔤(dx)Leb1(dα)subscript1𝐿𝑡𝑥subscript1subscriptΛ𝛼𝑟0superscriptsubscript𝑄𝐿𝑥𝑣𝑡𝑟𝛼d𝜃superscriptsubscript𝜉𝐿𝑥𝑣𝑡0d𝑟superscriptsubscript𝜎𝑑1𝑥d𝑣subscript𝜈𝔤d𝑥subscriptLeb1d𝛼\displaystyle\hskip 56.9055pt\mathbbm{1}_{L(t)}(x)\mathbbm{1}_{\Lambda_{\alpha% }((\ell,r))}(0)\ Q_{L(x,v,t)}^{\ell,r}(\alpha,{\rm d}\theta)\ \xi_{L(x,v,t)}^{% (0)}\big{(}{\rm d}(\ell,r)\big{)}\ \sigma_{d-1}^{(x)}({\rm d}v)\ \nu_{% \mathfrak{g}}({\rm d}x)\ \mathrm{Leb}_{1}({\rm d}\alpha)blackboard_1 start_POSTSUBSCRIPT italic_L ( italic_t ) end_POSTSUBSCRIPT ( italic_x ) blackboard_1 start_POSTSUBSCRIPT roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( ( roman_ℓ , italic_r ) ) end_POSTSUBSCRIPT ( 0 ) italic_Q start_POSTSUBSCRIPT italic_L ( italic_x , italic_v , italic_t ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ℓ , italic_r end_POSTSUPERSCRIPT ( italic_α , roman_d italic_θ ) italic_ξ start_POSTSUBSCRIPT italic_L ( italic_x , italic_v , italic_t ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT ( roman_d ( roman_ℓ , italic_r ) ) italic_σ start_POSTSUBSCRIPT italic_d - 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_x ) end_POSTSUPERSCRIPT ( roman_d italic_v ) italic_ν start_POSTSUBSCRIPT fraktur_g end_POSTSUBSCRIPT ( roman_d italic_x ) roman_Leb start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( roman_d italic_α )
=𝖬𝕊xd12(,r)𝟙L(x,v,t)(α)𝟙𝖠(γ(x,v)(α))𝟙𝖡(γ(x,v)(θ))1Leb1(L(x,v,t)(,r))absentsubscriptsubscript𝖬subscriptsuperscriptsubscript𝕊𝑥𝑑1subscriptsuperscript2subscript𝑟subscript1𝐿𝑥𝑣𝑡𝛼subscript1𝖠subscript𝛾𝑥𝑣𝛼subscript1𝖡subscript𝛾𝑥𝑣𝜃1subscriptLeb1𝐿𝑥𝑣𝑡𝑟\displaystyle\quad=\int_{\mathbb{R}}\int_{\mathsf{M}}\int_{\mathbb{S}_{x}^{d-1% }}\int_{\mathbb{R}^{2}}\int_{(\ell,r)}\mathbbm{1}_{L(x,v,t)}\left(\alpha\right% )\mathbbm{1}_{\mathsf{A}}\left(\gamma_{(x,v)}(\alpha)\right)\mathbbm{1}_{% \mathsf{B}}\left(\gamma_{(x,v)}(\theta)\right)\frac{1}{\mathrm{Leb}_{1}(L(x,v,% t)\cap(\ell,r))}= ∫ start_POSTSUBSCRIPT blackboard_R end_POSTSUBSCRIPT ∫ start_POSTSUBSCRIPT sansserif_M end_POSTSUBSCRIPT ∫ start_POSTSUBSCRIPT blackboard_S start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_d - 1 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ∫ start_POSTSUBSCRIPT blackboard_R start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ∫ start_POSTSUBSCRIPT ( roman_ℓ , italic_r ) end_POSTSUBSCRIPT blackboard_1 start_POSTSUBSCRIPT italic_L ( italic_x , italic_v , italic_t ) end_POSTSUBSCRIPT ( italic_α ) blackboard_1 start_POSTSUBSCRIPT sansserif_A end_POSTSUBSCRIPT ( italic_γ start_POSTSUBSCRIPT ( italic_x , italic_v ) end_POSTSUBSCRIPT ( italic_α ) ) blackboard_1 start_POSTSUBSCRIPT sansserif_B end_POSTSUBSCRIPT ( italic_γ start_POSTSUBSCRIPT ( italic_x , italic_v ) end_POSTSUBSCRIPT ( italic_θ ) ) divide start_ARG 1 end_ARG start_ARG roman_Leb start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_L ( italic_x , italic_v , italic_t ) ∩ ( roman_ℓ , italic_r ) ) end_ARG
𝟙L(t)(x)𝟙(,r)(α)QL(x,v,t),r(α,dθ)ξL(x,v,t)(0)(d(,r))σd1(x)(dv)ν𝔤(dx)Leb1(dα)subscript1𝐿𝑡𝑥subscript1𝑟𝛼superscriptsubscript𝑄𝐿𝑥𝑣𝑡𝑟𝛼d𝜃superscriptsubscript𝜉𝐿𝑥𝑣𝑡0d𝑟superscriptsubscript𝜎𝑑1𝑥d𝑣subscript𝜈𝔤d𝑥subscriptLeb1d𝛼\displaystyle\hskip 56.9055pt\mathbbm{1}_{L(t)}(x)\mathbbm{1}_{(\ell,r)}(% \alpha)\ Q_{L(x,v,t)}^{\ell,r}(\alpha,{\rm d}\theta)\ \xi_{L(x,v,t)}^{(0)}\big% {(}{\rm d}(\ell,r)\big{)}\ \sigma_{d-1}^{(x)}({\rm d}v)\ \nu_{\mathfrak{g}}({% \rm d}x)\ \mathrm{Leb}_{1}({\rm d}\alpha)blackboard_1 start_POSTSUBSCRIPT italic_L ( italic_t ) end_POSTSUBSCRIPT ( italic_x ) blackboard_1 start_POSTSUBSCRIPT ( roman_ℓ , italic_r ) end_POSTSUBSCRIPT ( italic_α ) italic_Q start_POSTSUBSCRIPT italic_L ( italic_x , italic_v , italic_t ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ℓ , italic_r end_POSTSUPERSCRIPT ( italic_α , roman_d italic_θ ) italic_ξ start_POSTSUBSCRIPT italic_L ( italic_x , italic_v , italic_t ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT ( roman_d ( roman_ℓ , italic_r ) ) italic_σ start_POSTSUBSCRIPT italic_d - 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_x ) end_POSTSUPERSCRIPT ( roman_d italic_v ) italic_ν start_POSTSUBSCRIPT fraktur_g end_POSTSUBSCRIPT ( roman_d italic_x ) roman_Leb start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( roman_d italic_α )
=𝖬𝕊xd121Leb1(L(x,v,t)(,r))L(x,v,t)(,r)(,r)𝟙𝖠(γ(x,v)(α))𝟙𝖡(γ(x,v)(θ))absentsubscript𝖬subscriptsuperscriptsubscript𝕊𝑥𝑑1subscriptsuperscript21subscriptLeb1𝐿𝑥𝑣𝑡𝑟subscript𝐿𝑥𝑣𝑡𝑟subscript𝑟subscript1𝖠subscript𝛾𝑥𝑣𝛼subscript1𝖡subscript𝛾𝑥𝑣𝜃\displaystyle\quad=\int_{\mathsf{M}}\int_{\mathbb{S}_{x}^{d-1}}\int_{\mathbb{R% }^{2}}\frac{1}{\mathrm{Leb}_{1}(L(x,v,t)\cap(\ell,r))}\int_{L(x,v,t)\cap(\ell,% r)}\int_{(\ell,r)}\mathbbm{1}_{\mathsf{A}}\left(\gamma_{(x,v)}(\alpha)\right)% \mathbbm{1}_{\mathsf{B}}\left(\gamma_{(x,v)}(\theta)\right)= ∫ start_POSTSUBSCRIPT sansserif_M end_POSTSUBSCRIPT ∫ start_POSTSUBSCRIPT blackboard_S start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_d - 1 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ∫ start_POSTSUBSCRIPT blackboard_R start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG roman_Leb start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_L ( italic_x , italic_v , italic_t ) ∩ ( roman_ℓ , italic_r ) ) end_ARG ∫ start_POSTSUBSCRIPT italic_L ( italic_x , italic_v , italic_t ) ∩ ( roman_ℓ , italic_r ) end_POSTSUBSCRIPT ∫ start_POSTSUBSCRIPT ( roman_ℓ , italic_r ) end_POSTSUBSCRIPT blackboard_1 start_POSTSUBSCRIPT sansserif_A end_POSTSUBSCRIPT ( italic_γ start_POSTSUBSCRIPT ( italic_x , italic_v ) end_POSTSUBSCRIPT ( italic_α ) ) blackboard_1 start_POSTSUBSCRIPT sansserif_B end_POSTSUBSCRIPT ( italic_γ start_POSTSUBSCRIPT ( italic_x , italic_v ) end_POSTSUBSCRIPT ( italic_θ ) )
𝟙L(t)(x)QL(x,v,t),r(α,dθ)Leb1(dα)ξL(x,v,t)(0)(d(,r))σd1(x)(dv)ν𝔤(dx).subscript1𝐿𝑡𝑥superscriptsubscript𝑄𝐿𝑥𝑣𝑡𝑟𝛼d𝜃subscriptLeb1d𝛼superscriptsubscript𝜉𝐿𝑥𝑣𝑡0d𝑟superscriptsubscript𝜎𝑑1𝑥d𝑣subscript𝜈𝔤d𝑥\displaystyle\hskip 56.9055pt\mathbbm{1}_{L(t)}(x)\ Q_{L(x,v,t)}^{\ell,r}(% \alpha,{\rm d}\theta)\ \mathrm{Leb}_{1}({\rm d}\alpha)\ \xi_{L(x,v,t)}^{(0)}% \big{(}{\rm d}(\ell,r)\big{)}\ \sigma_{d-1}^{(x)}({\rm d}v)\ \nu_{\mathfrak{g}% }({\rm d}x).blackboard_1 start_POSTSUBSCRIPT italic_L ( italic_t ) end_POSTSUBSCRIPT ( italic_x ) italic_Q start_POSTSUBSCRIPT italic_L ( italic_x , italic_v , italic_t ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ℓ , italic_r end_POSTSUPERSCRIPT ( italic_α , roman_d italic_θ ) roman_Leb start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( roman_d italic_α ) italic_ξ start_POSTSUBSCRIPT italic_L ( italic_x , italic_v , italic_t ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT ( roman_d ( roman_ℓ , italic_r ) ) italic_σ start_POSTSUBSCRIPT italic_d - 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_x ) end_POSTSUPERSCRIPT ( roman_d italic_v ) italic_ν start_POSTSUBSCRIPT fraktur_g end_POSTSUBSCRIPT ( roman_d italic_x ) .

This expression is symmetric in 𝖠𝖠\mathsf{A}sansserif_A and 𝖡𝖡\mathsf{B}sansserif_B, because of the reversibility of the shrinkage kernel. Namely Lemma 15 yields

L(t)𝖠Kt(x,𝖡)ν𝔤(dx)subscript𝐿𝑡𝖠subscript𝐾𝑡𝑥𝖡subscript𝜈𝔤d𝑥\displaystyle\int_{L(t)\cap\mathsf{A}}K_{t}(x,\mathsf{B})\ \nu_{\mathfrak{g}}(% {\rm d}x)∫ start_POSTSUBSCRIPT italic_L ( italic_t ) ∩ sansserif_A end_POSTSUBSCRIPT italic_K start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_x , sansserif_B ) italic_ν start_POSTSUBSCRIPT fraktur_g end_POSTSUBSCRIPT ( roman_d italic_x )
=𝖬𝕊xd121Leb1(L(x,v,t)(,r))L(x,v,t)(,r)(,r)𝟙𝖠(γ(x,v)(θ))𝟙𝖡(γ(x,v)(α))absentsubscript𝖬subscriptsuperscriptsubscript𝕊𝑥𝑑1subscriptsuperscript21subscriptLeb1𝐿𝑥𝑣𝑡𝑟subscript𝐿𝑥𝑣𝑡𝑟subscript𝑟subscript1𝖠subscript𝛾𝑥𝑣𝜃subscript1𝖡subscript𝛾𝑥𝑣𝛼\displaystyle\quad=\int_{\mathsf{M}}\int_{\mathbb{S}_{x}^{d-1}}\int_{\mathbb{R% }^{2}}\frac{1}{\mathrm{Leb}_{1}(L(x,v,t)\cap(\ell,r))}\int_{L(x,v,t)\cap(\ell,% r)}\int_{(\ell,r)}\mathbbm{1}_{\mathsf{A}}\left(\gamma_{(x,v)}(\theta)\right)% \mathbbm{1}_{\mathsf{B}}\left(\gamma_{(x,v)}(\alpha)\right)= ∫ start_POSTSUBSCRIPT sansserif_M end_POSTSUBSCRIPT ∫ start_POSTSUBSCRIPT blackboard_S start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_d - 1 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ∫ start_POSTSUBSCRIPT blackboard_R start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG roman_Leb start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_L ( italic_x , italic_v , italic_t ) ∩ ( roman_ℓ , italic_r ) ) end_ARG ∫ start_POSTSUBSCRIPT italic_L ( italic_x , italic_v , italic_t ) ∩ ( roman_ℓ , italic_r ) end_POSTSUBSCRIPT ∫ start_POSTSUBSCRIPT ( roman_ℓ , italic_r ) end_POSTSUBSCRIPT blackboard_1 start_POSTSUBSCRIPT sansserif_A end_POSTSUBSCRIPT ( italic_γ start_POSTSUBSCRIPT ( italic_x , italic_v ) end_POSTSUBSCRIPT ( italic_θ ) ) blackboard_1 start_POSTSUBSCRIPT sansserif_B end_POSTSUBSCRIPT ( italic_γ start_POSTSUBSCRIPT ( italic_x , italic_v ) end_POSTSUBSCRIPT ( italic_α ) )
𝟙L(t)(x)QL(x,v,t),r(α,dθ)Leb1(dα)ξL(x,v,t)(0)(d(,r))σd1(x)(dv)ν𝔤(dx).subscript1𝐿𝑡𝑥superscriptsubscript𝑄𝐿𝑥𝑣𝑡𝑟𝛼d𝜃subscriptLeb1d𝛼superscriptsubscript𝜉𝐿𝑥𝑣𝑡0d𝑟superscriptsubscript𝜎𝑑1𝑥d𝑣subscript𝜈𝔤d𝑥\displaystyle\hskip 56.9055pt\mathbbm{1}_{L(t)}(x)\ Q_{L(x,v,t)}^{\ell,r}(% \alpha,{\rm d}\theta)\ \mathrm{Leb}_{1}({\rm d}\alpha)\ \xi_{L(x,v,t)}^{(0)}% \big{(}{\rm d}(\ell,r)\big{)}\ \sigma_{d-1}^{(x)}({\rm d}v)\ \nu_{\mathfrak{g}% }({\rm d}x).blackboard_1 start_POSTSUBSCRIPT italic_L ( italic_t ) end_POSTSUBSCRIPT ( italic_x ) italic_Q start_POSTSUBSCRIPT italic_L ( italic_x , italic_v , italic_t ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ℓ , italic_r end_POSTSUPERSCRIPT ( italic_α , roman_d italic_θ ) roman_Leb start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( roman_d italic_α ) italic_ξ start_POSTSUBSCRIPT italic_L ( italic_x , italic_v , italic_t ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT ( roman_d ( roman_ℓ , italic_r ) ) italic_σ start_POSTSUBSCRIPT italic_d - 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_x ) end_POSTSUPERSCRIPT ( roman_d italic_v ) italic_ν start_POSTSUBSCRIPT fraktur_g end_POSTSUBSCRIPT ( roman_d italic_x ) .

Consequently,

L(t)𝖠Kt(x,𝖡)ν𝔤(dx)=L(t)𝖡Kt(x,𝖠)ν𝔤(dx).subscript𝐿𝑡𝖠subscript𝐾𝑡𝑥𝖡subscript𝜈𝔤d𝑥subscript𝐿𝑡𝖡subscript𝐾𝑡𝑥𝖠subscript𝜈𝔤d𝑥\displaystyle\int_{L(t)\cap\mathsf{A}}K_{t}(x,\mathsf{B})\ \nu_{\mathfrak{g}}(% {\rm d}x)=\int_{L(t)\cap\mathsf{B}}K_{t}(x,\mathsf{A})\ \nu_{\mathfrak{g}}({% \rm d}x).∫ start_POSTSUBSCRIPT italic_L ( italic_t ) ∩ sansserif_A end_POSTSUBSCRIPT italic_K start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_x , sansserif_B ) italic_ν start_POSTSUBSCRIPT fraktur_g end_POSTSUBSCRIPT ( roman_d italic_x ) = ∫ start_POSTSUBSCRIPT italic_L ( italic_t ) ∩ sansserif_B end_POSTSUBSCRIPT italic_K start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_x , sansserif_A ) italic_ν start_POSTSUBSCRIPT fraktur_g end_POSTSUBSCRIPT ( roman_d italic_x ) .

Acknowledgments

Data were kindly provided in part by the Human Connectome Project, WU-Minn Consortium (Principal Investigators: David Van Essen and Kamil Ugurbil; 1U54MH091657) funded by the 16 NIH Institutes and Centers that support the NIH Blueprint for Neuroscience Research; and by the McDonnell Center for Systems Neuroscience at Washington University.

We thank Rudrasis Chakraborty and Clément Mantoux for interesting discussions regarding the experiments on video actions and network data. We thank Eric Moulines, Randal Douc and Philip Schär for fruitful conversations regarding the theoretical part of this paper. MH and DR gratefully acknowledge support of the DFG within project 32680300 – SFB 1456 subproject B02. MH expresses her gratitude for the hospitality of École Polytechnique.

Appendix

Appendix A Properties of the stepping-out procedure

In this section we provide the proofs of Lemma 11 and Lemma 12. To this end, we suppose that we are in the setting of Section 4.1 summarized as follows:

Setting C.

Fix w(0,)𝑤0w\in(0,\infty)italic_w ∈ ( 0 , ∞ ) and m𝑚m\in\mathbb{N}italic_m ∈ blackboard_N, and let 𝖲()𝖲\mathsf{S}\in\mathcal{B}(\mathbb{R})sansserif_S ∈ caligraphic_B ( blackboard_R ). For all θ𝜃\theta\in\mathbb{R}italic_θ ∈ blackboard_R define the sequences (Li(θ))isubscriptsuperscriptsubscript𝐿𝑖𝜃𝑖(L_{i}^{(\theta)})_{i\in\mathbb{N}}( italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ) start_POSTSUBSCRIPT italic_i ∈ blackboard_N end_POSTSUBSCRIPT and (Rj(θ))jsubscriptsuperscriptsubscript𝑅𝑗𝜃𝑗(R_{j}^{(\theta)})_{j\in\mathbb{N}}( italic_R start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ) start_POSTSUBSCRIPT italic_j ∈ blackboard_N end_POSTSUBSCRIPT as in (18), and the stopping times τ𝖲(θ)superscriptsubscript𝜏𝖲𝜃\tau_{\mathsf{S}}^{(\theta)}italic_τ start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT and 𝔗𝖲(θ)superscriptsubscript𝔗𝖲𝜃\mathfrak{T}_{\mathsf{S}}^{(\theta)}fraktur_T start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT as in (21).

In the proof of both lemmas we push a corresponding property of the sequences (Li(θ))isubscriptsuperscriptsubscript𝐿𝑖𝜃𝑖(L_{i}^{(\theta)})_{i\in\mathbb{N}}( italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ) start_POSTSUBSCRIPT italic_i ∈ blackboard_N end_POSTSUBSCRIPT and (Rj(θ))jsubscriptsuperscriptsubscript𝑅𝑗𝜃𝑗(R_{j}^{(\theta)})_{j\in\mathbb{N}}( italic_R start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ) start_POSTSUBSCRIPT italic_j ∈ blackboard_N end_POSTSUBSCRIPT to the stopped sequences. Observe that it is relatively simple to establish the required properties for (Li(θ))isubscriptsuperscriptsubscript𝐿𝑖𝜃𝑖(L_{i}^{(\theta)})_{i\in\mathbb{N}}( italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ) start_POSTSUBSCRIPT italic_i ∈ blackboard_N end_POSTSUBSCRIPT and (Rj(θ))jsubscriptsuperscriptsubscript𝑅𝑗𝜃𝑗(R_{j}^{(\theta)})_{j\in\mathbb{N}}( italic_R start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ) start_POSTSUBSCRIPT italic_j ∈ blackboard_N end_POSTSUBSCRIPT, because their joint distributions are available explicitly.

Lemma 18.

Assume we are in Setting C. Let i,j𝑖𝑗i,j\in\mathbb{N}italic_i , italic_j ∈ blackboard_N and θ𝜃\theta\in\mathbb{R}italic_θ ∈ blackboard_R. We have for all 𝖠1,,𝖠i,𝖡1,,𝖡j()subscript𝖠1subscript𝖠𝑖subscript𝖡1subscript𝖡𝑗\mathsf{A}_{1},\ldots,\mathsf{A}_{i},\mathsf{B}_{1},\ldots,\mathsf{B}_{j}\in% \mathcal{B}(\mathbb{R})sansserif_A start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , sansserif_A start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , sansserif_B start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , sansserif_B start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∈ caligraphic_B ( blackboard_R ) that

(L1(θ)𝖠1,,Li(θ)𝖠i,R1(θ)𝖡1,,Rj(θ)𝖡j)formulae-sequencesuperscriptsubscript𝐿1𝜃subscript𝖠1formulae-sequencesuperscriptsubscript𝐿𝑖𝜃subscript𝖠𝑖formulae-sequencesuperscriptsubscript𝑅1𝜃subscript𝖡1superscriptsubscript𝑅𝑗𝜃subscript𝖡𝑗\displaystyle\mathbb{P}\Big{(}L_{1}^{(\theta)}\in\mathsf{A}_{1},\ldots,L_{i}^{% (\theta)}\in\mathsf{A}_{i},R_{1}^{(\theta)}\in\mathsf{B}_{1},\ldots,R_{j}^{(% \theta)}\in\mathsf{B}_{j}\Big{)}blackboard_P ( italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_A start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_A start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_B start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_R start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_B start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT )
=1w0wk=1i𝟙𝖠k(θu(k1)w)l=1j𝟙𝖡l(θu+lw)Leb1(du).absent1𝑤superscriptsubscript0𝑤superscriptsubscriptproduct𝑘1𝑖subscript1subscript𝖠𝑘𝜃𝑢𝑘1𝑤superscriptsubscriptproduct𝑙1𝑗subscript1subscript𝖡𝑙𝜃𝑢𝑙𝑤subscriptLeb1d𝑢\displaystyle\qquad=\frac{1}{w}\int_{0}^{w}\prod_{k=1}^{i}\mathbbm{1}_{\mathsf% {A}_{k}}\big{(}\theta-u-(k-1)w\big{)}\cdot\prod_{l=1}^{j}\mathbbm{1}_{\mathsf{% B}_{l}}(\theta-u+lw)\ \mathrm{Leb}_{1}({\rm d}u).= divide start_ARG 1 end_ARG start_ARG italic_w end_ARG ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_w end_POSTSUPERSCRIPT ∏ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT blackboard_1 start_POSTSUBSCRIPT sansserif_A start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_θ - italic_u - ( italic_k - 1 ) italic_w ) ⋅ ∏ start_POSTSUBSCRIPT italic_l = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT blackboard_1 start_POSTSUBSCRIPT sansserif_B start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_θ - italic_u + italic_l italic_w ) roman_Leb start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( roman_d italic_u ) .
Proof.

Let i,j𝑖𝑗i,j\in\mathbb{N}italic_i , italic_j ∈ blackboard_N, θ𝜃\theta\in\mathbb{R}italic_θ ∈ blackboard_R and 𝖠1,,𝖠i,𝖡1,,𝖡j()subscript𝖠1subscript𝖠𝑖subscript𝖡1subscript𝖡𝑗\mathsf{A}_{1},\ldots,\mathsf{A}_{i},\mathsf{B}_{1},\ldots,\mathsf{B}_{j}\in% \mathcal{B}(\mathbb{R})sansserif_A start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , sansserif_A start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , sansserif_B start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , sansserif_B start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∈ caligraphic_B ( blackboard_R ). Then, as ΥUnif([0,w])similar-toΥUnif0𝑤\Upsilon\sim\mathrm{Unif}([0,w])roman_Υ ∼ roman_Unif ( [ 0 , italic_w ] ), we have

(L1(θ)𝖠1,,Li(θ)𝖠i,R1(θ)𝖡1,,Rj(θ)𝖡j)formulae-sequencesuperscriptsubscript𝐿1𝜃subscript𝖠1formulae-sequencesuperscriptsubscript𝐿𝑖𝜃subscript𝖠𝑖formulae-sequencesuperscriptsubscript𝑅1𝜃subscript𝖡1superscriptsubscript𝑅𝑗𝜃subscript𝖡𝑗\displaystyle\mathbb{P}\Big{(}L_{1}^{(\theta)}\in\mathsf{A}_{1},\ldots,L_{i}^{% (\theta)}\in\mathsf{A}_{i},R_{1}^{(\theta)}\in\mathsf{B}_{1},\ldots,R_{j}^{(% \theta)}\in\mathsf{B}_{j}\Big{)}blackboard_P ( italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_A start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_A start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_B start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_R start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_B start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT )
=(θΥ(k1)w𝖠k,k{1,,i},θΥ+lw𝖡l,l{1,,j})absentformulae-sequence𝜃Υ𝑘1𝑤subscript𝖠𝑘formulae-sequence𝑘1𝑖formulae-sequence𝜃Υ𝑙𝑤subscript𝖡𝑙𝑙1𝑗\displaystyle\qquad=\mathbb{P}\Big{(}\theta-\Upsilon-(k-1)w\in\mathsf{A}_{k},% \,k\in\{1,\ldots,i\},\ \theta-\Upsilon+lw\in\mathsf{B}_{l},\,l\in\{1,\ldots,j% \}\Big{)}= blackboard_P ( italic_θ - roman_Υ - ( italic_k - 1 ) italic_w ∈ sansserif_A start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_k ∈ { 1 , … , italic_i } , italic_θ - roman_Υ + italic_l italic_w ∈ sansserif_B start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT , italic_l ∈ { 1 , … , italic_j } )
=1w0wk=1i𝟙𝖠k(θu(k1)w)l=1j𝟙𝖡l(θu+lw)Leb1(du).absent1𝑤superscriptsubscript0𝑤superscriptsubscriptproduct𝑘1𝑖subscript1subscript𝖠𝑘𝜃𝑢𝑘1𝑤superscriptsubscriptproduct𝑙1𝑗subscript1subscript𝖡𝑙𝜃𝑢𝑙𝑤subscriptLeb1d𝑢\displaystyle\qquad=\frac{1}{w}\int_{0}^{w}\prod_{k=1}^{i}\mathbbm{1}_{\mathsf% {A}_{k}}(\theta-u-(k-1)w)\cdot\prod_{l=1}^{j}\mathbbm{1}_{\mathsf{B}_{l}}(% \theta-u+lw)\ \mathrm{Leb}_{1}({\rm d}u).= divide start_ARG 1 end_ARG start_ARG italic_w end_ARG ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_w end_POSTSUPERSCRIPT ∏ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT blackboard_1 start_POSTSUBSCRIPT sansserif_A start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_θ - italic_u - ( italic_k - 1 ) italic_w ) ⋅ ∏ start_POSTSUBSCRIPT italic_l = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT blackboard_1 start_POSTSUBSCRIPT sansserif_B start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_θ - italic_u + italic_l italic_w ) roman_Leb start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( roman_d italic_u ) .

We present partitions of certain events that facilitate the extension of properties of the sequences (Li(θ))isubscriptsuperscriptsubscript𝐿𝑖𝜃𝑖(L_{i}^{(\theta)})_{i\in\mathbb{N}}( italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ) start_POSTSUBSCRIPT italic_i ∈ blackboard_N end_POSTSUBSCRIPT and (Rj(θ))jsubscriptsuperscriptsubscript𝑅𝑗𝜃𝑗(R_{j}^{(\theta)})_{j\in\mathbb{N}}( italic_R start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ) start_POSTSUBSCRIPT italic_j ∈ blackboard_N end_POSTSUBSCRIPT to the stepping-out distribution.

Lemma 19.

Suppose we are in Setting C. Then for all θ,α𝜃𝛼\theta,\alpha\in\mathbb{R}italic_θ , italic_α ∈ blackboard_R we have

  1. 1.

    {Li(θ)<α<Rj(θ)}=k=0i1{Lk+1(θ)<α<Lk(θ)}l=1j1{Rl(θ)<α<Rl+1(θ)}k=1i1{Lk(θ)=α}l=1i1{Rl(θ)=α} for all i,j,superscriptsubscript𝐿𝑖𝜃𝛼superscriptsubscript𝑅𝑗𝜃absentsquare-unionsuperscriptsubscriptsquare-union𝑘0𝑖1superscriptsubscript𝐿𝑘1𝜃𝛼superscriptsubscript𝐿𝑘𝜃superscriptsubscriptsquare-union𝑙1𝑗1superscriptsubscript𝑅𝑙𝜃𝛼superscriptsubscript𝑅𝑙1𝜃missing-subexpressionsquare-unionsquare-unionsuperscriptsubscriptsquare-union𝑘1𝑖1superscriptsubscript𝐿𝑘𝜃𝛼superscriptsubscriptsquare-union𝑙1𝑖1superscriptsubscript𝑅𝑙𝜃𝛼 for all 𝑖𝑗\begin{aligned} \left\{L_{i}^{(\theta)}<\alpha<R_{j}^{(\theta)}\right\}&=% \bigsqcup_{k=0}^{i-1}\left\{L_{k+1}^{(\theta)}<\alpha<L_{k}^{(\theta)}\right\}% \ \sqcup\ \bigsqcup_{l=1}^{j-1}\left\{R_{l}^{(\theta)}<\alpha<R_{l+1}^{(\theta% )}\right\}\\ &\qquad\sqcup\ \bigsqcup_{k=1}^{i-1}\left\{L_{k}^{(\theta)}=\alpha\right\}\ % \sqcup\ \bigsqcup_{l=1}^{i-1}\left\{R_{l}^{(\theta)}=\alpha\right\}\text{ for % all }i,j\in\mathbb{N},\end{aligned}start_ROW start_CELL { italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT < italic_α < italic_R start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT } end_CELL start_CELL = ⨆ start_POSTSUBSCRIPT italic_k = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_i - 1 end_POSTSUPERSCRIPT { italic_L start_POSTSUBSCRIPT italic_k + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT < italic_α < italic_L start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT } ⊔ ⨆ start_POSTSUBSCRIPT italic_l = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j - 1 end_POSTSUPERSCRIPT { italic_R start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT < italic_α < italic_R start_POSTSUBSCRIPT italic_l + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT } end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL ⊔ ⨆ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_i - 1 end_POSTSUPERSCRIPT { italic_L start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT = italic_α } ⊔ ⨆ start_POSTSUBSCRIPT italic_l = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_i - 1 end_POSTSUPERSCRIPT { italic_R start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT = italic_α } for all italic_i , italic_j ∈ blackboard_N , end_CELL end_ROW

  2. 2.

    {𝑳𝖲(θ)<α<L0(θ)}=i=1m{Lτ𝖲(θ)i+1(θ)<α<Lτ𝖲(θ)i(θ),τ𝖲(θ)i}i=1m1{Li(θ)=α,τ𝖲(θ)i+1},\begin{aligned} \{\boldsymbol{L}_{\mathsf{S}}^{(\theta)}<\alpha<L_{0}^{(\theta% )}\}&=\bigsqcup_{i=1}^{m}\{L_{\tau_{\mathsf{S}}^{(\theta)}-i+1}^{(\theta)}<% \alpha<L_{\tau_{\mathsf{S}}^{(\theta)}-i}^{(\theta)}\,,\ \tau_{\mathsf{S}}^{(% \theta)}\geqslant i\}\\ &\qquad\sqcup\ \bigsqcup_{i^{\prime}=1}^{m-1}\{L_{i^{\prime}}^{(\theta)}=% \alpha,\ \tau_{\mathsf{S}}^{(\theta)}\geqslant i^{\prime}+1\},\end{aligned}start_ROW start_CELL { bold_italic_L start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT < italic_α < italic_L start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT } end_CELL start_CELL = ⨆ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT { italic_L start_POSTSUBSCRIPT italic_τ start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT - italic_i + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT < italic_α < italic_L start_POSTSUBSCRIPT italic_τ start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT - italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT , italic_τ start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ⩾ italic_i } end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL ⊔ ⨆ start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m - 1 end_POSTSUPERSCRIPT { italic_L start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT = italic_α , italic_τ start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ⩾ italic_i start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT + 1 } , end_CELL end_ROW

  3. 3.

    {R1(θ)<α<𝑹𝖲(θ)}=j=1m1{R𝔗𝖲(θ)j(θ)<α<R𝔗𝖲(θ)j+1(θ),𝔗𝖲(θ)j+1}j=2m1{Rj(θ)=α,𝔗𝖲(θ)j+1},\begin{aligned} \{R_{1}^{(\theta)}<\alpha<\boldsymbol{R}_{\mathsf{S}}^{(\theta% )}\}&=\bigsqcup_{j=1}^{m-1}\{R_{\mathfrak{T}_{\mathsf{S}}^{(\theta)}-j}^{(% \theta)}<\alpha<R_{\mathfrak{T}_{\mathsf{S}}^{(\theta)}-j+1}^{(\theta)}\,,\ % \mathfrak{T}_{\mathsf{S}}^{(\theta)}\geqslant j+1\}\\ &\qquad\sqcup\ \bigsqcup_{j^{\prime}=2}^{m-1}\left\{R_{j^{\prime}}^{(\theta)}=% \alpha,\mathfrak{T}_{\mathsf{S}}^{(\theta)}\geqslant j^{\prime}+1\right\},\end% {aligned}start_ROW start_CELL { italic_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT < italic_α < bold_italic_R start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT } end_CELL start_CELL = ⨆ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m - 1 end_POSTSUPERSCRIPT { italic_R start_POSTSUBSCRIPT fraktur_T start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT - italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT < italic_α < italic_R start_POSTSUBSCRIPT fraktur_T start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT - italic_j + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT , fraktur_T start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ⩾ italic_j + 1 } end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL ⊔ ⨆ start_POSTSUBSCRIPT italic_j start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT = 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m - 1 end_POSTSUPERSCRIPT { italic_R start_POSTSUBSCRIPT italic_j start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT = italic_α , fraktur_T start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ⩾ italic_j start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT + 1 } , end_CELL end_ROW

  4. 4.

    (Ln(θ)=α)=(Rn(θ)=α)=0superscriptsubscript𝐿𝑛𝜃𝛼superscriptsubscript𝑅𝑛𝜃𝛼0\mathbb{P}\left(L_{n}^{(\theta)}=\alpha\right)=\mathbb{P}\left(R_{n}^{(\theta)% }=\alpha\right)=0blackboard_P ( italic_L start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT = italic_α ) = blackboard_P ( italic_R start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT = italic_α ) = 0 for all n𝑛n\in\mathbb{N}italic_n ∈ blackboard_N.

Proof.

To 1: Observe that the monotonicity property (20) of (Lk(θ))ksubscriptsuperscriptsubscript𝐿𝑘𝜃𝑘(L_{k}^{(\theta)})_{k\in\mathbb{N}}( italic_L start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ) start_POSTSUBSCRIPT italic_k ∈ blackboard_N end_POSTSUBSCRIPT and (Rl(θ))lsubscriptsuperscriptsubscript𝑅𝑙𝜃𝑙(R_{l}^{(\theta)})_{l\in\mathbb{N}}( italic_R start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ) start_POSTSUBSCRIPT italic_l ∈ blackboard_N end_POSTSUBSCRIPT implies the statement.

To 2: Let ω{𝑳𝖲(θ)<α<L0(θ)}ωsuperscriptsubscript𝑳𝖲𝜃𝛼superscriptsubscript𝐿0𝜃\upomega\in\{\boldsymbol{L}_{\mathsf{S}}^{(\theta)}<\alpha<L_{0}^{(\theta)}\}roman_ω ∈ { bold_italic_L start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT < italic_α < italic_L start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT }. Then there exists

k^^𝑘\displaystyle\widehat{k}over^ start_ARG italic_k end_ARG {0,,τ𝖲(θ)(ω)1} such that Lk^+1(θ)(ω)<α<Lk^(θ)(ω), orformulae-sequenceabsent0superscriptsubscript𝜏𝖲𝜃ω1 such that superscriptsubscript𝐿^𝑘1𝜃ω𝛼superscriptsubscript𝐿^𝑘𝜃ω or\displaystyle\in\{0,\ldots,\tau_{\mathsf{S}}^{(\theta)}(\upomega)-1\}\text{ % such that }L_{\widehat{k}+1}^{(\theta)}(\upomega)<\alpha<L_{\widehat{k}}^{(% \theta)}(\upomega),\text{ or}∈ { 0 , … , italic_τ start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ( roman_ω ) - 1 } such that italic_L start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ( roman_ω ) < italic_α < italic_L start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ( roman_ω ) , or
ksuperscript𝑘\displaystyle k^{\prime}italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT {1,,τ𝖲(θ)(ω)1} such that Lk(θ)(ω)=α.absent1superscriptsubscript𝜏𝖲𝜃ω1 such that superscriptsubscript𝐿superscript𝑘𝜃ω𝛼\displaystyle\in\{1,\ldots,\tau_{\mathsf{S}}^{(\theta)}(\upomega)-1\}\text{ % such that }L_{k^{\prime}}^{(\theta)}(\upomega)=\alpha.∈ { 1 , … , italic_τ start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ( roman_ω ) - 1 } such that italic_L start_POSTSUBSCRIPT italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ( roman_ω ) = italic_α .

In the first case, choosing k=τ𝖲(θ)(ω)k^𝑘superscriptsubscript𝜏𝖲𝜃ω^𝑘k=\tau_{\mathsf{S}}^{(\theta)}(\upomega)-\widehat{k}italic_k = italic_τ start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ( roman_ω ) - over^ start_ARG italic_k end_ARG and taking (22) into account, we get k{1,,m}𝑘1𝑚k\in\{1,\ldots,m\}italic_k ∈ { 1 , … , italic_m }, kτ𝖲(θ)(ω)𝑘superscriptsubscript𝜏𝖲𝜃ωk\leqslant\tau_{\mathsf{S}}^{(\theta)}(\upomega)italic_k ⩽ italic_τ start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ( roman_ω ) and Lτ𝖲(θ)k+1(θ)(ω)<α<Lτ𝖲(θ)k(θ)(ω)superscriptsubscript𝐿superscriptsubscript𝜏𝖲𝜃𝑘1𝜃ω𝛼superscriptsubscript𝐿superscriptsubscript𝜏𝖲𝜃𝑘𝜃ωL_{\tau_{\mathsf{S}}^{(\theta)}-k+1}^{(\theta)}(\upomega)<\alpha<L_{\tau_{% \mathsf{S}}^{(\theta)}-k}^{(\theta)}(\upomega)italic_L start_POSTSUBSCRIPT italic_τ start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT - italic_k + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ( roman_ω ) < italic_α < italic_L start_POSTSUBSCRIPT italic_τ start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT - italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ( roman_ω ). Or, in the second case, k{1,,m1}superscript𝑘1𝑚1k^{\prime}\in\{1,\ldots,m-1\}italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ { 1 , … , italic_m - 1 }, k+1τ𝖲(θ)(ω)superscript𝑘1superscriptsubscript𝜏𝖲𝜃ωk^{\prime}+1\leqslant\tau_{\mathsf{S}}^{(\theta)}(\upomega)italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT + 1 ⩽ italic_τ start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ( roman_ω ) and Lk(θ)(ω)=αsuperscriptsubscript𝐿superscript𝑘𝜃ω𝛼L_{k^{\prime}}^{(\theta)}(\upomega)=\alphaitalic_L start_POSTSUBSCRIPT italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ( roman_ω ) = italic_α. Hence

ωi=1m{Lτ𝖲(θ)i+1(θ)<α<Lτ𝖲(θ)i(θ),τ𝖲(θ)i}i=1m1{Li(θ)=α,τ𝖲(θ)i+1}.\upomega\in\bigsqcup_{i=1}^{m}\{L_{\tau_{\mathsf{S}}^{(\theta)}-i+1}^{(\theta)% }<\alpha<L_{\tau_{\mathsf{S}}^{(\theta)}-i}^{(\theta)}\,,\ \tau_{\mathsf{S}}^{% (\theta)}\geqslant i\}\ \sqcup\ \bigsqcup_{i^{\prime}=1}^{m-1}\{L_{i^{\prime}}% ^{(\theta)}=\alpha,\tau_{\mathsf{S}}^{(\theta)}\geqslant i^{\prime}+1\}.roman_ω ∈ ⨆ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT { italic_L start_POSTSUBSCRIPT italic_τ start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT - italic_i + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT < italic_α < italic_L start_POSTSUBSCRIPT italic_τ start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT - italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT , italic_τ start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ⩾ italic_i } ⊔ ⨆ start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m - 1 end_POSTSUPERSCRIPT { italic_L start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT = italic_α , italic_τ start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ⩾ italic_i start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT + 1 } .

Conversely, let there exists

i𝑖\displaystyle iitalic_i {1,,m} such that ω{Lτ𝖲(θ)i+1(θ)<α<Lτ𝖲(θ)i(θ),τ𝖲(θ)i}, or\displaystyle\in\{1,\ldots,m\}\text{ such that }\upomega\in\{L_{\tau_{\mathsf{% S}}^{(\theta)}-i+1}^{(\theta)}<\alpha<L_{\tau_{\mathsf{S}}^{(\theta)}-i}^{(% \theta)}\,,\ \tau_{\mathsf{S}}^{(\theta)}\geqslant i\},\text{ or}∈ { 1 , … , italic_m } such that roman_ω ∈ { italic_L start_POSTSUBSCRIPT italic_τ start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT - italic_i + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT < italic_α < italic_L start_POSTSUBSCRIPT italic_τ start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT - italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT , italic_τ start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ⩾ italic_i } , or
isuperscript𝑖\displaystyle i^{\prime}italic_i start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT {1,,m1} such that {Li(θ)=α,τ𝖲(θ)i+1}.absent1𝑚1 such that formulae-sequencesuperscriptsubscript𝐿superscript𝑖𝜃𝛼superscriptsubscript𝜏𝖲𝜃superscript𝑖1\displaystyle\in\{1,\ldots,m-1\}\text{ such that }\{L_{i^{\prime}}^{(\theta)}=% \alpha,\tau_{\mathsf{S}}^{(\theta)}\geqslant i^{\prime}+1\}.∈ { 1 , … , italic_m - 1 } such that { italic_L start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT = italic_α , italic_τ start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ⩾ italic_i start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT + 1 } .

Then clearly ω{𝑳𝖲(θ)<α<L0(θ)}ωsuperscriptsubscript𝑳𝖲𝜃𝛼superscriptsubscript𝐿0𝜃\upomega\in\{\boldsymbol{L}_{\mathsf{S}}^{(\theta)}<\alpha<L_{0}^{(\theta)}\}roman_ω ∈ { bold_italic_L start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT < italic_α < italic_L start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT }.

To 3: We apply similar arguments as for the statement 2. Let ω{R1(θ)<α<𝑹𝖲(θ)}ωsuperscriptsubscript𝑅1𝜃𝛼superscriptsubscript𝑹𝖲𝜃\upomega\in\{R_{1}^{(\theta)}<\alpha<\boldsymbol{R}_{\mathsf{S}}^{(\theta)}\}roman_ω ∈ { italic_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT < italic_α < bold_italic_R start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT }. Then there exists

k^^𝑘\displaystyle\widehat{k}over^ start_ARG italic_k end_ARG {1,,𝔗𝖲(θ)(ω)1} such that Rk^(θ)(ω)<α<Rk^+1(θ)(ω), orformulae-sequenceabsent1superscriptsubscript𝔗𝖲𝜃ω1 such that superscriptsubscript𝑅^𝑘𝜃ω𝛼superscriptsubscript𝑅^𝑘1𝜃ω or\displaystyle\in\{1,\ldots,\mathfrak{T}_{\mathsf{S}}^{(\theta)}(\upomega)-1\}% \text{ such that }R_{\widehat{k}}^{(\theta)}(\upomega)<\alpha<R_{\widehat{k}+1% }^{(\theta)}(\upomega),\text{ or}∈ { 1 , … , fraktur_T start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ( roman_ω ) - 1 } such that italic_R start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ( roman_ω ) < italic_α < italic_R start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ( roman_ω ) , or
ksuperscript𝑘\displaystyle k^{\prime}italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT {2,,𝔗𝖲(θ)(ω)1} such that Rk(θ)(ω)=α.absent2superscriptsubscript𝔗𝖲𝜃ω1 such that superscriptsubscript𝑅superscript𝑘𝜃ω𝛼\displaystyle\in\{2,\ldots,\mathfrak{T}_{\mathsf{S}}^{(\theta)}(\upomega)-1\}% \text{ such that }R_{k^{\prime}}^{(\theta)}(\upomega)=\alpha.∈ { 2 , … , fraktur_T start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ( roman_ω ) - 1 } such that italic_R start_POSTSUBSCRIPT italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ( roman_ω ) = italic_α .

In the first case, again choosing k=𝔗𝖲(θ)(ω)k^𝑘superscriptsubscript𝔗𝖲𝜃ω^𝑘k=\mathfrak{T}_{\mathsf{S}}^{(\theta)}(\upomega)-\widehat{k}italic_k = fraktur_T start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ( roman_ω ) - over^ start_ARG italic_k end_ARG and observing (22), we get k{1,,m1}𝑘1𝑚1k\in\{1,\ldots,m-1\}italic_k ∈ { 1 , … , italic_m - 1 }, k+1𝔗𝖲(θ)(ω)𝑘1superscriptsubscript𝔗𝖲𝜃ωk+1\leqslant\mathfrak{T}_{\mathsf{S}}^{(\theta)}(\upomega)italic_k + 1 ⩽ fraktur_T start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ( roman_ω ) and R𝔗𝖲(θ)k(θ)(ω)<α<R𝔗𝖲(θ)k+1(θ)(ω)superscriptsubscript𝑅superscriptsubscript𝔗𝖲𝜃𝑘𝜃ω𝛼superscriptsubscript𝑅superscriptsubscript𝔗𝖲𝜃𝑘1𝜃ωR_{\mathfrak{T}_{\mathsf{S}}^{(\theta)}-k}^{(\theta)}(\upomega)<\alpha<R_{% \mathfrak{T}_{\mathsf{S}}^{(\theta)}-k+1}^{(\theta)}(\upomega)italic_R start_POSTSUBSCRIPT fraktur_T start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT - italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ( roman_ω ) < italic_α < italic_R start_POSTSUBSCRIPT fraktur_T start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT - italic_k + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ( roman_ω ). Or, in the second case, we get k{2,,m1}superscript𝑘2𝑚1k^{\prime}\in\{2,\ldots,m-1\}italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ { 2 , … , italic_m - 1 }, k+1𝔗𝖲(θ)(ω)superscript𝑘1superscriptsubscript𝔗𝖲𝜃ωk^{\prime}+1\leqslant\mathfrak{T}_{\mathsf{S}}^{(\theta)}(\upomega)italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT + 1 ⩽ fraktur_T start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ( roman_ω ) and Rk(θ)(ω)=αsuperscriptsubscript𝑅superscript𝑘𝜃ω𝛼R_{k^{\prime}}^{(\theta)}(\upomega)=\alphaitalic_R start_POSTSUBSCRIPT italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ( roman_ω ) = italic_α. Thus

ωj=1m1{R𝔗𝖲(θ)j(θ)<α<R𝔗𝖲(θ)j+1(θ),𝔗𝖲(θ)j+1}j=2m1{Rj(θ)=α,𝔗𝖲(θ)j+1}.\upomega\in\bigsqcup_{j=1}^{m-1}\{R_{\mathfrak{T}_{\mathsf{S}}^{(\theta)}-j}^{% (\theta)}<\alpha<R_{\mathfrak{T}_{\mathsf{S}}^{(\theta)}-j+1}^{(\theta)}\,,\ % \mathfrak{T}_{\mathsf{S}}^{(\theta)}\geqslant j+1\}\ \sqcup\ \bigsqcup_{j^{% \prime}=2}^{m-1}\left\{R_{j^{\prime}}^{(\theta)}=\alpha,\mathfrak{T}_{\mathsf{% S}}^{(\theta)}\geqslant j^{\prime}+1\right\}.roman_ω ∈ ⨆ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m - 1 end_POSTSUPERSCRIPT { italic_R start_POSTSUBSCRIPT fraktur_T start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT - italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT < italic_α < italic_R start_POSTSUBSCRIPT fraktur_T start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT - italic_j + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT , fraktur_T start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ⩾ italic_j + 1 } ⊔ ⨆ start_POSTSUBSCRIPT italic_j start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT = 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m - 1 end_POSTSUPERSCRIPT { italic_R start_POSTSUBSCRIPT italic_j start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT = italic_α , fraktur_T start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ⩾ italic_j start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT + 1 } .

Conversely, let

ωω\displaystyle\upomegaroman_ω {R𝔗𝖲(θ)j(θ)<α<R𝔗𝖲(θ)j+1(θ),𝔗𝖲(θ)j+1} for some j{1,,m1}, or\displaystyle\in\{R_{\mathfrak{T}_{\mathsf{S}}^{(\theta)}-j}^{(\theta)}<\alpha% <R_{\mathfrak{T}_{\mathsf{S}}^{(\theta)}-j+1}^{(\theta)},\mathfrak{T}_{\mathsf% {S}}^{(\theta)}\geqslant j+1\}\text{ for some }j\in\{1,\ldots,m-1\},\text{ or}∈ { italic_R start_POSTSUBSCRIPT fraktur_T start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT - italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT < italic_α < italic_R start_POSTSUBSCRIPT fraktur_T start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT - italic_j + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT , fraktur_T start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ⩾ italic_j + 1 } for some italic_j ∈ { 1 , … , italic_m - 1 } , or
ωω\displaystyle\upomegaroman_ω {Rj(θ)=α,𝔗𝖲(θ)j+1} for some j{2,,m1}.absentformulae-sequencesuperscriptsubscript𝑅superscript𝑗𝜃𝛼superscriptsubscript𝔗𝖲𝜃superscript𝑗1 for some superscript𝑗2𝑚1\displaystyle\in\left\{R_{j^{\prime}}^{(\theta)}=\alpha,\mathfrak{T}_{\mathsf{% S}}^{(\theta)}\geqslant j^{\prime}+1\right\}\text{ for some }j^{\prime}\in\{2,% \ldots,m-1\}.∈ { italic_R start_POSTSUBSCRIPT italic_j start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT = italic_α , fraktur_T start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ⩾ italic_j start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT + 1 } for some italic_j start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ { 2 , … , italic_m - 1 } .

Then we have ω{R1(θ)<α<𝑹𝖲(θ)}ωsuperscriptsubscript𝑅1𝜃𝛼superscriptsubscript𝑹𝖲𝜃\upomega\in\{R_{1}^{(\theta)}<\alpha<\boldsymbol{R}_{\mathsf{S}}^{(\theta)}\}roman_ω ∈ { italic_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT < italic_α < bold_italic_R start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT }, as 𝔗𝖲(θ)(ω)jj+1j=1superscriptsubscript𝔗𝖲𝜃ω𝑗𝑗1𝑗1\mathfrak{T}_{\mathsf{S}}^{(\theta)}(\upomega)-j\geqslant j+1-j=1fraktur_T start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ( roman_ω ) - italic_j ⩾ italic_j + 1 - italic_j = 1 and 𝔗𝖲(θ)(ω)j+1<𝔗𝖲(θ)(ω)1+1=𝔗𝖲(θ)(ω)superscriptsubscript𝔗𝖲𝜃ω𝑗1superscriptsubscript𝔗𝖲𝜃ω11superscriptsubscript𝔗𝖲𝜃ω\mathfrak{T}_{\mathsf{S}}^{(\theta)}(\upomega)-j+1<\mathfrak{T}_{\mathsf{S}}^{% (\theta)}(\upomega)-1+1=\mathfrak{T}_{\mathsf{S}}^{(\theta)}(\upomega)fraktur_T start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ( roman_ω ) - italic_j + 1 < fraktur_T start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ( roman_ω ) - 1 + 1 = fraktur_T start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ( roman_ω ).

To 4: Let n𝑛n\in\mathbb{N}italic_n ∈ blackboard_N. We have by definition of Ln(θ)superscriptsubscript𝐿𝑛𝜃L_{n}^{(\theta)}italic_L start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT that

(Ln(θ)=α)superscriptsubscript𝐿𝑛𝜃𝛼\displaystyle\mathbb{P}\left(L_{n}^{(\theta)}=\alpha\right)blackboard_P ( italic_L start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT = italic_α ) =1w0w𝟙{α}(θu(n1)w)Leb1(du)absent1𝑤superscriptsubscript0𝑤subscript1𝛼𝜃𝑢𝑛1𝑤subscriptLeb1d𝑢\displaystyle=\frac{1}{w}\int_{0}^{w}\mathbbm{1}_{\{\alpha\}}\big{(}\theta-u-(% n-1)w\big{)}\ \mathrm{Leb}_{1}({\rm d}u)= divide start_ARG 1 end_ARG start_ARG italic_w end_ARG ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_w end_POSTSUPERSCRIPT blackboard_1 start_POSTSUBSCRIPT { italic_α } end_POSTSUBSCRIPT ( italic_θ - italic_u - ( italic_n - 1 ) italic_w ) roman_Leb start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( roman_d italic_u )
1wLeb1({θα(n1)w})=0.absent1𝑤subscriptLeb1𝜃𝛼𝑛1𝑤0\displaystyle\leqslant\frac{1}{w}\ \mathrm{Leb}_{1}\big{(}\{\theta-\alpha-(n-1% )w\}\big{)}=0.⩽ divide start_ARG 1 end_ARG start_ARG italic_w end_ARG roman_Leb start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( { italic_θ - italic_α - ( italic_n - 1 ) italic_w } ) = 0 .

The statement for Rn(θ)superscriptsubscript𝑅𝑛𝜃R_{n}^{(\theta)}italic_R start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT follows analogously. ∎

The following lemma collects the properties of (Li(θ))isubscriptsuperscriptsubscript𝐿𝑖𝜃𝑖(L_{i}^{(\theta)})_{i\in\mathbb{N}}( italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ) start_POSTSUBSCRIPT italic_i ∈ blackboard_N end_POSTSUBSCRIPT and (Rj(θ))jsubscriptsuperscriptsubscript𝑅𝑗𝜃𝑗(R_{j}^{(\theta)})_{j\in\mathbb{N}}( italic_R start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ) start_POSTSUBSCRIPT italic_j ∈ blackboard_N end_POSTSUBSCRIPT and the stopped sequences which lead to the proof of Lemma 11.

Lemma 20.

Assume we are in Setting C and let θ,α𝜃𝛼\theta,\alpha\in\mathbb{R}italic_θ , italic_α ∈ blackboard_R.

  1. 1.

    Let i,j𝑖𝑗i,j\in\mathbb{N}italic_i , italic_j ∈ blackboard_N and 𝖠1,,𝖠i,𝖡1,,𝖡j()subscript𝖠1subscript𝖠𝑖subscript𝖡1subscript𝖡𝑗\mathsf{A}_{1},\ldots,\mathsf{A}_{i},\mathsf{B}_{1},\ldots,\mathsf{B}_{j}\in% \mathcal{B}(\mathbb{R})sansserif_A start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , sansserif_A start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , sansserif_B start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , sansserif_B start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∈ caligraphic_B ( blackboard_R ). We have for all q{0,,i1}𝑞0𝑖1q\in\{0,\ldots,i-1\}italic_q ∈ { 0 , … , italic_i - 1 } that

    (L1(θ)𝖠1,,Li(θ)𝖠i,R1(θ)𝖡1,,Rj(θ)𝖡j,Lq+1(θ)<α<Lq(θ))formulae-sequencesuperscriptsubscript𝐿1𝜃subscript𝖠1formulae-sequencesuperscriptsubscript𝐿𝑖𝜃subscript𝖠𝑖formulae-sequencesuperscriptsubscript𝑅1𝜃subscript𝖡1formulae-sequencesuperscriptsubscript𝑅𝑗𝜃subscript𝖡𝑗superscriptsubscript𝐿𝑞1𝜃𝛼superscriptsubscript𝐿𝑞𝜃\displaystyle\mathbb{P}\left(L_{1}^{(\theta)}\in\mathsf{A}_{1},\ldots,L_{i}^{(% \theta)}\in\mathsf{A}_{i},R_{1}^{(\theta)}\in\mathsf{B}_{1},\ldots,R_{j}^{(% \theta)}\in\mathsf{B}_{j},L_{q+1}^{(\theta)}<\alpha<L_{q}^{(\theta)}\right)blackboard_P ( italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_A start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_A start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_B start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_R start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_B start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT , italic_L start_POSTSUBSCRIPT italic_q + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT < italic_α < italic_L start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT )
    =(Rq(α)𝖠1,,R1(α)𝖠q,L1(α)𝖠q+1,,Liq(α)𝖠i,\displaystyle\quad=\mathbb{P}\left(R_{q}^{(\alpha)}\in\mathsf{A}_{1},\ldots,R_% {1}^{(\alpha)}\in\mathsf{A}_{q},L_{1}^{(\alpha)}\in\mathsf{A}_{q+1},\ldots,L_{% i-q}^{(\alpha)}\in\mathsf{A}_{i},\right.= blackboard_P ( italic_R start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ∈ sansserif_A start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ∈ sansserif_A start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT , italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ∈ sansserif_A start_POSTSUBSCRIPT italic_q + 1 end_POSTSUBSCRIPT , … , italic_L start_POSTSUBSCRIPT italic_i - italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ∈ sansserif_A start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ,
    Rq+1(α)𝖡1,,Rj+q(α)𝖡j,Rq(α)<θ<Rq+1(α)).\displaystyle\quad\qquad\left.R_{q+1}^{(\alpha)}\in\mathsf{B}_{1},\ldots,R_{j+% q}^{(\alpha)}\in\mathsf{B}_{j},R_{q}^{(\alpha)}<\theta<R_{q+1}^{(\alpha)}% \right).italic_R start_POSTSUBSCRIPT italic_q + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ∈ sansserif_B start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_R start_POSTSUBSCRIPT italic_j + italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ∈ sansserif_B start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT , italic_R start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT < italic_θ < italic_R start_POSTSUBSCRIPT italic_q + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ) .

    Observe that for q=0𝑞0q=0italic_q = 0 we use convention (19).

  2. 2.

    Let i,j𝑖𝑗i,j\in\mathbb{N}italic_i , italic_j ∈ blackboard_N. For all 𝖠,𝖡()𝖠𝖡\mathsf{A},\mathsf{B}\in\mathcal{B}(\mathbb{R})sansserif_A , sansserif_B ∈ caligraphic_B ( blackboard_R ) and q{0,,i1}𝑞0𝑖1q\in\{0,\ldots,i-1\}italic_q ∈ { 0 , … , italic_i - 1 } we have

    (𝑳𝖲(θ)𝖠,𝑹𝖲(θ)𝖡,Lq+1(θ)<α<Lq(θ),τ𝖲(θ)=i,𝔗𝖲(θ)=j)\displaystyle\mathbb{P}\left(\boldsymbol{L}_{\mathsf{S}}^{(\theta)}\in\mathsf{% A},\boldsymbol{R}_{\mathsf{S}}^{(\theta)}\in\mathsf{B},L_{q+1}^{(\theta)}<% \alpha<L_{q}^{(\theta)},\tau_{\mathsf{S}}^{(\theta)}=i,\mathfrak{T}_{\mathsf{S% }}^{(\theta)}=j\right)blackboard_P ( bold_italic_L start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_A , bold_italic_R start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_B , italic_L start_POSTSUBSCRIPT italic_q + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT < italic_α < italic_L start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT , italic_τ start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT = italic_i , fraktur_T start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT = italic_j )
    =(𝑳𝖲(α)𝖠,𝑹𝖲(α)𝖡,Rq(α)<θ<Rq+1(α),τ𝖲(α)=iq,𝔗𝖲(α)=j+q).\displaystyle\qquad=\mathbb{P}\left(\boldsymbol{L}_{\mathsf{S}}^{(\alpha)}\in% \mathsf{A},\boldsymbol{R}_{\mathsf{S}}^{(\alpha)}\in\mathsf{B},R_{q}^{(\alpha)% }<\theta<R_{q+1}^{(\alpha)},\tau_{\mathsf{S}}^{(\alpha)}=i-q,\mathfrak{T}_{% \mathsf{S}}^{(\alpha)}=j+q\right).= blackboard_P ( bold_italic_L start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ∈ sansserif_A , bold_italic_R start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ∈ sansserif_B , italic_R start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT < italic_θ < italic_R start_POSTSUBSCRIPT italic_q + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT , italic_τ start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT = italic_i - italic_q , fraktur_T start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT = italic_j + italic_q ) .
  3. 3.

    For all 𝖠,𝖡()𝖠𝖡\mathsf{A},\mathsf{B}\in\mathcal{B}(\mathbb{R})sansserif_A , sansserif_B ∈ caligraphic_B ( blackboard_R ) holds

    (𝑳𝖲(θ)𝖠,𝑹𝖲(θ)𝖡,𝑳𝖲(θ)<α<𝑹𝖲(θ))formulae-sequencesuperscriptsubscript𝑳𝖲𝜃𝖠formulae-sequencesuperscriptsubscript𝑹𝖲𝜃𝖡superscriptsubscript𝑳𝖲𝜃𝛼superscriptsubscript𝑹𝖲𝜃\displaystyle\mathbb{P}\left(\boldsymbol{L}_{\mathsf{S}}^{(\theta)}\in\mathsf{% A},\boldsymbol{R}_{\mathsf{S}}^{(\theta)}\in\mathsf{B},\boldsymbol{L}_{\mathsf% {S}}^{(\theta)}<\alpha<\boldsymbol{R}_{\mathsf{S}}^{(\theta)}\right)blackboard_P ( bold_italic_L start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_A , bold_italic_R start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_B , bold_italic_L start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT < italic_α < bold_italic_R start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT )
    =(𝑳𝖲(α)𝖠,𝑹𝖲(α)𝖡,𝑳𝖲(α)<θ<𝑹𝖲(α)).absentformulae-sequencesuperscriptsubscript𝑳𝖲𝛼𝖠formulae-sequencesuperscriptsubscript𝑹𝖲𝛼𝖡superscriptsubscript𝑳𝖲𝛼𝜃superscriptsubscript𝑹𝖲𝛼\displaystyle\qquad=\mathbb{P}\left(\boldsymbol{L}_{\mathsf{S}}^{(\alpha)}\in% \mathsf{A},\boldsymbol{R}_{\mathsf{S}}^{(\alpha)}\in\mathsf{B},\boldsymbol{L}_% {\mathsf{S}}^{(\alpha)}<\theta<\boldsymbol{R}_{\mathsf{S}}^{(\alpha)}\right).= blackboard_P ( bold_italic_L start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ∈ sansserif_A , bold_italic_R start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ∈ sansserif_B , bold_italic_L start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT < italic_θ < bold_italic_R start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ) .
Proof.

To 1: Let i,j𝑖𝑗i,j\in\mathbb{N}italic_i , italic_j ∈ blackboard_N, q{0,,i1}𝑞0𝑖1q\in\{0,\ldots,i-1\}italic_q ∈ { 0 , … , italic_i - 1 } and 𝖠1,,𝖠i,𝖡1,,𝖡j()subscript𝖠1subscript𝖠𝑖subscript𝖡1subscript𝖡𝑗\mathsf{A}_{1},\ldots,\mathsf{A}_{i},\mathsf{B}_{1},\ldots,\mathsf{B}_{j}\in% \mathcal{B}(\mathbb{R})sansserif_A start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , sansserif_A start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , sansserif_B start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , sansserif_B start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∈ caligraphic_B ( blackboard_R ). By Lemma 18 we have

(L1(θ)𝖠1,,Li(θ)𝖠i,R1(θ)𝖡1,,Rj(θ)𝖡j,Lq+1(θ)<α<Lq(θ))formulae-sequencesuperscriptsubscript𝐿1𝜃subscript𝖠1formulae-sequencesuperscriptsubscript𝐿𝑖𝜃subscript𝖠𝑖formulae-sequencesuperscriptsubscript𝑅1𝜃subscript𝖡1formulae-sequencesuperscriptsubscript𝑅𝑗𝜃subscript𝖡𝑗superscriptsubscript𝐿𝑞1𝜃𝛼superscriptsubscript𝐿𝑞𝜃\displaystyle\mathbb{P}\left(L_{1}^{(\theta)}\in\mathsf{A}_{1},\ldots,L_{i}^{(% \theta)}\in\mathsf{A}_{i},R_{1}^{(\theta)}\in\mathsf{B}_{1},\ldots,R_{j}^{(% \theta)}\in\mathsf{B}_{j},L_{q+1}^{(\theta)}<\alpha<L_{q}^{(\theta)}\right)blackboard_P ( italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_A start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_A start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_B start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_R start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_B start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT , italic_L start_POSTSUBSCRIPT italic_q + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT < italic_α < italic_L start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT )
=1w0wk=1i𝟙𝖠k(θu(k1)w)l=1j𝟙𝖡l(θu+lw)absent1𝑤superscriptsubscript0𝑤superscriptsubscriptproduct𝑘1𝑖subscript1subscript𝖠𝑘𝜃𝑢𝑘1𝑤superscriptsubscriptproduct𝑙1𝑗subscript1subscript𝖡𝑙𝜃𝑢𝑙𝑤\displaystyle\qquad=\frac{1}{w}\int_{0}^{w}\prod_{k=1}^{i}\mathbbm{1}_{\mathsf% {A}_{k}}\big{(}\theta-u-(k-1)w\big{)}\cdot\prod_{l=1}^{j}\mathbbm{1}_{\mathsf{% B}_{l}}\big{(}\theta-u+lw\big{)}= divide start_ARG 1 end_ARG start_ARG italic_w end_ARG ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_w end_POSTSUPERSCRIPT ∏ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT blackboard_1 start_POSTSUBSCRIPT sansserif_A start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_θ - italic_u - ( italic_k - 1 ) italic_w ) ⋅ ∏ start_POSTSUBSCRIPT italic_l = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT blackboard_1 start_POSTSUBSCRIPT sansserif_B start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_θ - italic_u + italic_l italic_w )
𝟙(,α)(θuqw)𝟙(α,)(θu(q1)w)Leb1(du).absentsubscript1𝛼𝜃𝑢𝑞𝑤subscript1𝛼𝜃𝑢𝑞1𝑤subscriptLeb1d𝑢\displaystyle\qquad\qquad\qquad\qquad\cdot\mathbbm{1}_{(-\infty,\alpha)}(% \theta-u-qw)\mathbbm{1}_{(\alpha,\infty)}\big{(}\theta-u-(q-1)w\big{)}\ % \mathrm{Leb}_{1}({\rm d}u).⋅ blackboard_1 start_POSTSUBSCRIPT ( - ∞ , italic_α ) end_POSTSUBSCRIPT ( italic_θ - italic_u - italic_q italic_w ) blackboard_1 start_POSTSUBSCRIPT ( italic_α , ∞ ) end_POSTSUBSCRIPT ( italic_θ - italic_u - ( italic_q - 1 ) italic_w ) roman_Leb start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( roman_d italic_u ) .

Substituting u~=uθ+α+qw~𝑢𝑢𝜃𝛼𝑞𝑤\widetilde{u}=u-\theta+\alpha+qwover~ start_ARG italic_u end_ARG = italic_u - italic_θ + italic_α + italic_q italic_w, we obtain

(L1(θ)𝖠1,,Li(θ)𝖠i,R1(θ)𝖡1,,Rj(θ)𝖡j,Lq+1(θ)<α<Lq(θ))formulae-sequencesuperscriptsubscript𝐿1𝜃subscript𝖠1formulae-sequencesuperscriptsubscript𝐿𝑖𝜃subscript𝖠𝑖formulae-sequencesuperscriptsubscript𝑅1𝜃subscript𝖡1formulae-sequencesuperscriptsubscript𝑅𝑗𝜃subscript𝖡𝑗superscriptsubscript𝐿𝑞1𝜃𝛼superscriptsubscript𝐿𝑞𝜃\displaystyle\mathbb{P}\left(L_{1}^{(\theta)}\in\mathsf{A}_{1},\ldots,L_{i}^{(% \theta)}\in\mathsf{A}_{i},R_{1}^{(\theta)}\in\mathsf{B}_{1},\ldots,R_{j}^{(% \theta)}\in\mathsf{B}_{j},L_{q+1}^{(\theta)}<\alpha<L_{q}^{(\theta)}\right)blackboard_P ( italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_A start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_A start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_B start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_R start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_B start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT , italic_L start_POSTSUBSCRIPT italic_q + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT < italic_α < italic_L start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT )
=1w𝟙(0,w)(u~+θαqw)k=1i𝟙𝖠k(αu~(kq1)w)absent1𝑤subscriptsubscript10𝑤~𝑢𝜃𝛼𝑞𝑤superscriptsubscriptproduct𝑘1𝑖subscript1subscript𝖠𝑘𝛼~𝑢𝑘𝑞1𝑤\displaystyle\qquad=\frac{1}{w}\int_{\mathbb{R}}\mathbbm{1}_{(0,w)}(\widetilde% {u}+\theta-\alpha-qw)\prod_{k=1}^{i}\mathbbm{1}_{\mathsf{A}_{k}}\big{(}\alpha-% \widetilde{u}-(k-q-1)w\big{)}= divide start_ARG 1 end_ARG start_ARG italic_w end_ARG ∫ start_POSTSUBSCRIPT blackboard_R end_POSTSUBSCRIPT blackboard_1 start_POSTSUBSCRIPT ( 0 , italic_w ) end_POSTSUBSCRIPT ( over~ start_ARG italic_u end_ARG + italic_θ - italic_α - italic_q italic_w ) ∏ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT blackboard_1 start_POSTSUBSCRIPT sansserif_A start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_α - over~ start_ARG italic_u end_ARG - ( italic_k - italic_q - 1 ) italic_w )
l=1j𝟙𝖡l(αu~+(l+q)w)𝟙(,α)(αu~)𝟙(α,)(αu~+w)Leb1(du~)\displaystyle\qquad\qquad\cdot\prod_{l=1}^{j}\mathbbm{1}_{\mathsf{B}_{l}}\big{% (}\alpha-\widetilde{u}+(l+q)w\big{)}\cdot\mathbbm{1}_{(-\infty,\alpha)}(\alpha% -\widetilde{u})\mathbbm{1}_{(\alpha,\infty)}\big{(}\alpha-\widetilde{u}+w\big{% )}\ \mathrm{Leb}_{1}({\rm d}\widetilde{u})⋅ ∏ start_POSTSUBSCRIPT italic_l = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT blackboard_1 start_POSTSUBSCRIPT sansserif_B start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_α - over~ start_ARG italic_u end_ARG + ( italic_l + italic_q ) italic_w ) ⋅ blackboard_1 start_POSTSUBSCRIPT ( - ∞ , italic_α ) end_POSTSUBSCRIPT ( italic_α - over~ start_ARG italic_u end_ARG ) blackboard_1 start_POSTSUBSCRIPT ( italic_α , ∞ ) end_POSTSUBSCRIPT ( italic_α - over~ start_ARG italic_u end_ARG + italic_w ) roman_Leb start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( roman_d over~ start_ARG italic_u end_ARG )
=1w𝟙(,θ)(αu~+qw)𝟙(θ,)(αu~+(q+1)w)k=1iq𝟙𝖠k+q(αu~(k1)w)absent1𝑤subscriptsubscript1𝜃𝛼~𝑢𝑞𝑤subscript1𝜃𝛼~𝑢𝑞1𝑤superscriptsubscriptproductsuperscript𝑘1𝑖𝑞subscript1subscript𝖠superscript𝑘𝑞𝛼~𝑢superscript𝑘1𝑤\displaystyle\qquad=\frac{1}{w}\int_{\mathbb{R}}\mathbbm{1}_{(-\infty,\theta)}% (\alpha-\widetilde{u}+qw)\mathbbm{1}_{(\theta,\infty)}\big{(}\alpha-\widetilde% {u}+(q+1)w\big{)}\prod_{k^{\prime}=1}^{i-q}\mathbbm{1}_{\mathsf{A}_{k^{\prime}% +q}}\big{(}\alpha-\widetilde{u}-(k^{\prime}-1)w\big{)}= divide start_ARG 1 end_ARG start_ARG italic_w end_ARG ∫ start_POSTSUBSCRIPT blackboard_R end_POSTSUBSCRIPT blackboard_1 start_POSTSUBSCRIPT ( - ∞ , italic_θ ) end_POSTSUBSCRIPT ( italic_α - over~ start_ARG italic_u end_ARG + italic_q italic_w ) blackboard_1 start_POSTSUBSCRIPT ( italic_θ , ∞ ) end_POSTSUBSCRIPT ( italic_α - over~ start_ARG italic_u end_ARG + ( italic_q + 1 ) italic_w ) ∏ start_POSTSUBSCRIPT italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_i - italic_q end_POSTSUPERSCRIPT blackboard_1 start_POSTSUBSCRIPT sansserif_A start_POSTSUBSCRIPT italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT + italic_q end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_α - over~ start_ARG italic_u end_ARG - ( italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT - 1 ) italic_w )
l=1q𝟙𝖠q+1l(αu~+lw)l=q+1j+q𝟙𝖡lq(αu~+lw)𝟙(0,w)(u~)Leb1(du~).superscriptsubscriptproductsuperscript𝑙1𝑞subscript1subscript𝖠𝑞1superscript𝑙𝛼~𝑢superscript𝑙𝑤superscriptsubscriptproductsuperscript𝑙𝑞1𝑗𝑞subscript1subscript𝖡superscript𝑙𝑞𝛼~𝑢superscript𝑙𝑤subscript10𝑤~𝑢subscriptLeb1d~𝑢\displaystyle\qquad\qquad\prod_{l^{\prime}=1}^{q}\mathbbm{1}_{\mathsf{A}_{q+1-% l^{\prime}}}\big{(}\alpha-\widetilde{u}+l^{\prime}w\big{)}\cdot\prod_{l^{% \prime}=q+1}^{j+q}\mathbbm{1}_{\mathsf{B}_{l^{\prime}-q}}\big{(}\alpha-% \widetilde{u}+l^{\prime}w\big{)}\cdot\mathbbm{1}_{(0,w)}(\widetilde{u})\ % \mathrm{Leb}_{1}({\rm d}\widetilde{u}).∏ start_POSTSUBSCRIPT italic_l start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_q end_POSTSUPERSCRIPT blackboard_1 start_POSTSUBSCRIPT sansserif_A start_POSTSUBSCRIPT italic_q + 1 - italic_l start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_α - over~ start_ARG italic_u end_ARG + italic_l start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_w ) ⋅ ∏ start_POSTSUBSCRIPT italic_l start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT = italic_q + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j + italic_q end_POSTSUPERSCRIPT blackboard_1 start_POSTSUBSCRIPT sansserif_B start_POSTSUBSCRIPT italic_l start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT - italic_q end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_α - over~ start_ARG italic_u end_ARG + italic_l start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_w ) ⋅ blackboard_1 start_POSTSUBSCRIPT ( 0 , italic_w ) end_POSTSUBSCRIPT ( over~ start_ARG italic_u end_ARG ) roman_Leb start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( roman_d over~ start_ARG italic_u end_ARG ) .

Applying again Lemma 18, we get

(L1(θ)𝖠1,,Li(θ)𝖠i,R1(θ)𝖡1,,Rj(θ)𝖡j,Lq+1(θ)<α<Lq(θ))formulae-sequencesuperscriptsubscript𝐿1𝜃subscript𝖠1formulae-sequencesuperscriptsubscript𝐿𝑖𝜃subscript𝖠𝑖formulae-sequencesuperscriptsubscript𝑅1𝜃subscript𝖡1formulae-sequencesuperscriptsubscript𝑅𝑗𝜃subscript𝖡𝑗superscriptsubscript𝐿𝑞1𝜃𝛼superscriptsubscript𝐿𝑞𝜃\displaystyle\mathbb{P}\left(L_{1}^{(\theta)}\in\mathsf{A}_{1},\ldots,L_{i}^{(% \theta)}\in\mathsf{A}_{i},R_{1}^{(\theta)}\in\mathsf{B}_{1},\ldots,R_{j}^{(% \theta)}\in\mathsf{B}_{j},L_{q+1}^{(\theta)}<\alpha<L_{q}^{(\theta)}\right)blackboard_P ( italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_A start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_A start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_B start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_R start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_B start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT , italic_L start_POSTSUBSCRIPT italic_q + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT < italic_α < italic_L start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT )
=(Rq(α)𝖠1,,R1(α)𝖠q,L1(α)𝖠q+1,,Liq(α)𝖠i,\displaystyle\qquad=\mathbb{P}\left(R_{q}^{(\alpha)}\in\mathsf{A}_{1},\ldots,R% _{1}^{(\alpha)}\in\mathsf{A}_{q},L_{1}^{(\alpha)}\in\mathsf{A}_{q+1},\ldots,L_% {i-q}^{(\alpha)}\in\mathsf{A}_{i},\right.= blackboard_P ( italic_R start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ∈ sansserif_A start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ∈ sansserif_A start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT , italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ∈ sansserif_A start_POSTSUBSCRIPT italic_q + 1 end_POSTSUBSCRIPT , … , italic_L start_POSTSUBSCRIPT italic_i - italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ∈ sansserif_A start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ,
Rq+1(α)𝖡1,,Rj+q(α)𝖡j,Rq(α)<θ<Rq+1(α)).\displaystyle\qquad\qquad\qquad\left.R_{q+1}^{(\alpha)}\in\mathsf{B}_{1},% \ldots,R_{j+q}^{(\alpha)}\in\mathsf{B}_{j},R_{q}^{(\alpha)}<\theta<R_{q+1}^{(% \alpha)}\right).italic_R start_POSTSUBSCRIPT italic_q + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ∈ sansserif_B start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_R start_POSTSUBSCRIPT italic_j + italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ∈ sansserif_B start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT , italic_R start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT < italic_θ < italic_R start_POSTSUBSCRIPT italic_q + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ) .

To 2: Let i,j𝑖𝑗i,j\in\mathbb{N}italic_i , italic_j ∈ blackboard_N, q{0,,i1}𝑞0𝑖1q\in\{0,\ldots,i-1\}italic_q ∈ { 0 , … , italic_i - 1 } and 𝖠,𝖡()𝖠𝖡\mathsf{A},\mathsf{B}\in\mathcal{B}(\mathbb{R})sansserif_A , sansserif_B ∈ caligraphic_B ( blackboard_R ). If i+j>m𝑖𝑗𝑚i+j>mitalic_i + italic_j > italic_m, the statement is true by (22), which implies that both probabilities are zero. We now consider i+jm𝑖𝑗𝑚i+j\leqslant mitalic_i + italic_j ⩽ italic_m. To this end, for all n,n~𝑛~𝑛n,\widetilde{n}\in\mathbb{N}italic_n , over~ start_ARG italic_n end_ARG ∈ blackboard_N with n+n~m𝑛~𝑛𝑚n+\widetilde{n}\leqslant mitalic_n + over~ start_ARG italic_n end_ARG ⩽ italic_m and ι{n,,m+1n~}ι𝑛𝑚1~𝑛\upiota\in\{n,\ldots,m+1-\widetilde{n}\}roman_ι ∈ { italic_n , … , italic_m + 1 - over~ start_ARG italic_n end_ARG } set

𝖠kι,n,n~=𝖲,k{1,,n1}formulae-sequencesuperscriptsubscript𝖠𝑘ι𝑛~𝑛𝖲𝑘1𝑛1\displaystyle\mathsf{A}_{k}^{\upiota,n,\widetilde{n}}=\mathsf{S},\quad k\in\{1% ,\ldots,n-1\}\qquadsansserif_A start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ι , italic_n , over~ start_ARG italic_n end_ARG end_POSTSUPERSCRIPT = sansserif_S , italic_k ∈ { 1 , … , italic_n - 1 } and𝖠nι,n,n~={𝖠,ι=n𝖠(𝖲),ι>n,andsuperscriptsubscript𝖠𝑛ι𝑛~𝑛cases𝖠ι𝑛𝖠𝖲ι𝑛\displaystyle\text{and}\qquad\mathsf{A}_{n}^{\upiota,n,\widetilde{n}}=\begin{% dcases}\mathsf{A},&\upiota=n\\ \mathsf{A}\cap(\mathbb{R}\setminus\mathsf{S}),&\upiota>n,\end{dcases}and sansserif_A start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ι , italic_n , over~ start_ARG italic_n end_ARG end_POSTSUPERSCRIPT = { start_ROW start_CELL sansserif_A , end_CELL start_CELL roman_ι = italic_n end_CELL end_ROW start_ROW start_CELL sansserif_A ∩ ( blackboard_R ∖ sansserif_S ) , end_CELL start_CELL roman_ι > italic_n , end_CELL end_ROW
𝖡lι,n,n~=𝖲,l{1,,n~1}formulae-sequencesuperscriptsubscript𝖡𝑙ι𝑛~𝑛𝖲𝑙1~𝑛1\displaystyle\mathsf{B}_{l}^{\upiota,n,\widetilde{n}}=\mathsf{S},\quad l\in\{1% ,\ldots,\widetilde{n}-1\}\qquadsansserif_B start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ι , italic_n , over~ start_ARG italic_n end_ARG end_POSTSUPERSCRIPT = sansserif_S , italic_l ∈ { 1 , … , over~ start_ARG italic_n end_ARG - 1 } and𝖡n~ι,n,n~={𝖡,ι=m+1n~𝖡(𝖲),ι<m+1n~.andsuperscriptsubscript𝖡~𝑛ι𝑛~𝑛cases𝖡ι𝑚1~𝑛𝖡𝖲ι𝑚1~𝑛\displaystyle\text{and}\qquad\mathsf{B}_{\widetilde{n}}^{\upiota,n,\widetilde{% n}}=\begin{dcases}\mathsf{B},&\upiota=m+1-\widetilde{n}\\ \mathsf{B}\cap(\mathbb{R}\setminus\mathsf{S}),&\upiota<m+1-\widetilde{n}.\end{dcases}and sansserif_B start_POSTSUBSCRIPT over~ start_ARG italic_n end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ι , italic_n , over~ start_ARG italic_n end_ARG end_POSTSUPERSCRIPT = { start_ROW start_CELL sansserif_B , end_CELL start_CELL roman_ι = italic_m + 1 - over~ start_ARG italic_n end_ARG end_CELL end_ROW start_ROW start_CELL sansserif_B ∩ ( blackboard_R ∖ sansserif_S ) , end_CELL start_CELL roman_ι < italic_m + 1 - over~ start_ARG italic_n end_ARG . end_CELL end_ROW

Then for q{0,,n1}𝑞0𝑛1q\in\{0,\ldots,n-1\}italic_q ∈ { 0 , … , italic_n - 1 } holds

𝖠kιq,nq,n~+q=𝖠k+qι,n,n~,k{1,,nq},𝖡lιq,nq,n~+q={𝖠ql+1ι,n,n~,lq,𝖡lqι,n,n~,l>q,l{1,,n~+q}.\begin{split}\mathsf{A}_{k}^{\upiota-q,n-q,\widetilde{n}+q}&=\mathsf{A}_{k+q}^% {\upiota,n,\widetilde{n}},\qquad\forall\ k\in\{1,\ldots,n-q\},\\ \mathsf{B}_{l}^{\upiota-q,n-q,\widetilde{n}+q}&=\begin{dcases}\mathsf{A}_{q-l+% 1}^{\upiota,n,\widetilde{n}},&l\leqslant q,\\ \mathsf{B}_{l-q}^{\upiota,n,\widetilde{n}},&l>q,\qquad\forall\ l\in\{1,\ldots,% \widetilde{n}+q\}.\end{dcases}\end{split}start_ROW start_CELL sansserif_A start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ι - italic_q , italic_n - italic_q , over~ start_ARG italic_n end_ARG + italic_q end_POSTSUPERSCRIPT end_CELL start_CELL = sansserif_A start_POSTSUBSCRIPT italic_k + italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ι , italic_n , over~ start_ARG italic_n end_ARG end_POSTSUPERSCRIPT , ∀ italic_k ∈ { 1 , … , italic_n - italic_q } , end_CELL end_ROW start_ROW start_CELL sansserif_B start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ι - italic_q , italic_n - italic_q , over~ start_ARG italic_n end_ARG + italic_q end_POSTSUPERSCRIPT end_CELL start_CELL = { start_ROW start_CELL sansserif_A start_POSTSUBSCRIPT italic_q - italic_l + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ι , italic_n , over~ start_ARG italic_n end_ARG end_POSTSUPERSCRIPT , end_CELL start_CELL italic_l ⩽ italic_q , end_CELL end_ROW start_ROW start_CELL sansserif_B start_POSTSUBSCRIPT italic_l - italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ι , italic_n , over~ start_ARG italic_n end_ARG end_POSTSUPERSCRIPT , end_CELL start_CELL italic_l > italic_q , ∀ italic_l ∈ { 1 , … , over~ start_ARG italic_n end_ARG + italic_q } . end_CELL end_ROW end_CELL end_ROW (28)

We can use these sets to express events of the form {𝑳𝖲(θ)𝖠,𝑹𝖲(θ)𝖡,τ𝖲(θ)=n,𝔗𝖲(θ)=n~}formulae-sequencesuperscriptsubscript𝑳𝖲𝜃𝖠formulae-sequencesuperscriptsubscript𝑹𝖲𝜃𝖡formulae-sequencesuperscriptsubscript𝜏𝖲𝜃𝑛superscriptsubscript𝔗𝖲𝜃~𝑛\{\boldsymbol{L}_{\mathsf{S}}^{(\theta)}\in\mathsf{A},\boldsymbol{R}_{\mathsf{% S}}^{(\theta)}\in\mathsf{B},\tau_{\mathsf{S}}^{(\theta)}=n,\mathfrak{T}_{% \mathsf{S}}^{(\theta)}=\widetilde{n}\}{ bold_italic_L start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_A , bold_italic_R start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_B , italic_τ start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT = italic_n , fraktur_T start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT = over~ start_ARG italic_n end_ARG }. Namely, observe that by (22) we have τ𝖲(θ)Jm+1𝔗𝖲(θ)superscriptsubscript𝜏𝖲𝜃𝐽𝑚1superscriptsubscript𝔗𝖲𝜃\tau_{\mathsf{S}}^{(\theta)}\leqslant J\leqslant m+1-\mathfrak{T}_{\mathsf{S}}% ^{(\theta)}italic_τ start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ⩽ italic_J ⩽ italic_m + 1 - fraktur_T start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT. Together with the independence of J𝐽Jitalic_J from (Lk(θ))ksubscriptsuperscriptsubscript𝐿𝑘𝜃𝑘(L_{k}^{(\theta)})_{k\in\mathbb{N}}( italic_L start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ) start_POSTSUBSCRIPT italic_k ∈ blackboard_N end_POSTSUBSCRIPT and (Rl(θ))lsubscriptsuperscriptsubscript𝑅𝑙𝜃𝑙(R_{l}^{(\theta)})_{l\in\mathbb{N}}( italic_R start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ) start_POSTSUBSCRIPT italic_l ∈ blackboard_N end_POSTSUBSCRIPT, we get

(𝑳𝖲(θ)𝖠,𝑹𝖲(θ)𝖡,τ𝖲(θ)=n,𝔗𝖲(θ)=n~)=ι=nm+1n~(𝑳𝖲(θ)𝖠,𝑹𝖲(θ)𝖡,τ𝖲(θ)=n,𝔗𝖲(θ)=n~,J=ι)=ι=nm+1n~(L1(θ)𝖠1ι,n,n~,,Ln(θ)𝖠nι,n,n~,R1(θ)𝖡1ι,n,n~,,Rn~(θ)𝖡n~ι,n,n~,J=ι)=1mι=nm+1n~(L1(θ)𝖠1ι,n,n~,,Ln(θ)𝖠nι,n,n~,R1(θ)𝖡1ι,n,n~,,Rn~(θ)𝖡n~ι,n,n~).formulae-sequencesuperscriptsubscript𝑳𝖲𝜃𝖠formulae-sequencesuperscriptsubscript𝑹𝖲𝜃𝖡formulae-sequencesuperscriptsubscript𝜏𝖲𝜃𝑛superscriptsubscript𝔗𝖲𝜃~𝑛superscriptsubscriptι𝑛𝑚1~𝑛formulae-sequencesuperscriptsubscript𝑳𝖲𝜃𝖠formulae-sequencesuperscriptsubscript𝑹𝖲𝜃𝖡formulae-sequencesuperscriptsubscript𝜏𝖲𝜃𝑛formulae-sequencesuperscriptsubscript𝔗𝖲𝜃~𝑛𝐽ιsuperscriptsubscriptι𝑛𝑚1~𝑛formulae-sequencesuperscriptsubscript𝐿1𝜃superscriptsubscript𝖠1ι𝑛~𝑛formulae-sequencesuperscriptsubscript𝐿𝑛𝜃superscriptsubscript𝖠𝑛ι𝑛~𝑛formulae-sequencesuperscriptsubscript𝑅1𝜃superscriptsubscript𝖡1ι𝑛~𝑛formulae-sequencesuperscriptsubscript𝑅~𝑛𝜃superscriptsubscript𝖡~𝑛ι𝑛~𝑛𝐽ι1𝑚superscriptsubscriptι𝑛𝑚1~𝑛formulae-sequencesuperscriptsubscript𝐿1𝜃superscriptsubscript𝖠1ι𝑛~𝑛formulae-sequencesuperscriptsubscript𝐿𝑛𝜃superscriptsubscript𝖠𝑛ι𝑛~𝑛formulae-sequencesuperscriptsubscript𝑅1𝜃superscriptsubscript𝖡1ι𝑛~𝑛superscriptsubscript𝑅~𝑛𝜃superscriptsubscript𝖡~𝑛ι𝑛~𝑛\begin{split}&\mathbb{P}\left(\boldsymbol{L}_{\mathsf{S}}^{(\theta)}\in\mathsf% {A},\boldsymbol{R}_{\mathsf{S}}^{(\theta)}\in\mathsf{B},\tau_{\mathsf{S}}^{(% \theta)}=n,\mathfrak{T}_{\mathsf{S}}^{(\theta)}=\widetilde{n}\right)\\ &\quad=\sum_{\upiota=n}^{m+1-\widetilde{n}}\mathbb{P}\left(\boldsymbol{L}_{% \mathsf{S}}^{(\theta)}\in\mathsf{A},\boldsymbol{R}_{\mathsf{S}}^{(\theta)}\in% \mathsf{B},\tau_{\mathsf{S}}^{(\theta)}=n,\mathfrak{T}_{\mathsf{S}}^{(\theta)}% =\widetilde{n},J=\upiota\right)\\ &\quad=\sum_{\upiota=n}^{m+1-\widetilde{n}}\mathbb{P}\left(L_{1}^{(\theta)}\in% \mathsf{A}_{1}^{\upiota,n,\widetilde{n}},\ldots,L_{n}^{(\theta)}\in\mathsf{A}_% {n}^{\upiota,n,\widetilde{n}},R_{1}^{(\theta)}\in\mathsf{B}_{1}^{\upiota,n,% \widetilde{n}},\ldots,R_{\widetilde{n}}^{(\theta)}\in\mathsf{B}_{\widetilde{n}% }^{\upiota,n,\widetilde{n}},J=\upiota\right)\\ &\quad=\frac{1}{m}\sum_{\upiota=n}^{m+1-\widetilde{n}}\mathbb{P}\left(L_{1}^{(% \theta)}\in\mathsf{A}_{1}^{\upiota,n,\widetilde{n}},\ldots,L_{n}^{(\theta)}\in% \mathsf{A}_{n}^{\upiota,n,\widetilde{n}},R_{1}^{(\theta)}\in\mathsf{B}_{1}^{% \upiota,n,\widetilde{n}},\ldots,R_{\widetilde{n}}^{(\theta)}\in\mathsf{B}_{% \widetilde{n}}^{\upiota,n,\widetilde{n}}\right).\end{split}start_ROW start_CELL end_CELL start_CELL blackboard_P ( bold_italic_L start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_A , bold_italic_R start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_B , italic_τ start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT = italic_n , fraktur_T start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT = over~ start_ARG italic_n end_ARG ) end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL = ∑ start_POSTSUBSCRIPT roman_ι = italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m + 1 - over~ start_ARG italic_n end_ARG end_POSTSUPERSCRIPT blackboard_P ( bold_italic_L start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_A , bold_italic_R start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_B , italic_τ start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT = italic_n , fraktur_T start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT = over~ start_ARG italic_n end_ARG , italic_J = roman_ι ) end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL = ∑ start_POSTSUBSCRIPT roman_ι = italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m + 1 - over~ start_ARG italic_n end_ARG end_POSTSUPERSCRIPT blackboard_P ( italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_A start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ι , italic_n , over~ start_ARG italic_n end_ARG end_POSTSUPERSCRIPT , … , italic_L start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_A start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ι , italic_n , over~ start_ARG italic_n end_ARG end_POSTSUPERSCRIPT , italic_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_B start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ι , italic_n , over~ start_ARG italic_n end_ARG end_POSTSUPERSCRIPT , … , italic_R start_POSTSUBSCRIPT over~ start_ARG italic_n end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_B start_POSTSUBSCRIPT over~ start_ARG italic_n end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ι , italic_n , over~ start_ARG italic_n end_ARG end_POSTSUPERSCRIPT , italic_J = roman_ι ) end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL = divide start_ARG 1 end_ARG start_ARG italic_m end_ARG ∑ start_POSTSUBSCRIPT roman_ι = italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m + 1 - over~ start_ARG italic_n end_ARG end_POSTSUPERSCRIPT blackboard_P ( italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_A start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ι , italic_n , over~ start_ARG italic_n end_ARG end_POSTSUPERSCRIPT , … , italic_L start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_A start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ι , italic_n , over~ start_ARG italic_n end_ARG end_POSTSUPERSCRIPT , italic_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_B start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ι , italic_n , over~ start_ARG italic_n end_ARG end_POSTSUPERSCRIPT , … , italic_R start_POSTSUBSCRIPT over~ start_ARG italic_n end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_B start_POSTSUBSCRIPT over~ start_ARG italic_n end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ι , italic_n , over~ start_ARG italic_n end_ARG end_POSTSUPERSCRIPT ) . end_CELL end_ROW (29)

If we use this for n=i𝑛𝑖n=iitalic_n = italic_i and n~=j~𝑛𝑗\widetilde{n}=jover~ start_ARG italic_n end_ARG = italic_j, we get

(𝑳𝖲(θ)𝖠,𝑹𝖲(θ)𝖡,Lq+1(θ)<α<Lq(θ),τ𝖲(θ)=i,𝔗𝖲(θ)=j)\displaystyle\mathbb{P}\left(\boldsymbol{L}_{\mathsf{S}}^{(\theta)}\in\mathsf{% A},\boldsymbol{R}_{\mathsf{S}}^{(\theta)}\in\mathsf{B},L_{q+1}^{(\theta)}<% \alpha<L_{q}^{(\theta)},\tau_{\mathsf{S}}^{(\theta)}=i,\mathfrak{T}_{\mathsf{S% }}^{(\theta)}=j\right)blackboard_P ( bold_italic_L start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_A , bold_italic_R start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_B , italic_L start_POSTSUBSCRIPT italic_q + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT < italic_α < italic_L start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT , italic_τ start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT = italic_i , fraktur_T start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT = italic_j )
=1mι=im+1j(L1(θ)𝖠1ι,i,j,,Li(θ)𝖠iι,i,j,R1(θ)𝖡1ι,i,j,,Rj(θ)𝖡jι,i,j,Lq+1(θ)<α<Lq(θ)).absent1𝑚superscriptsubscriptι𝑖𝑚1𝑗formulae-sequencesuperscriptsubscript𝐿1𝜃superscriptsubscript𝖠1ι𝑖𝑗formulae-sequencesuperscriptsubscript𝐿𝑖𝜃superscriptsubscript𝖠𝑖ι𝑖𝑗formulae-sequencesuperscriptsubscript𝑅1𝜃superscriptsubscript𝖡1ι𝑖𝑗formulae-sequencesuperscriptsubscript𝑅𝑗𝜃superscriptsubscript𝖡𝑗ι𝑖𝑗superscriptsubscript𝐿𝑞1𝜃𝛼superscriptsubscript𝐿𝑞𝜃\displaystyle\quad=\frac{1}{m}\sum_{\upiota=i}^{m+1-j}\mathbb{P}\left(L_{1}^{(% \theta)}\in\mathsf{A}_{1}^{\upiota,i,j},\ldots,L_{i}^{(\theta)}\in\mathsf{A}_{% i}^{\upiota,i,j},R_{1}^{(\theta)}\in\mathsf{B}_{1}^{\upiota,i,j},\ldots,R_{j}^% {(\theta)}\in\mathsf{B}_{j}^{\upiota,i,j},L_{q+1}^{(\theta)}<\alpha<L_{q}^{(% \theta)}\right).= divide start_ARG 1 end_ARG start_ARG italic_m end_ARG ∑ start_POSTSUBSCRIPT roman_ι = italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m + 1 - italic_j end_POSTSUPERSCRIPT blackboard_P ( italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_A start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ι , italic_i , italic_j end_POSTSUPERSCRIPT , … , italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_A start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ι , italic_i , italic_j end_POSTSUPERSCRIPT , italic_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_B start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ι , italic_i , italic_j end_POSTSUPERSCRIPT , … , italic_R start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_B start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ι , italic_i , italic_j end_POSTSUPERSCRIPT , italic_L start_POSTSUBSCRIPT italic_q + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT < italic_α < italic_L start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ) .

Applying statement 1 and then (28) yields

(𝑳𝖲(θ)𝖠,𝑹𝖲(θ)𝖡,Lq+1(θ)<α<Lq(θ),τ𝖲(θ)=i,𝔗𝖲(θ)=j)\displaystyle\mathbb{P}\left(\boldsymbol{L}_{\mathsf{S}}^{(\theta)}\in\mathsf{% A},\boldsymbol{R}_{\mathsf{S}}^{(\theta)}\in\mathsf{B},L_{q+1}^{(\theta)}<% \alpha<L_{q}^{(\theta)},\tau_{\mathsf{S}}^{(\theta)}=i,\mathfrak{T}_{\mathsf{S% }}^{(\theta)}=j\right)blackboard_P ( bold_italic_L start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_A , bold_italic_R start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_B , italic_L start_POSTSUBSCRIPT italic_q + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT < italic_α < italic_L start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT , italic_τ start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT = italic_i , fraktur_T start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT = italic_j )
=1mι=im+1j(Rq(α)𝖠1ι,i,j,,R1(α)𝖠qι,i,j,L1(α)𝖠q+1ι,i,j,,Liq(α)𝖠iι,i,j,\displaystyle\quad=\frac{1}{m}\sum_{\upiota=i}^{m+1-j}\mathbb{P}\left(R_{q}^{(% \alpha)}\in\mathsf{A}_{1}^{\upiota,i,j},\ldots,R_{1}^{(\alpha)}\in\mathsf{A}_{% q}^{\upiota,i,j},L_{1}^{(\alpha)}\in\mathsf{A}_{q+1}^{\upiota,i,j},\ldots,L_{i% -q}^{(\alpha)}\in\mathsf{A}_{i}^{\upiota,i,j},\right.= divide start_ARG 1 end_ARG start_ARG italic_m end_ARG ∑ start_POSTSUBSCRIPT roman_ι = italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m + 1 - italic_j end_POSTSUPERSCRIPT blackboard_P ( italic_R start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ∈ sansserif_A start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ι , italic_i , italic_j end_POSTSUPERSCRIPT , … , italic_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ∈ sansserif_A start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ι , italic_i , italic_j end_POSTSUPERSCRIPT , italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ∈ sansserif_A start_POSTSUBSCRIPT italic_q + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ι , italic_i , italic_j end_POSTSUPERSCRIPT , … , italic_L start_POSTSUBSCRIPT italic_i - italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ∈ sansserif_A start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ι , italic_i , italic_j end_POSTSUPERSCRIPT ,
Rq+1(α)𝖡1ι,i,j,,Rj+q(α)𝖡jι,i,j,Rq(α)<θ<Rq+1(α))\displaystyle\quad\qquad\qquad\qquad\qquad\left.R_{q+1}^{(\alpha)}\in\mathsf{B% }_{1}^{\upiota,i,j},\ldots,R_{j+q}^{(\alpha)}\in\mathsf{B}_{j}^{\upiota,i,j},R% _{q}^{(\alpha)}<\theta<R_{q+1}^{(\alpha)}\right)italic_R start_POSTSUBSCRIPT italic_q + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ∈ sansserif_B start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ι , italic_i , italic_j end_POSTSUPERSCRIPT , … , italic_R start_POSTSUBSCRIPT italic_j + italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ∈ sansserif_B start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ι , italic_i , italic_j end_POSTSUPERSCRIPT , italic_R start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT < italic_θ < italic_R start_POSTSUBSCRIPT italic_q + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT )
=1mι=im+1j(L1(α)𝖠1ιq,iq,j+q,,Liq(α)𝖠iqιq,iq,j+q,\displaystyle\quad=\frac{1}{m}\sum_{\upiota=i}^{m+1-j}\mathbb{P}\left(L_{1}^{(% \alpha)}\in\mathsf{A}_{1}^{\upiota-q,i-q,j+q},\ldots,L_{i-q}^{(\alpha)}\in% \mathsf{A}_{i-q}^{\upiota-q,i-q,j+q},\right.= divide start_ARG 1 end_ARG start_ARG italic_m end_ARG ∑ start_POSTSUBSCRIPT roman_ι = italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m + 1 - italic_j end_POSTSUPERSCRIPT blackboard_P ( italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ∈ sansserif_A start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ι - italic_q , italic_i - italic_q , italic_j + italic_q end_POSTSUPERSCRIPT , … , italic_L start_POSTSUBSCRIPT italic_i - italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ∈ sansserif_A start_POSTSUBSCRIPT italic_i - italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ι - italic_q , italic_i - italic_q , italic_j + italic_q end_POSTSUPERSCRIPT ,
R1(α)𝖡1ιq,iq,j+q,,Rj+q(α)𝖡jιq,iq,j+q,Rq(α)<θ<Rq+1(α)).\displaystyle\quad\qquad\qquad\qquad\qquad\left.R^{(\alpha)}_{1}\in\mathsf{B}_% {1}^{\upiota-q,i-q,j+q},\ldots,R_{j+q}^{(\alpha)}\in\mathsf{B}_{j}^{\upiota-q,% i-q,j+q},R_{q}^{(\alpha)}<\theta<R_{q+1}^{(\alpha)}\right).italic_R start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∈ sansserif_B start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ι - italic_q , italic_i - italic_q , italic_j + italic_q end_POSTSUPERSCRIPT , … , italic_R start_POSTSUBSCRIPT italic_j + italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ∈ sansserif_B start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ι - italic_q , italic_i - italic_q , italic_j + italic_q end_POSTSUPERSCRIPT , italic_R start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT < italic_θ < italic_R start_POSTSUBSCRIPT italic_q + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ) .

Then doing an index shift and using (29) for n=iq𝑛𝑖𝑞n=i-qitalic_n = italic_i - italic_q and n~=j+q~𝑛𝑗𝑞\widetilde{n}=j+qover~ start_ARG italic_n end_ARG = italic_j + italic_q, we obtain

(𝑳𝖲(θ)𝖠,𝑹𝖲(θ)𝖡,Lq+1(θ)<α<Lq(θ),τ𝖲(θ)=i,𝔗𝖲(θ)=j)\displaystyle\mathbb{P}\left(\boldsymbol{L}_{\mathsf{S}}^{(\theta)}\in\mathsf{% A},\boldsymbol{R}_{\mathsf{S}}^{(\theta)}\in\mathsf{B},L_{q+1}^{(\theta)}<% \alpha<L_{q}^{(\theta)},\tau_{\mathsf{S}}^{(\theta)}=i,\mathfrak{T}_{\mathsf{S% }}^{(\theta)}=j\right)blackboard_P ( bold_italic_L start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_A , bold_italic_R start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_B , italic_L start_POSTSUBSCRIPT italic_q + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT < italic_α < italic_L start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT , italic_τ start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT = italic_i , fraktur_T start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT = italic_j )
=1mι=iqm+1(j+q)(L1(α)𝖠1ι,iq,j+q,,Liq(α)𝖠iqι,iq,j+q,\displaystyle\quad=\frac{1}{m}\sum_{\upiota^{\prime}=i-q}^{m+1-(j+q)}\mathbb{P% }\left(L_{1}^{(\alpha)}\in\mathsf{A}_{1}^{\upiota^{\prime},i-q,j+q},\ldots,L_{% i-q}^{(\alpha)}\in\mathsf{A}_{i-q}^{\upiota^{\prime},i-q,j+q},\right.= divide start_ARG 1 end_ARG start_ARG italic_m end_ARG ∑ start_POSTSUBSCRIPT roman_ι start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT = italic_i - italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m + 1 - ( italic_j + italic_q ) end_POSTSUPERSCRIPT blackboard_P ( italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ∈ sansserif_A start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ι start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_i - italic_q , italic_j + italic_q end_POSTSUPERSCRIPT , … , italic_L start_POSTSUBSCRIPT italic_i - italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ∈ sansserif_A start_POSTSUBSCRIPT italic_i - italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ι start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_i - italic_q , italic_j + italic_q end_POSTSUPERSCRIPT ,
R1(α)𝖡1ι,iq,j+q,,Rj+q(α)𝖡jι,iq,j+q,Rq(α)<θ<Rq+1(α))\displaystyle\quad\qquad\qquad\qquad\qquad\left.R^{(\alpha)}_{1}\in\mathsf{B}_% {1}^{\upiota^{\prime},i-q,j+q},\ldots,R_{j+q}^{(\alpha)}\in\mathsf{B}_{j}^{% \upiota^{\prime},i-q,j+q},R_{q}^{(\alpha)}<\theta<R_{q+1}^{(\alpha)}\right)italic_R start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∈ sansserif_B start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ι start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_i - italic_q , italic_j + italic_q end_POSTSUPERSCRIPT , … , italic_R start_POSTSUBSCRIPT italic_j + italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ∈ sansserif_B start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ι start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_i - italic_q , italic_j + italic_q end_POSTSUPERSCRIPT , italic_R start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT < italic_θ < italic_R start_POSTSUBSCRIPT italic_q + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT )
=(𝑳𝖲(α)𝖠,𝑹𝖲(α)𝖡,Rq(α)<θ<Rq+1(α),τ𝖲(α)=iq,𝔗𝖲(α)=j+q).\displaystyle\quad=\mathbb{P}\left(\boldsymbol{L}_{\mathsf{S}}^{(\alpha)}\in% \mathsf{A},\boldsymbol{R}_{\mathsf{S}}^{(\alpha)}\in\mathsf{B},R_{q}^{(\alpha)% }<\theta<R_{q+1}^{(\alpha)},\tau_{\mathsf{S}}^{(\alpha)}=i-q,\mathfrak{T}_{% \mathsf{S}}^{(\alpha)}=j+q\right).= blackboard_P ( bold_italic_L start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ∈ sansserif_A , bold_italic_R start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ∈ sansserif_B , italic_R start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT < italic_θ < italic_R start_POSTSUBSCRIPT italic_q + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT , italic_τ start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT = italic_i - italic_q , fraktur_T start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT = italic_j + italic_q ) .

To 3: Let 𝖠,𝖡()𝖠𝖡\mathsf{A},\mathsf{B}\in\mathcal{B}(\mathbb{R})sansserif_A , sansserif_B ∈ caligraphic_B ( blackboard_R ). Using (22), we obtain

(𝑳𝖲(θ)𝖠,𝑹𝖲(θ)𝖡,𝑳𝖲(θ)<α<𝑹𝖲(θ))formulae-sequencesuperscriptsubscript𝑳𝖲𝜃𝖠formulae-sequencesuperscriptsubscript𝑹𝖲𝜃𝖡superscriptsubscript𝑳𝖲𝜃𝛼superscriptsubscript𝑹𝖲𝜃\displaystyle\mathbb{P}\left(\boldsymbol{L}_{\mathsf{S}}^{(\theta)}\in\mathsf{% A},\boldsymbol{R}_{\mathsf{S}}^{(\theta)}\in\mathsf{B},\boldsymbol{L}_{\mathsf% {S}}^{(\theta)}<\alpha<\boldsymbol{R}_{\mathsf{S}}^{(\theta)}\right)blackboard_P ( bold_italic_L start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_A , bold_italic_R start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_B , bold_italic_L start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT < italic_α < bold_italic_R start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT )
=i=1mj=1m+1i(Li(θ)𝖠,Rj(θ)𝖡,Li(θ)<α<Rj(θ),τ𝖲(θ)=i,𝔗𝖲(θ)=j).\displaystyle\qquad=\sum_{i=1}^{m}\sum_{j=1}^{m+1-i}\mathbb{P}\left(L_{i}^{(% \theta)}\in\mathsf{A},R_{j}^{(\theta)}\in\mathsf{B},L_{i}^{(\theta)}<\alpha<R_% {j}^{(\theta)},\tau_{\mathsf{S}}^{(\theta)}=i,\mathfrak{T}_{\mathsf{S}}^{(% \theta)}=j\right).= ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m + 1 - italic_i end_POSTSUPERSCRIPT blackboard_P ( italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_A , italic_R start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_B , italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT < italic_α < italic_R start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT , italic_τ start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT = italic_i , fraktur_T start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT = italic_j ) .

By virtue of Lemma 19.1 and 19.4 this yields

(𝑳𝖲(θ)𝖠,𝑹𝖲(θ)𝖡,𝑳𝖲(θ)<α<𝑹𝖲(θ))formulae-sequencesuperscriptsubscript𝑳𝖲𝜃𝖠formulae-sequencesuperscriptsubscript𝑹𝖲𝜃𝖡superscriptsubscript𝑳𝖲𝜃𝛼superscriptsubscript𝑹𝖲𝜃\displaystyle\mathbb{P}\left(\boldsymbol{L}_{\mathsf{S}}^{(\theta)}\in\mathsf{% A},\boldsymbol{R}_{\mathsf{S}}^{(\theta)}\in\mathsf{B},\boldsymbol{L}_{\mathsf% {S}}^{(\theta)}<\alpha<\boldsymbol{R}_{\mathsf{S}}^{(\theta)}\right)blackboard_P ( bold_italic_L start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_A , bold_italic_R start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_B , bold_italic_L start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT < italic_α < bold_italic_R start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT )
=i=1mj=1m+1i[q=0i1(𝑳𝖲(θ)𝖠,𝑹𝖲(θ)𝖡,Lq+1(θ)<α<Lq(θ),τ𝖲(θ)=i,𝔗𝖲(θ)=j)\displaystyle\quad=\sum_{i=1}^{m}\sum_{j=1}^{m+1-i}\left[\sum_{q=0}^{i-1}% \mathbb{P}\left(\boldsymbol{L}_{\mathsf{S}}^{(\theta)}\in\mathsf{A},% \boldsymbol{R}_{\mathsf{S}}^{(\theta)}\in\mathsf{B},L_{q+1}^{(\theta)}<\alpha<% L_{q}^{(\theta)},\tau_{\mathsf{S}}^{(\theta)}=i,\mathfrak{T}_{\mathsf{S}}^{(% \theta)}=j\right)\right.= ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m + 1 - italic_i end_POSTSUPERSCRIPT [ ∑ start_POSTSUBSCRIPT italic_q = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_i - 1 end_POSTSUPERSCRIPT blackboard_P ( bold_italic_L start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_A , bold_italic_R start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_B , italic_L start_POSTSUBSCRIPT italic_q + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT < italic_α < italic_L start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT , italic_τ start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT = italic_i , fraktur_T start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT = italic_j )
+q=1j1(𝑳𝖲(θ)𝖠,𝑹𝖲(θ)𝖡,Rq(θ)<α<Rq+1(θ),τ𝖲(θ)=i,𝔗𝖲(θ)=j)].\displaystyle\qquad\qquad\quad+\left.\sum_{q=1}^{j-1}\mathbb{P}\left(% \boldsymbol{L}_{\mathsf{S}}^{(\theta)}\in\mathsf{A},\boldsymbol{R}_{\mathsf{S}% }^{(\theta)}\in\mathsf{B},R_{q}^{(\theta)}<\alpha<R_{q+1}^{(\theta)},\tau_{% \mathsf{S}}^{(\theta)}=i,\mathfrak{T}_{\mathsf{S}}^{(\theta)}=j\right)\right].+ ∑ start_POSTSUBSCRIPT italic_q = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j - 1 end_POSTSUPERSCRIPT blackboard_P ( bold_italic_L start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_A , bold_italic_R start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_B , italic_R start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT < italic_α < italic_R start_POSTSUBSCRIPT italic_q + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT , italic_τ start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT = italic_i , fraktur_T start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT = italic_j ) ] .

We apply statement 2 in the first sum straight forwardly. In the second sum we apply statement 2 with i+q𝑖𝑞i+qitalic_i + italic_q and jq𝑗𝑞j-qitalic_j - italic_q and reversed roles of θ𝜃\thetaitalic_θ and α𝛼\alphaitalic_α. Relocating the summand for q=0𝑞0q=0italic_q = 0 from the first to the second sum, we obtain

(𝑳𝖲(θ)𝖠,𝑹𝖲(θ)𝖡,𝑳𝖲(θ)<α<𝑹𝖲(θ))formulae-sequencesuperscriptsubscript𝑳𝖲𝜃𝖠formulae-sequencesuperscriptsubscript𝑹𝖲𝜃𝖡superscriptsubscript𝑳𝖲𝜃𝛼superscriptsubscript𝑹𝖲𝜃\displaystyle\mathbb{P}\left(\boldsymbol{L}_{\mathsf{S}}^{(\theta)}\in\mathsf{% A},\boldsymbol{R}_{\mathsf{S}}^{(\theta)}\in\mathsf{B},\boldsymbol{L}_{\mathsf% {S}}^{(\theta)}<\alpha<\boldsymbol{R}_{\mathsf{S}}^{(\theta)}\right)blackboard_P ( bold_italic_L start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_A , bold_italic_R start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_B , bold_italic_L start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT < italic_α < bold_italic_R start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT )
=i=1mj=1m+1i[q=1i1(𝑳𝖲(α)𝖠,𝑹𝖲(α)𝖡,Rq(α)<θ<Rq+1(α),τ𝖲(α)=iq,𝔗𝖲(α)=j+q)\displaystyle\quad=\sum_{i=1}^{m}\sum_{j=1}^{m+1-i}\left[\sum_{q=1}^{i-1}% \mathbb{P}\left(\boldsymbol{L}_{\mathsf{S}}^{(\alpha)}\in\mathsf{A},% \boldsymbol{R}_{\mathsf{S}}^{(\alpha)}\in\mathsf{B},R_{q}^{(\alpha)}<\theta<R_% {q+1}^{(\alpha)},\tau_{\mathsf{S}}^{(\alpha)}=i-q,\mathfrak{T}_{\mathsf{S}}^{(% \alpha)}=j+q\right)\right.= ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m + 1 - italic_i end_POSTSUPERSCRIPT [ ∑ start_POSTSUBSCRIPT italic_q = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_i - 1 end_POSTSUPERSCRIPT blackboard_P ( bold_italic_L start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ∈ sansserif_A , bold_italic_R start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ∈ sansserif_B , italic_R start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT < italic_θ < italic_R start_POSTSUBSCRIPT italic_q + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT , italic_τ start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT = italic_i - italic_q , fraktur_T start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT = italic_j + italic_q )
+q=0j1(𝑳𝖲(α)𝖠,𝑹𝖲(α)𝖡,Lq+1(α)<θ<Lq(α),τ𝖲(α)=i+q,𝔗𝖲(α)=jq)]\displaystyle\qquad\qquad\quad+\left.\sum_{q=0}^{j-1}\mathbb{P}\left(% \boldsymbol{L}_{\mathsf{S}}^{(\alpha)}\in\mathsf{A},\boldsymbol{R}_{\mathsf{S}% }^{(\alpha)}\in\mathsf{B},L_{q+1}^{(\alpha)}<\theta<L_{q}^{(\alpha)},\tau_{% \mathsf{S}}^{(\alpha)}=i+q,\mathfrak{T}_{\mathsf{S}}^{(\alpha)}=j-q\right)\right]+ ∑ start_POSTSUBSCRIPT italic_q = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j - 1 end_POSTSUPERSCRIPT blackboard_P ( bold_italic_L start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ∈ sansserif_A , bold_italic_R start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ∈ sansserif_B , italic_L start_POSTSUBSCRIPT italic_q + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT < italic_θ < italic_L start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT , italic_τ start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT = italic_i + italic_q , fraktur_T start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT = italic_j - italic_q ) ]
=i=1mj=1m+1i[q=1i1(𝑳𝖲(α)𝖠,𝑹𝖲(α)𝖡,R𝔗𝖲(α)j(α)<θ<R𝔗𝖲(α)j+1(α),𝔗𝖲(α)=j+q,\displaystyle\quad=\sum_{i=1}^{m}\sum_{j=1}^{m+1-i}\left[\sum_{q=1}^{i-1}% \mathbb{P}\left(\boldsymbol{L}_{\mathsf{S}}^{(\alpha)}\in\mathsf{A},% \boldsymbol{R}_{\mathsf{S}}^{(\alpha)}\in\mathsf{B},R_{\mathfrak{T}_{\mathsf{S% }}^{(\alpha)}-j}^{(\alpha)}<\theta<R_{\mathfrak{T}_{\mathsf{S}}^{(\alpha)}-j+1% }^{(\alpha)},\mathfrak{T}_{\mathsf{S}}^{(\alpha)}=j+q,\right.\right.= ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m + 1 - italic_i end_POSTSUPERSCRIPT [ ∑ start_POSTSUBSCRIPT italic_q = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_i - 1 end_POSTSUPERSCRIPT blackboard_P ( bold_italic_L start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ∈ sansserif_A , bold_italic_R start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ∈ sansserif_B , italic_R start_POSTSUBSCRIPT fraktur_T start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT - italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT < italic_θ < italic_R start_POSTSUBSCRIPT fraktur_T start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT - italic_j + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT , fraktur_T start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT = italic_j + italic_q ,
τ𝖲(α)+𝔗𝖲(α)=i+j)\displaystyle\quad\hskip 284.52756pt\tau_{\mathsf{S}}^{(\alpha)}+\mathfrak{T}_% {\mathsf{S}}^{(\alpha)}=i+j\bigg{)}italic_τ start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT + fraktur_T start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT = italic_i + italic_j )
+q=0j1(𝑳𝖲(α)𝖠,𝑹𝖲(α)𝖡,Lτ𝖲(α)i+1(α)<θ<Lτ𝖲(α)i(α),τ𝖲(α)=i+q,\displaystyle\quad\qquad\qquad+\left.\sum_{q=0}^{j-1}\mathbb{P}\left(% \boldsymbol{L}_{\mathsf{S}}^{(\alpha)}\in\mathsf{A},\boldsymbol{R}_{\mathsf{S}% }^{(\alpha)}\in\mathsf{B},L_{\tau_{\mathsf{S}}^{(\alpha)}-i+1}^{(\alpha)}<% \theta<L_{\tau_{\mathsf{S}}^{(\alpha)}-i}^{(\alpha)},\tau_{\mathsf{S}}^{(% \alpha)}=i+q,\right.\right.+ ∑ start_POSTSUBSCRIPT italic_q = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j - 1 end_POSTSUPERSCRIPT blackboard_P ( bold_italic_L start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ∈ sansserif_A , bold_italic_R start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ∈ sansserif_B , italic_L start_POSTSUBSCRIPT italic_τ start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT - italic_i + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT < italic_θ < italic_L start_POSTSUBSCRIPT italic_τ start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT - italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT , italic_τ start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT = italic_i + italic_q ,
τ𝖲(α)+𝔗𝖲(α)=i+j)].\displaystyle\quad\hskip 284.52756pt\tau_{\mathsf{S}}^{(\alpha)}+\mathfrak{T}_% {\mathsf{S}}^{(\alpha)}=i+j\bigg{)}\Bigg{]}.italic_τ start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT + fraktur_T start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT = italic_i + italic_j ) ] .

Then the lower bounds in (22) imply

(𝑳𝖲(θ)𝖠,𝑹𝖲(θ)𝖡,𝑳𝖲(θ)<α<𝑹𝖲(θ))formulae-sequencesuperscriptsubscript𝑳𝖲𝜃𝖠formulae-sequencesuperscriptsubscript𝑹𝖲𝜃𝖡superscriptsubscript𝑳𝖲𝜃𝛼superscriptsubscript𝑹𝖲𝜃\displaystyle\mathbb{P}\left(\boldsymbol{L}_{\mathsf{S}}^{(\theta)}\in\mathsf{% A},\boldsymbol{R}_{\mathsf{S}}^{(\theta)}\in\mathsf{B},\boldsymbol{L}_{\mathsf% {S}}^{(\theta)}<\alpha<\boldsymbol{R}_{\mathsf{S}}^{(\theta)}\right)blackboard_P ( bold_italic_L start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_A , bold_italic_R start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_B , bold_italic_L start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT < italic_α < bold_italic_R start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT )
=i=1mj=1m+1i[(𝑳𝖲(α)𝖠,𝑹𝖲(α)𝖡,R𝔗𝖲(α)j(α)<θ<R𝔗𝖲(α)j+1(α),𝔗𝖲(α)j+1,\displaystyle\quad=\sum_{i=1}^{m}\sum_{j=1}^{m+1-i}\left[\mathbb{P}\left(% \boldsymbol{L}_{\mathsf{S}}^{(\alpha)}\in\mathsf{A},\boldsymbol{R}_{\mathsf{S}% }^{(\alpha)}\in\mathsf{B},R_{\mathfrak{T}_{\mathsf{S}}^{(\alpha)}-j}^{(\alpha)% }<\theta<R_{\mathfrak{T}_{\mathsf{S}}^{(\alpha)}-j+1}^{(\alpha)},\mathfrak{T}_% {\mathsf{S}}^{(\alpha)}\geqslant j+1\right.\right.,= ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m + 1 - italic_i end_POSTSUPERSCRIPT [ blackboard_P ( bold_italic_L start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ∈ sansserif_A , bold_italic_R start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ∈ sansserif_B , italic_R start_POSTSUBSCRIPT fraktur_T start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT - italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT < italic_θ < italic_R start_POSTSUBSCRIPT fraktur_T start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT - italic_j + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT , fraktur_T start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ⩾ italic_j + 1 ,
τ𝖲(α)+𝔗𝖲(α)=i+j)\displaystyle\quad\hskip 284.52756pt\tau_{\mathsf{S}}^{(\alpha)}+\mathfrak{T}_% {\mathsf{S}}^{(\alpha)}=i+j\bigg{)}italic_τ start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT + fraktur_T start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT = italic_i + italic_j )
+(𝑳𝖲(α)𝖠,𝑹𝖲(α)𝖡,Lτ𝖲(α)i+1(α)<θ<Lτ𝖲(α)i(α),τ𝖲(α)i,τ𝖲(α)+𝔗𝖲(α)=i+j)].\displaystyle\qquad\qquad\quad+\left.\mathbb{P}\left(\boldsymbol{L}_{\mathsf{S% }}^{(\alpha)}\in\mathsf{A},\boldsymbol{R}_{\mathsf{S}}^{(\alpha)}\in\mathsf{B}% ,L_{\tau_{\mathsf{S}}^{(\alpha)}-i+1}^{(\alpha)}<\theta<L_{\tau_{\mathsf{S}}^{% (\alpha)}-i}^{(\alpha)},\tau_{\mathsf{S}}^{(\alpha)}\geqslant i,\tau_{\mathsf{% S}}^{(\alpha)}+\mathfrak{T}_{\mathsf{S}}^{(\alpha)}=i+j\right)\right].+ blackboard_P ( bold_italic_L start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ∈ sansserif_A , bold_italic_R start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ∈ sansserif_B , italic_L start_POSTSUBSCRIPT italic_τ start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT - italic_i + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT < italic_θ < italic_L start_POSTSUBSCRIPT italic_τ start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT - italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT , italic_τ start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ⩾ italic_i , italic_τ start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT + fraktur_T start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT = italic_i + italic_j ) ] .

Changing the order of summation, we obtain

(𝑳𝖲(θ)𝖠,𝑹𝖲(θ)𝖡,𝑳𝖲(θ)<α<𝑹𝖲(θ))formulae-sequencesuperscriptsubscript𝑳𝖲𝜃𝖠formulae-sequencesuperscriptsubscript𝑹𝖲𝜃𝖡superscriptsubscript𝑳𝖲𝜃𝛼superscriptsubscript𝑹𝖲𝜃\displaystyle\mathbb{P}\left(\boldsymbol{L}_{\mathsf{S}}^{(\theta)}\in\mathsf{% A},\boldsymbol{R}_{\mathsf{S}}^{(\theta)}\in\mathsf{B},\boldsymbol{L}_{\mathsf% {S}}^{(\theta)}<\alpha<\boldsymbol{R}_{\mathsf{S}}^{(\theta)}\right)blackboard_P ( bold_italic_L start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_A , bold_italic_R start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_B , bold_italic_L start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT < italic_α < bold_italic_R start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT )
=j=1mi=1m+1j(𝑳𝖲(α)𝖠,𝑹𝖲(α)𝖡,R𝔗𝖲(α)j(α)<θ<R𝔗𝖲(α)j+1(α),𝔗𝖲(α)j+1,\displaystyle\quad=\sum_{j=1}^{m}\sum_{i=1}^{m+1-j}\mathbb{P}\left(\boldsymbol% {L}_{\mathsf{S}}^{(\alpha)}\in\mathsf{A},\boldsymbol{R}_{\mathsf{S}}^{(\alpha)% }\in\mathsf{B},R_{\mathfrak{T}_{\mathsf{S}}^{(\alpha)}-j}^{(\alpha)}<\theta<R_% {\mathfrak{T}_{\mathsf{S}}^{(\alpha)}-j+1}^{(\alpha)},\mathfrak{T}_{\mathsf{S}% }^{(\alpha)}\geqslant j+1,\right.= ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m + 1 - italic_j end_POSTSUPERSCRIPT blackboard_P ( bold_italic_L start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ∈ sansserif_A , bold_italic_R start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ∈ sansserif_B , italic_R start_POSTSUBSCRIPT fraktur_T start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT - italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT < italic_θ < italic_R start_POSTSUBSCRIPT fraktur_T start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT - italic_j + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT , fraktur_T start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ⩾ italic_j + 1 ,
τ𝖲(α)+𝔗𝖲(α)=i+j)\displaystyle\quad\hskip 284.52756pt\tau_{\mathsf{S}}^{(\alpha)}+\mathfrak{T}_% {\mathsf{S}}^{(\alpha)}=i+j\bigg{)}italic_τ start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT + fraktur_T start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT = italic_i + italic_j )
+i=1mj=1m+1i(𝑳𝖲(α)𝖠,𝑹𝖲(α)𝖡,Lτ𝖲(α)i+1(α)<θ<Lτ𝖲(α)i(α),τ𝖲(α)i,\displaystyle\quad\qquad+\sum_{i=1}^{m}\sum_{j=1}^{m+1-i}\mathbb{P}\left(% \boldsymbol{L}_{\mathsf{S}}^{(\alpha)}\in\mathsf{A},\boldsymbol{R}_{\mathsf{S}% }^{(\alpha)}\in\mathsf{B},L_{\tau_{\mathsf{S}}^{(\alpha)}-i+1}^{(\alpha)}<% \theta<L_{\tau_{\mathsf{S}}^{(\alpha)}-i}^{(\alpha)},\tau_{\mathsf{S}}^{(% \alpha)}\geqslant i,\right.+ ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m + 1 - italic_i end_POSTSUPERSCRIPT blackboard_P ( bold_italic_L start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ∈ sansserif_A , bold_italic_R start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ∈ sansserif_B , italic_L start_POSTSUBSCRIPT italic_τ start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT - italic_i + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT < italic_θ < italic_L start_POSTSUBSCRIPT italic_τ start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT - italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT , italic_τ start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ⩾ italic_i ,
τ𝖲(α)+𝔗𝖲(α)=i+j).\displaystyle\quad\hskip 284.52756pt\tau_{\mathsf{S}}^{(\alpha)}+\mathfrak{T}_% {\mathsf{S}}^{(\alpha)}=i+j\bigg{)}.italic_τ start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT + fraktur_T start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT = italic_i + italic_j ) .

Together with the upper bound in (22) this yields

(𝑳𝖲(θ)𝖠,𝑹𝖲(θ)𝖡,𝑳𝖲(θ)<α<𝑹𝖲(θ))formulae-sequencesuperscriptsubscript𝑳𝖲𝜃𝖠formulae-sequencesuperscriptsubscript𝑹𝖲𝜃𝖡superscriptsubscript𝑳𝖲𝜃𝛼superscriptsubscript𝑹𝖲𝜃\displaystyle\mathbb{P}\left(\boldsymbol{L}_{\mathsf{S}}^{(\theta)}\in\mathsf{% A},\boldsymbol{R}_{\mathsf{S}}^{(\theta)}\in\mathsf{B},\boldsymbol{L}_{\mathsf% {S}}^{(\theta)}<\alpha<\boldsymbol{R}_{\mathsf{S}}^{(\theta)}\right)blackboard_P ( bold_italic_L start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_A , bold_italic_R start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_B , bold_italic_L start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT < italic_α < bold_italic_R start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT )
=j=1m(𝑳𝖲(α)𝖠,𝑹𝖲(α)𝖡,R𝔗𝖲(α)j(α)<θ<R𝔗𝖲(α)j+1(α),𝔗𝖲(α)j+1)\displaystyle\quad=\sum_{j=1}^{m}\mathbb{P}\left(\boldsymbol{L}_{\mathsf{S}}^{% (\alpha)}\in\mathsf{A},\boldsymbol{R}_{\mathsf{S}}^{(\alpha)}\in\mathsf{B},R_{% \mathfrak{T}_{\mathsf{S}}^{(\alpha)}-j}^{(\alpha)}<\theta<R_{\mathfrak{T}_{% \mathsf{S}}^{(\alpha)}-j+1}^{(\alpha)},\mathfrak{T}_{\mathsf{S}}^{(\alpha)}% \geqslant j+1\right)= ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT blackboard_P ( bold_italic_L start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ∈ sansserif_A , bold_italic_R start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ∈ sansserif_B , italic_R start_POSTSUBSCRIPT fraktur_T start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT - italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT < italic_θ < italic_R start_POSTSUBSCRIPT fraktur_T start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT - italic_j + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT , fraktur_T start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ⩾ italic_j + 1 )
+i=1m(𝑳𝖲(α)𝖠,𝑹𝖲(α)𝖡,Lτ𝖲(α)i+1(α)<θ<Lτ𝖲(α)i(α),τ𝖲(α)i).\displaystyle\quad\qquad+\sum_{i=1}^{m}\mathbb{P}\left(\boldsymbol{L}_{\mathsf% {S}}^{(\alpha)}\in\mathsf{A},\boldsymbol{R}_{\mathsf{S}}^{(\alpha)}\in\mathsf{% B},L_{\tau_{\mathsf{S}}^{(\alpha)}-i+1}^{(\alpha)}<\theta<L_{\tau_{\mathsf{S}}% ^{(\alpha)}-i}^{(\alpha)},\tau_{\mathsf{S}}^{(\alpha)}\geqslant i\right).+ ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT blackboard_P ( bold_italic_L start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ∈ sansserif_A , bold_italic_R start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ∈ sansserif_B , italic_L start_POSTSUBSCRIPT italic_τ start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT - italic_i + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT < italic_θ < italic_L start_POSTSUBSCRIPT italic_τ start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT - italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT , italic_τ start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ⩾ italic_i ) .

By virtue of Lemma 19.2 to 19.4 we get

(𝑳𝖲(θ)𝖠,𝑹𝖲(θ)𝖡,𝑳𝖲(θ)<α<𝑹𝖲(θ))formulae-sequencesuperscriptsubscript𝑳𝖲𝜃𝖠formulae-sequencesuperscriptsubscript𝑹𝖲𝜃𝖡superscriptsubscript𝑳𝖲𝜃𝛼superscriptsubscript𝑹𝖲𝜃\displaystyle\mathbb{P}\left(\boldsymbol{L}_{\mathsf{S}}^{(\theta)}\in\mathsf{% A},\boldsymbol{R}_{\mathsf{S}}^{(\theta)}\in\mathsf{B},\boldsymbol{L}_{\mathsf% {S}}^{(\theta)}<\alpha<\boldsymbol{R}_{\mathsf{S}}^{(\theta)}\right)blackboard_P ( bold_italic_L start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_A , bold_italic_R start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_B , bold_italic_L start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT < italic_α < bold_italic_R start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT )
=(𝑳𝖲(α)𝖠,𝑹𝖲(α)𝖡,R1(α)<θ<𝑹𝖲(α))+(𝑳𝖲(α)𝖠,𝑹𝖲(α)𝖡,𝑳𝖲(α)<θ<L0(α))absentformulae-sequencesuperscriptsubscript𝑳𝖲𝛼𝖠formulae-sequencesuperscriptsubscript𝑹𝖲𝛼𝖡superscriptsubscript𝑅1𝛼𝜃superscriptsubscript𝑹𝖲𝛼formulae-sequencesuperscriptsubscript𝑳𝖲𝛼𝖠formulae-sequencesuperscriptsubscript𝑹𝖲𝛼𝖡superscriptsubscript𝑳𝖲𝛼𝜃superscriptsubscript𝐿0𝛼\displaystyle\qquad=\mathbb{P}\left(\boldsymbol{L}_{\mathsf{S}}^{(\alpha)}\in% \mathsf{A},\boldsymbol{R}_{\mathsf{S}}^{(\alpha)}\in\mathsf{B},R_{1}^{(\alpha)% }<\theta<\boldsymbol{R}_{\mathsf{S}}^{(\alpha)}\right)+\mathbb{P}\left(% \boldsymbol{L}_{\mathsf{S}}^{(\alpha)}\in\mathsf{A},\boldsymbol{R}_{\mathsf{S}% }^{(\alpha)}\in\mathsf{B},\boldsymbol{L}_{\mathsf{S}}^{(\alpha)}<\theta<L_{0}^% {(\alpha)}\right)= blackboard_P ( bold_italic_L start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ∈ sansserif_A , bold_italic_R start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ∈ sansserif_B , italic_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT < italic_θ < bold_italic_R start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ) + blackboard_P ( bold_italic_L start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ∈ sansserif_A , bold_italic_R start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ∈ sansserif_B , bold_italic_L start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT < italic_θ < italic_L start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT )
=(𝑳𝖲(α)𝖠,𝑹𝖲(α)𝖡,𝑳𝖲(α)<θ<𝑹𝖲(α)).absentformulae-sequencesuperscriptsubscript𝑳𝖲𝛼𝖠formulae-sequencesuperscriptsubscript𝑹𝖲𝛼𝖡superscriptsubscript𝑳𝖲𝛼𝜃superscriptsubscript𝑹𝖲𝛼\displaystyle\qquad=\mathbb{P}\left(\boldsymbol{L}_{\mathsf{S}}^{(\alpha)}\in% \mathsf{A},\boldsymbol{R}_{\mathsf{S}}^{(\alpha)}\in\mathsf{B},\boldsymbol{L}_% {\mathsf{S}}^{(\alpha)}<\theta<\boldsymbol{R}_{\mathsf{S}}^{(\alpha)}\right).= blackboard_P ( bold_italic_L start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ∈ sansserif_A , bold_italic_R start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ∈ sansserif_B , bold_italic_L start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT < italic_θ < bold_italic_R start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ) .

Lemma 11 is now an easy corollary of the previous lemma.

Proof of Lemma 11.

Lemma 20.3 yields that the finite measures

2𝟙𝖢(,r)𝟙(,r)(α)ξ𝖲(θ)(d(,r)),𝖢(2),subscriptsuperscript2subscript1𝖢𝑟subscript1𝑟𝛼superscriptsubscript𝜉𝖲𝜃d𝑟𝖢superscript2\int_{\mathbb{R}^{2}}\mathbbm{1}_{\mathsf{C}}(\ell,r)\mathbbm{1}_{(\ell,r)}(% \alpha)\ \xi_{\mathsf{S}}^{(\theta)}\big{(}{\rm d}(\ell,r)\big{)},\qquad% \mathsf{C}\in\mathcal{B}(\mathbb{R}^{2}),∫ start_POSTSUBSCRIPT blackboard_R start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT blackboard_1 start_POSTSUBSCRIPT sansserif_C end_POSTSUBSCRIPT ( roman_ℓ , italic_r ) blackboard_1 start_POSTSUBSCRIPT ( roman_ℓ , italic_r ) end_POSTSUBSCRIPT ( italic_α ) italic_ξ start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ( roman_d ( roman_ℓ , italic_r ) ) , sansserif_C ∈ caligraphic_B ( blackboard_R start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) ,

and

2𝟙𝖢(,r)𝟙(,r)(θ)ξ𝖲(α)(d(,r)),𝖢(2),subscriptsuperscript2subscript1𝖢𝑟subscript1𝑟𝜃superscriptsubscript𝜉𝖲𝛼d𝑟𝖢superscript2\int_{\mathbb{R}^{2}}\mathbbm{1}_{\mathsf{C}}(\ell,r)\mathbbm{1}_{(\ell,r)}(% \theta)\ \xi_{\mathsf{S}}^{(\alpha)}\big{(}{\rm d}(\ell,r)\big{)},\qquad% \mathsf{C}\in\mathcal{B}(\mathbb{R}^{2}),∫ start_POSTSUBSCRIPT blackboard_R start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT blackboard_1 start_POSTSUBSCRIPT sansserif_C end_POSTSUBSCRIPT ( roman_ℓ , italic_r ) blackboard_1 start_POSTSUBSCRIPT ( roman_ℓ , italic_r ) end_POSTSUBSCRIPT ( italic_θ ) italic_ξ start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_α ) end_POSTSUPERSCRIPT ( roman_d ( roman_ℓ , italic_r ) ) , sansserif_C ∈ caligraphic_B ( blackboard_R start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) ,

agree on the intersection stable generator {𝖠×𝖡𝖠,𝖡()}conditional-set𝖠𝖡𝖠𝖡\{\mathsf{A}\times\mathsf{B}\mid\mathsf{A},\mathsf{B}\in\mathcal{B}(\mathbb{R})\}{ sansserif_A × sansserif_B ∣ sansserif_A , sansserif_B ∈ caligraphic_B ( blackboard_R ) } of (2)superscript2\mathcal{B}(\mathbb{R}^{2})caligraphic_B ( blackboard_R start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ). By [Klenke, Lemma 1.42], this implies that these two measures are equal. ∎

Before addressing Lemma 12, we take a brief detour, because some of the notation of the proof of Lemma 20 is convenient to show measureability of the stepping-out distribution in the argument θ𝜃\thetaitalic_θ.

Remark 21.

Assume we are in Setting C. Let A,B()𝐴𝐵A,B\in\mathcal{B}(\mathbb{R})italic_A , italic_B ∈ caligraphic_B ( blackboard_R ), and for all n,n~𝑛~𝑛n,\widetilde{n}\in\mathbb{N}italic_n , over~ start_ARG italic_n end_ARG ∈ blackboard_N with n+n~m𝑛~𝑛𝑚n+\widetilde{n}\leqslant mitalic_n + over~ start_ARG italic_n end_ARG ⩽ italic_m and ι{n,,m+1n~}ι𝑛𝑚1~𝑛\upiota\in\{n,\ldots,m+1-\widetilde{n}\}roman_ι ∈ { italic_n , … , italic_m + 1 - over~ start_ARG italic_n end_ARG } define 𝖠1ι,n,n~,,𝖠nι,n,n~superscriptsubscript𝖠1ι𝑛~𝑛superscriptsubscript𝖠𝑛ι𝑛~𝑛\mathsf{A}_{1}^{\upiota,n,\widetilde{n}},\ldots,\mathsf{A}_{n}^{\upiota,n,% \widetilde{n}}sansserif_A start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ι , italic_n , over~ start_ARG italic_n end_ARG end_POSTSUPERSCRIPT , … , sansserif_A start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ι , italic_n , over~ start_ARG italic_n end_ARG end_POSTSUPERSCRIPT, 𝖡1ι,n,n~,,𝖡n~ι,n,n~superscriptsubscript𝖡1ι𝑛~𝑛superscriptsubscript𝖡~𝑛ι𝑛~𝑛\mathsf{B}_{1}^{\upiota,n,\widetilde{n}},\ldots,\mathsf{B}_{\widetilde{n}}^{% \upiota,n,\widetilde{n}}sansserif_B start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ι , italic_n , over~ start_ARG italic_n end_ARG end_POSTSUPERSCRIPT , … , sansserif_B start_POSTSUBSCRIPT over~ start_ARG italic_n end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ι , italic_n , over~ start_ARG italic_n end_ARG end_POSTSUPERSCRIPT as in the proof of Lemma 12. Combining Tonnelli’s theorem and Lemma 18, we get that

θ(L1(θ)𝖠1ι,n,n~,,Ln(θ)𝖠nι,n,n~,R1(θ)𝖡1ι,n,n~,,Rn~(θ)𝖡n~ι,n,n~)maps-to𝜃formulae-sequencesuperscriptsubscript𝐿1𝜃superscriptsubscript𝖠1ι𝑛~𝑛formulae-sequencesuperscriptsubscript𝐿𝑛𝜃superscriptsubscript𝖠𝑛ι𝑛~𝑛formulae-sequencesuperscriptsubscript𝑅1𝜃superscriptsubscript𝖡1ι𝑛~𝑛superscriptsubscript𝑅~𝑛𝜃superscriptsubscript𝖡~𝑛ι𝑛~𝑛\theta\mapsto\mathbb{P}\left(L_{1}^{(\theta)}\in\mathsf{A}_{1}^{\upiota,n,% \widetilde{n}},\ldots,L_{n}^{(\theta)}\in\mathsf{A}_{n}^{\upiota,n,\widetilde{% n}},R_{1}^{(\theta)}\in\mathsf{B}_{1}^{\upiota,n,\widetilde{n}},\ldots,R_{% \widetilde{n}}^{(\theta)}\in\mathsf{B}_{\widetilde{n}}^{\upiota,n,\widetilde{n% }}\right)italic_θ ↦ blackboard_P ( italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_A start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ι , italic_n , over~ start_ARG italic_n end_ARG end_POSTSUPERSCRIPT , … , italic_L start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_A start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ι , italic_n , over~ start_ARG italic_n end_ARG end_POSTSUPERSCRIPT , italic_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_B start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ι , italic_n , over~ start_ARG italic_n end_ARG end_POSTSUPERSCRIPT , … , italic_R start_POSTSUBSCRIPT over~ start_ARG italic_n end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_B start_POSTSUBSCRIPT over~ start_ARG italic_n end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ι , italic_n , over~ start_ARG italic_n end_ARG end_POSTSUPERSCRIPT )

is measurable for all n,n~𝑛~𝑛n,\widetilde{n}\in\mathbb{N}italic_n , over~ start_ARG italic_n end_ARG ∈ blackboard_N with n+n~m𝑛~𝑛𝑚n+\widetilde{n}\leqslant mitalic_n + over~ start_ARG italic_n end_ARG ⩽ italic_m and ι{n,,m+1n~}ι𝑛𝑚1~𝑛\upiota\in\{n,\ldots,m+1-\widetilde{n}\}roman_ι ∈ { italic_n , … , italic_m + 1 - over~ start_ARG italic_n end_ARG } . Thus also

θmaps-to𝜃absent\displaystyle\theta\mapstoitalic_θ ↦ ξ𝖲(θ)(A×B)superscriptsubscript𝜉𝖲𝜃𝐴𝐵\displaystyle\xi_{\mathsf{S}}^{(\theta)}(A\times B)italic_ξ start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ( italic_A × italic_B )
=1mn=1mn~=1mnι=nm+1n~(L1(θ)𝖠1ι,n,n~,,Ln(θ)𝖠nι,n,n~,R1(θ)𝖡1ι,n,n~,,Rn~(θ)𝖡n~ι,n,n~)absent1𝑚superscriptsubscript𝑛1𝑚superscriptsubscript~𝑛1𝑚𝑛superscriptsubscriptι𝑛𝑚1~𝑛formulae-sequencesuperscriptsubscript𝐿1𝜃superscriptsubscript𝖠1ι𝑛~𝑛formulae-sequencesuperscriptsubscript𝐿𝑛𝜃superscriptsubscript𝖠𝑛ι𝑛~𝑛formulae-sequencesuperscriptsubscript𝑅1𝜃superscriptsubscript𝖡1ι𝑛~𝑛superscriptsubscript𝑅~𝑛𝜃superscriptsubscript𝖡~𝑛ι𝑛~𝑛\displaystyle\quad=\frac{1}{m}\sum_{n=1}^{m}\sum_{\widetilde{n}=1}^{m-n}\sum_{% \upiota=n}^{m+1-\widetilde{n}}\mathbb{P}\left(L_{1}^{(\theta)}\in\mathsf{A}_{1% }^{\upiota,n,\widetilde{n}},\ldots,L_{n}^{(\theta)}\in\mathsf{A}_{n}^{\upiota,% n,\widetilde{n}},R_{1}^{(\theta)}\in\mathsf{B}_{1}^{\upiota,n,\widetilde{n}},% \ldots,R_{\widetilde{n}}^{(\theta)}\in\mathsf{B}_{\widetilde{n}}^{\upiota,n,% \widetilde{n}}\right)= divide start_ARG 1 end_ARG start_ARG italic_m end_ARG ∑ start_POSTSUBSCRIPT italic_n = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT over~ start_ARG italic_n end_ARG = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m - italic_n end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT roman_ι = italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m + 1 - over~ start_ARG italic_n end_ARG end_POSTSUPERSCRIPT blackboard_P ( italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_A start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ι , italic_n , over~ start_ARG italic_n end_ARG end_POSTSUPERSCRIPT , … , italic_L start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_A start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ι , italic_n , over~ start_ARG italic_n end_ARG end_POSTSUPERSCRIPT , italic_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_B start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ι , italic_n , over~ start_ARG italic_n end_ARG end_POSTSUPERSCRIPT , … , italic_R start_POSTSUBSCRIPT over~ start_ARG italic_n end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_B start_POSTSUBSCRIPT over~ start_ARG italic_n end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ι , italic_n , over~ start_ARG italic_n end_ARG end_POSTSUPERSCRIPT )

is measurable. It is now straight forward to verify that (θ,C)ξ𝖲(θ)(C)maps-to𝜃𝐶superscriptsubscript𝜉𝖲𝜃𝐶(\theta,C)\mapsto\xi_{\mathsf{S}}^{(\theta)}(C)( italic_θ , italic_C ) ↦ italic_ξ start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ( italic_C ) is a transition kernel.

We turn to the proof of Lemma 12. Its essential strategy is to leverage the following property of (Li)isubscriptsubscript𝐿𝑖𝑖(L_{i})_{i\in\mathbb{N}}( italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_i ∈ blackboard_N end_POSTSUBSCRIPT and (Rj)jsubscriptsubscript𝑅𝑗𝑗(R_{j})_{j\in\mathbb{N}}( italic_R start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_j ∈ blackboard_N end_POSTSUBSCRIPT to the stepping-out distribution.

Lemma 22.

Assume that we are in Setting C.

  1. 1.

    Let α𝛼\alpha\in\mathbb{R}italic_α ∈ blackboard_R, i,j𝑖𝑗i,j\in\mathbb{N}italic_i , italic_j ∈ blackboard_N and 𝖠1,,𝖠i,𝖡1,,𝖡j()subscript𝖠1subscript𝖠𝑖subscript𝖡1subscript𝖡𝑗\mathsf{A}_{1},\ldots,\mathsf{A}_{i},\mathsf{B}_{1},\ldots,\mathsf{B}_{j}\in% \mathcal{B}(\mathbb{R})sansserif_A start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , sansserif_A start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , sansserif_B start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , sansserif_B start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∈ caligraphic_B ( blackboard_R ). Then we have

    (L1(θ)Λα(𝖠1),,Li(θ)Λα(𝖠i),R1(θ)Λα(𝖡1),,Rj(θ)Λα(𝖡j))formulae-sequencesuperscriptsubscript𝐿1𝜃subscriptΛ𝛼subscript𝖠1formulae-sequencesuperscriptsubscript𝐿𝑖𝜃subscriptΛ𝛼subscript𝖠𝑖formulae-sequencesuperscriptsubscript𝑅1𝜃subscriptΛ𝛼subscript𝖡1superscriptsubscript𝑅𝑗𝜃subscriptΛ𝛼subscript𝖡𝑗\displaystyle\mathbb{P}\left(L_{1}^{(\theta)}\in\Lambda_{\alpha}(\mathsf{A}_{1% }),\ldots,L_{i}^{(\theta)}\in\Lambda_{\alpha}(\mathsf{A}_{i}),R_{1}^{(\theta)}% \in\Lambda_{\alpha}(\mathsf{B}_{1}),\ldots,R_{j}^{(\theta)}\in\Lambda_{\alpha}% (\mathsf{B}_{j})\right)blackboard_P ( italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( sansserif_A start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) , … , italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( sansserif_A start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) , italic_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( sansserif_B start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) , … , italic_R start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( sansserif_B start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) )
    =(L1(Λα(θ))𝖡1,,Lj(Λα(θ))𝖡j,R1(Λα(θ))𝖠1,,Ri(Λα(θ))𝖠i).absentformulae-sequencesuperscriptsubscript𝐿1subscriptΛ𝛼𝜃subscript𝖡1formulae-sequencesuperscriptsubscript𝐿𝑗subscriptΛ𝛼𝜃subscript𝖡𝑗formulae-sequencesuperscriptsubscript𝑅1subscriptΛ𝛼𝜃subscript𝖠1superscriptsubscript𝑅𝑖subscriptΛ𝛼𝜃subscript𝖠𝑖\displaystyle\quad=\mathbb{P}\left(L_{1}^{\left(\Lambda_{\alpha}(\theta)\right% )}\in\mathsf{B}_{1},\ldots,L_{j}^{\left(\Lambda_{\alpha}(\theta)\right)}\in% \mathsf{B}_{j},R_{1}^{\left(\Lambda_{\alpha}(\theta)\right)}\in\mathsf{A}_{1},% \ldots,R_{i}^{\left(\Lambda_{\alpha}(\theta)\right)}\in\mathsf{A}_{i}\right).= blackboard_P ( italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( italic_θ ) ) end_POSTSUPERSCRIPT ∈ sansserif_B start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_L start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( italic_θ ) ) end_POSTSUPERSCRIPT ∈ sansserif_B start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT , italic_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( italic_θ ) ) end_POSTSUPERSCRIPT ∈ sansserif_A start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_R start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( italic_θ ) ) end_POSTSUPERSCRIPT ∈ sansserif_A start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) .
  2. 2.

    For all i{1,,m}𝑖1𝑚i\in\{1,\ldots,m\}italic_i ∈ { 1 , … , italic_m }, j{1,,m+1i}𝑗1𝑚1𝑖j\in\{1,\ldots,m+1-i\}italic_j ∈ { 1 , … , italic_m + 1 - italic_i }, θ,α𝜃𝛼\theta,\alpha\in\mathbb{R}italic_θ , italic_α ∈ blackboard_R and 𝖠,𝖡()𝖠𝖡\mathsf{A},\mathsf{B}\in\mathcal{B}(\mathbb{R})sansserif_A , sansserif_B ∈ caligraphic_B ( blackboard_R ) holds

    (Li(θ)𝖠,Rj(θ)𝖡,τΛα(𝖲)(θ)=i,𝔗Λα(𝖲)(θ)=j)formulae-sequencesuperscriptsubscript𝐿𝑖𝜃𝖠formulae-sequencesuperscriptsubscript𝑅𝑗𝜃𝖡formulae-sequencesuperscriptsubscript𝜏subscriptΛ𝛼𝖲𝜃𝑖superscriptsubscript𝔗subscriptΛ𝛼𝖲𝜃𝑗\displaystyle\mathbb{P}\left(L_{i}^{(\theta)}\in\mathsf{A},R_{j}^{(\theta)}\in% \mathsf{B},\tau_{\Lambda_{\alpha}(\mathsf{S})}^{(\theta)}=i,\mathfrak{T}_{% \Lambda_{\alpha}(\mathsf{S})}^{(\theta)}=j\right)blackboard_P ( italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_A , italic_R start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_B , italic_τ start_POSTSUBSCRIPT roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( sansserif_S ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT = italic_i , fraktur_T start_POSTSUBSCRIPT roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( sansserif_S ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT = italic_j )
    =(Lj(Λα(θ))Λα1(𝖡),Ri(Λα(θ))Λα1(𝖠),τ𝖲(Λα(θ))=j,𝔗𝖲(Λα(θ))=i).absentformulae-sequencesuperscriptsubscript𝐿𝑗subscriptΛ𝛼𝜃superscriptsubscriptΛ𝛼1𝖡formulae-sequencesuperscriptsubscript𝑅𝑖subscriptΛ𝛼𝜃superscriptsubscriptΛ𝛼1𝖠formulae-sequencesuperscriptsubscript𝜏𝖲subscriptΛ𝛼𝜃𝑗superscriptsubscript𝔗𝖲subscriptΛ𝛼𝜃𝑖\displaystyle\qquad=\mathbb{P}\left(L_{j}^{(\Lambda_{\alpha}(\theta))}\in% \Lambda_{\alpha}^{-1}(\mathsf{B}),R_{i}^{(\Lambda_{\alpha}(\theta))}\in\Lambda% _{\alpha}^{-1}(\mathsf{A}),\tau_{\mathsf{S}}^{(\Lambda_{\alpha}(\theta))}=j,% \mathfrak{T}_{\mathsf{S}}^{(\Lambda_{\alpha}(\theta))}=i\right).= blackboard_P ( italic_L start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( italic_θ ) ) end_POSTSUPERSCRIPT ∈ roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( sansserif_B ) , italic_R start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( italic_θ ) ) end_POSTSUPERSCRIPT ∈ roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( sansserif_A ) , italic_τ start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( italic_θ ) ) end_POSTSUPERSCRIPT = italic_j , fraktur_T start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( italic_θ ) ) end_POSTSUPERSCRIPT = italic_i ) .
Proof.

To 1: Let α𝛼\alpha\in\mathbb{R}italic_α ∈ blackboard_R, i,j𝑖𝑗i,j\in\mathbb{N}italic_i , italic_j ∈ blackboard_N and 𝖠1,,𝖠i,𝖡1,,𝖡j()subscript𝖠1subscript𝖠𝑖subscript𝖡1subscript𝖡𝑗\mathsf{A}_{1},\ldots,\mathsf{A}_{i},\mathsf{B}_{1},\ldots,\mathsf{B}_{j}\in% \mathcal{B}(\mathbb{R})sansserif_A start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , sansserif_A start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , sansserif_B start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , sansserif_B start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∈ caligraphic_B ( blackboard_R ). We have by Lemma 18

(L1(θ)Λα(𝖠1),,Li(θ)Λα(𝖠i),R1(θ)Λα(𝖡1),,Rj(θ)Λα(𝖡j))formulae-sequencesuperscriptsubscript𝐿1𝜃subscriptΛ𝛼subscript𝖠1formulae-sequencesuperscriptsubscript𝐿𝑖𝜃subscriptΛ𝛼subscript𝖠𝑖formulae-sequencesuperscriptsubscript𝑅1𝜃subscriptΛ𝛼subscript𝖡1superscriptsubscript𝑅𝑗𝜃subscriptΛ𝛼subscript𝖡𝑗\displaystyle\mathbb{P}\left(L_{1}^{(\theta)}\in\Lambda_{\alpha}(\mathsf{A}_{1% }),\ldots,L_{i}^{(\theta)}\in\Lambda_{\alpha}(\mathsf{A}_{i}),R_{1}^{(\theta)}% \in\Lambda_{\alpha}(\mathsf{B}_{1}),\ldots,R_{j}^{(\theta)}\in\Lambda_{\alpha}% (\mathsf{B}_{j})\right)blackboard_P ( italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( sansserif_A start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) , … , italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( sansserif_A start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) , italic_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( sansserif_B start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) , … , italic_R start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( sansserif_B start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) )
=1w0wk=1i𝟙Λα(𝖠k)(θu(k1)w)l=1j𝟙Λα(𝖡l)(θu+lw)Leb1(du)absent1𝑤superscriptsubscript0𝑤superscriptsubscriptproduct𝑘1𝑖subscript1subscriptΛ𝛼subscript𝖠𝑘𝜃𝑢𝑘1𝑤superscriptsubscriptproduct𝑙1𝑗subscript1subscriptΛ𝛼subscript𝖡𝑙𝜃𝑢𝑙𝑤subscriptLeb1d𝑢\displaystyle\quad=\frac{1}{w}\int_{0}^{w}\prod_{k=1}^{i}\mathbbm{1}_{\Lambda_% {\alpha}(\mathsf{A}_{k})}\big{(}\theta-u-(k-1)w\big{)}\prod_{l=1}^{j}\mathbbm{% 1}_{\Lambda_{\alpha}(\mathsf{B}_{l})}\big{(}\theta-u+lw\big{)}\ \mathrm{Leb}_{% 1}({\rm d}u)= divide start_ARG 1 end_ARG start_ARG italic_w end_ARG ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_w end_POSTSUPERSCRIPT ∏ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT blackboard_1 start_POSTSUBSCRIPT roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( sansserif_A start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) end_POSTSUBSCRIPT ( italic_θ - italic_u - ( italic_k - 1 ) italic_w ) ∏ start_POSTSUBSCRIPT italic_l = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT blackboard_1 start_POSTSUBSCRIPT roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( sansserif_B start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT ) end_POSTSUBSCRIPT ( italic_θ - italic_u + italic_l italic_w ) roman_Leb start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( roman_d italic_u )
=1w0wk=1i𝟙𝖠k(αθ+u+(k1)w)l=1j𝟙𝖡l(αθ+ulw)Leb1(du).absent1𝑤superscriptsubscript0𝑤superscriptsubscriptproduct𝑘1𝑖subscript1subscript𝖠𝑘𝛼𝜃𝑢𝑘1𝑤superscriptsubscriptproduct𝑙1𝑗subscript1subscript𝖡𝑙𝛼𝜃𝑢𝑙𝑤subscriptLeb1d𝑢\displaystyle\quad=\frac{1}{w}\int_{0}^{w}\prod_{k=1}^{i}\mathbbm{1}_{\mathsf{% A}_{k}}\big{(}\alpha-\theta+u+(k-1)w\big{)}\prod_{l=1}^{j}\mathbbm{1}_{\mathsf% {B}_{l}}\big{(}\alpha-\theta+u-lw\big{)}\ \mathrm{Leb}_{1}({\rm d}u).= divide start_ARG 1 end_ARG start_ARG italic_w end_ARG ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_w end_POSTSUPERSCRIPT ∏ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT blackboard_1 start_POSTSUBSCRIPT sansserif_A start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_α - italic_θ + italic_u + ( italic_k - 1 ) italic_w ) ∏ start_POSTSUBSCRIPT italic_l = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT blackboard_1 start_POSTSUBSCRIPT sansserif_B start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_α - italic_θ + italic_u - italic_l italic_w ) roman_Leb start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( roman_d italic_u ) .

Performing a change of variables u~=u+w~𝑢𝑢𝑤\widetilde{u}=-u+wover~ start_ARG italic_u end_ARG = - italic_u + italic_w yields

(L1(θ)Λα(𝖠1),,Li(θ)Λα(𝖠i),R1(θ)Λα(𝖡1),,Rj(θ)Λα(𝖡j))formulae-sequencesuperscriptsubscript𝐿1𝜃subscriptΛ𝛼subscript𝖠1formulae-sequencesuperscriptsubscript𝐿𝑖𝜃subscriptΛ𝛼subscript𝖠𝑖formulae-sequencesuperscriptsubscript𝑅1𝜃subscriptΛ𝛼subscript𝖡1superscriptsubscript𝑅𝑗𝜃subscriptΛ𝛼subscript𝖡𝑗\displaystyle\mathbb{P}\left(L_{1}^{(\theta)}\in\Lambda_{\alpha}(\mathsf{A}_{1% }),\ldots,L_{i}^{(\theta)}\in\Lambda_{\alpha}(\mathsf{A}_{i}),R_{1}^{(\theta)}% \in\Lambda_{\alpha}(\mathsf{B}_{1}),\ldots,R_{j}^{(\theta)}\in\Lambda_{\alpha}% (\mathsf{B}_{j})\right)blackboard_P ( italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( sansserif_A start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) , … , italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( sansserif_A start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) , italic_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( sansserif_B start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) , … , italic_R start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( sansserif_B start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) )
=1w0wk=1i𝟙𝖠k(αθu~+kw)l=1j𝟙𝖡l(αθu~(l1)w)Leb1(du~).absent1𝑤superscriptsubscript0𝑤superscriptsubscriptproduct𝑘1𝑖subscript1subscript𝖠𝑘𝛼𝜃~𝑢𝑘𝑤superscriptsubscriptproduct𝑙1𝑗subscript1subscript𝖡𝑙𝛼𝜃~𝑢𝑙1𝑤subscriptLeb1d~𝑢\displaystyle\quad=\frac{1}{w}\int_{0}^{w}\prod_{k=1}^{i}\mathbbm{1}_{\mathsf{% A}_{k}}\big{(}\alpha-\theta-\widetilde{u}+kw\big{)}\prod_{l=1}^{j}\mathbbm{1}_% {\mathsf{B}_{l}}\big{(}\alpha-\theta-\widetilde{u}-(l-1)w\big{)}\ \mathrm{Leb}% _{1}({\rm d}\widetilde{u}).= divide start_ARG 1 end_ARG start_ARG italic_w end_ARG ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_w end_POSTSUPERSCRIPT ∏ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT blackboard_1 start_POSTSUBSCRIPT sansserif_A start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_α - italic_θ - over~ start_ARG italic_u end_ARG + italic_k italic_w ) ∏ start_POSTSUBSCRIPT italic_l = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT blackboard_1 start_POSTSUBSCRIPT sansserif_B start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_α - italic_θ - over~ start_ARG italic_u end_ARG - ( italic_l - 1 ) italic_w ) roman_Leb start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( roman_d over~ start_ARG italic_u end_ARG ) .

Using again Lemma 18, we obtain

(L1(θ)Λα(𝖠1),,Li(θ)Λα(𝖠i),R1(θ)Λα(𝖡1),,Rj(θ)Λα(𝖡j))formulae-sequencesuperscriptsubscript𝐿1𝜃subscriptΛ𝛼subscript𝖠1formulae-sequencesuperscriptsubscript𝐿𝑖𝜃subscriptΛ𝛼subscript𝖠𝑖formulae-sequencesuperscriptsubscript𝑅1𝜃subscriptΛ𝛼subscript𝖡1superscriptsubscript𝑅𝑗𝜃subscriptΛ𝛼subscript𝖡𝑗\displaystyle\mathbb{P}\left(L_{1}^{(\theta)}\in\Lambda_{\alpha}(\mathsf{A}_{1% }),\ldots,L_{i}^{(\theta)}\in\Lambda_{\alpha}(\mathsf{A}_{i}),R_{1}^{(\theta)}% \in\Lambda_{\alpha}(\mathsf{B}_{1}),\ldots,R_{j}^{(\theta)}\in\Lambda_{\alpha}% (\mathsf{B}_{j})\right)blackboard_P ( italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( sansserif_A start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) , … , italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( sansserif_A start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) , italic_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( sansserif_B start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) , … , italic_R start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( sansserif_B start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) )
=(L1(Λα(θ))𝖡1,,Lj(Λα(θ))𝖡j,R1(Λα(θ))𝖠1,,Ri(Λα(θ))𝖠i).absentformulae-sequencesuperscriptsubscript𝐿1subscriptΛ𝛼𝜃subscript𝖡1formulae-sequencesuperscriptsubscript𝐿𝑗subscriptΛ𝛼𝜃subscript𝖡𝑗formulae-sequencesuperscriptsubscript𝑅1subscriptΛ𝛼𝜃subscript𝖠1superscriptsubscript𝑅𝑖subscriptΛ𝛼𝜃subscript𝖠𝑖\displaystyle\quad=\mathbb{P}\left(L_{1}^{\left(\Lambda_{\alpha}(\theta)\right% )}\in\mathsf{B}_{1},\ldots,L_{j}^{\left(\Lambda_{\alpha}(\theta)\right)}\in% \mathsf{B}_{j},R_{1}^{\left(\Lambda_{\alpha}(\theta)\right)}\in\mathsf{A}_{1},% \ldots,R_{i}^{\left(\Lambda_{\alpha}(\theta)\right)}\in\mathsf{A}_{i}\right).= blackboard_P ( italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( italic_θ ) ) end_POSTSUPERSCRIPT ∈ sansserif_B start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_L start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( italic_θ ) ) end_POSTSUPERSCRIPT ∈ sansserif_B start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT , italic_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( italic_θ ) ) end_POSTSUPERSCRIPT ∈ sansserif_A start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_R start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( italic_θ ) ) end_POSTSUPERSCRIPT ∈ sansserif_A start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) .

To 2: Let i{1,,m}𝑖1𝑚i\in\{1,\ldots,m\}italic_i ∈ { 1 , … , italic_m }, j{1,,m+1i}𝑗1𝑚1𝑖j\in\{1,\ldots,m+1-i\}italic_j ∈ { 1 , … , italic_m + 1 - italic_i }, θ,α𝜃𝛼\theta,\alpha\in\mathbb{R}italic_θ , italic_α ∈ blackboard_R and 𝖠,𝖡()𝖠𝖡\mathsf{A},\mathsf{B}\in\mathcal{B}(\mathbb{R})sansserif_A , sansserif_B ∈ caligraphic_B ( blackboard_R ). For ι{i,,m+1j}ι𝑖𝑚1𝑗\upiota\in\{i,\ldots,m+1-j\}roman_ι ∈ { italic_i , … , italic_m + 1 - italic_j } we define

𝖠kι=Λα(𝖲),k{1,,i1}formulae-sequencesuperscriptsubscript𝖠𝑘ιsubscriptΛ𝛼𝖲𝑘1𝑖1\displaystyle\mathsf{A}_{k}^{\upiota}=\Lambda_{\alpha}(\mathsf{S}),\quad k\in% \{1,\ldots,i-1\}\qquadsansserif_A start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ι end_POSTSUPERSCRIPT = roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( sansserif_S ) , italic_k ∈ { 1 , … , italic_i - 1 } and𝖠iι={𝖠,ι=i𝖠(Λα(𝖲)),ι>i,andsuperscriptsubscript𝖠𝑖ιcases𝖠ι𝑖𝖠subscriptΛ𝛼𝖲ι𝑖\displaystyle\text{and}\qquad\mathsf{A}_{i}^{\upiota}=\begin{dcases}\mathsf{A}% ,&\upiota=i\\ \mathsf{A}\cap(\mathbb{R}\setminus\Lambda_{\alpha}(\mathsf{S})),&\upiota>i,% \end{dcases}and sansserif_A start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ι end_POSTSUPERSCRIPT = { start_ROW start_CELL sansserif_A , end_CELL start_CELL roman_ι = italic_i end_CELL end_ROW start_ROW start_CELL sansserif_A ∩ ( blackboard_R ∖ roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( sansserif_S ) ) , end_CELL start_CELL roman_ι > italic_i , end_CELL end_ROW
𝖡lι=Λα(𝖲),l{1,,j1}formulae-sequencesuperscriptsubscript𝖡𝑙ιsubscriptΛ𝛼𝖲𝑙1𝑗1\displaystyle\mathsf{B}_{l}^{\upiota}=\Lambda_{\alpha}(\mathsf{S}),\quad l\in% \{1,\ldots,j-1\}\qquadsansserif_B start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ι end_POSTSUPERSCRIPT = roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( sansserif_S ) , italic_l ∈ { 1 , … , italic_j - 1 } and𝖡jι={𝖡,ι=m+1j𝖡(Λα(𝖲)),ι>m+1j,andsuperscriptsubscript𝖡𝑗ιcases𝖡ι𝑚1𝑗𝖡subscriptΛ𝛼𝖲ι𝑚1𝑗\displaystyle\text{and}\qquad\mathsf{B}_{j}^{\upiota}=\begin{dcases}\mathsf{B}% ,&\upiota=m+1-j\\ \mathsf{B}\cap(\mathbb{R}\setminus\Lambda_{\alpha}(\mathsf{S})),&\upiota>m+1-j% ,\end{dcases}and sansserif_B start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ι end_POSTSUPERSCRIPT = { start_ROW start_CELL sansserif_B , end_CELL start_CELL roman_ι = italic_m + 1 - italic_j end_CELL end_ROW start_ROW start_CELL sansserif_B ∩ ( blackboard_R ∖ roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( sansserif_S ) ) , end_CELL start_CELL roman_ι > italic_m + 1 - italic_j , end_CELL end_ROW

as well as

𝖠~kι=𝖲,k{1,,i1}formulae-sequencesuperscriptsubscript~𝖠𝑘ι𝖲𝑘1𝑖1\displaystyle\widetilde{\mathsf{A}}_{k}^{\upiota}=\mathsf{S},\quad k\in\{1,% \ldots,i-1\}\qquadover~ start_ARG sansserif_A end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ι end_POSTSUPERSCRIPT = sansserif_S , italic_k ∈ { 1 , … , italic_i - 1 } and𝖠~iι={Λα1(𝖠),ι=iΛα1(𝖠)(𝖲),ι>i,andsuperscriptsubscript~𝖠𝑖ιcasessuperscriptsubscriptΛ𝛼1𝖠ι𝑖superscriptsubscriptΛ𝛼1𝖠𝖲ι𝑖\displaystyle\text{and}\qquad\widetilde{\mathsf{A}}_{i}^{\upiota}=\begin{% dcases}\Lambda_{\alpha}^{-1}(\mathsf{A}),&\upiota=i\\ \Lambda_{\alpha}^{-1}(\mathsf{A})\cap(\mathbb{R}\setminus\mathsf{S}),&\upiota>% i,\end{dcases}and over~ start_ARG sansserif_A end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ι end_POSTSUPERSCRIPT = { start_ROW start_CELL roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( sansserif_A ) , end_CELL start_CELL roman_ι = italic_i end_CELL end_ROW start_ROW start_CELL roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( sansserif_A ) ∩ ( blackboard_R ∖ sansserif_S ) , end_CELL start_CELL roman_ι > italic_i , end_CELL end_ROW
𝖡~lι=𝖲,l{1,,j1}formulae-sequencesubscriptsuperscript~𝖡ι𝑙𝖲𝑙1𝑗1\displaystyle\widetilde{\mathsf{B}}^{\upiota}_{l}=\mathsf{S},\quad l\in\{1,% \ldots,j-1\}\qquadover~ start_ARG sansserif_B end_ARG start_POSTSUPERSCRIPT roman_ι end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT = sansserif_S , italic_l ∈ { 1 , … , italic_j - 1 } and𝖡~jι={Λα1(𝖡),ι=m+1jΛα1(𝖡)(𝖲),ι>m+1j.andsubscriptsuperscript~𝖡ι𝑗casessuperscriptsubscriptΛ𝛼1𝖡ι𝑚1𝑗superscriptsubscriptΛ𝛼1𝖡𝖲ι𝑚1𝑗\displaystyle\text{and}\qquad\widetilde{\mathsf{B}}^{\upiota}_{j}=\begin{% dcases}\Lambda_{\alpha}^{-1}(\mathsf{B}),&\upiota=m+1-j\\ \Lambda_{\alpha}^{-1}(\mathsf{B})\cap(\mathbb{R}\setminus\mathsf{S}),&\upiota>% m+1-j.\end{dcases}and over~ start_ARG sansserif_B end_ARG start_POSTSUPERSCRIPT roman_ι end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = { start_ROW start_CELL roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( sansserif_B ) , end_CELL start_CELL roman_ι = italic_m + 1 - italic_j end_CELL end_ROW start_ROW start_CELL roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( sansserif_B ) ∩ ( blackboard_R ∖ sansserif_S ) , end_CELL start_CELL roman_ι > italic_m + 1 - italic_j . end_CELL end_ROW

Observe that

Λα(𝖠~kι)=𝖠kι,andΛα(𝖡~lι)=𝖡lι,for k{1,,i},l{1,,j},formulae-sequencesubscriptΛ𝛼superscriptsubscript~𝖠𝑘ιsuperscriptsubscript𝖠𝑘ιandformulae-sequencesubscriptΛ𝛼superscriptsubscript~𝖡𝑙ιsuperscriptsubscript𝖡𝑙ιformulae-sequencefor 𝑘1𝑖𝑙1𝑗\displaystyle\Lambda_{\alpha}(\widetilde{\mathsf{A}}_{k}^{\upiota})=\mathsf{A}% _{k}^{\upiota},\qquad\text{and}\qquad\Lambda_{\alpha}(\widetilde{\mathsf{B}}_{% l}^{\upiota})=\mathsf{B}_{l}^{\upiota},\qquad\textup{for }k\in\{1,\ldots,i\},l% \in\{1,\ldots,j\},roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( over~ start_ARG sansserif_A end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ι end_POSTSUPERSCRIPT ) = sansserif_A start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ι end_POSTSUPERSCRIPT , and roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( over~ start_ARG sansserif_B end_ARG start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ι end_POSTSUPERSCRIPT ) = sansserif_B start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ι end_POSTSUPERSCRIPT , for italic_k ∈ { 1 , … , italic_i } , italic_l ∈ { 1 , … , italic_j } ,

as Λα()=subscriptΛ𝛼\Lambda_{\alpha}(\mathbb{R})=\mathbb{R}roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( blackboard_R ) = blackboard_R. Due to JUnif({1,,m})similar-to𝐽Unif1𝑚J\sim\mathrm{Unif}(\{1,\ldots,m\})italic_J ∼ roman_Unif ( { 1 , … , italic_m } ) being independent of (Lk(θ))ksubscriptsuperscriptsubscript𝐿𝑘𝜃𝑘(L_{k}^{(\theta)})_{k\in\mathbb{N}}( italic_L start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ) start_POSTSUBSCRIPT italic_k ∈ blackboard_N end_POSTSUBSCRIPT and (Rl(θ))lsubscriptsuperscriptsubscript𝑅𝑙𝜃𝑙(R_{l}^{(\theta)})_{l\in\mathbb{N}}( italic_R start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ) start_POSTSUBSCRIPT italic_l ∈ blackboard_N end_POSTSUBSCRIPT, and (22), using statement 1 gives

(Li(θ)𝖠,Rj(θ)𝖡,τΛα(𝖲)(θ)=i,𝔗Λα(𝖲)(θ)=j)formulae-sequencesuperscriptsubscript𝐿𝑖𝜃𝖠formulae-sequencesuperscriptsubscript𝑅𝑗𝜃𝖡formulae-sequencesuperscriptsubscript𝜏subscriptΛ𝛼𝖲𝜃𝑖superscriptsubscript𝔗subscriptΛ𝛼𝖲𝜃𝑗\displaystyle\mathbb{P}\left(L_{i}^{(\theta)}\in\mathsf{A},R_{j}^{(\theta)}\in% \mathsf{B},\tau_{\Lambda_{\alpha}(\mathsf{S})}^{(\theta)}=i,\mathfrak{T}_{% \Lambda_{\alpha}(\mathsf{S})}^{(\theta)}=j\right)blackboard_P ( italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_A , italic_R start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_B , italic_τ start_POSTSUBSCRIPT roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( sansserif_S ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT = italic_i , fraktur_T start_POSTSUBSCRIPT roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( sansserif_S ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT = italic_j )
=ι=im+1j(L1(θ)𝖠1ι,,Li(θ)𝖠iι,R1(θ)𝖡1ι,,Rj(θ)𝖡jι)(J=ι)absentsuperscriptsubscriptι𝑖𝑚1𝑗formulae-sequencesuperscriptsubscript𝐿1𝜃superscriptsubscript𝖠1ιformulae-sequencesuperscriptsubscript𝐿𝑖𝜃superscriptsubscript𝖠𝑖ιformulae-sequencesuperscriptsubscript𝑅1𝜃superscriptsubscript𝖡1ιsuperscriptsubscript𝑅𝑗𝜃superscriptsubscript𝖡𝑗ι𝐽ι\displaystyle\qquad=\sum_{\upiota=i}^{m+1-j}\mathbb{P}\left(L_{1}^{(\theta)}% \in\mathsf{A}_{1}^{\upiota},\ldots,L_{i}^{(\theta)}\in\mathsf{A}_{i}^{\upiota}% ,R_{1}^{(\theta)}\in\mathsf{B}_{1}^{\upiota},\ldots,R_{j}^{(\theta)}\in\mathsf% {B}_{j}^{\upiota}\right)\mathbb{P}(J=\upiota)= ∑ start_POSTSUBSCRIPT roman_ι = italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m + 1 - italic_j end_POSTSUPERSCRIPT blackboard_P ( italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_A start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ι end_POSTSUPERSCRIPT , … , italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_A start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ι end_POSTSUPERSCRIPT , italic_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_B start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ι end_POSTSUPERSCRIPT , … , italic_R start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_B start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ι end_POSTSUPERSCRIPT ) blackboard_P ( italic_J = roman_ι )
=1mι=im+1j(L1(θ)Λα(𝖠~1ι),,Li(θ)Λα(𝖠~iι),R1(θ)Λα(𝖡~1ι),,Rj(θ)Λα(𝖡~jι))absent1𝑚superscriptsubscriptι𝑖𝑚1𝑗formulae-sequencesuperscriptsubscript𝐿1𝜃subscriptΛ𝛼superscriptsubscript~𝖠1ιformulae-sequencesuperscriptsubscript𝐿𝑖𝜃subscriptΛ𝛼superscriptsubscript~𝖠𝑖ιformulae-sequencesuperscriptsubscript𝑅1𝜃subscriptΛ𝛼superscriptsubscript~𝖡1ιsuperscriptsubscript𝑅𝑗𝜃subscriptΛ𝛼superscriptsubscript~𝖡𝑗ι\displaystyle\qquad=\frac{1}{m}\sum_{\upiota=i}^{m+1-j}\mathbb{P}\left(L_{1}^{% (\theta)}\in\Lambda_{\alpha}(\widetilde{\mathsf{A}}_{1}^{\upiota}),\ldots,L_{i% }^{(\theta)}\in\Lambda_{\alpha}(\widetilde{\mathsf{A}}_{i}^{\upiota}),R_{1}^{(% \theta)}\in\Lambda_{\alpha}(\widetilde{\mathsf{B}}_{1}^{\upiota}),\ldots,R_{j}% ^{(\theta)}\in\Lambda_{\alpha}(\widetilde{\mathsf{B}}_{j}^{\upiota})\right)= divide start_ARG 1 end_ARG start_ARG italic_m end_ARG ∑ start_POSTSUBSCRIPT roman_ι = italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m + 1 - italic_j end_POSTSUPERSCRIPT blackboard_P ( italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( over~ start_ARG sansserif_A end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ι end_POSTSUPERSCRIPT ) , … , italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( over~ start_ARG sansserif_A end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ι end_POSTSUPERSCRIPT ) , italic_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( over~ start_ARG sansserif_B end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ι end_POSTSUPERSCRIPT ) , … , italic_R start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( over~ start_ARG sansserif_B end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ι end_POSTSUPERSCRIPT ) )
=1mι=im+1j(L1(Λα(θ))𝖡~1ι,,Lj(Λα(θ))𝖡~jι,R1(Λα(θ))𝖠~1ι,,Ri(Λα(θ))𝖠~iι)absent1𝑚superscriptsubscriptι𝑖𝑚1𝑗formulae-sequencesuperscriptsubscript𝐿1subscriptΛ𝛼𝜃superscriptsubscript~𝖡1ιformulae-sequencesuperscriptsubscript𝐿𝑗subscriptΛ𝛼𝜃superscriptsubscript~𝖡𝑗ιformulae-sequencesuperscriptsubscript𝑅1subscriptΛ𝛼𝜃superscriptsubscript~𝖠1ιsuperscriptsubscript𝑅𝑖subscriptΛ𝛼𝜃superscriptsubscript~𝖠𝑖ι\displaystyle\qquad=\frac{1}{m}\sum_{\upiota=i}^{m+1-j}\mathbb{P}\left(L_{1}^{% (\Lambda_{\alpha}(\theta))}\in\widetilde{\mathsf{B}}_{1}^{\upiota},\ldots,L_{j% }^{(\Lambda_{\alpha}(\theta))}\in\widetilde{\mathsf{B}}_{j}^{\upiota},R_{1}^{(% \Lambda_{\alpha}(\theta))}\in\widetilde{\mathsf{A}}_{1}^{\upiota},\ldots,R_{i}% ^{(\Lambda_{\alpha}(\theta))}\in\widetilde{\mathsf{A}}_{i}^{\upiota}\right)= divide start_ARG 1 end_ARG start_ARG italic_m end_ARG ∑ start_POSTSUBSCRIPT roman_ι = italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m + 1 - italic_j end_POSTSUPERSCRIPT blackboard_P ( italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( italic_θ ) ) end_POSTSUPERSCRIPT ∈ over~ start_ARG sansserif_B end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ι end_POSTSUPERSCRIPT , … , italic_L start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( italic_θ ) ) end_POSTSUPERSCRIPT ∈ over~ start_ARG sansserif_B end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ι end_POSTSUPERSCRIPT , italic_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( italic_θ ) ) end_POSTSUPERSCRIPT ∈ over~ start_ARG sansserif_A end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ι end_POSTSUPERSCRIPT , … , italic_R start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( italic_θ ) ) end_POSTSUPERSCRIPT ∈ over~ start_ARG sansserif_A end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ι end_POSTSUPERSCRIPT )
=(Lj(Λα(θ))Λα1(𝖡),Ri(Λα(θ))Λα1(𝖠),τ𝖲(Λα(θ))=j,𝔗𝖲(Λα(θ))=i).absentformulae-sequencesuperscriptsubscript𝐿𝑗subscriptΛ𝛼𝜃superscriptsubscriptΛ𝛼1𝖡formulae-sequencesuperscriptsubscript𝑅𝑖subscriptΛ𝛼𝜃superscriptsubscriptΛ𝛼1𝖠formulae-sequencesuperscriptsubscript𝜏𝖲subscriptΛ𝛼𝜃𝑗superscriptsubscript𝔗𝖲subscriptΛ𝛼𝜃𝑖\displaystyle\qquad=\mathbb{P}\left(L_{j}^{(\Lambda_{\alpha}(\theta))}\in% \Lambda_{\alpha}^{-1}(\mathsf{B}),R_{i}^{(\Lambda_{\alpha}(\theta))}\in\Lambda% _{\alpha}^{-1}(\mathsf{A}),\tau_{\mathsf{S}}^{(\Lambda_{\alpha}(\theta))}=j,% \mathfrak{T}_{\mathsf{S}}^{(\Lambda_{\alpha}(\theta))}=i\right).= blackboard_P ( italic_L start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( italic_θ ) ) end_POSTSUPERSCRIPT ∈ roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( sansserif_B ) , italic_R start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( italic_θ ) ) end_POSTSUPERSCRIPT ∈ roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( sansserif_A ) , italic_τ start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( italic_θ ) ) end_POSTSUPERSCRIPT = italic_j , fraktur_T start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( italic_θ ) ) end_POSTSUPERSCRIPT = italic_i ) .

Proof of Lemma 12.

Let 𝖠,𝖡()𝖠𝖡\mathsf{A},\mathsf{B}\in\mathcal{B}(\mathbb{R})sansserif_A , sansserif_B ∈ caligraphic_B ( blackboard_R ). Using (22), we obtain

ξΛα(𝖲)(θ)(𝖠×𝖡)=(𝑳Λα(𝖲)(θ)𝖠,𝑹Λα(𝖲)(θ)𝖡)=i=1mj=1m+1i(Li(θ)𝖠,Rj(θ)𝖡,τΛα(𝖲)(θ)=i,𝔗Λα(𝖲)(θ)=j).superscriptsubscript𝜉subscriptΛ𝛼𝖲𝜃𝖠𝖡formulae-sequencesubscriptsuperscript𝑳𝜃subscriptΛ𝛼𝖲𝖠subscriptsuperscript𝑹𝜃subscriptΛ𝛼𝖲𝖡superscriptsubscript𝑖1𝑚superscriptsubscript𝑗1𝑚1𝑖formulae-sequencesuperscriptsubscript𝐿𝑖𝜃𝖠formulae-sequencesuperscriptsubscript𝑅𝑗𝜃𝖡formulae-sequencesuperscriptsubscript𝜏subscriptΛ𝛼𝖲𝜃𝑖superscriptsubscript𝔗subscriptΛ𝛼𝖲𝜃𝑗\begin{split}\xi_{\Lambda_{\alpha}(\mathsf{S})}^{(\theta)}(\mathsf{A}\times% \mathsf{B})&=\mathbb{P}\left(\boldsymbol{L}^{(\theta)}_{\Lambda_{\alpha}(% \mathsf{S})}\in\mathsf{A},\boldsymbol{R}^{(\theta)}_{\Lambda_{\alpha}(\mathsf{% S})}\in\mathsf{B}\right)\\ &=\sum_{i=1}^{m}\sum_{j=1}^{m+1-i}\mathbb{P}\left(L_{i}^{(\theta)}\in\mathsf{A% },R_{j}^{(\theta)}\in\mathsf{B},\tau_{\Lambda_{\alpha}(\mathsf{S})}^{(\theta)}% =i,\mathfrak{T}_{\Lambda_{\alpha}(\mathsf{S})}^{(\theta)}=j\right).\end{split}start_ROW start_CELL italic_ξ start_POSTSUBSCRIPT roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( sansserif_S ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ( sansserif_A × sansserif_B ) end_CELL start_CELL = blackboard_P ( bold_italic_L start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( sansserif_S ) end_POSTSUBSCRIPT ∈ sansserif_A , bold_italic_R start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( sansserif_S ) end_POSTSUBSCRIPT ∈ sansserif_B ) end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL = ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m + 1 - italic_i end_POSTSUPERSCRIPT blackboard_P ( italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_A , italic_R start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ∈ sansserif_B , italic_τ start_POSTSUBSCRIPT roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( sansserif_S ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT = italic_i , fraktur_T start_POSTSUBSCRIPT roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( sansserif_S ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT = italic_j ) . end_CELL end_ROW

Applying Lemma 22.2 and a reordering of the sum yields

ξΛα(𝖲)(θ)(𝖠×𝖡)superscriptsubscript𝜉subscriptΛ𝛼𝖲𝜃𝖠𝖡\displaystyle\xi_{\Lambda_{\alpha}(\mathsf{S})}^{(\theta)}(\mathsf{A}\times% \mathsf{B})italic_ξ start_POSTSUBSCRIPT roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( sansserif_S ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT ( sansserif_A × sansserif_B ) =j=1mi=1m+1j(Lj(Λα(θ))Λα1(𝖡),Ri(Λα(θ))Λα1(𝖠),τ𝖲(Λα(θ))=j,𝔗𝖲(Λα(θ))=i)absentsuperscriptsubscript𝑗1𝑚superscriptsubscript𝑖1𝑚1𝑗formulae-sequencesuperscriptsubscript𝐿𝑗subscriptΛ𝛼𝜃superscriptsubscriptΛ𝛼1𝖡formulae-sequencesuperscriptsubscript𝑅𝑖subscriptΛ𝛼𝜃superscriptsubscriptΛ𝛼1𝖠formulae-sequencesuperscriptsubscript𝜏𝖲subscriptΛ𝛼𝜃𝑗superscriptsubscript𝔗𝖲subscriptΛ𝛼𝜃𝑖\displaystyle=\sum_{j=1}^{m}\sum_{i=1}^{m+1-j}\mathbb{P}\left(L_{j}^{(\Lambda_% {\alpha}(\theta))}\in\Lambda_{\alpha}^{-1}(\mathsf{B}),R_{i}^{(\Lambda_{\alpha% }(\theta))}\in\Lambda_{\alpha}^{-1}(\mathsf{A}),\tau_{\mathsf{S}}^{(\Lambda_{% \alpha}(\theta))}=j,\mathfrak{T}_{\mathsf{S}}^{(\Lambda_{\alpha}(\theta))}=i\right)= ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m + 1 - italic_j end_POSTSUPERSCRIPT blackboard_P ( italic_L start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( italic_θ ) ) end_POSTSUPERSCRIPT ∈ roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( sansserif_B ) , italic_R start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( italic_θ ) ) end_POSTSUPERSCRIPT ∈ roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( sansserif_A ) , italic_τ start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( italic_θ ) ) end_POSTSUPERSCRIPT = italic_j , fraktur_T start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( italic_θ ) ) end_POSTSUPERSCRIPT = italic_i )
=(𝑳𝖲(Λα(θ))Λα1(𝖡),𝑹𝖲(Λα(θ))Λα1(𝖠))=ξ𝖲(Λα(θ))(λα1(𝖠×𝖡)),absentformulae-sequencesubscriptsuperscript𝑳subscriptΛ𝛼𝜃𝖲superscriptsubscriptΛ𝛼1𝖡subscriptsuperscript𝑹subscriptΛ𝛼𝜃𝖲superscriptsubscriptΛ𝛼1𝖠superscriptsubscript𝜉𝖲subscriptΛ𝛼𝜃superscriptsubscriptλ𝛼1𝖠𝖡\displaystyle=\mathbb{P}\left(\boldsymbol{L}^{(\Lambda_{\alpha}(\theta))}_{% \mathsf{S}}\in\Lambda_{\alpha}^{-1}(\mathsf{B}),\boldsymbol{R}^{(\Lambda_{% \alpha}(\theta))}_{\mathsf{S}}\in\Lambda_{\alpha}^{-1}(\mathsf{A})\right)=\xi_% {\mathsf{S}}^{(\Lambda_{\alpha}(\theta))}\left(\uplambda_{\alpha}^{-1}(\mathsf% {A}\times\mathsf{B})\right),= blackboard_P ( bold_italic_L start_POSTSUPERSCRIPT ( roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( italic_θ ) ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT ∈ roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( sansserif_B ) , bold_italic_R start_POSTSUPERSCRIPT ( roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( italic_θ ) ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT ∈ roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( sansserif_A ) ) = italic_ξ start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( italic_θ ) ) end_POSTSUPERSCRIPT ( roman_λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( sansserif_A × sansserif_B ) ) ,

such that the probability measures ξΛα(𝖲)(θ)superscriptsubscript𝜉subscriptΛ𝛼𝖲𝜃\xi_{\Lambda_{\alpha}(\mathsf{S})}^{(\theta)}italic_ξ start_POSTSUBSCRIPT roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( sansserif_S ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_θ ) end_POSTSUPERSCRIPT and (λα)ξ𝖲(Λα(θ))subscriptsubscriptλ𝛼superscriptsubscript𝜉𝖲subscriptΛ𝛼𝜃(\uplambda_{\alpha})_{\sharp}\xi_{\mathsf{S}}^{(\Lambda_{\alpha}(\theta))}( roman_λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT ♯ end_POSTSUBSCRIPT italic_ξ start_POSTSUBSCRIPT sansserif_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( roman_Λ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( italic_θ ) ) end_POSTSUPERSCRIPT agree on the intersection stable generator {𝖠×𝖡𝖠,𝖡()}conditional-set𝖠𝖡𝖠𝖡\{\mathsf{A}\times\mathsf{B}\mid\mathsf{A},\mathsf{B}\in\mathcal{B}(\mathbb{R})\}{ sansserif_A × sansserif_B ∣ sansserif_A , sansserif_B ∈ caligraphic_B ( blackboard_R ) } of (2)superscript2\mathcal{B}(\mathbb{R}^{2})caligraphic_B ( blackboard_R start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ). Then [Klenke, Lemma 1.42] gives the result. ∎

Appendix B Uniform simple slice sampling

Let (𝖷,𝒳,μ)𝖷𝒳𝜇(\mathsf{X},\mathcal{X},\mu)( sansserif_X , caligraphic_X , italic_μ ) be a σ𝜎\sigmaitalic_σ-finite measure space. For illustrative purposes we introduce uniform simple slice sampling for probability measures on (𝖷,𝒳)𝖷𝒳(\mathsf{X},\mathcal{X})( sansserif_X , caligraphic_X ) that are absolutely continuos with respect to μ𝜇\muitalic_μ. More precisely, let

p:𝖷(0,):𝑝𝖷0p:\mathsf{X}\to(0,\infty)italic_p : sansserif_X → ( 0 , ∞ )

be a measurable function that satisfies

Z:=𝖷p(x)μ(dx)(0,).assign𝑍subscript𝖷𝑝𝑥𝜇d𝑥0Z:=\int_{\mathsf{X}}p(x)\ \mu({\rm d}x)\in(0,\infty).italic_Z := ∫ start_POSTSUBSCRIPT sansserif_X end_POSTSUBSCRIPT italic_p ( italic_x ) italic_μ ( roman_d italic_x ) ∈ ( 0 , ∞ ) .

We consider the probability measure

π(dx):=1Zp(x)μ(dx)assign𝜋d𝑥1𝑍𝑝𝑥𝜇d𝑥\pi({\rm d}x):=\frac{1}{Z}p(x)\mu({\rm d}x)italic_π ( roman_d italic_x ) := divide start_ARG 1 end_ARG start_ARG italic_Z end_ARG italic_p ( italic_x ) italic_μ ( roman_d italic_x ) (30)

that has unnormalized density p𝑝pitalic_p with respect to μ𝜇\muitalic_μ. We define the level sets as

L(t):={x𝖷p(x)>t},t(0,),formulae-sequenceassign𝐿𝑡conditional-set𝑥𝖷𝑝𝑥𝑡𝑡0L(t):=\{x\in\mathsf{X}\mid p(x)>t\},\qquad t\in(0,\infty),italic_L ( italic_t ) := { italic_x ∈ sansserif_X ∣ italic_p ( italic_x ) > italic_t } , italic_t ∈ ( 0 , ∞ ) ,

and the essential supremum norm of p𝑝pitalic_p

pess-:=inf𝖭𝒳,μ(𝖭)=0supx𝖷𝖭p(x).assignsubscriptnorm𝑝ess-subscriptinfimumformulae-sequence𝖭𝒳𝜇𝖭0subscriptsupremum𝑥𝖷𝖭𝑝𝑥\|p\|_{\text{ess-}\infty}:=\inf_{\mathsf{N}\in\mathcal{X},\mu(\mathsf{N})=0}% \sup_{x\in\mathsf{X}\setminus\mathsf{N}}p(x).∥ italic_p ∥ start_POSTSUBSCRIPT ess- ∞ end_POSTSUBSCRIPT := roman_inf start_POSTSUBSCRIPT sansserif_N ∈ caligraphic_X , italic_μ ( sansserif_N ) = 0 end_POSTSUBSCRIPT roman_sup start_POSTSUBSCRIPT italic_x ∈ sansserif_X ∖ sansserif_N end_POSTSUBSCRIPT italic_p ( italic_x ) .

By definition of pess-subscriptnorm𝑝ess-\|p\|_{\text{ess-}\infty}∥ italic_p ∥ start_POSTSUBSCRIPT ess- ∞ end_POSTSUBSCRIPT and because

0μ(L(t))Leb1(dt)=0𝖷𝟙L(t)(x)μ(dx)Leb1(dt)superscriptsubscript0𝜇𝐿𝑡subscriptLeb1d𝑡superscriptsubscript0subscript𝖷subscript1𝐿𝑡𝑥𝜇d𝑥subscriptLeb1d𝑡\displaystyle\int_{0}^{\infty}\mu\big{(}L(t)\big{)}\mathrm{Leb}_{1}({\rm d}t)=% \int_{0}^{\infty}\int_{\mathsf{X}}\mathbbm{1}_{L(t)}(x)\ \mu({\rm d}x)\,% \mathrm{Leb}_{1}({\rm d}t)∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT italic_μ ( italic_L ( italic_t ) ) roman_Leb start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( roman_d italic_t ) = ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ∫ start_POSTSUBSCRIPT sansserif_X end_POSTSUBSCRIPT blackboard_1 start_POSTSUBSCRIPT italic_L ( italic_t ) end_POSTSUBSCRIPT ( italic_x ) italic_μ ( roman_d italic_x ) roman_Leb start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( roman_d italic_t )
=𝖷0𝟙(0,p(x))(t)Leb1(dt)μ(dx)=𝖷p(x)μ(dx)=Z<,absentsubscript𝖷superscriptsubscript0subscript10𝑝𝑥𝑡subscriptLeb1d𝑡𝜇d𝑥subscript𝖷𝑝𝑥𝜇d𝑥𝑍\displaystyle\qquad=\int_{\mathsf{X}}\int_{0}^{\infty}\mathbbm{1}_{(0,p(x))}(t% )\ \mathrm{Leb}_{1}({\rm d}t)\,\mu({\rm d}x)=\int_{\mathsf{X}}p(x)\ \mu({\rm d% }x)=Z<\infty,= ∫ start_POSTSUBSCRIPT sansserif_X end_POSTSUBSCRIPT ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT blackboard_1 start_POSTSUBSCRIPT ( 0 , italic_p ( italic_x ) ) end_POSTSUBSCRIPT ( italic_t ) roman_Leb start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( roman_d italic_t ) italic_μ ( roman_d italic_x ) = ∫ start_POSTSUBSCRIPT sansserif_X end_POSTSUBSCRIPT italic_p ( italic_x ) italic_μ ( roman_d italic_x ) = italic_Z < ∞ ,

we have that for almost all t(0,pess-)𝑡0subscriptnorm𝑝ess-t\in(0,\|p\|_{\text{ess-}\infty})italic_t ∈ ( 0 , ∥ italic_p ∥ start_POSTSUBSCRIPT ess- ∞ end_POSTSUBSCRIPT ) holds μ(L(t))(0,)𝜇𝐿𝑡0\mu\big{(}L(t)\big{)}\in(0,\infty)italic_μ ( italic_L ( italic_t ) ) ∈ ( 0 , ∞ ). Consequently, the Markov kernel of the uniform simple slice sampler

H(x,𝖠):=1p(x)0p(x)μt(𝖠)Leb1(dt),x𝖷,𝖠𝒳,formulae-sequenceassign𝐻𝑥𝖠1𝑝𝑥superscriptsubscript0𝑝𝑥subscript𝜇𝑡𝖠subscriptLeb1d𝑡formulae-sequence𝑥𝖷𝖠𝒳H(x,\mathsf{A}):=\frac{1}{p(x)}\int_{0}^{p(x)}\mu_{t}(\mathsf{A})\ \mathrm{Leb% }_{1}({\rm d}t),\qquad x\in\mathsf{X},\mathsf{A}\in\mathcal{X},italic_H ( italic_x , sansserif_A ) := divide start_ARG 1 end_ARG start_ARG italic_p ( italic_x ) end_ARG ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_p ( italic_x ) end_POSTSUPERSCRIPT italic_μ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( sansserif_A ) roman_Leb start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( roman_d italic_t ) , italic_x ∈ sansserif_X , sansserif_A ∈ caligraphic_X , (31)

where

μt:=1μ(L(t))μ|L(t),t(0,pess-),formulae-sequenceassignsubscript𝜇𝑡evaluated-at1𝜇𝐿𝑡𝜇𝐿𝑡𝑡0subscriptnorm𝑝ess-\mu_{t}:=\frac{1}{\mu(L(t))}\mu|_{L(t)},\qquad t\in(0,\|p\|_{\text{ess-}\infty% }),italic_μ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT := divide start_ARG 1 end_ARG start_ARG italic_μ ( italic_L ( italic_t ) ) end_ARG italic_μ | start_POSTSUBSCRIPT italic_L ( italic_t ) end_POSTSUBSCRIPT , italic_t ∈ ( 0 , ∥ italic_p ∥ start_POSTSUBSCRIPT ess- ∞ end_POSTSUBSCRIPT ) ,

is well defined.

Lemma 23.

Let (𝖷,𝒳,μ)𝖷𝒳𝜇(\mathsf{X},\mathcal{X},\mu)( sansserif_X , caligraphic_X , italic_μ ) be a σ𝜎\sigmaitalic_σ-finite measure space, and let let p:𝖷(0,):𝑝𝖷0p:\mathsf{X}\to(0,\infty)italic_p : sansserif_X → ( 0 , ∞ ) be a measurable function with Z:=𝖷p(x)μ(dx)(0,)assign𝑍subscript𝖷𝑝𝑥𝜇d𝑥0Z:=\int_{\mathsf{X}}p(x)\ \mu({\rm d}x)\in(0,\infty)italic_Z := ∫ start_POSTSUBSCRIPT sansserif_X end_POSTSUBSCRIPT italic_p ( italic_x ) italic_μ ( roman_d italic_x ) ∈ ( 0 , ∞ ). Then H𝐻Hitalic_H, as defined in (31), is reversible with respect to π𝜋\piitalic_π, given in (30).

Proof.

Let 𝖠,𝖡𝒳𝖠𝖡𝒳\mathsf{A},\mathsf{B}\in\mathcal{X}sansserif_A , sansserif_B ∈ caligraphic_X. We have

BH(x,𝖠)π(dx)=Bp(x)Zp(x)0p(x)μt(𝖠)Leb1(dt)μ(dx)subscript𝐵𝐻𝑥𝖠𝜋d𝑥subscript𝐵𝑝𝑥𝑍𝑝𝑥superscriptsubscript0𝑝𝑥subscript𝜇𝑡𝖠subscriptLeb1d𝑡𝜇d𝑥\displaystyle\int_{B}H(x,\mathsf{A})\ \pi({\rm d}x)=\int_{B}\frac{p(x)}{Z\,p(x% )}\int_{0}^{p(x)}\mu_{t}(\mathsf{A})\ \mathrm{Leb}_{1}({\rm d}t)\,\mu({\rm d}x)∫ start_POSTSUBSCRIPT italic_B end_POSTSUBSCRIPT italic_H ( italic_x , sansserif_A ) italic_π ( roman_d italic_x ) = ∫ start_POSTSUBSCRIPT italic_B end_POSTSUBSCRIPT divide start_ARG italic_p ( italic_x ) end_ARG start_ARG italic_Z italic_p ( italic_x ) end_ARG ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_p ( italic_x ) end_POSTSUPERSCRIPT italic_μ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( sansserif_A ) roman_Leb start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( roman_d italic_t ) italic_μ ( roman_d italic_x )
=1Z0Bμt(𝖠)𝟙(0,p(x))(t)μ(dx)Leb1(dt)=1Z0μ(𝖠L(t))μ(𝖡L(t))μ(L(t))Leb1(dt).absent1𝑍superscriptsubscript0subscript𝐵subscript𝜇𝑡𝖠subscript10𝑝𝑥𝑡𝜇d𝑥subscriptLeb1d𝑡1𝑍superscriptsubscript0𝜇𝖠𝐿𝑡𝜇𝖡𝐿𝑡𝜇𝐿𝑡subscriptLeb1d𝑡\displaystyle\qquad=\frac{1}{Z}\int_{0}^{\infty}\int_{B}\mu_{t}(\mathsf{A})% \mathbbm{1}_{(0,p(x))}(t)\ \mu({\rm d}x)\,\mathrm{Leb}_{1}({\rm d}t)=\frac{1}{% Z}\int_{0}^{\infty}\frac{\mu(\mathsf{A}\cap L(t))\mu(\mathsf{B}\cap L(t))}{\mu% (L(t))}\ \mathrm{Leb}_{1}({\rm d}t).= divide start_ARG 1 end_ARG start_ARG italic_Z end_ARG ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ∫ start_POSTSUBSCRIPT italic_B end_POSTSUBSCRIPT italic_μ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( sansserif_A ) blackboard_1 start_POSTSUBSCRIPT ( 0 , italic_p ( italic_x ) ) end_POSTSUBSCRIPT ( italic_t ) italic_μ ( roman_d italic_x ) roman_Leb start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( roman_d italic_t ) = divide start_ARG 1 end_ARG start_ARG italic_Z end_ARG ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT divide start_ARG italic_μ ( sansserif_A ∩ italic_L ( italic_t ) ) italic_μ ( sansserif_B ∩ italic_L ( italic_t ) ) end_ARG start_ARG italic_μ ( italic_L ( italic_t ) ) end_ARG roman_Leb start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( roman_d italic_t ) .

This expression is symmetric in 𝖠𝖠\mathsf{A}sansserif_A and 𝖡𝖡\mathsf{B}sansserif_B, such that

𝖡H(x,𝖠)π(dx)=𝖠H(x,𝖡)π(dx).subscript𝖡𝐻𝑥𝖠𝜋d𝑥subscript𝖠𝐻𝑥𝖡𝜋d𝑥\displaystyle\int_{\mathsf{B}}H(x,\mathsf{A})\ \pi({\rm d}x)=\int_{\mathsf{A}}% H(x,\mathsf{B})\ \pi({\rm d}x).∫ start_POSTSUBSCRIPT sansserif_B end_POSTSUBSCRIPT italic_H ( italic_x , sansserif_A ) italic_π ( roman_d italic_x ) = ∫ start_POSTSUBSCRIPT sansserif_A end_POSTSUBSCRIPT italic_H ( italic_x , sansserif_B ) italic_π ( roman_d italic_x ) .

Appendix C Manifolds

We do not give a complete introduction to differential geometry in this section. The aim is rather to give a better understanding on the key objects used by the geodesic slice sampler and provide references for what is outside the scope of this paper.

C.1 Tangent space and Riemannian metric

We revise some selected basic concepts on manifolds. For a more thorough introduction to these objects see [Boothby, Sections I, III, IV, V]. Let 𝖬𝖬\mathsf{M}sansserif_M be a d𝑑ditalic_d-dimensional, smooth manifold that is connected as a set. The most defining property of such an object is the following: For all x𝖬𝑥𝖬x\in\mathsf{M}italic_x ∈ sansserif_M there exists an open set 𝖴𝖬𝖴𝖬\mathsf{U}\subseteq\mathsf{M}sansserif_U ⊆ sansserif_M containing x𝑥xitalic_x and an open set 𝖴dsuperscript𝖴superscript𝑑\mathsf{U}^{\prime}\subseteq\mathbb{R}^{d}sansserif_U start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ⊆ blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT such that there is a homeomorphism φ:𝖴𝖴:𝜑𝖴superscript𝖴\varphi:\mathsf{U}\to\mathsf{U}^{\prime}italic_φ : sansserif_U → sansserif_U start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT. We call the tuple (𝖴,φ)𝖴𝜑(\mathsf{U},\varphi)( sansserif_U , italic_φ ) a coordinate neighborhood. Since 𝖬𝖬\mathsf{M}sansserif_M is smooth, it is equipped with an atlas {(𝖴i,φi)}iIsubscriptsubscript𝖴𝑖subscript𝜑𝑖𝑖𝐼\{(\mathsf{U}_{i},\varphi_{i})\}_{i\in I}{ ( sansserif_U start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_φ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) } start_POSTSUBSCRIPT italic_i ∈ italic_I end_POSTSUBSCRIPT that contains coordinate neighborhoods for all points of 𝖬𝖬\mathsf{M}sansserif_M, such that for all i,jI𝑖𝑗𝐼i,j\in Iitalic_i , italic_j ∈ italic_I the map φiφj1:φj(𝖴j𝖴i)φi(𝖴j𝖴i):subscript𝜑𝑖superscriptsubscript𝜑𝑗1subscript𝜑𝑗subscript𝖴𝑗subscript𝖴𝑖subscript𝜑𝑖subscript𝖴𝑗subscript𝖴𝑖\varphi_{i}\circ\varphi_{j}^{-1}:\varphi_{j}\left(\mathsf{U}_{j}\cap\mathsf{U}% _{i}\right)\to\varphi_{i}\left(\mathsf{U}_{j}\cap\mathsf{U}_{i}\right)italic_φ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∘ italic_φ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT : italic_φ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( sansserif_U start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∩ sansserif_U start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) → italic_φ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( sansserif_U start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∩ sansserif_U start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ), which is a map from a subset of dsuperscript𝑑\mathbb{R}^{d}blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT to a subset of dsuperscript𝑑\mathbb{R}^{d}blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT, is a diffeomorphism. At all x𝖬𝑥𝖬x\in\mathsf{M}italic_x ∈ sansserif_M we can define the tangent space Tx𝖬subscript𝑇𝑥𝖬T_{x}\mathsf{M}italic_T start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT sansserif_M to 𝖬𝖬\mathsf{M}sansserif_M at x𝑥xitalic_x, which is the set of all linear mappings from the space of germs at x𝑥xitalic_x to \mathbb{R}blackboard_R satisfying the Leibniz rule. Observe that each coordinate neighborhood (𝖴,φ)𝖴𝜑(\mathsf{U},\varphi)( sansserif_U , italic_φ ) of 𝖬𝖬\mathsf{M}sansserif_M induces d𝑑ditalic_d vector fields, the coordinate frames. Essentially a vector field E𝐸Eitalic_E associates to each point x𝖬𝑥𝖬x\in\mathsf{M}italic_x ∈ sansserif_M an element ExTx𝖬subscript𝐸𝑥subscript𝑇𝑥𝖬E_{x}\in T_{x}\mathsf{M}italic_E start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT ∈ italic_T start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT sansserif_M. The coordinate frames form a basis of the tangent spaces Tx𝖬subscript𝑇𝑥𝖬T_{x}\mathsf{M}italic_T start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT sansserif_M, turning it into a d𝑑ditalic_d-dimensional vector space, at each x𝖴𝑥𝖴x\in\mathsf{U}italic_x ∈ sansserif_U. We denote the coordinate frames induced by (𝖴,φ)𝖴𝜑(\mathsf{U},\varphi)( sansserif_U , italic_φ ) as

E1φ,,Edφ.superscriptsubscript𝐸1𝜑superscriptsubscript𝐸𝑑𝜑E_{1}^{\varphi},\ldots,E_{d}^{\varphi}.italic_E start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_φ end_POSTSUPERSCRIPT , … , italic_E start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_φ end_POSTSUPERSCRIPT .

If there exists a smooth field 𝔤𝔤\mathfrak{g}fraktur_g of symmetric, positive definite bilinear forms, the manifold 𝖬𝖬\mathsf{M}sansserif_M is called Riemannian and 𝔤𝔤\mathfrak{g}fraktur_g is called a Riemannian metric on 𝖬𝖬\mathsf{M}sansserif_M. Essentially this means that 𝔤𝔤\mathfrak{g}fraktur_g associates to each x𝖬𝑥𝖬x\in\mathsf{M}italic_x ∈ sansserif_M a symmetric, positive definite bilinear form 𝔤x:Tx𝖬×Tx𝖬:subscript𝔤𝑥subscript𝑇𝑥𝖬subscript𝑇𝑥𝖬\mathfrak{g}_{x}:T_{x}\mathsf{M}\times T_{x}\mathsf{M}\to\mathbb{R}fraktur_g start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT : italic_T start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT sansserif_M × italic_T start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT sansserif_M → blackboard_R, which turns Tx𝖬subscript𝑇𝑥𝖬T_{x}\mathsf{M}italic_T start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT sansserif_M into an inner product space.

C.2 Geodesics

We call a map from an interval in \mathbb{R}blackboard_R to the manifold 𝖬𝖬\mathsf{M}sansserif_M a curve. The Riemannian structure on 𝖬𝖬\mathsf{M}sansserif_M induces a special class of curves on 𝖬𝖬\mathsf{M}sansserif_M, namely the geodesics. Intuitively, a geodesic can be thought of as a curve of constant velocity. For a formal definition of geodesics consult [Boothby, Section VII.5]. We say the manifold 𝖬𝖬\mathsf{M}sansserif_M is geodesically complete if all geodesics can be extended such that their domain is \mathbb{R}blackboard_R. In this case, by virtue of [Lee, Corollary 4.28], for all x𝖬𝑥𝖬x\in\mathsf{M}italic_x ∈ sansserif_M and all vTx𝖬𝑣subscript𝑇𝑥𝖬v\in T_{x}\mathsf{M}italic_v ∈ italic_T start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT sansserif_M there exists a unique geodesic

γ(x,v):𝖬:subscript𝛾𝑥𝑣𝖬\gamma_{(x,v)}:\mathbb{R}\to\mathsf{M}italic_γ start_POSTSUBSCRIPT ( italic_x , italic_v ) end_POSTSUBSCRIPT : blackboard_R → sansserif_M (32)

satisfying γ(x,v)(0)=xsubscript𝛾𝑥𝑣0𝑥\gamma_{(x,v)}(0)=xitalic_γ start_POSTSUBSCRIPT ( italic_x , italic_v ) end_POSTSUBSCRIPT ( 0 ) = italic_x and dγ(x,v)dt|0=vevaluated-atdsubscript𝛾𝑥𝑣d𝑡0𝑣\frac{{\rm d}\gamma_{(x,v)}}{{\rm d}t}|_{0}=vdivide start_ARG roman_d italic_γ start_POSTSUBSCRIPT ( italic_x , italic_v ) end_POSTSUBSCRIPT end_ARG start_ARG roman_d italic_t end_ARG | start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = italic_v for the velocity vector field at zero. Intuitively, γ(x,v)subscript𝛾𝑥𝑣\gamma_{(x,v)}italic_γ start_POSTSUBSCRIPT ( italic_x , italic_v ) end_POSTSUBSCRIPT can be thought of as the geodesic through x𝑥xitalic_x in direction v𝑣vitalic_v. It can also be written in terms of the exponential map ExpExp\mathrm{Exp}roman_Exp (see [Boothby, Section VII.6]) as

γ(x,v)(θ)=Expx(θv),x𝖬,vTx𝖬,θ.formulae-sequencesubscript𝛾𝑥𝑣𝜃subscriptExp𝑥𝜃𝑣formulae-sequence𝑥𝖬formulae-sequence𝑣subscript𝑇𝑥𝖬𝜃\gamma_{(x,v)}(\theta)=\mathrm{Exp}_{x}(\theta v),\qquad x\in\mathsf{M},v\in T% _{x}\mathsf{M},\theta\in\mathbb{R}.italic_γ start_POSTSUBSCRIPT ( italic_x , italic_v ) end_POSTSUBSCRIPT ( italic_θ ) = roman_Exp start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT ( italic_θ italic_v ) , italic_x ∈ sansserif_M , italic_v ∈ italic_T start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT sansserif_M , italic_θ ∈ blackboard_R .

By the Hopf-Rinow Theorem (see e.g [Lee, Theorem 6.19]) geodesic completeness is equivalent to metric completeness.

C.3 The Riemannian measure

We now introduce a measure on 𝖬𝖬\mathsf{M}sansserif_M which can be viewed as an extension of the Lebesgue measure to Riemannian manifolds, see also [Sakai, Section II.5]. The topology on 𝖬𝖬\mathsf{M}sansserif_M induces as a Borel-σ𝜎\sigmaitalic_σ-algebra, which we denote as (𝖬)𝖬\mathcal{B}(\mathsf{M})caligraphic_B ( sansserif_M ). We need the following family of functions: Given a coordinate neighborhood (𝖴,φ)𝖴𝜑(\mathsf{U},\varphi)( sansserif_U , italic_φ ), we use the Gram matrix of the coordinate frames E1,xφ,,Ed,xφsuperscriptsubscript𝐸1𝑥𝜑superscriptsubscript𝐸𝑑𝑥𝜑E_{1,x}^{\varphi},\ldots,E_{d,x}^{\varphi}italic_E start_POSTSUBSCRIPT 1 , italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_φ end_POSTSUPERSCRIPT , … , italic_E start_POSTSUBSCRIPT italic_d , italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_φ end_POSTSUPERSCRIPT evaluated at x𝖬𝑥𝖬x\in\mathsf{M}italic_x ∈ sansserif_M to introduce the function

det(g,φ):𝖴:𝑔𝜑𝖴\displaystyle\sqrt{\det(g,\varphi)}:\mathsf{U}square-root start_ARG roman_det ( italic_g , italic_φ ) end_ARG : sansserif_U [0,)absent0\displaystyle\to[0,\infty)→ [ 0 , ∞ )
x𝑥\displaystyle xitalic_x det[(𝔤x(Ej,xφ,Ek,xφ)){1j,kd}].maps-toabsentdelimited-[]subscriptsubscript𝔤𝑥superscriptsubscript𝐸𝑗𝑥𝜑superscriptsubscript𝐸𝑘𝑥𝜑formulae-sequence1𝑗𝑘𝑑\displaystyle\mapsto\sqrt{\det\left[\left(\mathfrak{g}_{x}(E_{j,x}^{\varphi},E% _{k,x}^{\varphi})\right)_{\{1\leqslant j,k\leqslant d\}}\right]}.↦ square-root start_ARG roman_det [ ( fraktur_g start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT ( italic_E start_POSTSUBSCRIPT italic_j , italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_φ end_POSTSUPERSCRIPT , italic_E start_POSTSUBSCRIPT italic_k , italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_φ end_POSTSUPERSCRIPT ) ) start_POSTSUBSCRIPT { 1 ⩽ italic_j , italic_k ⩽ italic_d } end_POSTSUBSCRIPT ] end_ARG .

Moreover, as 𝖬𝖬\mathsf{M}sansserif_M is by definition second countable, there exists a countable collection {(𝖴i,φi)}isubscriptsubscript𝖴𝑖subscript𝜑𝑖𝑖\{(\mathsf{U}_{i},\varphi_{i})\}_{i\in\mathbb{N}}{ ( sansserif_U start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_φ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) } start_POSTSUBSCRIPT italic_i ∈ blackboard_N end_POSTSUBSCRIPT of coordinate neighborhoods such that i𝖴i=𝖬subscript𝑖subscript𝖴𝑖𝖬\bigcup_{i\in\mathbb{N}}\mathsf{U}_{i}=\mathsf{M}⋃ start_POSTSUBSCRIPT italic_i ∈ blackboard_N end_POSTSUBSCRIPT sansserif_U start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = sansserif_M.121212A space where every cover of open sets contains a countable subcover is called Lindelöf space. Second countable spaces are Lindelöf. Note that 𝖬𝖬\mathsf{M}sansserif_M admits a partition of unity {ρi}isubscriptsubscript𝜌𝑖𝑖\{\rho_{i}\}_{i\in\mathbb{N}}{ italic_ρ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_i ∈ blackboard_N end_POSTSUBSCRIPT subordinate to {𝖴i}isubscriptsubscript𝖴𝑖𝑖\{\mathsf{U}_{i}\}_{i\in\mathbb{N}}{ sansserif_U start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_i ∈ blackboard_N end_POSTSUBSCRIPT, see [Sakai, Section I.2.1], i.e.,

  • ρi:𝖬[0,):subscript𝜌𝑖𝖬0\rho_{i}:\mathsf{M}\to[0,\infty)italic_ρ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT : sansserif_M → [ 0 , ∞ ) is a Csuperscript𝐶C^{\infty}italic_C start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT-function131313A function f:𝖬:𝑓𝖬f:\mathsf{M}\to\mathbb{R}italic_f : sansserif_M → blackboard_R is called a Csuperscript𝐶C^{\infty}italic_C start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT-function if for all coordinate neighborhoods (𝖴,φ)𝖴𝜑(\mathsf{U},\varphi)( sansserif_U , italic_φ ) the function fφ1:φ(𝖴):𝑓superscript𝜑1𝜑𝖴f\circ\varphi^{-1}:\varphi(\mathsf{U})\to\mathbb{R}italic_f ∘ italic_φ start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT : italic_φ ( sansserif_U ) → blackboard_R is a smooth function. with support suppρi𝖴isuppsubscript𝜌𝑖subscript𝖴𝑖\mathrm{supp}\,\rho_{i}\subseteq\mathsf{U}_{i}roman_supp italic_ρ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ⊆ sansserif_U start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT for all i𝑖i\in\mathbb{N}italic_i ∈ blackboard_N,

  • {suppρi}isubscriptsuppsubscript𝜌𝑖𝑖\{\mathrm{supp}\,\rho_{i}\}_{i\in\mathbb{N}}{ roman_supp italic_ρ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_i ∈ blackboard_N end_POSTSUBSCRIPT is locally finite141414For all x𝖬𝑥𝖬x\in\mathsf{M}italic_x ∈ sansserif_M there exists a neighborhood 𝖴𝖴\mathsf{U}sansserif_U of x𝑥xitalic_x such that 𝖴suppρi𝖴suppsubscript𝜌𝑖\mathsf{U}\cap\mathrm{supp}\,\rho_{i}\neq\emptysetsansserif_U ∩ roman_supp italic_ρ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ≠ ∅ holds only for finitely many i𝑖i\in\mathbb{N}italic_i ∈ blackboard_N. ,

  • i=1ρi(x)=1superscriptsubscript𝑖1subscript𝜌𝑖𝑥1\sum_{i=1}^{\infty}\rho_{i}(x)=1∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT italic_ρ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x ) = 1 for all x𝖬𝑥𝖬x\in\mathsf{M}italic_x ∈ sansserif_M.

Then the Riemannian measure induced by the Riemannian metric 𝔤𝔤\mathfrak{g}fraktur_g

ν𝔤(𝖠):=i=1φi(𝖴i)(ρi𝟙𝖠det(g,φi))φi1(z)Lebd(dz),𝖠(𝖬),formulae-sequenceassignsubscript𝜈𝔤𝖠superscriptsubscript𝑖1subscriptsubscript𝜑𝑖subscript𝖴𝑖subscript𝜌𝑖subscript1𝖠𝑔subscript𝜑𝑖superscriptsubscript𝜑𝑖1𝑧subscriptLeb𝑑d𝑧𝖠𝖬\nu_{\mathfrak{g}}(\mathsf{A}):=\sum_{i=1}^{\infty}\int_{\varphi_{i}(\mathsf{U% }_{i})}\left(\rho_{i}\cdot\mathbbm{1}_{\mathsf{A}}\cdot\sqrt{\det(g,\varphi_{i% })}\right)\circ\varphi_{i}^{-1}(z)\ \mathrm{Leb}_{d}({\rm d}z),\qquad\mathsf{A% }\in\mathcal{B}(\mathsf{M}),italic_ν start_POSTSUBSCRIPT fraktur_g end_POSTSUBSCRIPT ( sansserif_A ) := ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ∫ start_POSTSUBSCRIPT italic_φ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( sansserif_U start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_POSTSUBSCRIPT ( italic_ρ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ⋅ blackboard_1 start_POSTSUBSCRIPT sansserif_A end_POSTSUBSCRIPT ⋅ square-root start_ARG roman_det ( italic_g , italic_φ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_ARG ) ∘ italic_φ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_z ) roman_Leb start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT ( roman_d italic_z ) , sansserif_A ∈ caligraphic_B ( sansserif_M ) , (33)

defines a measure on the measure space (M,(𝖬))𝑀𝖬(M,\mathcal{B}(\mathsf{M}))( italic_M , caligraphic_B ( sansserif_M ) ). We provide some brief arguments that the Riemannian measure is well-defined and indeed a measure. Observe that 𝟙𝖠φi1subscript1𝖠superscriptsubscript𝜑𝑖1\mathbbm{1}_{\mathsf{A}}\circ\varphi_{i}^{-1}blackboard_1 start_POSTSUBSCRIPT sansserif_A end_POSTSUBSCRIPT ∘ italic_φ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT is Borel measurable, and ρiφi1subscript𝜌𝑖superscriptsubscript𝜑𝑖1\rho_{i}\circ\varphi_{i}^{-1}italic_ρ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∘ italic_φ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT and det(g,φi)φi1𝑔subscript𝜑𝑖superscriptsubscript𝜑𝑖1\sqrt{\det(g,\varphi_{i})}\circ\varphi_{i}^{-1}square-root start_ARG roman_det ( italic_g , italic_φ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_ARG ∘ italic_φ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT are smooth for all A(𝖬)𝐴𝖬A\in\mathcal{B}(\mathsf{M})italic_A ∈ caligraphic_B ( sansserif_M ) and all i𝑖i\in\mathbb{N}italic_i ∈ blackboard_N. Hence the appearing Lebesgue integrals are defined. For independence of the construction from the choice of open covering and partition of unity see [Sakai, page 62]. The σ𝜎\sigmaitalic_σ-additivity of ν𝔤subscript𝜈𝔤\nu_{\mathfrak{g}}italic_ν start_POSTSUBSCRIPT fraktur_g end_POSTSUBSCRIPT is inherited from the σ𝜎\sigmaitalic_σ-additivity of the Lebesgue integral. Applying standard extension arguments using the additivity of the Lebesgue integral and monotone convergence theorem, we can extend (33) to measurable functions f:𝖬[0,):𝑓𝖬0f:\mathsf{M}\to[0,\infty)italic_f : sansserif_M → [ 0 , ∞ ) yielding

𝖬f(x)ν𝔤(dx)=i=1φi(𝖴i)(ρifdet(g,φi))φi1(z)Lebd(dz).subscript𝖬𝑓𝑥subscript𝜈𝔤d𝑥superscriptsubscript𝑖1subscriptsubscript𝜑𝑖subscript𝖴𝑖subscript𝜌𝑖𝑓𝑔subscript𝜑𝑖superscriptsubscript𝜑𝑖1𝑧subscriptLeb𝑑d𝑧\int_{\mathsf{M}}f(x)\ \nu_{\mathfrak{g}}({\rm d}x)=\sum_{i=1}^{\infty}\int_{% \varphi_{i}(\mathsf{U}_{i})}\left(\rho_{i}\cdot f\cdot\sqrt{\det(g,\varphi_{i}% )}\right)\circ\varphi_{i}^{-1}(z)\ \mathrm{Leb}_{d}({\rm d}z).∫ start_POSTSUBSCRIPT sansserif_M end_POSTSUBSCRIPT italic_f ( italic_x ) italic_ν start_POSTSUBSCRIPT fraktur_g end_POSTSUBSCRIPT ( roman_d italic_x ) = ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ∫ start_POSTSUBSCRIPT italic_φ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( sansserif_U start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_POSTSUBSCRIPT ( italic_ρ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ⋅ italic_f ⋅ square-root start_ARG roman_det ( italic_g , italic_φ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_ARG ) ∘ italic_φ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_z ) roman_Leb start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT ( roman_d italic_z ) .

C.4 The uniform distribution on the unit tangent spheres

Throughout this section we fix xM𝑥𝑀x\in Mitalic_x ∈ italic_M. We denote by

𝕊xd1:={vTx𝖬𝔤x(v,v)=1}assignsuperscriptsubscript𝕊𝑥𝑑1conditional-set𝑣subscript𝑇𝑥𝖬subscript𝔤𝑥𝑣𝑣1\mathbb{S}_{x}^{d-1}:=\{v\in T_{x}\mathsf{M}\mid\mathfrak{g}_{x}(v,v)=1\}blackboard_S start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_d - 1 end_POSTSUPERSCRIPT := { italic_v ∈ italic_T start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT sansserif_M ∣ fraktur_g start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT ( italic_v , italic_v ) = 1 }

the unit tangent sphere to 𝖬𝖬\mathsf{M}sansserif_M at x𝑥xitalic_x. It is immersed into Tx𝖬subscript𝑇𝑥𝖬T_{x}\mathsf{M}italic_T start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT sansserif_M via the identity map Id:𝕊xd1Tx𝖬:Idsuperscriptsubscript𝕊𝑥𝑑1subscript𝑇𝑥𝖬\mathrm{Id}:\mathbb{S}_{x}^{d-1}\to T_{x}\mathsf{M}roman_Id : blackboard_S start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_d - 1 end_POSTSUPERSCRIPT → italic_T start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT sansserif_M. For all v𝕊xd1𝑣superscriptsubscript𝕊𝑥𝑑1v\in\mathbb{S}_{x}^{d-1}italic_v ∈ blackboard_S start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_d - 1 end_POSTSUPERSCRIPT, this induces a map Id:Tv𝕊xd1TvTx𝖬:subscriptIdsubscript𝑇𝑣superscriptsubscript𝕊𝑥𝑑1subscript𝑇𝑣subscript𝑇𝑥𝖬\mathrm{Id}_{*}:T_{v}\mathbb{S}_{x}^{d-1}\to T_{v}T_{x}\mathsf{M}roman_Id start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT : italic_T start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT blackboard_S start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_d - 1 end_POSTSUPERSCRIPT → italic_T start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT italic_T start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT sansserif_M on the tangent spaces, see [Boothby, Theorem IV.1.2]. As (Tx𝖬,𝔤x)subscript𝑇𝑥𝖬subscript𝔤𝑥(T_{x}\mathsf{M},\mathfrak{g}_{x})( italic_T start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT sansserif_M , fraktur_g start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT ) is a d𝑑ditalic_d-dimensional inner product space, the tangent space TvTx𝖬subscript𝑇𝑣subscript𝑇𝑥𝖬T_{v}T_{x}\mathsf{M}italic_T start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT italic_T start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT sansserif_M to Tx𝖬subscript𝑇𝑥𝖬T_{x}\mathsf{M}italic_T start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT sansserif_M at vTvM𝑣subscript𝑇𝑣𝑀v\in T_{v}Mitalic_v ∈ italic_T start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT italic_M is again Tx𝖬subscript𝑇𝑥𝖬T_{x}\mathsf{M}italic_T start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT sansserif_M, see [Boothby, Section II.3] for more details on this construction. Because (𝔤x)vsubscriptsubscript𝔤𝑥𝑣(\mathfrak{g}_{x})_{v}( fraktur_g start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT then becomes 𝔤xsubscript𝔤𝑥\mathfrak{g}_{x}fraktur_g start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT, we can exploit this to define the Riemannian metric 𝔤^xsubscript^𝔤𝑥\widehat{\mathfrak{g}}_{x}over^ start_ARG fraktur_g end_ARG start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT on 𝕊xd1superscriptsubscript𝕊𝑥𝑑1\mathbb{S}_{x}^{d-1}blackboard_S start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_d - 1 end_POSTSUPERSCRIPT given by

(𝔤^x)v(ξ1,ξ2):=𝔤x(Id(ξ1),Id(ξ2)),v𝕊xd1,ξ1,ξ2Tv𝕊xd1,formulae-sequenceassignsubscriptsubscript^𝔤𝑥𝑣subscript𝜉1subscript𝜉2subscript𝔤𝑥subscriptIdsubscript𝜉1subscriptIdsubscript𝜉2formulae-sequence𝑣superscriptsubscript𝕊𝑥𝑑1subscript𝜉1subscript𝜉2subscript𝑇𝑣superscriptsubscript𝕊𝑥𝑑1\left(\widehat{\mathfrak{g}}_{x}\right)_{v}(\xi_{1},\xi_{2}):=\mathfrak{g}_{x}% \big{(}\mathrm{Id}_{*}(\xi_{1}),\mathrm{Id}_{*}(\xi_{2})\big{)},\qquad v\in% \mathbb{S}_{x}^{d-1},\xi_{1},\xi_{2}\in T_{v}\mathbb{S}_{x}^{d-1},( over^ start_ARG fraktur_g end_ARG start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ( italic_ξ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_ξ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) := fraktur_g start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT ( roman_Id start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT ( italic_ξ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) , roman_Id start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT ( italic_ξ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) ) , italic_v ∈ blackboard_S start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_d - 1 end_POSTSUPERSCRIPT , italic_ξ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_ξ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ∈ italic_T start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT blackboard_S start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_d - 1 end_POSTSUPERSCRIPT ,

see [Boothby, Corollary V.2.5]. As described in Appendix C.3, 𝔤^xsubscript^𝔤𝑥\widehat{\mathfrak{g}}_{x}over^ start_ARG fraktur_g end_ARG start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT induces the (Riemannian) measure ν𝔤^xsubscript𝜈subscript^𝔤𝑥\nu_{\widehat{\mathfrak{g}}_{x}}italic_ν start_POSTSUBSCRIPT over^ start_ARG fraktur_g end_ARG start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT end_POSTSUBSCRIPT on 𝕊xd1superscriptsubscript𝕊𝑥𝑑1\mathbb{S}_{x}^{d-1}blackboard_S start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_d - 1 end_POSTSUPERSCRIPT. Since 𝕊xd1superscriptsubscript𝕊𝑥𝑑1\mathbb{S}_{x}^{d-1}blackboard_S start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_d - 1 end_POSTSUPERSCRIPT is compact, ν𝔤^x(𝕊xd1)subscript𝜈subscript^𝔤𝑥superscriptsubscript𝕊𝑥𝑑1\nu_{\widehat{\mathfrak{g}}_{x}}\left(\mathbb{S}_{x}^{d-1}\right)italic_ν start_POSTSUBSCRIPT over^ start_ARG fraktur_g end_ARG start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( blackboard_S start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_d - 1 end_POSTSUPERSCRIPT ) is finite, and we may define the probability measure

σd1(x):=1ν𝔤^x(𝕊xd1)ν𝔤^x.assignsuperscriptsubscript𝜎𝑑1𝑥1subscript𝜈subscript^𝔤𝑥subscriptsuperscript𝕊𝑑1𝑥subscript𝜈subscript^𝔤𝑥\sigma_{d-1}^{(x)}:=\frac{1}{\nu_{\widehat{\mathfrak{g}}_{x}}(\mathbb{S}^{d-1}% _{x})}\nu_{\widehat{\mathfrak{g}}_{x}}.italic_σ start_POSTSUBSCRIPT italic_d - 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_x ) end_POSTSUPERSCRIPT := divide start_ARG 1 end_ARG start_ARG italic_ν start_POSTSUBSCRIPT over^ start_ARG fraktur_g end_ARG start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( blackboard_S start_POSTSUPERSCRIPT italic_d - 1 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT ) end_ARG italic_ν start_POSTSUBSCRIPT over^ start_ARG fraktur_g end_ARG start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT end_POSTSUBSCRIPT .

Note that, up to a push forward under an isometric isomorphism, σd1(x)superscriptsubscript𝜎𝑑1𝑥\sigma_{d-1}^{(x)}italic_σ start_POSTSUBSCRIPT italic_d - 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_x ) end_POSTSUPERSCRIPT is the uniform distribution on the Euclidean unit sphere 𝕊d1superscript𝕊𝑑1\mathbb{S}^{d-1}blackboard_S start_POSTSUPERSCRIPT italic_d - 1 end_POSTSUPERSCRIPT.

Appendix D Experiments appendix

D.1 A practical case: Understanding the variability in graph data sets.

Table 8: Parameters used in experiments. Unif(n,k\operatorname{Unif}(n,kroman_Unif ( italic_n , italic_k) denotes the uniform sampling on 𝒱(n,k)𝒱𝑛𝑘\mathcal{V}(n,k)caligraphic_V ( italic_n , italic_k ) and \cdot the colomn wise multiplication.
Parameter σϵsubscript𝜎italic-ϵ\sigma_{\epsilon}italic_σ start_POSTSUBSCRIPT italic_ϵ end_POSTSUBSCRIPT σκsubscript𝜎𝜅\sigma_{\kappa}italic_σ start_POSTSUBSCRIPT italic_κ end_POSTSUBSCRIPT μ𝜇\muitalic_μ F𝐹Fitalic_F
Synthetic data estimation (n,k,λ)𝑛𝑘𝜆(n,k,\lambda)( italic_n , italic_k , italic_λ ) 0.1 2 [10,2,]102[10,2,\ldots][ 10 , 2 , … ] [λ,1,]Unif(n,k)𝜆1Unif𝑛𝑘[\lambda,1,\ldots]\cdot\operatorname{Unif}(n,k)[ italic_λ , 1 , … ] ⋅ roman_Unif ( italic_n , italic_k )
Missing link imputation 0.3 2 [20,10,5,2,10]20105210[20,10,5,2,-10][ 20 , 10 , 5 , 2 , - 10 ] [60,20,20,20,5]Unif(20,5)absentUnif205\cdot\operatorname{Unif}(20,5)⋅ roman_Unif ( 20 , 5 )
\printbibliography