Probabilistic Reachability Analysis of Stochastic Control Systems

Saber Jafarpour1∗, Zishun Liu2∗ and Yongxin Chen2 The first two authors contribute equally to this work1 Saber Jafarpour is with University of Colorado Boulder, Boulder, CO 80309 [email protected]2Zishun Liu and Yongxin Chen are with Georgia Institute of Technology, Atlanta, GA 30332 {zliu910}{yongchen}@gatech.edu
Abstract

We address the reachability problem for continuous-time stochastic dynamic systems. Our objective is to present a unified framework that characterizes the reachable set of a dynamic system in the presence of both stochastic disturbances and deterministic inputs. To achieve this, we devise a strategy that effectively decouples the effects of deterministic inputs and stochastic disturbances on the reachable sets of the system. For the deterministic part, many existing methods can capture the deterministic reachability. As for the stochastic disturbances, we introduce a novel technique that probabilistically bounds the difference between a stochastic trajectory and its deterministic counterpart. The key to our approach is introducing a novel energy function termed the Averaged Moment Generating Function that yields a high probability bound for this difference. This bound is tight and exact for linear stochastic dynamics and applicable to a large class of nonlinear stochastic dynamics. By combining our innovative technique with existing methods for deterministic reachability analysis, we can compute estimations of reachable sets that surpass those obtained with current approaches for stochastic reachability analysis. We validate the effectiveness of our framework through various numerical experiments. Beyond its immediate applications in reachability analysis, our methodology is poised to have profound implications in the broader analysis and control of stochastic systems. It opens avenues for enhanced understanding and manipulation of complex stochastic dynamics, presenting opportunities for advancements in related fields.

Index Terms:
Reachability analysis, stochastic dynamic systems, stochastic control

I Introduction

Reachability analysis is an important topic in systems and control theory that focuses on analyzing whether the trajectory of a system will reach a certain set within a time horizon starting from a given set of initial conditions and possibly subject to inputs or disturbances. It is essential in many applications including autonomous vehicles, aerospace systems, robotics, etc. For instance, in safety-critical applications where the system should be kept outside an unsafe region of the state space, reachability analysis is a key machinery to verify and design the control input to avoid the unsafe region.

Reachability analysis of dynamical systems is a fundamentally challenging task. For general dynamical systems, obtaining exact or close approximations of their reachable sets is only possible when the state dimension is low and generally demands substantial computational resources. However, there is a rapidly growing need for fast reachability analysis methods in various control applications. This motivates the need for rigorous methods that can efficiently upper bound the reachable sets of dynamical systems.

For deterministic systems with bounded inputs or disturbances, many methods have been proposed to over-approximate the reachable set. Several representative methods include Hamilton-Jacobi reachability that poses reachability as a game between two players [1, 2], contraction-based reachability that estimates the propagation of reachable sets using contraction rate of the system [3, 4], and Interval-based reachability that over-approximates reachable sets by leveraging techniques from interval analysis and monotone system theory [5, 6, 7, 8]. Other methods such as simulation-based reachability [9, 10] are also popular in a wide range of studies.

In this work, we are interested in reachability analysis for stochastic systems. In many real-world applications, systems are subject to unbounded and stochastic disturbances and are better modeled by stochastic dynamics. Despite the efficiency of the aforementioned deterministic reachability methods in the presence of bounded disturbances, they cannot be applied directly to systems subject to unbounded stochastic disturbances. For systems with stochastic disturbances, considering all possible disturbance scenarios will often result in unbounded reachable sets due to the unboundedness of stochastic noise. Moreover, this approach also ignores the statistical properties of the noise, leading to overly conservative results [11]. To better capture the effects of stochastic disturbances, reachability analysis in stochastic systems focuses on the probabilistic reachable set, which refers to the set that any possible trajectory starting from an initial set can reach with high probability (e.g., 99.9%).

There have been several attempts to approximate probabilistic reachable sets of stochastic systems and they can be divided into two categories. The first category is the optimization-based approaches that use Hamilton-Jacobi equations and dynamic programming [12, 13, 14, 15]. However, these approaches are usually computational heavy, rendering them impractical for large-scale systems. The second category is simulation-based approaches which provide guarantees for reachability using trajectory samples [16, 17, 18]. One drawback of these methods is that the amount of samples needed to obtain reasonable bounds on reachable sets can grow exponentially. Another tangentially related line of research is on the stochastic Lyapunov function [19] or barrier function [20, 21, 22, 23] for measuring the probability of a trajectory staying in a safe set. In these works, the goal is not to find the probabilistic reachable set but to verify whether a given safe set is in the probabilistic reachable set.

In this work, we establish a unified framework for computing the probabilistic reachable sets of nonlinear stochastic systems subject to both deterministic inputs and stochastic disturbances. Our method is both theoretically optimal and effective in practice. Theoretically, under standard assumptions, our method yields tight approximations of probabilistic reachable sets that cannot be improved further without additional assumptions. Implementation-wise, our approximations of probabilistic reachable sets can be computed efficiently and are scalable to high-dimensional systems.

Our framework is built upon a novel separation strategy, which decouples the effects of deterministic inputs and stochastic uncertainty on reachability analysis of the stochastic system (Proposition 1). The effects of stochastic uncertainty on the probabilistic reachable set can be represented using stochastic deviation, which refers to the distance between a stochastic trajectory and its associated deterministic trajectory. By developing a novel energy function termed the Averaged Moment Generating Function (AMGF), we provide a probabilistic bound on the stochastic deviation of general stochastic continuous trajectories (Theorem 1). Our bound has a dependence 𝒪(log(1/δ))𝒪1𝛿\mathcal{O}(\sqrt{\log(1/\delta)})caligraphic_O ( square-root start_ARG roman_log ( 1 / italic_δ ) end_ARG ) on the probability level 1δ1𝛿1-\delta1 - italic_δ, significantly better than existing techniques which result in a bound of the order 𝒪(1/δ)𝒪1𝛿\mathcal{O}(\sqrt{1/\delta})caligraphic_O ( square-root start_ARG 1 / italic_δ end_ARG ). Moreover, our bound coincides with that for linear stochastic systems under the same assumptions and cannot be improved further. The effects of deterministic input on the probabilistic reachable set can be captured using deterministic reachable sets of the associated deterministic system, i.e., the system obtained by removing the stochastic noise.

Consequently, our separation strategy enables a decomposition of the probabilistic reachable set into a deterministic reachable set capturing the deterministic input and a tight robustness buffer around it against the stochastic uncertainty (Theorem 2). As such, analyzing the reachability of the associated deterministic system is all we need to obtain a good probabilistic reachable set. This is a paradigm shift and brings tremendous flexibility to the reachability analysis of stochastic systems as any deterministic reachability framework can be incorporated. In particular, we combine our framework with two computationally efficient deterministic reachability approaches namely contraction-based reachability and interval-based reachability to obtain probabilistic reachable sets for stochastic systems.

Finally, our tight probabilistic bound of stochastic deviation is poised to have profound implications in the broader analysis and control of stochastic systems beyond its immediate applications in reachability analysis. To the best of our knowledge, this bound is the first non-conservative result that can quantitatively describe the behavior of a nonlinear stochastic system under standard assumptions. The bound is of independent interests and can potentially impact many other areas such as estimation, uncertainty quantification, finance, machine learning, statistics, etc. It opens avenues for enhanced understanding and manipulation of complex stochastic dynamics, presenting opportunities for advancements in related fields.

The rest of the paper is organized as follows. In Section II we briefly review reachability analysis for deterministic systems. In Section III we introduce and formulate the probabilistic reachability problem and present our overall strategy. The discussion of an existing method is given in Section IV. Section V contains the main technical contribution of this paper where we introduce a novel energy function termed the Averaged Moment Generating Function to bound the deviation of stochastic trajectories from their deterministic counterpart with high-probability. This high-probability bound of stochastic deviation is combined with deterministic reachability analysis in Section VI to approximate the probabilistic reachable set of stochastic systems. This is followed by case studies in Section VII and numerical experiments in Section VIII.

II Preliminaries

In this section, we briefly review reachability analysis for deterministic dynamics and related concepts.

II-A Notations

Vectors and matrices. Given a vector xn𝑥superscript𝑛x\in\mathbb{R}^{n}italic_x ∈ blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT, xnorm𝑥\|x\|∥ italic_x ∥ denotes its Euclidean norm (2subscript2\ell_{2}roman_ℓ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT norm) and xP=x𝖳Pxsubscriptnorm𝑥𝑃superscript𝑥𝖳𝑃𝑥\|x\|_{P}=\sqrt{x^{\mathsf{T}}Px}∥ italic_x ∥ start_POSTSUBSCRIPT italic_P end_POSTSUBSCRIPT = square-root start_ARG italic_x start_POSTSUPERSCRIPT sansserif_T end_POSTSUPERSCRIPT italic_P italic_x end_ARG with some positive definite matrix P𝑃Pitalic_P. Given a matrix Am×n𝐴superscript𝑚𝑛A\in\mathbb{R}^{m\times n}italic_A ∈ blackboard_R start_POSTSUPERSCRIPT italic_m × italic_n end_POSTSUPERSCRIPT, Anorm𝐴\|A\|∥ italic_A ∥ denotes its spectral norm and APsubscriptnorm𝐴𝑃\|A\|_{P}∥ italic_A ∥ start_POSTSUBSCRIPT italic_P end_POSTSUBSCRIPT denotes its weighted spectral norm with respect to some positive definite matrix P𝑃Pitalic_P. For two matrices A,Bn×n𝐴𝐵superscript𝑛𝑛A,B\in\mathbb{R}^{n\times n}italic_A , italic_B ∈ blackboard_R start_POSTSUPERSCRIPT italic_n × italic_n end_POSTSUPERSCRIPT, ABprecedes-or-equals𝐴𝐵A\preceq Bitalic_A ⪯ italic_B if BA𝐵𝐴B-Aitalic_B - italic_A is positive semi-definite. If An×n𝐴superscript𝑛𝑛A\in\mathbb{R}^{n\times n}italic_A ∈ blackboard_R start_POSTSUPERSCRIPT italic_n × italic_n end_POSTSUPERSCRIPT is a positive definite matrix, we denote its square root by A12superscript𝐴12A^{\frac{1}{2}}italic_A start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT, i.e., A12superscript𝐴12A^{\frac{1}{2}}italic_A start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT is the unique matrix such that A12(A12)𝖳=(A12)𝖳A12=Asuperscript𝐴12superscriptsuperscript𝐴12𝖳superscriptsuperscript𝐴12𝖳superscript𝐴12𝐴A^{\frac{1}{2}}(A^{\frac{1}{2}})^{\mathsf{T}}=(A^{\frac{1}{2}})^{\mathsf{T}}A^% {\frac{1}{2}}=Aitalic_A start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT ( italic_A start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT sansserif_T end_POSTSUPERSCRIPT = ( italic_A start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT sansserif_T end_POSTSUPERSCRIPT italic_A start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT = italic_A. Besides, we use ,\langle\cdot,\cdot\rangle⟨ ⋅ , ⋅ ⟩ to denote standard inner product, 00 to denote all-zero vectors and matrices, and Insubscript𝐼𝑛I_{n}italic_I start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT to denote n𝑛nitalic_n-dimensional identity matrix.

Set and Functions. We use n(r,y)superscript𝑛𝑟𝑦\mathcal{B}^{n}\left(r,y\right)caligraphic_B start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ( italic_r , italic_y ) to denote the ball {xn:xyr}conditional-set𝑥superscript𝑛norm𝑥𝑦𝑟\{x\in\mathbb{R}^{n}:\|x-y\|\leq r\}{ italic_x ∈ blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT : ∥ italic_x - italic_y ∥ ≤ italic_r } and 𝒮n1superscript𝒮𝑛1\mathcal{S}^{n-1}caligraphic_S start_POSTSUPERSCRIPT italic_n - 1 end_POSTSUPERSCRIPT to denote the unit sphere {xn:x=1}conditional-set𝑥superscript𝑛norm𝑥1\{x\in\mathbb{R}^{n}:\|x\|=1\}{ italic_x ∈ blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT : ∥ italic_x ∥ = 1 }. For two sets A,B𝐴𝐵A,Bitalic_A , italic_B, their Minkowski sum is defined as AB={x+y:xA,yB}direct-sum𝐴𝐵conditional-set𝑥𝑦formulae-sequence𝑥𝐴𝑦𝐵A\oplus B=\{x+y:x\in A,~{}y\in B\}italic_A ⊕ italic_B = { italic_x + italic_y : italic_x ∈ italic_A , italic_y ∈ italic_B }. Given a set 𝒳n\mathcal{X}\subseteq{}^{n}caligraphic_X ⊆ start_FLOATSUPERSCRIPT italic_n end_FLOATSUPERSCRIPT and a matrix Tn×nT\in{}^{n\times n}italic_T ∈ start_FLOATSUPERSCRIPT italic_n × italic_n end_FLOATSUPERSCRIPT, we define T𝒳={Tx|x𝒳}𝑇𝒳conditional-set𝑇𝑥𝑥𝒳T\mathcal{X}=\{Tx\;|\;x\in\mathcal{X}\}italic_T caligraphic_X = { italic_T italic_x | italic_x ∈ caligraphic_X }. Given a continuously differentiable vector-valued function f:nmf:{}^{n}\to{}^{m}italic_f : start_FLOATSUPERSCRIPT italic_n end_FLOATSUPERSCRIPT → start_FLOATSUPERSCRIPT italic_m end_FLOATSUPERSCRIPT, we denote the Jacobian of f𝑓fitalic_f at x𝑥xitalic_x by Dxf(x)subscript𝐷𝑥𝑓𝑥D_{x}f(x)italic_D start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT italic_f ( italic_x ). For a twice-differentiable scalar-valued function f:nf:{}^{n}\to\mathbb{R}italic_f : start_FLOATSUPERSCRIPT italic_n end_FLOATSUPERSCRIPT → blackboard_R, its gradient at x𝑥xitalic_x is f(x)𝑓𝑥\nabla f(x)∇ italic_f ( italic_x ) and the Hessian matrix is denoted as 2f(x)superscript2𝑓𝑥\nabla^{2}f(x)∇ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_f ( italic_x ).

Throughout the paper, we use 𝔼𝔼\mathbb{E}blackboard_E to denote expectation and \mathbb{P}blackboard_P to denote probability. For a set S𝑆Sitalic_S, XSsimilar-to𝑋𝑆X\sim Sitalic_X ∼ italic_S means X𝑋Xitalic_X is a random variable drawn uniformly from S𝑆Sitalic_S.

II-B Reachable Set of Deterministic Dynamics

Computing the reachable sets is a fundamental problem in dynamical systems and control theory. Consider the continuous-time deterministic system

x˙t=f(xt,ut,t),subscript˙𝑥𝑡𝑓subscript𝑥𝑡subscript𝑢𝑡𝑡\dot{x}_{t}=f(x_{t},u_{t},t),over˙ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = italic_f ( italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , italic_u start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , italic_t ) , (1)

where xtnsubscript𝑥𝑡superscript𝑛x_{t}\in\mathbb{R}^{n}italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT is the state at time t𝑡titalic_t, utpsubscript𝑢𝑡superscript𝑝u_{t}\in\mathbb{R}^{p}italic_u start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT is the input at time t𝑡titalic_t, and f:n×p×0nf:\mathbb{R}^{n}\times\mathbb{R}^{p}\times{}_{\geq 0}\to\mathbb{R}^{n}italic_f : blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT × blackboard_R start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT × start_FLOATSUBSCRIPT ≥ 0 end_FLOATSUBSCRIPT → blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT is a parameterized vector field. Depending on the applications, utsubscript𝑢𝑡u_{t}italic_u start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT can be a control action or a disturbance. The reachable set of a deterministic system is the set of all states that the system can reach, starting from an initial configuration, under all possible inputs within a specified time period [24].

Definition II.1 (DRS).

Consider the system (1) with initial set 𝒳0n\mathcal{X}_{0}\subseteq{}^{n}caligraphic_X start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ⊆ start_FLOATSUPERSCRIPT italic_n end_FLOATSUPERSCRIPT and input set 𝒰p\mathcal{U}\subseteq{}^{p}caligraphic_U ⊆ start_FLOATSUPERSCRIPT italic_p end_FLOATSUPERSCRIPT. The deterministic reachable set (DRS) of (1) at time t𝑡titalic_t starting from 𝒳0subscript𝒳0\mathcal{X}_{0}caligraphic_X start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT with inputs in 𝒰𝒰\mathcal{U}caligraphic_U is

t={xt|τxτ is a trajectory of (1) with x0𝒳0 and uτ:0𝒰}\displaystyle\mathcal{R}_{t}=\left\{x_{t}\middle|\begin{aligned} &\tau\mapsto x% _{\tau}\mbox{ is a trajectory of~{}\eqref{eq:deterministic}}\\ &\mbox{ with }x_{0}\in\mathcal{X}_{0}\mbox{ and }u_{\tau}:{}_{\geq 0}\to% \mathcal{U}\end{aligned}\right\}caligraphic_R start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = { italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT | start_ROW start_CELL end_CELL start_CELL italic_τ ↦ italic_x start_POSTSUBSCRIPT italic_τ end_POSTSUBSCRIPT is a trajectory of ( ) end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL with italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∈ caligraphic_X start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT and italic_u start_POSTSUBSCRIPT italic_τ end_POSTSUBSCRIPT : start_FLOATSUBSCRIPT ≥ 0 end_FLOATSUBSCRIPT → caligraphic_U end_CELL end_ROW } (2)

In general, computing the exact DRS of a dynamic system is computationally intractable [25]. Therefore, most methods in reachability analysis focus on providing over-approximation of DRS [24]. A set ¯tn\overline{\mathcal{R}}_{t}\subseteq{}^{n}over¯ start_ARG caligraphic_R end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ⊆ start_FLOATSUPERSCRIPT italic_n end_FLOATSUPERSCRIPT is an over-approximation of the DRS (2) if, for every t0𝑡0t\geq 0italic_t ≥ 0,

t¯t.subscript𝑡subscript¯𝑡\displaystyle\mathcal{R}_{t}\subseteq\overline{\mathcal{R}}_{t}.caligraphic_R start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ⊆ over¯ start_ARG caligraphic_R end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT .

In Section VII, we revisit two approaches to compute ¯tsubscript¯𝑡\overline{\mathcal{R}}_{t}over¯ start_ARG caligraphic_R end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT: contraction-based reachability and interval-based reachability.

II-C Matrix Measure and Contraction Theory

A key tool in studying reachable sets of system (1) is the matrix measure [26, 27] defined as follows.

Definition II.2 (Matrix Measure).

Given a matrix An×n𝐴superscript𝑛𝑛A\in\mathbb{R}^{n\times n}italic_A ∈ blackboard_R start_POSTSUPERSCRIPT italic_n × italic_n end_POSTSUPERSCRIPT, its matrix measure with respect to \|\cdot\|∥ ⋅ ∥, denoted by μ(A)𝜇𝐴\mu(A)italic_μ ( italic_A ), is defined as

μ(A)=limϵ0+In+ϵA1ϵ.𝜇𝐴subscriptitalic-ϵsuperscript0normsubscript𝐼𝑛italic-ϵ𝐴1italic-ϵ\displaystyle\mu(A)=\lim_{\epsilon\to 0^{+}}\frac{\|I_{n}+\epsilon A\|-1}{% \epsilon}.italic_μ ( italic_A ) = roman_lim start_POSTSUBSCRIPT italic_ϵ → 0 start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT end_POSTSUBSCRIPT divide start_ARG ∥ italic_I start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT + italic_ϵ italic_A ∥ - 1 end_ARG start_ARG italic_ϵ end_ARG .

Intuitively, μ(A)𝜇𝐴\mu(A)italic_μ ( italic_A ) can be considered as the one-sided derivative of the norm \|\cdot\|∥ ⋅ ∥ at Insubscript𝐼𝑛I_{n}italic_I start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT in the direction of A𝐴Aitalic_A. Although matrix measure can be defined with respect to any norm, in this paper we focus on the spectral norm. In this case, the matrix measure has a closed-form expression μ(A)=12λmax(A+A𝖳)𝜇𝐴12subscript𝜆𝐴superscript𝐴𝖳\mu(A)=\tfrac{1}{2}\lambda_{\max}(A+A^{\mathsf{T}})italic_μ ( italic_A ) = divide start_ARG 1 end_ARG start_ARG 2 end_ARG italic_λ start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT ( italic_A + italic_A start_POSTSUPERSCRIPT sansserif_T end_POSTSUPERSCRIPT ).

For the system (1), the evolution of the distance of two arbitrary trajectories can be measured using μ(Dxf(x,u,t))𝜇subscript𝐷𝑥𝑓𝑥𝑢𝑡\mu(D_{x}f(x,u,t))italic_μ ( italic_D start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT italic_f ( italic_x , italic_u , italic_t ) ). The following lemma provides a variational characterization of μ(Dxf(x,u,t))𝜇subscript𝐷𝑥𝑓𝑥𝑢𝑡\mu(D_{x}f(x,u,t))italic_μ ( italic_D start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT italic_f ( italic_x , italic_u , italic_t ) ) [28].

Lemma II.1.

Given a deterministic system (1), for every t0𝑡0t\geq 0italic_t ≥ 0, the following statement are equivalent

  1. (i)

    μ(Dxf(x,u,t))ct𝜇subscript𝐷𝑥𝑓𝑥𝑢𝑡subscript𝑐𝑡\mu(D_{x}f(x,u,t))\leq c_{t}italic_μ ( italic_D start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT italic_f ( italic_x , italic_u , italic_t ) ) ≤ italic_c start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT for all (x,u)n×𝒰𝑥𝑢superscript𝑛𝒰(x,u)\in\mathbb{R}^{n}\times\mathcal{U}( italic_x , italic_u ) ∈ blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT × caligraphic_U.

  2. (ii)

    (xy)𝖳(f(x,u,t)f(y,u,t))ctxy2superscript𝑥𝑦𝖳𝑓𝑥𝑢𝑡𝑓𝑦𝑢𝑡subscript𝑐𝑡superscriptnorm𝑥𝑦2(x-y)^{\mathsf{T}}(f(x,u,t)-f(y,u,t))\leq c_{t}\|x-y\|^{2}( italic_x - italic_y ) start_POSTSUPERSCRIPT sansserif_T end_POSTSUPERSCRIPT ( italic_f ( italic_x , italic_u , italic_t ) - italic_f ( italic_y , italic_u , italic_t ) ) ≤ italic_c start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∥ italic_x - italic_y ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT, for all (x,y,u)n××n𝒰(x,y,u)\in\mathbb{R}^{n}\times{}^{n}\times\mathcal{U}( italic_x , italic_y , italic_u ) ∈ blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT × start_FLOATSUPERSCRIPT italic_n end_FLOATSUPERSCRIPT × caligraphic_U.

A classical result in contraction theory states that if condition (i) holds then the distance between trajectories of the system (1) can be upper bounded exponentially with time [4]. If there exists α>0𝛼0\alpha>0italic_α > 0 such that ct<αsubscript𝑐𝑡𝛼c_{t}<-\alphaitalic_c start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT < - italic_α for all t𝑡titalic_t, then the distance between two arbitrary trajectories of (1) is decreasing over time, and we say the system is contracting [29, 30, 31]. In practice, to apply the contraction theory for reachability analysis, one needs to estimate or bound μ(Dxf(x,u,t))𝜇subscript𝐷𝑥𝑓𝑥𝑢𝑡\mu(D_{x}f(x,u,t))italic_μ ( italic_D start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT italic_f ( italic_x , italic_u , italic_t ) ). Several approaches have been proposed in the literature to determine the upper bound of μ(Dxf(x,u,t))𝜇subscript𝐷𝑥𝑓𝑥𝑢𝑡\mu(D_{x}f(x,u,t))italic_μ ( italic_D start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT italic_f ( italic_x , italic_u , italic_t ) ) (see e.g., [9],[4, Chapter 3,4],[32, 33]). These methods are applicable not only to contracting systems but also to systems with any ctsubscript𝑐𝑡c_{t}\in\mathbb{R}italic_c start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∈ blackboard_R. In this paper, we allow ctsubscript𝑐𝑡c_{t}\in\mathbb{R}italic_c start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∈ blackboard_R rather than restricting it to be negative.

III Reachability of Stochastic Systems

In many real-world applications, the underlying dynamics are driven not only by deterministic inputs but also by stochastic disturbances. Existing methods and techniques for deterministic reachability analysis designed for deterministic and often bounded inputs/disturbances are not applicable to these scenarios with stochastic disturbances. We aim to bridge this gap by developing a unified framework of reachability analysis for stochastic systems. In this section, we formulate our probabilistic reachability problem and introduce our overall strategy for addressing it.

III-A Problem Statement

Consider the stochastic system

dXt=f(Xt,ut,t)dt+gt(Xt)dWt,𝑑subscript𝑋𝑡𝑓subscript𝑋𝑡subscript𝑢𝑡𝑡𝑑𝑡subscript𝑔𝑡subscript𝑋𝑡𝑑subscript𝑊𝑡dX_{t}=f(X_{t},u_{t},t)dt+g_{t}(X_{t})dW_{t},italic_d italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = italic_f ( italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , italic_u start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , italic_t ) italic_d italic_t + italic_g start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) italic_d italic_W start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , (3)

where the state Xtnsubscript𝑋𝑡superscript𝑛X_{t}\in\mathbb{R}^{n}italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT is a random vector, ut:0𝒰pu_{t}:{}_{\geq 0}\to\mathcal{U}\subseteq\mathbb{R}^{p}italic_u start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT : start_FLOATSUBSCRIPT ≥ 0 end_FLOATSUBSCRIPT → caligraphic_U ⊆ blackboard_R start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT is the input, gtsubscript𝑔𝑡g_{t}italic_g start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT is the diffusion coefficient, and Wtmsubscript𝑊𝑡superscript𝑚W_{t}\in\mathbb{R}^{m}italic_W start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT is an m𝑚mitalic_m-dimensional Wiener process (Brownian motion) modeling the stochastic uncertainty. This stochastic system can be viewed as a noisy version of the deterministic system

x˙t=f(xt,ut,t).subscript˙𝑥𝑡𝑓subscript𝑥𝑡subscript𝑢𝑡𝑡\dot{x}_{t}=f(x_{t},u_{t},t).over˙ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = italic_f ( italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , italic_u start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , italic_t ) . (4)

To ensure (3) has a solution, we default standard Lipschitz and linear growth conditions [34, Theorem 5.2.1]. For reachability analysis, we impose the following assumption.

Assumption 1.

For the stochastic system (3), there exist integrable curves tctmaps-to𝑡subscript𝑐𝑡t\mapsto c_{t}italic_t ↦ italic_c start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT and tσtmaps-to𝑡subscript𝜎𝑡t\mapsto\sigma_{t}italic_t ↦ italic_σ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT such that,

  1. (i)

    μ(Dxf(x,u,t))ct𝜇subscript𝐷𝑥𝑓𝑥𝑢𝑡subscript𝑐𝑡\mu(D_{x}f(x,u,t))\leq c_{t}italic_μ ( italic_D start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT italic_f ( italic_x , italic_u , italic_t ) ) ≤ italic_c start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT for any t0𝑡0t\geq 0italic_t ≥ 0, u𝒰𝑢𝒰u\in\mathcal{U}italic_u ∈ caligraphic_U, and xn𝑥superscript𝑛x\in\mathbb{R}^{n}italic_x ∈ blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT.

  2. (ii)

    gt(x)gt(x)𝖳σt2Inprecedes-or-equalssubscript𝑔𝑡𝑥subscript𝑔𝑡superscript𝑥𝖳superscriptsubscript𝜎𝑡2subscript𝐼𝑛g_{t}(x)g_{t}(x)^{\mathsf{T}}\preceq\sigma_{t}^{2}I_{n}italic_g start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_x ) italic_g start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_x ) start_POSTSUPERSCRIPT sansserif_T end_POSTSUPERSCRIPT ⪯ italic_σ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_I start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT for any t0𝑡0t\geq 0italic_t ≥ 0 and xn𝑥superscript𝑛x\in\mathbb{R}^{n}italic_x ∈ blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT.

We are interested in characterizing the reachable set of the stochastic system (3) under Assumption 1. Departing from the deterministic dynamics (4) driven only by the input utsubscript𝑢𝑡u_{t}italic_u start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT, the stochastic system (3) is driven by both the input utsubscript𝑢𝑡u_{t}italic_u start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT and stochastic disturbance dWt/dt𝑑subscript𝑊𝑡𝑑𝑡dW_{t}/dtitalic_d italic_W start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT / italic_d italic_t. Deterministic reachability analysis falls short of capturing this stochastic disturbance. Indeed, most methods in deterministic reachability analysis assume bounded input/disturbance and approximate its DRS through worst-case type analysis [11]. However, the stochastic disturbance dWt/dt𝑑subscript𝑊𝑡𝑑𝑡dW_{t}/dtitalic_d italic_W start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT / italic_d italic_t is unbounded [35, Chapter 4.1]. This unbounded stochastic disturbance often results in a trivial reachable set in the sense of (2). For example, the classical reachable set of the system dXt=dWt𝑑subscript𝑋𝑡𝑑subscript𝑊𝑡dX_{t}=dW_{t}italic_d italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = italic_d italic_W start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT is the entire state space for any t>0𝑡0t>0italic_t > 0. We resort to a probabilistic notion of reachable sets to overcome these limitations of deterministic reachability analysis.

Definition III.1 (δ𝛿\deltaitalic_δ-PRS).

Consider the stochastic system (3) with initial set 𝒳0nsubscript𝒳0superscript𝑛\mathcal{X}_{0}\subseteq\mathbb{R}^{n}caligraphic_X start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ⊆ blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT and input set 𝒰p𝒰superscript𝑝\mathcal{U}\subseteq\mathbb{R}^{p}caligraphic_U ⊆ blackboard_R start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT. Given δ(0,1]𝛿01\delta\in(0,1]italic_δ ∈ ( 0 , 1 ] and t0𝑡0t\geq 0italic_t ≥ 0, the set δ,tn\mathcal{R}_{\delta,t}\subseteq{}^{n}caligraphic_R start_POSTSUBSCRIPT italic_δ , italic_t end_POSTSUBSCRIPT ⊆ start_FLOATSUPERSCRIPT italic_n end_FLOATSUPERSCRIPT is a δ𝛿\deltaitalic_δ-probabilistic reachable set (δ𝛿\deltaitalic_δ-PRS) of (3) at time t𝑡titalic_t, if for any x0𝒳0subscript𝑥0subscript𝒳0x_{0}\in\mathcal{X}_{0}italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∈ caligraphic_X start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT and piecewise continuous ut:0𝒰:subscript𝑢𝑡subscriptabsent0𝒰u_{t}:\mathbb{R}_{\geq 0}\to\mathcal{U}italic_u start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT : blackboard_R start_POSTSUBSCRIPT ≥ 0 end_POSTSUBSCRIPT → caligraphic_U, we have

(Xtδ,t)1δ.subscript𝑋𝑡subscript𝛿𝑡1𝛿\mathbb{P}\left(X_{t}\in\mathcal{R}_{\delta,t}\right)\geq 1-\delta.blackboard_P ( italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∈ caligraphic_R start_POSTSUBSCRIPT italic_δ , italic_t end_POSTSUBSCRIPT ) ≥ 1 - italic_δ . (5)

Briefly, a probabilistic reachable set of a stochastic system (3) is the set all possible trajectories can reach with high probability. An illustration of δ𝛿\deltaitalic_δ-PRS is given in Figure 1. For sufficiently small δ𝛿\deltaitalic_δ, δ,tsubscript𝛿𝑡\mathcal{R}_{\delta,t}caligraphic_R start_POSTSUBSCRIPT italic_δ , italic_t end_POSTSUBSCRIPT contains the DRS of the associated deterministic system (4) due to the stochastic disturbance, that is, tδ,tsubscript𝑡subscript𝛿𝑡\mathcal{R}_{t}\subseteq\mathcal{R}_{\delta,t}caligraphic_R start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ⊆ caligraphic_R start_POSTSUBSCRIPT italic_δ , italic_t end_POSTSUBSCRIPT. By definition, the δ𝛿\deltaitalic_δ-PRS is not unique. If δ,tsubscript𝛿𝑡\mathcal{R}_{\delta,t}caligraphic_R start_POSTSUBSCRIPT italic_δ , italic_t end_POSTSUBSCRIPT is a δ𝛿\deltaitalic_δ-PRS, then any δ,tδ,tsubscript𝛿𝑡superscriptsubscript𝛿𝑡\mathcal{R}_{\delta,t}^{\prime}\supseteq\mathcal{R}_{\delta,t}caligraphic_R start_POSTSUBSCRIPT italic_δ , italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ⊇ caligraphic_R start_POSTSUBSCRIPT italic_δ , italic_t end_POSTSUBSCRIPT is also a δ𝛿\deltaitalic_δ-PRS. We say δ,tsubscript𝛿𝑡\mathcal{R}_{\delta,t}caligraphic_R start_POSTSUBSCRIPT italic_δ , italic_t end_POSTSUBSCRIPT is a tighter δ𝛿\deltaitalic_δ-PRS than δ,tsuperscriptsubscript𝛿𝑡\mathcal{R}_{\delta,t}^{\prime}caligraphic_R start_POSTSUBSCRIPT italic_δ , italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT if δ,tδ,tsubscript𝛿𝑡superscriptsubscript𝛿𝑡\mathcal{R}_{\delta,t}\subseteq\mathcal{R}_{\delta,t}^{\prime}caligraphic_R start_POSTSUBSCRIPT italic_δ , italic_t end_POSTSUBSCRIPT ⊆ caligraphic_R start_POSTSUBSCRIPT italic_δ , italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT.

Refer to caption
Figure 1: An illustration of δ𝛿\deltaitalic_δ-PRS at time t𝑡titalic_t. Here δ,tsubscript𝛿𝑡\mathcal{R}_{\delta,t}caligraphic_R start_POSTSUBSCRIPT italic_δ , italic_t end_POSTSUBSCRIPT is a δ𝛿\deltaitalic_δ-PRS of the stochastic system (3), whose trajectories are in color, and tsubscript𝑡\mathcal{R}_{t}caligraphic_R start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT is the DRS of the associated deterministic system (4), whose trajectories are in black.

In many applications involving reachability analysis, it is desirable to have a tight δ𝛿\deltaitalic_δ-PRS. For instance, for safety-critical control, the safety of the system can be guaranteed by ensuring that the δ𝛿\deltaitalic_δ-PRS does not overlap with the unsafe regions [36]. A loose δ𝛿\deltaitalic_δ-PRS can result in very conservative strategies. Therefore, we are interested in finding the tightest possible δ𝛿\deltaitalic_δ-PRS.

Problem 1.

Find an as tight as possible δ𝛿\deltaitalic_δ-PRS δ,tsubscript𝛿𝑡\mathcal{R}_{\delta,t}caligraphic_R start_POSTSUBSCRIPT italic_δ , italic_t end_POSTSUBSCRIPT of the stochastic system (3) under Assumption 1.

III-B Separation Strategy and Stochastic Deviation

The trajectories of the stochastic system (3) are driven by both deterministic input and stochastic disturbance/input. The effects of these two types of inputs on the trajectories are relatively independent and may be handled separately. Building on this intuition, we propose a strategy termed separation strategy for probabilistic reachability analysis. The effects of the deterministic input can be encoded by the DRS of the associated deterministic system (4). To capture the effects of the stochastic disturbance, we associate each trajectory Xtsubscript𝑋𝑡X_{t}italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT of the system (3) with a trajectory xtsubscript𝑥𝑡x_{t}italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT of the system (4) with the same initial state x0=X0subscript𝑥0subscript𝑋0x_{0}=X_{0}italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = italic_X start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT and the same deterministic input utsubscript𝑢𝑡u_{t}italic_u start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT. The influence of the stochastic disturbance can then be represented by the deviation Xtxtnormsubscript𝑋𝑡subscript𝑥𝑡\|X_{t}-x_{t}\|∥ italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∥. The probabilistic reachable set of (3) can be approximated by combining these two components as formalized below.

Refer to caption
Figure 2: An illustration of separation strategy. Here δ,tsubscript𝛿𝑡\mathcal{R}_{\delta,t}caligraphic_R start_POSTSUBSCRIPT italic_δ , italic_t end_POSTSUBSCRIPT is a δ𝛿\deltaitalic_δ-PRS of the stochastic system (3), whose trajectory is Xtsubscript𝑋𝑡X_{t}italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT in red. ¯tsubscript¯𝑡\overline{\mathcal{R}}_{t}over¯ start_ARG caligraphic_R end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT is an over-approximation of the DRS of the associated deterministic system (4), whose trajectory is xtsubscript𝑥𝑡x_{t}italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT in black. The Minkowski sum corresponds to Proposition 1.
Proposition 1 (Separation strategy).

Consider the stochastic system (3) with its associated deterministic system (4). Let ¯tsubscript¯𝑡\overline{\mathcal{R}}_{t}over¯ start_ARG caligraphic_R end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT be any over-approximation of the DRS of (4). If there exists rδ,tsubscript𝑟𝛿𝑡r_{\delta,t}italic_r start_POSTSUBSCRIPT italic_δ , italic_t end_POSTSUBSCRIPT such that, for any given trajectory xtsubscript𝑥𝑡x_{t}italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT of (4) and any associated trajectory Xtsubscript𝑋𝑡X_{t}italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT of (3) with the same initial condition x0subscript𝑥0x_{0}italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT and input uτsubscript𝑢𝜏u_{\tau}italic_u start_POSTSUBSCRIPT italic_τ end_POSTSUBSCRIPT,

(Xtxtrδ,t)1δ,normsubscript𝑋𝑡subscript𝑥𝑡subscript𝑟𝛿𝑡1𝛿\mathbb{P}\left(\|X_{t}-x_{t}\|\leq r_{\delta,t}\right)\geq 1-\delta,blackboard_P ( ∥ italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∥ ≤ italic_r start_POSTSUBSCRIPT italic_δ , italic_t end_POSTSUBSCRIPT ) ≥ 1 - italic_δ , (6)

then ¯tn(rδ,t,0)direct-sumsubscript¯𝑡superscript𝑛subscript𝑟𝛿𝑡0\overline{\mathcal{R}}_{t}\oplus\mathcal{B}^{n}(r_{\delta,t},0)over¯ start_ARG caligraphic_R end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ⊕ caligraphic_B start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ( italic_r start_POSTSUBSCRIPT italic_δ , italic_t end_POSTSUBSCRIPT , 0 ) is a δ𝛿\deltaitalic_δ-PRS of (3).

Proof.

Let Xtsubscript𝑋𝑡X_{t}italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT be any trajectory of (3) associated with a trajectory xtsubscript𝑥𝑡x_{t}italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT of (4), then, by the assumption (6) and the definition of the Minkowski sum [37],

Xt{xt}n(rδ,t,0)subscript𝑋𝑡direct-sumsubscript𝑥𝑡superscript𝑛subscript𝑟𝛿𝑡0\displaystyle X_{t}\in\{x_{t}\}\oplus\mathcal{B}^{n}(r_{\delta,t},0)italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∈ { italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT } ⊕ caligraphic_B start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ( italic_r start_POSTSUBSCRIPT italic_δ , italic_t end_POSTSUBSCRIPT , 0 )

with probability at least 1δ1𝛿1-\delta1 - italic_δ. By the definition of ¯tsubscript¯𝑡\overline{\mathcal{R}}_{t}over¯ start_ARG caligraphic_R end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT, xt¯tsubscript𝑥𝑡subscript¯𝑡x_{t}\in\overline{\mathcal{R}}_{t}italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∈ over¯ start_ARG caligraphic_R end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT. Therefore, with probability at least 1δ1𝛿1-\delta1 - italic_δ,

Xt¯tn(rδ,t,0),subscript𝑋𝑡direct-sumsubscript¯𝑡superscript𝑛subscript𝑟𝛿𝑡0X_{t}\in\overline{\mathcal{R}}_{t}\oplus\mathcal{B}^{n}(r_{\delta,t},0),italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∈ over¯ start_ARG caligraphic_R end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ⊕ caligraphic_B start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ( italic_r start_POSTSUBSCRIPT italic_δ , italic_t end_POSTSUBSCRIPT , 0 ) ,

which completes the proof. ∎

We term the difference Xtxtnormsubscript𝑋𝑡subscript𝑥𝑡\|X_{t}-x_{t}\|∥ italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∥ between associated trajectories stochastic deviation. A key ingredient of Proposition 1 is a probabilistic bound rδ,tsubscript𝑟𝛿𝑡r_{\delta,t}italic_r start_POSTSUBSCRIPT italic_δ , italic_t end_POSTSUBSCRIPT that upper bounds the stochastic deviation with high probability. Proposition 1 states that if a probabilistic bound rδ,tsubscript𝑟𝛿𝑡r_{\delta,t}italic_r start_POSTSUBSCRIPT italic_δ , italic_t end_POSTSUBSCRIPT exists, then the dilation of the reachable set of the deterministic system (4) with a ball of radius rδ,tsubscript𝑟𝛿𝑡r_{\delta,t}italic_r start_POSTSUBSCRIPT italic_δ , italic_t end_POSTSUBSCRIPT is a δ𝛿\deltaitalic_δ-PRS of (3). This separation strategy decomposes the probabilistic reachability analysis problem into two parts: approximate the DRS of (4) and estimate the probabilistic bound rδ,tsubscript𝑟𝛿𝑡r_{\delta,t}italic_r start_POSTSUBSCRIPT italic_δ , italic_t end_POSTSUBSCRIPT of the stochastic deviation. Once a bound rδ,tsubscript𝑟𝛿𝑡r_{\delta,t}italic_r start_POSTSUBSCRIPT italic_δ , italic_t end_POSTSUBSCRIPT of the stochastic deviation is provided, one can combine it with any existing deterministic reachability method to approximate the δ𝛿\deltaitalic_δ-PRS.

The size of the δ𝛿\deltaitalic_δ-PRS ¯tn(rδ,t,0)direct-sumsubscript¯𝑡superscript𝑛subscript𝑟𝛿𝑡0\overline{\mathcal{R}}_{t}\oplus\mathcal{B}^{n}(r_{\delta,t},0)over¯ start_ARG caligraphic_R end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ⊕ caligraphic_B start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ( italic_r start_POSTSUBSCRIPT italic_δ , italic_t end_POSTSUBSCRIPT , 0 ) in Proposition 1 increases with rδ,tsubscript𝑟𝛿𝑡r_{\delta,t}italic_r start_POSTSUBSCRIPT italic_δ , italic_t end_POSTSUBSCRIPT. To ensure ¯tn(rδ,t,0)direct-sumsubscript¯𝑡superscript𝑛subscript𝑟𝛿𝑡0\overline{\mathcal{R}}_{t}\oplus\mathcal{B}^{n}(r_{\delta,t},0)over¯ start_ARG caligraphic_R end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ⊕ caligraphic_B start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ( italic_r start_POSTSUBSCRIPT italic_δ , italic_t end_POSTSUBSCRIPT , 0 ) is not an overly-conservative δ𝛿\deltaitalic_δ-PRS of (3), it is crucial to establish an as tight as possible probabilistic bound rδ,tsubscript𝑟𝛿𝑡r_{\delta,t}italic_r start_POSTSUBSCRIPT italic_δ , italic_t end_POSTSUBSCRIPT for the stochastic deviation. This is the main challenge addressed in this paper.

Problem 2.

Establish an as tight as possible probabilistic bound rδ,tsubscript𝑟𝛿𝑡r_{\delta,t}italic_r start_POSTSUBSCRIPT italic_δ , italic_t end_POSTSUBSCRIPT of the stochastic deviation Xtxtnormsubscript𝑋𝑡subscript𝑥𝑡\|X_{t}-x_{t}\|∥ italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∥ associated with systems (3)-(4) under Assumption 1.

IV Expectation Bound and Limitations

To warm up, we first revisit an existing approach [38] for Problem 2 and highlight its limitations.

IV-A Expectation Bound on Stochastic Deviation

Inspired by [38] we present a method that probabilistically bounds the stochastic deviation Xtxtnormsubscript𝑋𝑡subscript𝑥𝑡\|X_{t}-x_{t}\|∥ italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∥ under Assumption 1 by bounding the expectation 𝔼(Xtxt2)𝔼superscriptnormsubscript𝑋𝑡subscript𝑥𝑡2\mathbb{E}(\|X_{t}-x_{t}\|^{2})blackboard_E ( ∥ italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ).

For a trajectory Xtsubscript𝑋𝑡X_{t}italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT of the stochastic system (3) and the associated trajectory xtsubscript𝑥𝑡x_{t}italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT of the deterministic system (4), define the Lyapunov function Vt=Xtxt2subscript𝑉𝑡superscriptnormsubscript𝑋𝑡subscript𝑥𝑡2V_{t}=\|X_{t}-x_{t}\|^{2}italic_V start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = ∥ italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT. Then a direct application of the Ito’s Lemma [35] yields

dVt𝑑subscript𝑉𝑡\displaystyle dV_{t}italic_d italic_V start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT =2(Xtxt)𝖳(f(Xt,ut,t)f(xt,ut,t))dtabsent2superscriptsubscript𝑋𝑡subscript𝑥𝑡𝖳𝑓subscript𝑋𝑡subscript𝑢𝑡𝑡𝑓subscript𝑥𝑡subscript𝑢𝑡𝑡𝑑𝑡\displaystyle=2\left(X_{t}-x_{t}\right)^{\mathsf{T}}(f(X_{t},u_{t},t)-f(x_{t},% u_{t},t))dt= 2 ( italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT sansserif_T end_POSTSUPERSCRIPT ( italic_f ( italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , italic_u start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , italic_t ) - italic_f ( italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , italic_u start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , italic_t ) ) italic_d italic_t
+tr(gt𝖳gt)dt+2(Xtxt)𝖳gtdWttrsuperscriptsubscript𝑔𝑡𝖳subscript𝑔𝑡𝑑𝑡2superscriptsubscript𝑋𝑡subscript𝑥𝑡𝖳subscript𝑔𝑡𝑑subscript𝑊𝑡\displaystyle+\mathrm{tr}(g_{t}^{\mathsf{T}}g_{t})dt+2(X_{t}-x_{t})^{\mathsf{T% }}g_{t}dW_{t}+ roman_tr ( italic_g start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT sansserif_T end_POSTSUPERSCRIPT italic_g start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) italic_d italic_t + 2 ( italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT sansserif_T end_POSTSUPERSCRIPT italic_g start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT italic_d italic_W start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT (7)

Following standard Itó Calculus, for every t,h0𝑡0t,h\geq 0italic_t , italic_h ≥ 0,

𝔼(Vt+h)𝔼subscript𝑉𝑡\displaystyle\mathbb{E}(V_{t+h})blackboard_E ( italic_V start_POSTSUBSCRIPT italic_t + italic_h end_POSTSUBSCRIPT ) 𝔼(Vt)=𝔼(tt+h𝑑Vs)𝔼subscript𝑉𝑡𝔼superscriptsubscript𝑡𝑡differential-dsubscript𝑉𝑠\displaystyle-\mathbb{E}(V_{t})=\mathbb{E}\left(\int_{t}^{t+h}dV_{s}\right)- blackboard_E ( italic_V start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) = blackboard_E ( ∫ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t + italic_h end_POSTSUPERSCRIPT italic_d italic_V start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT )
tt+h𝔼(dVs)absentsuperscriptsubscript𝑡𝑡𝔼𝑑subscript𝑉𝑠\displaystyle\leq\int_{t}^{t+h}\mathbb{E}(dV_{s})≤ ∫ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t + italic_h end_POSTSUPERSCRIPT blackboard_E ( italic_d italic_V start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT )
tt+h(2cs𝔼(Xsxs2)+nσs2)𝑑sabsentsuperscriptsubscript𝑡𝑡2subscript𝑐𝑠𝔼superscriptnormsubscript𝑋𝑠subscript𝑥𝑠2𝑛subscriptsuperscript𝜎2𝑠differential-d𝑠\displaystyle\leq\int_{t}^{t+h}(2c_{s}\mathbb{E}(\|X_{s}-x_{s}\|^{2})+n\sigma^% {2}_{s})ds≤ ∫ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t + italic_h end_POSTSUPERSCRIPT ( 2 italic_c start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT blackboard_E ( ∥ italic_X start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) + italic_n italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ) italic_d italic_s
=tt+h(2cs𝔼(Vs)+nσs2)𝑑s.absentsuperscriptsubscript𝑡𝑡2subscript𝑐𝑠𝔼subscript𝑉𝑠𝑛subscriptsuperscript𝜎2𝑠differential-d𝑠\displaystyle=\int_{t}^{t+h}\left(2c_{s}\mathbb{E}(V_{s})+n\sigma^{2}_{s}% \right)ds.= ∫ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t + italic_h end_POSTSUPERSCRIPT ( 2 italic_c start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT blackboard_E ( italic_V start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ) + italic_n italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ) italic_d italic_s .

where the first inequality holds by the triangle inequality and the second inequality holds by Lemma II.1. Taking the limsup of both side as h00h\to 0italic_h → 0, for every t0𝑡0t\geq 0italic_t ≥ 0, we get

D+𝔼(Vt)2ct𝔼(Vt)+nσt2,V0=0,formulae-sequencesuperscript𝐷𝔼subscript𝑉𝑡2subscript𝑐𝑡𝔼subscript𝑉𝑡𝑛superscriptsubscript𝜎𝑡2subscript𝑉00\displaystyle D^{+}\mathbb{E}(V_{t})\leq 2c_{t}\mathbb{E}(V_{t})+n\sigma_{t}^{% 2},\quad V_{0}=0,italic_D start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT blackboard_E ( italic_V start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) ≤ 2 italic_c start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT blackboard_E ( italic_V start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) + italic_n italic_σ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , italic_V start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = 0 , (8)

where D+superscript𝐷D^{+}italic_D start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT is the upper Dini Derivative with respect to t𝑡titalic_t. By the generalized Gröwall-Bellman lemma [39, Appendix A1, Proposition 4], it follows the expectation bound

𝔼(Xtxt2)=𝔼(Vt)nΨt,𝔼superscriptnormsubscript𝑋𝑡subscript𝑥𝑡2𝔼subscript𝑉𝑡𝑛subscriptΨ𝑡\mathbb{E}(\|X_{t}-x_{t}\|^{2})=\mathbb{E}(V_{t})\leq n\Psi_{t},blackboard_E ( ∥ italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) = blackboard_E ( italic_V start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) ≤ italic_n roman_Ψ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , (9)

where

ΨtsubscriptΨ𝑡\displaystyle\Psi_{t}roman_Ψ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT =\displaystyle== e2ψt0tστ2e2ψτ𝑑τsuperscript𝑒2subscript𝜓𝑡superscriptsubscript0𝑡superscriptsubscript𝜎𝜏2superscript𝑒2subscript𝜓𝜏differential-d𝜏\displaystyle e^{2\psi_{t}}\int_{0}^{t}\sigma_{\tau}^{2}e^{-2\psi_{\tau}}d\tauitalic_e start_POSTSUPERSCRIPT 2 italic_ψ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT italic_σ start_POSTSUBSCRIPT italic_τ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_e start_POSTSUPERSCRIPT - 2 italic_ψ start_POSTSUBSCRIPT italic_τ end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_d italic_τ (10a)
ψtsubscript𝜓𝑡\displaystyle\psi_{t}italic_ψ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT =\displaystyle== 0tcτ𝑑τ.superscriptsubscript0𝑡subscript𝑐𝜏differential-d𝜏\displaystyle\int_{0}^{t}c_{\tau}d\tau.∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT italic_c start_POSTSUBSCRIPT italic_τ end_POSTSUBSCRIPT italic_d italic_τ . (10b)

Applying Markov inequality to the expectation bound (9), we obtain the probabilistic bound

(XtxtnδΨt)=(VtnδΨt)1δnormsubscript𝑋𝑡subscript𝑥𝑡𝑛𝛿subscriptΨ𝑡subscript𝑉𝑡𝑛𝛿subscriptΨ𝑡1𝛿\mathbb{P}\left(\|X_{t}-x_{t}\|\leq\sqrt{\frac{n}{\delta}\Psi_{t}}\right)=% \mathbb{P}\left(V_{t}\leq\frac{n}{\delta}\Psi_{t}\right)\geq 1-\deltablackboard_P ( ∥ italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∥ ≤ square-root start_ARG divide start_ARG italic_n end_ARG start_ARG italic_δ end_ARG roman_Ψ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_ARG ) = blackboard_P ( italic_V start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ≤ divide start_ARG italic_n end_ARG start_ARG italic_δ end_ARG roman_Ψ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) ≥ 1 - italic_δ (11)

for any δ(0,1)𝛿01\delta\in(0,1)italic_δ ∈ ( 0 , 1 ).

IV-B Limitations of Expectation Bound

The bound (11) based on the expectation bound (9) turns out to be loose. To see this, consider the linear time-invariant (LTI) stochastic system

dXt=(AXt+But)dt+σdWt𝑑subscript𝑋𝑡𝐴subscript𝑋𝑡𝐵subscript𝑢𝑡𝑑𝑡𝜎𝑑subscript𝑊𝑡\displaystyle dX_{t}=(AX_{t}+Bu_{t})dt+\sigma dW_{t}italic_d italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = ( italic_A italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT + italic_B italic_u start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) italic_d italic_t + italic_σ italic_d italic_W start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT (12)

and the associated deterministic system

x˙t=Axt+But.subscript˙𝑥𝑡𝐴subscript𝑥𝑡𝐵subscript𝑢𝑡\displaystyle\dot{x}_{t}=Ax_{t}+Bu_{t}.over˙ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = italic_A italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT + italic_B italic_u start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT . (13)

In this case, the bound (11) reads

(Xtxtrδ,t(1))1δ,normsubscript𝑋𝑡subscript𝑥𝑡superscriptsubscript𝑟𝛿𝑡11𝛿\mathbb{P}\left(\|X_{t}-x_{t}\|\leq r_{\delta,t}^{(1)}\right)\geq 1-\delta,blackboard_P ( ∥ italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∥ ≤ italic_r start_POSTSUBSCRIPT italic_δ , italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT ) ≥ 1 - italic_δ , (14)

where rδ,t(1)=nσ2(e2ct1)2cδsuperscriptsubscript𝑟𝛿𝑡1𝑛superscript𝜎2superscript𝑒2𝑐𝑡12𝑐𝛿r_{\delta,t}^{(1)}=\sqrt{\frac{n\sigma^{2}(e^{2ct}-1)}{2c\delta}}italic_r start_POSTSUBSCRIPT italic_δ , italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT = square-root start_ARG divide start_ARG italic_n italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_e start_POSTSUPERSCRIPT 2 italic_c italic_t end_POSTSUPERSCRIPT - 1 ) end_ARG start_ARG 2 italic_c italic_δ end_ARG end_ARG with c=μ(A)𝑐𝜇𝐴c=\mu(A)italic_c = italic_μ ( italic_A ).

On the other hand, when initialized at X0=x0subscript𝑋0subscript𝑥0X_{0}=x_{0}italic_X start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, Xtsubscript𝑋𝑡X_{t}italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT is a Gaussian random variable [35] with mean 𝔼(Xt)=xt𝔼subscript𝑋𝑡subscript𝑥𝑡\mathbb{E}(X_{t})=x_{t}blackboard_E ( italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) = italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT and covariance matrix

cov(Xt)=0tσ2eA(tτ)eA𝖳(tτ)𝑑τ.covsubscript𝑋𝑡superscriptsubscript0𝑡superscript𝜎2superscript𝑒𝐴𝑡𝜏superscript𝑒superscript𝐴𝖳𝑡𝜏differential-d𝜏\displaystyle\text{cov}(X_{t})=\int_{0}^{t}\sigma^{2}e^{A(t-\tau)}e^{A^{% \mathsf{T}}(t-\tau)}d\tau.cov ( italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) = ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_e start_POSTSUPERSCRIPT italic_A ( italic_t - italic_τ ) end_POSTSUPERSCRIPT italic_e start_POSTSUPERSCRIPT italic_A start_POSTSUPERSCRIPT sansserif_T end_POSTSUPERSCRIPT ( italic_t - italic_τ ) end_POSTSUPERSCRIPT italic_d italic_τ . (15)

Invoking the fact that eAteμ(A)tnormsuperscript𝑒𝐴𝑡superscript𝑒𝜇𝐴𝑡\|e^{At}\|\leq e^{\mu(A)t}∥ italic_e start_POSTSUPERSCRIPT italic_A italic_t end_POSTSUPERSCRIPT ∥ ≤ italic_e start_POSTSUPERSCRIPT italic_μ ( italic_A ) italic_t end_POSTSUPERSCRIPT for any t0𝑡0t\geq 0italic_t ≥ 0 [40], cov(Xt)covsubscript𝑋𝑡\text{cov}(X_{t})cov ( italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) can be bounded as

cov(Xt)0tσ2eA(tτ)eA𝖳(tτ)𝑑τInσ20te2c(tτ)𝑑τIn=σ22c(e2ct1)In.precedes-or-equalscovsubscript𝑋𝑡superscriptsubscript0𝑡superscript𝜎2delimited-∥∥superscript𝑒𝐴𝑡𝜏delimited-∥∥superscript𝑒superscript𝐴𝖳𝑡𝜏differential-d𝜏subscript𝐼𝑛precedes-or-equalssuperscript𝜎2superscriptsubscript0𝑡superscript𝑒2𝑐𝑡𝜏differential-d𝜏subscript𝐼𝑛superscript𝜎22𝑐superscript𝑒2𝑐𝑡1subscript𝐼𝑛\begin{split}\text{cov}(X_{t})&\preceq\int_{0}^{t}\sigma^{2}\|e^{A(t-\tau)}\|% \|e^{A^{\mathsf{T}}(t-\tau)}\|d\tau\,I_{n}\\ &\preceq\sigma^{2}\int_{0}^{t}e^{2c(t-\tau)}d\tau\,I_{n}\\ &=\tfrac{\sigma^{2}}{2c}(e^{2ct}-1)\,I_{n}.\end{split}start_ROW start_CELL cov ( italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) end_CELL start_CELL ⪯ ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ∥ italic_e start_POSTSUPERSCRIPT italic_A ( italic_t - italic_τ ) end_POSTSUPERSCRIPT ∥ ∥ italic_e start_POSTSUPERSCRIPT italic_A start_POSTSUPERSCRIPT sansserif_T end_POSTSUPERSCRIPT ( italic_t - italic_τ ) end_POSTSUPERSCRIPT ∥ italic_d italic_τ italic_I start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL ⪯ italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT italic_e start_POSTSUPERSCRIPT 2 italic_c ( italic_t - italic_τ ) end_POSTSUPERSCRIPT italic_d italic_τ italic_I start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL = divide start_ARG italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 2 italic_c end_ARG ( italic_e start_POSTSUPERSCRIPT 2 italic_c italic_t end_POSTSUPERSCRIPT - 1 ) italic_I start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT . end_CELL end_ROW (16)

By the concentration property of Gaussian distribution [41, Chapter 7], for any δ(0,1)𝛿01\delta\in(0,1)italic_δ ∈ ( 0 , 1 ), with probability at least 1δ1𝛿1-\delta1 - italic_δ,

Xtxtcov(Xt)(4n+22log(1/δ)).normsubscript𝑋𝑡subscript𝑥𝑡normcovsubscript𝑋𝑡4𝑛221𝛿\|X_{t}-x_{t}\|\leq\sqrt{\|\text{cov}(X_{t})\|}(4\sqrt{n}+2\sqrt{2\log(1/% \delta)}).∥ italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∥ ≤ square-root start_ARG ∥ cov ( italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) ∥ end_ARG ( 4 square-root start_ARG italic_n end_ARG + 2 square-root start_ARG 2 roman_log ( 1 / italic_δ ) end_ARG ) . (17)

Plugging (16) into (17) yields

(Xtxtrδ,t(2))1δ,normsubscript𝑋𝑡subscript𝑥𝑡subscriptsuperscript𝑟2𝛿𝑡1𝛿\displaystyle\mathbb{P}\left(\|X_{t}-x_{t}\|\leq r^{(2)}_{\delta,t}\right)\geq 1% -\delta,blackboard_P ( ∥ italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∥ ≤ italic_r start_POSTSUPERSCRIPT ( 2 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_δ , italic_t end_POSTSUBSCRIPT ) ≥ 1 - italic_δ , (18)

where rδ,t(2)=σ22c(e2ct1)(4n+22log(1/δ))superscriptsubscript𝑟𝛿𝑡2superscript𝜎22𝑐superscript𝑒2𝑐𝑡14𝑛221𝛿r_{\delta,t}^{(2)}=\sqrt{\tfrac{\sigma^{2}}{2c}(e^{2ct}-1)}(4\sqrt{n}+2\sqrt{2% \log(1/\delta)})italic_r start_POSTSUBSCRIPT italic_δ , italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 2 ) end_POSTSUPERSCRIPT = square-root start_ARG divide start_ARG italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 2 italic_c end_ARG ( italic_e start_POSTSUPERSCRIPT 2 italic_c italic_t end_POSTSUPERSCRIPT - 1 ) end_ARG ( 4 square-root start_ARG italic_n end_ARG + 2 square-root start_ARG 2 roman_log ( 1 / italic_δ ) end_ARG ).

The bound (18) is substantially better than (14). While the dependency of rδ,t(1)superscriptsubscript𝑟𝛿𝑡1r_{\delta,t}^{(1)}italic_r start_POSTSUBSCRIPT italic_δ , italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT and rδ,t(2)superscriptsubscript𝑟𝛿𝑡2r_{\delta,t}^{(2)}italic_r start_POSTSUBSCRIPT italic_δ , italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 2 ) end_POSTSUPERSCRIPT on c𝑐citalic_c and n𝑛nitalic_n are of the same order, the dependency of rδ,t(2)subscriptsuperscript𝑟2𝛿𝑡r^{(2)}_{\delta,t}italic_r start_POSTSUPERSCRIPT ( 2 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_δ , italic_t end_POSTSUBSCRIPT on δ𝛿\deltaitalic_δ is 𝒪(log(1/δ))𝒪1𝛿\mathcal{O}(\sqrt{\log(1/\delta)})caligraphic_O ( square-root start_ARG roman_log ( 1 / italic_δ ) end_ARG ), much better than the 𝒪(1/δ)𝒪1𝛿\mathcal{O}\left(\sqrt{1/\delta}\right)caligraphic_O ( square-root start_ARG 1 / italic_δ end_ARG ) dependency of rδ,t(1)subscriptsuperscript𝑟1𝛿𝑡r^{(1)}_{\delta,t}italic_r start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_δ , italic_t end_POSTSUBSCRIPT on δ𝛿\deltaitalic_δ. For small δ𝛿\deltaitalic_δ (e.g., 1010superscript101010^{-10}10 start_POSTSUPERSCRIPT - 10 end_POSTSUPERSCRIPT), which is crucial for safety-critical systems, log(1/δ)1𝛿\sqrt{\log(1/\delta)}square-root start_ARG roman_log ( 1 / italic_δ ) end_ARG is significantly smaller than 1/δ1𝛿\sqrt{1/\delta}square-root start_ARG 1 / italic_δ end_ARG (4.804.804.804.80 v.s. 105superscript10510^{5}10 start_POSTSUPERSCRIPT 5 end_POSTSUPERSCRIPT). As a result, the probabilistic reachable set based on the bound (11) can be conservative in practice.

Thus, there is a significant gap between the result (11) for nonlinear dynamics and probabilistic bounds for linear dynamics. The limitation of the expectation bound primarily lies in the quadratic Lyapunov function Vt=Xtxt2subscript𝑉𝑡superscriptnormsubscript𝑋𝑡subscript𝑥𝑡2V_{t}=\|X_{t}-x_{t}\|^{2}italic_V start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = ∥ italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT. The analysis focuses only on the evolution of the second order moment 𝔼(Xtxt2)𝔼superscriptnormsubscript𝑋𝑡subscript𝑥𝑡2\mathbb{E}(\|X_{t}-x_{t}\|^{2})blackboard_E ( ∥ italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ). It can at best guarantee a probabilistic bound for Xtxtnormsubscript𝑋𝑡subscript𝑥𝑡\|X_{t}-x_{t}\|∥ italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∥ of order 𝒪(1/δ)𝒪1𝛿\mathcal{O}(\sqrt{1/\delta})caligraphic_O ( square-root start_ARG 1 / italic_δ end_ARG ) via Markov inequality. This gives rise to the question: is the gap fundamental or an artifact of the analysis?

V Probabilistic Bound on Stochastic Deviation

In this section, we answer the aforementioned question by establishing a probabilistic bound for the stochastic deviation Xtxtnormsubscript𝑋𝑡subscript𝑥𝑡\|X_{t}-x_{t}\|∥ italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∥ of order 𝒪(log(1/δ))𝒪1𝛿\mathcal{O}(\sqrt{\log(1/\delta)})caligraphic_O ( square-root start_ARG roman_log ( 1 / italic_δ ) end_ARG ) for general nonlinear stochastic systems (3) under Assumption 1. We further show our bound is consistent with that for linear systems under the same assumption and is thus tight.

V-A Sub-Gaussian and MGF

The analysis (15)-(18) relying on the Gaussianity for linear systems can not be applied to (3) since Xtsubscript𝑋𝑡X_{t}italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT is not necessarily Gaussian for nonlinear systems. Fortunately, the norm concentration property (17) holds not only for Gaussian random vectors (distributions) but also for a wider class of random vectors known as sub-Gaussian vectors (distributions).

Definition V.1.

A random variable Xn𝑋superscript𝑛X\in\mathbb{R}^{n}italic_X ∈ blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT is said to be sub-Gaussian with variance proxy σ2superscript𝜎2\sigma^{2}italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT, denoted as XsubG(σ2)similar-to𝑋𝑠𝑢𝑏𝐺superscript𝜎2X\sim subG(\sigma^{2})italic_X ∼ italic_s italic_u italic_b italic_G ( italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ), if 𝔼X(X)=0subscript𝔼𝑋𝑋0\mathbb{E}_{X}(X)=0blackboard_E start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ( italic_X ) = 0 and

𝔼X(eλ,X)eλ2σ22,λ,𝒮n1.formulae-sequencesubscript𝔼𝑋superscript𝑒𝜆𝑋superscript𝑒superscript𝜆2superscript𝜎22formulae-sequencefor-all𝜆for-allsuperscript𝒮𝑛1\mathbb{E}_{X}\left(e^{\lambda\langle\ell,X\rangle}\right)\leq e^{\frac{% \lambda^{2}\sigma^{2}}{2}},~{}\forall\lambda\in\mathbb{R},~{}\forall\ell\in% \mathcal{S}^{n-1}.blackboard_E start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ( italic_e start_POSTSUPERSCRIPT italic_λ ⟨ roman_ℓ , italic_X ⟩ end_POSTSUPERSCRIPT ) ≤ italic_e start_POSTSUPERSCRIPT divide start_ARG italic_λ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT , ∀ italic_λ ∈ blackboard_R , ∀ roman_ℓ ∈ caligraphic_S start_POSTSUPERSCRIPT italic_n - 1 end_POSTSUPERSCRIPT . (19)

Many distributions including Gaussian distribution, zero-mean uniform distribution, and any zero-mean distribution with bounded support are instances of sub-Gaussian distributions. For Gaussian distribution, the variance proxy σ2superscript𝜎2\sigma^{2}italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT is cov(X)normcov𝑋\|\text{cov}(X)\|∥ cov ( italic_X ) ∥.

Sub-Gaussian distributions share the same norm concentration property as Gaussian distributions. For the sake of completeness, we present a version of the concentration property and its proof in Appendix -A.

Lemma V.1.

Let Xn𝑋superscript𝑛X\in\mathbb{R}^{n}italic_X ∈ blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT be a sub-Gaussian random vector with variance proxy σ2superscript𝜎2\sigma^{2}italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT, then for any δ(0,1)𝛿01\delta\in(0,1)italic_δ ∈ ( 0 , 1 ) and any ε(0,1)𝜀01\varepsilon\in(0,1)italic_ε ∈ ( 0 , 1 ),

Xσε1n+ε2log(1/δ)norm𝑋𝜎subscript𝜀1𝑛subscript𝜀21𝛿\|X\|\leq\sigma\sqrt{\varepsilon_{1}n+\varepsilon_{2}\log(1/\delta)}∥ italic_X ∥ ≤ italic_σ square-root start_ARG italic_ε start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_n + italic_ε start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT roman_log ( 1 / italic_δ ) end_ARG (20)

holds with probability at least 1δ1𝛿1-\delta1 - italic_δ, where

ε1=2log(1+2/ε)(1ε)2,ε2=2(1ε)2.formulae-sequencesubscript𝜀1212𝜀superscript1𝜀2subscript𝜀22superscript1𝜀2\varepsilon_{1}=\frac{2\log(1+2/\varepsilon)}{(1-\varepsilon)^{2}},~{}% \varepsilon_{2}=\frac{2}{(1-\varepsilon)^{2}}.italic_ε start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = divide start_ARG 2 roman_log ( 1 + 2 / italic_ε ) end_ARG start_ARG ( 1 - italic_ε ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG , italic_ε start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = divide start_ARG 2 end_ARG start_ARG ( 1 - italic_ε ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG . (21)

Lemma V.1 states a probabilistic bound of the norm Xnorm𝑋\|X\|∥ italic_X ∥ of a sub-Gaussian random vector that scales as 𝒪(n)𝒪𝑛\mathcal{O}(\sqrt{n})caligraphic_O ( square-root start_ARG italic_n end_ARG ) and 𝒪(log(1/δ))𝒪1𝛿\mathcal{O}(\sqrt{\log(1/\delta)})caligraphic_O ( square-root start_ARG roman_log ( 1 / italic_δ ) end_ARG ), the same as (17). The parameter ε𝜀\varepsilonitalic_ε can be selected according to the values of n,δ𝑛𝛿n,\deltaitalic_n , italic_δ to minimize the bound. When ε=0.5𝜀0.5\varepsilon=0.5italic_ε = 0.5, ε1=8log516subscript𝜀18516\varepsilon_{1}=8\log 5\approx 16italic_ε start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = 8 roman_log 5 ≈ 16 and ε2=8subscript𝜀28\varepsilon_{2}=8italic_ε start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = 8. Since σ2=cov(X)superscript𝜎2normcov𝑋\sigma^{2}=\|\text{cov}(X)\|italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = ∥ cov ( italic_X ) ∥ for Gaussian, (20) becomes (17) after applying Jensen’s Inequality. The dependence 𝒪(n)𝒪𝑛\mathcal{O}(\sqrt{n})caligraphic_O ( square-root start_ARG italic_n end_ARG ) and 𝒪(log(1/δ))𝒪1𝛿\mathcal{O}(\sqrt{\log(1/\delta)})caligraphic_O ( square-root start_ARG roman_log ( 1 / italic_δ ) end_ARG ) in Lemma V.1 is tight, but the expressions of ε1,ε2subscript𝜀1subscript𝜀2\varepsilon_{1},\varepsilon_{2}italic_ε start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_ε start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT in (21) are constructed in the proof and are by no means optimal, especially for specific values of n𝑛nitalic_n. For example, when the dimension n=1𝑛1n=1italic_n = 1, by Hoeffding’s Inequality [42, Chapter 1.2], a better choice is ε1=2log2subscript𝜀122\varepsilon_{1}=2\log 2italic_ε start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = 2 roman_log 2 and ε2=2subscript𝜀22\varepsilon_{2}=2italic_ε start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = 2.

To show a random variable X𝑋Xitalic_X is sub-Gaussian, one needs to verify 𝔼X(X)=0subscript𝔼𝑋𝑋0\mathbb{E}_{X}(X)=0blackboard_E start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ( italic_X ) = 0 and the inequality (19). Note that the left-hand side of (19) is the Moment Generating Function (MGF) [42, Chapter 1.1]

𝔼X(Mλ,(X)):=𝔼X(eλ,X),𝒮n1formulae-sequenceassignsubscript𝔼𝑋subscript𝑀𝜆𝑋subscript𝔼𝑋superscript𝑒𝜆𝑋superscript𝒮𝑛1\mathbb{E}_{X}\left(M_{\lambda,\ell}(X)\right):=\mathbb{E}_{X}\left(e^{\lambda% \langle\ell,X\rangle}\right),\quad\ell\in\mathcal{S}^{n-1}blackboard_E start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ( italic_M start_POSTSUBSCRIPT italic_λ , roman_ℓ end_POSTSUBSCRIPT ( italic_X ) ) := blackboard_E start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ( italic_e start_POSTSUPERSCRIPT italic_λ ⟨ roman_ℓ , italic_X ⟩ end_POSTSUPERSCRIPT ) , roman_ℓ ∈ caligraphic_S start_POSTSUPERSCRIPT italic_n - 1 end_POSTSUPERSCRIPT (22)

a common tool for concentration analysis. One advantage of the MGF compared with the second-order moment used in Section IV is that the MGF captures high-order information, and this is a major reason why MGF is useful for analyzing concentration properties.

Thus, a potential approach to bound Xtxtnormsubscript𝑋𝑡subscript𝑥𝑡\|X_{t}-x_{t}\|∥ italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∥ is to show Xtxtsubscript𝑋𝑡subscript𝑥𝑡X_{t}-x_{t}italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT is sub-Gaussian. Unfortunately, this is not true. For associated trajectories Xtsubscript𝑋𝑡X_{t}italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT and xtsubscript𝑥𝑡x_{t}italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT, 𝔼(Xt)xt𝔼subscript𝑋𝑡subscript𝑥𝑡\mathbb{E}(X_{t})\neq x_{t}blackboard_E ( italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) ≠ italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT for general nonlinear dynamics [35, Chapter 5.5]. Moreover, (19) requires bounding the evolution of the MGF for all 𝒮n1superscript𝒮𝑛1\ell\in\mathcal{S}^{n-1}roman_ℓ ∈ caligraphic_S start_POSTSUPERSCRIPT italic_n - 1 end_POSTSUPERSCRIPT, which can be too strong.

V-B Averaged Moment Generating Function

Inspired by the concentration properties of sub-Gaussian distributions and the limitations of MGF, we propose a weaker version of MGF termed the Averaged Moment Generating Function (AMGF) for probabilistic reachability analysis.

Definition V.2 (AMGF).

Given λ𝜆\lambda\in\mathbb{R}italic_λ ∈ blackboard_R, the Averaged Moment Generating Function Φn,λ:n:subscriptΦ𝑛𝜆superscript𝑛\Phi_{n,\lambda}:\mathbb{R}^{n}\to\mathbb{R}roman_Φ start_POSTSUBSCRIPT italic_n , italic_λ end_POSTSUBSCRIPT : blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT → blackboard_R is defined as

𝔼X(Φn,λ(X)):=𝔼X𝔼𝒮n1(eλ,X).assignsubscript𝔼𝑋subscriptΦ𝑛𝜆𝑋subscript𝔼𝑋subscript𝔼similar-tosuperscript𝒮𝑛1superscript𝑒𝜆𝑋\mathbb{E}_{X}(\Phi_{n,\lambda}(X)):=\mathbb{E}_{X}\mathbb{E}_{\ell\sim% \mathcal{S}^{n-1}}\left(e^{\lambda\langle\ell,X\rangle}\right).blackboard_E start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ( roman_Φ start_POSTSUBSCRIPT italic_n , italic_λ end_POSTSUBSCRIPT ( italic_X ) ) := blackboard_E start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT blackboard_E start_POSTSUBSCRIPT roman_ℓ ∼ caligraphic_S start_POSTSUPERSCRIPT italic_n - 1 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_e start_POSTSUPERSCRIPT italic_λ ⟨ roman_ℓ , italic_X ⟩ end_POSTSUPERSCRIPT ) . (23)

The AMGF is an average of the MGF over the sphere 𝒮n1similar-tosuperscript𝒮𝑛1\ell\sim\mathcal{S}^{n-1}roman_ℓ ∼ caligraphic_S start_POSTSUPERSCRIPT italic_n - 1 end_POSTSUPERSCRIPT. It was recently proposed in [43] to study sampling problems. Thanks to the averaging, bounding the AMGF is easier than bounding MGF for each \ellroman_ℓ. The AMGF can also be viewed as an MGF by replacing the exponential energy function eλ,xsuperscript𝑒𝜆𝑥e^{\lambda\langle\ell,x\rangle}italic_e start_POSTSUPERSCRIPT italic_λ ⟨ roman_ℓ , italic_x ⟩ end_POSTSUPERSCRIPT by Φn,λ(x)=𝔼𝒮n1(eλ,x)subscriptΦ𝑛𝜆𝑥subscript𝔼similar-tosuperscript𝒮𝑛1superscript𝑒𝜆𝑥\Phi_{n,\lambda}(x)=\mathbb{E}_{\ell\sim\mathcal{S}^{n-1}}\left(e^{\lambda% \langle\ell,x\rangle}\right)roman_Φ start_POSTSUBSCRIPT italic_n , italic_λ end_POSTSUBSCRIPT ( italic_x ) = blackboard_E start_POSTSUBSCRIPT roman_ℓ ∼ caligraphic_S start_POSTSUPERSCRIPT italic_n - 1 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_e start_POSTSUPERSCRIPT italic_λ ⟨ roman_ℓ , italic_x ⟩ end_POSTSUPERSCRIPT ). This energy function Φn,λsubscriptΦ𝑛𝜆\Phi_{n,\lambda}roman_Φ start_POSTSUBSCRIPT italic_n , italic_λ end_POSTSUBSCRIPT has several intriguing properties.

Lemma V.2 (Properties of Φn,λsubscriptΦ𝑛𝜆\Phi_{n,\lambda}roman_Φ start_POSTSUBSCRIPT italic_n , italic_λ end_POSTSUBSCRIPT).

The following statements hold for Φn,λsubscriptΦ𝑛𝜆\Phi_{n,\lambda}roman_Φ start_POSTSUBSCRIPT italic_n , italic_λ end_POSTSUBSCRIPT in (23):

  1. (i)

    Rotation invariance: For any xn𝑥superscript𝑛x\in\mathbb{R}^{n}italic_x ∈ blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT and η𝒮n1𝜂superscript𝒮𝑛1\eta\in\mathcal{S}^{n-1}italic_η ∈ caligraphic_S start_POSTSUPERSCRIPT italic_n - 1 end_POSTSUPERSCRIPT,

    Φn,λ(x)=Φn,λ(xη).subscriptΦ𝑛𝜆𝑥subscriptΦ𝑛𝜆norm𝑥𝜂\displaystyle\Phi_{n,\lambda}(x)=\Phi_{n,\lambda}(\|x\|\eta).roman_Φ start_POSTSUBSCRIPT italic_n , italic_λ end_POSTSUBSCRIPT ( italic_x ) = roman_Φ start_POSTSUBSCRIPT italic_n , italic_λ end_POSTSUBSCRIPT ( ∥ italic_x ∥ italic_η ) .
  2. (ii)

    Monotonicity: For any x,yn𝑥𝑦superscript𝑛x,y\in\mathbb{R}^{n}italic_x , italic_y ∈ blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT such that xynorm𝑥norm𝑦\|x\|\leq\|y\|∥ italic_x ∥ ≤ ∥ italic_y ∥,

    1Φn,λ(x)Φn,λ(y).1subscriptΦ𝑛𝜆𝑥subscriptΦ𝑛𝜆𝑦1\leq\Phi_{n,\lambda}(x)\leq\Phi_{n,\lambda}(y).1 ≤ roman_Φ start_POSTSUBSCRIPT italic_n , italic_λ end_POSTSUBSCRIPT ( italic_x ) ≤ roman_Φ start_POSTSUBSCRIPT italic_n , italic_λ end_POSTSUBSCRIPT ( italic_y ) .

Lemma V.2 implies that Φn,λ(x)subscriptΦ𝑛𝜆𝑥\Phi_{n,\lambda}(x)roman_Φ start_POSTSUBSCRIPT italic_n , italic_λ end_POSTSUBSCRIPT ( italic_x ) only depends on the norm xnorm𝑥\|x\|∥ italic_x ∥ of x𝑥xitalic_x and is monotonically increasing as xnorm𝑥\|x\|∥ italic_x ∥. For a non-expanding deterministic system (4), that is, μ(Dxf(x,u,t))0𝜇subscript𝐷𝑥𝑓𝑥𝑢𝑡0\mu(D_{x}f(x,u,t))\leq 0italic_μ ( italic_D start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT italic_f ( italic_x , italic_u , italic_t ) ) ≤ 0, these properties imply that Φn,λ(xtyt)subscriptΦ𝑛𝜆subscript𝑥𝑡subscript𝑦𝑡\Phi_{n,\lambda}(x_{t}-y_{t})roman_Φ start_POSTSUBSCRIPT italic_n , italic_λ end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - italic_y start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) is decreasing for any two arbitrary trajectories xt,ytsubscript𝑥𝑡subscript𝑦𝑡x_{t},y_{t}italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT. This can be formalized as follows.

Lemma V.3.

Consider the deterministic system (4) such that μ(Dxf(x,u,t))0𝜇subscript𝐷𝑥𝑓𝑥𝑢𝑡0\mu(D_{x}f(x,u,t))\leq 0italic_μ ( italic_D start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT italic_f ( italic_x , italic_u , italic_t ) ) ≤ 0 for every (x,u,t)×n𝒰×0(x,u,t)\in{}^{n}\times\mathcal{U}\times{}_{\geq 0}( italic_x , italic_u , italic_t ) ∈ start_FLOATSUPERSCRIPT italic_n end_FLOATSUPERSCRIPT × caligraphic_U × start_FLOATSUBSCRIPT ≥ 0 end_FLOATSUBSCRIPT, then for any x,yn𝑥𝑦superscript𝑛x,y\in\mathbb{R}^{n}italic_x , italic_y ∈ blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT, u𝒰𝑢𝒰u\in\mathcal{U}italic_u ∈ caligraphic_U and t0𝑡0t\geq 0italic_t ≥ 0:

𝔼𝒮n1(eλ,xyλ𝖳(f(x,u,t)f(y,u,t)))0.subscript𝔼similar-tosuperscript𝒮𝑛1superscript𝑒𝜆𝑥𝑦𝜆superscript𝖳𝑓𝑥𝑢𝑡𝑓𝑦𝑢𝑡0\displaystyle\mathbb{E}_{\ell\sim\mathcal{S}^{n-1}}\left(e^{\lambda\langle\ell% ,x-y\rangle}\lambda\ell^{\mathsf{T}}(f(x,u,t)-f(y,u,t))\right)\leq 0.blackboard_E start_POSTSUBSCRIPT roman_ℓ ∼ caligraphic_S start_POSTSUPERSCRIPT italic_n - 1 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_e start_POSTSUPERSCRIPT italic_λ ⟨ roman_ℓ , italic_x - italic_y ⟩ end_POSTSUPERSCRIPT italic_λ roman_ℓ start_POSTSUPERSCRIPT sansserif_T end_POSTSUPERSCRIPT ( italic_f ( italic_x , italic_u , italic_t ) - italic_f ( italic_y , italic_u , italic_t ) ) ) ≤ 0 .

An intriguing fact about AMGF is that it induces the same concentration property as MGF.

Lemma V.4.

If a random variable Xn𝑋superscript𝑛X\in\mathbb{R}^{n}italic_X ∈ blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT satisfies

𝔼X(Φn,λ(X))eλ2σ22,λ,formulae-sequencesubscript𝔼𝑋subscriptΦ𝑛𝜆𝑋superscript𝑒superscript𝜆2superscript𝜎22for-all𝜆\mathbb{E}_{X}\left(\Phi_{n,\lambda}(X)\right)\leq e^{\frac{\lambda^{2}\sigma^% {2}}{2}},~{}\forall\lambda\in\mathbb{R},blackboard_E start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ( roman_Φ start_POSTSUBSCRIPT italic_n , italic_λ end_POSTSUBSCRIPT ( italic_X ) ) ≤ italic_e start_POSTSUPERSCRIPT divide start_ARG italic_λ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT , ∀ italic_λ ∈ blackboard_R , (24)

then for any δ>0𝛿0\delta>0italic_δ > 0, (20) holds with probability at least 1δ1𝛿1-\delta1 - italic_δ.

At first sight, this is counter-intuitive, since upper-bounding AMGF is weaker than upper-bounding MGF for all \ellroman_ℓ. To see why Lemma V.4 holds, define an intermediate random variable X~=QX~𝑋𝑄𝑋\tilde{X}=QXover~ start_ARG italic_X end_ARG = italic_Q italic_X where Q𝕌nsimilar-to𝑄superscript𝕌𝑛Q\sim\mathbb{U}^{n}italic_Q ∼ blackboard_U start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT is a random unitary matrix with 𝕌nsuperscript𝕌𝑛\mathbb{U}^{n}blackboard_U start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT denoting the set of all the unitary matrices in n×nsuperscript𝑛𝑛\mathbb{R}^{n\times n}blackboard_R start_POSTSUPERSCRIPT italic_n × italic_n end_POSTSUPERSCRIPT. Then the AMGF over X𝑋Xitalic_X is equal to the MGF over X~~𝑋\tilde{X}over~ start_ARG italic_X end_ARG, that is, 𝔼X(Φn,λ(X))=𝔼X~(eλ,X~)subscript𝔼𝑋subscriptΦ𝑛𝜆𝑋subscript𝔼~𝑋superscript𝑒𝜆~𝑋\mathbb{E}_{X}\left(\Phi_{n,\lambda}(X)\right)=\mathbb{E}_{\tilde{X}}\left(e^{% \lambda\langle\ell,\tilde{X}\rangle}\right)blackboard_E start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ( roman_Φ start_POSTSUBSCRIPT italic_n , italic_λ end_POSTSUBSCRIPT ( italic_X ) ) = blackboard_E start_POSTSUBSCRIPT over~ start_ARG italic_X end_ARG end_POSTSUBSCRIPT ( italic_e start_POSTSUPERSCRIPT italic_λ ⟨ roman_ℓ , over~ start_ARG italic_X end_ARG ⟩ end_POSTSUPERSCRIPT ). This means X~~𝑋\tilde{X}over~ start_ARG italic_X end_ARG is sub-Gaussian with variance proxy σ2superscript𝜎2\sigma^{2}italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT. Lemma V.4 then follows by noticing that the transformation X~=QX~𝑋𝑄𝑋\tilde{X}=QXover~ start_ARG italic_X end_ARG = italic_Q italic_X does not affect the norm.

V-C Theoretical Analysis

Equipped with the AMGF, we are ready to establish a tighter probabilistic bound for the stochastic deviation Xtxtnormsubscript𝑋𝑡subscript𝑥𝑡\|X_{t}-x_{t}\|∥ italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∥. Thanks to Lemma V.4, it suffices to bound the evolution of the AMGF 𝔼(Φn,λ(Xtxt))𝔼subscriptΦ𝑛𝜆subscript𝑋𝑡subscript𝑥𝑡\mathbb{E}(\Phi_{n,\lambda}(X_{t}-x_{t}))blackboard_E ( roman_Φ start_POSTSUBSCRIPT italic_n , italic_λ end_POSTSUBSCRIPT ( italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) ) over time. Below we establish a probabilistic bound of the stochastic deviation of order 𝒪(log(1/δ))𝒪1𝛿\mathcal{O}(\sqrt{\log(1/\delta)})caligraphic_O ( square-root start_ARG roman_log ( 1 / italic_δ ) end_ARG ) for the stochastic system (3) satisfying Assumption 1 by developing a tight bound of 𝔼(Φn,λ(Xtxt))𝔼subscriptΦ𝑛𝜆subscript𝑋𝑡subscript𝑥𝑡\mathbb{E}(\Phi_{n,\lambda}(X_{t}-x_{t}))blackboard_E ( roman_Φ start_POSTSUBSCRIPT italic_n , italic_λ end_POSTSUBSCRIPT ( italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) ).

Theorem 1.

Consider the stochastic system (3) and the deterministic system (4) under Assumption 1. Let Xtsubscript𝑋𝑡X_{t}italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT be a trajectory of (3) and xtsubscript𝑥𝑡x_{t}italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT be an associated trajectory of (4) with the same initial condition x0subscript𝑥0x_{0}italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT and input ut:t𝒰:subscript𝑢𝑡𝑡𝒰u_{t}:t\to\mathcal{U}italic_u start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT : italic_t → caligraphic_U. Then, for any t>0𝑡0t>0italic_t > 0, δ(0,1)𝛿01\delta\in(0,1)italic_δ ∈ ( 0 , 1 ) and ε(0,1)𝜀01\varepsilon\in(0,1)italic_ε ∈ ( 0 , 1 ),

XtxtΨt(ε1n+ε2log(1/δ)),normsubscript𝑋𝑡subscript𝑥𝑡subscriptΨ𝑡subscript𝜀1𝑛subscript𝜀21𝛿\|X_{t}-x_{t}\|\leq\sqrt{\Psi_{t}(\varepsilon_{1}n+\varepsilon_{2}\log(1/% \delta))},∥ italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∥ ≤ square-root start_ARG roman_Ψ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_ε start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_n + italic_ε start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT roman_log ( 1 / italic_δ ) ) end_ARG , (25)

holds with probability at least 1δ1𝛿1-\delta1 - italic_δ, where ΨtsubscriptΨ𝑡\Psi_{t}roman_Ψ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT is as in (10) and ε1subscript𝜀1\varepsilon_{1}italic_ε start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT,ε2subscript𝜀2\varepsilon_{2}italic_ε start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT are given by (21).

Proof.

We start with a special case where Assumption 1 holds with a global matrix measure bound ct=0subscript𝑐𝑡0c_{t}=0italic_c start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = 0 and then generalize it to cases where Assumption 1 holds with arbitrary ctsubscript𝑐𝑡c_{t}italic_c start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT.

V-C1 Special Case

Denote vt=Xtxtsubscript𝑣𝑡subscript𝑋𝑡subscript𝑥𝑡v_{t}=X_{t}-x_{t}italic_v start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT and βt=f(Xt,ut,t)f(xt,ut,t)subscript𝛽𝑡𝑓subscript𝑋𝑡subscript𝑢𝑡𝑡𝑓subscript𝑥𝑡subscript𝑢𝑡𝑡\beta_{t}=f(X_{t},u_{t},t)-f(x_{t},u_{t},t)italic_β start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = italic_f ( italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , italic_u start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , italic_t ) - italic_f ( italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , italic_u start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , italic_t ), then

dvt=βtdt+gtdWt.𝑑subscript𝑣𝑡subscript𝛽𝑡𝑑𝑡subscript𝑔𝑡𝑑subscript𝑊𝑡dv_{t}=\beta_{t}dt+g_{t}dW_{t}.italic_d italic_v start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = italic_β start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT italic_d italic_t + italic_g start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT italic_d italic_W start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT . (26)

Based on the Fokker–Planck equation [35], ht=𝔼(Φn,λ(vt))subscript𝑡𝔼subscriptΦ𝑛𝜆subscript𝑣𝑡h_{t}=\mathbb{E}(\Phi_{n,\lambda}(v_{t}))italic_h start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = blackboard_E ( roman_Φ start_POSTSUBSCRIPT italic_n , italic_λ end_POSTSUBSCRIPT ( italic_v start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) ) satisfies

dhtdt=𝔼(Φn,λ(vt),βt)+12𝔼(2Φn,λ(vt),gtgt𝖳)𝑑subscript𝑡𝑑𝑡𝔼subscriptΦ𝑛𝜆subscript𝑣𝑡subscript𝛽𝑡12𝔼superscript2subscriptΦ𝑛𝜆subscript𝑣𝑡subscript𝑔𝑡superscriptsubscript𝑔𝑡𝖳\frac{dh_{t}}{dt}=\mathbb{E}\left(\langle\nabla\Phi_{n,\lambda}(v_{t}),\beta_{% t}\rangle\right)+\tfrac{1}{2}\mathbb{E}\left(\langle\nabla^{2}\Phi_{n,\lambda}% (v_{t}),g_{t}g_{t}^{\mathsf{T}}\rangle\right)divide start_ARG italic_d italic_h start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_ARG start_ARG italic_d italic_t end_ARG = blackboard_E ( ⟨ ∇ roman_Φ start_POSTSUBSCRIPT italic_n , italic_λ end_POSTSUBSCRIPT ( italic_v start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) , italic_β start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ⟩ ) + divide start_ARG 1 end_ARG start_ARG 2 end_ARG blackboard_E ( ⟨ ∇ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT roman_Φ start_POSTSUBSCRIPT italic_n , italic_λ end_POSTSUBSCRIPT ( italic_v start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) , italic_g start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT italic_g start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT sansserif_T end_POSTSUPERSCRIPT ⟩ ) (27)

By (23),

𝔼(Φn,λ(vt),βt)=𝔼𝔼𝒮n1(eλ,vtλ𝖳βt).𝔼subscriptΦ𝑛𝜆subscript𝑣𝑡subscript𝛽𝑡𝔼subscript𝔼similar-tosuperscript𝒮𝑛1superscript𝑒𝜆subscript𝑣𝑡𝜆superscript𝖳subscript𝛽𝑡\mathbb{E}\left(\langle\nabla\Phi_{n,\lambda}(v_{t}),\beta_{t}\rangle\right)=% \mathbb{E}\,\mathbb{E}_{\ell\sim\mathcal{S}^{n-1}}\left(e^{\lambda\langle\ell,% v_{t}\rangle}\lambda\ell^{\mathsf{T}}\beta_{t}\right).blackboard_E ( ⟨ ∇ roman_Φ start_POSTSUBSCRIPT italic_n , italic_λ end_POSTSUBSCRIPT ( italic_v start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) , italic_β start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ⟩ ) = blackboard_E blackboard_E start_POSTSUBSCRIPT roman_ℓ ∼ caligraphic_S start_POSTSUPERSCRIPT italic_n - 1 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_e start_POSTSUPERSCRIPT italic_λ ⟨ roman_ℓ , italic_v start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ⟩ end_POSTSUPERSCRIPT italic_λ roman_ℓ start_POSTSUPERSCRIPT sansserif_T end_POSTSUPERSCRIPT italic_β start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) . (28)

Applying Lemma V.3 with x=Xt𝑥subscript𝑋𝑡x=X_{t}italic_x = italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT and y=xt𝑦subscript𝑥𝑡y=x_{t}italic_y = italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT, we obtain

𝔼𝒮n1(eλ,vtλ𝖳βt)0.subscript𝔼similar-tosuperscript𝒮𝑛1superscript𝑒𝜆subscript𝑣𝑡𝜆superscript𝖳subscript𝛽𝑡0\mathbb{E}_{\ell\sim\mathcal{S}^{n-1}}\left(e^{\lambda\langle\ell,v_{t}\rangle% }\lambda\ell^{\mathsf{T}}\beta_{t}\right)\leq 0.blackboard_E start_POSTSUBSCRIPT roman_ℓ ∼ caligraphic_S start_POSTSUPERSCRIPT italic_n - 1 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_e start_POSTSUPERSCRIPT italic_λ ⟨ roman_ℓ , italic_v start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ⟩ end_POSTSUPERSCRIPT italic_λ roman_ℓ start_POSTSUPERSCRIPT sansserif_T end_POSTSUPERSCRIPT italic_β start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) ≤ 0 . (29)

Then

𝔼(Φn,λ(vt),βt)0𝔼subscriptΦ𝑛𝜆subscript𝑣𝑡subscript𝛽𝑡0\mathbb{E}\left(\langle\nabla\Phi_{n,\lambda}(v_{t}),\beta_{t}\rangle\right)\leq 0blackboard_E ( ⟨ ∇ roman_Φ start_POSTSUBSCRIPT italic_n , italic_λ end_POSTSUBSCRIPT ( italic_v start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) , italic_β start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ⟩ ) ≤ 0 (30)

follows by taking the expectation of (29).

The term 12𝔼(2Φn,λ(vt),gtgt𝖳)12𝔼superscript2subscriptΦ𝑛𝜆subscript𝑣𝑡subscript𝑔𝑡superscriptsubscript𝑔𝑡𝖳\frac{1}{2}\mathbb{E}\left(\langle\nabla^{2}\Phi_{n,\lambda}(v_{t}),g_{t}g_{t}% ^{\mathsf{T}}\rangle\right)divide start_ARG 1 end_ARG start_ARG 2 end_ARG blackboard_E ( ⟨ ∇ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT roman_Φ start_POSTSUBSCRIPT italic_n , italic_λ end_POSTSUBSCRIPT ( italic_v start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) , italic_g start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT italic_g start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT sansserif_T end_POSTSUPERSCRIPT ⟩ ) can be bounded as

12𝔼(2Φn,λ(vt),gtgt𝖳)=12𝔼𝔼𝒮n1(λ2eλ,vt𝖳,gtgt𝖳)12𝔼𝔼𝒮n1(λ2eλ,vttr(𝖳)gtgt𝖳)λ2σt22𝔼(Φn,λ(vt))=λ2σt22ht12𝔼superscript2subscriptΦ𝑛𝜆subscript𝑣𝑡subscript𝑔𝑡superscriptsubscript𝑔𝑡𝖳12𝔼subscript𝔼similar-tosuperscript𝒮𝑛1superscript𝜆2superscript𝑒𝜆subscript𝑣𝑡superscript𝖳subscript𝑔𝑡superscriptsubscript𝑔𝑡𝖳12𝔼subscript𝔼similar-tosuperscript𝒮𝑛1superscript𝜆2superscript𝑒𝜆subscript𝑣𝑡trsuperscript𝖳delimited-∥∥subscript𝑔𝑡superscriptsubscript𝑔𝑡𝖳superscript𝜆2superscriptsubscript𝜎𝑡22𝔼subscriptΦ𝑛𝜆subscript𝑣𝑡superscript𝜆2superscriptsubscript𝜎𝑡22subscript𝑡\begin{split}\tfrac{1}{2}&\mathbb{E}\left(\langle\nabla^{2}\Phi_{n,\lambda}(v_% {t}),g_{t}g_{t}^{\mathsf{T}}\rangle\right)\\ =&\tfrac{1}{2}\mathbb{E}\,\mathbb{E}_{\ell\sim\mathcal{S}^{n-1}}\left(\langle% \lambda^{2}e^{\lambda\langle\ell,v_{t}\rangle}\ell\ell^{\mathsf{T}},g_{t}g_{t}% ^{\mathsf{T}}\rangle\right)\\ \leq&\tfrac{1}{2}\mathbb{E}\,\mathbb{E}_{\ell\sim\mathcal{S}^{n-1}}\left(% \lambda^{2}e^{\lambda\langle\ell,v_{t}\rangle}\mathrm{tr}(\ell\ell^{\mathsf{T}% })\,\|g_{t}g_{t}^{\mathsf{T}}\|\right)\\ \leq&\frac{\lambda^{2}\sigma_{t}^{2}}{2}\mathbb{E}\left(\Phi_{n,\lambda}(v_{t}% )\right)=\frac{\lambda^{2}\sigma_{t}^{2}}{2}h_{t}\end{split}start_ROW start_CELL divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_CELL start_CELL blackboard_E ( ⟨ ∇ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT roman_Φ start_POSTSUBSCRIPT italic_n , italic_λ end_POSTSUBSCRIPT ( italic_v start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) , italic_g start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT italic_g start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT sansserif_T end_POSTSUPERSCRIPT ⟩ ) end_CELL end_ROW start_ROW start_CELL = end_CELL start_CELL divide start_ARG 1 end_ARG start_ARG 2 end_ARG blackboard_E blackboard_E start_POSTSUBSCRIPT roman_ℓ ∼ caligraphic_S start_POSTSUPERSCRIPT italic_n - 1 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( ⟨ italic_λ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_e start_POSTSUPERSCRIPT italic_λ ⟨ roman_ℓ , italic_v start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ⟩ end_POSTSUPERSCRIPT roman_ℓ roman_ℓ start_POSTSUPERSCRIPT sansserif_T end_POSTSUPERSCRIPT , italic_g start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT italic_g start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT sansserif_T end_POSTSUPERSCRIPT ⟩ ) end_CELL end_ROW start_ROW start_CELL ≤ end_CELL start_CELL divide start_ARG 1 end_ARG start_ARG 2 end_ARG blackboard_E blackboard_E start_POSTSUBSCRIPT roman_ℓ ∼ caligraphic_S start_POSTSUPERSCRIPT italic_n - 1 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_λ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_e start_POSTSUPERSCRIPT italic_λ ⟨ roman_ℓ , italic_v start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ⟩ end_POSTSUPERSCRIPT roman_tr ( roman_ℓ roman_ℓ start_POSTSUPERSCRIPT sansserif_T end_POSTSUPERSCRIPT ) ∥ italic_g start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT italic_g start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT sansserif_T end_POSTSUPERSCRIPT ∥ ) end_CELL end_ROW start_ROW start_CELL ≤ end_CELL start_CELL divide start_ARG italic_λ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_σ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 2 end_ARG blackboard_E ( roman_Φ start_POSTSUBSCRIPT italic_n , italic_λ end_POSTSUBSCRIPT ( italic_v start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) ) = divide start_ARG italic_λ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_σ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 2 end_ARG italic_h start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_CELL end_ROW (31)

where the first inequality follows the Cauchy–Schwarz inequality and the last line uses the fact that tr(𝖳)=1trsuperscript𝖳1\mathrm{tr}(\ell\ell^{\mathsf{T}})=1roman_tr ( roman_ℓ roman_ℓ start_POSTSUPERSCRIPT sansserif_T end_POSTSUPERSCRIPT ) = 1 for any 𝒮n1similar-tosuperscript𝒮𝑛1\ell\sim\mathcal{S}^{n-1}roman_ℓ ∼ caligraphic_S start_POSTSUPERSCRIPT italic_n - 1 end_POSTSUPERSCRIPT and gtgt𝖳σt2normsubscript𝑔𝑡superscriptsubscript𝑔𝑡𝖳superscriptsubscript𝜎𝑡2\|g_{t}g_{t}^{\mathsf{T}}\|\leq\sigma_{t}^{2}∥ italic_g start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT italic_g start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT sansserif_T end_POSTSUPERSCRIPT ∥ ≤ italic_σ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT as in Assumption 1.

Plugging (30) and (31) into (27) we arrive at

dhtdtλ2σt22ht,h0=1,formulae-sequence𝑑subscript𝑡𝑑𝑡superscript𝜆2superscriptsubscript𝜎𝑡22subscript𝑡subscript01\frac{dh_{t}}{dt}\leq\frac{\lambda^{2}\sigma_{t}^{2}}{2}h_{t},\quad h_{0}=1,divide start_ARG italic_d italic_h start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_ARG start_ARG italic_d italic_t end_ARG ≤ divide start_ARG italic_λ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_σ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 2 end_ARG italic_h start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , italic_h start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = 1 , (32)

and using the Grönwall inequality [44], we conclude

𝔼(Φn,λ(Xtxt))=hteλ20tστ2𝑑τ2.𝔼subscriptΦ𝑛𝜆subscript𝑋𝑡subscript𝑥𝑡subscript𝑡superscript𝑒superscript𝜆2superscriptsubscript0𝑡superscriptsubscript𝜎𝜏2differential-d𝜏2\mathbb{E}\left(\Phi_{n,\lambda}(X_{t}-x_{t})\right)=h_{t}\leq e^{\frac{% \lambda^{2}\int_{0}^{t}\sigma_{\tau}^{2}d\tau}{2}}.blackboard_E ( roman_Φ start_POSTSUBSCRIPT italic_n , italic_λ end_POSTSUBSCRIPT ( italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) ) = italic_h start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ≤ italic_e start_POSTSUPERSCRIPT divide start_ARG italic_λ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT italic_σ start_POSTSUBSCRIPT italic_τ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_d italic_τ end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT . (33)

By Lemma V.4, (33) implies that, for δ(0,1)for-all𝛿01\forall\delta\in(0,1)∀ italic_δ ∈ ( 0 , 1 ), with probability at least 1δ1𝛿1-\delta1 - italic_δ,

Xtxt(ε1n+ε2log(1/δ))0tστ2𝑑τ,normsubscript𝑋𝑡subscript𝑥𝑡subscript𝜀1𝑛subscript𝜀21𝛿superscriptsubscript0𝑡superscriptsubscript𝜎𝜏2differential-d𝜏\|X_{t}-x_{t}\|\leq\sqrt{(\varepsilon_{1}n+\varepsilon_{2}\log(1/\delta))\int_% {0}^{t}\sigma_{\tau}^{2}d\tau},∥ italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∥ ≤ square-root start_ARG ( italic_ε start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_n + italic_ε start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT roman_log ( 1 / italic_δ ) ) ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT italic_σ start_POSTSUBSCRIPT italic_τ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_d italic_τ end_ARG , (34)

where ε1,ε2subscript𝜀1subscript𝜀2\varepsilon_{1},\varepsilon_{2}italic_ε start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_ε start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT satisfy (21). Since Ψt=0tστ2𝑑τsubscriptΨ𝑡superscriptsubscript0𝑡superscriptsubscript𝜎𝜏2differential-d𝜏\Psi_{t}=\int_{0}^{t}\sigma_{\tau}^{2}d\tauroman_Ψ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT italic_σ start_POSTSUBSCRIPT italic_τ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_d italic_τ when ct=0subscript𝑐𝑡0c_{t}=0italic_c start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = 0 by definition, (34) corresponds to (25). This completes the proof in the special case.

V-C2 General Cases

Next we consider the general cases where Assumption 1 holds with μ(Dxf(x,u,t))ct𝜇subscript𝐷𝑥𝑓𝑥𝑢𝑡subscript𝑐𝑡\mu(D_{x}f(x,u,t))\leq c_{t}italic_μ ( italic_D start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT italic_f ( italic_x , italic_u , italic_t ) ) ≤ italic_c start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT for arbitrary ctsubscript𝑐𝑡c_{t}\in\mathbb{R}italic_c start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∈ blackboard_R. The strategy is to convert them into the above special case via scaling. Define scaled trajectories X~t=eψtXtsubscript~𝑋𝑡superscript𝑒subscript𝜓𝑡subscript𝑋𝑡\tilde{X}_{t}=e^{-\psi_{t}}X_{t}over~ start_ARG italic_X end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = italic_e start_POSTSUPERSCRIPT - italic_ψ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT and x~t=eψtxtsubscript~𝑥𝑡superscript𝑒subscript𝜓𝑡subscript𝑥𝑡\tilde{x}_{t}=e^{-\psi_{t}}x_{t}over~ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = italic_e start_POSTSUPERSCRIPT - italic_ψ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT where ψt=0tcτ𝑑τsubscript𝜓𝑡superscriptsubscript0𝑡subscript𝑐𝜏differential-d𝜏\psi_{t}=\int_{0}^{t}c_{\tau}d\tauitalic_ψ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT italic_c start_POSTSUBSCRIPT italic_τ end_POSTSUBSCRIPT italic_d italic_τ, then x~tsubscript~𝑥𝑡\tilde{x}_{t}over~ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT is a trajectory of the deterministic system

x~˙t=ctx~t+eψtf(eψtx~t,ut,t)=:f~(x~t,ut,t),\dot{\tilde{x}}_{t}=-c_{t}\tilde{x}_{t}+e^{-\psi_{t}}f(e^{\psi_{t}}\tilde{x}_{% t},u_{t},t)=:\tilde{f}(\tilde{x}_{t},u_{t},t),over˙ start_ARG over~ start_ARG italic_x end_ARG end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = - italic_c start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT over~ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT + italic_e start_POSTSUPERSCRIPT - italic_ψ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_f ( italic_e start_POSTSUPERSCRIPT italic_ψ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_POSTSUPERSCRIPT over~ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , italic_u start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , italic_t ) = : over~ start_ARG italic_f end_ARG ( over~ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , italic_u start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , italic_t ) , (35)

Similarly, X~tsubscript~𝑋𝑡\tilde{X}_{t}over~ start_ARG italic_X end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT satisfies

dX~t=f~(X~t,ut,t)dt+eψtgtdWt.𝑑subscript~𝑋𝑡~𝑓subscript~𝑋𝑡subscript𝑢𝑡𝑡𝑑𝑡superscript𝑒subscript𝜓𝑡subscript𝑔𝑡𝑑subscript𝑊𝑡d\tilde{X}_{t}=\tilde{f}(\tilde{X}_{t},u_{t},t)dt+e^{-\psi_{t}}g_{t}dW_{t}.italic_d over~ start_ARG italic_X end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = over~ start_ARG italic_f end_ARG ( over~ start_ARG italic_X end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , italic_u start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , italic_t ) italic_d italic_t + italic_e start_POSTSUPERSCRIPT - italic_ψ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_g start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT italic_d italic_W start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT . (36)

Note that (36) and (35) have the same drift dynamics f~~𝑓\tilde{f}over~ start_ARG italic_f end_ARG. For any x~t,y~tnsubscript~𝑥𝑡subscript~𝑦𝑡superscript𝑛\tilde{x}_{t},\tilde{y}_{t}\in\mathbb{R}^{n}over~ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT, f~~𝑓\tilde{f}over~ start_ARG italic_f end_ARG satisfies

(x~ty~t)𝖳(f~(x~t,ut,t)f~(y~t,ut,t))=(x~ty~t)𝖳(ct(x~ty~t)+eψt(f(xt,ut,t)f(yt,ut,t)))=ctx~ty~t2+e2ψt(xtyt)𝖳(f(xt,ut,t)f(yt,ut,t))ctx~ty~t2+e2ψtctxtyt2=ctx~ty~t2+cte2ψte2ψtx~ty~t2=0,superscriptsubscript~𝑥𝑡subscript~𝑦𝑡𝖳~𝑓subscript~𝑥𝑡subscript𝑢𝑡𝑡~𝑓subscript~𝑦𝑡subscript𝑢𝑡𝑡superscriptsubscript~𝑥𝑡subscript~𝑦𝑡𝖳subscript𝑐𝑡subscript~𝑥𝑡subscript~𝑦𝑡superscript𝑒subscript𝜓𝑡𝑓subscript𝑥𝑡subscript𝑢𝑡𝑡𝑓subscript𝑦𝑡subscript𝑢𝑡𝑡subscript𝑐𝑡superscriptdelimited-∥∥subscript~𝑥𝑡subscript~𝑦𝑡2superscript𝑒2subscript𝜓𝑡superscriptsubscript𝑥𝑡subscript𝑦𝑡𝖳𝑓subscript𝑥𝑡subscript𝑢𝑡𝑡𝑓subscript𝑦𝑡subscript𝑢𝑡𝑡subscript𝑐𝑡superscriptdelimited-∥∥subscript~𝑥𝑡subscript~𝑦𝑡2superscript𝑒2subscript𝜓𝑡subscript𝑐𝑡superscriptdelimited-∥∥subscript𝑥𝑡subscript𝑦𝑡2subscript𝑐𝑡superscriptdelimited-∥∥subscript~𝑥𝑡subscript~𝑦𝑡2subscript𝑐𝑡superscript𝑒2subscript𝜓𝑡superscript𝑒2subscript𝜓𝑡superscriptdelimited-∥∥subscript~𝑥𝑡subscript~𝑦𝑡20\begin{split}&(\tilde{x}_{t}-\tilde{y}_{t})^{\mathsf{T}}\left(\tilde{f}(\tilde% {x}_{t},u_{t},t)-\tilde{f}(\tilde{y}_{t},u_{t},t)\right)\\ =&(\tilde{x}_{t}-\tilde{y}_{t})^{\mathsf{T}}\left(-c_{t}(\tilde{x}_{t}-\tilde{% y}_{t})+e^{-\psi_{t}}\left(f(x_{t},u_{t},t)-f(y_{t},u_{t},t)\right)\right)\\ =&-c_{t}\|\tilde{x}_{t}-\tilde{y}_{t}\|^{2}+e^{-2\psi_{t}}(x_{t}-y_{t})^{% \mathsf{T}}\left(f(x_{t},u_{t},t)-f(y_{t},u_{t},t)\right)\\ \leq&-c_{t}\|\tilde{x}_{t}-\tilde{y}_{t}\|^{2}+e^{-2\psi_{t}}c_{t}\|x_{t}-y_{t% }\|^{2}\\ =&-c_{t}\|\tilde{x}_{t}-\tilde{y}_{t}\|^{2}+c_{t}e^{-2\psi_{t}}e^{2\psi_{t}}\|% \tilde{x}_{t}-\tilde{y}_{t}\|^{2}=0,\end{split}start_ROW start_CELL end_CELL start_CELL ( over~ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT sansserif_T end_POSTSUPERSCRIPT ( over~ start_ARG italic_f end_ARG ( over~ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , italic_u start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , italic_t ) - over~ start_ARG italic_f end_ARG ( over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , italic_u start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , italic_t ) ) end_CELL end_ROW start_ROW start_CELL = end_CELL start_CELL ( over~ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT sansserif_T end_POSTSUPERSCRIPT ( - italic_c start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( over~ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) + italic_e start_POSTSUPERSCRIPT - italic_ψ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( italic_f ( italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , italic_u start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , italic_t ) - italic_f ( italic_y start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , italic_u start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , italic_t ) ) ) end_CELL end_ROW start_ROW start_CELL = end_CELL start_CELL - italic_c start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∥ over~ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_e start_POSTSUPERSCRIPT - 2 italic_ψ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - italic_y start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT sansserif_T end_POSTSUPERSCRIPT ( italic_f ( italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , italic_u start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , italic_t ) - italic_f ( italic_y start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , italic_u start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , italic_t ) ) end_CELL end_ROW start_ROW start_CELL ≤ end_CELL start_CELL - italic_c start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∥ over~ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_e start_POSTSUPERSCRIPT - 2 italic_ψ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_c start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∥ italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - italic_y start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_CELL end_ROW start_ROW start_CELL = end_CELL start_CELL - italic_c start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∥ over~ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_c start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT italic_e start_POSTSUPERSCRIPT - 2 italic_ψ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_e start_POSTSUPERSCRIPT 2 italic_ψ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ∥ over~ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = 0 , end_CELL end_ROW

meaning Assumption 1 holds for scaled systems (35) and (36) with c~t=0subscript~𝑐𝑡0\tilde{c}_{t}=0over~ start_ARG italic_c end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = 0 and the results for the special case can be applied. The diffusion coefficient of (36) satisfies e2ψtgtgt𝖳e2ψtσt2=:σ~t2\|e^{-2\psi_{t}}g_{t}g_{t}^{\mathsf{T}}\|\leq e^{-2\psi_{t}}\sigma_{t}^{2}=:% \tilde{\sigma}_{t}^{2}∥ italic_e start_POSTSUPERSCRIPT - 2 italic_ψ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_g start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT italic_g start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT sansserif_T end_POSTSUPERSCRIPT ∥ ≤ italic_e start_POSTSUPERSCRIPT - 2 italic_ψ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_σ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = : over~ start_ARG italic_σ end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT. Applying (33) to the scaled dynamics (36) we have that with probability at least 1δ1𝛿1-\delta1 - italic_δ,

X~tx~t(ε1n+ε2log(1/δ))0tσ~τ2𝑑τ.normsubscript~𝑋𝑡subscript~𝑥𝑡subscript𝜀1𝑛subscript𝜀21𝛿superscriptsubscript0𝑡superscriptsubscript~𝜎𝜏2differential-d𝜏\|\tilde{X}_{t}-\tilde{x}_{t}\|\leq\sqrt{(\varepsilon_{1}n+\varepsilon_{2}\log% (1/\delta))\int_{0}^{t}\tilde{\sigma}_{\tau}^{2}d\tau}.∥ over~ start_ARG italic_X end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - over~ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∥ ≤ square-root start_ARG ( italic_ε start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_n + italic_ε start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT roman_log ( 1 / italic_δ ) ) ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT over~ start_ARG italic_σ end_ARG start_POSTSUBSCRIPT italic_τ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_d italic_τ end_ARG . (37)

Recalling Xt=eψtX~tsubscript𝑋𝑡superscript𝑒subscript𝜓𝑡subscript~𝑋𝑡X_{t}=e^{\psi_{t}}\tilde{X}_{t}italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = italic_e start_POSTSUPERSCRIPT italic_ψ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_POSTSUPERSCRIPT over~ start_ARG italic_X end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT, xt=eψtx~tsubscript𝑥𝑡superscript𝑒subscript𝜓𝑡subscript~𝑥𝑡x_{t}=e^{\psi_{t}}\tilde{x}_{t}italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = italic_e start_POSTSUPERSCRIPT italic_ψ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_POSTSUPERSCRIPT over~ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT, and ΨtsubscriptΨ𝑡\Psi_{t}roman_Ψ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT in (10), we conclude that with probability at least 1δ1𝛿1-\delta1 - italic_δ,

Xtxt(ε1n+ε2log(1/δ))e2ψt0tστ2e2ψτ𝑑τ=Ψt(ε1n+ε2log(1/δ)),delimited-∥∥subscript𝑋𝑡subscript𝑥𝑡subscript𝜀1𝑛subscript𝜀21𝛿superscript𝑒2subscript𝜓𝑡superscriptsubscript0𝑡superscriptsubscript𝜎𝜏2superscript𝑒2subscript𝜓𝜏differential-d𝜏subscriptΨ𝑡subscript𝜀1𝑛subscript𝜀21𝛿\begin{split}\|X_{t}-x_{t}\|\leq&\sqrt{(\varepsilon_{1}n+\varepsilon_{2}\log(1% /\delta))e^{2\psi_{t}}\int_{0}^{t}\sigma_{\tau}^{2}e^{-2\psi_{\tau}}d\tau}\\ =&\sqrt{\Psi_{t}(\varepsilon_{1}n+\varepsilon_{2}\log(1/\delta))},\end{split}start_ROW start_CELL ∥ italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∥ ≤ end_CELL start_CELL square-root start_ARG ( italic_ε start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_n + italic_ε start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT roman_log ( 1 / italic_δ ) ) italic_e start_POSTSUPERSCRIPT 2 italic_ψ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT italic_σ start_POSTSUBSCRIPT italic_τ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_e start_POSTSUPERSCRIPT - 2 italic_ψ start_POSTSUBSCRIPT italic_τ end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_d italic_τ end_ARG end_CELL end_ROW start_ROW start_CELL = end_CELL start_CELL square-root start_ARG roman_Ψ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_ε start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_n + italic_ε start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT roman_log ( 1 / italic_δ ) ) end_ARG , end_CELL end_ROW

which completes the proof. ∎

Remark V.1.

When Assumption 1 holds with time-invariant ctcsubscript𝑐𝑡𝑐c_{t}\equiv citalic_c start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ≡ italic_c and σtσsubscript𝜎𝑡𝜎\sigma_{t}\equiv\sigmaitalic_σ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ≡ italic_σ, ψtsubscript𝜓𝑡\psi_{t}italic_ψ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT defined in (10) becomes ψt=ctsubscript𝜓𝑡𝑐𝑡\psi_{t}=ctitalic_ψ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = italic_c italic_t, and (25) in Theorem 1 reduces to

Xtxtσ2(e2ct1)2c(ε1n+ε2log(1/δ)).normsubscript𝑋𝑡subscript𝑥𝑡superscript𝜎2superscript𝑒2𝑐𝑡12𝑐subscript𝜀1𝑛subscript𝜀21𝛿\|X_{t}-x_{t}\|\leq\sqrt{\frac{\sigma^{2}(e^{2ct}-1)}{2c}(\varepsilon_{1}n+% \varepsilon_{2}\log(1/\delta))}.∥ italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∥ ≤ square-root start_ARG divide start_ARG italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_e start_POSTSUPERSCRIPT 2 italic_c italic_t end_POSTSUPERSCRIPT - 1 ) end_ARG start_ARG 2 italic_c end_ARG ( italic_ε start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_n + italic_ε start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT roman_log ( 1 / italic_δ ) ) end_ARG . (38)

The probabilistic bound in Theorem 1 highly relies on the contraction rate of the dynamics. The bound (25) and (38) resemble the input-to-state bounds used in contraction-based reachability of deterministic systems [28]. Thus, our results can be viewed as the stochastic counterpart of the deterministic incremental input-to-state bounds in contraction theory.

V-D Extension to Weighted Norm

The probabilistic bound of the stochastic deviation Xtxtnormsubscript𝑋𝑡subscript𝑥𝑡\|X_{t}-x_{t}\|∥ italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∥ in Theorem 1 can be extended to bound the weighted deviation XtxtPsubscriptnormsubscript𝑋𝑡subscript𝑥𝑡𝑃\|X_{t}-x_{t}\|_{P}∥ italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT italic_P end_POSTSUBSCRIPT for any positive-definite matrix P𝑃Pitalic_P. To this end, define the weighted matrix measure of a matrix A𝐴Aitalic_A as

μP(A)=limϵ0+In+ϵAP1ϵ,subscript𝜇𝑃𝐴subscriptitalic-ϵsuperscript0subscriptnormsubscript𝐼𝑛italic-ϵ𝐴𝑃1italic-ϵ\mu_{P}(A)=\lim_{\epsilon\to 0^{+}}\frac{\|I_{n}+\epsilon A\|_{P}-1}{\epsilon},italic_μ start_POSTSUBSCRIPT italic_P end_POSTSUBSCRIPT ( italic_A ) = roman_lim start_POSTSUBSCRIPT italic_ϵ → 0 start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT end_POSTSUBSCRIPT divide start_ARG ∥ italic_I start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT + italic_ϵ italic_A ∥ start_POSTSUBSCRIPT italic_P end_POSTSUBSCRIPT - 1 end_ARG start_ARG italic_ϵ end_ARG ,

which can be obtained using the expression μP(A)=μ(P12AP12)subscript𝜇𝑃𝐴𝜇superscript𝑃12𝐴superscript𝑃12\mu_{P}(A)=\mu(P^{\frac{1}{2}}AP^{-\frac{1}{2}})italic_μ start_POSTSUBSCRIPT italic_P end_POSTSUBSCRIPT ( italic_A ) = italic_μ ( italic_P start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT italic_A italic_P start_POSTSUPERSCRIPT - divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT ) [4]. Consider the systems (3) and (4) satisfying a modified version of Assumption 1 as μP(Dxf(x,u,t))ctsubscript𝜇𝑃subscript𝐷𝑥𝑓𝑥𝑢𝑡subscript𝑐𝑡\mu_{P}(D_{x}f(x,u,t))\leq c_{t}italic_μ start_POSTSUBSCRIPT italic_P end_POSTSUBSCRIPT ( italic_D start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT italic_f ( italic_x , italic_u , italic_t ) ) ≤ italic_c start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT and P12gtgt𝖳P12σt2Inprecedes-or-equalssuperscript𝑃12subscript𝑔𝑡superscriptsubscript𝑔𝑡𝖳superscript𝑃12superscriptsubscript𝜎𝑡2subscript𝐼𝑛P^{\frac{1}{2}}g_{t}g_{t}^{\mathsf{T}}P^{\frac{1}{2}}\preceq\sigma_{t}^{2}I_{n}italic_P start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT italic_g start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT italic_g start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT sansserif_T end_POSTSUPERSCRIPT italic_P start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT ⪯ italic_σ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_I start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT.

This setting with weighted norm can be converted to the unweighted version in Section V-C through a coordinate transformation. More specifically, given associated trajectories Xt,xtsubscript𝑋𝑡subscript𝑥𝑡X_{t},x_{t}italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT of (3) and (4), define X^t=P12Xtsubscript^𝑋𝑡superscript𝑃12subscript𝑋𝑡\hat{X}_{t}=P^{\frac{1}{2}}X_{t}over^ start_ARG italic_X end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = italic_P start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT, x^t=P12xtsubscript^𝑥𝑡superscript𝑃12subscript𝑥𝑡\hat{x}_{t}=P^{\frac{1}{2}}x_{t}over^ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = italic_P start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT, then Xt^^subscript𝑋𝑡\hat{X_{t}}over^ start_ARG italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_ARG and x^tsubscript^𝑥𝑡\hat{x}_{t}over^ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT satisfy

dX^t𝑑subscript^𝑋𝑡\displaystyle d\hat{X}_{t}italic_d over^ start_ARG italic_X end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT =f^(X^t,ut,t)dt+g^tdWt,absent^𝑓subscript^𝑋𝑡subscript𝑢𝑡𝑡𝑑𝑡subscript^𝑔𝑡𝑑subscript𝑊𝑡\displaystyle=\hat{f}(\hat{X}_{t},u_{t},t)dt+\hat{g}_{t}dW_{t},= over^ start_ARG italic_f end_ARG ( over^ start_ARG italic_X end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , italic_u start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , italic_t ) italic_d italic_t + over^ start_ARG italic_g end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT italic_d italic_W start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , (39)
x^˙tsubscript˙^𝑥𝑡\displaystyle\dot{\hat{x}}_{t}over˙ start_ARG over^ start_ARG italic_x end_ARG end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT =f^(x^t,u,t)absent^𝑓subscript^𝑥𝑡𝑢𝑡\displaystyle=\hat{f}(\hat{x}_{t},u,t)= over^ start_ARG italic_f end_ARG ( over^ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , italic_u , italic_t ) (40)

with f^(x^)=P12f(P12x^)^𝑓^𝑥superscript𝑃12𝑓superscript𝑃12^𝑥\hat{f}(\hat{x})=P^{\frac{1}{2}}f(P^{-\frac{1}{2}}\hat{x})over^ start_ARG italic_f end_ARG ( over^ start_ARG italic_x end_ARG ) = italic_P start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT italic_f ( italic_P start_POSTSUPERSCRIPT - divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT over^ start_ARG italic_x end_ARG ) and g^t=P12gtsubscript^𝑔𝑡superscript𝑃12subscript𝑔𝑡\hat{g}_{t}=P^{\frac{1}{2}}g_{t}over^ start_ARG italic_g end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = italic_P start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT italic_g start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT. By definition, μP(A)=μ(P12AP12)subscript𝜇𝑃𝐴𝜇superscript𝑃12𝐴superscript𝑃12\mu_{P}(A)=\mu(P^{\frac{1}{2}}AP^{-\frac{1}{2}})italic_μ start_POSTSUBSCRIPT italic_P end_POSTSUBSCRIPT ( italic_A ) = italic_μ ( italic_P start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT italic_A italic_P start_POSTSUPERSCRIPT - divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT ) for any matrix A𝐴Aitalic_A, thus μ(Dx^f^(x^,u,t))=μP(Dxf(x,u,t))ct𝜇subscript𝐷^𝑥^𝑓^𝑥𝑢𝑡subscript𝜇𝑃subscript𝐷𝑥𝑓𝑥𝑢𝑡subscript𝑐𝑡\mu(D_{\hat{x}}\hat{f}(\hat{x},u,t))=\mu_{P}(D_{x}f(x,u,t))\leq c_{t}italic_μ ( italic_D start_POSTSUBSCRIPT over^ start_ARG italic_x end_ARG end_POSTSUBSCRIPT over^ start_ARG italic_f end_ARG ( over^ start_ARG italic_x end_ARG , italic_u , italic_t ) ) = italic_μ start_POSTSUBSCRIPT italic_P end_POSTSUBSCRIPT ( italic_D start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT italic_f ( italic_x , italic_u , italic_t ) ) ≤ italic_c start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT. Moreover, g^tg^t𝖳=P12gtgt𝖳P12σt2Insubscript^𝑔𝑡superscriptsubscript^𝑔𝑡𝖳superscript𝑃12subscript𝑔𝑡superscriptsubscript𝑔𝑡𝖳superscript𝑃12precedes-or-equalssuperscriptsubscript𝜎𝑡2subscript𝐼𝑛\hat{g}_{t}\hat{g}_{t}^{\mathsf{T}}=P^{\frac{1}{2}}g_{t}g_{t}^{\mathsf{T}}P^{% \frac{1}{2}}\preceq\sigma_{t}^{2}I_{n}over^ start_ARG italic_g end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT over^ start_ARG italic_g end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT sansserif_T end_POSTSUPERSCRIPT = italic_P start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT italic_g start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT italic_g start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT sansserif_T end_POSTSUPERSCRIPT italic_P start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT ⪯ italic_σ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_I start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT. Therefore, the systems (39) and (40) satisfy Assumption 1 with the standrad 2subscript2\ell_{2}roman_ℓ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT-norm. Then, by Theorem 1, with probability at least 1δ1𝛿1-\delta1 - italic_δ,

XtxtP=X^tx^tΨt(ε1n+ε2log(1/δ)).subscriptnormsubscript𝑋𝑡subscript𝑥𝑡𝑃normsubscript^𝑋𝑡subscript^𝑥𝑡subscriptΨ𝑡subscript𝜀1𝑛subscript𝜀21𝛿\|X_{t}-x_{t}\|_{P}=\|\hat{X}_{t}-\hat{x}_{t}\|\leq\sqrt{\Psi_{t}(\varepsilon_% {1}n+\varepsilon_{2}\log(1/\delta))}.∥ italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT italic_P end_POSTSUBSCRIPT = ∥ over^ start_ARG italic_X end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - over^ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∥ ≤ square-root start_ARG roman_Ψ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_ε start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_n + italic_ε start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT roman_log ( 1 / italic_δ ) ) end_ARG . (41)

This extension for weighted norm can sometimes be advantageous to establish a tighter bound. Given a matrix A𝐴Aitalic_A, μ(A)𝜇𝐴\mu(A)italic_μ ( italic_A ) can be much larger than the real parts of the eigenvalues of A𝐴Aitalic_A. In contrast, with a proper positive-definite matrix P𝑃Pitalic_P, μ(P12AP12)𝜇superscript𝑃12𝐴superscript𝑃12\mu(P^{\frac{1}{2}}AP^{-\frac{1}{2}})italic_μ ( italic_P start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT italic_A italic_P start_POSTSUPERSCRIPT - divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT ) can be made arbitrarily close to the real parts of the eigenvalues of A𝐴Aitalic_A [4, Chapter 2.7]. In this circumstance, working with the weighted norm can lead to sharper results.

V-E Tightness of Probabilistic Bound

Finally, we show that the probabilistic bound in Theorem 1 is tight under Assumption 1 and it is impossible to achieve better probabilistic bounds than (25) without additional assumptions. In particular, we show that the bound (25) precisely captures the stochastic deviation of linear systems satisfying Assumption 1.

Consider the LTI stochastic system (12) and its associated deterministic system (13). They satisfy Assumption 1 with ctc=μ(A)subscript𝑐𝑡𝑐𝜇𝐴c_{t}\equiv c=\mu(A)italic_c start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ≡ italic_c = italic_μ ( italic_A ) and σtσsubscript𝜎𝑡𝜎\sigma_{t}\equiv\sigmaitalic_σ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ≡ italic_σ. By Theorem 1, with probability at least 1δ1𝛿1-\delta1 - italic_δ,

Xtxtσ2(e2ct1)2c(ε1n+ε2log(1/δ)).normsubscript𝑋𝑡subscript𝑥𝑡superscript𝜎2superscript𝑒2𝑐𝑡12𝑐subscript𝜀1𝑛subscript𝜀21𝛿\|X_{t}-x_{t}\|\leq\sqrt{\frac{\sigma^{2}(e^{2ct}-1)}{2c}(\varepsilon_{1}n+% \varepsilon_{2}\log(1/\delta))}.∥ italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∥ ≤ square-root start_ARG divide start_ARG italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_e start_POSTSUPERSCRIPT 2 italic_c italic_t end_POSTSUPERSCRIPT - 1 ) end_ARG start_ARG 2 italic_c end_ARG ( italic_ε start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_n + italic_ε start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT roman_log ( 1 / italic_δ ) ) end_ARG . (42)

This is the same, up to some constants chosen by convention, as the tight bound (18) calculated using Gaussian concentration properties [45, Chapter 4.4]. For Assumption 1 with time-varying ctsubscript𝑐𝑡c_{t}italic_c start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT and σtsubscript𝜎𝑡\sigma_{t}italic_σ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT, we can construct linear dynamics

dXt𝑑subscript𝑋𝑡\displaystyle dX_{t}italic_d italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT =\displaystyle== ctXtdt+σtdWtsubscript𝑐𝑡subscript𝑋𝑡𝑑𝑡subscript𝜎𝑡𝑑subscript𝑊𝑡\displaystyle c_{t}X_{t}dt+\sigma_{t}dW_{t}italic_c start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT italic_d italic_t + italic_σ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT italic_d italic_W start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT
x˙tsubscript˙𝑥𝑡\displaystyle\dot{x}_{t}over˙ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT =\displaystyle== ctxt.subscript𝑐𝑡subscript𝑥𝑡\displaystyle c_{t}x_{t}.italic_c start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT .

With the same initial condition X0=x0subscript𝑋0subscript𝑥0X_{0}=x_{0}italic_X start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, Xtxtsubscript𝑋𝑡subscript𝑥𝑡X_{t}-x_{t}italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT is a zero-mean Gaussian random variance with covariance ΨtInsubscriptΨ𝑡subscript𝐼𝑛\Psi_{t}I_{n}roman_Ψ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT italic_I start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT where ΨtsubscriptΨ𝑡\Psi_{t}roman_Ψ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT is as in (10). The Gaussianity of Xtxtsubscript𝑋𝑡subscript𝑥𝑡X_{t}-x_{t}italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT leads to the same probabilistic bound as (25). Therefore, Theorem 1 is tight and cannot be improved.

While in this work the probabilistic bound in Theorem 1 is designed for reachability analysis, we emphasize that this very bound is one of the first non-conservative results that can quantitatively describe the behavior of a stochastic system. The bound is of independent interests beyond reachability analysis and can potentially impact many other areas such as estimation, uncertainty quantification, finance, etc.

VI Probabilistic Reachable Set

Equipped with the probabilistic bound (25) for the stochastic deviation, we are ready to present our approach to approximating the δ𝛿\deltaitalic_δ-PRS of a general nonlinear stochastic system (3). Recalling the separation strategy in Proposition 1, we can combine our tight bound (25) with any existing methods for approximating the DRS of the associated deterministic system (4) to estimate the δ𝛿\deltaitalic_δ-PRS of (3).

Theorem 2.

Consider the stochastic system (3) with initial set 𝒳0nsubscript𝒳0superscript𝑛\mathcal{X}_{0}\subseteq\mathbb{R}^{n}caligraphic_X start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ⊆ blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT and input set 𝒰p𝒰superscript𝑝\mathcal{U}\subseteq\mathbb{R}^{p}caligraphic_U ⊆ blackboard_R start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT. Suppose Assumption 1 holds. Let ¯tsubscript¯𝑡\overline{\mathcal{R}}_{t}over¯ start_ARG caligraphic_R end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT be an over-approximation of the DRS of the associated deterministic system (4). Then, for any probability level δ(0,1)𝛿01\delta\in(0,1)italic_δ ∈ ( 0 , 1 ), a δ𝛿\deltaitalic_δ-PRS of (3) is

δ,t=¯tn(rδ,t,0),subscript𝛿𝑡direct-sumsubscript¯𝑡superscript𝑛subscript𝑟𝛿𝑡0\mathcal{R}_{\delta,t}=\overline{\mathcal{R}}_{t}\oplus\mathcal{B}^{n}\left(r_% {\delta,t},0\right),caligraphic_R start_POSTSUBSCRIPT italic_δ , italic_t end_POSTSUBSCRIPT = over¯ start_ARG caligraphic_R end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ⊕ caligraphic_B start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ( italic_r start_POSTSUBSCRIPT italic_δ , italic_t end_POSTSUBSCRIPT , 0 ) , (43)

where rδ,t=Ψt(ε1n+ε2log(1/δ))subscript𝑟𝛿𝑡subscriptΨ𝑡subscript𝜀1𝑛subscript𝜀21𝛿r_{\delta,t}=\sqrt{\Psi_{t}(\varepsilon_{1}n+\varepsilon_{2}\log(1/\delta))}italic_r start_POSTSUBSCRIPT italic_δ , italic_t end_POSTSUBSCRIPT = square-root start_ARG roman_Ψ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_ε start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_n + italic_ε start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT roman_log ( 1 / italic_δ ) ) end_ARG with ΨtsubscriptΨ𝑡\Psi_{t}roman_Ψ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT in (10) and ε1,ε2subscript𝜀1subscript𝜀2\varepsilon_{1},\varepsilon_{2}italic_ε start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_ε start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT in (21).

Proof.

The result follows by replacing rδ,tsubscript𝑟𝛿𝑡r_{\delta,t}italic_r start_POSTSUBSCRIPT italic_δ , italic_t end_POSTSUBSCRIPT in Proposition 1 by (25) in Theorem 1. ∎

Theorem 2 is a paradigm shift and essentially reduces the probabilistic reachability problem into a widely studied deterministic reachability problem. To compute the δ𝛿\deltaitalic_δ-PRS (43) for the stochastic system (3), one only needs to over-approximate the DRS for the deterministic system (4). Theoretically speaking, the δ𝛿\deltaitalic_δ-PRS in Theorem 2 is tight and cannot be improved further without additional assumptions. From a practical point of view, by combining the tight high probability bounds on stochastic deviation in Theorem 1 with the scalable deterministic reachability frameworks [3, 6, 7], the δ𝛿\deltaitalic_δ-PRS in Theorem 2 can be computed efficiently for high-dimensional systems.

Tightness. To be more precise, replacing ¯tsubscript¯𝑡\overline{\mathcal{R}}_{t}over¯ start_ARG caligraphic_R end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT by tsubscript𝑡\mathcal{R}_{t}caligraphic_R start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT in (43) gives a tight δ𝛿\deltaitalic_δ-PRS. First, the probabilistic bound rδ,tsubscript𝑟𝛿𝑡r_{\delta,t}italic_r start_POSTSUBSCRIPT italic_δ , italic_t end_POSTSUBSCRIPT is tight provided the coefficients ct,σtsubscript𝑐𝑡subscript𝜎𝑡c_{t},\sigma_{t}italic_c start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT in Assumption 1 is tight. Moreover, since the deterministic input and stochastic disturbance in (3) affects the δ𝛿\deltaitalic_δ-PRS in Definition III.1 in an independent manner, the separation strategy (Proposition 1) is also tight, meaning the decomposition in Proposition 1 is necessary. Thus, the tightness of δ,tsubscript𝛿𝑡\mathcal{R}_{\delta,t}caligraphic_R start_POSTSUBSCRIPT italic_δ , italic_t end_POSTSUBSCRIPT in Theorem 2 depends only on the tightness of the over-approximation ¯tsubscript¯𝑡\overline{\mathcal{R}}_{t}over¯ start_ARG caligraphic_R end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT of the DRS of the associated deterministic system (4). It becomes tighter as ¯ttsubscript¯𝑡subscript𝑡\overline{\mathcal{R}}_{t}\to\mathcal{R}_{t}over¯ start_ARG caligraphic_R end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT → caligraphic_R start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT.

Computational complexity. The computational cost of (43) comes from two sources: computing ¯tsubscript¯𝑡\overline{\mathcal{R}}_{t}over¯ start_ARG caligraphic_R end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT and realizing the Minkowski sum direct-sum\oplus. The former depends on the choice of algorithms for approximating DRS. Computing the Minkowski sum in a parametrized form is challenging and efficient algorithms are only available for ellipsoids and polyhedral [46, 47, 48]. Fortunately, a parametrized Minkowski sum is not needed for reachability analysis. In practice, we only need an efficient membership oracle to determine whether a point x𝑥xitalic_x belongs to the Minkowski sum, which is an easier task. In particular, for (43), this oracle requires comparing miny¯tyxsubscript𝑦subscript¯𝑡norm𝑦𝑥\min_{y\in\overline{\mathcal{R}}_{t}}\|y-x\|roman_min start_POSTSUBSCRIPT italic_y ∈ over¯ start_ARG caligraphic_R end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∥ italic_y - italic_x ∥ and rδ,tsubscript𝑟𝛿𝑡r_{\delta,t}italic_r start_POSTSUBSCRIPT italic_δ , italic_t end_POSTSUBSCRIPT, which is a convex optimization when ¯tsubscript¯𝑡\overline{\mathcal{R}}_{t}over¯ start_ARG caligraphic_R end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT is convex. In the following section, we exemplify our framework with two popular methods for computing ¯tsubscript¯𝑡\overline{\mathcal{R}}_{t}over¯ start_ARG caligraphic_R end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT. These methods are scalable and result in convex ¯tsubscript¯𝑡\overline{\mathcal{R}}_{t}over¯ start_ARG caligraphic_R end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT, rendering efficient algorithms for probabilistic reachability analysis.

Extension to weighted norm. Similar to stochastic deviation, Theorem 2 is also extendable to the case for P𝑃Pitalic_P-weighted 2subscript2\ell_{2}roman_ℓ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT norm. Consider the modified assumption as shown in Section V-D. Following the proof of Proposition 1 while substituting n(rδ,t,0)superscript𝑛subscript𝑟𝛿𝑡0\mathcal{B}^{n}(r_{\delta,t},0)caligraphic_B start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ( italic_r start_POSTSUBSCRIPT italic_δ , italic_t end_POSTSUBSCRIPT , 0 ) by Pn(rδ,t,0)superscriptsubscript𝑃𝑛subscript𝑟𝛿𝑡0\mathcal{B}_{P}^{n}(r_{\delta,t},0)caligraphic_B start_POSTSUBSCRIPT italic_P end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ( italic_r start_POSTSUBSCRIPT italic_δ , italic_t end_POSTSUBSCRIPT , 0 ), where Pn(rδ,t,0)={xn:xPrδ,t}superscriptsubscript𝑃𝑛subscript𝑟𝛿𝑡0conditional-set𝑥superscript𝑛subscriptnorm𝑥𝑃subscript𝑟𝛿𝑡\mathcal{B}_{P}^{n}(r_{\delta,t},0)=\{x\in\mathbb{R}^{n}:~{}\|x\|_{P}\leq r_{% \delta,t}\}caligraphic_B start_POSTSUBSCRIPT italic_P end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ( italic_r start_POSTSUBSCRIPT italic_δ , italic_t end_POSTSUBSCRIPT , 0 ) = { italic_x ∈ blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT : ∥ italic_x ∥ start_POSTSUBSCRIPT italic_P end_POSTSUBSCRIPT ≤ italic_r start_POSTSUBSCRIPT italic_δ , italic_t end_POSTSUBSCRIPT } is an ellipsoid, we conclude that

δ,t=¯tPn(rδ,t,0)subscript𝛿𝑡direct-sumsubscript¯𝑡superscriptsubscript𝑃𝑛subscript𝑟𝛿𝑡0\mathcal{R}_{\delta,t}=\overline{\mathcal{R}}_{t}\oplus\mathcal{B}_{P}^{n}(r_{% \delta,t},0)caligraphic_R start_POSTSUBSCRIPT italic_δ , italic_t end_POSTSUBSCRIPT = over¯ start_ARG caligraphic_R end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ⊕ caligraphic_B start_POSTSUBSCRIPT italic_P end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ( italic_r start_POSTSUBSCRIPT italic_δ , italic_t end_POSTSUBSCRIPT , 0 )

is a δ𝛿\deltaitalic_δ-PRS of the system (3).

VII Case study of Probabilistic Reachability

In this section, we present the application of δ𝛿\deltaitalic_δ-PRS derived in Section VI in two case studies where contraction-based and interval-based methods are used to approximate tsubscript𝑡\mathcal{R}_{t}caligraphic_R start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT.

VII-A Contraction-based Probabilistic Reachability

Contraction theory is a classical framework for analyzing the stability of dynamical systems using the incremental distance between their trajectories [29, 31]. Traditionally, it is employed to infer strong robustness properties of dynamical systems. Recently, contraction theory has emerged as a computationally efficient tool for reachability analysis of deterministic systems. The contraction-based method relies on the matrix measure (Definition II.2) and the following assumption.

Assumption 2.

For the deterministic system (4), there exist constants c,ρ𝑐𝜌absentc,\rho\in\realitalic_c , italic_ρ ∈ such that, for every t,x,u×0×n𝒰t,x,u\in{}_{\geq 0}\times{}^{n}\times\mathcal{U}italic_t , italic_x , italic_u ∈ start_FLOATSUBSCRIPT ≥ 0 end_FLOATSUBSCRIPT × start_FLOATSUPERSCRIPT italic_n end_FLOATSUPERSCRIPT × caligraphic_U,

  1. (i)

    μ𝕏(Dxf(x,u,t))csubscript𝜇𝕏subscript𝐷𝑥𝑓𝑥𝑢𝑡𝑐\mu_{\mathbb{X}}(D_{x}f(x,u,t))\leq citalic_μ start_POSTSUBSCRIPT blackboard_X end_POSTSUBSCRIPT ( italic_D start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT italic_f ( italic_x , italic_u , italic_t ) ) ≤ italic_c, and

  2. (ii)

    Duf(x,u,t)𝕏,𝕌ρsubscriptnormsubscript𝐷𝑢𝑓𝑥𝑢𝑡𝕏𝕌𝜌\|D_{u}f(x,u,t)\|_{\mathbb{X},\mathbb{U}}\leq\rho∥ italic_D start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT italic_f ( italic_x , italic_u , italic_t ) ∥ start_POSTSUBSCRIPT blackboard_X , blackboard_U end_POSTSUBSCRIPT ≤ italic_ρ.

Here μ𝕏subscript𝜇𝕏\mu_{\mathbb{X}}italic_μ start_POSTSUBSCRIPT blackboard_X end_POSTSUBSCRIPT is the matrix measure with respect to the norm 𝕏\|\cdot\|_{\mathbb{X}}∥ ⋅ ∥ start_POSTSUBSCRIPT blackboard_X end_POSTSUBSCRIPT on n and 𝕏,𝕌\|\cdot\|_{\mathbb{X},\mathbb{U}}∥ ⋅ ∥ start_POSTSUBSCRIPT blackboard_X , blackboard_U end_POSTSUBSCRIPT denotes the induced norm on p×n. The norm 𝕏\|\cdot\|_{\mathbb{X}}∥ ⋅ ∥ start_POSTSUBSCRIPT blackboard_X end_POSTSUBSCRIPT can be chosen differently from the Euclidean norm in general to ensure the tightest possible reachable set. Suppose that system (4) satisfies Assumption 2 and let txtmaps-to𝑡subscriptsuperscript𝑥𝑡t\mapsto x^{*}_{t}italic_t ↦ italic_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT be a trajectory of (4) with the input tutmaps-to𝑡subscriptsuperscript𝑢𝑡t\mapsto u^{*}_{t}italic_t ↦ italic_u start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT. Given initial configuration x0𝒳0=𝕏(r1,x0)subscript𝑥0subscript𝒳0subscript𝕏subscript𝑟1subscriptsuperscript𝑥0x_{0}\in\mathcal{X}_{0}=\mathcal{B}_{{\mathbb{X}}}(r_{1},x^{*}_{0})italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∈ caligraphic_X start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = caligraphic_B start_POSTSUBSCRIPT blackboard_X end_POSTSUBSCRIPT ( italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) for r1>0subscript𝑟10r_{1}>0italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT > 0 and input ut𝕌(r2,ut)𝒰subscript𝑢𝑡subscript𝕌subscript𝑟2subscriptsuperscript𝑢𝑡𝒰u_{t}\in\mathcal{B}_{{\mathbb{U}}}(r_{2},u^{*}_{t})\subset\mathcal{U}italic_u start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∈ caligraphic_B start_POSTSUBSCRIPT blackboard_U end_POSTSUBSCRIPT ( italic_r start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_u start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) ⊂ caligraphic_U for r2>0subscript𝑟20r_{2}>0italic_r start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT > 0, the contraction-based method gives the following over-approximation of reachable sets of (4[28]

¯t=𝕏(ectr1+ρc(ect1)r2,xt).subscript¯𝑡subscript𝕏superscript𝑒𝑐𝑡subscript𝑟1𝜌𝑐superscript𝑒𝑐𝑡1subscript𝑟2subscriptsuperscript𝑥𝑡\overline{\mathcal{R}}_{t}=\mathcal{B}_{{\mathbb{X}}}(e^{ct}r_{1}+\tfrac{\rho}% {c}(e^{ct}-1)r_{2},x^{*}_{t}).over¯ start_ARG caligraphic_R end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = caligraphic_B start_POSTSUBSCRIPT blackboard_X end_POSTSUBSCRIPT ( italic_e start_POSTSUPERSCRIPT italic_c italic_t end_POSTSUPERSCRIPT italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + divide start_ARG italic_ρ end_ARG start_ARG italic_c end_ARG ( italic_e start_POSTSUPERSCRIPT italic_c italic_t end_POSTSUPERSCRIPT - 1 ) italic_r start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) . (44)

The contraction-based over-approximation of reachable sets in (44) can be combined with Theorem 2 to estimate a δ𝛿\deltaitalic_δ-PRS of the system (3).

Proposition 2 (Contraction-based reachability).

Consider the stochastic system (3) and its associated deterministic system (4) satisfying Assumptions 1 and 2. Let txtmaps-to𝑡subscriptsuperscript𝑥𝑡t\mapsto x^{*}_{t}italic_t ↦ italic_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT be a trajectory of (4) with the input tutmaps-to𝑡subscriptsuperscript𝑢𝑡t\mapsto u^{*}_{t}italic_t ↦ italic_u start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT and tXtmaps-to𝑡subscript𝑋𝑡t\mapsto X_{t}italic_t ↦ italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT be a trajectory of the stochastic system (3) starting from x0𝕏(r1,x0)subscript𝑥0subscript𝕏subscript𝑟1subscriptsuperscript𝑥0x_{0}\in\mathcal{B}_{{\mathbb{X}}}(r_{1},x^{*}_{0})italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∈ caligraphic_B start_POSTSUBSCRIPT blackboard_X end_POSTSUBSCRIPT ( italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) with an input ut:0𝕌(r2,ut)u_{t}:{}_{\geq 0}\to\mathcal{B}_{{\mathbb{U}}}(r_{2},u^{*}_{t})italic_u start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT : start_FLOATSUBSCRIPT ≥ 0 end_FLOATSUBSCRIPT → caligraphic_B start_POSTSUBSCRIPT blackboard_U end_POSTSUBSCRIPT ( italic_r start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_u start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ). Then, for every t0𝑡0t\geq 0italic_t ≥ 0, with probability at least 1δ1𝛿1-\delta1 - italic_δ,

Xt𝕏(ectr1+ρc(ect1)r2,xt)n(rδ,t,0)subscript𝑋𝑡direct-sumsubscript𝕏superscript𝑒𝑐𝑡subscript𝑟1𝜌𝑐superscript𝑒𝑐𝑡1subscript𝑟2subscriptsuperscript𝑥𝑡superscript𝑛subscript𝑟𝛿𝑡0\displaystyle X_{t}\in\mathcal{B}_{{\mathbb{X}}}(e^{ct}r_{1}+\tfrac{\rho}{c}(e% ^{ct}-1)r_{2},x^{*}_{t})\oplus\mathcal{B}^{n}(r_{\delta,t},0)italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∈ caligraphic_B start_POSTSUBSCRIPT blackboard_X end_POSTSUBSCRIPT ( italic_e start_POSTSUPERSCRIPT italic_c italic_t end_POSTSUPERSCRIPT italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + divide start_ARG italic_ρ end_ARG start_ARG italic_c end_ARG ( italic_e start_POSTSUPERSCRIPT italic_c italic_t end_POSTSUPERSCRIPT - 1 ) italic_r start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) ⊕ caligraphic_B start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ( italic_r start_POSTSUBSCRIPT italic_δ , italic_t end_POSTSUBSCRIPT , 0 )

where rδ,t=Ψt(ε1n+ε2log(1/δ))subscript𝑟𝛿𝑡subscriptΨ𝑡subscript𝜀1𝑛subscript𝜀21𝛿r_{\delta,t}=\sqrt{\Psi_{t}(\varepsilon_{1}n+\varepsilon_{2}\log(1/\delta))}italic_r start_POSTSUBSCRIPT italic_δ , italic_t end_POSTSUBSCRIPT = square-root start_ARG roman_Ψ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_ε start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_n + italic_ε start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT roman_log ( 1 / italic_δ ) ) end_ARG, ΨtsubscriptΨ𝑡\Psi_{t}roman_Ψ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT is as in (10), and ε1subscript𝜀1\varepsilon_{1}italic_ε start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT,ε2subscript𝜀2\varepsilon_{2}italic_ε start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT are given by (21).

Proof.

The result follows by combining Theorem 2 and the contraction-based over-approximation of the reachable set of system (4) in (44). ∎

VII-B Interval-based Probabilistic Reachability

Interval analysis is a framework for estimating propagation of uncertainties by computing function bounds [49] and has been successfully used for reachability analysis of deterministic systems. The main idea of interval-based reachability is to embed the dynamical system into a higher dimensional space using a suitable inclusion function. The map [𝖥¯𝖥¯]:×2n×2p02n\left[\begin{smallmatrix}\underline{\mathsf{F}}\\ \overline{\mathsf{F}}\end{smallmatrix}\right]:{}^{2n}\times{}^{2p}\times% \mathbb{R}_{\geq 0}\to{}^{2n}[ start_ROW start_CELL under¯ start_ARG sansserif_F end_ARG end_CELL end_ROW start_ROW start_CELL over¯ start_ARG sansserif_F end_ARG end_CELL end_ROW ] : start_FLOATSUPERSCRIPT 2 italic_n end_FLOATSUPERSCRIPT × start_FLOATSUPERSCRIPT 2 italic_p end_FLOATSUPERSCRIPT × blackboard_R start_POSTSUBSCRIPT ≥ 0 end_POSTSUBSCRIPT → start_FLOATSUPERSCRIPT 2 italic_n end_FLOATSUPERSCRIPT is an inclusion function for f𝑓fitalic_f, if, for every z,w[x¯,x¯]×[u¯,u¯]𝑧𝑤¯𝑥¯𝑥¯𝑢¯𝑢z,w\in[\underline{x},\overline{x}]\times[\underline{u},\overline{u}]italic_z , italic_w ∈ [ under¯ start_ARG italic_x end_ARG , over¯ start_ARG italic_x end_ARG ] × [ under¯ start_ARG italic_u end_ARG , over¯ start_ARG italic_u end_ARG ] and every t0𝑡0t\geq 0italic_t ≥ 0,

𝖥¯(x¯,x¯,u¯,u¯,t)f(z,w,t)𝖥¯(x¯,x¯,u¯,u¯,t).¯𝖥¯𝑥¯𝑥¯𝑢¯𝑢𝑡𝑓𝑧𝑤𝑡¯𝖥¯𝑥¯𝑥¯𝑢¯𝑢𝑡\displaystyle\underline{\mathsf{F}}(\underline{x},\overline{x},\underline{u},% \overline{u},t)\leq f(z,w,t)\leq\overline{\mathsf{F}}(\underline{x},\overline{% x},\underline{u},\overline{u},t).under¯ start_ARG sansserif_F end_ARG ( under¯ start_ARG italic_x end_ARG , over¯ start_ARG italic_x end_ARG , under¯ start_ARG italic_u end_ARG , over¯ start_ARG italic_u end_ARG , italic_t ) ≤ italic_f ( italic_z , italic_w , italic_t ) ≤ over¯ start_ARG sansserif_F end_ARG ( under¯ start_ARG italic_x end_ARG , over¯ start_ARG italic_x end_ARG , under¯ start_ARG italic_u end_ARG , over¯ start_ARG italic_u end_ARG , italic_t ) .

Many automated approaches exist for finding an inclusion function for f𝑓fitalic_f. We refer to [8, Section IV.B] for a detailed discussion on these approaches and to [50] for a toolbox for computing inclusion functions.

Given an interval initial configuration 𝒳0=[x¯0,x¯0]subscript𝒳0subscript¯𝑥0subscript¯𝑥0\mathcal{X}_{0}=[\underline{x}_{0},\overline{x}_{0}]caligraphic_X start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = [ under¯ start_ARG italic_x end_ARG start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , over¯ start_ARG italic_x end_ARG start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ] and an interval input set 𝒰=[u¯,u¯]𝒰¯𝑢¯𝑢\mathcal{U}=[\underline{u},\overline{u}]caligraphic_U = [ under¯ start_ARG italic_u end_ARG , over¯ start_ARG italic_u end_ARG ], the embedding system of (4) associated with the inclusion function 𝖥𝖥\mathsf{F}sansserif_F is given by

[x¯˙x¯˙]=[𝖥¯(x¯,x¯,u¯,u¯,t)𝖥¯(x¯,x¯,u¯,u¯,t)].matrix˙¯𝑥˙¯𝑥matrix¯𝖥¯𝑥¯𝑥¯𝑢¯𝑢𝑡¯𝖥¯𝑥¯𝑥¯𝑢¯𝑢𝑡\displaystyle\begin{bmatrix}\dot{\underline{x}}\\ \dot{\overline{x}}\end{bmatrix}=\begin{bmatrix}\underline{\mathsf{F}}(% \underline{x},\overline{x},\underline{u},\overline{u},t)\\ \overline{\mathsf{F}}(\underline{x},\overline{x},\underline{u},\overline{u},t)% \end{bmatrix}.[ start_ARG start_ROW start_CELL over˙ start_ARG under¯ start_ARG italic_x end_ARG end_ARG end_CELL end_ROW start_ROW start_CELL over˙ start_ARG over¯ start_ARG italic_x end_ARG end_ARG end_CELL end_ROW end_ARG ] = [ start_ARG start_ROW start_CELL under¯ start_ARG sansserif_F end_ARG ( under¯ start_ARG italic_x end_ARG , over¯ start_ARG italic_x end_ARG , under¯ start_ARG italic_u end_ARG , over¯ start_ARG italic_u end_ARG , italic_t ) end_CELL end_ROW start_ROW start_CELL over¯ start_ARG sansserif_F end_ARG ( under¯ start_ARG italic_x end_ARG , over¯ start_ARG italic_x end_ARG , under¯ start_ARG italic_u end_ARG , over¯ start_ARG italic_u end_ARG , italic_t ) end_CELL end_ROW end_ARG ] . (45)

Let [x¯tx¯t]delimited-[]subscript¯𝑥𝑡subscript¯𝑥𝑡\left[\begin{smallmatrix}\underline{x}_{t}\\ \overline{x}_{t}\end{smallmatrix}\right][ start_ROW start_CELL under¯ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL over¯ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_CELL end_ROW ] be the trajectory of the embedding system (45) starting from [x¯0x¯0]delimited-[]subscript¯𝑥0subscript¯𝑥0\left[\begin{smallmatrix}\underline{x}_{0}\\ \overline{x}_{0}\end{smallmatrix}\right][ start_ROW start_CELL under¯ start_ARG italic_x end_ARG start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL over¯ start_ARG italic_x end_ARG start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_CELL end_ROW ]. Then, an over-approximation of the deterministic reachable set of (4) is [8, Proposition 5]

¯t=[x¯t,x¯t].subscript¯𝑡subscript¯𝑥𝑡subscript¯𝑥𝑡\displaystyle\overline{\mathcal{R}}_{t}=[\underline{x}_{t},\overline{x}_{t}].over¯ start_ARG caligraphic_R end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = [ under¯ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , over¯ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ] . (46)

This interval-based over-approximation of reachable sets can be combined with Theorem 2 to estimate a δ𝛿\deltaitalic_δ-PRS of the system (3).

Proposition 3 (Interval-based reachability).

Consider the stochastic system (3) and its associated deterministic system (4) satisfying Assumption 1. Let tXtmaps-to𝑡subscript𝑋𝑡t\mapsto X_{t}italic_t ↦ italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT be a trajectory of the stochastic system (3) starting from x0[x¯0,x¯0]subscript𝑥0subscript¯𝑥0subscript¯𝑥0x_{0}\in[\underline{x}_{0},\overline{x}_{0}]italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∈ [ under¯ start_ARG italic_x end_ARG start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , over¯ start_ARG italic_x end_ARG start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ] with an input curve ut:0[u¯,u¯]u_{t}:{}_{\geq 0}\to[\underline{u},\overline{u}]italic_u start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT : start_FLOATSUBSCRIPT ≥ 0 end_FLOATSUBSCRIPT → [ under¯ start_ARG italic_u end_ARG , over¯ start_ARG italic_u end_ARG ]. Suppose that 𝖥=[𝖥¯𝖥¯]𝖥delimited-[]¯𝖥¯𝖥\mathsf{F}=\left[\begin{smallmatrix}\underline{\mathsf{F}}\\ \overline{\mathsf{F}}\end{smallmatrix}\right]sansserif_F = [ start_ROW start_CELL under¯ start_ARG sansserif_F end_ARG end_CELL end_ROW start_ROW start_CELL over¯ start_ARG sansserif_F end_ARG end_CELL end_ROW ] is an inclusion function for f𝑓fitalic_f and [x¯tx¯t]delimited-[]subscript¯𝑥𝑡subscript¯𝑥𝑡\left[\begin{smallmatrix}\underline{x}_{t}\\ \overline{x}_{t}\end{smallmatrix}\right][ start_ROW start_CELL under¯ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL over¯ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_CELL end_ROW ] is the trajectory of the embedding system (45) starting from [x¯0x¯0]delimited-[]subscript¯𝑥0subscript¯𝑥0\left[\begin{smallmatrix}\underline{x}_{0}\\ \overline{x}_{0}\end{smallmatrix}\right][ start_ROW start_CELL under¯ start_ARG italic_x end_ARG start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL over¯ start_ARG italic_x end_ARG start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_CELL end_ROW ]. Then, for every t0𝑡0t\geq 0italic_t ≥ 0, with probability at least 1δ1𝛿1-\delta1 - italic_δ

Xt[x¯t,x¯t]n(rδ,t,0),subscript𝑋𝑡direct-sumsubscript¯𝑥𝑡subscript¯𝑥𝑡superscript𝑛subscript𝑟𝛿𝑡0\displaystyle X_{t}\in[\underline{x}_{t},\overline{x}_{t}]\oplus\mathcal{B}^{n% }(r_{\delta,t},0),italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∈ [ under¯ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , over¯ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ] ⊕ caligraphic_B start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ( italic_r start_POSTSUBSCRIPT italic_δ , italic_t end_POSTSUBSCRIPT , 0 ) ,

where rδ,t=Ψt(ε1n+ε2log(1/δ))subscript𝑟𝛿𝑡subscriptΨ𝑡subscript𝜀1𝑛subscript𝜀21𝛿r_{\delta,t}=\sqrt{\Psi_{t}(\varepsilon_{1}n+\varepsilon_{2}\log(1/\delta))}italic_r start_POSTSUBSCRIPT italic_δ , italic_t end_POSTSUBSCRIPT = square-root start_ARG roman_Ψ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_ε start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_n + italic_ε start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT roman_log ( 1 / italic_δ ) ) end_ARG, ΨtsubscriptΨ𝑡\Psi_{t}roman_Ψ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT is as in (10), ε1subscript𝜀1\varepsilon_{1}italic_ε start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT,ε2subscript𝜀2\varepsilon_{2}italic_ε start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT are given by (21).

Proof.

The result follows by combining Theorem 2 and the interval over-approximation of the reachable set of the deterministic system (4) in (46). ∎

Refer to caption
((a)) Trajectories of Xtsubscript𝑋𝑡X_{t}italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT
Refer to caption
((b)) Trajectories of Xtxtnormsubscript𝑋𝑡subscript𝑥𝑡\|X_{t}-x_{t}\|∥ italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∥
Figure 3: Probabilistic bound of stochastic deviation for a linear system. In Figure 3(a), each curve represents an independent trajectory of Xtsubscript𝑋𝑡X_{t}italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT. In Figure 3(b), each solid curve is an independent trajectory of Xtxtnormsubscript𝑋𝑡subscript𝑥𝑡\|X_{t}-x_{t}\|∥ italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∥. The blue envelope and the blue dashed curve correspond to our bound (25).
Refer to caption
((a)) rδ,t2superscriptsubscript𝑟𝛿𝑡2r_{\delta,t}^{2}italic_r start_POSTSUBSCRIPT italic_δ , italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT w.r.t. 1/δ1𝛿1/\delta1 / italic_δ
Refer to caption
((b)) rδ,t2superscriptsubscript𝑟𝛿𝑡2r_{\delta,t}^{2}italic_r start_POSTSUBSCRIPT italic_δ , italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT w.r.t. n𝑛nitalic_n
Figure 4: Illustration of the tightness of rδ,tsubscript𝑟𝛿𝑡r_{\delta,t}italic_r start_POSTSUBSCRIPT italic_δ , italic_t end_POSTSUBSCRIPT w.r.t. δ,n𝛿𝑛\delta,nitalic_δ , italic_n. In Figure 4(a), the solid line shows the dependence of rδ,t2superscriptsubscript𝑟𝛿𝑡2r_{\delta,t}^{2}italic_r start_POSTSUBSCRIPT italic_δ , italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT over 1/δ1𝛿1/\delta1 / italic_δ and the dotted line in the same color is the corresponding simulated bound r^δ,t2superscriptsubscript^𝑟𝛿𝑡2\hat{r}_{\delta,t}^{2}over^ start_ARG italic_r end_ARG start_POSTSUBSCRIPT italic_δ , italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT. In Figure 4(b), the solid line shows the dependence of rδ,t2superscriptsubscript𝑟𝛿𝑡2r_{\delta,t}^{2}italic_r start_POSTSUBSCRIPT italic_δ , italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT over n𝑛nitalic_n, and the dotted line in the same color is the corresponding simulated r^δ,t2superscriptsubscript^𝑟𝛿𝑡2\hat{r}_{\delta,t}^{2}over^ start_ARG italic_r end_ARG start_POSTSUBSCRIPT italic_δ , italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT.

VIII Numerical experiments

In this section, we present several examples to illustrate the efficacy of our framework and the tightness of our results.

VIII-A Linear Example

We first consider a linear example to validate the tightness of our bound (25) on the stochastic deviation. Consider a simple linear dynamics

dXt=0.4InXtdt+2dWt=AXtdt+σdWt,𝑑subscript𝑋𝑡0.4subscript𝐼𝑛subscript𝑋𝑡𝑑𝑡2𝑑subscript𝑊𝑡𝐴subscript𝑋𝑡𝑑𝑡𝜎𝑑subscript𝑊𝑡\begin{split}dX_{t}&=-0.4I_{n}X_{t}dt+\sqrt{2}dW_{t}\\ &=AX_{t}dt+\sigma dW_{t},\end{split}start_ROW start_CELL italic_d italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_CELL start_CELL = - 0.4 italic_I start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT italic_d italic_t + square-root start_ARG 2 end_ARG italic_d italic_W start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL = italic_A italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT italic_d italic_t + italic_σ italic_d italic_W start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , end_CELL end_ROW (47)

initialized at X0=0subscript𝑋00X_{0}=0italic_X start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = 0. The system (47) satisfies Assumption 1 with ctc=μ(A)=0.4subscript𝑐𝑡𝑐𝜇𝐴0.4c_{t}\equiv c=\mu(A)=-0.4italic_c start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ≡ italic_c = italic_μ ( italic_A ) = - 0.4 and σtσ=2subscript𝜎𝑡𝜎2\sigma_{t}\equiv\sigma=\sqrt{2}italic_σ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ≡ italic_σ = square-root start_ARG 2 end_ARG. By linearity, Xtsubscript𝑋𝑡X_{t}italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT follows a zero-mean Gaussian distribution whose covariance cov(Xt)covsubscript𝑋𝑡\text{cov}(X_{t})cov ( italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) can be computed using (15) in closed-form. The trajectory of the deterministic dynamics associated with (47) starting from x0=0subscript𝑥00x_{0}=0italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = 0 is xt0subscript𝑥𝑡0x_{t}\equiv 0italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ≡ 0.

To illustrate the bound (25), we simulate 5000 independent trajectories of (47) with n=2𝑛2n=2italic_n = 2 over a time horizon t[0,1.5]𝑡01.5t\in[0,1.5]italic_t ∈ [ 0 , 1.5 ] and compute the deviation associated with each trajectory, as depicted in Figure 3. These trajectories are compared with our probabilistic bound rδ,tsubscript𝑟𝛿𝑡r_{\delta,t}italic_r start_POSTSUBSCRIPT italic_δ , italic_t end_POSTSUBSCRIPT with design parameter ε=1/16𝜀116\varepsilon=1/16italic_ε = 1 / 16, δ=103𝛿superscript103\delta=10^{-3}italic_δ = 10 start_POSTSUPERSCRIPT - 3 end_POSTSUPERSCRIPT. Figure 3 shows that all the trajectories satisfy the bound rδ,tsubscript𝑟𝛿𝑡r_{\delta,t}italic_r start_POSTSUBSCRIPT italic_δ , italic_t end_POSTSUBSCRIPT as expected.

By Theorem 1, the square of our bound (25), rδ,t2superscriptsubscript𝑟𝛿𝑡2r_{\delta,t}^{2}italic_r start_POSTSUBSCRIPT italic_δ , italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT, grows linearly with log(1/δ)1𝛿\log(1/\delta)roman_log ( 1 / italic_δ ) and n𝑛nitalic_n, as illustrated in Figure 4. To verify the tightness of these dependencies, we compare them with those obtained through simulation. In particular, for each choice of δ𝛿\deltaitalic_δ and n𝑛nitalic_n, we simulate 107superscript10710^{7}10 start_POSTSUPERSCRIPT 7 end_POSTSUPERSCRIPT independent trajectories of (47) and compute the associated value of Xtxtnormsubscript𝑋𝑡subscript𝑥𝑡\|X_{t}-x_{t}\|∥ italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∥ for each trajectory. We follow a standard approach [51] and estimate the high probability bound r^δ,tsubscript^𝑟𝛿𝑡\hat{r}_{\delta,t}over^ start_ARG italic_r end_ARG start_POSTSUBSCRIPT italic_δ , italic_t end_POSTSUBSCRIPT of the stochastic deviation as the δ𝛿\deltaitalic_δ-th largest Xtxtnormsubscript𝑋𝑡subscript𝑥𝑡\|X_{t}-x_{t}\|∥ italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∥ (e.g., top 1% if δ=102𝛿superscript102\delta=10^{-2}italic_δ = 10 start_POSTSUPERSCRIPT - 2 end_POSTSUPERSCRIPT). The results, shown in Figure 4, imply that r^δ,t2superscriptsubscript^𝑟𝛿𝑡2\hat{r}_{\delta,t}^{2}over^ start_ARG italic_r end_ARG start_POSTSUBSCRIPT italic_δ , italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT also grows linearly with log(1/δ)1𝛿\log(1/\delta)roman_log ( 1 / italic_δ ) and n𝑛nitalic_n, consistent with our theoretical bound (25).

Note that there is a gap between the calculated bounds with ε=1/16𝜀116\varepsilon=1/16italic_ε = 1 / 16 and the simulated bounds in 4. This is due to the choice of parameters ε1subscript𝜀1\varepsilon_{1}italic_ε start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and ε2subscript𝜀2\varepsilon_{2}italic_ε start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT. These parameters ε1subscript𝜀1\varepsilon_{1}italic_ε start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and ε2subscript𝜀2\varepsilon_{2}italic_ε start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT (21) are constructed in the proof for all δ,n𝛿𝑛\delta,nitalic_δ , italic_n and are not optimal for each choice of δ,n𝛿𝑛\delta,nitalic_δ , italic_n, as explained in Section V-A.

VIII-B Inverted Pendulum

Next, we consider an inverted pendulum with a stabilizing state feedback controller, whose state space model is given by

dXt=[θ˙gLsinθ+KXt]dt+gtdWt𝑑subscript𝑋𝑡matrix˙𝜃𝑔𝐿𝜃𝐾subscript𝑋𝑡𝑑𝑡subscript𝑔𝑡𝑑subscript𝑊𝑡\begin{split}dX_{t}&=\begin{bmatrix}\dot{\theta}\\ \frac{g}{L}\sin\theta+KX_{t}\\ \end{bmatrix}dt+g_{t}dW_{t}\\ \end{split}start_ROW start_CELL italic_d italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_CELL start_CELL = [ start_ARG start_ROW start_CELL over˙ start_ARG italic_θ end_ARG end_CELL end_ROW start_ROW start_CELL divide start_ARG italic_g end_ARG start_ARG italic_L end_ARG roman_sin italic_θ + italic_K italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_CELL end_ROW end_ARG ] italic_d italic_t + italic_g start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT italic_d italic_W start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_CELL end_ROW (48)

where Xt=[θθ˙]𝖳subscript𝑋𝑡superscriptmatrix𝜃˙𝜃𝖳X_{t}=\begin{bmatrix}\theta&\dot{\theta}\end{bmatrix}^{\mathsf{T}}italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = [ start_ARG start_ROW start_CELL italic_θ end_CELL start_CELL over˙ start_ARG italic_θ end_ARG end_CELL end_ROW end_ARG ] start_POSTSUPERSCRIPT sansserif_T end_POSTSUPERSCRIPT is the state vector, θ𝜃\thetaitalic_θ is the angle describing the position of the pendulum, θ˙˙𝜃\dot{\theta}over˙ start_ARG italic_θ end_ARG is the angular velocity of the pendulum, KXt=K1θ+K2θ˙𝐾subscript𝑋𝑡subscript𝐾1𝜃subscript𝐾2˙𝜃KX_{t}=K_{1}\theta+K_{2}\dot{\theta}italic_K italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = italic_K start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_θ + italic_K start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT over˙ start_ARG italic_θ end_ARG is a stabilizing linear state feedback controller, and gtdWtsubscript𝑔𝑡𝑑subscript𝑊𝑡g_{t}dW_{t}italic_g start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT italic_d italic_W start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT is the stochastic disturbance on the angular acceleration with Wtsubscript𝑊𝑡W_{t}italic_W start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT a one-dimensional Wiener process. Set the gravity g=10𝑔10g=10italic_g = 10, the pendulum length L=1𝐿1L=1italic_L = 1, and gt=[00.1]𝖳subscript𝑔𝑡superscriptmatrix00.1𝖳g_{t}=\begin{bmatrix}0&0.1\end{bmatrix}^{\mathsf{T}}italic_g start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = [ start_ARG start_ROW start_CELL 0 end_CELL start_CELL 0.1 end_CELL end_ROW end_ARG ] start_POSTSUPERSCRIPT sansserif_T end_POSTSUPERSCRIPT. The linear state feedback controller KXt𝐾subscript𝑋𝑡KX_{t}italic_K italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT is designed with feedback gain K=[K1K2]=[2020]𝐾matrixsubscript𝐾1subscript𝐾2matrix2020K=\begin{bmatrix}K_{1}&K_{2}\end{bmatrix}=\begin{bmatrix}-20&-20\end{bmatrix}italic_K = [ start_ARG start_ROW start_CELL italic_K start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_CELL start_CELL italic_K start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_CELL end_ROW end_ARG ] = [ start_ARG start_ROW start_CELL - 20 end_CELL start_CELL - 20 end_CELL end_ROW end_ARG ] to stabilize the equilibrium point x=[θθ˙]𝖳=0superscript𝑥superscriptmatrixsuperscript𝜃superscript˙𝜃𝖳0x^{*}=\begin{bmatrix}\theta^{*}&\dot{\theta}^{*}\end{bmatrix}^{\mathsf{T}}=0italic_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT = [ start_ARG start_ROW start_CELL italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_CELL start_CELL over˙ start_ARG italic_θ end_ARG start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_CELL end_ROW end_ARG ] start_POSTSUPERSCRIPT sansserif_T end_POSTSUPERSCRIPT = 0 of the associated deterministic system

x˙t=[θ˙gLsin(θ)+K1θ+K2θ˙]:=f(xt),subscript˙𝑥𝑡matrix˙𝜃𝑔𝐿𝜃subscript𝐾1𝜃subscript𝐾2˙𝜃assign𝑓subscript𝑥𝑡\displaystyle\dot{x}_{t}=\begin{bmatrix}\dot{\theta}\\ \frac{g}{L}\sin(\theta)+K_{1}\theta+K_{2}\dot{\theta}\end{bmatrix}:=f(x_{t}),over˙ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = [ start_ARG start_ROW start_CELL over˙ start_ARG italic_θ end_ARG end_CELL end_ROW start_ROW start_CELL divide start_ARG italic_g end_ARG start_ARG italic_L end_ARG roman_sin ( italic_θ ) + italic_K start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_θ + italic_K start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT over˙ start_ARG italic_θ end_ARG end_CELL end_ROW end_ARG ] := italic_f ( italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) , (49)

where xt=[θθ˙]𝖳subscript𝑥𝑡superscriptmatrix𝜃˙𝜃𝖳x_{t}=\begin{bmatrix}\theta&\dot{\theta}\end{bmatrix}^{\mathsf{T}}italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = [ start_ARG start_ROW start_CELL italic_θ end_CELL start_CELL over˙ start_ARG italic_θ end_ARG end_CELL end_ROW end_ARG ] start_POSTSUPERSCRIPT sansserif_T end_POSTSUPERSCRIPT.

Our goal is to find a tight δ𝛿\deltaitalic_δ-PRS of the inverted pendulum (48) starting from the initial configuration 𝒳0=[π10,π10]×[0.2,0.2]subscript𝒳0𝜋10𝜋100.20.2\mathcal{X}_{0}=[-\frac{\pi}{10},\frac{\pi}{10}]\times[-0.2,0.2]caligraphic_X start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = [ - divide start_ARG italic_π end_ARG start_ARG 10 end_ARG , divide start_ARG italic_π end_ARG start_ARG 10 end_ARG ] × [ - 0.2 , 0.2 ]. We use Theorem 2 with contraction-based and interval-based deterministic reachability methods to obtain δ𝛿\deltaitalic_δ-PRS of the inverted pendulum (48). We first consider the modified version of Assumption 1 introduced in Section V-D as μP(Dxf(x))ctsubscript𝜇𝑃subscript𝐷𝑥𝑓𝑥subscript𝑐𝑡\mu_{P}(D_{x}f(x))\leq c_{t}italic_μ start_POSTSUBSCRIPT italic_P end_POSTSUBSCRIPT ( italic_D start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT italic_f ( italic_x ) ) ≤ italic_c start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT and P12gtgt𝖳P12σt2Inprecedes-or-equalssuperscript𝑃12subscript𝑔𝑡superscriptsubscript𝑔𝑡𝖳superscript𝑃12superscriptsubscript𝜎𝑡2subscript𝐼𝑛P^{\frac{1}{2}}g_{t}g_{t}^{\mathsf{T}}P^{\frac{1}{2}}\preceq\sigma_{t}^{2}I_{n}italic_P start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT italic_g start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT italic_g start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT sansserif_T end_POSTSUPERSCRIPT italic_P start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT ⪯ italic_σ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_I start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT for every t0𝑡0t\geq 0italic_t ≥ 0 and xnx\in{}^{n}italic_x ∈ start_FLOATSUPERSCRIPT italic_n end_FLOATSUPERSCRIPT. For every x=(θ,θ˙)𝖳2x=(\theta,\dot{\theta})^{\mathsf{T}}\in{}^{2}italic_x = ( italic_θ , over˙ start_ARG italic_θ end_ARG ) start_POSTSUPERSCRIPT sansserif_T end_POSTSUPERSCRIPT ∈ start_FLOATSUPERSCRIPT 2 end_FLOATSUPERSCRIPT,

Dxf(x)=[01gLcos(θ)+K1K2].subscript𝐷𝑥𝑓𝑥delimited-[]01𝑔𝐿𝜃subscript𝐾1subscript𝐾2\displaystyle D_{x}f(x)=\left[\begin{smallmatrix}0&1\\ \frac{g}{L}\cos(\theta)+K_{1}&K_{2}\end{smallmatrix}\right].italic_D start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT italic_f ( italic_x ) = [ start_ROW start_CELL 0 end_CELL start_CELL 1 end_CELL end_ROW start_ROW start_CELL divide start_ARG italic_g end_ARG start_ARG italic_L end_ARG roman_cos ( italic_θ ) + italic_K start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_CELL start_CELL italic_K start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_CELL end_ROW ] .

We define the matrices A1,A22×2A_{1},A_{2}\in{}^{2\times 2}italic_A start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_A start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ∈ start_FLOATSUPERSCRIPT 2 × 2 end_FLOATSUPERSCRIPT as follows:

A1=[01gL+K1K2],A2=[01gL+K1K2].formulae-sequencesubscript𝐴1delimited-[]01𝑔𝐿subscript𝐾1subscript𝐾2subscript𝐴2delimited-[]01𝑔𝐿subscript𝐾1subscript𝐾2\displaystyle A_{1}=\left[\begin{smallmatrix}0&1\\ \frac{g}{L}+K_{1}&K_{2}\end{smallmatrix}\right],\qquad A_{2}=\left[\begin{% smallmatrix}0&1\\ -\frac{g}{L}+K_{1}&K_{2}\end{smallmatrix}\right].italic_A start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = [ start_ROW start_CELL 0 end_CELL start_CELL 1 end_CELL end_ROW start_ROW start_CELL divide start_ARG italic_g end_ARG start_ARG italic_L end_ARG + italic_K start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_CELL start_CELL italic_K start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_CELL end_ROW ] , italic_A start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = [ start_ROW start_CELL 0 end_CELL start_CELL 1 end_CELL end_ROW start_ROW start_CELL - divide start_ARG italic_g end_ARG start_ARG italic_L end_ARG + italic_K start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_CELL start_CELL italic_K start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_CELL end_ROW ] .

Note that cos(θ)[1,1]𝜃11\cos(\theta)\in[-1,1]roman_cos ( italic_θ ) ∈ [ - 1 , 1 ]. This implies that, for every x2x\in{}^{2}italic_x ∈ start_FLOATSUPERSCRIPT 2 end_FLOATSUPERSCRIPT, we have Dxf(x)conv{A1,A2}subscript𝐷𝑥𝑓𝑥convsubscript𝐴1subscript𝐴2D_{x}f(x)\in\mathrm{conv}\left\{A_{1},A_{2}\right\}italic_D start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT italic_f ( italic_x ) ∈ roman_conv { italic_A start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_A start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT }, where convconv\mathrm{conv}roman_conv is the convex hull. Thus, using [52, Lemma 4.1], the minimum constant contraction rate ct=csubscript𝑐𝑡𝑐c_{t}=citalic_c start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = italic_c for the system (49) can be computed using the following optimization problem:

minc,P0subscript𝑐succeeds𝑃0\displaystyle\min_{c\in\real,P\succ 0}roman_min start_POSTSUBSCRIPT italic_c ∈ , italic_P ≻ 0 end_POSTSUBSCRIPT c𝑐\displaystyle\quad citalic_c
s.t.Ai𝖳P+PAi2cP,for i{1,2}.formulae-sequenceprecedes-or-equalss.t.superscriptsubscript𝐴𝑖𝖳𝑃𝑃subscript𝐴𝑖2𝑐𝑃for 𝑖12\displaystyle\mbox{s.t.}\;\;A_{i}^{\mathsf{T}}P+PA_{i}\preceq 2cP,\quad\mbox{% for }i\in\{1,2\}.s.t. italic_A start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT sansserif_T end_POSTSUPERSCRIPT italic_P + italic_P italic_A start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ⪯ 2 italic_c italic_P , for italic_i ∈ { 1 , 2 } . (50)

We solve optimization problem (VIII-B) by successively applying semi-definite programming on P𝑃Pitalic_P and bisection on c𝑐citalic_c. The optimal solution of (VIII-B) is given by the constant contraction rate ct=c=0.5subscript𝑐𝑡𝑐0.5c_{t}=c=-0.5italic_c start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = italic_c = - 0.5 and the weight matrix P=[35.682.212.211.27]𝑃delimited-[]35.682.212.211.27P=\left[\begin{smallmatrix}35.68&2.21\\ 2.21&1.27\end{smallmatrix}\right]italic_P = [ start_ROW start_CELL 35.68 end_CELL start_CELL 2.21 end_CELL end_ROW start_ROW start_CELL 2.21 end_CELL start_CELL 1.27 end_CELL end_ROW ]. With this matrix P𝑃Pitalic_P, we compute P12gtgt𝖳P12=[0.00100.00340.00340.0118]0.0128I2superscript𝑃12subscript𝑔𝑡superscriptsubscript𝑔𝑡𝖳superscript𝑃12delimited-[]0.00100.00340.00340.0118precedes-or-equals0.0128subscript𝐼2P^{\frac{1}{2}}g_{t}g_{t}^{\mathsf{T}}P^{\frac{1}{2}}=\left[\begin{smallmatrix% }0.0010&0.0034\\ 0.0034&0.0118\end{smallmatrix}\right]\preceq 0.0128I_{2}italic_P start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT italic_g start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT italic_g start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT sansserif_T end_POSTSUPERSCRIPT italic_P start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT = [ start_ROW start_CELL 0.0010 end_CELL start_CELL 0.0034 end_CELL end_ROW start_ROW start_CELL 0.0034 end_CELL start_CELL 0.0118 end_CELL end_ROW ] ⪯ 0.0128 italic_I start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT, and get σt=σ=0.1130subscript𝜎𝑡𝜎0.1130\sigma_{t}=\sigma=0.1130italic_σ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = italic_σ = 0.1130.

Refer to caption
Refer to caption
Refer to caption
Refer to caption
Refer to caption
Refer to caption
Figure 5: Left: The solid blue lines show the boundary of δ𝛿\deltaitalic_δ-PRS with δ=103𝛿superscript103\delta=10^{-3}italic_δ = 10 start_POSTSUPERSCRIPT - 3 end_POSTSUPERSCRIPT at times t=1,2,4𝑡124t=1,2,4italic_t = 1 , 2 , 4 for the stochastic inverted pendulum (48) starting from 𝒳0¯𝒳0subscript𝒳0¯subscript𝒳0\overline{\mathcal{X}_{0}}\supset\mathcal{X}_{0}over¯ start_ARG caligraphic_X start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG ⊃ caligraphic_X start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT obtained using Proposition 2. The dashed blue lines are the boundary of the ellipsoids that over-approximate reachable sets of the associated deterministic system (49). The red dots are 2000200020002000 random trajectories of the inverted pendulum (48) starting from 𝒳¯0𝒳0subscript𝒳0subscript¯𝒳0\overline{\mathcal{X}}_{0}\supset\mathcal{X}_{0}over¯ start_ARG caligraphic_X end_ARG start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ⊃ caligraphic_X start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT at times t=1,2,4𝑡124t=1,2,4italic_t = 1 , 2 , 4. Right: The solid blue lines show the boundary of δ𝛿\deltaitalic_δ-PRS with δ=103𝛿superscript103\delta=10^{-3}italic_δ = 10 start_POSTSUPERSCRIPT - 3 end_POSTSUPERSCRIPT at times t=1,2,4𝑡124t=1,2,4italic_t = 1 , 2 , 4 for the inverted pendulum (48) starting from T1𝒴¯0𝒳0subscript𝒳0superscript𝑇1subscript¯𝒴0T^{-1}\overline{\mathcal{Y}}_{0}\supset\mathcal{X}_{0}italic_T start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT over¯ start_ARG caligraphic_Y end_ARG start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ⊃ caligraphic_X start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT obtained using Theorem 2 and interval-based reachability of the transformed system. The dashed blue lines are the boundary of the parallelotopes obtained from the interval analysis that over-approximates the reachable sets of the associated deterministic system (49). The red dots are 2000200020002000 random trajectories of the inverted pendulum (48) starting from T1𝒴¯0𝒳0subscript𝒳0superscript𝑇1subscript¯𝒴0T^{-1}\overline{\mathcal{Y}}_{0}\supset\mathcal{X}_{0}italic_T start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT over¯ start_ARG caligraphic_Y end_ARG start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ⊃ caligraphic_X start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT at times t=1,2,4𝑡124t=1,2,4italic_t = 1 , 2 , 4.
Contraction-based Reachability

We use Proposition 2 to find a δ𝛿\deltaitalic_δ-PRS of (48). We consider Assumption 2 with 𝕏=P\|\cdot\|_{\mathbb{X}}=\|\cdot\|_{P}∥ ⋅ ∥ start_POSTSUBSCRIPT blackboard_X end_POSTSUBSCRIPT = ∥ ⋅ ∥ start_POSTSUBSCRIPT italic_P end_POSTSUBSCRIPT with positive definite matrix P𝑃Pitalic_P as defined above. For every x2x\in{}^{2}italic_x ∈ start_FLOATSUPERSCRIPT 2 end_FLOATSUPERSCRIPT, we have μP(Dxf(x))c=0.5subscript𝜇𝑃subscript𝐷𝑥𝑓𝑥𝑐0.5\mu_{P}(D_{x}f(x))\leq c=-0.5italic_μ start_POSTSUBSCRIPT italic_P end_POSTSUBSCRIPT ( italic_D start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT italic_f ( italic_x ) ) ≤ italic_c = - 0.5. Using Proposition 2 with the initial configuration 𝒳¯0={x|2xP[π100.2]P}𝒳0\overline{\mathcal{X}}_{0}=\{x\in{}^{2}\;|\;\|x\|_{P}\leq\left\|\left[\begin{% smallmatrix}\frac{\pi}{10}\\ 0.2\end{smallmatrix}\right]\right\|_{P}\}\supset\mathcal{X}_{0}over¯ start_ARG caligraphic_X end_ARG start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = { italic_x ∈ start_FLOATSUPERSCRIPT 2 end_FLOATSUPERSCRIPT | ∥ italic_x ∥ start_POSTSUBSCRIPT italic_P end_POSTSUBSCRIPT ≤ ∥ [ start_ROW start_CELL divide start_ARG italic_π end_ARG start_ARG 10 end_ARG end_CELL end_ROW start_ROW start_CELL 0.2 end_CELL end_ROW ] ∥ start_POSTSUBSCRIPT italic_P end_POSTSUBSCRIPT } ⊃ caligraphic_X start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, we obtain a δ𝛿\deltaitalic_δ-PRS of (48) with δ=103𝛿superscript103\delta=10^{-3}italic_δ = 10 start_POSTSUPERSCRIPT - 3 end_POSTSUPERSCRIPT as shown in Figure 5 (left).

Interval-based Reachability

We use Theorem 2 with a modified version of interval-based analysis for the associated deterministic system (49) to find a δ𝛿\deltaitalic_δ-PRS of (48). We consider the coordinate transformation yt=Txtsubscript𝑦𝑡𝑇subscript𝑥𝑡y_{t}=Tx_{t}italic_y start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = italic_T italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT with nonsingular matrix T=[10.210]𝑇delimited-[]10.210T=\left[\begin{smallmatrix}1&0.2\\ 1&0\end{smallmatrix}\right]italic_T = [ start_ROW start_CELL 1 end_CELL start_CELL 0.2 end_CELL end_ROW start_ROW start_CELL 1 end_CELL start_CELL 0 end_CELL end_ROW ] for the associated deterministic system (49) and apply interval-based reachability to the transformed system. We employ Theorem 2 with the initial configuration T1𝒴¯0𝒳0subscript𝒳0superscript𝑇1subscript¯𝒴0T^{-1}\overline{\mathcal{Y}}_{0}\supset\mathcal{X}_{0}italic_T start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT over¯ start_ARG caligraphic_Y end_ARG start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ⊃ caligraphic_X start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT where 𝒴¯0=[π10[1.041],π10[1.041]]subscript¯𝒴0𝜋10delimited-[]1.041𝜋10delimited-[]1.041\overline{\mathcal{Y}}_{0}=[{-\tfrac{\pi}{10}\left[\begin{smallmatrix}1.04\\ 1\end{smallmatrix}\right]},{\tfrac{\pi}{10}\left[\begin{smallmatrix}1.04\\ 1\end{smallmatrix}\right]}]over¯ start_ARG caligraphic_Y end_ARG start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = [ - divide start_ARG italic_π end_ARG start_ARG 10 end_ARG [ start_ROW start_CELL 1.04 end_CELL end_ROW start_ROW start_CELL 1 end_CELL end_ROW ] , divide start_ARG italic_π end_ARG start_ARG 10 end_ARG [ start_ROW start_CELL 1.04 end_CELL end_ROW start_ROW start_CELL 1 end_CELL end_ROW ] ]. The δ𝛿\deltaitalic_δ-PRS of (48) with δ=103𝛿superscript103\delta=10^{-3}italic_δ = 10 start_POSTSUPERSCRIPT - 3 end_POSTSUPERSCRIPT obtained using this analysis are shown in Figure 5 (right).

VIII-C Nonlinear Unicycle model

Finally, we consider a vehicle moving on a 2222-dimensional plane with obstacles shown in light red in Figure 6. The vehicle is modeled by the unicycle dynamics

dXt=[vtcos(θ)vtsin(θ)wt+ut]dt+gtdWt𝑑subscript𝑋𝑡matrixsubscript𝑣𝑡𝜃subscript𝑣𝑡𝜃subscript𝑤𝑡subscript𝑢𝑡𝑑𝑡subscript𝑔𝑡𝑑subscript𝑊𝑡\displaystyle dX_{t}=\begin{bmatrix}v_{t}\cos(\theta)\\ v_{t}\sin(\theta)\\ w_{t}+u_{t}\end{bmatrix}dt+g_{t}dW_{t}italic_d italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = [ start_ARG start_ROW start_CELL italic_v start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT roman_cos ( italic_θ ) end_CELL end_ROW start_ROW start_CELL italic_v start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT roman_sin ( italic_θ ) end_CELL end_ROW start_ROW start_CELL italic_w start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT + italic_u start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_CELL end_ROW end_ARG ] italic_d italic_t + italic_g start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT italic_d italic_W start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT (51)

where Xt=[pxpyθ]𝖳subscript𝑋𝑡superscriptmatrixsubscript𝑝𝑥subscript𝑝𝑦𝜃𝖳X_{t}=\begin{bmatrix}p_{x}&p_{y}&\theta\end{bmatrix}^{\mathsf{T}}italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = [ start_ARG start_ROW start_CELL italic_p start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT end_CELL start_CELL italic_p start_POSTSUBSCRIPT italic_y end_POSTSUBSCRIPT end_CELL start_CELL italic_θ end_CELL end_ROW end_ARG ] start_POSTSUPERSCRIPT sansserif_T end_POSTSUPERSCRIPT is the state of the vehicle, (px,py)subscript𝑝𝑥subscript𝑝𝑦(p_{x},p_{y})( italic_p start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT , italic_p start_POSTSUBSCRIPT italic_y end_POSTSUBSCRIPT ) is the position of the center of mass of the vehicle in the plane, θ𝜃\thetaitalic_θ is the heading angle of the vehicle, vtsubscript𝑣𝑡v_{t}italic_v start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT is the linear velocity of the center of mass, wtsubscript𝑤𝑡w_{t}italic_w start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT is the angular velocity of the vehicle, utsubscript𝑢𝑡u_{t}italic_u start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT is the deterministic disturbance on the angular velocity, and gtdWtsubscript𝑔𝑡𝑑subscript𝑊𝑡g_{t}dW_{t}italic_g start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT italic_d italic_W start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT is the stochastic disturbance on the model with Wtsubscript𝑊𝑡W_{t}italic_W start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT a three-dimensional Wiener process. The associated deterministic unicycle model is given by

x˙t=[vtcos(θ)vtsin(θ)wt]+[00ut]:=f(xt,ut,t)subscript˙𝑥𝑡matrixsubscript𝑣𝑡𝜃subscript𝑣𝑡𝜃subscript𝑤𝑡matrix00subscript𝑢𝑡assign𝑓subscript𝑥𝑡subscript𝑢𝑡𝑡\displaystyle\dot{x}_{t}=\begin{bmatrix}v_{t}\cos(\theta)\\ v_{t}\sin(\theta)\\ w_{t}\end{bmatrix}+\begin{bmatrix}0\\ 0\\ u_{t}\end{bmatrix}:=f(x_{t},u_{t},t)over˙ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = [ start_ARG start_ROW start_CELL italic_v start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT roman_cos ( italic_θ ) end_CELL end_ROW start_ROW start_CELL italic_v start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT roman_sin ( italic_θ ) end_CELL end_ROW start_ROW start_CELL italic_w start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_CELL end_ROW end_ARG ] + [ start_ARG start_ROW start_CELL 0 end_CELL end_ROW start_ROW start_CELL 0 end_CELL end_ROW start_ROW start_CELL italic_u start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_CELL end_ROW end_ARG ] := italic_f ( italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , italic_u start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , italic_t ) (52)

where xt=[pxpyθ]𝖳subscript𝑥𝑡superscriptmatrixsubscript𝑝𝑥subscript𝑝𝑦𝜃𝖳x_{t}=\begin{bmatrix}p_{x}&p_{y}&\theta\end{bmatrix}^{\mathsf{T}}italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = [ start_ARG start_ROW start_CELL italic_p start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT end_CELL start_CELL italic_p start_POSTSUBSCRIPT italic_y end_POSTSUBSCRIPT end_CELL start_CELL italic_θ end_CELL end_ROW end_ARG ] start_POSTSUPERSCRIPT sansserif_T end_POSTSUPERSCRIPT. We use Model Predictive Control (MPC) to design an open-loop controller to steer the deterministic system (52) from the initial configuration x0=(5,5,2π3)subscript𝑥0552𝜋3x_{0}=(5,5,-\frac{2\pi}{3})italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = ( 5 , 5 , - divide start_ARG 2 italic_π end_ARG start_ARG 3 end_ARG ) to the origin while avoiding the obstacles in the pxpysubscript𝑝𝑥subscript𝑝𝑦p_{x}-p_{y}italic_p start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT - italic_p start_POSTSUBSCRIPT italic_y end_POSTSUBSCRIPT plane. The trajectory of the deterministic system (52) with the MPC controller starting from x0=(5,5,2π3)subscript𝑥0552𝜋3x_{0}=(5,5,-\frac{2\pi}{3})italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = ( 5 , 5 , - divide start_ARG 2 italic_π end_ARG start_ARG 3 end_ARG ) is denoted by t(px,py,θ)maps-to𝑡superscriptsubscript𝑝𝑥superscriptsubscript𝑝𝑦superscript𝜃t\mapsto(p_{x}^{*},p_{y}^{*},\theta^{*})italic_t ↦ ( italic_p start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , italic_p start_POSTSUBSCRIPT italic_y end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ). We consider t(px,py,θ)maps-to𝑡superscriptsubscript𝑝𝑥superscriptsubscript𝑝𝑦superscript𝜃t\mapsto(p_{x}^{*},p_{y}^{*},\theta^{*})italic_t ↦ ( italic_p start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , italic_p start_POSTSUBSCRIPT italic_y end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) as the reference trajectory for the stochastic vehicle (51). Using the approach in [53], we design the following feedback controller for tracking the reference trajectory t(px,py,θ)maps-to𝑡superscriptsubscript𝑝𝑥superscriptsubscript𝑝𝑦superscript𝜃t\mapsto(p_{x}^{*},p_{y}^{*},\theta^{*})italic_t ↦ ( italic_p start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , italic_p start_POSTSUBSCRIPT italic_y end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ):

vtsubscript𝑣𝑡\displaystyle v_{t}italic_v start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT =Krrtcos(αt),absentsubscript𝐾𝑟subscript𝑟𝑡subscript𝛼𝑡\displaystyle=K_{r}r_{t}\cos(\alpha_{t}),= italic_K start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT roman_cos ( italic_α start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) ,
wtsubscript𝑤𝑡\displaystyle w_{t}italic_w start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT =Kααt+Krsin(αt)cos(αt)αt+βtαt,absentsubscript𝐾𝛼subscript𝛼𝑡subscript𝐾𝑟subscript𝛼𝑡subscript𝛼𝑡subscript𝛼𝑡subscript𝛽𝑡subscript𝛼𝑡\displaystyle=K_{\alpha}\alpha_{t}+K_{r}\sin(\alpha_{t})\cos(\alpha_{t})\tfrac% {\alpha_{t}+\beta_{t}}{\alpha_{t}},= italic_K start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT italic_α start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT + italic_K start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT roman_sin ( italic_α start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) roman_cos ( italic_α start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) divide start_ARG italic_α start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT + italic_β start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_ARG start_ARG italic_α start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_ARG , (53)

where the variables rt,αt,βtsubscript𝑟𝑡subscript𝛼𝑡subscript𝛽𝑡r_{t},\alpha_{t},\beta_{t}italic_r start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , italic_α start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , italic_β start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT are defined as

rtsubscript𝑟𝑡\displaystyle r_{t}italic_r start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT =(pxpx)2+(pypy)2,absentsuperscriptsubscript𝑝𝑥superscriptsubscript𝑝𝑥2superscriptsubscript𝑝𝑦superscriptsubscript𝑝𝑦2\displaystyle=\sqrt{(p_{x}-p_{x}^{*})^{2}+(p_{y}-p_{y}^{*})^{2}},= square-root start_ARG ( italic_p start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT - italic_p start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + ( italic_p start_POSTSUBSCRIPT italic_y end_POSTSUBSCRIPT - italic_p start_POSTSUBSCRIPT italic_y end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ,
αtsubscript𝛼𝑡\displaystyle\alpha_{t}italic_α start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT =θatan(pypy,pxpx),absent𝜃atansubscript𝑝𝑦superscriptsubscript𝑝𝑦subscript𝑝𝑥superscriptsubscript𝑝𝑥\displaystyle=\theta-\mathrm{atan}(p_{y}-p_{y}^{*},p_{x}-p_{x}^{*}),= italic_θ - roman_atan ( italic_p start_POSTSUBSCRIPT italic_y end_POSTSUBSCRIPT - italic_p start_POSTSUBSCRIPT italic_y end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , italic_p start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT - italic_p start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) ,
βtsubscript𝛽𝑡\displaystyle\beta_{t}italic_β start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT =atan(pypy,pxpx)θ,absentatansubscript𝑝𝑦superscriptsubscript𝑝𝑦subscript𝑝𝑥superscriptsubscript𝑝𝑥superscript𝜃\displaystyle=\mathrm{atan}(p_{y}-p_{y}^{*},p_{x}-p_{x}^{*})-\theta^{*},= roman_atan ( italic_p start_POSTSUBSCRIPT italic_y end_POSTSUBSCRIPT - italic_p start_POSTSUBSCRIPT italic_y end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , italic_p start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT - italic_p start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) - italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ,

and Kr,Kα0subscript𝐾𝑟subscript𝐾𝛼0K_{r},K_{\alpha}\geq 0italic_K start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT , italic_K start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ≥ 0 are feedback gains.

We consider the stochastic vehicle (51) with gt=0.1subscript𝑔𝑡0.1g_{t}=0.1italic_g start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = 0.1 and ut[0.03,0.03]subscript𝑢𝑡0.030.03u_{t}\in[-0.03,0.03]italic_u start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∈ [ - 0.03 , 0.03 ] with the tracking controller (VIII-C) and feedback gains Kr=0.8subscript𝐾𝑟0.8K_{r}=-0.8italic_K start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT = - 0.8 and Kα=1.5subscript𝐾𝛼1.5K_{\alpha}=-1.5italic_K start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT = - 1.5. We assume that this stochastic vehicle starts from x0=(5,5,2π3)subscript𝑥0552𝜋3x_{0}=(5,5,-\frac{2\pi}{3})italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = ( 5 , 5 , - divide start_ARG 2 italic_π end_ARG start_ARG 3 end_ARG ). Our goal is to provide high probability guarantees that the stochastic vehicle (51) with the tracking controller (VIII-C) avoids the obstacles shown in Figure 6, over the time horizon [0,5]05[0,5][ 0 , 5 ]. We use a modified version of Proposition 2 to construct δ𝛿\deltaitalic_δ-PRS of the stochastic vehicle (51) with the tracking controller (VIII-C). We use the strategy in [9] to estimate a time-varying ctsubscript𝑐𝑡c_{t}italic_c start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT in Assumption 1. We also use a generalization of Assumption 2 with 𝕏\|\cdot\|_{\mathbb{X}}∥ ⋅ ∥ start_POSTSUBSCRIPT blackboard_X end_POSTSUBSCRIPT and 𝕌\|\cdot\|_{\mathbb{U}}∥ ⋅ ∥ start_POSTSUBSCRIPT blackboard_U end_POSTSUBSCRIPT defined as standard Euclidean norms and with time-varying contraction rate ctsubscript𝑐𝑡c_{t}italic_c start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT. This time-varying contraction rate is then used for contraction-based reachability analysis of the associated deterministic system (52) in Proposition 2. For δ=103𝛿superscript103\delta=10^{-3}italic_δ = 10 start_POSTSUPERSCRIPT - 3 end_POSTSUPERSCRIPT, the δ𝛿\deltaitalic_δ-PRS of the stochastic vehicle (51) with the tracking controller (VIII-C) starting from x0=(5,5,2π3)subscript𝑥0552𝜋3x_{0}=(5,5,-\frac{2\pi}{3})italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = ( 5 , 5 , - divide start_ARG 2 italic_π end_ARG start_ARG 3 end_ARG ) at times t[0,5]𝑡05t\in[0,5]italic_t ∈ [ 0 , 5 ] are shown in Figure 6 using the green envelope. From Figure 6, it is clear that the green envelope does not intersect any of the obstacles in the pxpysubscript𝑝𝑥subscript𝑝𝑦p_{x}-p_{y}italic_p start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT - italic_p start_POSTSUBSCRIPT italic_y end_POSTSUBSCRIPT plane. Therefore, with probability at least 99.9%percent99.999.9\%99.9 %, the stochastic vehicle (51) with the tracking controller (VIII-C) starting from x0=(5,5,2π3)subscript𝑥0552𝜋3x_{0}=(5,5,-\frac{2\pi}{3})italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = ( 5 , 5 , - divide start_ARG 2 italic_π end_ARG start_ARG 3 end_ARG ) is safe and does not hit any obstacle for all times t[0,5]𝑡05t\in[0,5]italic_t ∈ [ 0 , 5 ].

Refer to caption
Refer to caption
Refer to caption
Refer to caption
Figure 6: The stochastic vehicle (51) with the tracking feedback controller (VIII-C) in the pxpysubscript𝑝𝑥subscript𝑝𝑦p_{x}-p_{y}italic_p start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT - italic_p start_POSTSUBSCRIPT italic_y end_POSTSUBSCRIPT plane with obstacles shown in light red. The reference trajectory designed using MPC for the deterministic system (52) is shown with dashed black line. Top left: Two stochastic trajectories of vehicle (51) with the tracking feedback controller (VIII-C) starting from x0=(5,5,2π3)subscript𝑥0552𝜋3x_{0}=(5,5,-\frac{2\pi}{3})italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = ( 5 , 5 , - divide start_ARG 2 italic_π end_ARG start_ARG 3 end_ARG ) at time t=1𝑡1t=1italic_t = 1. Top right: The same two stochastic trajectories at time t=2.5𝑡2.5t=2.5italic_t = 2.5. Bottom left: The same two stochastic trajectories at time t=4𝑡4t=4italic_t = 4. Bottom right: 200200200200 sample trajectories of the stochastic vehicle (51) with the tracking feedback controller (VIII-C). The green envelope is the union of δ𝛿\deltaitalic_δ-PRS at times t[0,5]𝑡05t\in[0,5]italic_t ∈ [ 0 , 5 ] obtained using a modified version of Proposition 2 with δ=103𝛿superscript103\delta=10^{-3}italic_δ = 10 start_POSTSUPERSCRIPT - 3 end_POSTSUPERSCRIPT. Therefore, with probability at least 99.9%percent99.999.9\%99.9 %, the stochastic vehicle (51) with the tracking feedback controller (VIII-C) is safe for all t[0,5]𝑡05t\in[0,5]italic_t ∈ [ 0 , 5 ].

IX Conclusion

We propose an efficient and flexible framework for computing the Probabilistic Reachable Set (PRS) of continuous-time nonlinear stochastic systems. Using a suitable separation strategy, we decouple the effect of deterministic inputs and the effect of stochastic uncertainties on the PRS. This separation strategy is flexible as it allows using any deterministic reachability method to capture the effects of deterministic inputs. It essentially reduce the problem of computing PRS into analyzing the distance between stochastic trajectories and their associated deterministic trajectories termed stochastic deviation. By developing a novel energy function called Averaged Moment Generating Function, we establish a tight high-probability bound on the stochastic deviation of stochastic systems. To the best of our knowledge, our bound is the tightest high-probability bound on stochastic deviation for general nonlinear systems. By combining this probabilistic bound on stochastic deviation with the contraction-based and interval-based reachability of deterministic systems, we provide tight estimates of PRS for stochastic systems. Our separation strategy and tight probabilistic bounds on stochastic deviation can transform many current methods/results in control theory and applications. They will also open new research directions in various fields, such as safety-critical control, estimation, uncertainty quantification, statistics, and machine learning. Additionally, the AMGF leveraged in our theoretical analysis is a powerful mathematical tool, waiting for further exploitation in the future.

References

  • [1] S. Bansal, M. Chen, S. Herbert, and C. J. Tomlin, “Hamilton-jacobi reachability: A brief overview and recent advances,” in IEEE 56th Annual Conference on Decision and Control (CDC), 2017, pp. 2242–2253.
  • [2] I. Mitchell, “A toolbox of level set methods,” UBC Department of Computer Science Technical Report TR-2007-11, vol. 1, p. 6, 2007.
  • [3] J. Maidens and M. Arcak, “Reachability analysis of nonlinear systems using matrix measures,” IEEE Transactions on Automatic Control, vol. 60, no. 1, pp. 265–270, 2015.
  • [4] F. Bullo, Contraction Theory for Dynamical Systems, 1.0 ed.   Kindle Direct Publishing, 2022. [Online]. Available: http://motion.me.ucsb.edu/book-ctds
  • [5] J. K. Scott and P. I. Barton, “Bounds on the reachable sets of nonlinear control systems,” Automatica, vol. 49, no. 1, pp. 93–100, 2013.
  • [6] P.-J. Meyer, A. Devonport, and M. Arcak, “TIRA: Toolbox for interval reachability analysis,” in Proceedings of the 22nd ACM International Conference on Hybrid Systems: Computation and Control, 2019, pp. 224–229.
  • [7] S. Coogan, “Mixed monotonicity for reachability and safety in dynamical systems,” in 2020 59th IEEE Conference on Decision and Control (CDC), 2020, pp. 5074–5085.
  • [8] S. Jafarpour, A. Harapanahalli, and S. Coogan, “Efficient interaction-aware interval analysis of neural network feedback loops,” arXiv preprint, 2023. [Online]. Available: https://arxiv.org/abs/2307.14938
  • [9] C. Fan, J. Kapinski, X. Jin, and S. Mitra, “Simulation-driven reachability using matrix measures,” ACM Trans. Embed. Comput. Syst., vol. 17, no. 1, dec 2017. [Online]. Available: https://doi.org/10.1145/3126685
  • [10] Z. Huang and S. Mitra, “Computing bounded reach sets from sampled simulation traces,” in Proceedings of the 15th ACM International Conference on Hybrid Systems: Computation and Control, ser. HSCC ’12.   Association for Computing Machinery, 2012, p. 291–294. [Online]. Available: https://doi.org/10.1145/2185632.2185676
  • [11] R. K. Cosner, P. Culbertson, and A. D. Ames, “Bounding stochastic safety: Leveraging Freedman’s inequality with discrete-time control barrier functions,” IEEE Control Systems Letters, vol. 8, pp. 1937–1942, 2024.
  • [12] H. M. Soner and N. Touzi, “Dynamic programming for stochastic target problems and geometric flows,” Journal of the European Mathematical Society, vol. 4, no. 3, pp. 201–236, 2002.
  • [13] A. Abate, M. Prandini, J. Lygeros, and S. Sastry, “Probabilistic reachability and safety for controlled discrete time stochastic hybrid systems,” Automatica, vol. 44, no. 11, pp. 2724–2734, 2008.
  • [14] S. Summers and J. Lygeros, “Verification of discrete time stochastic hybrid systems: A stochastic reach-avoid decision problem,” Automatica, vol. 46, no. 12, pp. 1951–1961, 2010.
  • [15] P. Mohajerin Esfahani, D. Chatterjee, and J. Lygeros, “The stochastic reach-avoid problem and set characterization for diffusions,” Automatica, vol. 70, pp. 43–56, 2016.
  • [16] K. Lesser, M. Oishi, and R. S. Erwin, “Stochastic reachability for control of spacecraft relative motion,” in 52nd IEEE Conference on Decision and Control, 2013, pp. 4705–4712.
  • [17] H. Sartipizadeh, A. P. Vinod, B. Acikmese, and M. Oishi, “Voronoi partition-based scenario reduction for fast sampling-based stochastic reachability computation of linear systems,” in 2019 American Control Conference (ACC), 2019, pp. 37–44.
  • [18] N. Hashemi, X. Qin, L. Lindemann, and J. V. Deshmukh, “Data-driven reachability analysis of stochastic dynamical systems with conformal inference,” in 62nd IEEE Conference on Decision and Control (CDC), 2023, pp. 3102–3109.
  • [19] M. Black, G. Fainekos, B. Hoxha, and D. Panagou, “Risk-aware fixed-time stabilization of stochastic systems under measurement uncertainty,” arXiv preprint, 2024. [Online]. Available: https://arxiv.org/abs/2403.20258
  • [20] H. El-Samad, M. Fazel, X. Liu, A. Papachristodoulou, and S. Prajna, “Stochastic reachability analysis in complex biological networks,” in 2006 American Control Conference.   IEEE, 2006, pp. 6–pp.
  • [21] S. Prajna, A. Jadbabaie, and G. J. Pappas, “A framework for worst-case and stochastic safety verification using barrier certificates,” IEEE Transactions on Automatic Control, vol. 52, no. 8, pp. 1415–1428, 2007.
  • [22] C. Santoyo, M. Dutreix, and S. Coogan, “A barrier function approach to finite-time stochastic system verification and control,” Automatica, vol. 125, p. 109439, 2021.
  • [23] M. Anand, A. Lavaei, and M. Zamani, “From small-gain theory to compositional construction of barrier certificates for large-scale stochastic systems,” IEEE Transactions on Automatic Control, vol. 67, no. 10, pp. 5638–5645, 2022.
  • [24] X. Chen and S. Sankaranarayanan, “Reachability analysis for cyber-physical systems: Are we there yet?” in NASA Formal Methods: 14th International Symposium, NFM 2022, Pasadena, CA, USA, May 24–27, 2022, Proceedings.   Springer, 2022, pp. 109–130.
  • [25] C. Moore, “Unpredictability and undecidability in dynamical systems,” Physical Review Letters, vol. 64, pp. 2354–2357, May 1990.
  • [26] C. A. Desoer and H. Haneda, “The measure of a matrix as a tool to analyze computer algorithms for circuit analysis,” IEEE Transactions on Circuit Theory, vol. 19, no. 5, pp. 480–486, 1972.
  • [27] G. Söderlind, “The logarithmic norm. history and modern theory,” BIT Numerical Mathematics, vol. 46, pp. 631–652, 2006.
  • [28] A. Davydov, S. Jafarpour, and F. Bullo, “Non-Euclidean contraction theory for robust nonlinear stability,” IEEE Transactions on Automatic Control, vol. 67, no. 12, pp. 6667–6681, 2022.
  • [29] W. Lohmiller and J.-J. E. Slotine, “On contraction analysis for non-linear systems,” Automatica, vol. 34, no. 6, pp. 683–696, 1998.
  • [30] F. Forni and R. Sepulchre, “A differential Lyapunov framework for contraction analysis,” vol. 59, no. 3, pp. 614–628, 2014.
  • [31] Z. Aminzare and E. D. Sontag, “Contraction methods for nonlinear systems: A brief introduction and some open problems,” in 53rd IEEE Conference on Decision and Control, 2014, pp. 3835–3847.
  • [32] E. M. Aylward, P. A. Parrilo, and J.-J. E. Slotine, “Stability and robustness analysis of nonlinear systems via contraction metrics and sos programming,” Automatica, vol. 44, no. 8, pp. 2163–2170, 2008.
  • [33] Z. Zahreddine, “Matrix measure and application to stability of matrices and interval dynamical systems,” International Journal of Mathematics and Mathematical Sciences, vol. 2003, no. 2, p. 937084, 2003.
  • [34] B. Øksendal, Stochastic differential equations: an introduction with applications, ser. Universitext.   Springer Berlin, Heidelberg, 2013.
  • [35] S. Särkkä and A. Solin, Applied stochastic differential equations.   Cambridge University Press, 2019, vol. 10.
  • [36] R. K. Cosner, P. Culbertson, A. J. Taylor, and A. D. Ames, “Robust safety under stochastic uncertainty with discrete-time control barrier functions,” arXiv preprint, 2023. [Online]. Available: https://arxiv.org/abs/2302.07469
  • [37] E. Fogel and D. Halperin, “Exact and efficient construction of Minkowski sums of convex polyhedra with applications,” Computer-Aided Design, vol. 39, no. 11, pp. 929–940, 2007.
  • [38] Q.-C. Pham, N. Tabareau, and J.-J. Slotine, “A contraction theory approach to stochastic incremental stability,” IEEE Transactions on Automatic Control, vol. 54, no. 4, pp. 816–820, 2009.
  • [39] T. Lorenz, Mutational analysis: a joint framework for Cauchy problems in and beyond vector spaces, ser. Lecture Notes in Mathematics.   Springer-Verlag, Berlin, 2010, vol. 1996.
  • [40] F. Burns, M. Fiedler, and E. Haynsworth, “Polyhedral cones and positive operators,” Linear Algebra and its Applications, vol. 8, no. 6, pp. 547–559, 1974.
  • [41] A. Gittens and J. A. Tropp, “Tail bounds for all eigenvalues of a sum of random matrices,” arXiv preprint, 2011. [Online]. Available: https://arxiv.org/abs/1104.4513
  • [42] P. Rigollet and J.-C. Hütter, “High-dimensional statistics,” arXiv preprint, 2023. [Online]. Available: https://arxiv.org/abs/2310.19244
  • [43] J. M. Altschuler and K. Talwar, “Concentration of the Langevin algorithm’s stationary distribution,” arXiv preprint, 2022. [Online]. Available: https://arxiv.org/abs/2212.12629
  • [44] T. H. Gronwall, “Note on the derivatives with respect to a parameter of the solutions of a system of differential equations,” Annals of Mathematics, vol. 20, no. 4, pp. 292–296, 1919.
  • [45] R. Vershynin, High-Dimensional Probability: An Introduction with Applications in Data Science, ser. Cambridge Series in Statistical and Probabilistic Mathematics.   Cambridge University Press, 2018.
  • [46] P. Gritzmann and B. Sturmfels, “Minkowski addition of polytopes: Computational complexity and applications to gröbner bases,” SIAM Journal on Discrete Mathematics, vol. 6, no. 2, pp. 246–269, 1993.
  • [47] G. Varadhan and D. Manocha, “Accurate Minkowski sum approximation of polyhedral models,” in 12th Pacific Conference on Computer Graphics and Applications, 2004. PG 2004. Proceedings., 2004, pp. 392–401.
  • [48] C. Weibel, “Minkowski sums of polytopes: combinatorics and computation,” EPFL, Tech. Rep., 2007.
  • [49] L. Jaulin, M. Kieffer, O. Didrit, and É. Walter, Applied Interval Analysis.   Springer London, 2001.
  • [50] A. Harapanahalli, S. Jafarpour, and S. Coogan, “A toolbox for fast interval arithmetic in numpy with an application to formal verification of neural network controlled systems,” in 2nd ICML Workshop on Formal Verification of Machine Learning, 2023. [Online]. Available: https://arxiv.org/abs/2306.15340
  • [51] A. Shapiro, “Monte Carlo sampling methods,” Handbooks in operations research and management science, vol. 10, pp. 353–425, 2003.
  • [52] C. Fan, J. Kapinski, X. Jin, and S. Mitra, “Simulation-driven reachability using matrix measures,” ACM Transactions on Embedded Computing Systems, vol. 17, no. 1, dec 2017.
  • [53] M. Aicardi, G. Casalino, A. Bicchi, and A. Balestrino, “Closed loop steering of unicycle like vehicles via Lyapunov techniques,” IEEE Robotics & Automation Magazine, vol. 2, no. 1, pp. 27–35, 1995.

-A Proof of Lemma V.1 (Sub-Gaussian Norm Concentration)

For every ε(0,1)𝜀01\varepsilon\in(0,1)italic_ε ∈ ( 0 , 1 ), we can find a finite set 𝒩n(1,0)𝒩superscript𝑛10\mathcal{N}\subseteq\mathcal{B}^{n}\left(1,0\right)caligraphic_N ⊆ caligraphic_B start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ( 1 , 0 ) such that for x0n(1,0),x𝒩,xx0εformulae-sequencefor-allsubscript𝑥0superscript𝑛10formulae-sequence𝑥𝒩norm𝑥subscript𝑥0𝜀\forall x_{0}\in\mathcal{B}^{n}\left(1,0\right),~{}\exists x\in\mathcal{N},~{}% \|x-x_{0}\|\leq\varepsilon∀ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∈ caligraphic_B start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ( 1 , 0 ) , ∃ italic_x ∈ caligraphic_N , ∥ italic_x - italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∥ ≤ italic_ε. Let |𝒩|𝒩|\mathcal{N}|| caligraphic_N | denote the number of elements in 𝒩𝒩\mathcal{N}caligraphic_N. By [42, Exercise 4.4.2], there exists such an 𝒩𝒩\mathcal{N}caligraphic_N that |𝒩|(1+2/ε)n𝒩superscript12𝜀𝑛|\mathcal{N}|\leq(1+2/\varepsilon)^{n}| caligraphic_N | ≤ ( 1 + 2 / italic_ε ) start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT and for any vector xn𝑥superscript𝑛x\in\mathbb{R}^{n}italic_x ∈ blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT,

x11εmax𝒩𝖳x.norm𝑥11𝜀subscript𝒩superscript𝖳𝑥\|x\|\leq\frac{1}{1-\varepsilon}\max_{\ell\in\mathcal{N}}\ell^{\mathsf{T}}x.∥ italic_x ∥ ≤ divide start_ARG 1 end_ARG start_ARG 1 - italic_ε end_ARG roman_max start_POSTSUBSCRIPT roman_ℓ ∈ caligraphic_N end_POSTSUBSCRIPT roman_ℓ start_POSTSUPERSCRIPT sansserif_T end_POSTSUPERSCRIPT italic_x .

It follows that, for any r>0𝑟0r>0italic_r > 0 and any sub-Gaussian vector Xn𝑋superscript𝑛X\in\mathbb{R}^{n}italic_X ∈ blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT with variance proxy σ2superscript𝜎2\sigma^{2}italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT,

(Xr)(11εmax𝒩𝖳Xr)(𝒩𝖳X1εr).delimited-∥∥𝑋𝑟11𝜀subscript𝒩superscript𝖳𝑋𝑟subscript𝒩superscript𝖳𝑋1𝜀𝑟\begin{split}&\mathbb{P}\left(\|X\|\geq r\right)\leq\mathbb{P}\left(\frac{1}{1% -\varepsilon}\max_{\ell\in\mathcal{N}}\ell^{\mathsf{T}}X\geq r\right)\\ \leq&\mathbb{P}\left(\bigcup_{\ell\in\mathcal{N}}\frac{\ell^{\mathsf{T}}X}{1-% \varepsilon}\geq r\right).\end{split}start_ROW start_CELL end_CELL start_CELL blackboard_P ( ∥ italic_X ∥ ≥ italic_r ) ≤ blackboard_P ( divide start_ARG 1 end_ARG start_ARG 1 - italic_ε end_ARG roman_max start_POSTSUBSCRIPT roman_ℓ ∈ caligraphic_N end_POSTSUBSCRIPT roman_ℓ start_POSTSUPERSCRIPT sansserif_T end_POSTSUPERSCRIPT italic_X ≥ italic_r ) end_CELL end_ROW start_ROW start_CELL ≤ end_CELL start_CELL blackboard_P ( ⋃ start_POSTSUBSCRIPT roman_ℓ ∈ caligraphic_N end_POSTSUBSCRIPT divide start_ARG roman_ℓ start_POSTSUPERSCRIPT sansserif_T end_POSTSUPERSCRIPT italic_X end_ARG start_ARG 1 - italic_ε end_ARG ≥ italic_r ) . end_CELL end_ROW (54)

Since 1norm1\|\ell\|\leq 1∥ roman_ℓ ∥ ≤ 1 for 𝒩𝒩\ell\in\mathcal{N}roman_ℓ ∈ caligraphic_N, we have

(𝖳X1εr,𝒩)(𝖳X(1ε)r,𝒩).formulae-sequencesuperscript𝖳𝑋1𝜀𝑟𝒩formulae-sequencesuperscript𝖳𝑋norm1𝜀𝑟𝒩\mathbb{P}\left(\frac{\ell^{\mathsf{T}}X}{1-\varepsilon}\geq r,~{}\ell\in% \mathcal{N}\right)\leq\mathbb{P}\left(\frac{\ell^{\mathsf{T}}X}{\|\ell\|(1-% \varepsilon)}\geq r,~{}\ell\in\mathcal{N}\right).blackboard_P ( divide start_ARG roman_ℓ start_POSTSUPERSCRIPT sansserif_T end_POSTSUPERSCRIPT italic_X end_ARG start_ARG 1 - italic_ε end_ARG ≥ italic_r , roman_ℓ ∈ caligraphic_N ) ≤ blackboard_P ( divide start_ARG roman_ℓ start_POSTSUPERSCRIPT sansserif_T end_POSTSUPERSCRIPT italic_X end_ARG start_ARG ∥ roman_ℓ ∥ ( 1 - italic_ε ) end_ARG ≥ italic_r , roman_ℓ ∈ caligraphic_N ) . (55)

By the definition of sub-Gaussian vector, we know 𝖳Xsuperscript𝖳𝑋norm\frac{\ell^{\mathsf{T}}X}{\|\ell\|}divide start_ARG roman_ℓ start_POSTSUPERSCRIPT sansserif_T end_POSTSUPERSCRIPT italic_X end_ARG start_ARG ∥ roman_ℓ ∥ end_ARG is sub-Gaussian with variance proxy σ2superscript𝜎2\sigma^{2}italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT for any 𝒩𝒩\ell\in\mathcal{N}roman_ℓ ∈ caligraphic_N. By Hoeffding’s Inequality,

(𝖳X(1ε)r,𝒩)e(1ε)2r22σ2.formulae-sequencesuperscript𝖳𝑋norm1𝜀𝑟𝒩superscript𝑒superscript1𝜀2superscript𝑟22superscript𝜎2\mathbb{P}\left(\frac{\ell^{\mathsf{T}}X}{\|\ell\|(1-\varepsilon)}\geq r,~{}% \ell\in\mathcal{N}\right)\leq e^{-\frac{(1-\varepsilon)^{2}r^{2}}{2\sigma^{2}}}.blackboard_P ( divide start_ARG roman_ℓ start_POSTSUPERSCRIPT sansserif_T end_POSTSUPERSCRIPT italic_X end_ARG start_ARG ∥ roman_ℓ ∥ ( 1 - italic_ε ) end_ARG ≥ italic_r , roman_ℓ ∈ caligraphic_N ) ≤ italic_e start_POSTSUPERSCRIPT - divide start_ARG ( 1 - italic_ε ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_r start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 2 italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG end_POSTSUPERSCRIPT . (56)

Combining (54)-(56) and taking union bound over 𝒩𝒩\ell\in\mathcal{N}roman_ℓ ∈ caligraphic_N, we obtain

(Xr)(𝒩𝖳X1εr)|𝒩|e(1ε)2r22σ2(1+2ε)ne(1ε)2r22σ2.delimited-∥∥𝑋𝑟subscript𝒩superscript𝖳𝑋1𝜀𝑟𝒩superscript𝑒superscript1𝜀2superscript𝑟22superscript𝜎2superscript12𝜀𝑛superscript𝑒superscript1𝜀2superscript𝑟22superscript𝜎2\begin{split}&\mathbb{P}\left(\|X\|\geq r\right)\leq\mathbb{P}\left(\bigcup_{% \ell\in\mathcal{N}}\frac{\ell^{\mathsf{T}}X}{1-\varepsilon}\geq r\right)\\ \leq&|\mathcal{N}|e^{-\frac{(1-\varepsilon)^{2}r^{2}}{2\sigma^{2}}}\leq(1+% \frac{2}{\varepsilon})^{n}e^{-\frac{(1-\varepsilon)^{2}r^{2}}{2\sigma^{2}}}.% \end{split}start_ROW start_CELL end_CELL start_CELL blackboard_P ( ∥ italic_X ∥ ≥ italic_r ) ≤ blackboard_P ( ⋃ start_POSTSUBSCRIPT roman_ℓ ∈ caligraphic_N end_POSTSUBSCRIPT divide start_ARG roman_ℓ start_POSTSUPERSCRIPT sansserif_T end_POSTSUPERSCRIPT italic_X end_ARG start_ARG 1 - italic_ε end_ARG ≥ italic_r ) end_CELL end_ROW start_ROW start_CELL ≤ end_CELL start_CELL | caligraphic_N | italic_e start_POSTSUPERSCRIPT - divide start_ARG ( 1 - italic_ε ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_r start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 2 italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG end_POSTSUPERSCRIPT ≤ ( 1 + divide start_ARG 2 end_ARG start_ARG italic_ε end_ARG ) start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_e start_POSTSUPERSCRIPT - divide start_ARG ( 1 - italic_ε ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_r start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 2 italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG end_POSTSUPERSCRIPT . end_CELL end_ROW (57)

To ensure a confidence level δ𝛿\deltaitalic_δ, which means the right-hand side of (57) δabsent𝛿\leq\delta≤ italic_δ, r𝑟ritalic_r should satisfy

r22σ2(1ε)2(nlog(1+2ε)+log1δ).superscript𝑟22superscript𝜎2superscript1𝜀2𝑛12𝜀1𝛿r^{2}\geq\frac{2\sigma^{2}}{(1-\varepsilon)^{2}}(n\log(1+\frac{2}{\varepsilon}% )+\log\frac{1}{\delta}).italic_r start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ≥ divide start_ARG 2 italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG ( 1 - italic_ε ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ( italic_n roman_log ( 1 + divide start_ARG 2 end_ARG start_ARG italic_ε end_ARG ) + roman_log divide start_ARG 1 end_ARG start_ARG italic_δ end_ARG ) . (58)

Then (20) follows by taking the square root. This completes the proof.

-B Proof of Lemma V.2

(i) For any λ𝜆\lambda\in\mathbb{R}italic_λ ∈ blackboard_R and η1,η2𝒮n1subscript𝜂1subscript𝜂2superscript𝒮𝑛1\eta_{1},\eta_{2}\in\mathcal{S}^{n-1}italic_η start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_η start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ∈ caligraphic_S start_POSTSUPERSCRIPT italic_n - 1 end_POSTSUPERSCRIPT, we have

𝔼𝒮n1(eλ,η1)=𝔼𝒮n1eλ,η2).\mathbb{E}_{\ell\sim\mathcal{S}^{n-1}}(e^{\lambda\langle\ell,\eta_{1}\rangle})% =\mathbb{E}_{\ell\sim\mathcal{S}^{n-1}}e^{\lambda\langle\ell,\eta_{2}\rangle}).blackboard_E start_POSTSUBSCRIPT roman_ℓ ∼ caligraphic_S start_POSTSUPERSCRIPT italic_n - 1 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_e start_POSTSUPERSCRIPT italic_λ ⟨ roman_ℓ , italic_η start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ⟩ end_POSTSUPERSCRIPT ) = blackboard_E start_POSTSUBSCRIPT roman_ℓ ∼ caligraphic_S start_POSTSUPERSCRIPT italic_n - 1 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_e start_POSTSUPERSCRIPT italic_λ ⟨ roman_ℓ , italic_η start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ⟩ end_POSTSUPERSCRIPT ) . (59)

It follows that, for any xnx\in{}^{n}italic_x ∈ start_FLOATSUPERSCRIPT italic_n end_FLOATSUPERSCRIPT and η𝒮n1𝜂superscript𝒮𝑛1\eta\in\mathcal{S}^{n-1}italic_η ∈ caligraphic_S start_POSTSUPERSCRIPT italic_n - 1 end_POSTSUPERSCRIPT,

Φn,λ(x)=𝔼𝒮n1(eλ,x)=𝔼𝒮n1(eλx,xx)=𝔼𝒮n1(eλx,η)=Φn,λ(xη).subscriptΦ𝑛𝜆𝑥subscript𝔼similar-tosuperscript𝒮𝑛1superscript𝑒𝜆𝑥subscript𝔼similar-tosuperscript𝒮𝑛1superscript𝑒𝜆norm𝑥𝑥norm𝑥subscript𝔼similar-tosuperscript𝒮𝑛1superscript𝑒𝜆norm𝑥𝜂subscriptΦ𝑛𝜆delimited-∥∥𝑥𝜂\begin{split}\Phi_{n,\lambda}(x)&=\mathbb{E}_{\ell\sim\mathcal{S}^{n-1}}\left(% e^{\lambda\langle\ell,x\rangle}\right)\\ &=\mathbb{E}_{\ell\sim\mathcal{S}^{n-1}}\left(e^{\lambda\|x\|\langle\ell,\frac% {x}{\|x\|}\rangle}\right)\\ &=\mathbb{E}_{\ell\sim\mathcal{S}^{n-1}}\left(e^{\lambda\|x\|\langle\ell,\eta% \rangle}\right)\\ &=\Phi_{n,\lambda}(\|x\|\,\eta).\end{split}start_ROW start_CELL roman_Φ start_POSTSUBSCRIPT italic_n , italic_λ end_POSTSUBSCRIPT ( italic_x ) end_CELL start_CELL = blackboard_E start_POSTSUBSCRIPT roman_ℓ ∼ caligraphic_S start_POSTSUPERSCRIPT italic_n - 1 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_e start_POSTSUPERSCRIPT italic_λ ⟨ roman_ℓ , italic_x ⟩ end_POSTSUPERSCRIPT ) end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL = blackboard_E start_POSTSUBSCRIPT roman_ℓ ∼ caligraphic_S start_POSTSUPERSCRIPT italic_n - 1 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_e start_POSTSUPERSCRIPT italic_λ ∥ italic_x ∥ ⟨ roman_ℓ , divide start_ARG italic_x end_ARG start_ARG ∥ italic_x ∥ end_ARG ⟩ end_POSTSUPERSCRIPT ) end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL = blackboard_E start_POSTSUBSCRIPT roman_ℓ ∼ caligraphic_S start_POSTSUPERSCRIPT italic_n - 1 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_e start_POSTSUPERSCRIPT italic_λ ∥ italic_x ∥ ⟨ roman_ℓ , italic_η ⟩ end_POSTSUPERSCRIPT ) end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL = roman_Φ start_POSTSUBSCRIPT italic_n , italic_λ end_POSTSUBSCRIPT ( ∥ italic_x ∥ italic_η ) . end_CELL end_ROW (60)

(ii) By part (i), Φn,λ(x)=Φn,λ(xη)subscriptΦ𝑛𝜆𝑥subscriptΦ𝑛𝜆norm𝑥𝜂\Phi_{n,\lambda}(x)=\Phi_{n,\lambda}(\|x\|\eta)roman_Φ start_POSTSUBSCRIPT italic_n , italic_λ end_POSTSUBSCRIPT ( italic_x ) = roman_Φ start_POSTSUBSCRIPT italic_n , italic_λ end_POSTSUBSCRIPT ( ∥ italic_x ∥ italic_η ) for any η𝒮n1𝜂superscript𝒮𝑛1\eta\in\mathcal{S}^{n-1}italic_η ∈ caligraphic_S start_POSTSUPERSCRIPT italic_n - 1 end_POSTSUPERSCRIPT. Taking the derivative of Φn,λ(x)subscriptΦ𝑛𝜆𝑥\Phi_{n,\lambda}(x)roman_Φ start_POSTSUBSCRIPT italic_n , italic_λ end_POSTSUBSCRIPT ( italic_x ) over xnorm𝑥\|x\|∥ italic_x ∥ when x0norm𝑥0\|x\|\neq 0∥ italic_x ∥ ≠ 0:

dΦn,λ(x)dx=ddx𝔼𝒮n1(eλx,η)=𝔼𝒮n1(λ,ηeλx,η)=1x𝔼𝒮n1(λx,ηeλx,η).𝑑subscriptΦ𝑛𝜆𝑥𝑑norm𝑥𝑑𝑑norm𝑥subscript𝔼similar-tosuperscript𝒮𝑛1superscript𝑒𝜆norm𝑥𝜂subscript𝔼similar-tosuperscript𝒮𝑛1𝜆𝜂superscript𝑒𝜆norm𝑥𝜂1norm𝑥subscript𝔼similar-tosuperscript𝒮𝑛1𝜆delimited-∥∥𝑥𝜂superscript𝑒𝜆norm𝑥𝜂\begin{split}&\frac{d\Phi_{n,\lambda}(x)}{d\|x\|}=\frac{d}{d\|x\|}\mathbb{E}_{% \ell\sim\mathcal{S}^{n-1}}\left(e^{\lambda\|x\|\langle\ell,\eta\rangle}\right)% \\ &=\mathbb{E}_{\ell\sim\mathcal{S}^{n-1}}\left(\lambda\langle\ell,\eta\rangle e% ^{\lambda\|x\|\langle\ell,\eta\rangle}\right)\\ &=\frac{1}{\|x\|}\mathbb{E}_{\ell\sim\mathcal{S}^{n-1}}\left(\lambda\|x\|% \langle\ell,\eta\rangle e^{\lambda\|x\|\langle\ell,\eta\rangle}\right).\end{split}start_ROW start_CELL end_CELL start_CELL divide start_ARG italic_d roman_Φ start_POSTSUBSCRIPT italic_n , italic_λ end_POSTSUBSCRIPT ( italic_x ) end_ARG start_ARG italic_d ∥ italic_x ∥ end_ARG = divide start_ARG italic_d end_ARG start_ARG italic_d ∥ italic_x ∥ end_ARG blackboard_E start_POSTSUBSCRIPT roman_ℓ ∼ caligraphic_S start_POSTSUPERSCRIPT italic_n - 1 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_e start_POSTSUPERSCRIPT italic_λ ∥ italic_x ∥ ⟨ roman_ℓ , italic_η ⟩ end_POSTSUPERSCRIPT ) end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL = blackboard_E start_POSTSUBSCRIPT roman_ℓ ∼ caligraphic_S start_POSTSUPERSCRIPT italic_n - 1 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_λ ⟨ roman_ℓ , italic_η ⟩ italic_e start_POSTSUPERSCRIPT italic_λ ∥ italic_x ∥ ⟨ roman_ℓ , italic_η ⟩ end_POSTSUPERSCRIPT ) end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL = divide start_ARG 1 end_ARG start_ARG ∥ italic_x ∥ end_ARG blackboard_E start_POSTSUBSCRIPT roman_ℓ ∼ caligraphic_S start_POSTSUPERSCRIPT italic_n - 1 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_λ ∥ italic_x ∥ ⟨ roman_ℓ , italic_η ⟩ italic_e start_POSTSUPERSCRIPT italic_λ ∥ italic_x ∥ ⟨ roman_ℓ , italic_η ⟩ end_POSTSUPERSCRIPT ) . end_CELL end_ROW (61)

Set y=eλx,η𝑦superscript𝑒𝜆norm𝑥𝜂y=e^{\lambda\|x\|\langle\ell,\eta\rangle}italic_y = italic_e start_POSTSUPERSCRIPT italic_λ ∥ italic_x ∥ ⟨ roman_ℓ , italic_η ⟩ end_POSTSUPERSCRIPT. Applying Jensen’s Inequality over the convex function ylogy𝑦𝑦y\log yitalic_y roman_log italic_y, we arrive at

𝔼𝒮n1(λx,ηeλx,η)𝔼𝒮n1(λx,η)𝔼𝒮n1(eλx,η)=0.subscript𝔼similar-tosuperscript𝒮𝑛1𝜆delimited-∥∥𝑥𝜂superscript𝑒𝜆norm𝑥𝜂subscript𝔼similar-tosuperscript𝒮𝑛1𝜆delimited-∥∥𝑥𝜂subscript𝔼similar-tosuperscript𝒮𝑛1superscript𝑒𝜆norm𝑥𝜂0\begin{split}&\mathbb{E}_{\ell\sim\mathcal{S}^{n-1}}\left(\lambda\|x\|\langle% \ell,\eta\rangle e^{\lambda\|x\|\langle\ell,\eta\rangle}\right)\\ \geq&\mathbb{E}_{\ell\sim\mathcal{S}^{n-1}}\left(\lambda\|x\|\langle\ell,\eta% \rangle\right)\,\mathbb{E}_{\ell\sim\mathcal{S}^{n-1}}\left(e^{\lambda\|x\|% \langle\ell,\eta\rangle}\right)=0.\end{split}start_ROW start_CELL end_CELL start_CELL blackboard_E start_POSTSUBSCRIPT roman_ℓ ∼ caligraphic_S start_POSTSUPERSCRIPT italic_n - 1 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_λ ∥ italic_x ∥ ⟨ roman_ℓ , italic_η ⟩ italic_e start_POSTSUPERSCRIPT italic_λ ∥ italic_x ∥ ⟨ roman_ℓ , italic_η ⟩ end_POSTSUPERSCRIPT ) end_CELL end_ROW start_ROW start_CELL ≥ end_CELL start_CELL blackboard_E start_POSTSUBSCRIPT roman_ℓ ∼ caligraphic_S start_POSTSUPERSCRIPT italic_n - 1 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_λ ∥ italic_x ∥ ⟨ roman_ℓ , italic_η ⟩ ) blackboard_E start_POSTSUBSCRIPT roman_ℓ ∼ caligraphic_S start_POSTSUPERSCRIPT italic_n - 1 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_e start_POSTSUPERSCRIPT italic_λ ∥ italic_x ∥ ⟨ roman_ℓ , italic_η ⟩ end_POSTSUPERSCRIPT ) = 0 . end_CELL end_ROW

Thus, dΦn,λ(x)dx0𝑑subscriptΦ𝑛𝜆𝑥𝑑norm𝑥0\frac{d\Phi_{n,\lambda}(x)}{d\|x\|}\geq 0divide start_ARG italic_d roman_Φ start_POSTSUBSCRIPT italic_n , italic_λ end_POSTSUBSCRIPT ( italic_x ) end_ARG start_ARG italic_d ∥ italic_x ∥ end_ARG ≥ 0 when x0norm𝑥0\|x\|\neq 0∥ italic_x ∥ ≠ 0. When x=0norm𝑥0\|x\|=0∥ italic_x ∥ = 0, obviously Φn,λ(x)=1subscriptΦ𝑛𝜆𝑥1\Phi_{n,\lambda}(x)=1roman_Φ start_POSTSUBSCRIPT italic_n , italic_λ end_POSTSUBSCRIPT ( italic_x ) = 1 and dΦn,λ(x)dx=0𝑑subscriptΦ𝑛𝜆𝑥𝑑norm𝑥0\frac{d\Phi_{n,\lambda}(x)}{d\|x\|}=0divide start_ARG italic_d roman_Φ start_POSTSUBSCRIPT italic_n , italic_λ end_POSTSUBSCRIPT ( italic_x ) end_ARG start_ARG italic_d ∥ italic_x ∥ end_ARG = 0. This completes the proof.

-C Proof of Lemma V.3

Let τxτmaps-to𝜏subscript𝑥𝜏\tau\mapsto x_{\tau}italic_τ ↦ italic_x start_POSTSUBSCRIPT italic_τ end_POSTSUBSCRIPT and τyτmaps-to𝜏subscript𝑦𝜏\tau\mapsto y_{\tau}italic_τ ↦ italic_y start_POSTSUBSCRIPT italic_τ end_POSTSUBSCRIPT be two trajectories of the system (4). Since μ(Dxf(x,u,t))0𝜇subscript𝐷𝑥𝑓𝑥𝑢𝑡0\mu(D_{x}f(x,u,t))\leq 0italic_μ ( italic_D start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT italic_f ( italic_x , italic_u , italic_t ) ) ≤ 0, for every x,u,t×n𝒰×0x,u,t\in{}^{n}\times\mathcal{U}\times{}_{\geq 0}italic_x , italic_u , italic_t ∈ start_FLOATSUPERSCRIPT italic_n end_FLOATSUPERSCRIPT × caligraphic_U × start_FLOATSUBSCRIPT ≥ 0 end_FLOATSUBSCRIPT, we get [28, Theorem 36]

xτyτxtyt, for all τt.formulae-sequencenormsubscript𝑥𝜏subscript𝑦𝜏normsubscript𝑥𝑡subscript𝑦𝑡 for all 𝜏𝑡\displaystyle\|x_{\tau}-y_{\tau}\|\leq\|x_{t}-y_{t}\|,\mbox{ for all }\tau\geq t.∥ italic_x start_POSTSUBSCRIPT italic_τ end_POSTSUBSCRIPT - italic_y start_POSTSUBSCRIPT italic_τ end_POSTSUBSCRIPT ∥ ≤ ∥ italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - italic_y start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∥ , for all italic_τ ≥ italic_t .

Using Lemma V.2(ii), for every τt𝜏𝑡\tau\geq titalic_τ ≥ italic_t,

𝔼𝒮n1(eλ,xτyτ)𝔼𝒮n1(eλ,xtyt).subscript𝔼superscript𝒮𝑛1superscript𝑒𝜆subscript𝑥𝜏subscript𝑦𝜏subscript𝔼superscript𝒮𝑛1superscript𝑒𝜆subscript𝑥𝑡subscript𝑦𝑡\displaystyle\mathbb{E}_{\ell\in\mathcal{S}^{n-1}}(e^{\lambda\langle\ell,x_{% \tau}-y_{\tau}\rangle})\leq\mathbb{E}_{\ell\in\mathcal{S}^{n-1}}(e^{\lambda% \langle\ell,x_{t}-y_{t}\rangle}).blackboard_E start_POSTSUBSCRIPT roman_ℓ ∈ caligraphic_S start_POSTSUPERSCRIPT italic_n - 1 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_e start_POSTSUPERSCRIPT italic_λ ⟨ roman_ℓ , italic_x start_POSTSUBSCRIPT italic_τ end_POSTSUBSCRIPT - italic_y start_POSTSUBSCRIPT italic_τ end_POSTSUBSCRIPT ⟩ end_POSTSUPERSCRIPT ) ≤ blackboard_E start_POSTSUBSCRIPT roman_ℓ ∈ caligraphic_S start_POSTSUPERSCRIPT italic_n - 1 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_e start_POSTSUPERSCRIPT italic_λ ⟨ roman_ℓ , italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - italic_y start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ⟩ end_POSTSUPERSCRIPT ) .

This implies that, for every τt𝜏𝑡\tau\geq titalic_τ ≥ italic_t,

𝔼𝒮n1(1τt(eλ,xτyτeλ,xtyt))0.subscript𝔼superscript𝒮𝑛11𝜏𝑡superscript𝑒𝜆subscript𝑥𝜏subscript𝑦𝜏superscript𝑒𝜆subscript𝑥𝑡subscript𝑦𝑡0\displaystyle\mathbb{E}_{\ell\in\mathcal{S}^{n-1}}\left(\tfrac{1}{\tau-t}\left% (e^{\lambda\langle\ell,x_{\tau}-y_{\tau}\rangle}-e^{\lambda\langle\ell,x_{t}-y% _{t}\rangle}\right)\right)\leq 0.blackboard_E start_POSTSUBSCRIPT roman_ℓ ∈ caligraphic_S start_POSTSUPERSCRIPT italic_n - 1 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( divide start_ARG 1 end_ARG start_ARG italic_τ - italic_t end_ARG ( italic_e start_POSTSUPERSCRIPT italic_λ ⟨ roman_ℓ , italic_x start_POSTSUBSCRIPT italic_τ end_POSTSUBSCRIPT - italic_y start_POSTSUBSCRIPT italic_τ end_POSTSUBSCRIPT ⟩ end_POSTSUPERSCRIPT - italic_e start_POSTSUPERSCRIPT italic_λ ⟨ roman_ℓ , italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - italic_y start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ⟩ end_POSTSUPERSCRIPT ) ) ≤ 0 .

Taking the limit as τt0+𝜏𝑡superscript0\tau-t\to 0^{+}italic_τ - italic_t → 0 start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT, we have

𝔼𝒮n1(eλ,xtytλ𝖳(f(xt,ut,t)f(yt,ut,t)))0.subscript𝔼superscript𝒮𝑛1superscript𝑒𝜆subscript𝑥𝑡subscript𝑦𝑡𝜆superscript𝖳𝑓subscript𝑥𝑡subscript𝑢𝑡𝑡𝑓subscript𝑦𝑡subscript𝑢𝑡𝑡0\displaystyle\mathbb{E}_{\ell\in\mathcal{S}^{n-1}}\left(e^{\lambda\langle\ell,% x_{t}-y_{t}\rangle}\lambda\ell^{\mathsf{T}}(f(x_{t},u_{t},t)-f(y_{t},u_{t},t))% \right)\leq 0.blackboard_E start_POSTSUBSCRIPT roman_ℓ ∈ caligraphic_S start_POSTSUPERSCRIPT italic_n - 1 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_e start_POSTSUPERSCRIPT italic_λ ⟨ roman_ℓ , italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - italic_y start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ⟩ end_POSTSUPERSCRIPT italic_λ roman_ℓ start_POSTSUPERSCRIPT sansserif_T end_POSTSUPERSCRIPT ( italic_f ( italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , italic_u start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , italic_t ) - italic_f ( italic_y start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , italic_u start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , italic_t ) ) ) ≤ 0 .

The result follows by noting that xt,ytnx_{t},y_{t}\in{}^{n}italic_x start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∈ start_FLOATSUPERSCRIPT italic_n end_FLOATSUPERSCRIPT and ut𝒰subscript𝑢𝑡𝒰u_{t}\in\mathcal{U}italic_u start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∈ caligraphic_U have been chosen arbitrarily.

-D Proof of Lemma V.4

Define random vector X~=QX~𝑋𝑄𝑋\tilde{X}=QXover~ start_ARG italic_X end_ARG = italic_Q italic_X, where Q𝕌nsimilar-to𝑄superscript𝕌𝑛Q\sim\mathbb{U}^{n}italic_Q ∼ blackboard_U start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT is a random unitary matrix. By Lemma V.2(i), we have that for any η𝒮n1𝜂superscript𝒮𝑛1\eta\in\mathcal{S}^{n-1}italic_η ∈ caligraphic_S start_POSTSUPERSCRIPT italic_n - 1 end_POSTSUPERSCRIPT,

Φn,λ(X)=𝔼𝒮n1(eλX,XX)=𝔼𝒮n1(eλX,η)=𝔼𝒮n1(eλη,X)=𝔼Q𝕌n(eλη,QX),subscriptΦ𝑛𝜆𝑋subscript𝔼similar-tosuperscript𝒮𝑛1superscript𝑒𝜆norm𝑋𝑋norm𝑋subscript𝔼similar-tosuperscript𝒮𝑛1superscript𝑒𝜆norm𝑋𝜂subscript𝔼similar-tosuperscript𝒮𝑛1superscript𝑒𝜆𝜂norm𝑋subscript𝔼similar-to𝑄superscript𝕌𝑛superscript𝑒𝜆𝜂𝑄𝑋\begin{split}\Phi_{n,\lambda}(X)&=\mathbb{E}_{\ell\sim\mathcal{S}^{n-1}}\left(% e^{\lambda\|X\|\langle\ell,\frac{X}{\|X\|}\rangle}\right)=\mathbb{E}_{\ell\sim% \mathcal{S}^{n-1}}\left(e^{\lambda\|X\|\langle\ell,\eta\rangle}\right)\\ &=\mathbb{E}_{\ell\sim\mathcal{S}^{n-1}}\left(e^{\lambda\langle\eta,\ell\|X\|% \rangle}\right)=\mathbb{E}_{Q\sim\mathbb{U}^{n}}\left(e^{\lambda\langle\eta,QX% \rangle}\right),\end{split}start_ROW start_CELL roman_Φ start_POSTSUBSCRIPT italic_n , italic_λ end_POSTSUBSCRIPT ( italic_X ) end_CELL start_CELL = blackboard_E start_POSTSUBSCRIPT roman_ℓ ∼ caligraphic_S start_POSTSUPERSCRIPT italic_n - 1 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_e start_POSTSUPERSCRIPT italic_λ ∥ italic_X ∥ ⟨ roman_ℓ , divide start_ARG italic_X end_ARG start_ARG ∥ italic_X ∥ end_ARG ⟩ end_POSTSUPERSCRIPT ) = blackboard_E start_POSTSUBSCRIPT roman_ℓ ∼ caligraphic_S start_POSTSUPERSCRIPT italic_n - 1 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_e start_POSTSUPERSCRIPT italic_λ ∥ italic_X ∥ ⟨ roman_ℓ , italic_η ⟩ end_POSTSUPERSCRIPT ) end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL = blackboard_E start_POSTSUBSCRIPT roman_ℓ ∼ caligraphic_S start_POSTSUPERSCRIPT italic_n - 1 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_e start_POSTSUPERSCRIPT italic_λ ⟨ italic_η , roman_ℓ ∥ italic_X ∥ ⟩ end_POSTSUPERSCRIPT ) = blackboard_E start_POSTSUBSCRIPT italic_Q ∼ blackboard_U start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_e start_POSTSUPERSCRIPT italic_λ ⟨ italic_η , italic_Q italic_X ⟩ end_POSTSUPERSCRIPT ) , end_CELL end_ROW

where the last “===” uses the fact that Q𝒮n1𝑄superscript𝒮𝑛1Q\ell\in\mathcal{S}^{n-1}italic_Q roman_ℓ ∈ caligraphic_S start_POSTSUPERSCRIPT italic_n - 1 end_POSTSUPERSCRIPT for any 𝒮n1superscript𝒮𝑛1\ell\in\mathcal{S}^{n-1}roman_ℓ ∈ caligraphic_S start_POSTSUPERSCRIPT italic_n - 1 end_POSTSUPERSCRIPT. By (24), we obtain

𝔼X~(eλη,X~)=𝔼X𝔼Q(eλη,QX)=𝔼X(Φn,λ(X))eλ2σ22,λ,η𝒮n1.\begin{split}&\mathbb{E}_{\tilde{X}}\left(e^{\lambda\langle\eta,\tilde{X}% \rangle}\right)=\mathbb{E}_{X}\mathbb{E}_{Q}\left(e^{\lambda\langle\eta,QX% \rangle}\right)\\ =&\mathbb{E}_{X}\left(\Phi_{n,\lambda}(X)\right)\leq e^{\frac{\lambda^{2}% \sigma^{2}}{2}},\quad\forall\lambda\in\mathbb{R},~{}\forall\eta\in\mathcal{S}^% {n-1}.\end{split}start_ROW start_CELL end_CELL start_CELL blackboard_E start_POSTSUBSCRIPT over~ start_ARG italic_X end_ARG end_POSTSUBSCRIPT ( italic_e start_POSTSUPERSCRIPT italic_λ ⟨ italic_η , over~ start_ARG italic_X end_ARG ⟩ end_POSTSUPERSCRIPT ) = blackboard_E start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT blackboard_E start_POSTSUBSCRIPT italic_Q end_POSTSUBSCRIPT ( italic_e start_POSTSUPERSCRIPT italic_λ ⟨ italic_η , italic_Q italic_X ⟩ end_POSTSUPERSCRIPT ) end_CELL end_ROW start_ROW start_CELL = end_CELL start_CELL blackboard_E start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ( roman_Φ start_POSTSUBSCRIPT italic_n , italic_λ end_POSTSUBSCRIPT ( italic_X ) ) ≤ italic_e start_POSTSUPERSCRIPT divide start_ARG italic_λ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT , ∀ italic_λ ∈ blackboard_R , ∀ italic_η ∈ caligraphic_S start_POSTSUPERSCRIPT italic_n - 1 end_POSTSUPERSCRIPT . end_CELL end_ROW (62)

Therefore, X~~𝑋\tilde{X}over~ start_ARG italic_X end_ARG is sub-Gaussian with variance proxy σ2superscript𝜎2\sigma^{2}italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT. By Lemma V.1, X~~𝑋\tilde{X}over~ start_ARG italic_X end_ARG satisfies (20).

Finally, since X=QX=X~norm𝑋norm𝑄𝑋norm~𝑋\|X\|=\|QX\|=\|\tilde{X}\|∥ italic_X ∥ = ∥ italic_Q italic_X ∥ = ∥ over~ start_ARG italic_X end_ARG ∥ for any Q𝕌n𝑄superscript𝕌𝑛Q\in\mathbb{U}^{n}italic_Q ∈ blackboard_U start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT, we conclude that X𝑋Xitalic_X also satisfies (20). This completes the proof.