A Safe First-Order Method for Pricing-Based Resource Allocation in Safety-Critical Networks

Berkay Turan      Spencer Hutchinson    Mahnoosh Alizadeh B. Turan, S. Hutchinson, and M. Alizadeh are with Dept. of ECE, UCSB, Santa Barbara, CA, USA. This work is supported by NSF grant #1847096. E-mails: [email protected], [email protected], [email protected]
Abstract

We introduce a novel algorithm for solving network utility maximization (NUM) problems that arise in resource allocation schemes over networks with known safety-critical constraints, where the constraints form an arbitrary convex and compact feasible set. Inspired by applications where customers’ demand can only be affected through posted prices and real-time two-way communication with customers is not available, we require an algorithm to generate “safe prices”. This means that at no iteration should the realized demand in response to the posted prices violate the safety constraints of the network. Thus, in contrast to existing distributed first-order methods, our algorithm, called safe pricing for NUM (SPNUM), is guaranteed to produce feasible primal iterates at all iterations. At the heart of the algorithm lie two key steps that must go hand in hand to guarantee safety and convergence: 1) applying a projected gradient method on a shrunk feasible set to get the desired demand, and 2) estimating the price response function of the users and determining the price so that the induced demand is close to the desired demand. We ensure safety by adjusting the shrinkage to account for the error between the induced demand and the desired demand. In addition, by gradually reducing the amount of shrinkage and the step size of the gradient method, we prove that the primal iterates produced by the SPNUM achieve a sublinear static regret of 𝒪(log(T))𝒪𝑇{\cal O}(\log{(T)})caligraphic_O ( roman_log ( italic_T ) ) after T𝑇Titalic_T time steps.

I Introduction

Many applications falling within the scope of resource allocation over networks, e.g., power distribution systems [1], congestion control in data networks [2, 3, 4], wireless cellular networks [5], and congestion control in urban traffic networks [6], deal with a multi-user optimization problem that falls under the general umbrella of network utility maximization (NUM) problems. The shared goal in these problems is to safely and efficiently allocate the shared resources to the users, where safety refers to satisfying the constraints of the system that depend on the resource allocation of all the users, and efficiency refers to the total utility of the users for a given resource allocation.

In NUM problems, the user-specific utility functions are assumed to be private to the users and therefore a centralized solution is not possible. Accordingly, distributed optimization methods have become suitable tools thanks to the separable structure of NUM problems [7, 8]. The idea is to decompose the main problem into sub-problems that can be solved by the individual users. The solutions of the sub-problems are then used to solve the main problem [9, 10], and this has been advocated for use in different applications, e.g., [11, 2]. Among the two main types of decomposition methods, primal decomposition methods correspond to a direct allocation of the resources by a central coordinator and solve the primal problem, whereas dual decomposition methods based on the Lagrangian dual problem [12] correspond to resource allocation via pricing and solve the dual problem [7]. Due to the structure of NUM problems, the latter approach has been widely adopted in the literature [7, 13, 14]. Additionally, it gives users the freedom of determining their own demand based on pricing-type signals.

Although there is extensive literature on pricing algorithms based on dual decomposition, the majority of studies focus on linear constraints [13, 14, 15, 16, 17], or on non-linear constraints with the assumption of separability and full user knowledge of these constraints [18, 19, 20]. Furthermore, none of the aforementioned studies propose an iterative pricing algorithm that induces resource demand satisfying the hard constraints of the problem during the iterative optimization process. Instead, these studies only provide bounds on the infeasibility amount of the resource demand (e.g., [14, 16]). Our preliminary work in [17] is an exception, which is limited to problems with linear inequality constraints characterized by binary matrices. Thus, pricing-based solutions can only be realized after convergence to a near-feasible point for resource allocation systems with safety-critical constraints. Therefore, implementation of such solutions requires a negotiation process through a two-way communication network if the system has hard safety-critical constraints, which can be considered impractical in many applications.

The research presented in this paper is motivated by network resource allocation applications in safety-critical systems, where a real-time two-way communication channel with the users is not available. One particularly relevant example of this type of application can be seen in the context of pricing-based electricity demand response. When attempting to change the users’ demand through posted prices, users determine their own electricity consumption to minimize their electricity bill, and no further control on the users’ demand is feasible. As such, these prices must be set such that the realized demand does not violate the physical constraints of the electric grid [21]. This is necessary to ensure the safe and reliable operation of the grid, as violating these physical constraints could have serious reliability implications. As such, many previous works to determine prices either directly solicit all the users’ preferences and solve for the prices centrally [7, 22], or employ distributed optimization methods that require back-and-forth communication with the users to converge to an optimal and grid-safe price [7, 11]. Both categories of methods have proven to be hard to implement in practical setups, motivating new research on solutions that do not require active customer engagement and still retain safe grid operations [23, 24]. In light of this motivating example, the solution we devise to determine a pricing-based solution for NUM involves a number of key considerations:

  1. 1.

    The users themselves determine their own resource demand in response to the prices, with the actual demand only becoming observable ex-post.

  2. 2.

    No negotiation or back-and-forth communication with the users is allowed, and no adjustment (curtailment) of demand is feasible, rendering existing works based on distributed optimization to determine prices inapplicable.

  3. 3.

    It is essential that the safety-critical hard constraints of the systems must not be violated by users’ resource demand at any time, even when their price response is unknown.

Accordingly, the main challenge this paper aims to overcome is how to determine the prices for resources such that:

  1. 1.

    No preference solicitation or negotiation with the users is required.

  2. 2.

    The induced resource demand of the users at every iteration always satisfies the constraints of the system (i.e., guaranteed primal feasibility).

  3. 3.

    The induced resource demand of the users is efficient, i.e., the total utility earned by the users is maximized (measured through regret bounds).

To this end, in this paper, we develop an iterative pricing algorithm to solve NUM problems with arbitrary convex and compact feasible sets, called safe pricing for NUM (SPNUM). We design our algorithm based solely on the realized demand in response to prices and communicate to the users only the prices for the resources at each iteration. Our contributions can be summarized as follows:

  • We introduce a novel algorithm, the SPNUM, for solving NUM problems with arbitrary convex and compact feasible sets through pricing. Our algorithm iteratively designs prices and allows users the freedom of determining their own decision variable based on prices according to their own profit maximization problem (without imposing any iterative variable update rule on the users).

  • We characterize a principled way to choose algorithm parameters to guarantee feasible primal iterates at all iterations. Furthermore, we prove that the static regret incurred by the feasible primal iterates produced by the SPNUM, i.e., the cumulative gap between the optimal objective value and the objective function evaluated at the primal iterates, up to time T𝑇Titalic_T is bounded by 𝒪(log(T))𝒪𝑇{\cal O}(\log{(T)})caligraphic_O ( roman_log ( italic_T ) ).

  • We numerically evaluate our algorithm to support our theoretical findings and compare its performance to existing first-order distributed methods for NUM problems.

To the best of the authors’ knowledge, no previous work has studied pricing algorithms for NUM problems on arbitrary convex feasible sets that are unknown to the users, even without consideration of safety. While primal-dual algorithms [25, 26, 27, 28] can handle non-separable arbitrary convex feasible sets, they rely on a primal update rule users need to follow in order to converge as opposed to maximizing their own profit based on observed prices. To this end, our contributions extend beyond safety, since SPNUM solves NUM problems on arbitrary convex feasible sets by iteratively designing prices and allowing the users to determine their own resource demand according to their own profit maximization problem.

The primal feasibility and the regret guarantees of the SPNUM result from a combination of two ingredients: 1) given prices and demand at a given instant, we apply a projected gradient method on a shrunk feasible set to get the next desired demand, and 2) we estimate the price response function of the users around the current prices and determine the next prices so that the induced demand is close to the desired demand. To ensure the algorithm behaves as a projected gradient method, the induced demand must be in the strict interior of the feasible set. The algorithm operates on a shrunk feasible set to account for the error between induced and desired demand, and gradually reduces shrinkage and step size to converge to the optimal solution.

Related work: Besides dual (sub)gradient methods, a few other branches of literature study a similar problem to ours. We highlight how those lines of work do not meet our particular design criteria and what differentiates our work from them. Additional details on distributed optimization algorithms and their classifications can be found in the surveys [29, 30].

  1. 1.

    Primal-dual methods: Primal-dual methods tackle multi-user optimization problems with arbitrary convex global constraints by applying a projected gradient descent/ascent on the primal/dual variables of the Lagrangian [25, 26, 27, 28]. The dual variables are updated using the aggregate resource demand information of the users and can be used for pricing of the resources. Therefore the update rule for the dual variables meets our design goals. However, the primal variables, i.e., the resource demand of the users, are updated by applying one step of gradient descent instead of solving for the profit-maximizing optimal demand in response to prices. Accordingly, these algorithms do not resemble the selfish profit-maximizing behavior of the users we adopt in this paper.

  2. 2.

    Projected gradient methods: The main goal of the projected gradient methods is to maintain feasibility by projecting the primal variables on the feasible convex set after each update step. Scholars have extensively studied the convergence properties of the projected gradient methods under different assumptions [9, 31, 32]. On the other hand, the main challenge brought by our setup is that the primal variables are controlled solely by the users and cannot be manipulated (e.g., projected). Even though we can determine a feasible desired resource allocation by means of a projected gradient method, the prices that induce such resource demand are unknown due to the privacy of the utility functions, which brings unique challenges not addressed by the previous literature.

  3. 3.

    Interior point methods: Interior point methods are commonly used to solve inequality-constrained problems by using barrier functions to convert them into a sequence of equality-constrained problems, which are then solved using Newton’s method [33]. While producing feasible iterates, the use of Newton’s method requires the Hessian, which is often not available in practical applications, such as demand response without two-way communications. To address this limitation, previous works such as [34] and [35] have proposed feasible interior point methods that approximate the Hessian using first or second-order information exchange. However, these methods do not match the profit maximization rule we would like to preserve in this paper, which allows users to freely determine their resource consumption in response to posted prices. Closest to our setup and design goals in this paper would be [36, 37], where separable optimization problems with linear constraints are considered. While [36] proposes a Newton-like dual update that approximates the Hessian using first-order information, only the asymptotic convergence of the algorithm is proven and the feasibility of primal iterates is not guaranteed. [37] proposes an interior point method using Lagrangian dual decomposition with theoretical guarantees, but requires the exact Hessian for dual updates.

  4. 4.

    Constrained Online Convex Optimization: The constrained online convex optimization literature (e.g., [38, 39, 40]) aims to minimize regret while establishing bounds on the constraint violation by employing iterative update algorithms on the primal variables. A common method in constrained online convex optimization literature is updating the primal variables (i.e., resource demand of the users) directly using the gradient of the objective function as the feedback. On the contrary, our setup only allows us to update the prices and get the primal variables (i.e., resource demand of the users) as feedback afterward, where the resource demand is determined by the users according to their own profit-maximization problem. This introduces a novel challenge because both the regret and the constraints are evaluated on the primal variables, and we somehow need to set the prices such that 1) the induced demand is in the feasible region and 2) the regret incurred by the induced demand is minimized.

Paper Organization: The remainder of the paper is organized as follows. In Section II, we formalize the problem setup. In Section III, we describe the SPNUM (Algorithm 1) and in Section IV, we prove its feasibility and regret guarantees. In Section V, we provide a numerical study demonstrating the efficacy of the SDGM.

Notation and Basic Definitions: We denote the set of real numbers by {\mathbb{R}}blackboard_R and the set of non-negative real numbers by +subscript{\mathbb{R}}_{+}blackboard_R start_POSTSUBSCRIPT + end_POSTSUBSCRIPT. For vectors, \|\cdot\|∥ ⋅ ∥ denotes the standard Euclidean norm and p\|\cdot\|_{p}∥ ⋅ ∥ start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT denotes the p𝑝pitalic_p-norm. For matrices, \|\cdot\|∥ ⋅ ∥ denotes the matrix norm. Given a positive integer n>0𝑛0n>0italic_n > 0, [n]delimited-[]𝑛[n][ italic_n ] denotes the set of integers {1,2,,n}12𝑛\{1,2,\dots,n\}{ 1 , 2 , … , italic_n }. For two vectors x,yd𝑥𝑦superscript𝑑x,y\in{\mathbb{R}}^{d}italic_x , italic_y ∈ blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT, x,y𝑥𝑦\langle x,y\rangle⟨ italic_x , italic_y ⟩ denotes the inner product of x𝑥xitalic_x and y𝑦yitalic_y. Given a vector x=[x1,x2,,xn]d𝑥superscriptsuperscriptsubscript𝑥1topsuperscriptsubscript𝑥2topsuperscriptsubscript𝑥𝑛toptopsuperscript𝑑x=[x_{1}^{\top},~{}x_{2}^{\top},~{}\dots,~{}x_{n}^{\top}]^{\top}\in{\mathbb{R}% }^{d}italic_x = [ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT , italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT , … , italic_x start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ] start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT, xidisubscript𝑥𝑖superscriptsubscript𝑑𝑖x_{i}\in{\mathbb{R}}^{d_{i}}italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT denotes the i𝑖iitalic_i’th block of x𝑥xitalic_x. For a matrix Am×n𝐴superscript𝑚𝑛A\in\mathbb{R}^{m\times n}italic_A ∈ blackboard_R start_POSTSUPERSCRIPT italic_m × italic_n end_POSTSUPERSCRIPT, Ajsubscript𝐴𝑗A_{j}italic_A start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT denotes the j𝑗jitalic_j’th row of A𝐴Aitalic_A, A:,jsubscript𝐴:𝑗A_{:,j}italic_A start_POSTSUBSCRIPT : , italic_j end_POSTSUBSCRIPT denotes the j𝑗jitalic_j’th column of A𝐴Aitalic_A. Given a matrix Am×m𝐴superscript𝑚𝑚{A}\in{\mathbb{R}}^{m\times m}italic_A ∈ blackboard_R start_POSTSUPERSCRIPT italic_m × italic_m end_POSTSUPERSCRIPT, diag(A)mdiag𝐴superscript𝑚\textnormal{diag}(A)\in{\mathbb{R}}^{m}diag ( italic_A ) ∈ blackboard_R start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT is the vector of the diagonals of A𝐴Aitalic_A, κ(A)𝜅𝐴\kappa(A)italic_κ ( italic_A ) is the condition number of A𝐴Aitalic_A, and σmin(A)subscript𝜎𝐴\sigma_{\min}(A)italic_σ start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT ( italic_A )/σmax(A)subscript𝜎𝐴\sigma_{\max}(A)italic_σ start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT ( italic_A ) are the minimum/maximum singular values of A𝐴Aitalic_A. Given a function f:𝒳d:𝑓𝒳superscript𝑑f:{\cal X}\subseteq{\mathbb{R}}^{d}\rightarrow{\mathbb{R}}italic_f : caligraphic_X ⊆ blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT → blackboard_R, f𝑓\nabla f∇ italic_f denotes the gradient of f𝑓fitalic_f, kfsuperscript𝑘𝑓\nabla^{k}f∇ start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT italic_f denotes the k𝑘kitalic_k’th order gradient of f𝑓fitalic_f, and domfdom𝑓\textnormal{dom}fdom italic_f denotes the domain 𝒳𝒳{\cal X}caligraphic_X of f𝑓fitalic_f. Given two vectors x,ym𝑥𝑦superscript𝑚x,y\in{\mathbb{R}}^{m}italic_x , italic_y ∈ blackboard_R start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT, xy𝑥𝑦x\leq yitalic_x ≤ italic_y implies element-wise inequality. Given a set 𝒳d𝒳superscript𝑑{\cal X}\subset\mathbb{R}^{d}caligraphic_X ⊂ blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT, 𝒳intsuperscript𝒳int{\cal X}^{\textnormal{int}}caligraphic_X start_POSTSUPERSCRIPT int end_POSTSUPERSCRIPT denotes the interior of 𝒳𝒳{\cal X}caligraphic_X. Given a convex and compact set 𝒳d𝒳superscript𝑑{\cal X}\subset\mathbb{R}^{d}caligraphic_X ⊂ blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT and a point xd𝑥superscript𝑑x\in{\mathbb{R}}^{d}italic_x ∈ blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT, Π𝒳(x)subscriptΠ𝒳𝑥{\Pi}_{\cal X}(x)roman_Π start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT ( italic_x ) denotes the Euclidian projection of x𝑥{x}italic_x onto 𝒳𝒳{\cal X}caligraphic_X. We denote the closed and the open Euclidean ball with radius r𝑟ritalic_r centered at origin as ¯(r)¯𝑟\bar{\mathcal{B}}(r)over¯ start_ARG caligraphic_B end_ARG ( italic_r ) and (r)𝑟{\cal B}(r)caligraphic_B ( italic_r ), respectively. Idsubscript𝐼𝑑I_{d}italic_I start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT denotes the identity matrix of size d𝑑ditalic_d, 𝟏dsubscript1𝑑\bm{1}_{d}bold_1 start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT denotes the vector of all 1’s with dimension d𝑑ditalic_d, and eisubscript𝑒𝑖e_{i}italic_e start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT denotes the unit vector with 1111 in i𝑖iitalic_i’th dimension and 00 everywhere else. The nomenclature can be found in the Appendix.

Definition 1.

A differentiable function f()𝑓f(\cdot)italic_f ( ⋅ ) is said to be 𝛍𝛍\bm{\mu}bold_italic_μ-strongly concave over the domain 𝒳𝒳{\cal X}caligraphic_X if there exists μ>0𝜇0\mu>0italic_μ > 0 such that

f(x2)f(x1),x1x2μx1x22𝑓subscript𝑥2𝑓subscript𝑥1subscript𝑥1subscript𝑥2𝜇superscriptnormsubscript𝑥1subscript𝑥22\langle\nabla f(x_{2})-\nabla f(x_{1}),x_{1}-x_{2}\rangle\geq\mu\|x_{1}-x_{2}% \|^{2}⟨ ∇ italic_f ( italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) - ∇ italic_f ( italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) , italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ⟩ ≥ italic_μ ∥ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT (1)

holds for all x1,x2𝒳subscript𝑥1subscript𝑥2𝒳x_{1},x_{2}\in\cal Xitalic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ∈ caligraphic_X.

Definition 2.

A differentiable function f()𝑓f(\cdot)italic_f ( ⋅ ) is said to be 𝐋𝐋\bm{L}bold_italic_L-smooth over the domain 𝒳𝒳{\cal X}caligraphic_X if there exists L>0𝐿0L>0italic_L > 0 such that

f(x1)f(x2)Lx1x2norm𝑓subscript𝑥1𝑓subscript𝑥2𝐿normsubscript𝑥1subscript𝑥2\|\nabla f(x_{1})-\nabla f(x_{2})\|\leq L\|x_{1}-x_{2}\|∥ ∇ italic_f ( italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) - ∇ italic_f ( italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) ∥ ≤ italic_L ∥ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ∥ (2)

holds for all x1,x2𝒳subscript𝑥1subscript𝑥2𝒳x_{1},x_{2}\in\cal Xitalic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ∈ caligraphic_X.

Definition 3.

A function f()𝑓f(\cdot)italic_f ( ⋅ ) is said to be 𝐌𝐌\bm{M}bold_italic_M-Lipschitz continuous over the domain 𝒳𝒳{\cal X}caligraphic_X if there exists M>0𝑀0M>0italic_M > 0 such that

f(x1)f(x2)Mx1x2norm𝑓subscript𝑥1𝑓subscript𝑥2𝑀normsubscript𝑥1subscript𝑥2\|f(x_{1})-f(x_{2})\|\leq M\|x_{1}-x_{2}\|∥ italic_f ( italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) - italic_f ( italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) ∥ ≤ italic_M ∥ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ∥ (3)

holds for all x1,x2𝒳subscript𝑥1subscript𝑥2𝒳x_{1},x_{2}\in\cal Xitalic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ∈ caligraphic_X.

II Problem Setup

We study the standard NUM problem [2], where the goal is to allocate resources to n𝑛nitalic_n users subject to a set of coupling constraints such that the total utility of the users is maximized. It can be formulated as the following optimization problem:

maxxdomfd𝑥dom𝑓superscript𝑑\displaystyle\underset{x\in\textnormal{dom}f\subseteq\mathbb{R}^{d}}{\max}start_UNDERACCENT italic_x ∈ dom italic_f ⊆ blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT end_UNDERACCENT start_ARG roman_max end_ARG f(x)=i=1nfi(xi)𝑓𝑥superscriptsubscript𝑖1𝑛subscript𝑓𝑖subscript𝑥𝑖\displaystyle~{}f(x)=\sum_{i=1}^{n}f_{i}(x_{i})italic_f ( italic_x ) = ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) (4a)
s.t. x𝒳,𝑥𝒳\displaystyle~{}x\in{\cal X},italic_x ∈ caligraphic_X , (4b)

where fi()subscript𝑓𝑖f_{i}(\cdot)italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( ⋅ ) is the concave utility function of user i𝑖iitalic_i that depends on the disubscript𝑑𝑖d_{i}italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT-dimensional vector of resource consumption, denoted by xidomfidisubscript𝑥𝑖domsubscript𝑓𝑖superscriptsubscript𝑑𝑖x_{i}\in\textnormal{dom}f_{i}\subseteq{\mathbb{R}}^{d_{i}}italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ dom italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ⊆ blackboard_R start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT, and 𝒳d𝒳superscript𝑑{\cal X}\subset\mathbb{R}^{d}caligraphic_X ⊂ blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT is the convex and compact set of feasible resource allocations. We also have i[n]di=dsubscript𝑖delimited-[]𝑛subscript𝑑𝑖𝑑\sum_{i\in[n]}d_{i}=d∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] end_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_d, domf=i[n]domfidom𝑓subscriptproduct𝑖delimited-[]𝑛domsubscript𝑓𝑖\textnormal{dom}f=\prod_{i\in[n]}\textnormal{dom}f_{i}dom italic_f = ∏ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] end_POSTSUBSCRIPT dom italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, and define d¯=maxi[n]di¯𝑑subscript𝑖delimited-[]𝑛subscript𝑑𝑖\bar{d}=\max_{i\in[n]}d_{i}over¯ start_ARG italic_d end_ARG = roman_max start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] end_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT.

For all users i[n]𝑖delimited-[]𝑛i\in[n]italic_i ∈ [ italic_n ], we define the set 𝒳i={xidi:x𝒳 s.t. xi is the i’th block of x}subscript𝒳𝑖conditional-setsubscript𝑥𝑖superscriptsubscript𝑑𝑖𝑥𝒳 s.t. subscript𝑥𝑖 is the 𝑖’th block of 𝑥{\cal X}_{i}=\{x_{i}\in{\mathbb{R}}^{d_{i}}:\exists x\in{\cal X}\textnormal{ s% .t. }x_{i}\textnormal{ is the }i\textnormal{'th block of }x\}caligraphic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = { italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT : ∃ italic_x ∈ caligraphic_X s.t. italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is the italic_i ’th block of italic_x } as the set of values that user i𝑖iitalic_i’s resource demand vector can take in the aggregate feasible set 𝒳𝒳{\cal X}caligraphic_X. Note that since 𝒳𝒳{\cal X}caligraphic_X is convex and compact, 𝒳isubscript𝒳𝑖{\cal X}_{i}caligraphic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is convex and compact, i[n]for-all𝑖delimited-[]𝑛{\forall i\in[n]}∀ italic_i ∈ [ italic_n ]. Furthermore, if x𝒳𝑥𝒳x\in{\cal X}italic_x ∈ caligraphic_X, then xi𝒳isubscript𝑥𝑖subscript𝒳𝑖x_{i}\in{\cal X}_{i}italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ caligraphic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and if x𝒳int𝑥superscript𝒳intx\in{\cal X}^{\textnormal{int}}italic_x ∈ caligraphic_X start_POSTSUPERSCRIPT int end_POSTSUPERSCRIPT, then xi𝒳iintsubscript𝑥𝑖superscriptsubscript𝒳𝑖intx_{i}\in{\cal X}_{i}^{\textnormal{int}}italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ caligraphic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT int end_POSTSUPERSCRIPT hold by definition. We make the following assumptions on the feasible set 𝒳𝒳{\cal X}caligraphic_X, and on the utility functions over 𝒳isubscript𝒳𝑖{\cal X}_{i}caligraphic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, i[n]for-all𝑖delimited-[]𝑛\forall i\in[n]∀ italic_i ∈ [ italic_n ].

Assumption 1.

The feasible set 𝒳𝒳{\cal X}caligraphic_X is a subset of domfdom𝑓\textnormal{dom}fdom italic_f, i.e., 𝒳domf𝒳dom𝑓{\cal X}\subseteq\textnormal{dom}fcaligraphic_X ⊆ dom italic_f. The diameter of the feasible set 𝒳𝒳{\cal X}caligraphic_X is bounded by R𝑅Ritalic_R, i.e., xyRnorm𝑥𝑦𝑅\|x-y\|\leq R∥ italic_x - italic_y ∥ ≤ italic_R, x,y𝒳for-all𝑥𝑦𝒳\forall x,y\in{\cal X}∀ italic_x , italic_y ∈ caligraphic_X. There exists a vector x~~𝑥\tilde{x}over~ start_ARG italic_x end_ARG in the interior of 𝒳𝒳{\cal X}caligraphic_X such that x~𝒳int~𝑥superscript𝒳int\tilde{x}\in{\cal X}^{\textnormal{int}}over~ start_ARG italic_x end_ARG ∈ caligraphic_X start_POSTSUPERSCRIPT int end_POSTSUPERSCRIPT.

Assumption 2.

For all i[n]𝑖delimited-[]𝑛i\in[n]italic_i ∈ [ italic_n ], the utility function fi()subscript𝑓𝑖f_{i}(\cdot)italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( ⋅ ) is μ𝜇\muitalic_μ-strongly concave, L𝐿Litalic_L-smooth, M𝑀Mitalic_M-Lipschitz continuous, and has β𝛽\betaitalic_β-smooth gradient over 𝒳isubscript𝒳𝑖{\cal X}_{i}caligraphic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT.

Example 1 (Utility function).

For instance, take fi(xi)=fα(xi)subscript𝑓𝑖subscript𝑥𝑖subscript𝑓𝛼subscript𝑥𝑖f_{i}(x_{i})=f_{\alpha}(x_{i})italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) = italic_f start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) to be an α𝛼\alphaitalic_α-fair utility function (see [41]) and let 𝒳i=[\stackunder[1.2pt]x i,x¯i]subscript𝒳𝑖\stackunderdelimited-[]1.2𝑝𝑡𝑥subscript 𝑖subscript¯𝑥𝑖{\cal X}_{i}=[\stackunder[1.2pt]{$x$}{\rule{3.44444pt}{0.32289pt}}_{i},\bar{x}% _{i}]caligraphic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = [ [ 1.2 italic_p italic_t ] italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , over¯ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ] with \stackunder[1.2pt]x i>0\stackunderdelimited-[]1.2𝑝𝑡𝑥subscript 𝑖0\stackunder[1.2pt]{$x$}{\rule{3.44444pt}{0.32289pt}}_{i}>0[ 1.2 italic_p italic_t ] italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT > 0. We have that fi(xi)1/\stackunder[1.2pt]x iαsubscript𝑓𝑖subscript𝑥𝑖1\stackunderdelimited-[]1.2𝑝𝑡𝑥superscriptsubscript 𝑖𝛼\nabla f_{i}(x_{i})\leq 1/{\stackunder[1.2pt]{$x$}{\rule{3.44444pt}{0.32289pt}% }}_{i}^{\alpha}∇ italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ≤ 1 / [ 1.2 italic_p italic_t ] italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_α end_POSTSUPERSCRIPT, α/\stackunder[1.2pt]x iα+12fi(xi)α/x¯iα+1-\alpha/{\stackunder[1.2pt]{$x$}{\rule{3.44444pt}{0.32289pt}}}_{i}^{\alpha+1}% \leq\nabla^{2}f_{i}^{(}x_{i})\leq-\alpha/{\bar{x}}_{i}^{\alpha+1}- italic_α / [ 1.2 italic_p italic_t ] italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_α + 1 end_POSTSUPERSCRIPT ≤ ∇ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( end_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ≤ - italic_α / over¯ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_α + 1 end_POSTSUPERSCRIPT, and α(α+1)/x¯iα+23fi(xi)α(α+1)/\stackunder[1.2pt]x iα+2𝛼𝛼1superscriptsubscript¯𝑥𝑖𝛼2superscript3subscript𝑓𝑖subscript𝑥𝑖𝛼𝛼1\stackunderdelimited-[]1.2𝑝𝑡𝑥superscriptsubscript 𝑖𝛼2\alpha(\alpha+1)/\bar{x}_{i}^{\alpha+2}\leq\nabla^{3}f_{i}(x_{i})\leq\alpha(% \alpha+1)/\stackunder[1.2pt]{$x$}{\rule{3.44444pt}{0.32289pt}}_{i}^{\alpha+2}italic_α ( italic_α + 1 ) / over¯ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_α + 2 end_POSTSUPERSCRIPT ≤ ∇ start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ≤ italic_α ( italic_α + 1 ) / [ 1.2 italic_p italic_t ] italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_α + 2 end_POSTSUPERSCRIPT, x𝒳ifor-all𝑥subscript𝒳𝑖\forall x\in{\cal X}_{i}∀ italic_x ∈ caligraphic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT. Therefore, fi(xi)subscript𝑓𝑖subscript𝑥𝑖f_{i}(x_{i})italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) is α/x¯iα+1𝛼superscriptsubscript¯𝑥𝑖𝛼1\alpha/{\bar{x}}_{i}^{\alpha+1}italic_α / over¯ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_α + 1 end_POSTSUPERSCRIPT-strongly concave, α/\stackunder[1.2pt]x iα+1𝛼\stackunderdelimited-[]1.2𝑝𝑡𝑥superscriptsubscript 𝑖𝛼1\alpha/{\stackunder[1.2pt]{$x$}{\rule{3.44444pt}{0.32289pt}}}_{i}^{\alpha+1}italic_α / [ 1.2 italic_p italic_t ] italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_α + 1 end_POSTSUPERSCRIPT-smooth, and 1/\stackunder[1.2pt]x iα1\stackunderdelimited-[]1.2𝑝𝑡𝑥superscriptsubscript 𝑖𝛼1/{\stackunder[1.2pt]{$x$}{\rule{3.44444pt}{0.32289pt}}}_{i}^{\alpha}1 / [ 1.2 italic_p italic_t ] italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_α end_POSTSUPERSCRIPT-Lipschitz continuous, and has α(α+1)/\stackunder[1.2pt]x iα+2𝛼𝛼1\stackunderdelimited-[]1.2𝑝𝑡𝑥superscriptsubscript 𝑖𝛼2\alpha(\alpha+1)/\stackunder[1.2pt]{$x$}{\rule{3.44444pt}{0.32289pt}}_{i}^{% \alpha+2}italic_α ( italic_α + 1 ) / [ 1.2 italic_p italic_t ] italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_α + 2 end_POSTSUPERSCRIPT-smooth gradient over 𝒳isubscript𝒳𝑖{\cal X}_{i}caligraphic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT.

Under Assumption 2, the objective function (4a) is strongly concave with coefficient μ𝜇\muitalic_μ. Accordingly, the convex optimization problem (4) has a unique solution denoted by xsuperscript𝑥x^{\star}italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT and an optimal objective value denoted by fsuperscript𝑓f^{\star}italic_f start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT.

Since fi()subscript𝑓𝑖f_{i}(\cdot)italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( ⋅ ) are private to the users, (4) cannot be solved centrally. Therefore, distributed optimization methods based on the dual decomposition framework have been proposed in the literature (e.g., [7] for the case when 𝒳𝒳{\cal X}caligraphic_X is a polytope) in order to incentivize selfish users with private utility functions to follow the optimal global solution. The common high-level idea is to divide the main problem into subproblems that can be solved by the individual users upon observing a pricing signal, and iteratively design prices {p0,p1,}superscript𝑝0superscript𝑝1\{p^{0},p^{1},\dots\}{ italic_p start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT , italic_p start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT , … } to converge to the optimal resource allocation vector xsuperscript𝑥x^{\star}italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT. In this framework, upon observing a price pidisubscript𝑝𝑖superscriptsubscript𝑑𝑖p_{i}\in{\mathbb{R}}^{d_{i}}italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT, each user i[n]𝑖delimited-[]𝑛i\in[n]italic_i ∈ [ italic_n ] determines their own decision variable according to their own profit maximization problem:

gi(pi)=argmaxxidomfifi(xi)pi,xi.subscript𝑔𝑖subscript𝑝𝑖subscript𝑥𝑖domsubscript𝑓𝑖argmaxsubscript𝑓𝑖subscript𝑥𝑖subscript𝑝𝑖subscript𝑥𝑖g_{i}(p_{i})=\underset{x_{i}\in\textnormal{dom}f_{i}}{\operatorname*{arg\,max}% }f_{i}(x_{i})-\langle p_{i},x_{i}\rangle.italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) = start_UNDERACCENT italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ dom italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_UNDERACCENT start_ARG roman_arg roman_max end_ARG italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) - ⟨ italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ⟩ . (5)

We call gi()subscript𝑔𝑖g_{i}(\cdot)italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( ⋅ ) the price response function of user i𝑖iitalic_i and let g(p)=[g1(p1),g2(p2),,gn(pn)]𝑔𝑝subscript𝑔1superscriptsubscript𝑝1topsubscript𝑔2superscriptsubscript𝑝2topsubscript𝑔𝑛superscriptsubscript𝑝𝑛topg(p)=[g_{1}(p_{1})^{\top},~{}g_{2}(p_{2})^{\top},\dots,~{}g_{n}(p_{n})^{\top}]italic_g ( italic_p ) = [ italic_g start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT , italic_g start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT , … , italic_g start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ] be the concatenated vector of price responses given a price vector pd𝑝superscript𝑑p\in{\mathbb{R}}^{d}italic_p ∈ blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT.

In the next section, we propose an algorithm to iteratively design pt,t1superscript𝑝𝑡for-all𝑡1p^{t},~{}\forall t\geq 1italic_p start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT , ∀ italic_t ≥ 1, that produce feasible primal solutions, i.e., xt𝒳,t1formulae-sequencesuperscript𝑥𝑡𝒳for-all𝑡1x^{t}\in{\cal X},~{}\forall t\geq 1italic_x start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ∈ caligraphic_X , ∀ italic_t ≥ 1, where xit=gi(pit)superscriptsubscript𝑥𝑖𝑡subscript𝑔𝑖superscriptsubscript𝑝𝑖𝑡x_{i}^{t}=g_{i}(p_{i}^{t})italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT = italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ) is determined by user i𝑖iitalic_i through (5). In addition, the algorithm should produce primal iterates that result in a sublinear static regret per user, which is measured by

R(T)=1nt=1Tff(xt).𝑅𝑇1𝑛superscriptsubscript𝑡1𝑇superscript𝑓𝑓superscript𝑥𝑡R(T)=\frac{1}{n}\sum_{t=1}^{T}f^{\star}-f(x^{t}).italic_R ( italic_T ) = divide start_ARG 1 end_ARG start_ARG italic_n end_ARG ∑ start_POSTSUBSCRIPT italic_t = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_f start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT - italic_f ( italic_x start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ) . (6)

It is worthwhile to highlight that even without the safety criterion, the literature on distributed optimization methods does not provide a distributed solution based on pricing to (4) with any type of convergence guarantees. Existing works in the literature 1) utilize a pricing algorithm based on the dual decomposition framework but consider linear constraints [13, 14, 15, 16, 17] or non-linear and separable constraints known by the users [18, 19, 20], or 2) solve the Lagrangian dual problem by primal-dual methods [25, 26, 27, 28], which restrict the users to follow a primal update method that cannot be enforced in the setting where users only care about maximizing their own profit dictated by (5). Therefore, a pricing algorithm that induces a sequence of primal iterates converging to the optimal solution of (4) with general convex and compact feasible sets 𝒳𝒳{\cal X}caligraphic_X is novel in the distributed optimization literature.

Additionally, we note that the definition of regret in (6) quantifies the difference between the efficiencies of the optimal resource allocation and the proposed algorithm up to time T𝑇Titalic_T. When the primal iterates {xt}t[T]subscriptsuperscript𝑥𝑡𝑡delimited-[]𝑇\{x^{t}\}_{t\in[T]}{ italic_x start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT } start_POSTSUBSCRIPT italic_t ∈ [ italic_T ] end_POSTSUBSCRIPT are in the feasible set 𝒳𝒳{\cal X}caligraphic_X, users’ resource demand can actually be realized through the posted prices without waiting for the convergence of the algorithm, and therefore regret is a meaningful measure. On the other hand, although the above sum is computable for many of the existing works mentioned earlier (e.g., [14, 15] with linear constraints), they do not guarantee feasible primal iterates but only establish bounds on the amount of constraint violation at a given iteration t𝑡titalic_t. Therefore, solutions are only realizable after convergence to a near-feasible point for resource allocation systems with safety-critical constraints. As such, they can be viewed as complex negotiations with users over what their potential demand would be in response to different prices in order to converge to the optimal price, which renders regret a less meaningful measure. By incorporating primal feasibility into our design goals, we aim to continually allocate resources to the users through posted prices during the iterative optimization process and measure the overall efficiency of this process through regret.

III Safe Pricing Algorithm for NUM

In this section, we describe the price update algorithm we propose, called Safe Pricing for NUM (SPNUM), that produces feasible primal iterates satisfying a sublinear regret. To do so, we will use some definitions and results from [42] regarding the geometric properties of convex and compact sets. While the primary focus of [42] centers on a linear stochastic bandit setup that bears little resemblance to the NUM setup under study, the definitions of the shrunk set outlined in the former are applicable to the present context as well.

III-A Geometric Properties of the Feasible Set

The main ingredient that ensures the safety of SPNUM is that it operates on a shrunk feasible set, which is formally defined as follows:

Definition 4.

For a compact set 𝒳d𝒳superscript𝑑\mathcal{X}\subset\mathbb{R}^{d}caligraphic_X ⊂ blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT and a positive scalar Δ+Δsubscript\Delta\in{\mathbb{R}}_{+}roman_Δ ∈ blackboard_R start_POSTSUBSCRIPT + end_POSTSUBSCRIPT, we define the shrunk version of 𝒳𝒳\mathcal{X}caligraphic_X as 𝒳Δ:={x𝒳:x+v𝒳,v¯(Δ)}assignsubscript𝒳Δconditional-set𝑥𝒳formulae-sequence𝑥𝑣𝒳for-all𝑣¯Δ\mathcal{X}_{\Delta}:=\{x\in\mathcal{X}:x+v\in\mathcal{X},\forall v\in\bar{% \cal B}(\Delta)\}caligraphic_X start_POSTSUBSCRIPT roman_Δ end_POSTSUBSCRIPT := { italic_x ∈ caligraphic_X : italic_x + italic_v ∈ caligraphic_X , ∀ italic_v ∈ over¯ start_ARG caligraphic_B end_ARG ( roman_Δ ) }.

Example 2.

(Shrunk polytope) Let Am×d𝐴superscript𝑚𝑑A\in{\mathbb{R}}^{m\times d}italic_A ∈ blackboard_R start_POSTSUPERSCRIPT italic_m × italic_d end_POSTSUPERSCRIPT and 𝒳={xd:Axc}𝒳conditional-set𝑥superscript𝑑𝐴𝑥𝑐\mathcal{X}=\{x\in\mathbb{R}^{d}:Ax\leq c\}caligraphic_X = { italic_x ∈ blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT : italic_A italic_x ≤ italic_c } be a polytope. The shrunk version of 𝒳𝒳{\cal X}caligraphic_X is defined as 𝒳Δ={xd:AjxcjΔAj,j[m]}subscript𝒳Δconditional-set𝑥superscript𝑑formulae-sequencesuperscriptsubscript𝐴𝑗top𝑥subscript𝑐𝑗Δnormsubscript𝐴𝑗𝑗delimited-[]𝑚{\cal X}_{\Delta}=\{x\in\mathbb{R}^{d}:A_{j}^{\top}x\leq c_{j}-\Delta\|A_{j}\|% ,~{}j\in[m]\}caligraphic_X start_POSTSUBSCRIPT roman_Δ end_POSTSUBSCRIPT = { italic_x ∈ blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT : italic_A start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_x ≤ italic_c start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT - roman_Δ ∥ italic_A start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∥ , italic_j ∈ [ italic_m ] }.

Remark 1.

If 𝒳𝒳{\cal X}caligraphic_X is convex and compact, then 𝒳Δsubscript𝒳Δ{\cal X}_{\Delta}caligraphic_X start_POSTSUBSCRIPT roman_Δ end_POSTSUBSCRIPT is also convex and compact.111We can equivalently define 𝒳Δsubscript𝒳Δ\mathcal{X}_{\Delta}caligraphic_X start_POSTSUBSCRIPT roman_Δ end_POSTSUBSCRIPT using Minkowski subtraction. The Minkowski subtraction of sets A,Bd𝐴𝐵superscript𝑑A,B\subseteq\mathbb{R}^{d}italic_A , italic_B ⊆ blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT is defined as AB:={ab:aA,bB}assignsymmetric-difference𝐴𝐵conditional-set𝑎𝑏formulae-sequence𝑎𝐴𝑏𝐵A\ominus B:=\{a-b:a\in A,b\in B\}italic_A ⊖ italic_B := { italic_a - italic_b : italic_a ∈ italic_A , italic_b ∈ italic_B }, or equivalently, AB=bB(Ab)symmetric-difference𝐴𝐵subscript𝑏𝐵𝐴𝑏A\ominus B=\bigcap_{b\in{B}}(A-b)italic_A ⊖ italic_B = ⋂ start_POSTSUBSCRIPT italic_b ∈ italic_B end_POSTSUBSCRIPT ( italic_A - italic_b ). Therefore, 𝒳Δ=𝒳(Δ)subscript𝒳Δsymmetric-difference𝒳Δ\mathcal{X}_{\Delta}=\mathcal{X}\ominus\mathcal{B}(\Delta)caligraphic_X start_POSTSUBSCRIPT roman_Δ end_POSTSUBSCRIPT = caligraphic_X ⊖ caligraphic_B ( roman_Δ ) is an intersection of convex and closed sets and hence is convex and closed [43, Section 3.1]. By Definition 4, 𝒳Δsubscript𝒳Δ{\cal X}_{\Delta}caligraphic_X start_POSTSUBSCRIPT roman_Δ end_POSTSUBSCRIPT is a subset of 𝒳𝒳{\cal X}caligraphic_X, and therefore bounded. A closed and bounded convex set is convex and compact.

Given the above definition of the shrunk version of a set, one can consider the maximum shrinkage that a set can withstand while still being nonempty. We introduce the maximum shrinkage of a set in the following definition.

Definition 5.

For a compact set 𝒳d𝒳superscript𝑑\mathcal{X}\subset\mathbb{R}^{d}caligraphic_X ⊂ blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT, we define the maximum shrinkage of 𝒳𝒳\mathcal{X}caligraphic_X, as H𝒳:=sup{Δ:𝒳Δ}assignsubscript𝐻𝒳supremumconditional-setΔsubscript𝒳ΔH_{\mathcal{X}}:=\sup\{\Delta:\mathcal{X}_{\Delta}\neq\emptyset\}italic_H start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT := roman_sup { roman_Δ : caligraphic_X start_POSTSUBSCRIPT roman_Δ end_POSTSUBSCRIPT ≠ ∅ }.

III-B Description of the Algorithm

1:  Input: p0superscript𝑝0p^{0}italic_p start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT, ΔtsuperscriptΔ𝑡\Delta^{t}roman_Δ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT, γtsuperscript𝛾𝑡\gamma^{t}italic_γ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT, ηtsuperscript𝜂𝑡\eta^{t}italic_η start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT.
2:  (Initialization stage):
3:   Each user i[n]𝑖delimited-[]𝑛i\in[n]italic_i ∈ [ italic_n ] receives pi0superscriptsubscript𝑝𝑖0p_{i}^{0}italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT and pit=pi0+η0e1+mod(t,di)superscriptsubscript𝑝𝑖𝑡superscriptsubscript𝑝𝑖0superscript𝜂0subscript𝑒1mod𝑡subscript𝑑𝑖p_{i}^{-t}=p_{i}^{0}+\eta^{0}e_{1+\mathrm{mod}(t,d_{i})}italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - italic_t end_POSTSUPERSCRIPT = italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT + italic_η start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT italic_e start_POSTSUBSCRIPT 1 + roman_mod ( italic_t , italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_POSTSUBSCRIPT, t[di]for-all𝑡delimited-[]subscript𝑑𝑖\forall t\in[d_{i}]∀ italic_t ∈ [ italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ] and solves
xit=gi(pit),t=di,di+1,,0.formulae-sequencesuperscriptsubscript𝑥𝑖𝑡subscript𝑔𝑖superscriptsubscript𝑝𝑖𝑡𝑡subscript𝑑𝑖subscript𝑑𝑖10x_{i}^{t}=g_{i}(p_{i}^{t}),~{}t=-d_{i},-d_{i}+1,\dots,0.italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT = italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ) , italic_t = - italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , - italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT + 1 , … , 0 . (7)
4:   For all i[n]𝑖delimited-[]𝑛i\in[n]italic_i ∈ [ italic_n ], estimate the Jacobian of gisubscript𝑔𝑖g_{i}italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT as:
^gi0=[xidixi0η0,,xi1xi0η0]^superscriptsubscript𝑔𝑖0superscriptsubscript𝑥𝑖subscript𝑑𝑖superscriptsubscript𝑥𝑖0superscript𝜂0superscriptsubscript𝑥𝑖1superscriptsubscript𝑥𝑖0superscript𝜂0\hat{\nabla}g_{i}^{0}=\left[\frac{x_{i}^{-d_{i}}-x_{i}^{0}}{\eta^{0}},\dots,~{% }\frac{x_{i}^{-1}-x_{i}^{0}}{\eta^{0}}\right]over^ start_ARG ∇ end_ARG italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT = [ divide start_ARG italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT - italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT end_ARG start_ARG italic_η start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT end_ARG , … , divide start_ARG italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT - italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT end_ARG start_ARG italic_η start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT end_ARG ] (8)
5:  for t=0,1,𝑡01t=0,1,\dotsitalic_t = 0 , 1 , … do
6:     (Update stage)
7:      Compute x^t+1=Π𝒳Δt(xt+γtpt)superscript^𝑥𝑡1subscriptΠsubscript𝒳superscriptΔ𝑡superscript𝑥𝑡superscript𝛾𝑡superscript𝑝𝑡\hat{x}^{t+1}=\Pi_{{\cal X}_{\Delta^{t}}}(x^{t}+\gamma^{t}p^{t})over^ start_ARG italic_x end_ARG start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT = roman_Π start_POSTSUBSCRIPT caligraphic_X start_POSTSUBSCRIPT roman_Δ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT + italic_γ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT italic_p start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ).
8:     Set pit+1=pit+[^git]1(x^it+1xit)superscriptsubscript𝑝𝑖𝑡1superscriptsubscript𝑝𝑖𝑡superscriptdelimited-[]^superscriptsubscript𝑔𝑖𝑡1superscriptsubscript^𝑥𝑖𝑡1superscriptsubscript𝑥𝑖𝑡{p}_{i}^{t+1}=p_{i}^{t}+[\hat{\nabla}g_{i}^{t}]^{-1}(\hat{x}_{i}^{t+1}-x_{i}^{% t})italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT = italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT + [ over^ start_ARG ∇ end_ARG italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ] start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( over^ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT - italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ), for all i[n]𝑖delimited-[]𝑛i\in[n]italic_i ∈ [ italic_n ].
9:     Each user i[n]𝑖delimited-[]𝑛i\in[n]italic_i ∈ [ italic_n ] receives pit+1superscriptsubscript𝑝𝑖𝑡1p_{i}^{t+1}italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT and solves
xit+1=gi(pit+1)superscriptsubscript𝑥𝑖𝑡1subscript𝑔𝑖superscriptsubscript𝑝𝑖𝑡1x_{i}^{t+1}=g_{i}(p_{i}^{t+1})italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT = italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT ) (9)
10:     (Sampling stage)
11:     Each user i[n]𝑖delimited-[]𝑛i\in[n]italic_i ∈ [ italic_n ] receives pit+1,s=pit+1+ηt+1e1+mod(t,di)superscriptsubscript𝑝𝑖𝑡1𝑠superscriptsubscript𝑝𝑖𝑡1superscript𝜂𝑡1subscript𝑒1mod𝑡subscript𝑑𝑖p_{i}^{t+1,s}=p_{i}^{t+1}+\eta^{t+1}e_{1+\mathrm{mod}(t,d_{i})}italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t + 1 , italic_s end_POSTSUPERSCRIPT = italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT + italic_η start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT italic_e start_POSTSUBSCRIPT 1 + roman_mod ( italic_t , italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_POSTSUBSCRIPT and solves
xit+1,s=gi(pit+1,s)superscriptsubscript𝑥𝑖𝑡1𝑠subscript𝑔𝑖superscriptsubscript𝑝𝑖𝑡1𝑠x_{i}^{t+1,s}=g_{i}(p_{i}^{t+1,s})italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t + 1 , italic_s end_POSTSUPERSCRIPT = italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t + 1 , italic_s end_POSTSUPERSCRIPT ) (10)
12:     For each user i[n]𝑖delimited-[]𝑛i\in[n]italic_i ∈ [ italic_n ]
[^git]:,1+mod(t,di)(xit+1,sxit+1)/ηt+1subscriptdelimited-[]^superscriptsubscript𝑔𝑖𝑡:1mod𝑡subscript𝑑𝑖superscriptsubscript𝑥𝑖𝑡1𝑠superscriptsubscript𝑥𝑖𝑡1superscript𝜂𝑡1\displaystyle[\hat{\nabla}g_{i}^{t}]_{:,1+\mathrm{mod}(t,d_{i})}\leftarrow(x_{% i}^{t+1,s}-x_{i}^{t+1})/{\eta^{t+1}}[ over^ start_ARG ∇ end_ARG italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ] start_POSTSUBSCRIPT : , 1 + roman_mod ( italic_t , italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_POSTSUBSCRIPT ← ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t + 1 , italic_s end_POSTSUPERSCRIPT - italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT ) / italic_η start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT (11)
^git+1=^git^superscriptsubscript𝑔𝑖𝑡1^superscriptsubscript𝑔𝑖𝑡\displaystyle\hat{\nabla}g_{i}^{t+1}=\hat{\nabla}g_{i}^{t}over^ start_ARG ∇ end_ARG italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT = over^ start_ARG ∇ end_ARG italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT (12)
13:  end for
Algorithm 1 Safe Pricing for NUM

The proposed method, called safe pricing for NUM (SPNUM) and outlined in Algorithm 1, consists of two stages at each iteration: 1) update stage (Step 6) and 2) sampling stage (Step 10). The update stage proceeds similarly to a projected gradient method on the primal iterates while designing prices that induce realized iterates close to a desired iterate. The sampling stage estimates the Jacobians of the price response functions of the users, which are used during the update stage.

In the update stage, the algorithm first determines a desired next iterate x^t+1superscript^𝑥𝑡1\hat{x}^{t+1}over^ start_ARG italic_x end_ARG start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT in Step 7. However, because the primal variables are not directly controllable, prices that induce xt+1superscript𝑥𝑡1x^{t+1}italic_x start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT that is close to x^t+1superscript^𝑥𝑡1\hat{x}^{t+1}over^ start_ARG italic_x end_ARG start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT have to be determined at Step 8. Accordingly, at the heart of the update stage lie two key steps:

  1. 1.

    At iteration t𝑡titalic_t, the central coordinator observes xtsuperscript𝑥𝑡x^{t}italic_x start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT and determines the next desired iterate x^t+1superscript^𝑥𝑡1\hat{x}^{t+1}over^ start_ARG italic_x end_ARG start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT by means of a projected gradient ascent step in Step 7. This is because if xt𝒳intsuperscript𝑥𝑡superscript𝒳intx^{t}\in{\cal X}^{\textnormal{int}}italic_x start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ∈ caligraphic_X start_POSTSUPERSCRIPT int end_POSTSUPERSCRIPT, then xi𝒳iintsubscript𝑥𝑖superscriptsubscript𝒳𝑖intx_{i}\in{\cal X}_{i}^{\textnormal{int}}italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ caligraphic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT int end_POSTSUPERSCRIPT, which implies that pit=fi(xt)superscriptsubscript𝑝𝑖𝑡subscript𝑓𝑖superscript𝑥𝑡p_{i}^{t}=\nabla f_{i}(x^{t})italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT = ∇ italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ) by Assumption 2 and the first order optimality condition for (5). Therefore, pt=f(xt)superscript𝑝𝑡𝑓superscript𝑥𝑡p^{t}=\nabla f(x^{t})italic_p start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT = ∇ italic_f ( italic_x start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ). In addition, projection is performed onto a shrunk set 𝒳Δtsubscript𝒳superscriptΔ𝑡{\cal X}_{\Delta^{t}}caligraphic_X start_POSTSUBSCRIPT roman_Δ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT end_POSTSUBSCRIPT, where ΔtsuperscriptΔ𝑡\Delta^{t}roman_Δ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT controls the amount of shrinkage at time t𝑡titalic_t. This is the key ingredient to ensure the safety of the algorithm because the uncertainty in the price response functions will cause the actual induced iterate xt+1superscript𝑥𝑡1x^{t+1}italic_x start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT in response to the price vector pt+1superscript𝑝𝑡1p^{t+1}italic_p start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT to deviate from the desired iterate x^t+1superscript^𝑥𝑡1\hat{x}^{t+1}over^ start_ARG italic_x end_ARG start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT. By adding this safety margin to the constraint, we can ensure safety if xt+1x^t+1¯(Δt)normsuperscript𝑥𝑡1superscript^𝑥𝑡1¯superscriptΔ𝑡\|x^{t+1}-\hat{x}^{t+1}\|\in\bar{\cal B}(\Delta^{t})∥ italic_x start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT - over^ start_ARG italic_x end_ARG start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT ∥ ∈ over¯ start_ARG caligraphic_B end_ARG ( roman_Δ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ). Finally, by utilizing a diminishing safety margin sequence {Δt}t0subscriptsuperscriptΔ𝑡𝑡0\{\Delta^{t}\}_{t\geq 0}{ roman_Δ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT } start_POSTSUBSCRIPT italic_t ≥ 0 end_POSTSUBSCRIPT, we can ensure convergence to the optimal solution of (4).

  2. 2.

    Once the desired next iterate x^t+1superscript^𝑥𝑡1\hat{x}^{t+1}over^ start_ARG italic_x end_ARG start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT is determined, the central coordinator has to determine pit+1superscriptsubscript𝑝𝑖𝑡1p_{i}^{t+1}italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT that would ideally induce x^it+1superscriptsubscript^𝑥𝑖𝑡1\hat{x}_{i}^{t+1}over^ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT, i[n]for-all𝑖delimited-[]𝑛\forall i\in[n]∀ italic_i ∈ [ italic_n ]. However, the price response function is unknown to the central coordinator, and therefore an exact solution is not possible. Instead, the central coordinator makes a linear approximation of the price response function using the Jacobian estimate of gisubscript𝑔𝑖g_{i}italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, i[n]for-all𝑖delimited-[]𝑛\forall i\in[n]∀ italic_i ∈ [ italic_n ]. In particular, the central coordinator keeps an estimate of the Jacobian denoted by ^git^superscriptsubscript𝑔𝑖𝑡\hat{\nabla}g_{i}^{t}over^ start_ARG ∇ end_ARG italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT initialized in Steps 3 and 4 of the algorithm, which is constructed by varying the price vector along each dimension and estimating the gradient using the difference equation. This results in the following linear approximation of the price response function around pitsuperscriptsubscript𝑝𝑖𝑡p_{i}^{t}italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT:

    g^i(p)=xit+^git(ppit).subscript^𝑔𝑖𝑝superscriptsubscript𝑥𝑖𝑡^superscriptsubscript𝑔𝑖𝑡𝑝superscriptsubscript𝑝𝑖𝑡\hat{g}_{i}(p)=x_{i}^{t}+\hat{\nabla}g_{i}^{t}(p-p_{i}^{t}).over^ start_ARG italic_g end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p ) = italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT + over^ start_ARG ∇ end_ARG italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ( italic_p - italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ) . (13)

    By setting p=pit+1𝑝superscriptsubscript𝑝𝑖𝑡1p={p}_{i}^{t+1}italic_p = italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT, g^i(pit+1)=x^t+1subscript^𝑔𝑖superscriptsubscript𝑝𝑖𝑡1superscript^𝑥𝑡1\hat{g}_{i}({p}_{i}^{t+1})=\hat{x}^{t+1}over^ start_ARG italic_g end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT ) = over^ start_ARG italic_x end_ARG start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT, and rearranging, we get the price update rule in Step 8. This requires that the ^git^superscriptsubscript𝑔𝑖𝑡\hat{\nabla}g_{i}^{t}over^ start_ARG ∇ end_ARG italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT is an invertible matrix, which will be proven in Section IV.

After determining pt+1superscript𝑝𝑡1p^{t+1}italic_p start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT and xt+1superscript𝑥𝑡1x^{t+1}italic_x start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT, the algorithm proceeds to the sampling stage to update the Jacobian estimates. To achieve this, the central coordinator varies the price vector pit+1superscriptsubscript𝑝𝑖𝑡1p_{i}^{t+1}italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT along the dimension 1+mod(t,di)1mod𝑡subscript𝑑𝑖1+\mathrm{mod}(t,d_{i})1 + roman_mod ( italic_t , italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) in Step 11 for user i[n]𝑖delimited-[]𝑛i\in[n]italic_i ∈ [ italic_n ], resulting in a sampling price of pit+1,ssuperscriptsubscript𝑝𝑖𝑡1𝑠{p_{i}^{t+1,s}}italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t + 1 , italic_s end_POSTSUPERSCRIPT. The response is observed and denoted as xit+1,ssuperscriptsubscript𝑥𝑖𝑡1𝑠x_{i}^{t+1,s}italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t + 1 , italic_s end_POSTSUPERSCRIPT. The difference between xit+1,ssuperscriptsubscript𝑥𝑖𝑡1𝑠x_{i}^{t+1,s}italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t + 1 , italic_s end_POSTSUPERSCRIPT and xit+1superscriptsubscript𝑥𝑖𝑡1x_{i}^{t+1}italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT divided by the amount of price variation serves as an estimate of the gradient of the price response function along the 1+mod(t,di)1mod𝑡subscript𝑑𝑖1+\mathrm{mod}(t,d_{i})1 + roman_mod ( italic_t , italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT )’th principal axis, which becomes the 1+mod(t,di)1mod𝑡subscript𝑑𝑖1+\mathrm{mod}(t,d_{i})1 + roman_mod ( italic_t , italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT )’th column of the Jacobian estimate ^git+1^superscriptsubscript𝑔𝑖𝑡1\hat{\nabla}g_{i}^{t+1}over^ start_ARG ∇ end_ARG italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT in Step 12. It is worthwhile to highlight that for a user i𝑖iitalic_i, the error between x^it+1superscriptsubscript^𝑥𝑖𝑡1\hat{x}_{i}^{t+1}over^ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT and xit+1superscriptsubscript𝑥𝑖𝑡1x_{i}^{t+1}italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT has two sources: 1) the difference between the estimated Jacobian and the actual Jacobian, i.e., ^gitgi(pit)^superscriptsubscript𝑔𝑖𝑡subscript𝑔𝑖superscriptsubscript𝑝𝑖𝑡\hat{\nabla}g_{i}^{t}-\nabla g_{i}(p_{i}^{t})over^ start_ARG ∇ end_ARG italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT - ∇ italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ), and 2) the high order terms not captured by the linear approximation, i.e., R1=gi(pt)gi(pt)(ppt)subscript𝑅1subscript𝑔𝑖superscript𝑝𝑡subscript𝑔𝑖superscript𝑝𝑡𝑝superscript𝑝𝑡R_{1}=g_{i}(p^{t})-\nabla g_{i}(p^{t})(p-p^{t})italic_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ) - ∇ italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ) ( italic_p - italic_p start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ).

It is necessary that there exists an initial price vector p0superscript𝑝0p^{0}italic_p start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT such that the demand vectors in response to the initial sampling prices in (7) are in 𝒳intsuperscript𝒳int{\cal X}^{\textnormal{int}}caligraphic_X start_POSTSUPERSCRIPT int end_POSTSUPERSCRIPT so that the algorithm can proceed as described above. Since this has to hold before getting any feedback from the users, we make the following assumption:

Assumption 3.

There exists a known price vector p0superscript𝑝0p^{0}italic_p start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT such that g(p0)𝒳int𝑔superscript𝑝0superscript𝒳intg(p^{0})\in{\cal X}^{\textnormal{int}}italic_g ( italic_p start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT ) ∈ caligraphic_X start_POSTSUPERSCRIPT int end_POSTSUPERSCRIPT and for all i[n]𝑖delimited-[]𝑛i\in[n]italic_i ∈ [ italic_n ], xidi𝒳iintsuperscriptsubscript𝑥𝑖subscript𝑑𝑖superscriptsubscript𝒳𝑖intx_{i}^{-d_{i}}\in{\cal X}_{i}^{\textnormal{int}}italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ∈ caligraphic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT int end_POSTSUPERSCRIPT.

The above assumption guarantees that the initial demand vectors in (7) are in 𝒳iint,i[n]subscriptsuperscript𝒳int𝑖for-all𝑖delimited-[]𝑛{\cal X}^{\textnormal{int}}_{i},~{}\forall i\in[n]caligraphic_X start_POSTSUPERSCRIPT int end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , ∀ italic_i ∈ [ italic_n ] and therefore the initial Jacobian estimation is meaningful.

Remark 2.

One way to satisfy Assumption 3 is to choose η0superscript𝜂0\eta^{0}italic_η start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT such that 𝒳nη0μsubscript𝒳𝑛superscript𝜂0𝜇{\cal X}_{\frac{\sqrt{n}\eta^{0}}{\mu}}caligraphic_X start_POSTSUBSCRIPT divide start_ARG square-root start_ARG italic_n end_ARG italic_η start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT end_ARG start_ARG italic_μ end_ARG end_POSTSUBSCRIPT is non-empty and p0superscript𝑝0p^{0}italic_p start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT such that g(p0)𝒳nη0μ𝑔superscript𝑝0subscript𝒳𝑛superscript𝜂0𝜇g(p^{0})\in{\cal X}_{\frac{\sqrt{n}\eta^{0}}{\mu}}italic_g ( italic_p start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT ) ∈ caligraphic_X start_POSTSUBSCRIPT divide start_ARG square-root start_ARG italic_n end_ARG italic_η start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT end_ARG start_ARG italic_μ end_ARG end_POSTSUBSCRIPT, which is proven in Appendix -D.

Remark 3.

For network resource allocation systems, the historical price response of the users can be used to choose a price point in history where the induced demand was in the feasible set. However, if there are additional assumptions that allows us to exploit the structure of the feasible set, we can utilize systematic methods. For instance, if the feasible set 𝒳𝒳{\cal X}caligraphic_X is a polytope of the form 𝒳={x:Axc}𝒳conditional-set𝑥𝐴𝑥𝑐{\cal X}=\{x:Ax\leq c\}caligraphic_X = { italic_x : italic_A italic_x ≤ italic_c }, where Aij0subscript𝐴𝑖𝑗0A_{ij}\geq 0italic_A start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT ≥ 0, then one way to find safe prices is to set the prices too high and gradually reduce them since low demand promotes safety. Indeed, this is a method we use in our preliminary work below (where A𝐴Aitalic_A was a binary matrix) to determine initial prices [17].

In the next section, we characterize a principled way to choose parameters ΔtsuperscriptΔ𝑡\Delta^{t}roman_Δ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT, γtsuperscript𝛾𝑡\gamma^{t}italic_γ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT, and ηtsuperscript𝜂𝑡\eta^{t}italic_η start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT in order to produce feasible primal iterates. Additionally, we prove that the regret incurred by the iterates produced by Algorithm 1 is 𝒪(log(T))𝒪𝑇{\cal O}(\log(T))caligraphic_O ( roman_log ( italic_T ) ) after T𝑇Titalic_T iterations, and the last iterate converges to the optimal solution at the rate 𝒪(log(T)/T)𝒪𝑇𝑇{\cal O}(\log(T)/T)caligraphic_O ( roman_log ( italic_T ) / italic_T ).

IV Feasibility and Regret Analysis

In order to prove the safety and the regret guarantees of our algorithm, we will need to bound the distance between a point in x𝒳𝑥𝒳x\in{\cal X}italic_x ∈ caligraphic_X and its projection onto the shrunk set Π𝒳Δ(x)subscriptΠsubscript𝒳Δ𝑥\Pi_{{\cal X}_{\Delta}}(x)roman_Π start_POSTSUBSCRIPT caligraphic_X start_POSTSUBSCRIPT roman_Δ end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_x ). The following definition from [42] formalizes this notion called the sharpness of a set, which is defined as the maximum distance from any point in a set to the projection of it onto the shrunk version of that set.

Definition 6.

For a convex and compact set 𝒳d𝒳superscript𝑑\mathcal{X}\subset\mathbb{R}^{d}caligraphic_X ⊂ blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT, we define the sharpness of 𝒳𝒳\mathcal{X}caligraphic_X as

Sharp𝒳(Δ):=supx𝒳Π𝒳Δ(x)x,assignsubscriptSharp𝒳Δsubscriptsupremum𝑥𝒳normsubscriptΠsubscript𝒳Δ𝑥𝑥\mathrm{Sharp}_{\mathcal{X}}(\Delta)\vcentcolon=\sup_{x\in{\cal X}}\|\Pi_{{% \cal X}_{\Delta}}(x)-x\|,roman_Sharp start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT ( roman_Δ ) := roman_sup start_POSTSUBSCRIPT italic_x ∈ caligraphic_X end_POSTSUBSCRIPT ∥ roman_Π start_POSTSUBSCRIPT caligraphic_X start_POSTSUBSCRIPT roman_Δ end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_x ) - italic_x ∥ , (14)

for all non-negative ΔΔ\Deltaroman_Δ such that 𝒳Δsubscript𝒳Δ\mathcal{X}_{\Delta}caligraphic_X start_POSTSUBSCRIPT roman_Δ end_POSTSUBSCRIPT is nonempty.

The following proposition establishes a bound on the sharpness of convex and compact sets as a linear function of ΔΔ{\Delta}roman_Δ:

Proposition 1.

[42, Corollary 11] For a convex, compact set 𝒳d𝒳superscript𝑑\mathcal{X}\subset\mathbb{R}^{d}caligraphic_X ⊂ blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT with non-empty interior, we have that Sharp𝒳(Δ)Γ𝒳ΔsubscriptSharp𝒳ΔsubscriptΓ𝒳Δ\mathrm{Sharp}_{\mathcal{X}}(\Delta)\leq\Gamma_{\mathcal{X}}\Deltaroman_Sharp start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT ( roman_Δ ) ≤ roman_Γ start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT roman_Δ where Γ𝒳1subscriptΓ𝒳1\Gamma_{\mathcal{X}}\geq 1roman_Γ start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT ≥ 1 is a constant that depends only on the geometry and the dimension of 𝒳𝒳\mathcal{X}caligraphic_X.

Example 3 (Sharpness of a polytope [42]).

Let 𝒳={xd:Axc}𝒳conditional-set𝑥superscript𝑑𝐴𝑥𝑐\mathcal{X}=\{x\in\mathbb{R}^{d}:Ax\leq c\}caligraphic_X = { italic_x ∈ blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT : italic_A italic_x ≤ italic_c } be a polytope with a nonempty interior. Define Asubscript𝐴\mathcal{I}_{A}caligraphic_I start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT to refer to the collection of all sets of d𝑑ditalic_d indices such that for each {i1,i2,,id}Asubscript𝑖1subscript𝑖2subscript𝑖𝑑subscript𝐴\{i_{1},i_{2},...,i_{d}\}\in\mathcal{I}_{A}{ italic_i start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_i start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , … , italic_i start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT } ∈ caligraphic_I start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT the vectors Ai1,Ai2,,Aidsubscript𝐴subscript𝑖1subscript𝐴subscript𝑖2subscript𝐴subscript𝑖𝑑A_{i_{1}},A_{i_{2}},...,A_{i_{d}}italic_A start_POSTSUBSCRIPT italic_i start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , italic_A start_POSTSUBSCRIPT italic_i start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , … , italic_A start_POSTSUBSCRIPT italic_i start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT end_POSTSUBSCRIPT are linearly independent. For each Asubscript𝐴\ell\in\mathcal{I}_{A}roman_ℓ ∈ caligraphic_I start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT where ={i1,i2,,id}subscript𝑖1subscript𝑖2subscript𝑖𝑑\ell=\{i_{1},i_{2},...,i_{d}\}roman_ℓ = { italic_i start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_i start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , … , italic_i start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT }, we define A=[Ai1Ai2Aid]superscript𝐴superscriptdelimited-[]superscriptsubscript𝐴subscript𝑖1topsuperscriptsubscript𝐴subscript𝑖2topsuperscriptsubscript𝐴subscript𝑖𝑑toptopA^{\ell}=[A_{i_{1}}^{\top}\ A_{i_{2}}^{\top}\ ...\ A_{i_{d}}^{\top}]^{\top}italic_A start_POSTSUPERSCRIPT roman_ℓ end_POSTSUPERSCRIPT = [ italic_A start_POSTSUBSCRIPT italic_i start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_A start_POSTSUBSCRIPT italic_i start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT … italic_A start_POSTSUBSCRIPT italic_i start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ] start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT. We have that Sharp𝒳(Δ)dK𝒳ΔsubscriptSharp𝒳Δ𝑑subscript𝐾𝒳Δ\mathrm{Sharp}_{\mathcal{X}}(\Delta)\leq\sqrt{d}K_{\mathcal{X}}\Deltaroman_Sharp start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT ( roman_Δ ) ≤ square-root start_ARG italic_d end_ARG italic_K start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT roman_Δ, where K𝒳:=maxAκ(A)assignsubscript𝐾𝒳subscriptsubscript𝐴𝜅superscript𝐴K_{\mathcal{X}}:=\max_{\ell\in\mathcal{I}_{A}}\kappa(A^{\ell})italic_K start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT := roman_max start_POSTSUBSCRIPT roman_ℓ ∈ caligraphic_I start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_κ ( italic_A start_POSTSUPERSCRIPT roman_ℓ end_POSTSUPERSCRIPT ).

Example 4 (Sharpness of a ball in dsuperscript𝑑{\mathbb{R}}^{d}blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT).

Let 𝒳={xd:(xx0)(xx0)r2}𝒳conditional-set𝑥superscript𝑑superscript𝑥subscript𝑥0top𝑥subscript𝑥0superscript𝑟2{\cal X}=\{x\in\mathbb{R}^{d}:(x-x_{0})^{\top}(x-x_{0})\leq r^{2}\}caligraphic_X = { italic_x ∈ blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT : ( italic_x - italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ( italic_x - italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) ≤ italic_r start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT } be a ball in dsuperscript𝑑{\mathbb{R}}^{d}blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT with radius r𝑟ritalic_r centered at x0subscript𝑥0x_{0}italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT. We have that Sharp𝒳(Δ)=ΔsubscriptSharp𝒳ΔΔ\mathrm{Sharp}_{\mathcal{X}}(\Delta)=\Deltaroman_Sharp start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT ( roman_Δ ) = roman_Δ.

Although we do not specify a closed-form expression of Γ𝒳subscriptΓ𝒳\Gamma_{\cal X}roman_Γ start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT for a general convex and compact set 𝒳𝒳\cal Xcaligraphic_X, it relates to the sharpness of polytopes that are contained in 𝒳𝒳{\cal X}caligraphic_X, which have closed-form bounds as given by Example 3. We refer the reader to [42] (Proposition 10) for a detailed discussion.

The next lemma characterizes the regularity properties of gi(pi)subscript𝑔𝑖subscript𝑝𝑖g_{i}(p_{i})italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) over the set of prices that induce a resource demand in 𝒳iintsuperscriptsubscript𝒳𝑖int{\cal X}_{i}^{\textnormal{int}}caligraphic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT int end_POSTSUPERSCRIPT for a user i[n]𝑖delimited-[]𝑛i\in[n]italic_i ∈ [ italic_n ]. This property is crucial for our analysis and for the feasibility of the algorithm, as we need to show that the inverse of the matrix ^git^superscriptsubscript𝑔𝑖𝑡\hat{\nabla}g_{i}^{t}over^ start_ARG ∇ end_ARG italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT for the price update rule in Step 8 is a valid operation.

Lemma 1.

Let 𝒫i={pidi:gi(pi)𝒳iint}subscript𝒫𝑖conditional-setsubscript𝑝𝑖superscriptsubscript𝑑𝑖subscript𝑔𝑖subscript𝑝𝑖superscriptsubscript𝒳𝑖int{\cal P}_{i}=\{p_{i}\in{\mathbb{R}}^{d_{i}}:g_{i}(p_{i})\in{\cal X}_{i}^{% \textnormal{int}}\}caligraphic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = { italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT : italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ∈ caligraphic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT int end_POSTSUPERSCRIPT } be the set of prices that induce a resource demand in 𝒳iintsuperscriptsubscript𝒳𝑖int{\cal X}_{i}^{\textnormal{int}}caligraphic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT int end_POSTSUPERSCRIPT for a user i[n]𝑖delimited-[]𝑛i\in[n]italic_i ∈ [ italic_n ]. Over 𝒫isubscript𝒫𝑖{\cal P}_{i}caligraphic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, gi(pi)subscript𝑔𝑖subscript𝑝𝑖g_{i}(p_{i})italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) is bijective, 1/μ1𝜇1/\mu1 / italic_μ-Lipschitz continuous, and β/μ3𝛽superscript𝜇3\beta/\mu^{3}italic_β / italic_μ start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT-smooth. Accordingly, gi(pi)subscript𝑔𝑖subscript𝑝𝑖g_{i}(p_{i})italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) is invertible and gi(pi)=[2fi(gi(pi))]1subscript𝑔𝑖subscript𝑝𝑖superscriptdelimited-[]superscript2subscript𝑓𝑖subscript𝑔𝑖subscript𝑝𝑖1\nabla g_{i}(p_{i})=[\nabla^{2}f_{i}(g_{i}(p_{i}))]^{-1}∇ italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) = [ ∇ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ) ] start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT.

The proof of Lemma 1 can be found in Appendix -E. Lemma 1 establishes that the true Jacobian of the price response function for user i𝑖iitalic_i is invertible because it corresponds to the inverse of the Hessian of the strongly concave utility function of user i𝑖iitalic_i. However, this does not imply that the estimated Jacobian ^git^superscriptsubscript𝑔𝑖𝑡\hat{\nabla}g_{i}^{t}over^ start_ARG ∇ end_ARG italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT is invertible since it is constructed by finite difference gradient approximation. The next lemma states that the estimated Jacobian ^git^superscriptsubscript𝑔𝑖𝑡\hat{\nabla}g_{i}^{t}over^ start_ARG ∇ end_ARG italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT is close enough to gi(pt)subscript𝑔𝑖superscript𝑝𝑡\nabla g_{i}(p^{t})∇ italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ), which allows us to bound the minimum singular value of it and therefore guarantees invertibility with the appropriate choice of algorithm parameters.

Lemma 2.

Let γt=1/(μ(t+τ))superscript𝛾𝑡1𝜇𝑡𝜏\gamma^{t}=1/(\mu(t+\tau))italic_γ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT = 1 / ( italic_μ ( italic_t + italic_τ ) ), Δt=Δ/(t+τ)2superscriptΔ𝑡Δsuperscript𝑡𝜏2\Delta^{t}=\Delta/(t+\tau)^{2}roman_Δ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT = roman_Δ / ( italic_t + italic_τ ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT, and ηt=μΔt1/(4n)superscript𝜂𝑡𝜇superscriptΔ𝑡14𝑛\eta^{t}=\mu\Delta^{t-1}/(4\sqrt{n})italic_η start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT = italic_μ roman_Δ start_POSTSUPERSCRIPT italic_t - 1 end_POSTSUPERSCRIPT / ( 4 square-root start_ARG italic_n end_ARG ) for some Δ>0Δ0\Delta>0roman_Δ > 0 and

τ=max{2,2d¯1,1+2μΔΓ𝒳/(Mn),Δ/H𝒳,\displaystyle\tau=\max\Big{\{}2,2\bar{d}-1,1+{2\mu\Delta\Gamma_{\cal X}}/({M% \sqrt{n}}),\sqrt{{\Delta}/{{H}_{\cal X}}},italic_τ = roman_max { 2 , 2 over¯ start_ARG italic_d end_ARG - 1 , 1 + 2 italic_μ roman_Δ roman_Γ start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT / ( italic_M square-root start_ARG italic_n end_ARG ) , square-root start_ARG roman_Δ / italic_H start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT end_ARG ,
LβMd¯(μ+32LΓ𝒳n(d¯1))/(2μ4Γ𝒳)}.\displaystyle\hskip 28.45274pt{L\beta M\sqrt{\bar{d}}\left(\mu+32L\Gamma_{\cal X% }\sqrt{n}(\bar{d}-1)\right)}/({2\mu^{4}\Gamma_{\cal X}})\Big{\}}.italic_L italic_β italic_M square-root start_ARG over¯ start_ARG italic_d end_ARG end_ARG ( italic_μ + 32 italic_L roman_Γ start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT square-root start_ARG italic_n end_ARG ( over¯ start_ARG italic_d end_ARG - 1 ) ) / ( 2 italic_μ start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT roman_Γ start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT ) } . (15)

Suppose that at iteration t𝑡titalic_t, xk𝒳ηknμintsuperscript𝑥𝑘superscriptsubscript𝒳superscript𝜂𝑘𝑛𝜇intx^{k}\in{{\cal X}}_{\frac{\eta^{k}\sqrt{n}}{\mu}}^{\textnormal{int}}italic_x start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT ∈ caligraphic_X start_POSTSUBSCRIPT divide start_ARG italic_η start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT square-root start_ARG italic_n end_ARG end_ARG start_ARG italic_μ end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT int end_POSTSUPERSCRIPT, k[max{td¯+1,0},t]for-all𝑘𝑡¯𝑑10𝑡\forall k\in[\max\{t-\bar{d}+1,0\},t]∀ italic_k ∈ [ roman_max { italic_t - over¯ start_ARG italic_d end_ARG + 1 , 0 } , italic_t ]. Then, the following holds for all i[n]𝑖delimited-[]𝑛i\in[n]italic_i ∈ [ italic_n ]:

^gitgi(pit)eit,norm^superscriptsubscript𝑔𝑖𝑡subscript𝑔𝑖superscriptsubscript𝑝𝑖𝑡superscriptsubscript𝑒𝑖𝑡\displaystyle\|\hat{\nabla}g_{i}^{t}-\nabla g_{i}(p_{i}^{t})\|\leq e_{i}^{t},∥ over^ start_ARG ∇ end_ARG italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT - ∇ italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ) ∥ ≤ italic_e start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT , (16)

where

eit=2βdiμ3(ηt+2L(di1)(Mnγt+2ΔtΓ𝒳))12L.superscriptsubscript𝑒𝑖𝑡2𝛽subscript𝑑𝑖superscript𝜇3superscript𝜂𝑡2𝐿subscript𝑑𝑖1𝑀𝑛superscript𝛾𝑡2superscriptΔ𝑡subscriptΓ𝒳12𝐿e_{i}^{t}{=}\frac{2\beta\sqrt{d_{i}}}{\mu^{3}}\left(\eta^{t}{+}2L(d_{i}{-}1)(M% \sqrt{n}\gamma^{t}{+}2\Delta^{t}\Gamma_{\cal X})\right)\leq\frac{1}{2L}.italic_e start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT = divide start_ARG 2 italic_β square-root start_ARG italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG end_ARG start_ARG italic_μ start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT end_ARG ( italic_η start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT + 2 italic_L ( italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - 1 ) ( italic_M square-root start_ARG italic_n end_ARG italic_γ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT + 2 roman_Δ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT roman_Γ start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT ) ) ≤ divide start_ARG 1 end_ARG start_ARG 2 italic_L end_ARG . (17)

Accordingly, σmin(^git)12Lsubscript𝜎^superscriptsubscript𝑔𝑖𝑡12𝐿\sigma_{\min}(\hat{\nabla}g_{i}^{t})\geq\frac{1}{2L}italic_σ start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT ( over^ start_ARG ∇ end_ARG italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ) ≥ divide start_ARG 1 end_ARG start_ARG 2 italic_L end_ARG and therefore ^git^superscriptsubscript𝑔𝑖𝑡\hat{\nabla}g_{i}^{t}over^ start_ARG ∇ end_ARG italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT is invertible.

The proof of Lemma 2 can be found in Appendix -A. Lemma 2 characterizes a principled way to choose the algorithm parameters with respect to a free parameter ΔΔ\Deltaroman_Δ in order to bound the difference between ^git^superscriptsubscript𝑔𝑖𝑡\hat{\nabla}g_{i}^{t}over^ start_ARG ∇ end_ARG italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT and gi(pt)subscript𝑔𝑖superscript𝑝𝑡\nabla g_{i}(p^{t})∇ italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ). In the following subsections, we will first characterize the choice of ΔΔ\Deltaroman_Δ that guarantees primal feasibility at all iterations and then prove the regret and convergence guarantees of Algorithm 1 under this choice of parameters.

IV-A Feasibility Analysis

The following proposition characterizes the choice of the parameters ΔtsuperscriptΔ𝑡\Delta^{t}roman_Δ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT, γtsuperscript𝛾𝑡\gamma^{t}italic_γ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT, and ηtsuperscript𝜂𝑡\eta^{t}italic_η start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT to ensure feasible primal iterates:

Proposition 2.

Let γt=1/(μ(t+τ))superscript𝛾𝑡1𝜇𝑡𝜏\gamma^{t}=1/(\mu(t+\tau))italic_γ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT = 1 / ( italic_μ ( italic_t + italic_τ ) ) and Δt=Δ/(t+τ)2superscriptΔ𝑡Δsuperscript𝑡𝜏2\Delta^{t}=\Delta/(t+\tau)^{2}roman_Δ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT = roman_Δ / ( italic_t + italic_τ ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT, ηt=μΔt1/(4n)superscript𝜂𝑡𝜇superscriptΔ𝑡14𝑛\eta^{t}=\mu\Delta^{t-1}/(4\sqrt{n})italic_η start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT = italic_μ roman_Δ start_POSTSUPERSCRIPT italic_t - 1 end_POSTSUPERSCRIPT / ( 4 square-root start_ARG italic_n end_ARG ), where τ𝜏\tauitalic_τ is given by (2) and

ΔΔ\displaystyle\Deltaroman_Δ =βLMn3/2(6L+d(μ/n+32L(d¯1)))/μ5.absent𝛽𝐿𝑀superscript𝑛326𝐿𝑑𝜇𝑛32𝐿¯𝑑1superscript𝜇5\displaystyle={\beta LMn^{3/2}(6L+\sqrt{d}(\mu/\sqrt{n}+32L(\bar{d}-1)))}/{\mu% ^{5}}.= italic_β italic_L italic_M italic_n start_POSTSUPERSCRIPT 3 / 2 end_POSTSUPERSCRIPT ( 6 italic_L + square-root start_ARG italic_d end_ARG ( italic_μ / square-root start_ARG italic_n end_ARG + 32 italic_L ( over¯ start_ARG italic_d end_ARG - 1 ) ) ) / italic_μ start_POSTSUPERSCRIPT 5 end_POSTSUPERSCRIPT . (18)

Then for all t0𝑡0t\geq 0italic_t ≥ 0, x^t+1xt+1<3Δt/4normsuperscript^𝑥𝑡1superscript𝑥𝑡13superscriptΔ𝑡4\|\hat{x}^{t+1}-x^{t+1}\|<3\Delta^{t}/4∥ over^ start_ARG italic_x end_ARG start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT - italic_x start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT ∥ < 3 roman_Δ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT / 4 and xt+1xt+1,sΔt/4normsuperscript𝑥𝑡1superscript𝑥𝑡1𝑠superscriptΔ𝑡4\|x^{t+1}-x^{t+1,s}\|\leq\Delta^{t}/4∥ italic_x start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT - italic_x start_POSTSUPERSCRIPT italic_t + 1 , italic_s end_POSTSUPERSCRIPT ∥ ≤ roman_Δ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT / 4. Accordingly, for all t0𝑡0t\geq 0italic_t ≥ 0, the iterates xtsuperscript𝑥𝑡x^{t}italic_x start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT and xt,ssuperscript𝑥𝑡𝑠x^{t,s}italic_x start_POSTSUPERSCRIPT italic_t , italic_s end_POSTSUPERSCRIPT produced by Algorithm 1 are feasible and in the strict interior of the feasible set, i.e., xt𝒳ηtnμintsuperscript𝑥𝑡superscriptsubscript𝒳superscript𝜂𝑡𝑛𝜇intx^{t}\in{\cal X}_{\frac{\eta^{t}\sqrt{n}}{\mu}}^{\textnormal{int}}italic_x start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ∈ caligraphic_X start_POSTSUBSCRIPT divide start_ARG italic_η start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT square-root start_ARG italic_n end_ARG end_ARG start_ARG italic_μ end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT int end_POSTSUPERSCRIPT and xt,s𝒳int,t1formulae-sequencesuperscript𝑥𝑡𝑠superscript𝒳intfor-all𝑡1x^{t,s}\in{\cal X}^{\textnormal{int}},~{}\forall t\geq 1italic_x start_POSTSUPERSCRIPT italic_t , italic_s end_POSTSUPERSCRIPT ∈ caligraphic_X start_POSTSUPERSCRIPT int end_POSTSUPERSCRIPT , ∀ italic_t ≥ 1.

Refer to caption
(a)
Refer to caption
(b)
Figure 1: Results for the benchmarking study. In all plots, SPNUM is shown in blue, SDGM in green, and DG in red. The shaded areas correspond to one standard deviation. In (a), we plot the convergence of the primal variables measured by xtx2superscriptnormsuperscript𝑥𝑡superscript𝑥2\|x^{t}-x^{\star}\|^{2}∥ italic_x start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT - italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT (left) and the infeasibility amount measured by [Axtc]+/cnormsubscriptdelimited-[]𝐴superscript𝑥𝑡𝑐norm𝑐\|[Ax^{t}-c]_{+}\|/\|c\|∥ [ italic_A italic_x start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT - italic_c ] start_POSTSUBSCRIPT + end_POSTSUBSCRIPT ∥ / ∥ italic_c ∥ (right) for all three algorithms when A{0,1}m×n𝐴superscript01𝑚𝑛A\in\{0,1\}^{m\times n}italic_A ∈ { 0 , 1 } start_POSTSUPERSCRIPT italic_m × italic_n end_POSTSUPERSCRIPT is a binary matrix. In (b), we plot the convergence of the primal variables measured by xtx2superscriptnormsuperscript𝑥𝑡superscript𝑥2\|x^{t}-x^{\star}\|^{2}∥ italic_x start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT - italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT (left) and the infeasibility amount measured by [Axtc]+/cnormsubscriptdelimited-[]𝐴superscript𝑥𝑡𝑐norm𝑐\|[Ax^{t}-c]_{+}\|/\|c\|∥ [ italic_A italic_x start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT - italic_c ] start_POSTSUBSCRIPT + end_POSTSUBSCRIPT ∥ / ∥ italic_c ∥ (right) for all three algorithms when Am×n𝐴superscript𝑚𝑛A\in{\mathbb{R}}^{m\times n}italic_A ∈ blackboard_R start_POSTSUPERSCRIPT italic_m × italic_n end_POSTSUPERSCRIPT is a real matrix.

The proof of Proposition 2 can be found in Appendix -B. Given that under Proposition 2, xtsuperscript𝑥𝑡x^{t}italic_x start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT for all t1𝑡1t\geq 1italic_t ≥ 1 are feasible and therefore implementable, the static regret (6) is a valid choice of performance metric. Next, we prove that the regret of Algorithm 1 is 𝒪(log(T))𝒪𝑇{\cal O}(\log(T))caligraphic_O ( roman_log ( italic_T ) ) and the primal variables converge to the optimal solution at the rate 𝒪(log(T)/T)𝒪𝑇𝑇{\cal O}(\log(T)/T)caligraphic_O ( roman_log ( italic_T ) / italic_T ).

IV-B Regret and Convergence Analysis

As our algorithm alternates between executing one update and one sampling stage, after T𝑇Titalic_T iterations it will have executed T/2𝑇2T/2italic_T / 2 update stages and T/2𝑇2T/2italic_T / 2 sampling stages. In this case, the regret per user is fairly calculated as:

R(T)=1nt=1T/2(f(x)f(xt)+f(x)f(xt,s)).𝑅𝑇1𝑛superscriptsubscript𝑡1𝑇2𝑓superscript𝑥𝑓superscript𝑥𝑡𝑓superscript𝑥𝑓superscript𝑥𝑡𝑠R(T)=\frac{1}{n}\sum_{t=1}^{T/2}(f(x^{\star})-f(x^{t})+f(x^{\star})-f(x^{t,s})).italic_R ( italic_T ) = divide start_ARG 1 end_ARG start_ARG italic_n end_ARG ∑ start_POSTSUBSCRIPT italic_t = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T / 2 end_POSTSUPERSCRIPT ( italic_f ( italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) - italic_f ( italic_x start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ) + italic_f ( italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) - italic_f ( italic_x start_POSTSUPERSCRIPT italic_t , italic_s end_POSTSUPERSCRIPT ) ) . (19)

The following theorem establishes an upper bound on the regret incurred by the primal iterates produced by Algorithm 1, and the squared distance between last iterate xT/2superscript𝑥𝑇2x^{T/2}italic_x start_POSTSUPERSCRIPT italic_T / 2 end_POSTSUPERSCRIPT and the optimum solution xsuperscript𝑥x^{\star}italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT:

Theorem 1.

Let p0superscript𝑝0p^{0}italic_p start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT, γtsuperscript𝛾𝑡\gamma^{t}italic_γ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT, ΔtsuperscriptΔ𝑡\Delta^{t}roman_Δ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT, and ηtsuperscript𝜂𝑡\eta^{t}italic_η start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT be chosen as in Proposition 2. Then for all t0𝑡0t\geq 0italic_t ≥ 0, the iterates produced by Algorithm 1 are feasible. Furthermore, the regret R(T)𝑅𝑇R(T)italic_R ( italic_T ) for T2𝑇2T\geq 2italic_T ≥ 2 satisfies

R(T)𝒪(log(T)(1+ΔΓ𝒳/n)),𝑅𝑇𝒪𝑇1ΔsubscriptΓ𝒳𝑛R(T)\leq{\cal O}(\log(T)(1+\Delta\Gamma_{\cal X}/n)),italic_R ( italic_T ) ≤ caligraphic_O ( roman_log ( italic_T ) ( 1 + roman_Δ roman_Γ start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT / italic_n ) ) , (20)

where 𝒪()𝒪{\cal O}(\cdot)caligraphic_O ( ⋅ ) hides other constants. In addition, the last primal iterate xT/2superscript𝑥𝑇2x^{T/2}italic_x start_POSTSUPERSCRIPT italic_T / 2 end_POSTSUPERSCRIPT satisfies

xT/2x2𝒪(log(T)/T).superscriptnormsuperscript𝑥𝑇2superscript𝑥2𝒪𝑇𝑇\|x^{T/2}-x^{\star}\|^{2}\leq{\cal O}(\log(T)/T).∥ italic_x start_POSTSUPERSCRIPT italic_T / 2 end_POSTSUPERSCRIPT - italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ≤ caligraphic_O ( roman_log ( italic_T ) / italic_T ) . (21)

Proof outline: Since the update stage of the algorithm proceeds similarly to a projected gradient method, the proof is similar to that of a projected gradient ascent for strongly concave functions. We have an additional error term due to xt+1x^t+1normsuperscript𝑥𝑡1superscript^𝑥𝑡1\|x^{t+1}-\hat{x}^{t+1}\|∥ italic_x start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT - over^ start_ARG italic_x end_ARG start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT ∥, which is 𝒪(Δt)𝒪superscriptΔ𝑡{\cal O}(\Delta^{t})caligraphic_O ( roman_Δ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ). The error term impacts the regret of the update stages as 𝒪(t=1T/2Δt/γt)𝒪superscriptsubscript𝑡1𝑇2superscriptΔ𝑡superscript𝛾𝑡{\cal O}(\sum_{t=1}^{T/2}\Delta^{t}/\gamma^{t})caligraphic_O ( ∑ start_POSTSUBSCRIPT italic_t = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T / 2 end_POSTSUPERSCRIPT roman_Δ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT / italic_γ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ), which results in an additive 𝒪(log(T)ΔΓ𝒳/n)𝒪𝑇ΔsubscriptΓ𝒳𝑛{\cal O}(\log(T)\Delta\Gamma_{\cal X}/n)caligraphic_O ( roman_log ( italic_T ) roman_Δ roman_Γ start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT / italic_n ) term. For the regret of the sampling stages, we note that the prices for the sampling stages are set by varying the prices of the update stages by ηtsuperscript𝜂𝑡\eta^{t}italic_η start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT. Therefore, we can upper bound the sum of the regret of all sampling stages by the regret of the update stages plus a constant additive term of ΔM/(4n)Δ𝑀4𝑛\Delta M/(4\sqrt{n})roman_Δ italic_M / ( 4 square-root start_ARG italic_n end_ARG ).

The complete proof of Theorem 1 and the explicit constants of (20) can be found in Appendix -C. According to Theorem 1, Algorithm 1 produces feasible solutions that achieve a sublinear regret of 𝒪(log(T))𝒪𝑇{\cal O}(\log(T))caligraphic_O ( roman_log ( italic_T ) ). Furthermore, the primal variables induced by the prices converge to the optimal solution at the rate 𝒪(log(T)/T)𝒪𝑇𝑇{\cal O}(\log(T)/T)caligraphic_O ( roman_log ( italic_T ) / italic_T ).

Remark 4.

When di=1,i[n]formulae-sequencesubscript𝑑𝑖1for-all𝑖delimited-[]𝑛d_{i}=1,~{}\forall i\in[n]italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = 1 , ∀ italic_i ∈ [ italic_n ], Δ=𝒪(βn3/2)Δ𝒪𝛽superscript𝑛32\Delta={\cal O(}\beta n^{3/2})roman_Δ = caligraphic_O ( italic_β italic_n start_POSTSUPERSCRIPT 3 / 2 end_POSTSUPERSCRIPT ) and R(T)=𝒪(log(T)(1+nβΓ𝒳))𝑅𝑇𝒪𝑇1𝑛𝛽subscriptΓ𝒳R(T)={\cal O}(\log(T)(1+\sqrt{n}\beta\Gamma_{\cal X}))italic_R ( italic_T ) = caligraphic_O ( roman_log ( italic_T ) ( 1 + square-root start_ARG italic_n end_ARG italic_β roman_Γ start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT ) ).

In the next section, we numerically demonstrate our theoretical results about the primal variables induced by Algorithm 1 and compare its performance to existing pricing algorithms.

V Numerical Studies

In this section, we demonstrate the efficacy of SPNUM via three numerical studies: 1) a benchmarking study to compare SPNUM’s convergence and feasibility performance to existing pricing methods that solve the NUM problem, 2) a toy NUM problem with a non-linear feasible set to demonstrate the success of SPNUM on non-linear feasible sets, and 3) a parameter study to demonstrate how the regret depends on the second order smoothness parameter β𝛽\betaitalic_β, sharpness parameter Γ𝒳subscriptΓ𝒳{\Gamma}_{\cal X}roman_Γ start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT, and the number of users n𝑛nitalic_n.

V-A Benchmarking Study

In this study, our aim is to compare the safety and convergence performance of SPNUM to the existing algorithms on feasible sets characterized by linear inequalities, i.e., 𝒳={xd:Axc}𝒳conditional-set𝑥superscript𝑑𝐴𝑥𝑐{\cal X}=\{x\in{\mathbb{R}}^{d}:Ax\leq c\}caligraphic_X = { italic_x ∈ blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT : italic_A italic_x ≤ italic_c }. We compare SPNUM to DG [16], which can achieve a linear convergence rate, and SDGM [17], which can provide safety when A𝐴Aitalic_A is a binary matrix.

We have implemented all algorithms on two types of A𝐴Aitalic_A matrices: 1) A𝐴Aitalic_A is a binary matrix and 2) A𝐴Aitalic_A is a real matrix. For both cases, we randomly generated a collection of 50 networks with a random number of users n𝑛nitalic_n taking (integer) values in range [5,20]520[5,20][ 5 , 20 ], and a random number of constraints m𝑚mitalic_m taking values in the interval [5,10]510[5,10][ 5 , 10 ] (generated independently). Inspired by [16], for all users i[n]𝑖delimited-[]𝑛i\in[n]italic_i ∈ [ italic_n ], we let the utility function be fi(xi)=0.5(xi3)2xiθilog(1+exi)subscript𝑓𝑖subscript𝑥𝑖0.5superscriptsubscript𝑥𝑖32subscript𝑥𝑖subscript𝜃𝑖1superscript𝑒subscript𝑥𝑖f_{i}(x_{i})=-0.5(x_{i}-3)^{2}-x_{i}-\theta_{i}\log(1+e^{x_{i}})italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) = - 0.5 ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - 3 ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT roman_log ( 1 + italic_e start_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ), where θisubscript𝜃𝑖\theta_{i}italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is sampled uniformly from [0,1]01[0,1][ 0 , 1 ] for each network configuration (we shifted the quadratic term by 3333 to ensure that the optimal solution is on the boundary of the feasible set). We set domfi=[0,1]domsubscript𝑓𝑖01\textnormal{dom}f_{i}=[0,1]dom italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = [ 0 , 1 ] for all i[n]𝑖delimited-[]𝑛i\in[n]italic_i ∈ [ italic_n ]. For each network configuration, we first randomly generated a matrix A^^𝐴\hat{A}over^ start_ARG italic_A end_ARG by sampling m×n𝑚𝑛m\times nitalic_m × italic_n Bernoulli random variables for the binary matrix case, and by sampling m×n𝑚𝑛m\times nitalic_m × italic_n random variables from the continuous uniform distribution in [1,1]11[-1,1][ - 1 , 1 ] for the real matrix case. We then let A=[A^In]𝐴superscriptdelimited-[]superscript^𝐴topsubscript𝐼𝑛topA=[\hat{A}^{\top}~{}I_{n}]^{\top}italic_A = [ over^ start_ARG italic_A end_ARG start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_I start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ] start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT. For the binary case, we let c=𝟏m+n𝑐subscript1𝑚𝑛c=\bm{1}_{m+n}italic_c = bold_1 start_POSTSUBSCRIPT italic_m + italic_n end_POSTSUBSCRIPT, and for the real case, we let c=[0.1𝟏m𝟏n]𝑐superscriptdelimited-[]0.1superscriptsubscript1𝑚topsuperscriptsubscript1𝑛toptopc=[0.1\bm{1}_{m}^{\top}~{}\bm{1}_{n}^{\top}]^{\top}italic_c = [ 0.1 bold_1 start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT bold_1 start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ] start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT.222For SPNUM, we additionally include the constraints x0𝑥0x\geq 0italic_x ≥ 0 in 𝒳𝒳{\cal X}caligraphic_X to satisfy Assumption 1. For the other algorithms, this is not needed.

We note that 𝒳i[0,1],i[n]formulae-sequencesubscript𝒳𝑖01for-all𝑖delimited-[]𝑛{\cal X}_{i}\subseteq[0,1],~{}\forall i\in[n]caligraphic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ⊆ [ 0 , 1 ] , ∀ italic_i ∈ [ italic_n ]. Within 𝒳isubscript𝒳𝑖{\cal X}_{i}caligraphic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, using bounds on θisubscript𝜃𝑖\theta_{i}italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and computing the derivatives of fisubscript𝑓𝑖f_{i}italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, we get M=2𝑀2M=2italic_M = 2, L=5/4𝐿54L=5/4italic_L = 5 / 4, μ=1𝜇1\mu=1italic_μ = 1, β=sinh(1)/(2(1+cosh(1))2)0.0909𝛽12superscript1120.0909\beta=\sinh(1)/(2(1+\cosh(1))^{2})\approx 0.0909italic_β = roman_sinh ( 1 ) / ( 2 ( 1 + roman_cosh ( 1 ) ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) ≈ 0.0909. Finally, from Example 3 we have Γ𝒳nκ(A)subscriptΓ𝒳𝑛𝜅𝐴{\Gamma}_{\cal X}\leq\sqrt{n}\kappa(A)roman_Γ start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT ≤ square-root start_ARG italic_n end_ARG italic_κ ( italic_A ).

For each configuration, we initialized the dual variables and prices to induce xi0=η0/μ,i[n]formulae-sequencesuperscriptsubscript𝑥𝑖0superscript𝜂0𝜇for-all𝑖delimited-[]𝑛x_{i}^{0}=\eta^{0}/\mu,~{}\forall i\in[n]italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT = italic_η start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT / italic_μ , ∀ italic_i ∈ [ italic_n ], and ran all three algorithms for a horizon of T=1000𝑇1000T=1000italic_T = 1000. We demonstrate the results for the binary matrix case and the real matrix case in Figure 1(a) and Figure 1(b), respectively. In Figure 1(a) we observe that 1) while DG converges the fastest, it is not safe, 2) SDGM and SPNUM converge slower but are safe, and 3) SDGM converges faster than SPNUM because it is designed specifically for this setting. On the other hand, in Figure 1(b) we observe that 1) SDGM does not provide safety and convergence when A𝐴Aitalic_A is a real matrix, as its assumptions do not hold anymore (note that the plot for xtx2superscriptnormsuperscript𝑥𝑡superscript𝑥2\|x^{t}-x^{\star}\|^{2}∥ italic_x start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT - italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT flattens for SDGM), 2) SPNUM successfully provides safety and convergence.

Refer to caption
Figure 2: Results for the numerical study on the impact of sharpness on regret. The figures on each row share the same y-axis. The shaded areas correspond to one standard deviation. The title of each subfigure denotes the (β,Γ𝒳)𝛽subscriptΓ𝒳(\beta,\Gamma_{\cal X})( italic_β , roman_Γ start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT ) configuration, and the regret incurred by different values of n𝑛nitalic_n are plotted for each configuration. We observe that in the top row of figures, i.e., when β𝛽\betaitalic_β is small, both Γ𝒳subscriptΓ𝒳\Gamma_{\cal X}roman_Γ start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT and n𝑛nitalic_n have little effect on the regret (e.g., increasing Γ𝒳subscriptΓ𝒳{\Gamma}_{\cal X}roman_Γ start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT by 8 times only doubles the regret for all n𝑛nitalic_n). On the other hand, the bottom row of figures demonstrates that when β𝛽\betaitalic_β is larger, then Γ𝒳subscriptΓ𝒳\Gamma_{\cal X}roman_Γ start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT and n𝑛nitalic_n have a significant impact.
Refer to caption
Figure 3: Results for the numerical study on SPNUM on non-linear feasible set. In the left figure, the regret divided by log(1+t)1𝑡{\log(1+t)}roman_log ( 1 + italic_t ) is plotted in green, and constraint violation is plotted in blue, where constraint violation is 00 if xt𝒳superscript𝑥𝑡𝒳x^{t}\in{\cal X}italic_x start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ∈ caligraphic_X and 1111 otherwise. In the right figure, we plot the primal convergence measured as xtx2superscriptnormsuperscript𝑥𝑡superscript𝑥2\|x^{t}-x^{\star}\|^{2}∥ italic_x start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT - italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT. Shaded areas correspond to one standard deviation.

V-B SPNUM on Non-linear Feasible Set

This study aims to demonstrate numerically the regret and safety guarantees of SPNUM on a problem with a feasible set characterized by non-linear inequalities. We select the feasible set 𝒳={xd:x1}𝒳conditional-set𝑥superscript𝑑norm𝑥1{\cal X}=\{x\in{\mathbb{R}}^{d}:\|x\|\leq 1\}caligraphic_X = { italic_x ∈ blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT : ∥ italic_x ∥ ≤ 1 } as the unit ball in dsuperscript𝑑{\mathbb{R}}^{d}blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT centered at the origin. At the beginning of each run, we sample the number of users n𝑛nitalic_n as an integer from the range [5,20]520[5,20][ 5 , 20 ] uniformly at random. For all i[n]𝑖delimited-[]𝑛i\in[n]italic_i ∈ [ italic_n ], we let the utility function be fi(xi)=0.5(xiyi)2xiθilog(1+exi)subscript𝑓𝑖subscript𝑥𝑖0.5superscriptsubscript𝑥𝑖subscript𝑦𝑖2subscript𝑥𝑖subscript𝜃𝑖1superscript𝑒subscript𝑥𝑖f_{i}(x_{i})=-0.5(x_{i}-y_{i})^{2}-x_{i}-\theta_{i}\log(1+e^{x_{i}})italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) = - 0.5 ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT roman_log ( 1 + italic_e start_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ), where θisubscript𝜃𝑖\theta_{i}italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is sampled uniformly from [0,1]01[0,1][ 0 , 1 ] and yisubscript𝑦𝑖y_{i}italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is sampled uniformly from [2,2]22[-2,2][ - 2 , 2 ] at random at the beginning of each run.

Noting that 𝒳i=[1,1]subscript𝒳𝑖11{\cal X}_{i}=[-1,1]caligraphic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = [ - 1 , 1 ], using bounds on θisubscript𝜃𝑖\theta_{i}italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and yisubscript𝑦𝑖y_{i}italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and computing the derivatives of fisubscript𝑓𝑖f_{i}italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, we get M=4+e/(1+e)𝑀4𝑒1𝑒M=4+e/(1+e)italic_M = 4 + italic_e / ( 1 + italic_e ), L=5/4𝐿54L=5/4italic_L = 5 / 4, μ=1𝜇1\mu=1italic_μ = 1, β=sinh(1)/(2(1+cosh(1))2)0.0909𝛽12superscript1120.0909\beta=\sinh(1)/(2(1+\cosh(1))^{2})\approx 0.0909italic_β = roman_sinh ( 1 ) / ( 2 ( 1 + roman_cosh ( 1 ) ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) ≈ 0.0909. Finally, from Example 4 we have Γ𝒳=1subscriptΓ𝒳1{\Gamma}_{\cal X}=1roman_Γ start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT = 1.

We initialize the prices to induce xi0=η0/μ,i[n]formulae-sequencesuperscriptsubscript𝑥𝑖0superscript𝜂0𝜇for-all𝑖delimited-[]𝑛x_{i}^{0}=\eta^{0}/\mu,~{}\forall i\in[n]italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT = italic_η start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT / italic_μ , ∀ italic_i ∈ [ italic_n ], and ran SPNUM 100 times for a horizon of T=50𝑇50T=50italic_T = 50. The results are illustrated in Figure 3. The figure shows that 1) the regret of SPNUM grows as 𝒪(log(t))𝒪𝑡{\cal O}(\log(t))caligraphic_O ( roman_log ( italic_t ) ), 2) SPNUM guarantees feasible iterates at all iterations, and 3) the primal iterates produced by SPNUM converge to the optimal solution.

V-C Impact of Sharpness on Regret

In this study, our aim is to support our theoretical results about SPNUM with numerical examples. In particular, we study the impact of sharpness parameter Γ𝒳subscriptΓ𝒳{\Gamma}_{\cal X}roman_Γ start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT and the number of users n𝑛nitalic_n on regret through β𝛽\betaitalic_β. We set di=1subscript𝑑𝑖1d_{i}=1italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = 1, in which case R(T)=𝒪(log(T)(1+βnΓ𝒳))𝑅𝑇𝒪𝑇1𝛽𝑛subscriptΓ𝒳R(T)={\cal O}(\log(T)(1+\beta\sqrt{n}\Gamma_{\cal X}))italic_R ( italic_T ) = caligraphic_O ( roman_log ( italic_T ) ( 1 + italic_β square-root start_ARG italic_n end_ARG roman_Γ start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT ) ) as stated in Remark 4. For each user i𝑖iitalic_i, we set fi(xi)=θi(cos(ω(x1))/ω210(x2)2xsin(ω)/ω)subscript𝑓𝑖subscript𝑥𝑖subscript𝜃𝑖𝜔𝑥1superscript𝜔210superscript𝑥22𝑥𝜔𝜔f_{i}(x_{i})=\theta_{i}(\cos(\omega(x-1))/\omega^{2}-10(x-2)^{2}-x\sin(\omega)% /\omega)italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) = italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( roman_cos ( italic_ω ( italic_x - 1 ) ) / italic_ω start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - 10 ( italic_x - 2 ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - italic_x roman_sin ( italic_ω ) / italic_ω ), where θisubscript𝜃𝑖\theta_{i}italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is sampled uniformly from [1,2]12[1,2][ 1 , 2 ]. This particular choice of fisubscript𝑓𝑖f_{i}italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT allows us to control β𝛽\betaitalic_β while keeping the other parameters constant by simply choosing ω𝜔\omegaitalic_ω. Using the bounds on θisubscript𝜃𝑖\theta_{i}italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and computing the derivatives of fisubscript𝑓𝑖f_{i}italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, we get M=40𝑀40M=40italic_M = 40, L=42𝐿42L=42italic_L = 42, μ=19𝜇19\mu=19italic_μ = 19, and β=2ω𝛽2𝜔\beta=2\omegaitalic_β = 2 italic_ω.

In order to have control over the sharpness parameter Γ𝒳subscriptΓ𝒳{\Gamma}_{\cal X}roman_Γ start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT, we study linear constraints of the form 𝒳={xn:x0,Axc}𝒳conditional-set𝑥superscript𝑛formulae-sequence𝑥0𝐴𝑥𝑐{\cal X}=\{x\in\mathbb{R}^{n}:x\geq 0,Ax\leq c\}caligraphic_X = { italic_x ∈ blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT : italic_x ≥ 0 , italic_A italic_x ≤ italic_c }, where Aij=(1K)/(1+K(n1))subscript𝐴𝑖𝑗1𝐾1𝐾𝑛1A_{ij}=(1-K)/(1+K(n-1))italic_A start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT = ( 1 - italic_K ) / ( 1 + italic_K ( italic_n - 1 ) ) if ij𝑖𝑗i\neq jitalic_i ≠ italic_j, and Aii=1subscript𝐴𝑖𝑖1A_{ii}=1italic_A start_POSTSUBSCRIPT italic_i italic_i end_POSTSUBSCRIPT = 1. This choice of A𝐴Aitalic_A allows us to parameterize the feasible set as a function of the condition number K𝐾Kitalic_K, since κ(A)=K𝜅𝐴𝐾\kappa(A)=Kitalic_κ ( italic_A ) = italic_K and Γ𝒳=nκ(A)subscriptΓ𝒳𝑛𝜅𝐴{\Gamma}_{\cal X}=\sqrt{n}\kappa(A)roman_Γ start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT = square-root start_ARG italic_n end_ARG italic_κ ( italic_A ). Finally, since fisubscript𝑓𝑖f_{i}italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is increasing over 𝒳isubscript𝒳𝑖{\cal X}_{i}caligraphic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, the optimal solution is given by x=𝟏nsuperscript𝑥subscript1𝑛x^{\star}=\bm{1}_{n}italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT = bold_1 start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT.

For n={2,4,8,16}𝑛24816n=\{2,4,8,16\}italic_n = { 2 , 4 , 8 , 16 }, ω={0.001,0.1}𝜔0.0010.1\omega=\{0.001,0.1\}italic_ω = { 0.001 , 0.1 }, and K={4/n,8/n,16/n,32/n}𝐾4𝑛8𝑛16𝑛32𝑛K=\{4/\sqrt{n},8/\sqrt{n},16/\sqrt{n},32/\sqrt{n}\}italic_K = { 4 / square-root start_ARG italic_n end_ARG , 8 / square-root start_ARG italic_n end_ARG , 16 / square-root start_ARG italic_n end_ARG , 32 / square-root start_ARG italic_n end_ARG }, we randomly sampled 10 sets of {θi}i[n]subscriptsubscript𝜃𝑖𝑖delimited-[]𝑛\{\theta_{i}\}_{i\in[n]}{ italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] end_POSTSUBSCRIPT, initialized pi0superscriptsubscript𝑝𝑖0p_{i}^{0}italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT so that xi0=η0/μ,i[n]formulae-sequencesuperscriptsubscript𝑥𝑖0superscript𝜂0𝜇for-all𝑖delimited-[]𝑛x_{i}^{0}=\eta^{0}/\mu,~{}\forall i\in[n]italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT = italic_η start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT / italic_μ , ∀ italic_i ∈ [ italic_n ], and ran SPNUM for a horizon of T=500𝑇500T=500italic_T = 500. Note that this corresponds to configurations of β={0.002,0.2}𝛽0.0020.2\beta=\{0.002,0.2\}italic_β = { 0.002 , 0.2 } and Γ𝒳={4,8,16,32}subscriptΓ𝒳481632\Gamma_{\cal X}=\{4,8,16,32\}roman_Γ start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT = { 4 , 8 , 16 , 32 }. We plot the regret for each configuration in Figure 2. The results indicate that 1) when β𝛽\betaitalic_β is small, Γ𝒳subscriptΓ𝒳\Gamma_{\cal X}roman_Γ start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT and n𝑛nitalic_n have little impact on the regret, and 2) when β𝛽\betaitalic_β is large, regret grows with Γ𝒳subscriptΓ𝒳{\Gamma}_{\cal X}roman_Γ start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT and n𝑛nitalic_n as the term proportional to nβΓ𝒳𝑛𝛽subscriptΓ𝒳\sqrt{n}\beta\Gamma_{\cal X}square-root start_ARG italic_n end_ARG italic_β roman_Γ start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT dominates.

VI Conclusion

In this work, we introduced a novel algorithm, called the safe pricing for NUM (SPNUM), for solving resource allocation problems over networks with arbitrary convex and compact feasible sets in a distributed fashion. Our algorithm iteratively designs prices for resources and allows the users the determine their own resource demand in response to prices according to their own profit maximization problem. The prices produced by SPNUM ensure that the induced demand satisfies the constraints of the system during the optimization process, which promotes safety. This is done by: 1) shrinking the constraint set and applying a projected gradient method to the primal variables to determine the updated desired demand, and 2) determining the prices that would induce the desired demand by estimating the price response function of the users using the historical data. By carefully controlling the amount of shrinkage to account for the error in the estimated price response, we ensure the safety of the algorithm. In addition, we have proven that the regret incurred by the SPNUM is 𝒪(log(T))𝒪𝑇{\cal O}(\log(T))caligraphic_O ( roman_log ( italic_T ) ), and the primal variables converge to the optimal solution at the rate of 𝒪(log(T)/T)𝒪𝑇𝑇{\cal O}(\log(T)/T)caligraphic_O ( roman_log ( italic_T ) / italic_T ).

References

  • [1] P. Samadi, A.-H. Mohsenian-Rad, R. Schober, V. W. Wong, and J. Jatskevich, “Optimal real-time pricing algorithm based on utility maximization for smart grid,” in 2010 First IEEE International Conference on Smart Grid Communications.   IEEE, 2010, pp. 415–420.
  • [2] F. P. Kelly, A. K. Maulloo, and D. K. H. Tan, “Rate control for communication networks: shadow prices, proportional fairness and stability,” Journal of the Operational Research society, vol. 49, no. 3, pp. 237–252, 1998.
  • [3] Y. Li, F. Song, J. Liu, X. Xie, and E. Tian, “Software defined event-triggering control for large-scale networked systems subject to stochastic cyber attacks,” IEEE Transactions on Control of Network Systems, 2023.
  • [4] J. Liu, E. Gong, L. Zha, X. Xie, and E. Tian, “Outlier-resistant recursive security filtering for multirate networked systems under fading measurements and round-robin protocol,” IEEE Transactions on Control of Network Systems, 2023.
  • [5] M. Chiang and J. Bell, “Balancing supply and demand of bandwidth in wireless cellular networks: utility maximization over powers and rates,” in IEEE INFOCOM 2004, vol. 4.   IEEE, 2004, pp. 2800–2811.
  • [6] N. Mehr, J. Lioris, R. Horowitz, and R. Pedarsani, “Joint perimeter and signal control of urban traffic via network utility maximization,” in 2017 IEEE 20th International Conference on Intelligent Transportation Systems (ITSC).   IEEE, 2017, pp. 1–6.
  • [7] D. P. Palomar and M. Chiang, “A tutorial on decomposition methods for network utility maximization,” IEEE Journal on Selected Areas in Communications, vol. 24, no. 8, pp. 1439–1451, 2006.
  • [8] I. Necoara, V. Nedelcu, and I. Dumitrache, “Parallel and distributed optimization methods for estimation and control in networks,” Journal of Process Control, vol. 21, no. 5, pp. 756–766, 2011.
  • [9] D. P. Bertsekas, “Nonlinear programming,” Journal of the Operational Research Society, vol. 48, no. 3, pp. 334–334, 1997.
  • [10] D. Bertsekas and J. Tsitsiklis, Parallel and distributed computation: numerical methods.   Athena Scientific, 2015.
  • [11] N. Li, L. Chen, and S. H. Low, “Optimal demand response based on utility maximization in power networks,” in 2011 IEEE power and energy society general meeting.   IEEE, 2011, pp. 1–8.
  • [12] N. Z. Shor, Minimization methods for non-differentiable functions.   Springer Science & Business Media, 2012, vol. 3.
  • [13] A. Nedić and A. Ozdaglar, “Approximate primal solutions and rate analysis for dual subgradient methods,” SIAM Journal on Optimization, vol. 19, no. 4, pp. 1757–1780, 2009.
  • [14] A. Beck, A. Nedić, A. Ozdaglar, and M. Teboulle, “An o(1/k)𝑜1𝑘o(1/k)italic_o ( 1 / italic_k ) gradient method for network resource allocation problems,” IEEE Transactions on Control of Network Systems, vol. 1, no. 1, pp. 64–73, 2014.
  • [15] I. Necoara and V. Nedelcu, “Rate analysis of inexact dual first-order methods application to dual decomposition,” IEEE Transactions on Automatic Control, vol. 59, no. 5, pp. 1232–1243, 2013.
  • [16] ——, “On linear convergence of a distributed dual gradient algorithm for linearly constrained separable convex problems,” Automatica, vol. 55, pp. 209–216, 2015.
  • [17] B. Turan and M. Alizadeh, “Safe dual gradient method for network utility maximization problems,” in 2022 IEEE 61st Conference on Decision and Control (CDC).   IEEE, 2022, pp. 6953–6959.
  • [18] A. Simonetto and H. Jamali-Rad, “Primal recovery from consensus-based dual decomposition for distributed convex optimization,” Journal of Optimization Theory and Applications, vol. 168, pp. 172–197, 2016.
  • [19] A. Falsone, K. Margellos, S. Garatti, and M. Prandini, “Dual decomposition for multi-agent distributed optimization with coupling constraints,” Automatica, vol. 84, pp. 149–158, 2017.
  • [20] I. Notarnicola and G. Notarstefano, “Constraint-coupled distributed optimization: a relaxation and duality approach,” IEEE Transactions on Control of Network Systems, vol. 7, no. 1, pp. 483–492, 2019.
  • [21] J. S. Vardakas, N. Zorba, and C. V. Verikoukis, “A survey on demand response programs in smart grids: Pricing methods and optimization algorithms,” IEEE Communications Surveys & Tutorials, vol. 17, no. 1, pp. 152–178, 2014.
  • [22] K. Christakou, D.-C. Tomozei, J.-Y. Le Boudec, and M. Paolone, “Ac opf in radial distribution networks–part ii: An augmented lagrangian-based opf algorithm, distributable via primal decomposition,” Electric Power Systems Research, vol. 150, pp. 24–35, 2017.
  • [23] N. Tucker, A. Moradipari, and M. Alizadeh, “Constrained thompson sampling for real-time electricity pricing with grid reliability constraints,” IEEE Transactions on Smart Grid, vol. 11, no. 6, pp. 4971–4983, 2020.
  • [24] M. Lubin, Y. Dvorkin, and S. Backhaus, “A robust approach to chance constrained optimal power flow with renewable generation,” IEEE Transactions on Power Systems, vol. 31, no. 5, pp. 3840–3849, 2015.
  • [25] M. Zhu and S. Martinez, “On distributed convex optimization under inequality and equality constraints,” IEEE Transactions on Automatic Control, vol. 57, no. 1, pp. 151–164, 2011.
  • [26] J. Koshal, A. Nedić, and U. V. Shanbhag, “Multiuser optimization: Distributed algorithms and error analysis,” SIAM Journal on Optimization, vol. 21, no. 3, pp. 1046–1081, 2011.
  • [27] K. Sakurama and M. Miura, “Distributed constraint optimization on networked multi-agent systems,” Applied Mathematics and Computation, vol. 292, pp. 272–281, 2017.
  • [28] B. Turan, C. A. Uribe, H.-T. Wai, and M. Alizadeh, “Resilient primal–dual optimization algorithms for distributed resource allocation,” IEEE Transactions on Control of Network Systems, vol. 8, no. 1, pp. 282–294, 2020.
  • [29] T. Yang, X. Yi, J. Wu, Y. Yuan, D. Wu, Z. Meng, Y. Hong, H. Wang, Z. Lin, and K. H. Johansson, “A survey of distributed optimization,” Annual Reviews in Control, vol. 47, pp. 278–305, 2019.
  • [30] Y. Zheng and Q. Liu, “A review of distributed optimization: Problems, models and algorithms,” Neurocomputing, vol. 483, pp. 446–459, 2022.
  • [31] P. H. Calamai and J. J. Moré, “Projected gradient methods for linearly constrained problems,” Mathematical programming, vol. 39, no. 1, pp. 93–116, 1987.
  • [32] E. S. Levitin and B. T. Polyak, “Constrained minimization methods,” USSR Computational mathematics and mathematical physics, vol. 6, no. 5, pp. 1–50, 1966.
  • [33] S. Boyd and L. Vandenberghe, Convex optimization.   Cambridge university press, 2004.
  • [34] P. Armand, J. C. Gilbert, and S. Jan-Jégou, “A feasible bfgs interior point algorithm for solving convex minimization problems,” SIAM Journal on Optimization, vol. 11, no. 1, pp. 199–222, 2000.
  • [35] E. Wei, A. Ozdaglar, and A. Jadbabaie, “A distributed newton method for network utility maximization,” in 49th IEEE Conference on Decision and Control (CDC).   IEEE, 2010, pp. 1816–1821.
  • [36] S. Athuraliya and S. H. Low, “Optimization flow control with newton-like algorithm,” Telecommunication Systems, vol. 15, no. 3, pp. 345–358, 2000.
  • [37] I. Necoara and J. Suykens, “Interior-point lagrangian decomposition method for separable convex optimization,” Journal of Optimization Theory and Applications, vol. 143, no. 3, pp. 567–588, 2009.
  • [38] H. Guo, X. Liu, H. Wei, and L. Ying, “Online convex optimization with hard constraints: Towards the best of two worlds and beyond,” Advances in Neural Information Processing Systems, vol. 35, pp. 36 426–36 439, 2022.
  • [39] S. Mannor, J. N. Tsitsiklis, and J. Y. Yu, “Online learning with sample path constraints.” Journal of Machine Learning Research, vol. 10, no. 3, 2009.
  • [40] H. Yu, M. Neely, and X. Wei, “Online convex optimization with stochastic constraints,” Advances in Neural Information Processing Systems, vol. 30, 2017.
  • [41] J. Mo and J. Walrand, “Fair end-to-end window-based congestion control,” IEEE/ACM Transactions on networking, vol. 8, no. 5, pp. 556–567, 2000.
  • [42] S. Hutchinson, B. Turan, and M. Alizadeh, “The impact of the geometric properties of the constraint set in safe optimization with bandit feedback,” arXiv preprint arXiv:2305.00889, 2023.
  • [43] R. Schneider, Convex bodies: the Brunn–Minkowski theory.   Cambridge university press, 2014, no. 151.
  • [44] Y. Nesterov and B. T. Polyak, “Cubic regularization of newton method and its global performance,” Mathematical Programming, vol. 108, no. 1, pp. 177–205, 2006.

-A Proof of Lemma 2

Firstly we note that by the choice of τΔ/H𝒳𝜏Δsubscript𝐻𝒳\tau\geq\sqrt{\Delta/H_{\cal X}}italic_τ ≥ square-root start_ARG roman_Δ / italic_H start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT end_ARG, we can ensure that ΔtH𝒳superscriptΔ𝑡subscript𝐻𝒳\Delta^{t}\leq H_{\cal X}roman_Δ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ≤ italic_H start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT and that 𝒳Δtsubscript𝒳superscriptΔ𝑡{\cal X}_{\Delta^{t}}caligraphic_X start_POSTSUBSCRIPT roman_Δ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT end_POSTSUBSCRIPT is non-empty. Next, we show that eit1/(2L),t0formulae-sequencesuperscriptsubscript𝑒𝑖𝑡12𝐿for-all𝑡0e_{i}^{t}\leq 1/(2L),~{}\forall t\geq 0italic_e start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ≤ 1 / ( 2 italic_L ) , ∀ italic_t ≥ 0. Note that eitsuperscriptsubscript𝑒𝑖𝑡e_{i}^{t}italic_e start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT is decreasing with t𝑡titalic_t, and therefore is maximized for t=0𝑡0t=0italic_t = 0:

eitei0=2βdi(η0+2L(di1)(Mnγ0+2Δ0Γ𝒳))/μ3superscriptsubscript𝑒𝑖𝑡superscriptsubscript𝑒𝑖02𝛽subscript𝑑𝑖superscript𝜂02𝐿subscript𝑑𝑖1𝑀𝑛superscript𝛾02superscriptΔ0subscriptΓ𝒳superscript𝜇3\displaystyle e_{i}^{t}{\leq}e_{i}^{0}{=}{2\beta\sqrt{d_{i}}}\left(\eta^{0}{+}% 2L(d_{i}{-}1)(M\sqrt{n}\gamma^{0}{+}2\Delta^{0}\Gamma_{\cal X})\right)/{\mu^{3}}italic_e start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ≤ italic_e start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT = 2 italic_β square-root start_ARG italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG ( italic_η start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT + 2 italic_L ( italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - 1 ) ( italic_M square-root start_ARG italic_n end_ARG italic_γ start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT + 2 roman_Δ start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT roman_Γ start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT ) ) / italic_μ start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT (22)

For τ2μΔΓ𝒳/(Mn)𝜏2𝜇ΔsubscriptΓ𝒳𝑀𝑛\tau\geq{2\mu\Delta\Gamma_{\cal X}}/({M\sqrt{n}})italic_τ ≥ 2 italic_μ roman_Δ roman_Γ start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT / ( italic_M square-root start_ARG italic_n end_ARG ) and did¯subscript𝑑𝑖¯𝑑d_{i}\leq\bar{d}italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ≤ over¯ start_ARG italic_d end_ARG, we get:

ei0superscriptsubscript𝑒𝑖0\displaystyle e_{i}^{0}italic_e start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT 2βd¯μ3τ(M8Γ𝒳+4L(d¯1)Mnμ)absent2𝛽¯𝑑superscript𝜇3𝜏𝑀8subscriptΓ𝒳4𝐿¯𝑑1𝑀𝑛𝜇\displaystyle\leq\frac{2\beta\sqrt{\bar{d}}}{\mu^{3}\tau}\left(\frac{M}{8% \Gamma_{\cal X}}+\frac{4L(\bar{d}-1)M\sqrt{n}}{\mu}\right)≤ divide start_ARG 2 italic_β square-root start_ARG over¯ start_ARG italic_d end_ARG end_ARG end_ARG start_ARG italic_μ start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT italic_τ end_ARG ( divide start_ARG italic_M end_ARG start_ARG 8 roman_Γ start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT end_ARG + divide start_ARG 4 italic_L ( over¯ start_ARG italic_d end_ARG - 1 ) italic_M square-root start_ARG italic_n end_ARG end_ARG start_ARG italic_μ end_ARG ) (23)
=βMd¯(μ+32LΓ𝒳n(d¯1))/(4μ4Γ𝒳τ).absent𝛽𝑀¯𝑑𝜇32𝐿subscriptΓ𝒳𝑛¯𝑑14superscript𝜇4subscriptΓ𝒳𝜏\displaystyle={\beta M\sqrt{\bar{d}}\left(\mu+32L\Gamma_{\cal X}\sqrt{n}(\bar{% d}-1)\right)}/({4\mu^{4}\Gamma_{\cal X}\tau}).= italic_β italic_M square-root start_ARG over¯ start_ARG italic_d end_ARG end_ARG ( italic_μ + 32 italic_L roman_Γ start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT square-root start_ARG italic_n end_ARG ( over¯ start_ARG italic_d end_ARG - 1 ) ) / ( 4 italic_μ start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT roman_Γ start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT italic_τ ) . (24)

Next, using τLβMd¯(μ+32LΓ𝒳n(d¯1))/(2μ4Γ𝒳)𝜏𝐿𝛽𝑀¯𝑑𝜇32𝐿subscriptΓ𝒳𝑛¯𝑑12superscript𝜇4subscriptΓ𝒳\tau\geq{L\beta M\sqrt{\bar{d}}\left(\mu+32L\Gamma_{\cal X}\sqrt{n}(\bar{d}-1)% \right)}/({2\mu^{4}\Gamma_{\cal X}})italic_τ ≥ italic_L italic_β italic_M square-root start_ARG over¯ start_ARG italic_d end_ARG end_ARG ( italic_μ + 32 italic_L roman_Γ start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT square-root start_ARG italic_n end_ARG ( over¯ start_ARG italic_d end_ARG - 1 ) ) / ( 2 italic_μ start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT roman_Γ start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT ):

eitei01/(2L).superscriptsubscript𝑒𝑖𝑡superscriptsubscript𝑒𝑖012𝐿e_{i}^{t}\leq e_{i}^{0}\leq 1/(2L).italic_e start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ≤ italic_e start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT ≤ 1 / ( 2 italic_L ) . (25)

We will prove the lemma by induction that if ^gikgi(pik)eiknorm^superscriptsubscript𝑔𝑖𝑘subscript𝑔𝑖superscriptsubscript𝑝𝑖𝑘superscriptsubscript𝑒𝑖𝑘\|\hat{\nabla}g_{i}^{k}-\nabla g_{i}(p_{i}^{k})\|\leq e_{i}^{k}∥ over^ start_ARG ∇ end_ARG italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT - ∇ italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT ) ∥ ≤ italic_e start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT holds for k[max{0,tdi+1},t1]𝑘0𝑡subscript𝑑𝑖1𝑡1k\in[\max\{0,t-d_{i}+1\},t-1]italic_k ∈ [ roman_max { 0 , italic_t - italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT + 1 } , italic_t - 1 ], then it holds for k=t𝑘𝑡k=titalic_k = italic_t. Using Cauchy-Schwarz inequality:

^gitgi(pit)dimaxj[di][^git]:,j[gi(pit)]:,j.norm^superscriptsubscript𝑔𝑖𝑡subscript𝑔𝑖superscriptsubscript𝑝𝑖𝑡subscript𝑑𝑖subscript𝑗delimited-[]subscript𝑑𝑖normsubscriptdelimited-[]^superscriptsubscript𝑔𝑖𝑡:𝑗subscriptdelimited-[]subscript𝑔𝑖superscriptsubscript𝑝𝑖𝑡:𝑗\displaystyle\|\hat{\nabla}g_{i}^{t}{-}\nabla g_{i}(p_{i}^{t})\|\leq\sqrt{d_{i% }}\max_{j\in[d_{i}]}\|[\hat{\nabla}g_{i}^{t}]_{:,j}{-}[\nabla g_{i}(p_{i}^{t})% ]_{:,j}\|.∥ over^ start_ARG ∇ end_ARG italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT - ∇ italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ) ∥ ≤ square-root start_ARG italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG roman_max start_POSTSUBSCRIPT italic_j ∈ [ italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ] end_POSTSUBSCRIPT ∥ [ over^ start_ARG ∇ end_ARG italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ] start_POSTSUBSCRIPT : , italic_j end_POSTSUBSCRIPT - [ ∇ italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ) ] start_POSTSUBSCRIPT : , italic_j end_POSTSUBSCRIPT ∥ . (26)

For a given j[di]𝑗delimited-[]subscript𝑑𝑖j\in[d_{i}]italic_j ∈ [ italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ], by construction of ^git^superscriptsubscript𝑔𝑖𝑡\hat{\nabla}g_{i}^{t}over^ start_ARG ∇ end_ARG italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT we have

[^git]:,j=(gi(pij+ηjej)gi(pij))/ηj,subscriptdelimited-[]^superscriptsubscript𝑔𝑖𝑡:𝑗subscript𝑔𝑖superscriptsubscript𝑝𝑖subscript𝑗superscript𝜂subscript𝑗subscript𝑒𝑗subscript𝑔𝑖superscriptsubscript𝑝𝑖subscript𝑗superscript𝜂subscript𝑗[\hat{\nabla}g_{i}^{t}]_{:,j}=({g_{i}(p_{i}^{\ell_{j}}+\eta^{\ell_{j}}e_{j})-g% _{i}(p_{i}^{\ell_{j}})})/{\eta^{\ell_{j}}},[ over^ start_ARG ∇ end_ARG italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ] start_POSTSUBSCRIPT : , italic_j end_POSTSUBSCRIPT = ( italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ℓ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUPERSCRIPT + italic_η start_POSTSUPERSCRIPT roman_ℓ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_e start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) - italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ℓ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ) ) / italic_η start_POSTSUPERSCRIPT roman_ℓ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUPERSCRIPT , (27)

for some j[max{0,tdi+1},t]subscript𝑗0𝑡subscript𝑑𝑖1𝑡\ell_{j}\in[\max\{0,t-d_{i}+1\},t]roman_ℓ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∈ [ roman_max { 0 , italic_t - italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT + 1 } , italic_t ]. Using the Taylor series expansion, we can rewrite the above as:

[^git]:,j=[gi(pij)]:,j+R1/ηj,subscriptdelimited-[]^superscriptsubscript𝑔𝑖𝑡:𝑗subscriptdelimited-[]subscript𝑔𝑖superscriptsubscript𝑝𝑖subscript𝑗:𝑗subscript𝑅1superscript𝜂subscript𝑗[\hat{\nabla}g_{i}^{t}]_{:,j}=[\nabla g_{i}(p_{i}^{\ell_{j}})]_{:,j}+R_{1}/% \eta^{\ell_{j}},[ over^ start_ARG ∇ end_ARG italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ] start_POSTSUBSCRIPT : , italic_j end_POSTSUBSCRIPT = [ ∇ italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ℓ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ) ] start_POSTSUBSCRIPT : , italic_j end_POSTSUBSCRIPT + italic_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT / italic_η start_POSTSUPERSCRIPT roman_ℓ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUPERSCRIPT , (28)

where R1β(ηj)2/(2μ3)normsubscript𝑅1𝛽superscriptsuperscript𝜂subscript𝑗22superscript𝜇3\|R_{1}\|\leq\beta(\eta^{\ell_{j}})^{2}/(2\mu^{3})∥ italic_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∥ ≤ italic_β ( italic_η start_POSTSUPERSCRIPT roman_ℓ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT / ( 2 italic_μ start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT ) follows from [44, Lemma 1] using β/μ3𝛽superscript𝜇3\beta/\mu^{3}italic_β / italic_μ start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT-smoothness of gisubscript𝑔𝑖g_{i}italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT. Accordingly,

^gitgi(pit)norm^superscriptsubscript𝑔𝑖𝑡subscript𝑔𝑖superscriptsubscript𝑝𝑖𝑡\displaystyle\|\hat{\nabla}g_{i}^{t}-\nabla g_{i}(p_{i}^{t})\|∥ over^ start_ARG ∇ end_ARG italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT - ∇ italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ) ∥ dimaxj[di][gi(pij)]:,j[gi(pit)]:,jabsentsubscript𝑑𝑖subscript𝑗delimited-[]subscript𝑑𝑖normsubscriptdelimited-[]subscript𝑔𝑖superscriptsubscript𝑝𝑖subscript𝑗:𝑗subscriptdelimited-[]subscript𝑔𝑖superscriptsubscript𝑝𝑖𝑡:𝑗\displaystyle\leq\sqrt{d_{i}}\max_{j\in[d_{i}]}\|[\nabla g_{i}(p_{i}^{\ell_{j}% })]_{:,j}-[\nabla g_{i}(p_{i}^{t})]_{:,j}\|≤ square-root start_ARG italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG roman_max start_POSTSUBSCRIPT italic_j ∈ [ italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ] end_POSTSUBSCRIPT ∥ [ ∇ italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ℓ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ) ] start_POSTSUBSCRIPT : , italic_j end_POSTSUBSCRIPT - [ ∇ italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ) ] start_POSTSUBSCRIPT : , italic_j end_POSTSUBSCRIPT ∥
+diβηj/(2μ3)subscript𝑑𝑖𝛽superscript𝜂subscript𝑗2superscript𝜇3\displaystyle~{}~{}~{}~{}+\sqrt{d_{i}}\beta\eta^{\ell_{j}}/(2\mu^{3})+ square-root start_ARG italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG italic_β italic_η start_POSTSUPERSCRIPT roman_ℓ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUPERSCRIPT / ( 2 italic_μ start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT ) (29)
maxj[max{0,tdi+1},t]βdiμ3pijpit+diβηj2μ3,absentsubscriptsubscript𝑗0𝑡subscript𝑑𝑖1𝑡𝛽subscript𝑑𝑖superscript𝜇3normsuperscriptsubscript𝑝𝑖subscript𝑗superscriptsubscript𝑝𝑖𝑡subscript𝑑𝑖𝛽superscript𝜂subscript𝑗2superscript𝜇3\displaystyle{\leq}\max_{{\ell_{j}}\in[\max\{0,t-d_{i}+1\},t]}\frac{\beta\sqrt% {d_{i}}}{\mu^{3}}\|p_{i}^{\ell_{j}}-p_{i}^{t}\|+\frac{\sqrt{d_{i}}\beta\eta^{% \ell_{j}}}{2\mu^{3}},≤ roman_max start_POSTSUBSCRIPT roman_ℓ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∈ [ roman_max { 0 , italic_t - italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT + 1 } , italic_t ] end_POSTSUBSCRIPT divide start_ARG italic_β square-root start_ARG italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG end_ARG start_ARG italic_μ start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT end_ARG ∥ italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ℓ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUPERSCRIPT - italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ∥ + divide start_ARG square-root start_ARG italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG italic_β italic_η start_POSTSUPERSCRIPT roman_ℓ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUPERSCRIPT end_ARG start_ARG 2 italic_μ start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT end_ARG , (30)

where we used

[gi(pij)]:,j[gi(pit)]:,jgi(pij)gi(pit),normsubscriptdelimited-[]subscript𝑔𝑖superscriptsubscript𝑝𝑖subscript𝑗:𝑗subscriptdelimited-[]subscript𝑔𝑖superscriptsubscript𝑝𝑖𝑡:𝑗normsubscript𝑔𝑖superscriptsubscript𝑝𝑖subscript𝑗subscript𝑔𝑖superscriptsubscript𝑝𝑖𝑡\|[\nabla g_{i}(p_{i}^{\ell_{j}})]_{:,j}-[\nabla g_{i}(p_{i}^{t})]_{:,j}\|\leq% \|\nabla g_{i}(p_{i}^{\ell_{j}})-\nabla g_{i}(p_{i}^{t})\|,∥ [ ∇ italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ℓ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ) ] start_POSTSUBSCRIPT : , italic_j end_POSTSUBSCRIPT - [ ∇ italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ) ] start_POSTSUBSCRIPT : , italic_j end_POSTSUBSCRIPT ∥ ≤ ∥ ∇ italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ℓ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ) - ∇ italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ) ∥ , (31)

for all j[di]𝑗delimited-[]subscript𝑑𝑖j\in[d_{i}]italic_j ∈ [ italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ], and smoothness of gisubscript𝑔𝑖g_{i}italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT. Furthermore, note that for τ2d¯1𝜏2¯𝑑1\tau\geq 2\bar{d}-1italic_τ ≥ 2 over¯ start_ARG italic_d end_ARG - 1, ηtdi+14ηtsuperscript𝜂𝑡subscript𝑑𝑖14superscript𝜂𝑡\eta^{t-d_{i}+1}\leq 4\eta^{t}italic_η start_POSTSUPERSCRIPT italic_t - italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT + 1 end_POSTSUPERSCRIPT ≤ 4 italic_η start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT and therefore for t=0𝑡0t=0italic_t = 0 we have

^gi0gi(p0)norm^superscriptsubscript𝑔𝑖0subscript𝑔𝑖superscript𝑝0\displaystyle\|\hat{\nabla}g_{i}^{0}-\nabla g_{i}(p^{0})\|∥ over^ start_ARG ∇ end_ARG italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT - ∇ italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT ) ∥ 2diβη0/μ3ei0.absent2subscript𝑑𝑖𝛽superscript𝜂0superscript𝜇3superscriptsubscript𝑒𝑖0\displaystyle\leq 2\sqrt{d_{i}}\beta\eta^{0}/\mu^{3}\leq e_{i}^{0}.≤ 2 square-root start_ARG italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG italic_β italic_η start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT / italic_μ start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT ≤ italic_e start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT . (32)

Accordingly, the statement holds for t=0𝑡0t=0italic_t = 0, which covers the base case. For t>0𝑡0t>0italic_t > 0, we continue from (30) and bound pijpitnormsuperscriptsubscript𝑝𝑖subscript𝑗superscriptsubscript𝑝𝑖𝑡\|p_{i}^{\ell_{j}}-p_{i}^{t}\|∥ italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ℓ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUPERSCRIPT - italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ∥ as

pijpitnormsuperscriptsubscript𝑝𝑖subscript𝑗superscriptsubscript𝑝𝑖𝑡\displaystyle\|p_{i}^{\ell_{j}}-p_{i}^{t}\|∥ italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ℓ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUPERSCRIPT - italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ∥ k=jt1pikpik+1absentsuperscriptsubscript𝑘subscript𝑗𝑡1normsuperscriptsubscript𝑝𝑖𝑘superscriptsubscript𝑝𝑖𝑘1\displaystyle\leq\sum_{k={\ell_{j}}}^{t-1}\|p_{i}^{k}-p_{i}^{k+1}\|≤ ∑ start_POSTSUBSCRIPT italic_k = roman_ℓ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t - 1 end_POSTSUPERSCRIPT ∥ italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT - italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k + 1 end_POSTSUPERSCRIPT ∥ (33)
=k=jt1[^gik]1(x^ik+1xik)absentsuperscriptsubscript𝑘subscript𝑗𝑡1normsuperscriptdelimited-[]^superscriptsubscript𝑔𝑖𝑘1superscriptsubscript^𝑥𝑖𝑘1superscriptsubscript𝑥𝑖𝑘\displaystyle=\sum_{k={\ell_{j}}}^{t-1}\|[\hat{\nabla}g_{i}^{k}]^{-1}(\hat{x}_% {i}^{k+1}-x_{i}^{k})\|= ∑ start_POSTSUBSCRIPT italic_k = roman_ℓ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t - 1 end_POSTSUPERSCRIPT ∥ [ over^ start_ARG ∇ end_ARG italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT ] start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( over^ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k + 1 end_POSTSUPERSCRIPT - italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT ) ∥ (34)
k=jt1[^gik]1x^ik+1xik.absentsuperscriptsubscript𝑘subscript𝑗𝑡1normsuperscriptdelimited-[]^superscriptsubscript𝑔𝑖𝑘1normsuperscriptsubscript^𝑥𝑖𝑘1superscriptsubscript𝑥𝑖𝑘\displaystyle\leq\sum_{k={\ell_{j}}}^{t-1}\|[\hat{\nabla}g_{i}^{k}]^{-1}\|\|% \hat{x}_{i}^{k+1}-x_{i}^{k}\|.≤ ∑ start_POSTSUBSCRIPT italic_k = roman_ℓ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t - 1 end_POSTSUPERSCRIPT ∥ [ over^ start_ARG ∇ end_ARG italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT ] start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ∥ ∥ over^ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k + 1 end_POSTSUPERSCRIPT - italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT ∥ . (35)

The following two lemmas, whose proofs can be found in Appendices -F and -G bound each of the terms in the above summation:

Lemma 3.

Suppose that ^gitgi(pit)1/(2L)norm^superscriptsubscript𝑔𝑖𝑡subscript𝑔𝑖superscriptsubscript𝑝𝑖𝑡12𝐿\|\hat{\nabla}g_{i}^{t}-\nabla g_{i}(p_{i}^{t})\|\leq 1/(2L)∥ over^ start_ARG ∇ end_ARG italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT - ∇ italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ) ∥ ≤ 1 / ( 2 italic_L ) for some t𝑡titalic_t. Then σmin(^git)1/(2L)subscript𝜎^superscriptsubscript𝑔𝑖𝑡12𝐿\sigma_{\min}(\hat{\nabla}g_{i}^{t})\geq 1/(2L)italic_σ start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT ( over^ start_ARG ∇ end_ARG italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ) ≥ 1 / ( 2 italic_L ) and [^git]12Lnormsuperscriptdelimited-[]^superscriptsubscript𝑔𝑖𝑡12𝐿\|[\hat{\nabla}g_{i}^{t}]^{-1}\|\leq 2L∥ [ over^ start_ARG ∇ end_ARG italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ] start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ∥ ≤ 2 italic_L.

Lemma 4.

For all t0𝑡0t\geq 0italic_t ≥ 0, if xt𝒳intsuperscript𝑥𝑡superscript𝒳intx^{t}\in{\cal X}^{\textnormal{int}}italic_x start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ∈ caligraphic_X start_POSTSUPERSCRIPT int end_POSTSUPERSCRIPT, then for a user i[n]𝑖delimited-[]𝑛i\in[n]italic_i ∈ [ italic_n ] the following holds:

x^it+1xitMnγt+ΔtΓ𝒳.normsuperscriptsubscript^𝑥𝑖𝑡1superscriptsubscript𝑥𝑖𝑡𝑀𝑛superscript𝛾𝑡superscriptΔ𝑡subscriptΓ𝒳\|\hat{x}_{i}^{t+1}-x_{i}^{t}\|\leq M\sqrt{n}\gamma^{t}+\Delta^{t}\Gamma_{\cal X}.∥ over^ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT - italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ∥ ≤ italic_M square-root start_ARG italic_n end_ARG italic_γ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT + roman_Δ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT roman_Γ start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT . (36)

Using Lemmas 3 and 4, we get

maxj[max{0,tdi+1},t]pijpitsubscriptsubscript𝑗0𝑡subscript𝑑𝑖1𝑡normsuperscriptsubscript𝑝𝑖subscript𝑗superscriptsubscript𝑝𝑖𝑡\displaystyle\max_{{\ell_{j}}\in[\max\{0,t-d_{i}+1\},t]}\|p_{i}^{{\ell_{j}}}-p% _{i}^{t}\|roman_max start_POSTSUBSCRIPT roman_ℓ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∈ [ roman_max { 0 , italic_t - italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT + 1 } , italic_t ] end_POSTSUBSCRIPT ∥ italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ℓ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUPERSCRIPT - italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ∥ (37)
maxj[max{0,tdi+1},t]2Lk=jt1Mnγk+ΔkΓ𝒳absentsubscriptsubscript𝑗0𝑡subscript𝑑𝑖1𝑡2𝐿superscriptsubscript𝑘subscript𝑗𝑡1𝑀𝑛superscript𝛾𝑘superscriptΔ𝑘subscriptΓ𝒳\displaystyle\leq\max_{{\ell_{j}}\in[\max\{0,t-d_{i}+1\},t]}2L\sum_{k={\ell_{j% }}}^{t-1}M\sqrt{n}\gamma^{k}+\Delta^{k}\Gamma_{\cal X}≤ roman_max start_POSTSUBSCRIPT roman_ℓ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∈ [ roman_max { 0 , italic_t - italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT + 1 } , italic_t ] end_POSTSUBSCRIPT 2 italic_L ∑ start_POSTSUBSCRIPT italic_k = roman_ℓ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t - 1 end_POSTSUPERSCRIPT italic_M square-root start_ARG italic_n end_ARG italic_γ start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT + roman_Δ start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT roman_Γ start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT (38)
2L(tmin)(Mnγmin+ΔminΓ𝒳),absent2𝐿𝑡subscript𝑀𝑛superscript𝛾subscriptsuperscriptΔsubscriptsubscriptΓ𝒳\displaystyle\leq 2L(t-{\ell}_{\min})(M\sqrt{n}\gamma^{\ell_{\min}}+\Delta^{% \ell_{\min}}\Gamma_{\cal X}),≤ 2 italic_L ( italic_t - roman_ℓ start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT ) ( italic_M square-root start_ARG italic_n end_ARG italic_γ start_POSTSUPERSCRIPT roman_ℓ start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT end_POSTSUPERSCRIPT + roman_Δ start_POSTSUPERSCRIPT roman_ℓ start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT end_POSTSUPERSCRIPT roman_Γ start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT ) , (39)

where min=max{0,tdi+1}subscript0𝑡subscript𝑑𝑖1\ell_{\min}=\max\{0,t-d_{i}+1\}roman_ℓ start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT = roman_max { 0 , italic_t - italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT + 1 }. Lastly, note that tmindi1𝑡subscriptsubscript𝑑𝑖1t-\ell_{\min}\leq d_{i}-1italic_t - roman_ℓ start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT ≤ italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - 1, γmin/γt2superscript𝛾subscriptsuperscript𝛾𝑡2\gamma^{\ell_{\min}}/\gamma^{t}\leq 2italic_γ start_POSTSUPERSCRIPT roman_ℓ start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT end_POSTSUPERSCRIPT / italic_γ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ≤ 2, and Δmin/Δt4superscriptΔsubscriptsuperscriptΔ𝑡4\Delta^{\ell_{\min}}/\Delta^{t}\leq 4roman_Δ start_POSTSUPERSCRIPT roman_ℓ start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT end_POSTSUPERSCRIPT / roman_Δ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ≤ 4 for τ2d¯1𝜏2¯𝑑1\tau\geq 2\bar{d}-1italic_τ ≥ 2 over¯ start_ARG italic_d end_ARG - 1, which gives the final result.

-B Proof of Proposition 2

We will prove by induction that if at iteration t𝑡titalic_t, k[max{td¯+1,0},t]for-all𝑘𝑡¯𝑑10𝑡\forall k\in[\max\{t-\bar{d}+1,0\},t]∀ italic_k ∈ [ roman_max { italic_t - over¯ start_ARG italic_d end_ARG + 1 , 0 } , italic_t ], xk𝒳nηkμintsuperscript𝑥𝑘superscriptsubscript𝒳𝑛superscript𝜂𝑘𝜇intx^{k}\in{{\cal X}}_{\frac{\sqrt{n}\eta^{k}}{\mu}}^{\textnormal{int}}italic_x start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT ∈ caligraphic_X start_POSTSUBSCRIPT divide start_ARG square-root start_ARG italic_n end_ARG italic_η start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT end_ARG start_ARG italic_μ end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT int end_POSTSUPERSCRIPT, then xt+1𝒳nηt+1μintsuperscript𝑥𝑡1subscriptsuperscript𝒳int𝑛superscript𝜂𝑡1𝜇x^{t+1}\in{\cal X}^{\textnormal{int}}_{\frac{\sqrt{n}\eta^{t+1}}{\mu}}italic_x start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT ∈ caligraphic_X start_POSTSUPERSCRIPT int end_POSTSUPERSCRIPT start_POSTSUBSCRIPT divide start_ARG square-root start_ARG italic_n end_ARG italic_η start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT end_ARG start_ARG italic_μ end_ARG end_POSTSUBSCRIPT and use Assumption 3 that x0𝒳nη0μintsuperscript𝑥0subscriptsuperscript𝒳int𝑛superscript𝜂0𝜇x^{0}\in{\cal X}^{\textnormal{int}}_{\frac{\sqrt{n}\eta^{0}}{\mu}}italic_x start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT ∈ caligraphic_X start_POSTSUPERSCRIPT int end_POSTSUPERSCRIPT start_POSTSUBSCRIPT divide start_ARG square-root start_ARG italic_n end_ARG italic_η start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT end_ARG start_ARG italic_μ end_ARG end_POSTSUBSCRIPT. This will ensure that xt+1,s𝒳intsuperscript𝑥𝑡1𝑠superscript𝒳intx^{t+1,s}\in{\cal X}^{\textnormal{int}}italic_x start_POSTSUPERSCRIPT italic_t + 1 , italic_s end_POSTSUPERSCRIPT ∈ caligraphic_X start_POSTSUPERSCRIPT int end_POSTSUPERSCRIPT as well by choice of ΔtsuperscriptΔ𝑡\Delta^{t}roman_Δ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT and ηtsuperscript𝜂𝑡\eta^{t}italic_η start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT. Therefore, we assume that xk𝒳nηkμintsuperscript𝑥𝑘subscriptsuperscript𝒳int𝑛superscript𝜂𝑘𝜇x^{k}\in{\cal X}^{\textnormal{int}}_{\frac{\sqrt{n}\eta^{k}}{\mu}}italic_x start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT ∈ caligraphic_X start_POSTSUPERSCRIPT int end_POSTSUPERSCRIPT start_POSTSUBSCRIPT divide start_ARG square-root start_ARG italic_n end_ARG italic_η start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT end_ARG start_ARG italic_μ end_ARG end_POSTSUBSCRIPT. Note that x^t+1𝒳intsuperscript^𝑥𝑡1superscript𝒳int\hat{x}^{t+1}\in{\cal X}^{\textnormal{int}}over^ start_ARG italic_x end_ARG start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT ∈ caligraphic_X start_POSTSUPERSCRIPT int end_POSTSUPERSCRIPT by definition.

For all i[n]𝑖delimited-[]𝑛i\in[n]italic_i ∈ [ italic_n ], we consider a modified utility function f~i(xi)subscript~𝑓𝑖subscript𝑥𝑖\tilde{f}_{i}(x_{i})over~ start_ARG italic_f end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ), which is equal to fi(xi)subscript𝑓𝑖subscript𝑥𝑖f_{i}(x_{i})italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) if xi𝒳isubscript𝑥𝑖subscript𝒳𝑖x_{i}\in{\cal X}_{i}italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ caligraphic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, and an L𝐿Litalic_L-smooth, μ𝜇\muitalic_μ-strongly concave extension with β𝛽\betaitalic_β-smooth gradient beyond the set 𝒳isubscript𝒳𝑖{\cal X}_{i}caligraphic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT. Accordingly, domf~i=didomsubscript~𝑓𝑖superscriptsubscript𝑑𝑖\textnormal{dom}\tilde{f}_{i}=\mathbb{R}^{d_{i}}dom over~ start_ARG italic_f end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = blackboard_R start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT, and f~isubscript~𝑓𝑖\tilde{f}_{i}over~ start_ARG italic_f end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is L𝐿Litalic_L-smooth and μ𝜇\muitalic_μ-strongly concave over disuperscriptsubscript𝑑𝑖{\mathbb{R}}^{d_{i}}blackboard_R start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT with β𝛽\betaitalic_β-smooth gradient.

Using the modified utility function, we define the modified price response function

g~i(pi)=argmaxxidif~i(xi)xi,pi.subscript~𝑔𝑖subscript𝑝𝑖subscript𝑥𝑖superscriptsubscript𝑑𝑖argmaxsubscript~𝑓𝑖subscript𝑥𝑖subscript𝑥𝑖subscript𝑝𝑖{\tilde{g}}_{i}(p_{i})=\underset{x_{i}\in{\mathbb{R}}^{d_{i}}}{\operatorname*{% arg\,max}}\tilde{f}_{i}(x_{i})-\langle x_{i},p_{i}\rangle.over~ start_ARG italic_g end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) = start_UNDERACCENT italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT end_UNDERACCENT start_ARG roman_arg roman_max end_ARG over~ start_ARG italic_f end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) - ⟨ italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ⟩ . (40)

The following Lemma, whose proof can be found in Appendix -H, characterizes the regularity properties of g~i(pi)subscript~𝑔𝑖subscript𝑝𝑖\tilde{g}_{i}(p_{i})over~ start_ARG italic_g end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ), i[n]for-all𝑖delimited-[]𝑛\forall i\in[n]∀ italic_i ∈ [ italic_n ], under Assumption 2:

Lemma 5.

For all i[n]𝑖delimited-[]𝑛i\in[n]italic_i ∈ [ italic_n ], let g~i(pi)subscript~𝑔𝑖subscript𝑝𝑖\tilde{g}_{i}(p_{i})over~ start_ARG italic_g end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) be the modified price response function in (40). Then, g~i(pi)subscript~𝑔𝑖subscript𝑝𝑖\tilde{g}_{i}(p_{i})over~ start_ARG italic_g end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) is bijective, 1/μ1𝜇1/\mu1 / italic_μ-Lipschitz continuous and β/μ3𝛽superscript𝜇3\beta/\mu^{3}italic_β / italic_μ start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT-smooth over disuperscriptsubscript𝑑𝑖{\mathbb{R}}^{d_{i}}blackboard_R start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT. Furthermore, let 𝒫i={pidi:gi(pi)𝒳iint}subscript𝒫𝑖conditional-setsubscript𝑝𝑖superscriptsubscript𝑑𝑖subscript𝑔𝑖subscript𝑝𝑖superscriptsubscript𝒳𝑖int{\cal P}_{i}=\{p_{i}\in{\mathbb{R}}^{d_{i}}:g_{i}(p_{i})\in{\cal X}_{i}^{% \textnormal{int}}\}caligraphic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = { italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT : italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ∈ caligraphic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT int end_POSTSUPERSCRIPT }. The following hold true:

  1. 1.

    If g~i(pi)𝒳iintsubscript~𝑔𝑖subscript𝑝𝑖superscriptsubscript𝒳𝑖int\tilde{g}_{i}(p_{i})\in{\cal X}_{i}^{\textnormal{int}}over~ start_ARG italic_g end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ∈ caligraphic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT int end_POSTSUPERSCRIPT, then pi𝒫isubscript𝑝𝑖subscript𝒫𝑖p_{i}\in{\cal P}_{i}italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ caligraphic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT.

  2. 2.

    If pi𝒫isubscript𝑝𝑖subscript𝒫𝑖p_{i}\in{\cal P}_{i}italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ caligraphic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, then g~i(pi)=gi(pi)subscript~𝑔𝑖subscript𝑝𝑖subscript𝑔𝑖subscript𝑝𝑖\tilde{g}_{i}(p_{i})=g_{i}(p_{i})over~ start_ARG italic_g end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) = italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ).

For each user i[n]𝑖delimited-[]𝑛i\in[n]italic_i ∈ [ italic_n ], we let x~it+1=g~i(pit+1)superscriptsubscript~𝑥𝑖𝑡1subscript~𝑔𝑖superscriptsubscript𝑝𝑖𝑡1\tilde{x}_{i}^{t+1}=\tilde{g}_{i}(p_{i}^{t+1})over~ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT = over~ start_ARG italic_g end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT ) and we rearrange the price update rule:

x~it+1x^it+1=x~it+1xit^git(pt+1pt).superscriptsubscript~𝑥𝑖𝑡1superscriptsubscript^𝑥𝑖𝑡1superscriptsubscript~𝑥𝑖𝑡1superscriptsubscript𝑥𝑖𝑡^superscriptsubscript𝑔𝑖𝑡superscript𝑝𝑡1superscript𝑝𝑡\tilde{x}_{i}^{t+1}-\hat{x}_{i}^{t+1}=\tilde{x}_{i}^{t+1}-x_{i}^{t}-\hat{% \nabla}g_{i}^{t}(p^{t+1}-p^{t}).over~ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT - over^ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT = over~ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT - italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT - over^ start_ARG ∇ end_ARG italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ( italic_p start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT - italic_p start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ) . (41)

We can also write the Taylor expansion of the modified price response function g~i(p)subscript~𝑔𝑖𝑝\tilde{g}_{i}(p)over~ start_ARG italic_g end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p ) around pitsuperscriptsubscript𝑝𝑖𝑡p_{i}^{t}italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT:

g~i(pit+1)g~i(pit)=g~i(pit)(pit+1pit)+R1.subscript~𝑔𝑖superscriptsubscript𝑝𝑖𝑡1subscript~𝑔𝑖superscriptsubscript𝑝𝑖𝑡subscript~𝑔𝑖superscriptsubscript𝑝𝑖𝑡superscriptsubscript𝑝𝑖𝑡1superscriptsubscript𝑝𝑖𝑡subscript𝑅1\tilde{g}_{i}(p_{i}^{t+1})-\tilde{g}_{i}(p_{i}^{t})=\nabla\tilde{g}_{i}(p_{i}^% {t})(p_{i}^{t+1}-p_{i}^{t})+R_{1}.over~ start_ARG italic_g end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT ) - over~ start_ARG italic_g end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ) = ∇ over~ start_ARG italic_g end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ) ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT - italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ) + italic_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT . (42)

We replace g~i(pit)=gi(pit)=xitsubscript~𝑔𝑖superscriptsubscript𝑝𝑖𝑡subscript𝑔𝑖superscriptsubscript𝑝𝑖𝑡superscriptsubscript𝑥𝑖𝑡\tilde{g}_{i}(p_{i}^{t})=g_{i}(p_{i}^{t})=x_{i}^{t}over~ start_ARG italic_g end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ) = italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ) = italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT and g~i(pit)=gi(pit)subscript~𝑔𝑖superscriptsubscript𝑝𝑖𝑡subscript𝑔𝑖superscriptsubscript𝑝𝑖𝑡\nabla\tilde{g}_{i}(p_{i}^{t})=\nabla g_{i}(p_{i}^{t})∇ over~ start_ARG italic_g end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ) = ∇ italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ) (since pit𝒫isuperscriptsubscript𝑝𝑖𝑡subscript𝒫𝑖p_{i}^{t}\in{\cal P}_{i}italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ∈ caligraphic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT) and plug the above equation into (41):

x~it+1x^it+1=(gi(pit)^git)(pit+1pit)+R1.superscriptsubscript~𝑥𝑖𝑡1superscriptsubscript^𝑥𝑖𝑡1subscript𝑔𝑖superscriptsubscript𝑝𝑖𝑡^superscriptsubscript𝑔𝑖𝑡superscriptsubscript𝑝𝑖𝑡1superscriptsubscript𝑝𝑖𝑡subscript𝑅1\tilde{x}_{i}^{t+1}-\hat{x}_{i}^{t+1}=(\nabla g_{i}(p_{i}^{t})-\hat{\nabla}g_{% i}^{t})(p_{i}^{t+1}-p_{i}^{t})+R_{1}.over~ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT - over^ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT = ( ∇ italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ) - over^ start_ARG ∇ end_ARG italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ) ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT - italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ) + italic_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT . (43)

To bound the norm of the above equation, we use Lemma 2 to bound the norm of the first term and [44, Lemma 1] to bound the second term:

x~it+1x^it+1eitpit+1pit+β2μ3pit+1pit2.normsuperscriptsubscript~𝑥𝑖𝑡1superscriptsubscript^𝑥𝑖𝑡1superscriptsubscript𝑒𝑖𝑡normsuperscriptsubscript𝑝𝑖𝑡1superscriptsubscript𝑝𝑖𝑡𝛽2superscript𝜇3superscriptnormsuperscriptsubscript𝑝𝑖𝑡1superscriptsubscript𝑝𝑖𝑡2\|\tilde{x}_{i}^{t+1}-\hat{x}_{i}^{t+1}\|\leq e_{i}^{t}\|p_{i}^{t+1}-p_{i}^{t}% \|+\frac{\beta}{2\mu^{3}}\|p_{i}^{t+1}-p_{i}^{t}\|^{2}.∥ over~ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT - over^ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT ∥ ≤ italic_e start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ∥ italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT - italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ∥ + divide start_ARG italic_β end_ARG start_ARG 2 italic_μ start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT end_ARG ∥ italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT - italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT . (44)

Rearranging the price update rule and using Lemmas 2 and 4 we can bound the norm of the price change:

pit+1pit[^git]1x^it+1xitdelimited-∥∥superscriptsubscript𝑝𝑖𝑡1superscriptsubscript𝑝𝑖𝑡delimited-∥∥superscriptdelimited-[]^superscriptsubscript𝑔𝑖𝑡1delimited-∥∥superscriptsubscript^𝑥𝑖𝑡1superscriptsubscript𝑥𝑖𝑡\displaystyle\begin{split}\|p_{i}^{t+1}-p_{i}^{t}\|&\leq\|[\hat{\nabla}g_{i}^{% t}]^{-1}\|\|\hat{x}_{i}^{t+1}-x_{i}^{t}\|\end{split}start_ROW start_CELL ∥ italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT - italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ∥ end_CELL start_CELL ≤ ∥ [ over^ start_ARG ∇ end_ARG italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ] start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ∥ ∥ over^ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT - italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ∥ end_CELL end_ROW (45)
2L(Mnγt+ΔtΓ𝒳).absent2𝐿𝑀𝑛superscript𝛾𝑡superscriptΔ𝑡subscriptΓ𝒳\displaystyle\leq 2L(M\sqrt{n}\gamma^{t}+\Delta^{t}\Gamma_{\cal X}).≤ 2 italic_L ( italic_M square-root start_ARG italic_n end_ARG italic_γ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT + roman_Δ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT roman_Γ start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT ) . (46)

Note that both upper bounds for eitsuperscriptsubscript𝑒𝑖𝑡e_{i}^{t}italic_e start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT and pit+1pitnormsuperscriptsubscript𝑝𝑖𝑡1superscriptsubscript𝑝𝑖𝑡\|p_{i}^{t+1}-p_{i}^{t}\|∥ italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT - italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ∥ are decreasing with t𝑡titalic_t. We can bound eitsuperscriptsubscript𝑒𝑖𝑡e_{i}^{t}italic_e start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT using τ>2μΔΓ𝒳Mn𝜏2𝜇ΔsubscriptΓ𝒳𝑀𝑛\tau>\frac{2\mu\Delta\Gamma_{\cal X}}{M\sqrt{n}}italic_τ > divide start_ARG 2 italic_μ roman_Δ roman_Γ start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT end_ARG start_ARG italic_M square-root start_ARG italic_n end_ARG end_ARG and 1Γ𝒳1subscriptΓ𝒳1\leq\Gamma_{\cal X}1 ≤ roman_Γ start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT as:

eitsuperscriptsubscript𝑒𝑖𝑡\displaystyle e_{i}^{t}italic_e start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT <βMdin(μ/n+32L(d¯1))/(4μ4(t+τ))absent𝛽𝑀subscript𝑑𝑖𝑛𝜇𝑛32𝐿¯𝑑14superscript𝜇4𝑡𝜏\displaystyle<{\beta M\sqrt{d_{i}n}(\mu/\sqrt{n}+32L(\bar{d}-1))}/({4\mu^{4}(t% +\tau)})< italic_β italic_M square-root start_ARG italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_n end_ARG ( italic_μ / square-root start_ARG italic_n end_ARG + 32 italic_L ( over¯ start_ARG italic_d end_ARG - 1 ) ) / ( 4 italic_μ start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT ( italic_t + italic_τ ) ) (47)
=βMdinγt(μ/n+32L(d¯1))/(4μ3),absent𝛽𝑀subscript𝑑𝑖𝑛superscript𝛾𝑡𝜇𝑛32𝐿¯𝑑14superscript𝜇3\displaystyle={\beta M\sqrt{d_{i}n}\gamma^{t}(\mu/\sqrt{n}+32L(\bar{d}-1))}/({% 4\mu^{3}}),= italic_β italic_M square-root start_ARG italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_n end_ARG italic_γ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ( italic_μ / square-root start_ARG italic_n end_ARG + 32 italic_L ( over¯ start_ARG italic_d end_ARG - 1 ) ) / ( 4 italic_μ start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT ) , (48)

and further upper bound pit+1pitnormsuperscriptsubscript𝑝𝑖𝑡1superscriptsubscript𝑝𝑖𝑡\|p_{i}^{t+1}-p_{i}^{t}\|∥ italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT - italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ∥ as

pit+1pit3LMnγt.normsuperscriptsubscript𝑝𝑖𝑡1superscriptsubscript𝑝𝑖𝑡3𝐿𝑀𝑛superscript𝛾𝑡\|p_{i}^{t+1}-p_{i}^{t}\|\leq 3LM\sqrt{n}\gamma^{t}.∥ italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT - italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ∥ ≤ 3 italic_L italic_M square-root start_ARG italic_n end_ARG italic_γ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT . (49)

Plugging the above bounds and γtsuperscript𝛾𝑡\gamma^{t}italic_γ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT into (44):

x~it+1x^it+1<3βLM2n4μ5(t+τ)2(6L+di(μ/n+32L(d¯1))).delimited-∥∥superscriptsubscript~𝑥𝑖𝑡1superscriptsubscript^𝑥𝑖𝑡13𝛽𝐿superscript𝑀2𝑛4superscript𝜇5superscript𝑡𝜏26𝐿subscript𝑑𝑖𝜇𝑛32𝐿¯𝑑1\displaystyle\begin{split}\|\tilde{x}_{i}^{t+1}-\hat{x}_{i}^{t+1}\|&<\frac{3% \beta LM^{2}n}{4\mu^{5}(t+\tau)^{2}}\Big{(}6L\\ &+{\sqrt{d_{i}}\left(\mu/\sqrt{n}+32L(\bar{d}-1)\right)}\Big{)}.\end{split}start_ROW start_CELL ∥ over~ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT - over^ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT ∥ end_CELL start_CELL < divide start_ARG 3 italic_β italic_L italic_M start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_n end_ARG start_ARG 4 italic_μ start_POSTSUPERSCRIPT 5 end_POSTSUPERSCRIPT ( italic_t + italic_τ ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ( 6 italic_L end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL + square-root start_ARG italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG ( italic_μ / square-root start_ARG italic_n end_ARG + 32 italic_L ( over¯ start_ARG italic_d end_ARG - 1 ) ) ) . end_CELL end_ROW (50)

Next, using Cauchy-Schwarz inequality, we bound x~t+1xt+1normsuperscript~𝑥𝑡1superscript𝑥𝑡1\|\tilde{x}^{t+1}-x^{t+1}\|∥ over~ start_ARG italic_x end_ARG start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT - italic_x start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT ∥ as

x~t+1x^t+1<3βLM2n3/24μ5(t+τ)2(6L+d(μ/n+32L(d¯1)))delimited-∥∥superscript~𝑥𝑡1superscript^𝑥𝑡13𝛽𝐿superscript𝑀2superscript𝑛324superscript𝜇5superscript𝑡𝜏26𝐿𝑑𝜇𝑛32𝐿¯𝑑1\displaystyle\begin{split}\|\tilde{x}^{t+1}-\hat{x}^{t+1}\|&<\frac{3\beta LM^{% 2}n^{3/2}}{4\mu^{5}(t+\tau)^{2}}\Big{(}6L\\ &+{\sqrt{d}\left(\mu/\sqrt{n}+32L(\bar{d}-1)\right)}\Big{)}\end{split}start_ROW start_CELL ∥ over~ start_ARG italic_x end_ARG start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT - over^ start_ARG italic_x end_ARG start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT ∥ end_CELL start_CELL < divide start_ARG 3 italic_β italic_L italic_M start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_n start_POSTSUPERSCRIPT 3 / 2 end_POSTSUPERSCRIPT end_ARG start_ARG 4 italic_μ start_POSTSUPERSCRIPT 5 end_POSTSUPERSCRIPT ( italic_t + italic_τ ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ( 6 italic_L end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL + square-root start_ARG italic_d end_ARG ( italic_μ / square-root start_ARG italic_n end_ARG + 32 italic_L ( over¯ start_ARG italic_d end_ARG - 1 ) ) ) end_CELL end_ROW (51)
=3Δt/4,absent3superscriptΔ𝑡4\displaystyle={3\Delta^{t}}/{4},= 3 roman_Δ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT / 4 , (52)

where we used the definition of ΔtsuperscriptΔ𝑡\Delta^{t}roman_Δ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT and i[n]didnsubscript𝑖delimited-[]𝑛subscript𝑑𝑖𝑑𝑛\sum_{i\in[n]}\sqrt{d_{i}}\leq\sqrt{dn}∑ start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] end_POSTSUBSCRIPT square-root start_ARG italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG ≤ square-root start_ARG italic_d italic_n end_ARG. This establishes that by definition of a shrunk set and Δt/4=nηt+1μsuperscriptΔ𝑡4𝑛superscript𝜂𝑡1𝜇\Delta^{t}/4=\frac{\sqrt{n}\eta^{t+1}}{\mu}roman_Δ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT / 4 = divide start_ARG square-root start_ARG italic_n end_ARG italic_η start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT end_ARG start_ARG italic_μ end_ARG, x~t+1𝒳nηt+1μintsuperscript~𝑥𝑡1subscriptsuperscript𝒳int𝑛superscript𝜂𝑡1𝜇\tilde{x}^{t+1}\in{\cal X}^{\textnormal{int}}_{\frac{\sqrt{n}\eta^{t+1}}{\mu}}over~ start_ARG italic_x end_ARG start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT ∈ caligraphic_X start_POSTSUPERSCRIPT int end_POSTSUPERSCRIPT start_POSTSUBSCRIPT divide start_ARG square-root start_ARG italic_n end_ARG italic_η start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT end_ARG start_ARG italic_μ end_ARG end_POSTSUBSCRIPT. Furthermore, let x~it+1,s=gi~(pit+1,s)superscriptsubscript~𝑥𝑖𝑡1𝑠~subscript𝑔𝑖superscriptsubscript𝑝𝑖𝑡1𝑠\tilde{x}_{i}^{t+1,s}=\tilde{g_{i}}(p_{i}^{t+1,s})over~ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t + 1 , italic_s end_POSTSUPERSCRIPT = over~ start_ARG italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t + 1 , italic_s end_POSTSUPERSCRIPT ). Using 1/μ1𝜇1/\mu1 / italic_μ-Lipschitz continuity of gi~(pi)~subscript𝑔𝑖subscript𝑝𝑖\tilde{g_{i}}(p_{i})over~ start_ARG italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ):

x~it+1,sx~it+1Δt/(4n),normsuperscriptsubscript~𝑥𝑖𝑡1𝑠superscriptsubscript~𝑥𝑖𝑡1superscriptΔ𝑡4𝑛\|\tilde{x}_{i}^{t+1,s}-\tilde{x}_{i}^{t+1}\|\leq{\Delta^{t}}/(4\sqrt{n}),∥ over~ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t + 1 , italic_s end_POSTSUPERSCRIPT - over~ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT ∥ ≤ roman_Δ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT / ( 4 square-root start_ARG italic_n end_ARG ) , (53)

and x~t+1,sx~t+1Δt/4normsuperscript~𝑥𝑡1𝑠superscript~𝑥𝑡1superscriptΔ𝑡4\|\tilde{x}^{t+1,s}-\tilde{x}^{t+1}\|\leq\Delta^{t}/4∥ over~ start_ARG italic_x end_ARG start_POSTSUPERSCRIPT italic_t + 1 , italic_s end_POSTSUPERSCRIPT - over~ start_ARG italic_x end_ARG start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT ∥ ≤ roman_Δ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT / 4. Accordingly, we have

x~t+1,sx^t+1<Δt,normsuperscript~𝑥𝑡1𝑠superscript^𝑥𝑡1superscriptΔ𝑡\|\tilde{x}^{t+1,s}-\hat{x}^{t+1}\|<\Delta^{t},∥ over~ start_ARG italic_x end_ARG start_POSTSUPERSCRIPT italic_t + 1 , italic_s end_POSTSUPERSCRIPT - over^ start_ARG italic_x end_ARG start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT ∥ < roman_Δ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT , (54)

which establishes that x~t+1,s𝒳intsuperscript~𝑥𝑡1𝑠superscript𝒳int\tilde{x}^{t+1,s}\in{\cal X}^{\textnormal{int}}over~ start_ARG italic_x end_ARG start_POSTSUPERSCRIPT italic_t + 1 , italic_s end_POSTSUPERSCRIPT ∈ caligraphic_X start_POSTSUPERSCRIPT int end_POSTSUPERSCRIPT. Lastly, note that if x~t+1,x~t+1,s𝒳intsuperscript~𝑥𝑡1superscript~𝑥𝑡1𝑠superscript𝒳int\tilde{x}^{t+1},\tilde{x}^{t+1,s}\in{\cal X}^{\textnormal{int}}over~ start_ARG italic_x end_ARG start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT , over~ start_ARG italic_x end_ARG start_POSTSUPERSCRIPT italic_t + 1 , italic_s end_POSTSUPERSCRIPT ∈ caligraphic_X start_POSTSUPERSCRIPT int end_POSTSUPERSCRIPT, then for all i[n]𝑖delimited-[]𝑛i\in[n]italic_i ∈ [ italic_n ], x~it+1,x~it+1,s𝒳iintsuperscriptsubscript~𝑥𝑖𝑡1superscriptsubscript~𝑥𝑖𝑡1𝑠superscriptsubscript𝒳𝑖int\tilde{x}_{i}^{t+1},\tilde{x}_{i}^{t+1,s}\in{\cal X}_{i}^{\textnormal{int}}over~ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT , over~ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t + 1 , italic_s end_POSTSUPERSCRIPT ∈ caligraphic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT int end_POSTSUPERSCRIPT, or equivalently, g~i(pit+1),g~i(pit+1,s)𝒳iintsubscript~𝑔𝑖superscriptsubscript𝑝𝑖𝑡1subscript~𝑔𝑖superscriptsubscript𝑝𝑖𝑡1𝑠superscriptsubscript𝒳𝑖int\tilde{g}_{i}(p_{i}^{t+1}),\tilde{g}_{i}(p_{i}^{t+1,s})\in{\cal X}_{i}^{% \textnormal{int}}over~ start_ARG italic_g end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT ) , over~ start_ARG italic_g end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t + 1 , italic_s end_POSTSUPERSCRIPT ) ∈ caligraphic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT int end_POSTSUPERSCRIPT. Using Lemma 5 we have that pit+1,pit+1,s𝒫isuperscriptsubscript𝑝𝑖𝑡1superscriptsubscript𝑝𝑖𝑡1𝑠subscript𝒫𝑖p_{i}^{t+1},p_{i}^{t+1,s}\in{\cal P}_{i}italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT , italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t + 1 , italic_s end_POSTSUPERSCRIPT ∈ caligraphic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, i[n]for-all𝑖delimited-[]𝑛\forall i\in[n]∀ italic_i ∈ [ italic_n ]. Hence, g~i(pt+1)=gi(pt+1)subscript~𝑔𝑖superscript𝑝𝑡1subscript𝑔𝑖superscript𝑝𝑡1\tilde{g}_{i}(p^{t+1})=g_{i}(p^{t+1})over~ start_ARG italic_g end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT ) = italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT ) and x~it+1=xit+1superscriptsubscript~𝑥𝑖𝑡1superscriptsubscript𝑥𝑖𝑡1\tilde{x}_{i}^{t+1}=x_{i}^{t+1}over~ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT = italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT as well as g~i(pt+1,s)=gi(pt+1,s)subscript~𝑔𝑖superscript𝑝𝑡1𝑠subscript𝑔𝑖superscript𝑝𝑡1𝑠\tilde{g}_{i}(p^{t+1,s})=g_{i}(p^{t+1,s})over~ start_ARG italic_g end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUPERSCRIPT italic_t + 1 , italic_s end_POSTSUPERSCRIPT ) = italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUPERSCRIPT italic_t + 1 , italic_s end_POSTSUPERSCRIPT ) and x~it+1,s=xit+1,ssuperscriptsubscript~𝑥𝑖𝑡1𝑠superscriptsubscript𝑥𝑖𝑡1𝑠\tilde{x}_{i}^{t+1,s}=x_{i}^{t+1,s}over~ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t + 1 , italic_s end_POSTSUPERSCRIPT = italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t + 1 , italic_s end_POSTSUPERSCRIPT for all i[n]𝑖delimited-[]𝑛i\in[n]italic_i ∈ [ italic_n ], which proves the proposition.

-C Proof of Theorem 1

We denote the regret incurred by the update stage as Ru(T)=t=1T/2f(x)f(xt)subscript𝑅𝑢𝑇superscriptsubscript𝑡1𝑇2𝑓superscript𝑥𝑓superscript𝑥𝑡R_{u}(T)=\sum_{t=1}^{T/2}f(x^{\star})-f(x^{t})italic_R start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ( italic_T ) = ∑ start_POSTSUBSCRIPT italic_t = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T / 2 end_POSTSUPERSCRIPT italic_f ( italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) - italic_f ( italic_x start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ) and the regret incurred by the sampling stage as Rs(T)=t=1T/2f(x)f(xt,s)subscript𝑅𝑠𝑇superscriptsubscript𝑡1𝑇2𝑓superscript𝑥𝑓superscript𝑥𝑡𝑠R_{s}(T)=\sum_{t=1}^{T/2}f(x^{\star})-f(x^{t,s})italic_R start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ( italic_T ) = ∑ start_POSTSUBSCRIPT italic_t = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T / 2 end_POSTSUPERSCRIPT italic_f ( italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) - italic_f ( italic_x start_POSTSUPERSCRIPT italic_t , italic_s end_POSTSUPERSCRIPT ). Let yt+1=xt+γtptsuperscript𝑦𝑡1superscript𝑥𝑡superscript𝛾𝑡subscript𝑝𝑡y^{t+1}=x^{t}+\gamma^{t}p_{t}italic_y start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT = italic_x start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT + italic_γ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT italic_p start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT. By Lemma 5, we know that pt=f(xt)superscript𝑝𝑡𝑓superscript𝑥𝑡p^{t}=\nabla f(x^{t})italic_p start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT = ∇ italic_f ( italic_x start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ), t0for-all𝑡0\forall t\geq 0∀ italic_t ≥ 0, since xt𝒳intsuperscript𝑥𝑡superscript𝒳intx^{t}\in{\cal X}^{\textnormal{int}}italic_x start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ∈ caligraphic_X start_POSTSUPERSCRIPT int end_POSTSUPERSCRIPT by Proposition 2. For t1𝑡1t\geq 1italic_t ≥ 1, we write using strong concavity:

f(x)f(xt)f(xt),xtxμ2xtx2𝑓superscript𝑥𝑓superscript𝑥𝑡𝑓superscript𝑥𝑡superscript𝑥𝑡superscript𝑥𝜇2superscriptnormsuperscript𝑥𝑡superscript𝑥2\displaystyle f(x^{\star})-f(x^{t})\leq\langle-\nabla f(x^{t}),x^{t}-x^{\star}% \rangle-\frac{\mu}{2}\|x^{t}-x^{\star}\|^{2}italic_f ( italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) - italic_f ( italic_x start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ) ≤ ⟨ - ∇ italic_f ( italic_x start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ) , italic_x start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT - italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ⟩ - divide start_ARG italic_μ end_ARG start_ARG 2 end_ARG ∥ italic_x start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT - italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT (55)
=1γtxtyt+1,xtxμ2xtx2absent1superscript𝛾𝑡superscript𝑥𝑡superscript𝑦𝑡1superscript𝑥𝑡superscript𝑥𝜇2superscriptnormsuperscript𝑥𝑡superscript𝑥2\displaystyle=\frac{1}{\gamma^{t}}\langle x^{t}-y^{t+1},x^{t}-x^{\star}\rangle% -\frac{\mu}{2}\|x^{t}-x^{\star}\|^{2}= divide start_ARG 1 end_ARG start_ARG italic_γ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT end_ARG ⟨ italic_x start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT - italic_y start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT , italic_x start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT - italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ⟩ - divide start_ARG italic_μ end_ARG start_ARG 2 end_ARG ∥ italic_x start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT - italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT (56)
=12γt(xtyt+12+xtx2yt+1x2)μ2xtx2.absent12superscript𝛾𝑡superscriptdelimited-∥∥superscript𝑥𝑡superscript𝑦𝑡12superscriptdelimited-∥∥superscript𝑥𝑡superscript𝑥2superscriptdelimited-∥∥superscript𝑦𝑡1superscript𝑥2𝜇2superscriptdelimited-∥∥superscript𝑥𝑡superscript𝑥2\displaystyle\begin{split}&=\frac{1}{2\gamma^{t}}\left(\|x^{t}-y^{t+1}\|^{2}+% \|x^{t}-x^{\star}\|^{2}-\|y^{t+1}-x^{\star}\|^{2}\right)\\ &\hskip 28.45274pt-\frac{\mu}{2}\|x^{t}-x^{\star}\|^{2}.\end{split}start_ROW start_CELL end_CELL start_CELL = divide start_ARG 1 end_ARG start_ARG 2 italic_γ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT end_ARG ( ∥ italic_x start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT - italic_y start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + ∥ italic_x start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT - italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - ∥ italic_y start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT - italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL - divide start_ARG italic_μ end_ARG start_ARG 2 end_ARG ∥ italic_x start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT - italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT . end_CELL end_ROW (57)

Next, we bound the yt+1x2superscriptnormsuperscript𝑦𝑡1superscript𝑥2\|y^{t+1}-x^{\star}\|^{2}∥ italic_y start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT - italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT term using Theorem 2 as follows:

yt+1x2Π𝒳Δt(yt+1)Π𝒳Δt(x)2superscriptnormsuperscript𝑦𝑡1superscript𝑥2superscriptnormsubscriptΠsubscript𝒳superscriptΔ𝑡superscript𝑦𝑡1subscriptΠsubscript𝒳superscriptΔ𝑡superscript𝑥2\displaystyle\|y^{t+1}-x^{\star}\|^{2}\geq\|\Pi_{{\cal X}_{\Delta^{t}}}(y^{t+1% })-\Pi_{{\cal X}_{\Delta^{t}}}(x^{\star})\|^{2}∥ italic_y start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT - italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ≥ ∥ roman_Π start_POSTSUBSCRIPT caligraphic_X start_POSTSUBSCRIPT roman_Δ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_y start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT ) - roman_Π start_POSTSUBSCRIPT caligraphic_X start_POSTSUBSCRIPT roman_Δ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT (58)
=x^t+1Π𝒳Δt(x)2absentsuperscriptnormsuperscript^𝑥𝑡1subscriptΠsubscript𝒳superscriptΔ𝑡superscript𝑥2\displaystyle=\|\hat{x}^{t+1}-\Pi_{{\cal X}_{\Delta^{t}}}(x^{\star})\|^{2}= ∥ over^ start_ARG italic_x end_ARG start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT - roman_Π start_POSTSUBSCRIPT caligraphic_X start_POSTSUBSCRIPT roman_Δ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT (59)
=x^t+1xt+1+xt+1x+xΠ𝒳Δt(x)2absentsuperscriptnormsuperscript^𝑥𝑡1superscript𝑥𝑡1superscript𝑥𝑡1superscript𝑥superscript𝑥subscriptΠsubscript𝒳superscriptΔ𝑡superscript𝑥2\displaystyle=\|\hat{x}^{t+1}-x^{t+1}+x^{t+1}-x^{\star}+x^{\star}-\Pi_{{\cal X% }_{\Delta^{t}}}(x^{\star})\|^{2}= ∥ over^ start_ARG italic_x end_ARG start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT - italic_x start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT + italic_x start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT - italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT + italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT - roman_Π start_POSTSUBSCRIPT caligraphic_X start_POSTSUBSCRIPT roman_Δ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT (60)
=x^t+1xt+12+xt+1x2+xΠ𝒳Δt(x)2absentsuperscriptnormsuperscript^𝑥𝑡1superscript𝑥𝑡12superscriptnormsuperscript𝑥𝑡1superscript𝑥2superscriptnormsuperscript𝑥subscriptΠsubscript𝒳superscriptΔ𝑡superscript𝑥2\displaystyle=\|\hat{x}^{t+1}-x^{t+1}\|^{2}+\|x^{t+1}-x^{\star}\|^{2}+\|x^{% \star}-\Pi_{{\cal X}_{\Delta^{t}}}(x^{\star})\|^{2}= ∥ over^ start_ARG italic_x end_ARG start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT - italic_x start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + ∥ italic_x start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT - italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + ∥ italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT - roman_Π start_POSTSUBSCRIPT caligraphic_X start_POSTSUBSCRIPT roman_Δ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT
+2x^t+1xt+1,xt+1x+2xt+1x,xΠ𝒳Δt(x)2superscript^𝑥𝑡1superscript𝑥𝑡1superscript𝑥𝑡1superscript𝑥2superscript𝑥𝑡1superscript𝑥superscript𝑥subscriptΠsubscript𝒳superscriptΔ𝑡superscript𝑥\displaystyle+2\langle\hat{x}^{t+1}{-}x^{t+1},x^{t+1}{-}x^{\star}\rangle{+}2% \langle x^{t+1}{-}x^{\star},x^{\star}{-}\Pi_{{\cal X}_{\Delta^{t}}}(x^{\star})\rangle+ 2 ⟨ over^ start_ARG italic_x end_ARG start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT - italic_x start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT , italic_x start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT - italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ⟩ + 2 ⟨ italic_x start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT - italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT , italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT - roman_Π start_POSTSUBSCRIPT caligraphic_X start_POSTSUBSCRIPT roman_Δ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) ⟩
+2xΠ𝒳Δt(x),x^t+1xt+12superscript𝑥subscriptΠsubscript𝒳superscriptΔ𝑡superscript𝑥superscript^𝑥𝑡1superscript𝑥𝑡1\displaystyle+2\langle x^{\star}-\Pi_{{\cal X}_{\Delta^{t}}}(x^{\star}),\hat{x% }^{t+1}-x^{t+1}\rangle+ 2 ⟨ italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT - roman_Π start_POSTSUBSCRIPT caligraphic_X start_POSTSUBSCRIPT roman_Δ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) , over^ start_ARG italic_x end_ARG start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT - italic_x start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT ⟩ (61)
xt+1x22x^t+1xt+1xt+1x2xt+1xxΠ𝒳Δt(x)2xΠ𝒳Δt(x)x^t+1xt+1absentsuperscriptdelimited-∥∥superscript𝑥𝑡1superscript𝑥22delimited-∥∥superscript^𝑥𝑡1superscript𝑥𝑡1delimited-∥∥superscript𝑥𝑡1superscript𝑥2delimited-∥∥superscript𝑥𝑡1superscript𝑥delimited-∥∥superscript𝑥subscriptΠsubscript𝒳superscriptΔ𝑡superscript𝑥2delimited-∥∥superscript𝑥subscriptΠsubscript𝒳superscriptΔ𝑡superscript𝑥delimited-∥∥superscript^𝑥𝑡1superscript𝑥𝑡1\displaystyle\begin{split}&\geq\|x^{t+1}-x^{\star}\|^{2}-2\|\hat{x}^{t+1}-x^{t% +1}\|\|x^{t+1}-x^{\star}\|\\ &-2\|x^{t+1}-x^{\star}\|\|x^{\star}-\Pi_{{\cal X}_{\Delta^{t}}}(x^{\star})\|\\ &-2\|x^{\star}-\Pi_{{\cal X}_{\Delta^{t}}}(x^{\star})\|\|\hat{x}^{t+1}-x^{t+1}% \|\end{split}start_ROW start_CELL end_CELL start_CELL ≥ ∥ italic_x start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT - italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - 2 ∥ over^ start_ARG italic_x end_ARG start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT - italic_x start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT ∥ ∥ italic_x start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT - italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ∥ end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL - 2 ∥ italic_x start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT - italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ∥ ∥ italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT - roman_Π start_POSTSUBSCRIPT caligraphic_X start_POSTSUBSCRIPT roman_Δ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) ∥ end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL - 2 ∥ italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT - roman_Π start_POSTSUBSCRIPT caligraphic_X start_POSTSUBSCRIPT roman_Δ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) ∥ ∥ over^ start_ARG italic_x end_ARG start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT - italic_x start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT ∥ end_CELL end_ROW (62)
xt+1x22ΔtR(Γ𝒳+3/4)3/2(Δt)2Γ𝒳absentsuperscriptdelimited-∥∥superscript𝑥𝑡1superscript𝑥22superscriptΔ𝑡𝑅subscriptΓ𝒳3432superscriptsuperscriptΔ𝑡2subscriptΓ𝒳\displaystyle\begin{split}&\geq\|x^{t+1}-x^{\star}\|^{2}-2\Delta^{t}R(\Gamma_{% \cal X}+3/4)-3/2(\Delta^{t})^{2}\Gamma_{\cal X}\end{split}start_ROW start_CELL end_CELL start_CELL ≥ ∥ italic_x start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT - italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - 2 roman_Δ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT italic_R ( roman_Γ start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT + 3 / 4 ) - 3 / 2 ( roman_Δ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT roman_Γ start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT end_CELL end_ROW (63)
:=xt+1x2Ct,assignabsentsuperscriptnormsuperscript𝑥𝑡1superscript𝑥2subscript𝐶𝑡\displaystyle\vcentcolon=\|x^{t+1}-x^{\star}\|^{2}-C_{t},:= ∥ italic_x start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT - italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - italic_C start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , (64)

where the last inequality uses xt+1x^t+1<3Δt/4normsuperscript𝑥𝑡1superscript^𝑥𝑡13superscriptΔ𝑡4\|{x}^{t+1}-\hat{x}^{t+1}\|<3\Delta^{t}/4∥ italic_x start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT - over^ start_ARG italic_x end_ARG start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT ∥ < 3 roman_Δ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT / 4 given by Proposition 2 and Proposition 1 to bound xΠ𝒳Δt(x)normsuperscript𝑥subscriptΠsubscript𝒳superscriptΔ𝑡superscript𝑥\|x^{\star}{-}\Pi_{{\cal X}_{\Delta^{t}}}(x^{\star})\|∥ italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT - roman_Π start_POSTSUBSCRIPT caligraphic_X start_POSTSUBSCRIPT roman_Δ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) ∥. Plugging this in (LABEL:eq:instaregret):

f(x)f(xt)M2nγt2μ2xtx2+Ct2γt+12γt(xtx2xt+1x2).𝑓superscript𝑥𝑓superscript𝑥𝑡superscript𝑀2𝑛superscript𝛾𝑡2𝜇2superscriptdelimited-∥∥superscript𝑥𝑡superscript𝑥2superscript𝐶𝑡2superscript𝛾𝑡12superscript𝛾𝑡superscriptdelimited-∥∥superscript𝑥𝑡superscript𝑥2superscriptdelimited-∥∥superscript𝑥𝑡1superscript𝑥2\begin{split}f(x^{\star})-f(x^{t})\leq&\frac{M^{2}n\gamma^{t}}{2}-\frac{\mu}{2% }\|x^{t}-x^{\star}\|^{2}+\frac{C^{t}}{2\gamma^{t}}\\ &{+}\frac{1}{2\gamma^{t}}(\|x^{t}{-}x^{\star}\|^{2}-\|x^{t+1}-x^{\star}\|^{2})% .\end{split}start_ROW start_CELL italic_f ( italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) - italic_f ( italic_x start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ) ≤ end_CELL start_CELL divide start_ARG italic_M start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_n italic_γ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT end_ARG start_ARG 2 end_ARG - divide start_ARG italic_μ end_ARG start_ARG 2 end_ARG ∥ italic_x start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT - italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + divide start_ARG italic_C start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT end_ARG start_ARG 2 italic_γ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT end_ARG end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL + divide start_ARG 1 end_ARG start_ARG 2 italic_γ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT end_ARG ( ∥ italic_x start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT - italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - ∥ italic_x start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT - italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) . end_CELL end_ROW (65)

Summing from t=1𝑡1t=1italic_t = 1 to T/2𝑇2T/2italic_T / 2 telescopes the xtx2superscriptnormsuperscript𝑥𝑡superscript𝑥2\|x^{t}-x^{\star}\|^{2}∥ italic_x start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT - italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT terms:

nRu(T)M2nlog(T/2)2μ+μτ2x1x2+t=2T/2(12γt12γt1μ2)xtx212γT/2xT/2+1x2+t=1T/2Ct2γt𝑛subscript𝑅𝑢𝑇superscript𝑀2𝑛𝑇22𝜇𝜇𝜏2superscriptdelimited-∥∥superscript𝑥1superscript𝑥2superscriptsubscript𝑡2𝑇212superscript𝛾𝑡12superscript𝛾𝑡1𝜇2superscriptdelimited-∥∥superscript𝑥𝑡superscript𝑥212superscript𝛾𝑇2superscriptdelimited-∥∥superscript𝑥𝑇21superscript𝑥2superscriptsubscript𝑡1𝑇2superscript𝐶𝑡2superscript𝛾𝑡\displaystyle\begin{split}&nR_{u}(T)\leq\frac{M^{2}n\log(T/2)}{2\mu}+\frac{\mu% \tau}{2}\|x^{1}-x^{\star}\|^{2}\\ &\hskip 28.45274pt+\sum_{t=2}^{T/2}\left(\frac{1}{2\gamma^{t}}-\frac{1}{2% \gamma^{t-1}}-\frac{\mu}{2}\right)\|x^{t}-x^{\star}\|^{2}\\ &\hskip 28.45274pt-\frac{1}{2\gamma^{T/2}}\|x^{T/2+1}-x^{\star}\|^{2}+\sum_{t=% 1}^{T/2}\frac{C^{t}}{2\gamma^{t}}\end{split}start_ROW start_CELL end_CELL start_CELL italic_n italic_R start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ( italic_T ) ≤ divide start_ARG italic_M start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_n roman_log ( italic_T / 2 ) end_ARG start_ARG 2 italic_μ end_ARG + divide start_ARG italic_μ italic_τ end_ARG start_ARG 2 end_ARG ∥ italic_x start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT - italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL + ∑ start_POSTSUBSCRIPT italic_t = 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T / 2 end_POSTSUPERSCRIPT ( divide start_ARG 1 end_ARG start_ARG 2 italic_γ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT end_ARG - divide start_ARG 1 end_ARG start_ARG 2 italic_γ start_POSTSUPERSCRIPT italic_t - 1 end_POSTSUPERSCRIPT end_ARG - divide start_ARG italic_μ end_ARG start_ARG 2 end_ARG ) ∥ italic_x start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT - italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL - divide start_ARG 1 end_ARG start_ARG 2 italic_γ start_POSTSUPERSCRIPT italic_T / 2 end_POSTSUPERSCRIPT end_ARG ∥ italic_x start_POSTSUPERSCRIPT italic_T / 2 + 1 end_POSTSUPERSCRIPT - italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + ∑ start_POSTSUBSCRIPT italic_t = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T / 2 end_POSTSUPERSCRIPT divide start_ARG italic_C start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT end_ARG start_ARG 2 italic_γ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT end_ARG end_CELL end_ROW (66)
M2nlog(T/2)2μ+μτ2x1x2+t=1T/2Ct2γt.absentsuperscript𝑀2𝑛𝑇22𝜇𝜇𝜏2superscriptnormsuperscript𝑥1superscript𝑥2superscriptsubscript𝑡1𝑇2superscript𝐶𝑡2superscript𝛾𝑡\displaystyle\leq\frac{M^{2}n\log(T/2)}{2\mu}+\frac{\mu\tau}{2}\|x^{1}-x^{% \star}\|^{2}+\sum_{t=1}^{T/2}\frac{C^{t}}{2\gamma^{t}}.≤ divide start_ARG italic_M start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_n roman_log ( italic_T / 2 ) end_ARG start_ARG 2 italic_μ end_ARG + divide start_ARG italic_μ italic_τ end_ARG start_ARG 2 end_ARG ∥ italic_x start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT - italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + ∑ start_POSTSUBSCRIPT italic_t = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T / 2 end_POSTSUPERSCRIPT divide start_ARG italic_C start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT end_ARG start_ARG 2 italic_γ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT end_ARG . (67)

Finally, note that Ct=𝒪(1/t2)superscript𝐶𝑡𝒪1superscript𝑡2C^{t}={\cal O}(1/t^{2})italic_C start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT = caligraphic_O ( 1 / italic_t start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) because the it consists of terms ΔtsuperscriptΔ𝑡\Delta^{t}roman_Δ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT and (Δt)2superscriptsuperscriptΔ𝑡2(\Delta^{t})^{2}( roman_Δ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT. Therefore, we can use the bounds t=1T/21t+τt=1T/21t+2log(T/2)superscriptsubscript𝑡1𝑇21𝑡𝜏superscriptsubscript𝑡1𝑇21𝑡2𝑇2{\sum}_{t=1}^{T/2}\frac{1}{t+\tau}\leq{\sum}_{t=1}^{T/2}\frac{1}{t+2}\leq\log(% T/2)∑ start_POSTSUBSCRIPT italic_t = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T / 2 end_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG italic_t + italic_τ end_ARG ≤ ∑ start_POSTSUBSCRIPT italic_t = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T / 2 end_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG italic_t + 2 end_ARG ≤ roman_log ( italic_T / 2 ) and for k2𝑘2k\geq 2italic_k ≥ 2, t=1T/21(t+2)k1superscriptsubscript𝑡1𝑇21superscript𝑡2𝑘1\sum_{t=1}^{T/2}\frac{1}{(t+2)^{k}}\leq 1∑ start_POSTSUBSCRIPT italic_t = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T / 2 end_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG ( italic_t + 2 ) start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT end_ARG ≤ 1 to show that:

t=1T/2Ct2γtμΔR(3/4+Γ𝒳)log(T/2)+3μΔ2Γ𝒳/4.superscriptsubscript𝑡1𝑇2superscript𝐶𝑡2superscript𝛾𝑡𝜇Δ𝑅34subscriptΓ𝒳𝑇23𝜇superscriptΔ2subscriptΓ𝒳4\begin{split}\sum_{t=1}^{T/2}\frac{C^{t}}{2\gamma^{t}}\leq&\mu{\Delta R(3/4+% \Gamma_{\cal X})\log(T/2)}+{3\mu\Delta^{2}\Gamma_{\cal X}}/4.\end{split}start_ROW start_CELL ∑ start_POSTSUBSCRIPT italic_t = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T / 2 end_POSTSUPERSCRIPT divide start_ARG italic_C start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT end_ARG start_ARG 2 italic_γ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT end_ARG ≤ end_CELL start_CELL italic_μ roman_Δ italic_R ( 3 / 4 + roman_Γ start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT ) roman_log ( italic_T / 2 ) + 3 italic_μ roman_Δ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT roman_Γ start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT / 4 . end_CELL end_ROW (68)

Plugging (68) into (67) and dividing by both sides by n𝑛nitalic_n, we get the regret incurred by the update stages. For the sampling stages, we note that due to the strong concavity of f𝑓fitalic_f

f(xt)f(xt,s)𝑓superscript𝑥𝑡𝑓superscript𝑥𝑡𝑠\displaystyle f(x^{t}){-}f(x^{t,s})italic_f ( italic_x start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ) - italic_f ( italic_x start_POSTSUPERSCRIPT italic_t , italic_s end_POSTSUPERSCRIPT ) f(xt,s),xtxt,sMnΔt14.absent𝑓superscript𝑥𝑡𝑠superscript𝑥𝑡superscript𝑥𝑡𝑠𝑀𝑛superscriptΔ𝑡14\displaystyle\leq\langle\nabla f(x^{t,s}),x^{t}{-}x^{t,s}\rangle\leq M\sqrt{n}% \frac{\Delta^{t-1}}{4}.≤ ⟨ ∇ italic_f ( italic_x start_POSTSUPERSCRIPT italic_t , italic_s end_POSTSUPERSCRIPT ) , italic_x start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT - italic_x start_POSTSUPERSCRIPT italic_t , italic_s end_POSTSUPERSCRIPT ⟩ ≤ italic_M square-root start_ARG italic_n end_ARG divide start_ARG roman_Δ start_POSTSUPERSCRIPT italic_t - 1 end_POSTSUPERSCRIPT end_ARG start_ARG 4 end_ARG . (69)

Accordingly f(x)f(xt,s)f(x)f(xt)+MnΔt1/4𝑓superscript𝑥𝑓superscript𝑥𝑡𝑠𝑓superscript𝑥𝑓superscript𝑥𝑡𝑀𝑛superscriptΔ𝑡14f(x^{\star})-f(x^{t,s})\leq f(x^{\star})-f(x^{t})+M\sqrt{n}\Delta^{t-1}/4italic_f ( italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) - italic_f ( italic_x start_POSTSUPERSCRIPT italic_t , italic_s end_POSTSUPERSCRIPT ) ≤ italic_f ( italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) - italic_f ( italic_x start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ) + italic_M square-root start_ARG italic_n end_ARG roman_Δ start_POSTSUPERSCRIPT italic_t - 1 end_POSTSUPERSCRIPT / 4. Summing from t=1𝑡1t=1italic_t = 1 to T/2𝑇2T/2italic_T / 2, we get

nRs(T)𝑛subscript𝑅𝑠𝑇\displaystyle nR_{s}(T)italic_n italic_R start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ( italic_T ) =nRu(T)+M4t=1T/2Δt1nRu(T)+ΔMn4,absent𝑛subscript𝑅𝑢𝑇𝑀4superscriptsubscript𝑡1𝑇2superscriptΔ𝑡1𝑛subscript𝑅𝑢𝑇Δ𝑀𝑛4\displaystyle{=}nR_{u}(T){+}\frac{M}{4}\sum_{t=1}^{T/2}\Delta^{t-1}{\leq}nR_{u% }(T){+}\frac{\Delta M\sqrt{n}}{4},= italic_n italic_R start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ( italic_T ) + divide start_ARG italic_M end_ARG start_ARG 4 end_ARG ∑ start_POSTSUBSCRIPT italic_t = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T / 2 end_POSTSUPERSCRIPT roman_Δ start_POSTSUPERSCRIPT italic_t - 1 end_POSTSUPERSCRIPT ≤ italic_n italic_R start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ( italic_T ) + divide start_ARG roman_Δ italic_M square-root start_ARG italic_n end_ARG end_ARG start_ARG 4 end_ARG , (70)

which gives the final result as

R(T)2Ru(T)+ΔM/(4n).𝑅𝑇2subscript𝑅𝑢𝑇Δ𝑀4𝑛R(T)\leq 2R_{u}(T)+{\Delta M}/({4\sqrt{n}}).italic_R ( italic_T ) ≤ 2 italic_R start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ( italic_T ) + roman_Δ italic_M / ( 4 square-root start_ARG italic_n end_ARG ) . (71)

To get the convergence result, we rearrange (65):

xt+1x2xtx2(1μγt)+M2n(γt)2+Ct+2γt(f(xt)f(x))superscriptdelimited-∥∥superscript𝑥𝑡1superscript𝑥2superscriptdelimited-∥∥superscript𝑥𝑡superscript𝑥21𝜇superscript𝛾𝑡superscript𝑀2𝑛superscriptsuperscript𝛾𝑡2superscript𝐶𝑡2superscript𝛾𝑡𝑓superscript𝑥𝑡𝑓superscript𝑥\displaystyle\begin{split}\|x^{t+1}-x^{\star}\|^{2}&\leq\|x^{t}-x^{\star}\|^{2% }(1-\mu\gamma^{t})+M^{2}n(\gamma^{t})^{2}\\ &+C^{t}+2\gamma^{t}(f(x^{t})-f(x^{\star}))\end{split}start_ROW start_CELL ∥ italic_x start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT - italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_CELL start_CELL ≤ ∥ italic_x start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT - italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( 1 - italic_μ italic_γ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ) + italic_M start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_n ( italic_γ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL + italic_C start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT + 2 italic_γ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ( italic_f ( italic_x start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ) - italic_f ( italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) ) end_CELL end_ROW (72)
\displaystyle\leq xtx2(1μγt)+M2n(γt)2+Ct.superscriptnormsuperscript𝑥𝑡superscript𝑥21𝜇superscript𝛾𝑡superscript𝑀2𝑛superscriptsuperscript𝛾𝑡2superscript𝐶𝑡\displaystyle\|x^{t}-x^{\star}\|^{2}(1-\mu\gamma^{t})+M^{2}n(\gamma^{t})^{2}+C% ^{t}.∥ italic_x start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT - italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( 1 - italic_μ italic_γ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ) + italic_M start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_n ( italic_γ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_C start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT . (73)

We get an equation like the above for all t0𝑡0t\geq 0italic_t ≥ 0. We multiply each by (1μγt+1)1𝜇superscript𝛾𝑡1(1-\mu\gamma^{t+1})( 1 - italic_μ italic_γ start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT ) for t<T/21𝑡𝑇21t<T/2-1italic_t < italic_T / 2 - 1 and sum them from t=0𝑡0t=0italic_t = 0 to t=T/21𝑡𝑇21t=T/2-1italic_t = italic_T / 2 - 1 to get:

xT/2x2x0x2t=0T/21(1μγt)+M2nt=0T/21(γt)2i=t+1T/21(1μγi)+t=0T/21Cti=t+1T/21(1μγi)superscriptdelimited-∥∥superscript𝑥𝑇2superscript𝑥2superscriptdelimited-∥∥superscript𝑥0superscript𝑥2superscriptsubscriptproduct𝑡0𝑇211𝜇superscript𝛾𝑡superscript𝑀2𝑛superscriptsubscript𝑡0𝑇21superscriptsuperscript𝛾𝑡2superscriptsubscriptproduct𝑖𝑡1𝑇211𝜇superscript𝛾𝑖superscriptsubscript𝑡0𝑇21superscript𝐶𝑡superscriptsubscriptproduct𝑖𝑡1𝑇211𝜇superscript𝛾𝑖\displaystyle\begin{split}&\|x^{T/2}-x^{\star}\|^{2}\leq\|x^{0}-x^{\star}\|^{2% }\prod_{t=0}^{T/2-1}(1-\mu\gamma^{t})\\ &+M^{2}n\sum_{t=0}^{T/2-1}(\gamma^{t})^{2}\prod_{i=t+1}^{T/2-1}(1-\mu\gamma^{i% })\\ &+\sum_{t=0}^{T/2-1}C^{t}\prod_{i=t+1}^{T/2-1}(1-\mu\gamma^{i})\end{split}start_ROW start_CELL end_CELL start_CELL ∥ italic_x start_POSTSUPERSCRIPT italic_T / 2 end_POSTSUPERSCRIPT - italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ≤ ∥ italic_x start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT - italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ∏ start_POSTSUBSCRIPT italic_t = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T / 2 - 1 end_POSTSUPERSCRIPT ( 1 - italic_μ italic_γ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ) end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL + italic_M start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_n ∑ start_POSTSUBSCRIPT italic_t = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T / 2 - 1 end_POSTSUPERSCRIPT ( italic_γ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ∏ start_POSTSUBSCRIPT italic_i = italic_t + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T / 2 - 1 end_POSTSUPERSCRIPT ( 1 - italic_μ italic_γ start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT ) end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL + ∑ start_POSTSUBSCRIPT italic_t = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T / 2 - 1 end_POSTSUPERSCRIPT italic_C start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ∏ start_POSTSUBSCRIPT italic_i = italic_t + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T / 2 - 1 end_POSTSUPERSCRIPT ( 1 - italic_μ italic_γ start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT ) end_CELL end_ROW (74)
x0x2τ1τ1+T/2+M2nlog(T/2)μ2(T/2+τ1)+2R(3/4+Γ𝒳)Δlog(T/2)(T/2+τ1)+3Δ2Γ𝒳2(T/2+τ1).absentsuperscriptdelimited-∥∥superscript𝑥0superscript𝑥2𝜏1𝜏1𝑇2superscript𝑀2𝑛𝑇2superscript𝜇2𝑇2𝜏12𝑅34subscriptΓ𝒳Δ𝑇2𝑇2𝜏13superscriptΔ2subscriptΓ𝒳2𝑇2𝜏1\displaystyle\begin{split}&\leq\|x^{0}-x^{\star}\|^{2}\frac{\tau-1}{\tau-1+T/2% }+\frac{M^{2}n\log(T/2)}{\mu^{2}(T/2+\tau-1)}\\ &+\frac{2R(3/4+\Gamma_{\cal X})\Delta\log(T/2)}{(T/2+\tau-1)}+\frac{3\Delta^{2% }\Gamma_{\cal X}}{2(T/2+\tau-1)}.\end{split}start_ROW start_CELL end_CELL start_CELL ≤ ∥ italic_x start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT - italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT divide start_ARG italic_τ - 1 end_ARG start_ARG italic_τ - 1 + italic_T / 2 end_ARG + divide start_ARG italic_M start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_n roman_log ( italic_T / 2 ) end_ARG start_ARG italic_μ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_T / 2 + italic_τ - 1 ) end_ARG end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL + divide start_ARG 2 italic_R ( 3 / 4 + roman_Γ start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT ) roman_Δ roman_log ( italic_T / 2 ) end_ARG start_ARG ( italic_T / 2 + italic_τ - 1 ) end_ARG + divide start_ARG 3 roman_Δ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT roman_Γ start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT end_ARG start_ARG 2 ( italic_T / 2 + italic_τ - 1 ) end_ARG . end_CELL end_ROW (75)

which completes the proof.

-D Proof of Remark 2

For a user i[n]𝑖delimited-[]𝑛i\in[n]italic_i ∈ [ italic_n ], using the modified price response function g~i(pi)subscript~𝑔𝑖subscript𝑝𝑖\tilde{g}_{i}(p_{i})over~ start_ARG italic_g end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) introduced in the proof of Proposition 2, we have that

x~itxi0η0/μ,t[di,1],formulae-sequencenormsuperscriptsubscript~𝑥𝑖𝑡superscriptsubscript𝑥𝑖0superscript𝜂0𝜇for-all𝑡subscript𝑑𝑖1\|\tilde{x}_{i}^{-t}-x_{i}^{0}\|\leq{\eta^{0}}/{\mu},~{}\forall t\in[-d_{i},-1],∥ over~ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - italic_t end_POSTSUPERSCRIPT - italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT ∥ ≤ italic_η start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT / italic_μ , ∀ italic_t ∈ [ - italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , - 1 ] , (76)

which implies that x~it𝒳iintsuperscriptsubscript~𝑥𝑖𝑡superscriptsubscript𝒳𝑖int\tilde{x}_{i}^{-t}\in{\cal X}_{i}^{\textnormal{int}}over~ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - italic_t end_POSTSUPERSCRIPT ∈ caligraphic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT int end_POSTSUPERSCRIPT because x0𝒳η0nμintsuperscript𝑥0subscriptsuperscript𝒳intsuperscript𝜂0𝑛𝜇x^{0}\in{\cal X}^{\textnormal{int}}_{\frac{\eta^{0}\sqrt{n}}{\mu}}italic_x start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT ∈ caligraphic_X start_POSTSUPERSCRIPT int end_POSTSUPERSCRIPT start_POSTSUBSCRIPT divide start_ARG italic_η start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT square-root start_ARG italic_n end_ARG end_ARG start_ARG italic_μ end_ARG end_POSTSUBSCRIPT. As such, x~it=xitsuperscriptsubscript~𝑥𝑖𝑡superscriptsubscript𝑥𝑖𝑡\tilde{x}_{i}^{-t}=x_{i}^{-t}over~ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - italic_t end_POSTSUPERSCRIPT = italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - italic_t end_POSTSUPERSCRIPT and pit=fi(xit)superscriptsubscript𝑝𝑖𝑡subscript𝑓𝑖superscriptsubscript𝑥𝑖𝑡p_{i}^{-t}=\nabla f_{i}(x_{i}^{-t})italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - italic_t end_POSTSUPERSCRIPT = ∇ italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - italic_t end_POSTSUPERSCRIPT ).

BERKAY TURAN is pursuing the Ph.D. degree in Electrical and Computer Engineering at the University of California, Santa Barbara. He received the B.Sc. degree in Electrical and Electronics Engineering as well as the B.Sc. degree in Physics degree from  Boğaziçi University, Istanbul, Turkey, in 2018. The overarching goal of his research is to design network control, optimization, and learning frameworks to promote efficiency and resiliency in societal-scale cyber-physical systems.
SPENCER HUTCHINSON received the B.S. degree in electrical engineering from Colorado School of Mines in 2021. He is currently pursuing the Ph.D. degree in electrical and computer engineering from the University of California, Santa Barbara in Santa Barbara, CA, USA. His research interests include the design and analysis of optimization and learning algorithms for the control of human-cyber-physical systems.
MAHNOOSH ALIZADEH is an associate professor of Electrical and Computer Engineering at the University of California Santa Barbara. She received the B.Sc. degree (’09) in Electrical Engineering from Sharif University of Technology and the M.Sc. (’13) and Ph.D. (’14) degrees in Electrical and Computer Engineering from the University of California Davis. From 2014 to 2016, she was a postdoctoral scholar at Stanford University. Her research interests are focused on designing network control, optimization, and learning frameworks to promote efficiency and resiliency in societal-scale cyber-physical systems. Dr. Alizadeh is a recipient of the NSF CAREER award.

-E Proof of Lemma 1

By definition, fi(xi)subscript𝑓𝑖subscript𝑥𝑖f_{i}(x_{i})italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) is strongly concave over 𝒳isubscript𝒳𝑖{\cal X}_{i}caligraphic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, therefore the optimization problem maxxdomfifi(xi)xi,pisubscript𝑥domsubscript𝑓𝑖subscript𝑓𝑖subscript𝑥𝑖subscript𝑥𝑖subscript𝑝𝑖{\max}_{x\in{\textnormal{dom}f_{i}}}f_{i}(x_{i})-\langle x_{i},p_{i}\rangleroman_max start_POSTSUBSCRIPT italic_x ∈ dom italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) - ⟨ italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ⟩ is strongly concave and has a unique solution for pi𝒫isubscript𝑝𝑖subscript𝒫𝑖p_{i}\in{\cal P}_{i}italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ caligraphic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT. Since 𝒳idomfisubscript𝒳𝑖domsubscript𝑓𝑖{\cal X}_{i}\subseteq\textnormal{dom}f_{i}caligraphic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ⊆ dom italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT by Assumption 1, the optimal solution is in the interior of domfidomsubscript𝑓𝑖\textnormal{dom}f_{i}dom italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT. Therefore the first-order optimality condition implies that the optimal solution gi(pi)subscript𝑔𝑖subscript𝑝𝑖g_{i}(p_{i})italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) satisfies

pi=fi(gi(pi)),subscript𝑝𝑖subscript𝑓𝑖subscript𝑔𝑖subscript𝑝𝑖p_{i}=\nabla f_{i}(g_{i}(p_{i})),italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = ∇ italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ) , (77)

which implies that fisubscript𝑓𝑖\nabla{f}_{i}∇ italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is surjective for pi𝒫isubscript𝑝𝑖subscript𝒫𝑖p_{i}\in{\cal P}_{i}italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ caligraphic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT. We also know that the gradient of a strongly concave function is injective333To see this, suppose that x1x2subscript𝑥1subscript𝑥2x_{1}\neq x_{2}italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ≠ italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT and therefore x1x2>0normsubscript𝑥1subscript𝑥20\|x_{1}-x_{2}\|>0∥ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ∥ > 0. If f(x1)=f(x2)𝑓subscript𝑥1𝑓subscript𝑥2\nabla f(x_{1})=\nabla f(x_{2})∇ italic_f ( italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) = ∇ italic_f ( italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ), (1) results in 0μx1x220𝜇superscriptnormsubscript𝑥1subscript𝑥220\geq\mu\|x_{1}-x_{2}\|^{2}0 ≥ italic_μ ∥ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT, which is a contradiction and x1=x2subscript𝑥1subscript𝑥2x_{1}=x_{2}italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT must hold., therefore, fisubscript𝑓𝑖\nabla{f}_{i}∇ italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is bijective and invertible and gi(pi)=fi1(pi)subscript𝑔𝑖subscript𝑝𝑖superscriptsubscript𝑓𝑖1subscript𝑝𝑖{g}_{i}(p_{i})=\nabla{f}_{i}^{-1}(p_{i})italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) = ∇ italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ), which also proves that gi(pi)subscript𝑔𝑖subscript𝑝𝑖g_{i}(p_{i})italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) is bijective. By the inverse function theorem, we get that:

gi(pi)=[2fi(gi(pi))]1.subscript𝑔𝑖subscript𝑝𝑖superscriptdelimited-[]superscript2subscript𝑓𝑖subscript𝑔𝑖subscript𝑝𝑖1\nabla{g}_{i}(p_{i})=[\nabla^{2}{f}_{i}({g}_{i}(p_{i}))]^{-1}.∇ italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) = [ ∇ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ) ] start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT . (78)

Since fisubscript𝑓𝑖{f}_{i}italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is L𝐿Litalic_L-smooth and μ𝜇\muitalic_μ-strongly concave, inverse of it’s Hessian has eigenvalues in [1/μ,1/L]1𝜇1𝐿[-1/\mu,-1/L][ - 1 / italic_μ , - 1 / italic_L ], which results in

gi(pi)=[2fi(gi(pi))]11/μ,normsubscript𝑔𝑖subscript𝑝𝑖normsuperscriptdelimited-[]superscript2subscript𝑓𝑖subscript𝑔𝑖subscript𝑝𝑖11𝜇\|\nabla{g}_{i}(p_{i})\|=\|[\nabla^{2}{f}_{i}({g}_{i}(p_{i}))]^{-1}\|\leq 1/\mu,∥ ∇ italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ∥ = ∥ [ ∇ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ) ] start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ∥ ≤ 1 / italic_μ , (79)

proving the Lipschitz property of gi(pi)subscript𝑔𝑖subscript𝑝𝑖{g}_{i}(p_{i})italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ). To show smoothness, we let xi1=gi(pi1)superscriptsubscript𝑥𝑖1subscript𝑔𝑖superscriptsubscript𝑝𝑖1x_{i}^{1}={g}_{i}(p_{i}^{1})italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT = italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ) and xi2=gi(pi2)superscriptsubscript𝑥𝑖2subscript𝑔𝑖superscriptsubscript𝑝𝑖2x_{i}^{2}={g}_{i}(p_{i}^{2})italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) and write:

gi(pi1)gi(pi2)=[2fi(xi1)]1[2fi(xi2)]1normsubscript𝑔𝑖superscriptsubscript𝑝𝑖1subscript𝑔𝑖superscriptsubscript𝑝𝑖2normsuperscriptdelimited-[]superscript2subscript𝑓𝑖superscriptsubscript𝑥𝑖11superscriptdelimited-[]superscript2subscript𝑓𝑖superscriptsubscript𝑥𝑖21\displaystyle\|\nabla{g}_{i}(p_{i}^{1}){-}\nabla{g}_{i}(p_{i}^{2})\|=\|[\nabla% ^{2}{f}_{i}(x_{i}^{1})]^{-1}{-}[\nabla^{2}{f}_{i}(x_{i}^{2})]^{-1}\|∥ ∇ italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ) - ∇ italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) ∥ = ∥ [ ∇ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ) ] start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT - [ ∇ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) ] start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ∥ (80)
=[2fi(xi1)]1(2fi(xi2)2fi(xi1))[2fi(xi2)]1absentnormsuperscriptdelimited-[]superscript2subscript𝑓𝑖superscriptsubscript𝑥𝑖11superscript2subscript𝑓𝑖superscriptsubscript𝑥𝑖2superscript2subscript𝑓𝑖superscriptsubscript𝑥𝑖1superscriptdelimited-[]superscript2subscript𝑓𝑖superscriptsubscript𝑥𝑖21\displaystyle=\|[\nabla^{2}{f}_{i}(x_{i}^{1})]^{-1}(\nabla^{2}{f}_{i}(x_{i}^{2% }){-}\nabla^{2}{f}_{i}(x_{i}^{1}))[\nabla^{2}{f}_{i}(x_{i}^{2})]^{-1}\|= ∥ [ ∇ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ) ] start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( ∇ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) - ∇ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ) ) [ ∇ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) ] start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ∥ (81)
βxi1xi2/μ2βpi1pi2/μ3,absent𝛽normsuperscriptsubscript𝑥𝑖1superscriptsubscript𝑥𝑖2superscript𝜇2𝛽normsuperscriptsubscript𝑝𝑖1superscriptsubscript𝑝𝑖2superscript𝜇3\displaystyle\leq{\beta}\|x_{i}^{1}-x_{i}^{2}\|/{\mu^{2}}\leq{\beta}\|p_{i}^{1% }-p_{i}^{2}\|/{\mu^{3}},≤ italic_β ∥ italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT - italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ∥ / italic_μ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ≤ italic_β ∥ italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT - italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ∥ / italic_μ start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT , (82)

where the last inequality uses 1/μ1𝜇1/\mu1 / italic_μ-Lipschitz continuity of gi(pi)subscript𝑔𝑖subscript𝑝𝑖{g}_{i}(p_{i})italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ), which proves β/μ3𝛽superscript𝜇3\beta/\mu^{3}italic_β / italic_μ start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT-smoothness of gi(pi)subscript𝑔𝑖subscript𝑝𝑖{g}_{i}(p_{i})italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ).

-F Proof of Lemma 3

Note that for pit𝒫isuperscriptsubscript𝑝𝑖𝑡subscript𝒫𝑖p_{i}^{t}\in{\cal P}_{i}italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ∈ caligraphic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, gi(pt)=[2fi(gi(pt))]1subscript𝑔𝑖superscript𝑝𝑡superscriptdelimited-[]superscript2subscript𝑓𝑖subscript𝑔𝑖superscript𝑝𝑡1\nabla g_{i}(p^{t})=[\nabla^{2}f_{i}(g_{i}(p^{t}))]^{-1}∇ italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ) = [ ∇ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ) ) ] start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT is symmetric by Schwarz’s theorem, since 2fi(gi(pi))superscript2subscript𝑓𝑖subscript𝑔𝑖subscript𝑝𝑖\nabla^{2}f_{i}(g_{i}(p_{i}))∇ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ) is β𝛽\betaitalic_β-Lipschitz continuous for pi𝒫isubscript𝑝𝑖subscript𝒫𝑖p_{i}\in{\cal P}_{i}italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ caligraphic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT. Accordingly, the minimum singular value of gi(pit)subscript𝑔𝑖superscriptsubscript𝑝𝑖𝑡\nabla g_{i}(p_{i}^{t})∇ italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ) is equal to smallest absolute eigenvalue of [2fi(gi(pt))]1superscriptdelimited-[]superscript2subscript𝑓𝑖subscript𝑔𝑖superscript𝑝𝑡1[\nabla^{2}f_{i}(g_{i}(p^{t}))]^{-1}[ ∇ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ) ) ] start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT, i.e., σmin(gi(pit))=1/Lsubscript𝜎subscript𝑔𝑖superscriptsubscript𝑝𝑖𝑡1𝐿\sigma_{\min}(\nabla g_{i}(p_{i}^{t}))=1/Litalic_σ start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT ( ∇ italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ) ) = 1 / italic_L. This implies that if ^gitgi(pit)1/(2L)norm^superscriptsubscript𝑔𝑖𝑡subscript𝑔𝑖superscriptsubscript𝑝𝑖𝑡12𝐿\|\hat{\nabla}g_{i}^{t}-\nabla g_{i}(p_{i}^{t})\|\leq 1/(2L)∥ over^ start_ARG ∇ end_ARG italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT - ∇ italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ) ∥ ≤ 1 / ( 2 italic_L ) holds, then

σmin(^git)=minx=1^gitxsubscript𝜎^superscriptsubscript𝑔𝑖𝑡norm𝑥1norm^superscriptsubscript𝑔𝑖𝑡𝑥\displaystyle\sigma_{\min}(\hat{\nabla}g_{i}^{t})=\underset{\|x\|=1}{\min}\|% \hat{\nabla}g_{i}^{t}x\|italic_σ start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT ( over^ start_ARG ∇ end_ARG italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ) = start_UNDERACCENT ∥ italic_x ∥ = 1 end_UNDERACCENT start_ARG roman_min end_ARG ∥ over^ start_ARG ∇ end_ARG italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT italic_x ∥ (83)
=minx=1gi(pit)x+(^gitgi(pit))xabsentnorm𝑥1normsubscript𝑔𝑖superscriptsubscript𝑝𝑖𝑡𝑥^superscriptsubscript𝑔𝑖𝑡subscript𝑔𝑖superscriptsubscript𝑝𝑖𝑡𝑥\displaystyle=\underset{\|x\|=1}{\min}\|\nabla g_{i}(p_{i}^{t})x+(\hat{\nabla}% g_{i}^{t}-\nabla g_{i}(p_{i}^{t}))x\|= start_UNDERACCENT ∥ italic_x ∥ = 1 end_UNDERACCENT start_ARG roman_min end_ARG ∥ ∇ italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ) italic_x + ( over^ start_ARG ∇ end_ARG italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT - ∇ italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ) ) italic_x ∥ (84)
minx=1gi(pit)xmaxx=1(^gitgi(pit))xabsentnorm𝑥1normsubscript𝑔𝑖superscriptsubscript𝑝𝑖𝑡𝑥norm𝑥1norm^superscriptsubscript𝑔𝑖𝑡subscript𝑔𝑖superscriptsubscript𝑝𝑖𝑡𝑥\displaystyle\geq\underset{\|x\|=1}{\min}\|\nabla g_{i}(p_{i}^{t})x\|-% \underset{\|x\|=1}{\max}\|(\hat{\nabla}g_{i}^{t}-\nabla g_{i}(p_{i}^{t}))x\|≥ start_UNDERACCENT ∥ italic_x ∥ = 1 end_UNDERACCENT start_ARG roman_min end_ARG ∥ ∇ italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ) italic_x ∥ - start_UNDERACCENT ∥ italic_x ∥ = 1 end_UNDERACCENT start_ARG roman_max end_ARG ∥ ( over^ start_ARG ∇ end_ARG italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT - ∇ italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ) ) italic_x ∥ (85)
=1/L1/(2L)1/(2L),absent1𝐿12𝐿12𝐿\displaystyle=1/L-1/(2L)\geq 1/(2L),= 1 / italic_L - 1 / ( 2 italic_L ) ≥ 1 / ( 2 italic_L ) , (86)

which implies that [^git]1=1/σmin(^git)2Lnormsuperscriptdelimited-[]^superscriptsubscript𝑔𝑖𝑡11subscript𝜎^superscriptsubscript𝑔𝑖𝑡2𝐿\|[\hat{\nabla}g_{i}^{t}]^{-1}\|=1/\sigma_{\min}(\hat{\nabla}g_{i}^{t})\leq 2L∥ [ over^ start_ARG ∇ end_ARG italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ] start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ∥ = 1 / italic_σ start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT ( over^ start_ARG ∇ end_ARG italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ) ≤ 2 italic_L.

-G Proof of Lemma 4

To bound x^it+1xitnormsuperscriptsubscript^𝑥𝑖𝑡1superscriptsubscript𝑥𝑖𝑡\|\hat{x}_{i}^{t+1}-x_{i}^{t}\|∥ over^ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT - italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ∥, we will use the following as an auxiliary result:

Theorem 2.

[43, Theorem 1.2.1] Let 𝒳𝒳{\cal X}caligraphic_X be a convex and compact set in dsuperscript𝑑\mathbb{R}^{d}blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT. Then, the metric projection onto 𝒳𝒳{\cal X}caligraphic_X is contracting, that is,

Π𝒳(x)Π𝒳(y)xy,x,y,d.\|\Pi_{\cal X}(x)-\Pi_{\cal X}(y)\|\leq\|x-y\|,~{}\forall x,y,\in{\mathbb{R}}^% {d}.∥ roman_Π start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT ( italic_x ) - roman_Π start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT ( italic_y ) ∥ ≤ ∥ italic_x - italic_y ∥ , ∀ italic_x , italic_y , ∈ blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT .

Using the above result, we bound x^it+1xitnormsuperscriptsubscript^𝑥𝑖𝑡1superscriptsubscript𝑥𝑖𝑡\|\hat{x}_{i}^{t+1}-x_{i}^{t}\|∥ over^ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT - italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ∥ as:

x^it+1xitx^t+1xtnormsuperscriptsubscript^𝑥𝑖𝑡1superscriptsubscript𝑥𝑖𝑡normsuperscript^𝑥𝑡1superscript𝑥𝑡\displaystyle\|\hat{x}_{i}^{t+1}-x_{i}^{t}\|\leq\|\hat{x}^{t+1}-x^{t}\|∥ over^ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT - italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ∥ ≤ ∥ over^ start_ARG italic_x end_ARG start_POSTSUPERSCRIPT italic_t + 1 end_POSTSUPERSCRIPT - italic_x start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ∥ (87)
=Π𝒳Δt(xt+ptγt)Π𝒳Δt(xt)+Π𝒳Δt(xt)xtabsentnormsubscriptΠsubscript𝒳superscriptΔ𝑡superscript𝑥𝑡superscript𝑝𝑡superscript𝛾𝑡subscriptΠsubscript𝒳superscriptΔ𝑡superscript𝑥𝑡subscriptΠsubscript𝒳superscriptΔ𝑡superscript𝑥𝑡superscript𝑥𝑡\displaystyle=\|\Pi_{{\cal X}_{\Delta^{t}}}(x^{t}+p^{t}\gamma^{t})-\Pi_{{\cal X% }_{\Delta^{t}}}(x^{t})+\Pi_{{\cal X}_{\Delta^{t}}}(x^{t})-x^{t}\|= ∥ roman_Π start_POSTSUBSCRIPT caligraphic_X start_POSTSUBSCRIPT roman_Δ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT + italic_p start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT italic_γ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ) - roman_Π start_POSTSUBSCRIPT caligraphic_X start_POSTSUBSCRIPT roman_Δ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ) + roman_Π start_POSTSUBSCRIPT caligraphic_X start_POSTSUBSCRIPT roman_Δ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ) - italic_x start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ∥ (88)
Π𝒳Δt(xt+ptγt)Π𝒳Δt(xt)+Π𝒳Δt(xt)xtabsentnormsubscriptΠsubscript𝒳superscriptΔ𝑡superscript𝑥𝑡superscript𝑝𝑡superscript𝛾𝑡subscriptΠsubscript𝒳superscriptΔ𝑡superscript𝑥𝑡normsubscriptΠsubscript𝒳superscriptΔ𝑡superscript𝑥𝑡superscript𝑥𝑡\displaystyle\leq\|\Pi_{{\cal X}_{\Delta^{t}}}(x^{t}+p^{t}\gamma^{t})-\Pi_{{% \cal X}_{\Delta^{t}}}(x^{t})\|{+}\|\Pi_{{\cal X}_{\Delta^{t}}}(x^{t})-x^{t}\|≤ ∥ roman_Π start_POSTSUBSCRIPT caligraphic_X start_POSTSUBSCRIPT roman_Δ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT + italic_p start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT italic_γ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ) - roman_Π start_POSTSUBSCRIPT caligraphic_X start_POSTSUBSCRIPT roman_Δ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ) ∥ + ∥ roman_Π start_POSTSUBSCRIPT caligraphic_X start_POSTSUBSCRIPT roman_Δ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ) - italic_x start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ∥ (89)
ptγt+ΔtΓ𝒳Mnγt+ΔtΓ𝒳,absentnormsuperscript𝑝𝑡superscript𝛾𝑡superscriptΔ𝑡subscriptΓ𝒳𝑀𝑛superscript𝛾𝑡superscriptΔ𝑡subscriptΓ𝒳\displaystyle\leq\|p^{t}\gamma^{t}\|+\Delta^{t}\Gamma_{\cal X}\leq M\sqrt{n}% \gamma^{t}+\Delta^{t}\Gamma_{\cal X},≤ ∥ italic_p start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT italic_γ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ∥ + roman_Δ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT roman_Γ start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT ≤ italic_M square-root start_ARG italic_n end_ARG italic_γ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT + roman_Δ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT roman_Γ start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT , (90)

where we used pit=fi(xit)Mnormsuperscriptsubscript𝑝𝑖𝑡normsubscript𝑓𝑖superscriptsubscript𝑥𝑖𝑡𝑀\|p_{i}^{t}\|=\|\nabla f_{i}(x_{i}^{t})\|\leq M∥ italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ∥ = ∥ ∇ italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ) ∥ ≤ italic_M since xit𝒳iintsuperscriptsubscript𝑥𝑖𝑡superscriptsubscript𝒳𝑖intx_{i}^{t}\in{\cal X}_{i}^{\textnormal{int}}italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ∈ caligraphic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT int end_POSTSUPERSCRIPT, and Proposition 1.

-H Proof of Lemma 5

The first part of the lemma follows from the same steps as in Lemma 1 for pidisubscript𝑝𝑖superscriptsubscript𝑑𝑖p_{i}\in{\mathbb{R}}^{d_{i}}italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT instead of pi𝒫isubscript𝑝𝑖subscript𝒫𝑖p_{i}\in{\cal P}_{i}italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ caligraphic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, and using f~isubscript~𝑓𝑖\tilde{f}_{i}over~ start_ARG italic_f end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and g~isubscript~𝑔𝑖\tilde{g}_{i}over~ start_ARG italic_g end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT instead of fisubscript𝑓𝑖f_{i}italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and gisubscript𝑔𝑖g_{i}italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT.

Next, we prove the second part of the lemma. For the first statement, given a pidisubscript𝑝𝑖superscriptsubscript𝑑𝑖p_{i}\in{\mathbb{R}}^{d_{i}}italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT, suppose that g~i(pi)𝒳iintsubscript~𝑔𝑖subscript𝑝𝑖superscriptsubscript𝒳𝑖int\tilde{g}_{i}(p_{i})\in{\cal X}_{i}^{\textnormal{int}}over~ start_ARG italic_g end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ∈ caligraphic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT int end_POSTSUPERSCRIPT. This implies that there exists xi𝒳iintsubscript𝑥𝑖superscriptsubscript𝒳𝑖intx_{i}\in{\cal X}_{i}^{\textnormal{int}}italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ caligraphic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT int end_POSTSUPERSCRIPT that satisfies f~i(xi)=pisubscript~𝑓𝑖subscript𝑥𝑖subscript𝑝𝑖\nabla\tilde{f}_{i}(x_{i})=p_{i}∇ over~ start_ARG italic_f end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) = italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT. Since f~i(xi)=fi(xi)subscript~𝑓𝑖subscript𝑥𝑖subscript𝑓𝑖subscript𝑥𝑖\tilde{f}_{i}(x_{i})=f_{i}(x_{i})over~ start_ARG italic_f end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) = italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) for xi𝒳iintsubscript𝑥𝑖superscriptsubscript𝒳𝑖intx_{i}\in{\cal X}_{i}^{\textnormal{int}}italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ caligraphic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT int end_POSTSUPERSCRIPT, the same xisubscript𝑥𝑖x_{i}italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT solves the optimization problem in (5), which implies gi(pi)=g~i(pi)subscript𝑔𝑖subscript𝑝𝑖subscript~𝑔𝑖subscript𝑝𝑖g_{i}(p_{i})=\tilde{g}_{i}(p_{i})italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) = over~ start_ARG italic_g end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ). Therefore, gi(pi)𝒳iintsubscript𝑔𝑖subscript𝑝𝑖superscriptsubscript𝒳𝑖intg_{i}(p_{i})\in{\cal X}_{i}^{\textnormal{int}}italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ∈ caligraphic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT int end_POSTSUPERSCRIPT, which proves pi𝒫isubscript𝑝𝑖subscript𝒫𝑖p_{i}\in{\cal P}_{i}italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ caligraphic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT by definition.

To prove the second statement, note that if pi𝒫isubscript𝑝𝑖subscript𝒫𝑖p_{i}\in{\cal P}_{i}italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ caligraphic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, then gi(pi)𝒳iintsubscript𝑔𝑖subscript𝑝𝑖superscriptsubscript𝒳𝑖intg_{i}(p_{i})\in{\cal X}_{i}^{\textnormal{int}}italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ∈ caligraphic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT int end_POSTSUPERSCRIPT. Since 𝒳idomfisubscript𝒳𝑖domsubscript𝑓𝑖{\cal X}_{i}\subseteq\textnormal{dom}f_{i}caligraphic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ⊆ dom italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT by Assumption 1, the first order optimality condition of (5) implies that there exists xi=gi(pi)𝒳iintsubscript𝑥𝑖subscript𝑔𝑖subscript𝑝𝑖superscriptsubscript𝒳𝑖intx_{i}=g_{i}(p_{i})\in{\cal X}_{i}^{\textnormal{int}}italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ∈ caligraphic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT int end_POSTSUPERSCRIPT such that fi(xi)=pisubscript𝑓𝑖subscript𝑥𝑖subscript𝑝𝑖\nabla f_{i}(x_{i})=p_{i}∇ italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) = italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT. The same xisubscript𝑥𝑖x_{i}italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT solves the optimization problem (40), since fi(xi)=f~i(xi)subscript𝑓𝑖subscript𝑥𝑖subscript~𝑓𝑖subscript𝑥𝑖f_{i}(x_{i})=\tilde{f}_{i}(x_{i})italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) = over~ start_ARG italic_f end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) for xi𝒳iintsubscript𝑥𝑖superscriptsubscript𝒳𝑖intx_{i}\in{\cal X}_{i}^{\textnormal{int}}italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ caligraphic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT int end_POSTSUPERSCRIPT. The optimal solution to (40) has to be unique due to strong concavity, therefore it must hold true that g~i(pi)=gi(pi)subscript~𝑔𝑖subscript𝑝𝑖subscript𝑔𝑖subscript𝑝𝑖\tilde{g}_{i}(p_{i})=g_{i}(p_{i})over~ start_ARG italic_g end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) = italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ).

Nomenclature

n𝑛nitalic_n Number of users
fisubscript𝑓𝑖f_{i}italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT Utility function of user i𝑖iitalic_i
fisubscript𝑓𝑖\nabla f_{i}∇ italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT Gradient of fisubscript𝑓𝑖f_{i}italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT
f𝑓fitalic_f Sum of the n𝑛nitalic_n fisubscript𝑓𝑖f_{i}italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT’s
f𝑓\nabla f∇ italic_f Gradient of f𝑓fitalic_f
xisubscript𝑥𝑖x_{i}italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT Resource demand vector of user i𝑖iitalic_i
x𝑥xitalic_x Concatenated resource demand of n𝑛nitalic_n xisubscript𝑥𝑖x_{i}italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT’s
xsuperscript𝑥x^{\star}italic_x start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT Optimal solution
fsuperscript𝑓f^{\star}italic_f start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT Optimal objective value
disubscript𝑑𝑖d_{i}italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT Dimension of xisubscript𝑥𝑖x_{i}italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT
d¯¯𝑑\bar{d}over¯ start_ARG italic_d end_ARG Highest dimension of xisubscript𝑥𝑖x_{i}italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT among the users
domfidomsubscript𝑓𝑖\textnormal{dom}f_{i}dom italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT Domain of fisubscript𝑓𝑖f_{i}italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT
𝒳𝒳\cal Xcaligraphic_X Feasible set
𝒳isubscript𝒳𝑖{\cal X}_{i}caligraphic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT The set of values that user i𝑖iitalic_i’s resource demand vector can take in the feasible set 𝒳𝒳{\cal X}caligraphic_X
𝒳int,𝒳iintsuperscript𝒳intsuperscriptsubscript𝒳𝑖int{\cal X}^{\textnormal{int}},{\cal X}_{i}^{\textnormal{int}}caligraphic_X start_POSTSUPERSCRIPT int end_POSTSUPERSCRIPT , caligraphic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT int end_POSTSUPERSCRIPT Interiors of sets 𝒳𝒳{\cal X}caligraphic_X, 𝒳isubscript𝒳𝑖{\cal X}_{i}caligraphic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT
R𝑅Ritalic_R Upper bound on the diameter of 𝒳𝒳{\cal X}caligraphic_X
μ𝜇\muitalic_μ Strong concavity constant for all fisubscript𝑓𝑖f_{i}italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT
L𝐿Litalic_L Smootness constant for all fisubscript𝑓𝑖f_{i}italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT
M𝑀Mitalic_M Lipschitz constant for all fisubscript𝑓𝑖f_{i}italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT
β𝛽\betaitalic_β Smoothness constant for all fisubscript𝑓𝑖\nabla f_{i}∇ italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT
pisubscript𝑝𝑖p_{i}italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT Resource price vector for user i𝑖iitalic_i
p𝑝pitalic_p Concatenated resource price vector of n𝑛nitalic_n pisubscript𝑝𝑖p_{i}italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT’s
gisubscript𝑔𝑖g_{i}italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT Price response function of user i𝑖iitalic_i
g𝑔gitalic_g Concatenated price response function of n𝑛nitalic_n gisubscript𝑔𝑖g_{i}italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT’s
R(T)𝑅𝑇R(T)italic_R ( italic_T ) Regret incurred after T𝑇Titalic_T iterations
t𝑡titalic_t Iteration index
pitsuperscriptsubscript𝑝𝑖𝑡p_{i}^{t}italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT Price vector of user i𝑖iitalic_i at iteration t𝑡titalic_t
ptsuperscript𝑝𝑡p^{t}italic_p start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT Concatenated price vector of n𝑛nitalic_n pitsuperscriptsubscript𝑝𝑖𝑡p_{i}^{t}italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT’s
xitsuperscriptsubscript𝑥𝑖𝑡x_{i}^{t}italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT Resource demand vector of user i𝑖iitalic_i at iteration t𝑡titalic_t
xtsuperscript𝑥𝑡x^{t}italic_x start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT Concatenated resource demand vector of n𝑛nitalic_n xitsuperscriptsubscript𝑥𝑖𝑡x_{i}^{t}italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT’s
¯(r)¯𝑟\bar{\mathcal{B}}(r)over¯ start_ARG caligraphic_B end_ARG ( italic_r ) Closed Euclidean ball with radius r𝑟ritalic_r centered at origin
(r)𝑟{\cal B}(r)caligraphic_B ( italic_r ) Open Euclidean ball with radius r𝑟ritalic_r centered at origin
𝒳Δsubscript𝒳Δ{\cal X}_{\Delta}caligraphic_X start_POSTSUBSCRIPT roman_Δ end_POSTSUBSCRIPT Shrunk version of 𝒳𝒳{\cal X}caligraphic_X by an amount ΔΔ{\Delta}roman_Δ
H𝒳subscript𝐻𝒳H_{\cal X}italic_H start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT Maximum shrinkage of 𝒳𝒳{\cal X}caligraphic_X
x^itsuperscriptsubscript^𝑥𝑖𝑡\hat{x}_{i}^{t}over^ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT Desired resource demand vector of user i𝑖iitalic_i at iteration t𝑡titalic_t
x^tsuperscript^𝑥𝑡\hat{x}^{t}over^ start_ARG italic_x end_ARG start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT Desired concatenated resource demand vector of n𝑛nitalic_n x^itsuperscriptsubscript^𝑥𝑖𝑡\hat{x}_{i}^{t}over^ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT’s
^git^superscriptsubscript𝑔𝑖𝑡\hat{\nabla}g_{i}^{t}over^ start_ARG ∇ end_ARG italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT Jacobian estimate of user i𝑖iitalic_i’s price response function at iteration t𝑡titalic_t
pit,ssuperscriptsubscript𝑝𝑖𝑡𝑠p_{i}^{t,s}italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t , italic_s end_POSTSUPERSCRIPT Resource price for user i𝑖iitalic_i at sampling stage of iteration t𝑡titalic_t
xit,ssuperscriptsubscript𝑥𝑖𝑡𝑠x_{i}^{t,s}italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t , italic_s end_POSTSUPERSCRIPT Resource demand of user i𝑖iitalic_i at sampling stage of iteration t𝑡titalic_t
ΔtsuperscriptΔ𝑡\Delta^{t}roman_Δ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT Amount of shrinkage of the feasible set 𝒳𝒳{\cal X}caligraphic_X at iteration t𝑡titalic_t
γtsuperscript𝛾𝑡\gamma^{t}italic_γ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT Step-size of the algorithm at the update stage
ηtsuperscript𝜂𝑡\eta^{t}italic_η start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT The amount of price variation at the sampling stage
τ𝜏\tauitalic_τ Constant shift in the denominator of γtsuperscript𝛾𝑡\gamma^{t}italic_γ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT and ΔtsuperscriptΔ𝑡\Delta^{t}roman_Δ start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT
Sharp𝒳subscriptSharp𝒳\mathrm{Sharp}_{\mathcal{X}}roman_Sharp start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT Sharpness of 𝒳𝒳{\cal X}caligraphic_X
Γ𝒳subscriptΓ𝒳\Gamma_{\cal X}roman_Γ start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT Upper bound on the sharpness constant of 𝒳𝒳{\cal X}caligraphic_X
𝒫isubscript𝒫𝑖{\cal P}_{i}caligraphic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT Set of prices that induce a resource demand in 𝒳isubscript𝒳𝑖{\cal X}_{i}caligraphic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT for user i𝑖iitalic_i