Safe Online Convex Optimization with Multi-Point Feedback

Hutchinson, Spencer; Alizadeh, Mahnoosh

Computer Science > Machine Learning

arXiv:2407.11471 (cs)

[Submitted on 16 Jul 2024]

Title:Safe Online Convex Optimization with Multi-Point Feedback

Authors:Spencer Hutchinson, Mahnoosh Alizadeh

View PDF HTML (experimental)

Abstract:Motivated by the stringent safety requirements that are often present in real-world applications, we study a safe online convex optimization setting where the player needs to simultaneously achieve sublinear regret and zero constraint violation while only using zero-order information. In particular, we consider a multi-point feedback setting, where the player chooses $d + 1$ points in each round (where $d$ is the problem dimension) and then receives the value of the constraint function and cost function at each of these points. To address this problem, we propose an algorithm that leverages forward-difference gradient estimation as well as optimistic and pessimistic action sets to achieve $\mathcal{O}(d \sqrt{T})$ regret and zero constraint violation under the assumption that the constraint function is smooth and strongly convex. We then perform a numerical study to investigate the impacts of the unknown constraint and zero-order feedback on empirical performance.

Comments:	20 pages, 1 figure. Published in the proceedings of the Learning for Dynamics and Control Conference (L4DC) 2024
Subjects:	Machine Learning (cs.LG); Optimization and Control (math.OC)
Cite as:	arXiv:2407.11471 [cs.LG]
	(or arXiv:2407.11471v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2407.11471

Submission history

From: Spencer Hutchinson [view email]
[v1] Tue, 16 Jul 2024 08:09:26 UTC (112 KB)

Computer Science > Machine Learning

Title:Safe Online Convex Optimization with Multi-Point Feedback

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Safe Online Convex Optimization with Multi-Point Feedback

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators