Non-stationary Online Learning with Memory and Non-stochastic Control

Zhao, Peng; Yan, Yu-Hu; Wang, Yu-Xiang; Zhou, Zhi-Hua

Computer Science > Machine Learning

arXiv:2102.03758 (cs)

[Submitted on 7 Feb 2021 (v1), last revised 15 Aug 2023 (this version, v4)]

Title:Non-stationary Online Learning with Memory and Non-stochastic Control

Authors:Peng Zhao, Yu-Hu Yan, Yu-Xiang Wang, Zhi-Hua Zhou

View PDF

Abstract:We study the problem of Online Convex Optimization (OCO) with memory, which allows loss functions to depend on past decisions and thus captures temporal effects of learning problems. In this paper, we introduce dynamic policy regret as the performance measure to design algorithms robust to non-stationary environments, which competes algorithms' decisions with a sequence of changing comparators. We propose a novel algorithm for OCO with memory that provably enjoys an optimal dynamic policy regret in terms of time horizon, non-stationarity measure, and memory length. The key technical challenge is how to control the switching cost, the cumulative movements of player's decisions, which is neatly addressed by a novel switching-cost-aware online ensemble approach equipped with a new meta-base decomposition of dynamic policy regret and a careful design of meta-learner and base-learner that explicitly regularizes the switching cost. The results are further applied to tackle non-stationarity in online non-stochastic control (Agarwal et al., 2019), i.e., controlling a linear dynamical system with adversarial disturbance and convex cost functions. We derive a novel gradient-based controller with dynamic policy regret guarantees, which is the first controller provably competitive to a sequence of changing policies for online non-stochastic control.

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2102.03758 [cs.LG]
	(or arXiv:2102.03758v4 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2102.03758
Journal reference:	Journal of Machine Learning Research, 2023

Submission history

From: Peng Zhao [view email]
[v1] Sun, 7 Feb 2021 09:45:15 UTC (925 KB)
[v2] Fri, 25 Jun 2021 09:47:15 UTC (4,989 KB)
[v3] Mon, 23 May 2022 11:30:10 UTC (467 KB)
[v4] Tue, 15 Aug 2023 02:31:59 UTC (465 KB)

Computer Science > Machine Learning

Title:Non-stationary Online Learning with Memory and Non-stochastic Control

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Non-stationary Online Learning with Memory and Non-stochastic Control

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators