Tracking Most Significant Arm Switches in Bandits

Suk, Joe; Kpotufe, Samory

Computer Science > Machine Learning

arXiv:2112.13838 (cs)

[Submitted on 27 Dec 2021 (v1), last revised 16 Jun 2022 (this version, v6)]

Title:Tracking Most Significant Arm Switches in Bandits

Authors:Joe Suk, Samory Kpotufe

View PDF

Abstract:In bandit with distribution shifts, one aims to automatically adapt to unknown changes in reward distribution, and restart exploration when necessary. While this problem has been studied for many years, a recent breakthrough of Auer et al. (2018, 2019) provides the first adaptive procedure to guarantee an optimal (dynamic) regret $\sqrt{LT}$, for $T$ rounds, and an unknown number $L$ of changes. However, while this rate is tight in the worst case, it remained open whether faster rates are possible, without prior knowledge, if few changes in distribution are actually severe.
To resolve this question, we propose a new notion of significant shift, which only counts very severe changes that clearly necessitate a restart: roughly, these are changes involving not only best arm switches, but also involving large aggregate differences in reward overtime. Thus, our resulting procedure adaptively achieves rates always faster (sometimes significantly) than $O(\sqrt{ST})$, where $S\ll L$ only counts best arm switches, while at the same time, always faster than the optimal $O(V^{\frac{1}{3}}T^{\frac{2}{3}})$ when expressed in terms of total variation $V$ (which aggregates differences overtime). Our results are expressed in enough generality to also capture non-stochastic adversarial settings.

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2112.13838 [cs.LG]
	(or arXiv:2112.13838v6 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2112.13838

Submission history

From: Joe Suk [view email]
[v1] Mon, 27 Dec 2021 18:59:05 UTC (40 KB)
[v2] Wed, 5 Jan 2022 00:53:53 UTC (41 KB)
[v3] Tue, 11 Jan 2022 18:52:37 UTC (41 KB)
[v4] Wed, 16 Feb 2022 18:34:36 UTC (77 KB)
[v5] Thu, 28 Apr 2022 15:56:25 UTC (82 KB)
[v6] Thu, 16 Jun 2022 15:38:39 UTC (121 KB)

Computer Science > Machine Learning

Title:Tracking Most Significant Arm Switches in Bandits

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Tracking Most Significant Arm Switches in Bandits

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators