Identifying the Best Arm in the Presence of Global Environment Shifts

Srisawad, Phurinut; Branke, Juergen; Tran-Thanh, Long

Computer Science > Machine Learning

arXiv:2408.12581 (cs)

[Submitted on 22 Aug 2024]

Title:Identifying the Best Arm in the Presence of Global Environment Shifts

Authors:Phurinut Srisawad, Juergen Branke, Long Tran-Thanh

View PDF HTML (experimental)

Abstract:This paper formulates a new Best-Arm Identification problem in the non-stationary stochastic bandits setting, where the means of all arms are shifted in the same way due to a global influence of the environment. The aim is to identify the unique best arm across environmental change given a fixed total budget. While this setting can be regarded as a special case of Adversarial Bandits or Corrupted Bandits, we demonstrate that existing solutions tailored to those settings do not fully utilise the nature of this global influence, and thus, do not work well in practice (despite their theoretical guarantees). To overcome this issue, in this paper we develop a novel selection policy that is consistent and robust in dealing with global environmental shifts. We then propose an allocation policy, LinLUCB, which exploits information about global shifts across all arms in each environment. Empirical tests depict a significant improvement in our policies against other existing methods.

Comments:	Extended version of the paper accepted at the 27th European Conference on Artificial Intelligence (ECAI 2024); Paper ID: M1125
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2408.12581 [cs.LG]
	(or arXiv:2408.12581v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2408.12581

Submission history

From: Phurinut Srisawad [view email]
[v1] Thu, 22 Aug 2024 17:47:01 UTC (6,320 KB)

Computer Science > Machine Learning

Title:Identifying the Best Arm in the Presence of Global Environment Shifts

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Identifying the Best Arm in the Presence of Global Environment Shifts

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators