Nested bandits

M Martin, P Mertikopoulos, T Rahier… - … on Machine Learning, 2022 - proceedings.mlr.press
… the nested bandit model we study assumes an additional layer of information relative to
standard bandit … , similar to the semi-bandit in the study of combinatorial bandit algorithms [12…

The pareto frontier of model selection for general contextual bandits

TV Marinov, J Zimmert - Advances in Neural Information …, 2021 - proceedings.neurips.cc
… related to contextual bandits with finite policy classes are linear contextual bandits. Model
bandit problem with general policy classes of finite size. There are K arms and nested policy …

Dynamic balancing for model selection in bandits and rl

A Cutkosky, C Dann, A Das, C Gentile… - International …, 2021 - proceedings.mlr.press
… 2019), linear bandits with unknown misspecification, and confidence-parameter tuning for
contextual linear bandits. For the case of linear bandits with nested model classes, we show …

Model selection for contextual bandits

DJ Foster, A Krishnamurthy… - Advances in Neural …, 2019 - proceedings.neurips.cc
… Our main result is a new model selection guarantee for linear contextual bandits. We
work in the stochastic realizable setting with a sequence of nested linear policy classes of …

Regret bound balancing and elimination for model selection in bandits and rl

A Pacchiano, C Dann, C Gentile, P Bartlett - arXiv preprint arXiv …, 2020 - arxiv.org
… model classes (Section 6) that consider linear contextual bandits or linear … bandit algorithms
like OFUL (Section 6.5). Finally, we specifically focus on the nested linear contextual bandit

Two-stage multiarmed bandit for reconfigurable intelligent surface aided millimeter wave communications

EM Mohamed, S Hashima, K Hatano, SA Aldossari - Sensors, 2022 - mdpi.com
… in the form of a multiarmed bandit (MAB) game is suggested, where a nested two-stage
stochastic … of the proposed nested two-stage MAB strategy; in particular, the nested two-stage TS …

Universal and data-adaptive algorithms for model selection in linear contextual bandits

VK Muthukumar… - … Conference on Machine …, 2022 - proceedings.mlr.press
… Model selection in contextual bandits is an important … multi-armed bandit problem from a
linear contextual bandit problem. Even … selection among nested linear contextual bandits under …

Osom: A simultaneously optimal algorithm for multi-armed and linear contextual bandits

N Chatterji, V Muthukumar… - … Conference on Artificial …, 2020 - proceedings.mlr.press
… to arbitrary policy classes that are nested: we will see that the OSOM algorithm critically
exploits the nested structure of the simple bandit model within the linear contextual model. …

[BOOK][B] Bandit Nation: A History of Outlaws and Cultural Struggle in Mexico, 1810-1920

C Frazer - 2006 - books.google.com
… is or is not a bandit. Most often, the Mexican and foreign elites pinned the label of “bandit” on
… and oppression—not merely because most bandits emerged from among the poor, but also …

Automated quantum circuit design with nested monte carlo tree search

P Wang, M Usman, U Parampalli… - IEEE Transactions …, 2023 - ieeexplore.ieee.org
… In this work, we report an algorithmic framework based on nested Monte-Carlo Tree Search
(MCTS) coupled with the combinatorial multi-armed bandit (CMAB) model for the automated …