The KL-UCB algorithm for bounded stochastic bandits and beyond

A Garivier, O Cappé - … of the 24th annual conference on …, 2011 - proceedings.mlr.press
This paper presents a finite-time analysis of the KL-UCB algorithm, an online, horizon-free
index policy for stochastic bandit problems. We prove two distinct results: first, for arbitrary …

UCB revisited: Improved regret bounds for the stochastic multi-armed bandit problem

P Auer, R Ortner - Periodica Mathematica Hungarica, 2010 - akjournals.com
… of the UCB algorithm of Auer et al. [4]. For this modified algorithm we give an improved
bound on the regret with respect to the optimal reward. While for the original UCB algorithm the …

Seletracetam (UCB 44212)

B Bennett, A Matagne, P Michel, M Leonard, M Cornet… - …, 2007 - Springer
Better pharmacotherapies for epilepsy are needed for patients who are refractory to or have
tolerability difficulties with current treatments. Seletracetam, a new drug in epilepsy …

Neural contextual bandits with ucb-based exploration

D Zhou, L Li, Q Gu - International Conference on Machine …, 2020 - proceedings.mlr.press
We study the stochastic contextual bandit problem, where the reward is generated from an
unknown function with additive noise. No assumption is made about the reward function other …

lil'ucb: An optimal exploration algorithm for multi-armed bandits

K Jamieson, M Malloy, R Nowak… - … on Learning Theory, 2014 - proceedings.mlr.press
The paper proposes a novel upper confidence bound (UCB) procedure for identifying the arm
with the largest mean in a multi-armed bandit game in the fixed confidence setting using a …

ucb L059, a novel anti-convulsant drug: pharmacological profile in animais

AJ Gower, M Noyer, R Verloes, J Gobert… - European journal of …, 1992 - Elsevier
ucb LOS9 in the audiogenic seizure-prone mouse showed potent anticonvulsant activity. The
effects of ucb … the effects of the R enantiomcr, ucb LOhO, wcrc cvafuatcd in a Iimitcd range …

Discounted ucb

L Kocsis, C Szepesvári - 2nd PASCAL Challenges Workshop, 2006 - lri.fr
Discounted UCB … Discounted UCB … Discounted UCB

Ucb exploration via q-ensembles

RY Chen, S Sidor, P Abbeel, J Schulman - arXiv preprint arXiv …, 2017 - arxiv.org
… We extend the intuition of UCB algorithms to the RL setting. Using the outputs of the {Qk}
functions, we construct a UCB by adding the empirical standard deviation σ(st,a) of {Qk(st,a)}K …

Augmentation of umbilical cord blood (UCB) transplantation with ex vivo–expanded UCB cells: results of a phase 1 trial using the AastromReplicell System

J Jaroscak, K Goltry, A Smith, B Waters-Pick, PL Martin… - Blood, 2003 - ashpublications.org
… augmenting UCB transplants with ex vivo–expanded UCB cells is beneficial. … UCB units
and 1 patient receiving a related UCB graft. On day 0, 26 patients received unmanipulated UCB

[HTML][HTML] Brivaracetam (ucb 34714)

P von Rosenstiel - Neurotherapeutics, 2007 - Elsevier
… In an open-label study, subjects with chronic liver disease were compared to healthy subjects
(UCB, data on file). The apparent total body clearance of brivaracetam was reduced by 24…