An Optimal Algorithm for Linear Bandits

Cesa-Bianchi, Nicolò; Kakade, Sham

Computer Science > Machine Learning

arXiv:1110.4322 (cs)

This paper has been withdrawn by Nicolò Cesa-Bianchi

[Submitted on 19 Oct 2011 (v1), last revised 14 Feb 2012 (this version, v3)]

Title:An Optimal Algorithm for Linear Bandits

Authors:Nicolò Cesa-Bianchi, Sham Kakade

No PDF available, click to view other formats

Abstract:We provide the first algorithm for online bandit linear optimization whose regret after T rounds is of order sqrt{Td ln N} on any finite class X of N actions in d dimensions, and of order d*sqrt{T} (up to log factors) when X is infinite. These bounds are not improvable in general. The basic idea utilizes tools from convex geometry to construct what is essentially an optimal exploration basis. We also present an application to a model of linear bandits with expert advice. Interestingly, these results show that bandit linear optimization with expert advice in d dimensions is no more difficult (in terms of the achievable regret) than the online d-armed bandit problem with expert advice (where EXP4 is optimal).

Comments:	This paper is superseded by S. Bubeck, N. Cesa-Bianchi, and S.M. Kakade, "Towards minimax policies for online linear optimization with bandit feedback"
Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1110.4322 [cs.LG]
	(or arXiv:1110.4322v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1110.4322

Submission history

From: Nicolò Cesa-Bianchi [view email]
[v1] Wed, 19 Oct 2011 15:57:27 UTC (11 KB)
[v2] Tue, 20 Dec 2011 14:30:27 UTC (11 KB)
[v3] Tue, 14 Feb 2012 16:14:39 UTC (1 KB) (withdrawn)

Full-text links:

Access Paper:

Withdrawn

Current browse context:

cs.LG

< prev | next >

new | recent | 2011-10

Change to browse by:

cs
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

Nicolò Cesa-Bianchi
Sham Kakade

export BibTeX citation

Computer Science > Machine Learning

Title:An Optimal Algorithm for Linear Bandits

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:An Optimal Algorithm for Linear Bandits

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators