Tractable contextual bandits beyond realizability

Krishnamurthy, Sanath Kumar; Hadad, Vitor; Athey, Susan

Computer Science > Machine Learning

arXiv:2010.13013 (cs)

[Submitted on 25 Oct 2020 (v1), last revised 26 Feb 2021 (this version, v2)]

Title:Tractable contextual bandits beyond realizability

Authors:Sanath Kumar Krishnamurthy, Vitor Hadad, Susan Athey

View PDF

Abstract:Tractable contextual bandit algorithms often rely on the realizability assumption - i.e., that the true expected reward model belongs to a known class, such as linear functions. In this work, we present a tractable bandit algorithm that is not sensitive to the realizability assumption and computationally reduces to solving a constrained regression problem in every epoch. When realizability does not hold, our algorithm ensures the same guarantees on regret achieved by realizability-based algorithms under realizability, up to an additive term that accounts for the misspecification error. This extra term is proportional to T times a function of the mean squared error between the best model in the class and the true model, where T is the total number of time-steps. Our work sheds light on the bias-variance trade-off for tractable contextual bandits. This trade-off is not captured by algorithms that assume realizability, since under this assumption there exists an estimator in the class that attains zero bias.

Comments:	35 pages, 6 figures
Subjects:	Machine Learning (cs.LG); Statistics Theory (math.ST); Machine Learning (stat.ML)
Cite as:	arXiv:2010.13013 [cs.LG]
	(or arXiv:2010.13013v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2010.13013

Submission history

From: Sanath Kumar Krishnamurthy [view email]
[v1] Sun, 25 Oct 2020 01:36:04 UTC (300 KB)
[v2] Fri, 26 Feb 2021 00:08:36 UTC (515 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2020-10

Change to browse by:

cs
math
math.ST
stat
stat.ML
stat.TH

References & Citations

DBLP - CS Bibliography

listing | bibtex

Sanath Kumar Krishnamurthy
Susan Athey

export BibTeX citation

Computer Science > Machine Learning

Title:Tractable contextual bandits beyond realizability

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Tractable contextual bandits beyond realizability

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators