Federated Offline Policy Learning

Carranza, Aldo Gael; Athey, Susan

Computer Science > Machine Learning

arXiv:2305.12407 (cs)

[Submitted on 21 May 2023 (v1), last revised 11 Oct 2024 (this version, v2)]

Title:Federated Offline Policy Learning

Authors:Aldo Gael Carranza, Susan Athey

View PDF HTML (experimental)

Abstract:We consider the problem of learning personalized decision policies from observational bandit feedback data across multiple heterogeneous data sources. In our approach, we introduce a novel regret analysis that establishes finite-sample upper bounds on distinguishing notions of global regret for all data sources on aggregate and of local regret for any given data source. We characterize these regret bounds by expressions of source heterogeneity and distribution shift. Moreover, we examine the practical considerations of this problem in the federated setting where a central server aims to train a policy on data distributed across the heterogeneous sources without collecting any of their raw data. We present a policy learning algorithm amenable to federation based on the aggregation of local policies trained with doubly robust offline policy evaluation strategies. Our analysis and supporting experimental results provide insights into tradeoffs in the participation of heterogeneous data sources in offline policy learning.

Subjects:	Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC); Econometrics (econ.EM); Machine Learning (stat.ML)
Cite as:	arXiv:2305.12407 [cs.LG]
	(or arXiv:2305.12407v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2305.12407

Submission history

From: Aldo Carranza [view email]
[v1] Sun, 21 May 2023 09:08:09 UTC (874 KB)
[v2] Fri, 11 Oct 2024 05:46:36 UTC (2,037 KB)

Computer Science > Machine Learning

Title:Federated Offline Policy Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Federated Offline Policy Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators