Reinforcement Learning, Bit by Bit

Lu, Xiuyuan; Van Roy, Benjamin; Dwaracherla, Vikranth; Ibrahimi, Morteza; Osband, Ian; Wen, Zheng

Computer Science > Machine Learning

arXiv:2103.04047 (cs)

[Submitted on 6 Mar 2021 (v1), last revised 4 May 2023 (this version, v8)]

Title:Reinforcement Learning, Bit by Bit

Authors:Xiuyuan Lu, Benjamin Van Roy, Vikranth Dwaracherla, Morteza Ibrahimi, Ian Osband, Zheng Wen

View PDF

Abstract:Reinforcement learning agents have demonstrated remarkable achievements in simulated environments. Data efficiency poses an impediment to carrying this success over to real environments. The design of data-efficient agents calls for a deeper understanding of information acquisition and representation. We discuss concepts and regret analysis that together offer principled guidance. This line of thinking sheds light on questions of what information to seek, how to seek that information, and what information to retain. To illustrate concepts, we design simple agents that build on them and present computational results that highlight data efficiency.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2103.04047 [cs.LG]
	(or arXiv:2103.04047v8 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2103.04047

Submission history

From: Xiuyuan Lu [view email]
[v1] Sat, 6 Mar 2021 06:37:46 UTC (3,074 KB)
[v2] Sun, 14 Mar 2021 05:58:17 UTC (2,420 KB)
[v3] Mon, 12 Apr 2021 18:42:28 UTC (2,421 KB)
[v4] Tue, 11 May 2021 01:03:05 UTC (4,849 KB)
[v5] Mon, 23 Aug 2021 04:56:18 UTC (5,078 KB)
[v6] Mon, 7 Feb 2022 22:13:26 UTC (5,079 KB)
[v7] Fri, 25 Mar 2022 15:58:11 UTC (5,442 KB)
[v8] Thu, 4 May 2023 20:53:30 UTC (2,851 KB)

Computer Science > Machine Learning

Title:Reinforcement Learning, Bit by Bit

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Reinforcement Learning, Bit by Bit

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators