Replay across Experiments: A Natural Extension of Off-Policy RL

Tirumala, Dhruva; Lampe, Thomas; Chen, Jose Enrique; Haarnoja, Tuomas; Huang, Sandy; Lever, Guy; Moran, Ben; Hertweck, Tim; Hasenclever, Leonard; Riedmiller, Martin; Heess, Nicolas; Wulfmeier, Markus

Computer Science > Machine Learning

arXiv:2311.15951 (cs)

[Submitted on 27 Nov 2023 (v1), last revised 28 Nov 2023 (this version, v2)]

Title:Replay across Experiments: A Natural Extension of Off-Policy RL

Authors:Dhruva Tirumala, Thomas Lampe, Jose Enrique Chen, Tuomas Haarnoja, Sandy Huang, Guy Lever, Ben Moran, Tim Hertweck, Leonard Hasenclever, Martin Riedmiller, Nicolas Heess, Markus Wulfmeier

View PDF

Abstract:Replaying data is a principal mechanism underlying the stability and data efficiency of off-policy reinforcement learning (RL). We present an effective yet simple framework to extend the use of replays across multiple experiments, minimally adapting the RL workflow for sizeable improvements in controller performance and research iteration times. At its core, Replay Across Experiments (RaE) involves reusing experience from previous experiments to improve exploration and bootstrap learning while reducing required changes to a minimum in comparison to prior work. We empirically show benefits across a number of RL algorithms and challenging control domains spanning both locomotion and manipulation, including hard exploration tasks from egocentric vision. Through comprehensive ablations, we demonstrate robustness to the quality and amount of data available and various hyperparameter choices. Finally, we discuss how our approach can be applied more broadly across research life cycles and can increase resilience by reloading data across random seeds or hyperparameter variations.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Robotics (cs.RO)
Cite as:	arXiv:2311.15951 [cs.LG]
	(or arXiv:2311.15951v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2311.15951

Submission history

From: Dhruva Tirumala [view email]
[v1] Mon, 27 Nov 2023 15:57:11 UTC (1,485 KB)
[v2] Tue, 28 Nov 2023 15:18:43 UTC (1,589 KB)

Computer Science > Machine Learning

Title:Replay across Experiments: A Natural Extension of Off-Policy RL

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Replay across Experiments: A Natural Extension of Off-Policy RL

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators