An Empirical Study of the Effectiveness of Using a Replay Buffer on Mode Discovery in GFlowNets

Vemgal, Nikhil; Lau, Elaine; Precup, Doina

Computer Science > Machine Learning

arXiv:2307.07674 (cs)

[Submitted on 15 Jul 2023 (v1), last revised 18 Jul 2023 (this version, v2)]

Title:An Empirical Study of the Effectiveness of Using a Replay Buffer on Mode Discovery in GFlowNets

Authors:Nikhil Vemgal, Elaine Lau, Doina Precup

View PDF

Abstract:Reinforcement Learning (RL) algorithms aim to learn an optimal policy by iteratively sampling actions to learn how to maximize the total expected return, $R(x)$. GFlowNets are a special class of algorithms designed to generate diverse candidates, $x$, from a discrete set, by learning a policy that approximates the proportional sampling of $R(x)$. GFlowNets exhibit improved mode discovery compared to conventional RL algorithms, which is very useful for applications such as drug discovery and combinatorial search. However, since GFlowNets are a relatively recent class of algorithms, many techniques which are useful in RL have not yet been associated with them. In this paper, we study the utilization of a replay buffer for GFlowNets. We explore empirically various replay buffer sampling techniques and assess the impact on the speed of mode discovery and the quality of the modes discovered. Our experimental results in the Hypergrid toy domain and a molecule synthesis environment demonstrate significant improvements in mode discovery when training with a replay buffer, compared to training only with trajectories generated on-policy.

Comments:	Accepted to ICML 2023 workshop on Structured Probabilistic Inference & Generative Modeling
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2307.07674 [cs.LG]
	(or arXiv:2307.07674v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2307.07674

Submission history

From: Nikhil Vemgal [view email]
[v1] Sat, 15 Jul 2023 01:17:14 UTC (17,585 KB)
[v2] Tue, 18 Jul 2023 01:11:01 UTC (17,585 KB)

Computer Science > Machine Learning

Title:An Empirical Study of the Effectiveness of Using a Replay Buffer on Mode Discovery in GFlowNets

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:An Empirical Study of the Effectiveness of Using a Replay Buffer on Mode Discovery in GFlowNets

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators