Policy Manifold Search for Improving Diversity-based Neuroevolution

Rakicevic, Nemanja; Cully, Antoine; Kormushev, Petar

Computer Science > Machine Learning

arXiv:2012.08676 (cs)

[Submitted on 15 Dec 2020]

Title:Policy Manifold Search for Improving Diversity-based Neuroevolution

Authors:Nemanja Rakicevic, Antoine Cully, Petar Kormushev

View PDF

Abstract:Diversity-based approaches have recently gained popularity as an alternative paradigm to performance-based policy search. A popular approach from this family, Quality-Diversity (QD), maintains a collection of high-performing policies separated in the diversity-metric space, defined based on policies' rollout behaviours. When policies are parameterised as neural networks, i.e. Neuroevolution, QD tends to not scale well with parameter space dimensionality. Our hypothesis is that there exists a low-dimensional manifold embedded in the policy parameter space, containing a high density of diverse and feasible policies. We propose a novel approach to diversity-based policy search via Neuroevolution, that leverages learned latent representations of the policy parameters which capture the local structure of the data. Our approach iteratively collects policies according to the QD framework, in order to (i) build a collection of diverse policies, (ii) use it to learn a latent representation of the policy parameters, (iii) perform policy search in the learned latent space. We use the Jacobian of the inverse transformation (this http URL function) to guide the search in the latent space. This ensures that the generated samples remain in the high-density regions of the original space, after reconstruction. We evaluate our contributions on three continuous control tasks in simulated environments, and compare to diversity-based baselines. The findings suggest that our approach yields a more efficient and robust policy search process.

Comments:	Paper accepted as oral (8% acceptance rate) at Beyond Backpropagation: Novel Ideas for Training Neural Architectures Workshop at NeurIPS 2020
Subjects:	Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
Cite as:	arXiv:2012.08676 [cs.LG]
	(or arXiv:2012.08676v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2012.08676

Submission history

From: Nemanja Rakicevic [view email]
[v1] Tue, 15 Dec 2020 23:59:49 UTC (21,410 KB)

Computer Science > Machine Learning

Title:Policy Manifold Search for Improving Diversity-based Neuroevolution

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Policy Manifold Search for Improving Diversity-based Neuroevolution

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators