Symmetric Reinforcement Learning Loss for Robust Learning on Diverse Tasks and Model Scales

Byun, Ju-Seung; Perrault, Andrew

Computer Science > Machine Learning

arXiv:2405.17618 (cs)

[Submitted on 27 May 2024 (v1), last revised 29 May 2024 (this version, v2)]

Title:Symmetric Reinforcement Learning Loss for Robust Learning on Diverse Tasks and Model Scales

Authors:Ju-Seung Byun, Andrew Perrault

View PDF HTML (experimental)

Abstract:Reinforcement learning (RL) training is inherently unstable due to factors such as moving targets and high gradient variance. Reinforcement Learning from Human Feedback (RLHF) and Reinforcement Learning from AI Feedback (RLAIF) can introduce additional difficulty. Differing preferences can complicate the alignment process, and prediction errors in a trained reward model can become more severe as the LLM generates unseen outputs. To enhance training robustness, RL has adopted techniques from supervised learning, such as ensembles and layer normalization. In this work, we improve the stability of RL training by adapting the reverse cross entropy (RCE) from supervised learning for noisy data to define a symmetric RL loss. We demonstrate performance improvements across various tasks and scales. We conduct experiments in discrete action tasks (Atari games) and continuous action space tasks (MuJoCo benchmark and Box2D) using Symmetric A2C (SA2C) and Symmetric PPO (SPPO), with and without added noise with especially notable performance in SPPO across different hyperparameters. Furthermore, we validate the benefits of the symmetric RL loss when using SPPO for large language models through improved performance in RLHF tasks, such as IMDB positive sentiment sentiment and TL;DR summarization tasks.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2405.17618 [cs.LG]
	(or arXiv:2405.17618v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2405.17618

Submission history

From: Ju-Seung Byun [view email]
[v1] Mon, 27 May 2024 19:28:33 UTC (245 KB)
[v2] Wed, 29 May 2024 04:19:00 UTC (245 KB)

Computer Science > Machine Learning

Title:Symmetric Reinforcement Learning Loss for Robust Learning on Diverse Tasks and Model Scales

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Symmetric Reinforcement Learning Loss for Robust Learning on Diverse Tasks and Model Scales

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators