Conflict-Averse Gradient Aggregation for Constrained Multi-Objective Reinforcement Learning

Kim, Dohyeong; Hong, Mineui; Park, Jeongho; Oh, Songhwai

Computer Science > Machine Learning

arXiv:2403.00282 (cs)

[Submitted on 1 Mar 2024 (v1), last revised 31 May 2024 (this version, v2)]

Title:Conflict-Averse Gradient Aggregation for Constrained Multi-Objective Reinforcement Learning

Authors:Dohyeong Kim, Mineui Hong, Jeongho Park, Songhwai Oh

View PDF HTML (experimental)

Abstract:In many real-world applications, a reinforcement learning (RL) agent should consider multiple objectives and adhere to safety guidelines. To address these considerations, we propose a constrained multi-objective RL algorithm named Constrained Multi-Objective Gradient Aggregator (CoMOGA). In the field of multi-objective optimization, managing conflicts between the gradients of the multiple objectives is crucial to prevent policies from converging to local optima. It is also essential to efficiently handle safety constraints for stable training and constraint satisfaction. We address these challenges straightforwardly by treating the maximization of multiple objectives as a constrained optimization problem (COP), where the constraints are defined to improve the original objectives. Existing safety constraints are then integrated into the COP, and the policy is updated using a linear approximation, which ensures the avoidance of gradient conflicts. Despite its simplicity, CoMOGA guarantees optimal convergence in tabular settings. Through various experiments, we have confirmed that preventing gradient conflicts is critical, and the proposed method achieves constraint satisfaction across all tasks.

Comments:	25 pages
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2403.00282 [cs.LG]
	(or arXiv:2403.00282v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2403.00282

Submission history

From: Dohyeong Kim [view email]
[v1] Fri, 1 Mar 2024 04:57:13 UTC (1,900 KB)
[v2] Fri, 31 May 2024 07:19:03 UTC (5,571 KB)

Computer Science > Machine Learning

Title:Conflict-Averse Gradient Aggregation for Constrained Multi-Objective Reinforcement Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Conflict-Averse Gradient Aggregation for Constrained Multi-Objective Reinforcement Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators