Tackling Non-Stationarity in Reinforcement Learning via Causal-Origin Representation

Zhang, Wanpeng; Li, Yilin; Yang, Boyu; Lu, Zongqing

Computer Science > Machine Learning

arXiv:2306.02747v3 (cs)

[Submitted on 5 Jun 2023 (v1), last revised 2 Jun 2024 (this version, v3)]

Title:Tackling Non-Stationarity in Reinforcement Learning via Causal-Origin Representation

Authors:Wanpeng Zhang, Yilin Li, Boyu Yang, Zongqing Lu

View PDF HTML (experimental)

Abstract:In real-world scenarios, the application of reinforcement learning is significantly challenged by complex non-stationarity. Most existing methods attempt to model changes in the environment explicitly, often requiring impractical prior knowledge of environments. In this paper, we propose a new perspective, positing that non-stationarity can propagate and accumulate through complex causal relationships during state transitions, thereby compounding its sophistication and affecting policy learning. We believe that this challenge can be more effectively addressed by implicitly tracing the causal origin of non-stationarity. To this end, we introduce the Causal-Origin REPresentation (COREP) algorithm. COREP primarily employs a guided updating mechanism to learn a stable graph representation for the state, termed as causal-origin representation. By leveraging this representation, the learned policy exhibits impressive resilience to non-stationarity. We supplement our approach with a theoretical analysis grounded in the causal interpretation for non-stationary reinforcement learning, advocating for the validity of the causal-origin representation. Experimental results further demonstrate the superior performance of COREP over existing methods in tackling non-stationarity problems.

Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2306.02747 [cs.LG]
	(or arXiv:2306.02747v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2306.02747

Submission history

From: Wanpeng Zhang [view email]
[v1] Mon, 5 Jun 2023 10:05:43 UTC (1,313 KB)
[v2] Fri, 29 Sep 2023 12:07:36 UTC (1,359 KB)
[v3] Sun, 2 Jun 2024 06:32:12 UTC (1,345 KB)

Computer Science > Machine Learning

Title:Tackling Non-Stationarity in Reinforcement Learning via Causal-Origin Representation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Tackling Non-Stationarity in Reinforcement Learning via Causal-Origin Representation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators