Time-Variant Variational Transfer for Value Functions

Canonaco, Giuseppe; Soprani, Andrea; Roveri, Manuel; Restelli, Marcello

Computer Science > Machine Learning

arXiv:2005.12864 (cs)

[Submitted on 26 May 2020 (v1), last revised 18 Jun 2020 (this version, v2)]

Title:Time-Variant Variational Transfer for Value Functions

Authors:Giuseppe Canonaco, Andrea Soprani, Manuel Roveri, Marcello Restelli

View PDF

Abstract:In most of the transfer learning approaches to reinforcement learning (RL) the distribution over the tasks is assumed to be stationary. Therefore, the target and source tasks are i.i.d. samples of the same distribution. In the context of this work, we consider the problem of transferring value functions through a variational method when the distribution that generates the tasks is time-variant, proposing a solution that leverages this temporal structure inherent in the task generating process. Furthermore, by means of a finite-sample analysis, the previously mentioned solution is theoretically compared to its time-invariant version. Finally, we will provide an experimental evaluation of the proposed technique with three distinct temporal dynamics in three different RL environments.

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2005.12864 [cs.LG]
	(or arXiv:2005.12864v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2005.12864

Submission history

From: Giuseppe Canonaco [view email]
[v1] Tue, 26 May 2020 16:52:26 UTC (542 KB)
[v2] Thu, 18 Jun 2020 13:13:12 UTC (989 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2020-05

Change to browse by:

cs
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

Giuseppe Canonaco
Manuel Roveri
Marcello Restelli

export BibTeX citation

Computer Science > Machine Learning

Title:Time-Variant Variational Transfer for Value Functions

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Time-Variant Variational Transfer for Value Functions

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators