default search action

combined dblp search
author search
venue search
publication search

ask others

Mark Rowland 0001

> Home > Persons

Person information

affiliation: Google DeepMind, London, UK

Other persons with the same name

see FAQ

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[j5]
- view
  - electronic edition @ jmlr.org (open access)
  - no references & citations available
- export record
  dblp key:
  - journals/jmlr/RowlandMATOHTBD24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jmlr/RowlandMATOHTBD24
Mark Rowland, Rémi Munos, Mohammad Gheshlaghi Azar, Yunhao Tang, Georg Ostrovski, Anna Harutyunyan, Karl Tuyls, Marc G. Bellemare, Will Dabney:
An Analysis of Quantile Temporal-Difference Learning. J. Mach. Learn. Res. 25: 163:1-163:47 (2024)
[c48]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/aistats/AzarGPMRVC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aistats/AzarGPMRVC24
Mohammad Gheshlaghi Azar, Zhaohan Daniel Guo, Bilal Piot, Rémi Munos, Mark Rowland, Michal Valko, Daniele Calandriello:
A General Theoretical Paradigm to Understand Learning from Human Preferences. AISTATS 2024: 4447-4455
[c47]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/CalandrielloGMR24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/CalandrielloGMR24
Daniele Calandriello, Zhaohan Daniel Guo, Rémi Munos, Mark Rowland, Yunhao Tang, Bernardo Ávila Pires, Pierre Harvey Richemond, Charline Le Lan, Michal Valko, Tianqi Liu, Rishabh Joshi, Zeyu Zheng, Bilal Piot:
Human Alignment of Large Language Models through Online Preference Optimisation. ICML 2024
[c46]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/MunosVCARGTGMFM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/MunosVCARGTGMFM24
Rémi Munos, Michal Valko, Daniele Calandriello, Mohammad Gheshlaghi Azar, Mark Rowland, Zhaohan Daniel Guo, Yunhao Tang, Matthieu Geist, Thomas Mesnard, Côme Fiegel, Andrea Michi, Marco Selvi, Sertan Girgin, Nikola Momchev, Olivier Bachem, Daniel J. Mankowitz, Doina Precup, Bilal Piot:
Nash Learning from Human Feedback. ICML 2024
[c45]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/TangGZCMRRVPP24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/TangGZCMRRVPP24
Yunhao Tang, Zhaohan Daniel Guo, Zeyu Zheng, Daniele Calandriello, Rémi Munos, Mark Rowland, Pierre Harvey Richemond, Michal Valko, Bernardo Ávila Pires, Bilal Piot:
Generalized Preference Optimization: A Unified Approach to Offline Alignment. ICML 2024
[c44]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/WenliangDAHRGR24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/WenliangDAHRGR24
Li Kevin Wenliang, Grégoire Delétang, Matthew Aitchison, Marcus Hutter, Anian Ruoss, Arthur Gretton, Mark Rowland:
Distributional Bellman Operators over Mean Embeddings. ICML 2024
[c43]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/WiltzerFGT0DBR24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/WiltzerFGT0DBR24
Harley Wiltzer, Jesse Farebrother, Arthur Gretton, Yunhao Tang, André Barreto, Will Dabney, Marc G. Bellemare, Mark Rowland:
A Distributional Analogue to the Successor Representation. ICML 2024
[i50]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-05749
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-05749
Yunhao Tang, Zhaohan Daniel Guo, Zeyu Zheng, Daniele Calandriello, Rémi Munos, Mark Rowland, Pierre Harvey Richemond, Michal Valko, Bernardo Ávila Pires, Bilal Piot:
Generalized Preference Optimization: A Unified Approach to Offline Alignment. CoRR abs/2402.05749 (2024)
[i49]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-05766
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-05766
Yunhao Tang, Mark Rowland, Rémi Munos, Bernardo Ávila Pires, Will Dabney:
Off-policy Distributional Q(λ): Distributional RL without Importance Sampling. CoRR abs/2402.05766 (2024)
[i48]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-07598
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-07598
Mark Rowland, Li Kevin Wenliang, Rémi Munos, Clare Lyle, Yunhao Tang, Will Dabney:
Near-Minimax-Optimal Distributional Reinforcement Learning with a Generative Model. CoRR abs/2402.07598 (2024)
[i47]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-08530
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-08530
Harley Wiltzer, Jesse Farebrother, Arthur Gretton, Yunhao Tang, André Barreto, Will Dabney, Marc G. Bellemare, Mark Rowland:
A Distributional Analogue to the Successor Representation. CoRR abs/2402.08530 (2024)
[i46]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-08635
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-08635
Daniele Calandriello, Daniel Guo, Rémi Munos, Mark Rowland, Yunhao Tang, Bernardo Ávila Pires, Pierre Harvey Richemond, Charline Le Lan, Michal Valko, Tianqi Liu, Rishabh Joshi, Zeyu Zheng, Bilal Piot:
Human Alignment of Large Language Models through Online Preference Optimisation. CoRR abs/2403.08635 (2024)
[i45]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-02035
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-02035
Khimya Khetarpal, Zhaohan Daniel Guo, Bernardo Ávila Pires, Yunhao Tang, Clare Lyle, Mark Rowland, Nicolas Heess, Diana Borsa, Arthur Guez, Will Dabney:
A Unifying Framework for Action-Conditional Self-Predictive Reinforcement Learning. CoRR abs/2406.02035 (2024)
[i44]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-00328
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-00328
Harley Wiltzer, Jesse Farebrother, Arthur Gretton, Mark Rowland:
Foundations of Multivariate Distributional Reinforcement Learning. CoRR abs/2409.00328 (2024)
2023
[j4]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - journals/tmlr/CastroKPR23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmlr/CastroKPR23
Pablo Samuel Castro, Tyler Kastner, Prakash Panangaden, Mark Rowland:
A Kernel Perspective on Behavioural Metrics for Markov Decision Processes. Trans. Mach. Learn. Res. 2023 (2023)
[c42]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/aistats/LanGFRPAB23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aistats/LanGFRPAB23
Charline Le Lan, Joshua Greaves, Jesse Farebrother, Mark Rowland, Fabian Pedregosa, Rishabh Agarwal, Marc G. Bellemare:
A Novel Stochastic Gradient Descent Algorithm for Learning Principal Subspaces. AISTATS 2023: 1703-1718
[c41]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/LanTRHABD23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/LanTRHABD23
Charline Le Lan, Stephen Tu, Mark Rowland, Anna Harutyunyan, Rishabh Agarwal, Marc G. Bellemare, Will Dabney:
Bootstrapped Representations in Reinforcement Learning. ICML 2023: 18686-18713
[c40]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/MesnardCSTRWLGV23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/MesnardCSTRWLGV23
Thomas Mesnard, Wenqi Chen, Alaa Saade, Yunhao Tang, Mark Rowland, Theophane Weber, Clare Lyle, Audrunas Gruslys, Michal Valko, Will Dabney, Georg Ostrovski, Eric Moulines, Rémi Munos:
Quantile Credit Assignment. ICML 2023: 24517-24531
[c39]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/RowlandTLMBD23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/RowlandTLMBD23
Mark Rowland, Yunhao Tang, Clare Lyle, Rémi Munos, Marc G. Bellemare, Will Dabney:
The Statistical Benefits of Quantile Temporal-Difference Learning for Value Estimation. ICML 2023: 29210-29231
[c38]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/TangGRPCMRALL0T23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/TangGRPCMRALL0T23
Yunhao Tang, Zhaohan Daniel Guo, Pierre Harvey Richemond, Bernardo Ávila Pires, Yash Chandak, Rémi Munos, Mark Rowland, Mohammad Gheshlaghi Azar, Charline Le Lan, Clare Lyle, András György, Shantanu Thakoor, Will Dabney, Bilal Piot, Daniele Calandriello, Michal Valko:
Understanding Self-Predictive Learning for Reinforcement Learning. ICML 2023: 33632-33656
[c37]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/TangKRHMPV23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/TangKRHMPV23
Yunhao Tang, Tadashi Kozuno, Mark Rowland, Anna Harutyunyan, Rémi Munos, Bernardo Ávila Pires, Michal Valko:
DoMo-AC: Doubly Multi-step Off-policy Actor-Critic Algorithm. ICML 2023: 33657-33673
[c36]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/TangMRV23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/TangMRV23
Yunhao Tang, Rémi Munos, Mark Rowland, Michal Valko:
VA-learning as a more efficient alternative to Q-learning. ICML 2023: 33739-33757
[i43]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2301-04462
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2301-04462
Mark Rowland, Rémi Munos, Mohammad Gheshlaghi Azar, Yunhao Tang, Georg Ostrovski, Anna Harutyunyan, Karl Tuyls, Marc G. Bellemare, Will Dabney:
An Analysis of Quantile Temporal-Difference Learning. CoRR abs/2301.04462 (2023)
[i42]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-18161
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-18161
Yunhao Tang, Rémi Munos, Mark Rowland, Michal Valko:
VA-learning as a more efficient alternative to Q-learning. CoRR abs/2305.18161 (2023)
[i41]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-18388
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-18388
Mark Rowland, Yunhao Tang, Clare Lyle, Rémi Munos, Marc G. Bellemare, Will Dabney:
The Statistical Benefits of Quantile Temporal-Difference Learning for Value Estimation. CoRR abs/2305.18388 (2023)
[i40]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-18501
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-18501
Yunhao Tang, Tadashi Kozuno, Mark Rowland, Anna Harutyunyan, Rémi Munos, Bernardo Ávila Pires, Michal Valko:
DoMo-AC: Doubly Multi-step Off-policy Actor-Critic Algorithm. CoRR abs/2305.18501 (2023)
[i39]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-10171
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-10171
Charline Le Lan, Stephen Tu, Mark Rowland, Anna Harutyunyan, Rishabh Agarwal, Marc G. Bellemare, Will Dabney:
Bootstrapped Representations in Reinforcement Learning. CoRR abs/2306.10171 (2023)
[i38]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-12036
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-12036
Mohammad Gheshlaghi Azar, Mark Rowland, Bilal Piot, Daniel Guo, Daniele Calandriello, Michal Valko, Rémi Munos:
A General Theoretical Paradigm to Understand Learning from Human Preferences. CoRR abs/2310.12036 (2023)
[i37]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-19804
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-19804
Pablo Samuel Castro, Tyler Kastner, Prakash Panangaden, Mark Rowland:
A Kernel Perspective on Behavioural Metrics for Markov Decision Processes. CoRR abs/2310.19804 (2023)
[i36]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-00886
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-00886
Rémi Munos, Michal Valko, Daniele Calandriello, Mohammad Gheshlaghi Azar, Mark Rowland, Zhaohan Daniel Guo, Yunhao Tang, Matthieu Geist, Thomas Mesnard, Andrea Michi, Marco Selvi, Sertan Girgin, Nikola Momchev, Olivier Bachem, Daniel J. Mankowitz, Doina Precup, Bilal Piot:
Nash Learning from Human Feedback. CoRR abs/2312.00886 (2023)
[i35]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-07358
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-07358
Li Kevin Wenliang, Grégoire Delétang, Matthew Aitchison, Marcus Hutter, Anian Ruoss, Arthur Gretton, Mark Rowland:
Distributional Bellman Operators over Mean Embeddings. CoRR abs/2312.07358 (2023)
2022
[j3]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/jair/PiliourasROEHCT22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jair/PiliourasROEHCT22
Georgios Piliouras, Mark Rowland, Shayegan Omidshafiei, Romuald Elie, Daniel Hennes, Jerome T. Connor, Karl Tuyls:
Evolutionary Dynamics and Phi-Regret Minimization in Games. J. Artif. Intell. Res. 74: 1125-1158 (2022)
[c35]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/aistats/TangRMV22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aistats/TangRMV22
Yunhao Tang, Mark Rowland, Rémi Munos, Michal Valko:
Marginalized Operators for Off-policy Reinforcement Learning. AISTATS 2022: 655-679
[c34]
- view
  authority control:
- export record
  dblp key:
  - conf/atal/MullerREPPLMPT22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/MullerREPPLMPT22
Paul Muller, Mark Rowland, Romuald Elie, Georgios Piliouras, Julien Pérolat, Mathieu Laurière, Raphaël Marinier, Olivier Pietquin, Karl Tuyls:
Learning Equilibria in Mean-Field Games: Introducing Mean-Field PSRO. AAMAS 2022: 926-934
[c33]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/LyleRD22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/LyleRD22
Clare Lyle, Mark Rowland, Will Dabney:
Understanding and Preventing Capacity Loss in Reinforcement Learning. ICLR 2022
[c32]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/LyleRDKG22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/LyleRDKG22
Clare Lyle, Mark Rowland, Will Dabney, Marta Kwiatkowska, Yarin Gal:
Learning Dynamics and Generalization in Deep Reinforcement Learning. ICML 2022: 14560-14581
[c31]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/ThakoorRBDM022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/ThakoorRBDM022
Shantanu Thakoor, Mark Rowland, Diana Borsa, Will Dabney, Rémi Munos, André Barreto:
Generalised Policy Improvement with Geometric Policy Composition. ICML 2022: 21272-21307
[c30]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/TangMRPDB22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/TangMRPDB22
Yunhao Tang, Rémi Munos, Mark Rowland, Bernardo Ávila Pires, Will Dabney, Marc G. Bellemare:
The Nature of Temporal Difference Errors in Multi-step Distributional Reinforcement Learning. NeurIPS 2022
[c29]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/TiapkinBCMMNRVM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/TiapkinBCMMNRVM22
Daniil Tiapkin, Denis Belomestny, Daniele Calandriello, Eric Moulines, Rémi Munos, Alexey Naumov, Mark Rowland, Michal Valko, Pierre Ménard:
Optimistic Posterior Sampling for Reinforcement Learning with Few Samples and Tight Guarantees. NeurIPS 2022
[d1]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - data/10/PerolatVHTSBMCBAMECWGMK22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/data/10/PerolatVHTSBMCBAMECWGMK22
Julien Pérolat, Bart De Vylder, Daniel Hennes, Eugene Tarassov, Florian Strub, Vincent de Boer, Paul Muller, Jerome T. Connor, Neil Burch, Thomas Anthony, Stephen McAleer, Romuald Elie, Sarah H. Cen, Zhe Wang, Audrunas Gruslys, Aleksandra Malysheva, Mina Khan, Sherjil Ozair, Finbarr Timbers, Toby Pohlen, Tom Eccles, Mark Rowland, Marc Lanctot, Jean-Baptiste Lespiau, Bilal Piot, Shayegan Omidshafiei, Edward Lockhart, Laurent Sifre, Nathalie Beauguerlange, Rémi Munos, David Silver, Satinder Singh, Demis Hassabis, Karl Tuyls:
Figure Data for the paper "Mastering the Game of Stratego with Model-Free Multiagent Reinforcement Learning". Zenodo, 2022
[i34]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-16177
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-16177
Yunhao Tang, Mark Rowland, Rémi Munos, Michal Valko:
Marginalized Operators for Off-policy Reinforcement Learning. CoRR abs/2203.16177 (2022)
[i33]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-09560
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-09560
Clare Lyle, Mark Rowland, Will Dabney:
Understanding and Preventing Capacity Loss in Reinforcement Learning. CoRR abs/2204.09560 (2022)
[i32]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-02126
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-02126
Clare Lyle, Mark Rowland, Will Dabney, Marta Kwiatkowska, Yarin Gal:
Learning Dynamics and Generalization in Reinforcement Learning. CoRR abs/2206.02126 (2022)
[i31]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-08736
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-08736
Shantanu Thakoor, Mark Rowland, Diana Borsa, Will Dabney, Rémi Munos, André Barreto:
Generalised Policy Improvement with Geometric Policy Composition. CoRR abs/2206.08736 (2022)
[i30]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-15378
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-15378
Julien Pérolat, Bart De Vylder, Daniel Hennes, Eugene Tarassov, Florian Strub, Vincent de Boer, Paul Muller, Jerome T. Connor, Neil Burch, Thomas W. Anthony, Stephen McAleer, Romuald Elie, Sarah H. Cen, Zhe Wang, Audrunas Gruslys, Aleksandra Malysheva, Mina Khan, Sherjil Ozair, Finbarr Timbers, Toby Pohlen, Tom Eccles, Mark Rowland, Marc Lanctot, Jean-Baptiste Lespiau, Bilal Piot, Shayegan Omidshafiei, Edward Lockhart, Laurent Sifre, Nathalie Beauguerlange, Rémi Munos, David Silver, Satinder Singh, Demis Hassabis, Karl Tuyls:
Mastering the Game of Stratego with Model-Free Multiagent Reinforcement Learning. CoRR abs/2206.15378 (2022)
[i29]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2207-07570
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2207-07570
Yunhao Tang, Mark Rowland, Rémi Munos, Bernardo Ávila Pires, Will Dabney, Marc G. Bellemare:
The Nature of Temporal Difference Errors in Multi-step Distributional Reinforcement Learning. CoRR abs/2207.07570 (2022)
[i28]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2208-10138
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2208-10138
Paul Muller, Romuald Elie, Mark Rowland, Mathieu Laurière, Julien Pérolat, Sarah Perrin, Matthieu Geist, Georgios Piliouras, Olivier Pietquin, Karl Tuyls:
Learning Correlated Equilibria in Mean-Field Games. CoRR abs/2208.10138 (2022)
[i27]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2209-14414
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2209-14414
Daniil Tiapkin, Denis Belomestny, Daniele Calandriello, Eric Moulines, Rémi Munos, Alexey Naumov, Mark Rowland, Michal Valko, Pierre Ménard:
Optimistic Posterior Sampling for Reinforcement Learning with Few Samples and Tight Guarantees. CoRR abs/2209.14414 (2022)
[i26]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2212-03319
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2212-03319
Yunhao Tang, Zhaohan Daniel Guo, Pierre Harvey Richemond, Bernardo Ávila Pires, Yash Chandak, Rémi Munos, Mark Rowland, Mohammad Gheshlaghi Azar, Charline Le Lan, Clare Lyle, András György, Shantanu Thakoor, Will Dabney, Bilal Piot, Daniele Calandriello, Michal Valko:
Understanding Self-Predictive Learning for Reinforcement Learning. CoRR abs/2212.03319 (2022)
[i25]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2212-04025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2212-04025
Charline Le Lan, Joshua Greaves, Jesse Farebrother, Mark Rowland, Fabian Pedregosa, Rishabh Agarwal, Marc G. Bellemare:
A Novel Stochastic Gradient Descent Algorithm for Learning Principal Subspaces. CoRR abs/2212.04025 (2022)
2021
[j2]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/jair/TuylsOMWCHGSWSL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jair/TuylsOMWCHGSWSL21
Karl Tuyls, Shayegan Omidshafiei, Paul Muller, Zhe Wang, Jerome T. Connor, Daniel Hennes, Ian Graham, William Spearman, Tim Waskett, Dafydd Steele, Pauline Luc, Adrià Recasens, Alexandre Galashov, Gregory Thornton, Romuald Elie, Pablo Sprechmann, Pol Moreno, Kris Cao, Marta Garnelo, Praneet Dutta, Michal Valko, Nicolas Heess, Alex Bridgland, Julien Pérolat, Bart De Vylder, S. M. Ali Eslami, Mark Rowland, Andrew Jaegle, Rémi Munos, Trevor Back, Razia Ahamed, Simon Bouton, Nathalie Beauguerlange, Jackson Broshear, Thore Graepel, Demis Hassabis:
Game Plan: What AI can do for Football, and What Football can do for AI. J. Artif. Intell. Res. 71: 41-88 (2021)
[c28]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/DabneyBRDQBS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/DabneyBRDQBS21
Will Dabney, André Barreto, Mark Rowland, Robert Dadashi, John Quan, Marc G. Bellemare, David Silver:
The Value-Improvement Path: Towards Better Representations for Reinforcement Learning. AAAI 2021: 7160-7168
[c27]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/aistats/LyleROD21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aistats/LyleROD21
Clare Lyle, Mark Rowland, Georg Ostrovski, Will Dabney:
On the Effect of Auxiliary Tasks on Representation Dynamics. AISTATS 2021: 1-9
[c26]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/KozunoTRMKDVA21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/KozunoTRMKDVA21
Tadashi Kozuno, Yunhao Tang, Mark Rowland, Rémi Munos, Steven Kapturowski, Will Dabney, Michal Valko, David Abel:
Revisiting Peng's Q(λ) for Modern Reinforcement Learning. ICML 2021: 5794-5804
[c25]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/PerolatMLOROBAB21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/PerolatMLOROBAB21
Julien Pérolat, Rémi Munos, Jean-Baptiste Lespiau, Shayegan Omidshafiei, Mark Rowland, Pedro A. Ortega, Neil Burch, Thomas W. Anthony, David Balduzzi, Bart De Vylder, Georgios Piliouras, Marc Lanctot, Karl Tuyls:
From Poincaré Recurrence to Convergence in Imperfect Information Games: Finding Equilibrium via Regularization. ICML 2021: 8525-8535
[c24]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/TangRMV21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/TangRMV21
Yunhao Tang, Mark Rowland, Rémi Munos, Michal Valko:
Taylor Expansion of Discount Factors. ICML 2021: 10130-10140
[c23]
- view
  - electronic edition @ neurips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/TangKRMV21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/TangKRMV21
Yunhao Tang, Tadashi Kozuno, Mark Rowland, Rémi Munos, Michal Valko:
Unifying Gradient Estimators for Meta-Reinforcement Learning via Off-Policy Evaluation. NeurIPS 2021: 5303-5315
[c22]
- view
  - electronic edition @ neurips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/CastroKPR21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/CastroKPR21
Pablo Samuel Castro, Tyler Kastner, Prakash Panangaden, Mark Rowland:
MICo: Improved representations via sampling-based state similarity for Markov decision processes. NeurIPS 2021: 30113-30126
[i24]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2102-13089
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-13089
Clare Lyle, Mark Rowland, Georg Ostrovski, Will Dabney:
On The Effect of Auxiliary Tasks on Representation Dynamics. CoRR abs/2102.13089 (2021)
[i23]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2103-00107
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-00107
Tadashi Kozuno, Yunhao Tang, Mark Rowland, Rémi Munos, Steven Kapturowski, Will Dabney, Michal Valko, David Abel:
Revisiting Peng's Q(λ) for Modern Reinforcement Learning. CoRR abs/2103.00107 (2021)
[i22]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2106-06170
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-06170
Yunhao Tang, Mark Rowland, Rémi Munos, Michal Valko:
Taylor Expansion of Discount Factors. CoRR abs/2106.06170 (2021)
[i21]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2106-08229
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-08229
Pablo Samuel Castro, Tyler Kastner, Prakash Panangaden, Mark Rowland:
MICo: Learning improved representations via sampling-based state similarity for Markov decision processes. CoRR abs/2106.08229 (2021)
[i20]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2106-13125
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-13125
Yunhao Tang, Tadashi Kozuno, Mark Rowland, Rémi Munos, Michal Valko:
Unifying Gradient Estimators for Meta-Reinforcement Learning via Off-Policy Evaluation. CoRR abs/2106.13125 (2021)
[i19]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2106-14668
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-14668
Georgios Piliouras, Mark Rowland, Shayegan Omidshafiei, Romuald Elie, Daniel Hennes, Jerome T. Connor, Karl Tuyls:
Evolutionary Dynamics and Φ-Regret Minimization in Games. CoRR abs/2106.14668 (2021)
[i18]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2111-08350
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2111-08350
Paul Muller, Mark Rowland, Romuald Elie, Georgios Piliouras, Julien Pérolat, Mathieu Laurière, Raphaël Marinier, Olivier Pietquin, Karl Tuyls:
Learning Equilibria in Mean-Field Games: Introducing Mean-Field PSRO. CoRR abs/2111.08350 (2021)
2020
[c21]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/aistats/RowlandDM20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aistats/RowlandDM20
Mark Rowland, Will Dabney, Rémi Munos:
Adaptive Trade-Offs in Off-Policy Learning. AISTATS 2020: 34-44
[c20]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/aistats/RowlandHHBSMD20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aistats/RowlandHHBSMD20
Mark Rowland, Anna Harutyunyan, Hado van Hasselt, Diana Borsa, Tom Schaul, Rémi Munos, Will Dabney:
Conditional Importance Sampling for Off-Policy Learning. AISTATS 2020: 45-55
[c19]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/MullerORTPLHMLH20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/MullerORTPLHMLH20
Paul Muller, Shayegan Omidshafiei, Mark Rowland, Karl Tuyls, Julien Pérolat, Siqi Liu, Daniel Hennes, Luke Marris, Marc Lanctot, Edward Hughes, Zhe Wang, Guy Lever, Nicolas Heess, Thore Graepel, Rémi Munos:
A Generalized Training Approach for Multiagent Learning. ICLR 2020
[c18]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/FedusRABLRD20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/FedusRABLRD20
William Fedus, Prajit Ramachandran, Rishabh Agarwal, Yoshua Bengio, Hugo Larochelle, Mark Rowland, Will Dabney:
Revisiting Fundamentals of Experience Replay. ICML 2020: 3061-3071
[c17]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/MunosPLRVLTHOGA20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/MunosPLRVLTHOGA20
Rémi Munos, Julien Pérolat, Jean-Baptiste Lespiau, Mark Rowland, Bart De Vylder, Marc Lanctot, Finbarr Timbers, Daniel Hennes, Shayegan Omidshafiei, Audrunas Gruslys, Mohammad Gheshlaghi Azar, Edward Lockhart, Karl Tuyls:
Fast computation of Nash Equilibria in Imperfect Information Games. ICML 2020: 7119-7129
[i17]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2002-08456
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-08456
Julien Pérolat, Rémi Munos, Jean-Baptiste Lespiau, Shayegan Omidshafiei, Mark Rowland, Pedro A. Ortega, Neil Burch, Thomas W. Anthony, David Balduzzi, Bart De Vylder, Georgios Piliouras, Marc Lanctot, Karl Tuyls:
From Poincaré Recurrence to Convergence in Imperfect Information Games: Finding Equilibrium via Regularization. CoRR abs/2002.08456 (2020)
[i16]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2005-01642
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2005-01642
Shayegan Omidshafiei, Karl Tuyls, Wojciech M. Czarnecki, Francisco C. Santos, Mark Rowland, Jerome T. Connor, Daniel Hennes, Paul Muller, Julien Pérolat, Bart De Vylder, Audrunas Gruslys, Rémi Munos:
Navigating the Landscape of Games. CoRR abs/2005.01642 (2020)
[i15]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2006-02243
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2006-02243
Will Dabney, André Barreto, Mark Rowland, Robert Dadashi, John Quan, Marc G. Bellemare, David Silver:
The Value-Improvement Path: Towards Better Representations for Reinforcement Learning. CoRR abs/2006.02243 (2020)
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2007-06700
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2007-06700
William Fedus, Prajit Ramachandran, Rishabh Agarwal, Yoshua Bengio, Hugo Larochelle, Mark Rowland, Will Dabney:
Revisiting Fundamentals of Experience Replay. CoRR abs/2007.06700 (2020)
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2011-09192
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-09192
Karl Tuyls, Shayegan Omidshafiei, Paul Muller, Zhe Wang, Jerome T. Connor, Daniel Hennes, Ian Graham, William Spearman, Tim Waskett, Dafydd Steele, Pauline Luc, Adrià Recasens, Alexandre Galashov, Gregory Thornton, Romuald Elie, Pablo Sprechmann, Pol Moreno, Kris Cao, Marta Garnelo, Praneet Dutta, Michal Valko, Nicolas Heess, Alex Bridgland, Julien Pérolat, Bart De Vylder, S. M. Ali Eslami, Mark Rowland, Andrew Jaegle, Rémi Munos, Trevor Back, Razia Ahamed, Simon Bouton, Nathalie Beauguerlange, Jackson Broshear, Thore Graepel, Demis Hassabis:
Game Plan: What AI can do for Football, and What Football can do for AI. CoRR abs/2011.09192 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[j1]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/sac/LomeliRGG19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/sac/LomeliRGG19
Maria Lomeli, Mark Rowland, Arthur Gretton, Zoubin Ghahramani:
Antithetic and Monte Carlo kernel estimators for partial rankings. Stat. Comput. 29(5): 1127-1147 (2019)
[c16]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/aistats/RowlandHTCSW19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aistats/RowlandHTCSW19
Mark Rowland, Jiri Hron, Yunhao Tang, Krzysztof Choromanski, Tamás Sarlós, Adrian Weller:
Orthogonal Estimation of Wasserstein Distances. AISTATS 2019: 186-195
[c15]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/ChoromanskiRCW19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/ChoromanskiRCW19
Krzysztof Choromanski, Mark Rowland, Wenyu Chen, Adrian Weller:
Unifying Orthogonal Monte Carlo Methods. ICML 2019: 1203-1212
[c14]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/RowlandDKMBD19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/RowlandDKMBD19
Mark Rowland, Robert Dadashi, Saurabh Kumar, Rémi Munos, Marc G. Bellemare, Will Dabney:
Statistics and Samples in Distributional Reinforcement Learning. ICML 2019: 5528-5536
[c13]
- view
- export record
  dblp key:
  - conf/nips/RowlandOTPVPM19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/RowlandOTPVPM19
Mark Rowland, Shayegan Omidshafiei, Karl Tuyls, Julien Pérolat, Michal Valko, Georgios Piliouras, Rémi Munos:
Multiagent Evaluation under Incomplete Information. NeurIPS 2019: 12270-12282
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1902-08102
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1902-08102
Mark Rowland, Robert Dadashi, Saurabh Kumar, Rémi Munos, Marc G. Bellemare, Will Dabney:
Statistics and Samples in Distributional Reinforcement Learning. CoRR abs/1902.08102 (2019)
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1903-01373
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1903-01373
Shayegan Omidshafiei, Christos H. Papadimitriou, Georgios Piliouras, Karl Tuyls, Mark Rowland, Jean-Baptiste Lespiau, Wojciech M. Czarnecki, Marc Lanctot, Julien Pérolat, Rémi Munos:
α-Rank: Multi-Agent Evaluation by Evolution. CoRR abs/1903.01373 (2019)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1903-03784
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1903-03784
Mark Rowland, Jiri Hron, Yunhao Tang, Krzysztof Choromanski, Tamás Sarlós, Adrian Weller:
Orthogonal Estimation of Wasserstein Distances. CoRR abs/1903.03784 (2019)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1905-03030
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1905-03030
Pedro A. Ortega, Jane X. Wang, Mark Rowland, Tim Genewein, Zeb Kurth-Nelson, Razvan Pascanu, Nicolas Heess, Joel Veness, Alexander Pritzel, Pablo Sprechmann, Siddhant M. Jayakumar, Tom McGrath, Kevin J. Miller, Mohammad Gheshlaghi Azar, Ian Osband, Neil C. Rabinowitz, András György, Silvia Chiappa, Simon Osindero, Yee Whye Teh, Hado van Hasselt, Nando de Freitas, Matthew M. Botvinick, Shane Legg:
Meta-learning of Sequential Strategies. CoRR abs/1905.03030 (2019)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1909-09849
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1909-09849
Mark Rowland, Shayegan Omidshafiei, Karl Tuyls, Julien Pérolat, Michal Valko, Georgios Piliouras, Rémi Munos:
Multiagent Evaluation under Incomplete Information. CoRR abs/1909.09849 (2019)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1909-12823
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1909-12823
Paul Muller, Shayegan Omidshafiei, Mark Rowland, Karl Tuyls, Julien Pérolat, Siqi Liu, Daniel Hennes, Luke Marris, Marc Lanctot, Edward Hughes, Zhe Wang, Guy Lever, Nicolas Heess, Thore Graepel, Rémi Munos:
A Generalized Training Approach for Multiagent Learning. CoRR abs/1909.12823 (2019)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1910-07478
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1910-07478
Mark Rowland, Will Dabney, Rémi Munos:
Adaptive Trade-Offs in Off-Policy Learning. CoRR abs/1910.07478 (2019)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1910-07479
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1910-07479
Mark Rowland, Anna Harutyunyan, Hado van Hasselt, Diana Borsa, Tom Schaul, Rémi Munos, Will Dabney:
Conditional Importance Sampling for Off-Policy Learning. CoRR abs/1910.07479 (2019)
2018
[c12]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/DabneyRBM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/DabneyRBM18
Will Dabney, Mark Rowland, Marc G. Bellemare, Rémi Munos:
Distributional Reinforcement Learning With Quantile Regression. AAAI 2018: 2892-2901
[c11]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/aistats/ChoromanskiRSST18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aistats/ChoromanskiRSST18
Krzysztof Choromanski, Mark Rowland, Tamás Sarlós, Vikas Sindhwani, Richard E. Turner, Adrian Weller:
The Geometry of Random Features. AISTATS 2018: 1-9
[c10]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/aistats/RowlandBDMT18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aistats/RowlandBDMT18
Mark Rowland, Marc G. Bellemare, Will Dabney, Rémi Munos, Yee Whye Teh:
An Analysis of Categorical Distributional Reinforcement Learning. AISTATS 2018: 29-37
[c9]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/MatthewsHRTG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/MatthewsHRTG18
Alexander G. de G. Matthews, Jiri Hron, Mark Rowland, Richard E. Turner, Zoubin Ghahramani:
Gaussian Process Behaviour in Wide Deep Neural Networks. ICLR (Poster) 2018
[c8]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/ChoromanskiRSTW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/ChoromanskiRSTW18
Krzysztof Choromanski, Mark Rowland, Vikas Sindhwani, Richard E. Turner, Adrian Weller:
Structured Evolution with Compact Architectures for Scalable Policy Optimization. ICML 2018: 969-977
[c7]
- view
- export record
  dblp key:
  - conf/nips/RowlandCCPSTW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/RowlandCCPSTW18
Mark Rowland, Krzysztof Choromanski, François Chalus, Aldo Pacchiano, Tamás Sarlós, Richard E. Turner, Adrian Weller:
Geometrically Coupled Monte Carlo Sampling. NeurIPS 2018: 195-205
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1804-02395
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1804-02395
Krzysztof Choromanski, Mark Rowland, Vikas Sindhwani, Richard E. Turner, Adrian Weller:
Structured Evolution with Compact Architectures for Scalable Policy Optimization. CoRR abs/1804.02395 (2018)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1804-11271
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1804-11271
Alexander G. de G. Matthews, Mark Rowland, Jiri Hron, Richard E. Turner, Zoubin Ghahramani:
Gaussian Process Behaviour in Wide Deep Neural Networks. CoRR abs/1804.11271 (2018)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1807-00400
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1807-00400
Maria Lomeli, Mark Rowland, Arthur Gretton, Zoubin Ghahramani:
Antithetic and Monte Carlo kernel estimators for partial rankings. CoRR abs/1807.00400 (2018)
2017
[c6]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/aistats/RowlandPW17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aistats/RowlandPW17
Mark Rowland, Aldo Pacchiano, Adrian Weller:
Conditions beyond treewidth for tightness of higher-order LP relaxations. AISTATS 2017: 10-18
[c5]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/TripuraneniRGT17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/TripuraneniRGT17
Nilesh Tripuraneni, Mark Rowland, Zoubin Ghahramani, Richard E. Turner:
Magnetic Hamiltonian Monte Carlo. ICML 2017: 3453-3461
[c4]
- view
- export record
  dblp key:
  - conf/nips/RowlandW17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/RowlandW17
Mark Rowland, Adrian Weller:
Uprooting and Rerooting Higher-Order Graphical Models. NIPS 2017: 209-218
[c3]
- view
- export record
  dblp key:
  - conf/nips/ChoromanskiRW17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ChoromanskiRW17
Krzysztof Marcin Choromanski, Mark Rowland, Adrian Weller:
The Unreasonable Effectiveness of Structured Random Orthogonal Embeddings. NIPS 2017: 219-228
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1710-10044
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1710-10044
Will Dabney, Mark Rowland, Marc G. Bellemare, Rémi Munos:
Distributional Reinforcement Learning with Quantile Regression. CoRR abs/1710.10044 (2017)
2016
[c2]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/aistats/WellerRS16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aistats/WellerRS16
Adrian Weller, Mark Rowland, David A. Sontag:
Tightness of LP Relaxations for Almost Balanced Models. AISTATS 2016: 47-55
[c1]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/Hernandez-Lobato16a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/Hernandez-Lobato16a
José Miguel Hernández-Lobato, Yingzhen Li, Mark Rowland, Thang D. Bui, Daniel Hernández-Lobato, Richard E. Turner:
Black-Box Alpha Divergence Minimization. ICML 2016: 1511-1520

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.