default search action

combined dblp search
author search
venue search
publication search

ask others

Ian Osband

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2023
[j4]
- view
  authority control:
- export record
  dblp key:
  - journals/ftml/LuRDIOW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ftml/LuRDIOW23
Xiuyuan Lu, Benjamin Van Roy, Vikranth Dwaracherla, Morteza Ibrahimi, Ian Osband, Zheng Wen:
Reinforcement Learning, Bit by Bit. Found. Trends Mach. Learn. 16(6): 733-865 (2023)
[j3]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - journals/tmlr/DwaracherlaWOLAR23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmlr/DwaracherlaWOLAR23
Vikranth Dwaracherla, Zheng Wen, Ian Osband, Xiuyuan Lu, Seyed Mohammad Asghari, Benjamin Van Roy:
Ensembles for Uncertainty Estimation: Benefits of Prior Functions and Bootstrapping. Trans. Mach. Learn. Res. 2023 (2023)
[c20]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/Osband0ADILR23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/Osband0ADILR23
Ian Osband, Zheng Wen, Seyed Mohammad Asghari, Vikranth Dwaracherla, Morteza Ibrahimi, Xiuyuan Lu, Benjamin Van Roy:
Epistemic Neural Networks. NeurIPS 2023
[c19]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/uai/OsbandWADILR23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/uai/OsbandWADILR23
Ian Osband, Zheng Wen, Seyed Mohammad Asghari, Vikranth Dwaracherla, Morteza Ibrahimi, Xiuyuan Lu, Benjamin Van Roy:
Approximate Thompson Sampling via Epistemic Neural Networks. UAI 2023: 1586-1595
[i32]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-09205
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-09205
Ian Osband, Zheng Wen, Seyed Mohammad Asghari, Vikranth Dwaracherla, Morteza Ibrahimi, Xiuyuan Lu, Benjamin Van Roy:
Approximate Thompson Sampling via Epistemic Neural Networks. CoRR abs/2302.09205 (2023)
2022
[c18]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/OsbandWADLILHOR22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/OsbandWADLILHOR22
Ian Osband, Zheng Wen, Seyed Mohammad Asghari, Vikranth Dwaracherla, Xiuyuan Lu, Morteza Ibrahimi, Dieterich Lawson, Botao Hao, Brendan O'Donoghue, Benjamin Van Roy:
The Neural Testbed: Evaluating Joint Predictions. NeurIPS 2022
[c17]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/uai/OsbandWADLR22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/uai/OsbandWADLR22
Ian Osband, Zheng Wen, Seyed Mohammad Asghari, Vikranth Dwaracherla, Xiuyuan Lu, Benjamin Van Roy:
Evaluating high-order predictive distributions in deep learning. UAI 2022: 1552-1560
[i31]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2202-13509
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-13509
Ian Osband, Zheng Wen, Seyed Mohammad Asghari, Vikranth Dwaracherla, Xiuyuan Lu, Benjamin Van Roy:
Evaluating High-Order Predictive Distributions in Deep Learning. CoRR abs/2202.13509 (2022)
[i30]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-03633
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-03633
Vikranth Dwaracherla, Zheng Wen, Ian Osband, Xiuyuan Lu, Seyed Mohammad Asghari, Benjamin Van Roy:
Ensembles for Uncertainty Estimation: Benefits of Prior Functions and Bootstrapping. CoRR abs/2206.03633 (2022)
[i29]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2207-00137
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2207-00137
Xiuyuan Lu, Ian Osband, Seyed Mohammad Asghari, Sven Gowal, Vikranth Dwaracherla, Zheng Wen, Benjamin Van Roy:
Robustness of Epinets against Distributional Shifts. CoRR abs/2207.00137 (2022)
[i28]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-01568
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-01568
Ian Osband, Seyed Mohammad Asghari, Benjamin Van Roy, Nat McAleese, John Aslanides, Geoffrey Irving:
Fine-Tuning Language Models via Epistemic Neural Networks. CoRR abs/2211.01568 (2022)
2021
[c16]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/uai/ODonoghueLO21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/uai/ODonoghueLO21
Brendan O'Donoghue, Tor Lattimore, Ian Osband:
Matrix games with bandit feedback. UAI 2021: 279-289
[i27]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2103-04047
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-04047
Xiuyuan Lu, Benjamin Van Roy, Vikranth Dwaracherla, Morteza Ibrahimi, Ian Osband, Zheng Wen:
Reinforcement Learning, Bit by Bit. CoRR abs/2103.04047 (2021)
[i26]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2107-08924
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2107-08924
Ian Osband, Zheng Wen, Mohammad Asghari, Morteza Ibrahimi, Xiyuan Lu, Benjamin Van Roy:
Epistemic Neural Networks. CoRR abs/2107.08924 (2021)
[i25]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2107-09224
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2107-09224
Xiuyuan Lu, Ian Osband, Benjamin Van Roy, Zheng Wen:
Evaluating Probabilistic Inference in Deep Learning: Beyond Marginal Predictions. CoRR abs/2107.09224 (2021)
[i24]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-04629
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-04629
Ian Osband, Zheng Wen, Seyed Mohammad Asghari, Vikranth Dwaracherla, Botao Hao, Morteza Ibrahimi, Dieterich Lawson, Xiuyuan Lu, Brendan O'Donoghue, Benjamin Van Roy:
Evaluating Predictive Distributions: Does Bayesian Deep Learning Work? CoRR abs/2110.04629 (2021)
2020
[c15]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/DwaracherlaLIOW20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/DwaracherlaLIOW20
Vikranth Dwaracherla, Xiuyuan Lu, Morteza Ibrahimi, Ian Osband, Zheng Wen, Benjamin Van Roy:
Hypermodels for Exploration. ICLR 2020
[c14]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/ODonoghueOI20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/ODonoghueOI20
Brendan O'Donoghue, Ian Osband, Catalin Ionescu:
Making Sense of Reinforcement Learning and Probabilistic Inference. ICLR 2020
[c13]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/OsbandDHASSMLSS20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/OsbandDHASSMLSS20
Ian Osband, Yotam Doron, Matteo Hessel, John Aslanides, Eren Sezener, Andre Saraiva, Katrina McKinney, Tor Lattimore, Csaba Szepesvári, Satinder Singh, Benjamin Van Roy, Richard S. Sutton, David Silver, Hado van Hasselt:
Behaviour Suite for Reinforcement Learning. ICLR 2020
[i23]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2001-00805
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2001-00805
Brendan O'Donoghue, Ian Osband, Catalin Ionescu:
Making Sense of Reinforcement Learning and Probabilistic Inference. CoRR abs/2001.00805 (2020)
[i22]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2006-05145
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2006-05145
Brendan O'Donoghue, Tor Lattimore, Ian Osband:
Stochastic matrix games with bandit feedback. CoRR abs/2006.05145 (2020)
[i21]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2006-07464
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2006-07464
Vikranth Dwaracherla, Xiuyuan Lu, Morteza Ibrahimi, Ian Osband, Zheng Wen, Benjamin Van Roy:
Hypermodels for Exploration. CoRR abs/2006.07464 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[j2]
- view
  - electronic edition @ jmlr.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/jmlr/OsbandRRW19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jmlr/OsbandRRW19
Ian Osband, Benjamin Van Roy, Daniel J. Russo, Zheng Wen:
Deep Exploration via Randomized Value Functions. J. Mach. Learn. Res. 20: 124:1-124:62 (2019)
[i20]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1905-03030
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1905-03030
Pedro A. Ortega, Jane X. Wang, Mark Rowland, Tim Genewein, Zeb Kurth-Nelson, Razvan Pascanu, Nicolas Heess, Joel Veness, Alexander Pritzel, Pablo Sprechmann, Siddhant M. Jayakumar, Tom McGrath, Kevin J. Miller, Mohammad Gheshlaghi Azar, Ian Osband, Neil C. Rabinowitz, András György, Silvia Chiappa, Simon Osindero, Yee Whye Teh, Hado van Hasselt, Nando de Freitas, Matthew M. Botvinick, Shane Legg:
Meta-learning of Sequential Strategies. CoRR abs/1905.03030 (2019)
[i19]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1908-03568
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1908-03568
Ian Osband, Yotam Doron, Matteo Hessel, John Aslanides, Eren Sezener, Andre Saraiva, Katrina McKinney, Tor Lattimore, Csaba Szepesvári, Satinder Singh, Benjamin Van Roy, Richard S. Sutton, David Silver, Hado van Hasselt:
Behaviour Suite for Reinforcement Learning. CoRR abs/1908.03568 (2019)
2018
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/ftml/RussoRKOW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ftml/RussoRKOW18
Daniel Russo, Benjamin Van Roy, Abbas Kazerouni, Ian Osband, Zheng Wen:
A Tutorial on Thompson Sampling. Found. Trends Mach. Learn. 11(1): 1-96 (2018)
[c12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/HesterVPLSPHQSO18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/HesterVPLSPHQSO18
Todd Hester, Matej Vecerík, Olivier Pietquin, Marc Lanctot, Tom Schaul, Bilal Piot, Dan Horgan, John Quan, Andrew Sendonaris, Ian Osband, Gabriel Dulac-Arnold, John P. Agapiou, Joel Z. Leibo, Audrunas Gruslys:
Deep Q-learning From Demonstrations. AAAI 2018: 3223-3230
[c11]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/FortunatoAPMHOG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/FortunatoAPMHOG18
Meire Fortunato, Mohammad Gheshlaghi Azar, Bilal Piot, Jacob Menick, Matteo Hessel, Ian Osband, Alex Graves, Volodymyr Mnih, Rémi Munos, Demis Hassabis, Olivier Pietquin, Charles Blundell, Shane Legg:
Noisy Networks For Exploration. ICLR (Poster) 2018
[c10]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/ODonoghueOMM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/ODonoghueOMM18
Brendan O'Donoghue, Ian Osband, Rémi Munos, Volodymyr Mnih:
The Uncertainty Bellman Equation and Exploration. ICML 2018: 3836-3845
[c9]
- view
- export record
  dblp key:
  - conf/nips/DimakopoulouOR18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/DimakopoulouOR18
Maria Dimakopoulou, Ian Osband, Benjamin Van Roy:
Scalable Coordinated Exploration in Concurrent Reinforcement Learning. NeurIPS 2018: 4223-4232
[c8]
- view
- export record
  dblp key:
  - conf/nips/OsbandAC18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/OsbandAC18
Ian Osband, John Aslanides, Albin Cassirer:
Randomized Prior Functions for Deep Reinforcement Learning. NeurIPS 2018: 8626-8638
[i18]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1805-08948
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1805-08948
Maria Dimakopoulou, Ian Osband, Benjamin Van Roy:
Scalable Coordinated Exploration in Concurrent Reinforcement Learning. CoRR abs/1805.08948 (2018)
[i17]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1806-03335
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1806-03335
Ian Osband, John Aslanides, Albin Cassirer:
Randomized Prior Functions for Deep Reinforcement Learning. CoRR abs/1806.03335 (2018)
2017
[c7]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/AzarOM17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/AzarOM17
Mohammad Gheshlaghi Azar, Ian Osband, Rémi Munos:
Minimax Regret Bounds for Reinforcement Learning. ICML 2017: 263-272
[c6]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/OsbandR17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/OsbandR17
Ian Osband, Benjamin Van Roy:
Why is Posterior Sampling Better than Optimism for Reinforcement Learning? ICML 2017: 2701-2710
[i16]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/OsbandR17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/OsbandR17
Ian Osband, Benjamin Van Roy:
Gaussian-Dirichlet Posterior Dominance in Sequential Learning. CoRR abs/1702.04126 (2017)
[i15]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/AzarOM17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/AzarOM17
Mohammad Gheshlaghi Azar, Ian Osband, Rémi Munos:
Minimax Regret Bounds for Reinforcement Learning. CoRR abs/1703.05449 (2017)
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/Osband0WR17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/Osband0WR17
Ian Osband, Daniel Russo, Zheng Wen, Benjamin Van Roy:
Deep Exploration via Randomized Value Functions. CoRR abs/1703.07608 (2017)
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/HesterVPLSPSDOA17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/HesterVPLSPSDOA17
Todd Hester, Matej Vecerík, Olivier Pietquin, Marc Lanctot, Tom Schaul, Bilal Piot, Andrew Sendonaris, Gabriel Dulac-Arnold, Ian Osband, John P. Agapiou, Joel Z. Leibo, Audrunas Gruslys:
Learning from Demonstrations for Real World Reinforcement Learning. CoRR abs/1704.03732 (2017)
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/OsbandR17a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/OsbandR17a
Ian Osband, Benjamin Van Roy:
On Optimistic versus Randomized Exploration in Reinforcement Learning. CoRR abs/1706.04241 (2017)
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/FortunatoAPMOGM17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/FortunatoAPMOGM17
Meire Fortunato, Mohammad Gheshlaghi Azar, Bilal Piot, Jacob Menick, Ian Osband, Alex Graves, Vlad Mnih, Rémi Munos, Demis Hassabis, Olivier Pietquin, Charles Blundell, Shane Legg:
Noisy Networks for Exploration. CoRR abs/1706.10295 (2017)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/0001RKO17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/0001RKO17
Daniel Russo, Benjamin Van Roy, Abbas Kazerouni, Ian Osband:
A Tutorial on Thompson Sampling. CoRR abs/1707.02038 (2017)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1709-05380
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1709-05380
Brendan O'Donoghue, Ian Osband, Rémi Munos, Volodymyr Mnih:
The Uncertainty Bellman Equation and Exploration. CoRR abs/1709.05380 (2017)
2016
[c5]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/OsbandRW16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/OsbandRW16
Ian Osband, Benjamin Van Roy, Zheng Wen:
Generalization and Exploration via Randomized Value Functions. ICML 2016: 2377-2386
[c4]
- view
- export record
  dblp key:
  - conf/nips/OsbandBPR16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/OsbandBPR16
Ian Osband, Charles Blundell, Alexander Pritzel, Benjamin Van Roy:
Deep Exploration via Bootstrapped DQN. NIPS 2016: 4026-4034
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/OsbandBPR16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/OsbandBPR16
Ian Osband, Charles Blundell, Alexander Pritzel, Benjamin Van Roy:
Deep Exploration via Bootstrapped DQN. CoRR abs/1602.04621 (2016)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/OsbandR16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/OsbandR16
Ian Osband, Benjamin Van Roy:
Why is Posterior Sampling Better than Optimism for Reinforcement Learning. CoRR abs/1607.00215 (2016)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/OsbandR16a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/OsbandR16a
Ian Osband, Benjamin Van Roy:
Posterior Sampling for Reinforcement Learning Without Episodes. CoRR abs/1608.02731 (2016)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/OsbandR16b
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/OsbandR16b
Ian Osband, Benjamin Van Roy:
On Lower Bounds for Regret in Reinforcement Learning. CoRR abs/1608.02732 (2016)
2015
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/OsbandR15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/OsbandR15
Ian Osband, Benjamin Van Roy:
Bootstrapped Thompson Sampling and Deep Exploration. CoRR abs/1507.00300 (2015)
2014
[c3]
- view
- export record
  dblp key:
  - conf/nips/OsbandR14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/OsbandR14
Ian Osband, Benjamin Van Roy:
Near-optimal Reinforcement Learning in Factored MDPs. NIPS 2014: 604-612
[c2]
- view
- export record
  dblp key:
  - conf/nips/OsbandR14a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/OsbandR14a
Ian Osband, Benjamin Van Roy:
Model-based Reinforcement Learning and the Eluder Dimension. NIPS 2014: 1466-1474
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/OsbandR14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/OsbandR14
Ian Osband, Benjamin Van Roy:
Near-optimal Regret Bounds for Reinforcement Learning in Factored MDPs. CoRR abs/1403.3741 (2014)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/OsbandR14a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/OsbandR14a
Ian Osband, Benjamin Van Roy:
Model-based Reinforcement Learning and the Eluder Dimension. CoRR abs/1406.1853 (2014)
2013
[c1]
- view
- export record
  dblp key:
  - conf/nips/OsbandRR13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/OsbandRR13
Ian Osband, Daniel Russo, Benjamin Van Roy:
(More) Efficient Reinforcement Learning via Posterior Sampling. NIPS 2013: 3003-3011
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/OsbandRR13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/OsbandRR13
Ian Osband, Daniel Russo, Benjamin Van Roy:
(More) Efficient Reinforcement Learning via Posterior Sampling. CoRR abs/1306.0940 (2013)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.