default search action
Kamil Ciosek
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j3]Nicolò Felicioni, Lucas Maystre, Sina Ghiassian, Kamil Ciosek:
On the Importance of Uncertainty in Decision-Making with Large Language Models. Trans. Mach. Learn. Res. 2024 (2024) - [i22]Nicolò Felicioni, Lucas Maystre, Sina Ghiassian, Kamil Ciosek:
On the Importance of Uncertainty in Decision-Making with Large Language Models. CoRR abs/2404.02649 (2024) - [i21]Sergio Calvo-Ordoñez, Konstantina Palla, Kamil Ciosek:
Epistemic Uncertainty and Observation Noise with the Neural Tangent Kernel. CoRR abs/2409.03953 (2024) - 2023
- [c20]Thomas M. McDonald, Lucas Maystre, Mounia Lalmas, Daniel Russo, Kamil Ciosek:
Impatient Bandits: Optimizing Recommendations for the Long-Term Without Delay. KDD 2023: 1687-1697 - [c19]Federico Tomasi, Joseph Cauteruccio, Surya Kanoria, Kamil Ciosek, Matteo Rinaldi, Zhenwen Dai:
Automatic Music Playlist Generation via Simulation-based Reinforcement Learning. KDD 2023: 4948-4957 - [i20]Matthew Smith, Lucas Maystre, Zhenwen Dai, Kamil Ciosek:
A Strong Baseline for Batch Imitation Learning. CoRR abs/2302.02788 (2023) - [i19]Thomas M. McDonald, Lucas Maystre, Mounia Lalmas, Daniel Russo, Kamil Ciosek:
Impatient Bandits: Optimizing Recommendations for the Long-Term Without Delay. CoRR abs/2307.09943 (2023) - [i18]Federico Tomasi, Joseph Cauteruccio, Surya Kanoria, Kamil Ciosek, Matteo Rinaldi, Zhenwen Dai:
Automatic Music Playlist Generation via Simulation-based Reinforcement Learning. CoRR abs/2310.09123 (2023) - 2022
- [c18]Kamil Ciosek:
Imitation Learning by Reinforcement Learning. ICLR 2022 - 2021
- [c17]Tabish Rashid, Cheng Zhang, Kamil Ciosek:
Estimating α-Rank by Maximizing Information Gain. AAAI 2021: 5673-5681 - [c16]Hisham Husain, Kamil Ciosek, Ryota Tomioka:
Regularized Policies are Reward Robust. AISTATS 2021: 64-72 - [c15]Paul Knott, Micah Carroll, Sam Devlin, Kamil Ciosek, Katja Hofmann, Anca D. Dragan, Rohin Shah:
Evaluating the Robustness of Collaborative Agents. AAMAS 2021: 1560-1562 - [c14]Luisa M. Zintgraf, Sam Devlin, Kamil Ciosek, Shimon Whiteson, Katja Hofmann:
Deep Interactive Bayesian Reinforcement Learning via Meta-Learning. AAMAS 2021: 1712-1714 - [c13]David Lindner, Matteo Turchetta, Sebastian Tschiatschek, Kamil Ciosek, Andreas Krause:
Information Directed Reward Learning for Reinforcement Learning. NeurIPS 2021: 3850-3862 - [i17]Luisa M. Zintgraf, Sam Devlin, Kamil Ciosek, Shimon Whiteson, Katja Hofmann:
Deep Interactive Bayesian Reinforcement Learning via Meta-Learning. CoRR abs/2101.03864 (2021) - [i16]Paul Knott, Micah Carroll, Sam Devlin, Kamil Ciosek, Katja Hofmann, Anca D. Dragan, Rohin Shah:
Evaluating the Robustness of Collaborative Agents. CoRR abs/2101.05507 (2021) - [i15]Hisham Husain, Kamil Ciosek, Ryota Tomioka:
Regularized Policies are Reward Robust. CoRR abs/2101.07012 (2021) - [i14]Tabish Rashid, Cheng Zhang, Kamil Ciosek:
Estimating α-Rank by Maximizing Information Gain. CoRR abs/2101.09178 (2021) - [i13]David Lindner, Matteo Turchetta, Sebastian Tschiatschek, Kamil Ciosek, Andreas Krause:
Information Directed Reward Learning for Reinforcement Learning. CoRR abs/2102.12466 (2021) - [i12]Kamil Ciosek:
Imitation Learning by Reinforcement Learning. CoRR abs/2108.04763 (2021) - 2020
- [j2]Kamil Ciosek, Shimon Whiteson:
Expected Policy Gradients for Reinforcement Learning. J. Mach. Learn. Res. 21: 52:1-52:51 (2020) - [j1]Supratik Paul, Konstantinos I. Chatzilygeroudis, Kamil Ciosek, Jean-Baptiste Mouret, Michael A. Osborne, Shimon Whiteson:
Robust Reinforcement Learning with Bayesian Optimisation and Quadrature. J. Mach. Learn. Res. 21: 151:1-151:31 (2020) - [c12]Jacob Beck, Kamil Ciosek, Sam Devlin, Sebastian Tschiatschek, Cheng Zhang, Katja Hofmann:
AMRL: Aggregated Memory For Reinforcement Learning. ICLR 2020 - [c11]Kamil Ciosek, Vincent Fortuin, Ryota Tomioka, Katja Hofmann, Richard E. Turner:
Conservative Uncertainty Estimation By Fitting Prior Networks. ICLR 2020 - [c10]Ron Amit, Ron Meir, Kamil Ciosek:
Discount Factor as a Regularizer in Reinforcement Learning. ICML 2020: 269-278 - [c9]Jiachen Li, Quan Vuong, Shuang Liu, Minghua Liu, Kamil Ciosek, Henrik I. Christensen, Hao Su:
Multi-task Batch Reinforcement Learning with Metric Learning. NeurIPS 2020 - [i11]Ron Amit, Ron Meir, Kamil Ciosek:
Discount Factor as a Regularizer in Reinforcement Learning. CoRR abs/2007.02040 (2020) - [i10]Luke Harries, Rebekah Storan Clarke, Timothy Chapman, Swamy V. P. L. N. Nallamalli, Levent Özgür, Shuktika Jain, Alex Leung, Steve Lim, Aaron Dietrich, José Miguel Hernández-Lobato, Tom Ellis, Cheng Zhang, Kamil Ciosek:
DRIFT: Deep Reinforcement Learning for Functional Software Testing. CoRR abs/2007.08220 (2020)
2010 – 2019
- 2019
- [c8]Kamil Ciosek, Quan Vuong, Robert Tyler Loftin, Katja Hofmann:
Better Exploration with Optimistic Actor Critic. NeurIPS 2019: 1785-1796 - [c7]Maximilian Igl, Kamil Ciosek, Yingzhen Li, Sebastian Tschiatschek, Cheng Zhang, Sam Devlin, Katja Hofmann:
Generalization in Reinforcement Learning with Selective Noise Injection and Information Bottleneck. NeurIPS 2019: 13956-13968 - [i9]Quan Vuong, Shuang Liu, Minghua Liu, Kamil Ciosek, Hao Su, Henrik Iskov Christensen:
Pre-training as Batch Meta Reinforcement Learning with tiMe. CoRR abs/1909.11373 (2019) - [i8]Kamil Ciosek, Quan Vuong, Robert Tyler Loftin, Katja Hofmann:
Better Exploration with Optimistic Actor-Critic. CoRR abs/1910.12807 (2019) - [i7]Maximilian Igl, Kamil Ciosek, Yingzhen Li, Sebastian Tschiatschek, Cheng Zhang, Sam Devlin, Katja Hofmann:
Generalization in Reinforcement Learning with Selective Noise Injection and Information Bottleneck. CoRR abs/1910.12911 (2019) - 2018
- [c6]Kamil Ciosek, Shimon Whiteson:
Expected Policy Gradients. AAAI 2018: 2868-2875 - [c5]Supratik Paul, Konstantinos I. Chatzilygeroudis, Kamil Ciosek, Jean-Baptiste Mouret, Michael A. Osborne, Shimon Whiteson:
Alternating Optimisation and Quadrature for Robust Control. AAAI 2018: 3925-3933 - [c4]Matthew Fellows, Kamil Ciosek, Shimon Whiteson:
Fourier Policy Gradients. ICML 2018: 1485-1494 - [i6]Kamil Ciosek, Shimon Whiteson:
Expected Policy Gradients for Reinforcement Learning. CoRR abs/1801.03326 (2018) - [i5]Matthew Fellows, Kamil Ciosek, Shimon Whiteson:
Fourier Policy Gradients. CoRR abs/1802.06891 (2018) - 2017
- [c3]Kamil Andrzej Ciosek, Shimon Whiteson:
OFFER: Off-Environment Reinforcement Learning. AAAI 2017: 1819-1825 - [i4]Kamil Ciosek, Shimon Whiteson:
Expected Policy Gradients. CoRR abs/1706.05374 (2017) - 2016
- [i3]Supratik Paul, Kamil Ciosek, Michael A. Osborne, Shimon Whiteson:
Alternating Optimisation and Quadrature for Robust Reinforcement Learning. CoRR abs/1605.07496 (2016) - 2015
- [b1]Kamil Andrzej Ciosek:
Linear reinforcement learning with options. University College London, UK, 2015 - [i2]Kamil Ciosek, David Silver:
Value Iteration with Options and State Aggregation. CoRR abs/1501.03959 (2015) - 2013
- [i1]Kamil Ciosek:
Properties of the Least Squares Temporal Difference learning algorithm. CoRR abs/1301.5220 (2013) - 2012
- [c2]David Silver, Kamil Ciosek:
Compositional Planning Using Optimal Option Models. ICML 2012
2000 – 2009
- 2009
- [c1]Kamil Ciosek, Pawel Kotowski:
Generating 3D Plants using Lindenmayer System. GRAPP 2009: 76-81
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-10 21:15 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint