default search action
Abbas Abdolmaleki
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j8]Konstantinos Bousmalis, Giulia Vezzani, Dushyant Rao, Coline Manon Devin, Alex X. Lee, Maria Bauzá Villalonga, Todor Davchev, Yuxiang Zhou, Agrim Gupta, Akhil Raju, Antoine Laurens, Claudio Fantacci, Valentin Dalibard, Martina Zambelli, Murilo Fernandes Martins, Rugile Pevceviciute, Michiel Blokzijl, Misha Denil, Nathan Batchelor, Thomas Lampe, Emilio Parisotto, Konrad Zolna, Scott E. Reed, Sergio Gómez Colmenarejo, Jon Scholz, Abbas Abdolmaleki, Oliver Groth, Jean-Baptiste Regli, Oleg Sushkov, Thomas Rothörl, José Enrique Chen, Yusuf Aytar, Dave Barker, Joy Ortiz, Martin A. Riedmiller, Jost Tobias Springenberg, Raia Hadsell, Francesco Nori, Nicolas Heess:
RoboCat: A Self-Improving Generalist Agent for Robotic Manipulation. Trans. Mach. Learn. Res. 2024 (2024) - [c36]Jost Tobias Springenberg, Abbas Abdolmaleki, Jingwei Zhang, Oliver Groth, Michael Bloesch, Thomas Lampe, Philemon Brakel, Sarah Bechtle, Steven Kapturowski, Roland Hafner, Nicolas Heess, Martin A. Riedmiller:
Offline Actor-Critic Reinforcement Learning Scales to Large Models. ICML 2024 - [c35]Thomas Lampe, Abbas Abdolmaleki, Sarah Bechtle, Sandy H. Huang, Jost Tobias Springenberg, Michael Bloesch, Oliver Groth, Roland Hafner, Tim Hertweck, Michael Neunert, Markus Wulfmeier, Jingwei Zhang, Francesco Nori, Nicolas Heess, Martin A. Riedmiller:
Mastering Stacking of Diverse Shapes with Large-Scale Iterative Reinforcement Learning on Real Robots. ICRA 2024: 7772-7779 - [c34]Mohak Bhardwaj, Thomas Lampe, Michael Neunert, Francesco Romano, Abbas Abdolmaleki, Arunkumar Byravan, Markus Wulfmeier, Martin A. Riedmiller, Jonas Buchli:
Real-world fluid directed rigid body control via deep reinforcement learning. L4DC 2024: 414-427 - [i37]Jost Tobias Springenberg, Abbas Abdolmaleki, Jingwei Zhang, Oliver Groth, Michael Bloesch, Thomas Lampe, Philemon Brakel, Sarah Bechtle, Steven Kapturowski, Roland Hafner, Nicolas Heess, Martin A. Riedmiller:
Offline Actor-Critic Reinforcement Learning Scales to Large Models. CoRR abs/2402.05546 (2024) - [i36]Mohak Bhardwaj, Thomas Lampe, Michael Neunert, Francesco Romano, Abbas Abdolmaleki, Arunkumar Byravan, Markus Wulfmeier, Martin A. Riedmiller, Jonas Buchli:
Real-World Fluid Directed Rigid Body Control via Deep Reinforcement Learning. CoRR abs/2402.06102 (2024) - [i35]Jingwei Zhang, Thomas Lampe, Abbas Abdolmaleki, Jost Tobias Springenberg, Martin A. Riedmiller:
Game On: Towards Language Models as RL Experimenters. CoRR abs/2409.03402 (2024) - [i34]Abbas Abdolmaleki, Bilal Piot, Bobak Shahriari, Jost Tobias Springenberg, Tim Hertweck, Rishabh Joshi, Junhyuk Oh, Michael Bloesch, Thomas Lampe, Nicolas Heess, Jonas Buchli, Martin A. Riedmiller:
Preference Optimization as Probabilistic Inference. CoRR abs/2410.04166 (2024) - 2023
- [j7]Giulia Vezzani, Dhruva Tirumala, Markus Wulfmeier, Dushyant Rao, Abbas Abdolmaleki, Ben Moran, Tuomas Haarnoja, Jan Humplik, Roland Hafner, Michael Neunert, Claudio Fantacci, Tim Hertweck, Thomas Lampe, Fereshteh Sadeghi, Nicolas Heess, Martin A. Riedmiller:
SkillS: Adaptive Skill Sequencing for Efficient Temporally-Extended Exploration. Trans. Mach. Learn. Res. 2023 (2023) - [i33]Jingwei Zhang, Jost Tobias Springenberg, Arunkumar Byravan, Leonard Hasenclever, Abbas Abdolmaleki, Dushyant Rao, Nicolas Heess, Martin A. Riedmiller:
Leveraging Jumpy Models for Planning and Fast Learning in Robotic Domains. CoRR abs/2302.12617 (2023) - [i32]Konstantinos Bousmalis, Giulia Vezzani, Dushyant Rao, Coline Devin, Alex X. Lee, Maria Bauzá, Todor Davchev, Yuxiang Zhou, Agrim Gupta, Akhil Raju, Antoine Laurens, Claudio Fantacci, Valentin Dalibard, Martina Zambelli, Murilo F. Martins, Rugile Pevceviciute, Michiel Blokzijl, Misha Denil, Nathan Batchelor, Thomas Lampe, Emilio Parisotto, Konrad Zolna, Scott E. Reed, Sergio Gómez Colmenarejo, Jon Scholz, Abbas Abdolmaleki, Oliver Groth, Jean-Baptiste Regli, Oleg Sushkov, Thomas Rothörl, José Enrique Chen, Yusuf Aytar, Dave Barker, Joy Ortiz, Martin A. Riedmiller, Jost Tobias Springenberg, Raia Hadsell, Francesco Nori, Nicolas Heess:
RoboCat: A Self-Improving Foundation Agent for Robotic Manipulation. CoRR abs/2306.11706 (2023) - [i31]Shruti Mishra, Ankit Anand, Jordan Hoffmann, Nicolas Heess, Martin A. Riedmiller, Abbas Abdolmaleki, Doina Precup:
Policy composition in reinforcement learning via multi-objective policy optimization. CoRR abs/2308.15470 (2023) - [i30]Thomas Lampe, Abbas Abdolmaleki, Sarah Bechtle, Sandy H. Huang, Jost Tobias Springenberg, Michael Bloesch, Oliver Groth, Roland Hafner, Tim Hertweck, Michael Neunert, Markus Wulfmeier, Jingwei Zhang, Francesco Nori, Nicolas Heess, Martin A. Riedmiller:
Mastering Stacking of Diverse Shapes with Large-Scale Iterative Reinforcement Learning on Real Robots. CoRR abs/2312.11374 (2023) - 2022
- [j6]Jonas Degrave, Federico Felici, Jonas Buchli, Michael Neunert, Brendan D. Tracey, Francesco Carpanese, Timo Ewalds, Roland Hafner, Abbas Abdolmaleki, Diego de Las Casas, Craig Donner, Leslie Fritz, Cristian Galperti, Andrea Huber, James Keeling, Maria Tsimpoukelli, Jackie Kay, Antoine Merle, Jean-Marc Moret, Seb Noury, Federico Pesamosca, David Pfau, Olivier Sauter, Cristian Sommariva, Stefano Coda, Basil Duval, Ambrogio Fasoli, Pushmeet Kohli, Koray Kavukcuoglu, Demis Hassabis, Martin A. Riedmiller:
Magnetic control of tokamak plasmas through deep reinforcement learning. Nat. 602(7897): 414-419 (2022) - [j5]Siqi Liu, Guy Lever, Zhe Wang, Josh Merel, S. M. Ali Eslami, Daniel Hennes, Wojciech M. Czarnecki, Yuval Tassa, Shayegan Omidshafiei, Abbas Abdolmaleki, Noah Y. Siegel, Leonard Hasenclever, Luke Marris, Saran Tunyasuvunakool, H. Francis Song, Markus Wulfmeier, Paul Muller, Tuomas Haarnoja, Brendan D. Tracey, Karl Tuyls, Thore Graepel, Nicolas Heess:
From motor control to team play in simulated humanoid football. Sci. Robotics 7(69) (2022) - [c33]Wenxuan Zhou, Steven Bohez, Jan Humplik, Nicolas Heess, Abbas Abdolmaleki, Dushyant Rao, Markus Wulfmeier, Tuomas Haarnoja:
Forgetting and Imbalance in Robot Lifelong Learning with Off-policy Data. CoLLAs 2022: 294-309 - [c32]Arunkumar Byravan, Leonard Hasenclever, Piotr Trochim, Mehdi Mirza, Alessandro Davide Ialongo, Yuval Tassa, Jost Tobias Springenberg, Abbas Abdolmaleki, Nicolas Heess, Josh Merel, Martin A. Riedmiller:
Evaluating Model-Based Planning and Planner Amortization for Continuous Control. ICLR 2022 - [c31]Alex X. Lee, Coline Devin, Jost Tobias Springenberg, Yuxiang Zhou, Thomas Lampe, Abbas Abdolmaleki, Konstantinos Bousmalis:
How to Spend Your Robot Time: Bridging Kickstarting and Offline Reinforcement Learning for Vision-based Robotic Manipulation. IROS 2022: 2468-2475 - [d1]Siqi Liu, Guy Lever, Zhe Wang, Josh Merel, S. M. Ali Eslami, Daniel Hennes, Wojciech Czarnecki, Yuval Tassa, Shayegan Omidshafiei, Abbas Abdolmaleki, Noah Y. Siegel, Leonard Hasenclever, Luke Marris, Saran Tunyasuvunakool, H. Francis Song, Markus Wulfmeier, Paul Muller, Tuomas Haarnoja, Brendan D. Tracey, Karl Tuyls, Thore Graepel, Nicolas Heess:
Figure Data for the paper "From Motor Control to Team Play in Simulated Humanoid Football". Zenodo, 2022 - [i29]Wenxuan Zhou, Steven Bohez, Jan Humplik, Abbas Abdolmaleki, Dushyant Rao, Markus Wulfmeier, Tuomas Haarnoja, Nicolas Heess:
Offline Distillation for Robot Lifelong Learning with Imbalanced Experience. CoRR abs/2204.05893 (2022) - [i28]Bobak Shahriari, Abbas Abdolmaleki, Arunkumar Byravan, Abe Friesen, Siqi Liu, Jost Tobias Springenberg, Nicolas Heess, Matt Hoffman, Martin A. Riedmiller:
Revisiting Gaussian mixture critics in off-policy reinforcement learning: a sample-based approach. CoRR abs/2204.10256 (2022) - [i27]Alex X. Lee, Coline Devin, Jost Tobias Springenberg, Yuxiang Zhou, Thomas Lampe, Abbas Abdolmaleki, Konstantinos Bousmalis:
How to Spend Your Robot Time: Bridging Kickstarting and Offline Reinforcement Learning for Vision-based Robotic Manipulation. CoRR abs/2205.03353 (2022) - [i26]Giulia Vezzani, Dhruva Tirumala, Markus Wulfmeier, Dushyant Rao, Abbas Abdolmaleki, Ben Moran, Tuomas Haarnoja, Jan Humplik, Roland Hafner, Michael Neunert, Claudio Fantacci, Tim Hertweck, Thomas Lampe, Fereshteh Sadeghi, Nicolas Heess, Martin A. Riedmiller:
SkillS: Adaptive Skill Sequencing for Efficient Temporally-Extended Exploration. CoRR abs/2211.13743 (2022) - 2021
- [c30]Sandy H. Huang, Abbas Abdolmaleki, Giulia Vezzani, Philemon Brakel, Daniel J. Mankowitz, Michael Neunert, Steven Bohez, Yuval Tassa, Nicolas Heess, Martin A. Riedmiller, Raia Hadsell:
A Constrained Multi-Objective Reinforcement Learning Framework. CoRL 2021: 883-893 - [c29]Alex X. Lee, Coline Manon Devin, Yuxiang Zhou, Thomas Lampe, Konstantinos Bousmalis, Jost Tobias Springenberg, Arunkumar Byravan, Abbas Abdolmaleki, Nimrod Gileadi, David Khosid, Claudio Fantacci, José Enrique Chen, Akhil Raju, Rae Jeong, Michael Neunert, Antoine Laurens, Stefano Saliceti, Federico Casarini, Martin A. Riedmiller, Raia Hadsell, Francesco Nori:
Beyond Pick-and-Place: Tackling Robotic Stacking of Diverse Shapes. CoRL 2021: 1089-1131 - [c28]Markus Wulfmeier, Dushyant Rao, Roland Hafner, Thomas Lampe, Abbas Abdolmaleki, Tim Hertweck, Michael Neunert, Dhruva Tirumala, Noah Y. Siegel, Nicolas Heess, Martin A. Riedmiller:
Data-efficient Hindsight Off-policy Option Learning. ICML 2021: 11340-11350 - [i25]William F. Whitney, Michael Bloesch, Jost Tobias Springenberg, Abbas Abdolmaleki, Martin A. Riedmiller:
Rethinking Exploration for Sample-Efficient Policy Learning. CoRR abs/2101.09458 (2021) - [i24]Siqi Liu, Guy Lever, Zhe Wang, Josh Merel, S. M. Ali Eslami, Daniel Hennes, Wojciech M. Czarnecki, Yuval Tassa, Shayegan Omidshafiei, Abbas Abdolmaleki, Noah Y. Siegel, Leonard Hasenclever, Luke Marris, Saran Tunyasuvunakool, H. Francis Song, Markus Wulfmeier, Paul Muller, Tuomas Haarnoja, Brendan D. Tracey, Karl Tuyls, Thore Graepel, Nicolas Heess:
From Motor Control to Team Play in Simulated Humanoid Football. CoRR abs/2105.12196 (2021) - [i23]Abbas Abdolmaleki, Sandy H. Huang, Giulia Vezzani, Bobak Shahriari, Jost Tobias Springenberg, Shruti Mishra, Dhruva TB, Arunkumar Byravan, Konstantinos Bousmalis, András György, Csaba Szepesvári, Raia Hadsell, Nicolas Heess, Martin A. Riedmiller:
On Multi-objective Policy Optimization as a Tool for Reinforcement Learning. CoRR abs/2106.08199 (2021) - [i22]Arunkumar Byravan, Leonard Hasenclever, Piotr Trochim, Mehdi Mirza, Alessandro Davide Ialongo, Yuval Tassa, Jost Tobias Springenberg, Abbas Abdolmaleki, Nicolas Heess, Josh Merel, Martin A. Riedmiller:
Evaluating model-based planning and planner amortization for continuous control. CoRR abs/2110.03363 (2021) - [i21]Alex X. Lee, Coline Devin, Yuxiang Zhou, Thomas Lampe, Konstantinos Bousmalis, Jost Tobias Springenberg, Arunkumar Byravan, Abbas Abdolmaleki, Nimrod Gileadi, David Khosid, Claudio Fantacci, José Enrique Chen, Akhil Raju, Rae Jeong, Michael Neunert, Antoine Laurens, Stefano Saliceti, Federico Casarini, Martin A. Riedmiller, Raia Hadsell, Francesco Nori:
Beyond Pick-and-Place: Tackling Robotic Stacking of Diverse Shapes. CoRR abs/2110.06192 (2021) - 2020
- [c27]Daniel J. Mankowitz, Nir Levine, Rae Jeong, Abbas Abdolmaleki, Jost Tobias Springenberg, Yuanyuan Shi, Jackie Kay, Todd Hester, Timothy A. Mann, Martin A. Riedmiller:
Robust Reinforcement Learning for Continuous Control with Model Misspecification. ICLR 2020 - [c26]Noah Y. Siegel, Jost Tobias Springenberg, Felix Berkenkamp, Abbas Abdolmaleki, Michael Neunert, Thomas Lampe, Roland Hafner, Nicolas Heess, Martin A. Riedmiller:
Keep Doing What Worked: Behavior Modelling Priors for Offline Reinforcement Learning. ICLR 2020 - [c25]H. Francis Song, Abbas Abdolmaleki, Jost Tobias Springenberg, Aidan Clark, Hubert Soyer, Jack W. Rae, Seb Noury, Arun Ahuja, Siqi Liu, Dhruva Tirumala, Nicolas Heess, Dan Belov, Martin A. Riedmiller, Matthew M. Botvinick:
V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control. ICLR 2020 - [c24]Abbas Abdolmaleki, Sandy H. Huang, Leonard Hasenclever, Michael Neunert, H. Francis Song, Martina Zambelli, Murilo F. Martins, Nicolas Heess, Raia Hadsell, Martin A. Riedmiller:
A distributional view on multi-objective policy optimization. ICML 2020: 11-22 - [c23]Markus Wulfmeier, Abbas Abdolmaleki, Roland Hafner, Jost Tobias Springenberg, Michael Neunert, Noah Y. Siegel, Tim Hertweck, Thomas Lampe, Nicolas Heess, Martin A. Riedmiller:
Compositional Transfer in Hierarchical Reinforcement Learning. Robotics: Science and Systems 2020 - [i20]Michael Neunert, Abbas Abdolmaleki, Markus Wulfmeier, Thomas Lampe, Jost Tobias Springenberg, Roland Hafner, Francesco Romano, Jonas Buchli, Nicolas Heess, Martin A. Riedmiller:
Continuous-Discrete Reinforcement Learning for Hybrid Control in Robotics. CoRR abs/2001.00449 (2020) - [i19]Noah Y. Siegel, Jost Tobias Springenberg, Felix Berkenkamp, Abbas Abdolmaleki, Michael Neunert, Thomas Lampe, Roland Hafner, Nicolas Heess, Martin A. Riedmiller:
Keep Doing What Worked: Behavioral Modelling Priors for Offline Reinforcement Learning. CoRR abs/2002.08396 (2020) - [i18]Abbas Abdolmaleki, Sandy H. Huang, Leonard Hasenclever, Michael Neunert, H. Francis Song, Martina Zambelli, Murilo F. Martins, Nicolas Heess, Raia Hadsell, Martin A. Riedmiller:
A Distributional View on Multi-Objective Policy Optimization. CoRR abs/2005.07513 (2020) - [i17]Matt Hoffman, Bobak Shahriari, John Aslanides, Gabriel Barth-Maron, Feryal M. P. Behbahani, Tamara Norman, Abbas Abdolmaleki, Albin Cassirer, Fan Yang, Kate Baumli, Sarah Henderson, Alexander Novikov, Sergio Gómez Colmenarejo, Serkan Cabi, Çaglar Gülçehre, Tom Le Paine, Andrew Cowie, Ziyu Wang, Bilal Piot, Nando de Freitas:
Acme: A Research Framework for Distributed Reinforcement Learning. CoRR abs/2006.00979 (2020) - [i16]Markus Wulfmeier, Dushyant Rao, Roland Hafner, Thomas Lampe, Abbas Abdolmaleki, Tim Hertweck, Michael Neunert, Dhruva Tirumala, Noah Y. Siegel, Nicolas Heess, Martin A. Riedmiller:
Data-efficient Hindsight Off-policy Option Learning. CoRR abs/2007.15588 (2020) - [i15]Jost Tobias Springenberg, Nicolas Heess, Daniel J. Mankowitz, Josh Merel, Arunkumar Byravan, Abbas Abdolmaleki, Jackie Kay, Jonas Degrave, Julian Schrittwieser, Yuval Tassa, Jonas Buchli, Dan Belov, Martin A. Riedmiller:
Local Search for Policy Iteration in Continuous Control. CoRR abs/2010.05545 (2020) - [i14]Giulia Vezzani, Michael Neunert, Markus Wulfmeier, Rae Jeong, Thomas Lampe, Noah Y. Siegel, Roland Hafner, Abbas Abdolmaleki, Martin A. Riedmiller, Francesco Nori:
"What, not how": Solving an under-actuated insertion task from scratch. CoRR abs/2010.15492 (2020)
2010 – 2019
- 2019
- [j4]Abbas Abdolmaleki, David Simões, Nuno Lau, Luís Paulo Reis, Gerhard Neumann:
Contextual Direct Policy Search - With Regularized Covariance Matrix Estimation. J. Intell. Robotic Syst. 96(2): 141-157 (2019) - [c22]Arunkumar Byravan, Jost Tobias Springenberg, Abbas Abdolmaleki, Roland Hafner, Michael Neunert, Thomas Lampe, Noah Y. Siegel, Nicolas Heess, Martin A. Riedmiller:
Imagined Value Gradients: Model-Based Policy Optimization with Tranferable Latent Dynamics Models. CoRL 2019: 566-589 - [c21]Michael Neunert, Abbas Abdolmaleki, Markus Wulfmeier, Thomas Lampe, Jost Tobias Springenberg, Roland Hafner, Francesco Romano, Jonas Buchli, Nicolas Heess, Martin A. Riedmiller:
Continuous-Discrete Reinforcement Learning for Hybrid Control in Robotics. CoRL 2019: 735-751 - [c20]Devin Schwab, Jost Tobias Springenberg, Murilo Fernandes Martins, Michael Neunert, Thomas Lampe, Abbas Abdolmaleki, Tim Hertweck, Roland Hafner, Francesco Nori, Martin A. Riedmiller:
Simultaneously Learning Vision and Feature-Based Control Policies for Real-World Ball-In-A-Cup. Robotics: Science and Systems 2019 - [i13]Steven Bohez, Abbas Abdolmaleki, Michael Neunert, Jonas Buchli, Nicolas Heess, Raia Hadsell:
Value constrained model-free continuous control. CoRR abs/1902.04623 (2019) - [i12]Devin Schwab, Jost Tobias Springenberg, Murilo F. Martins, Thomas Lampe, Michael Neunert, Abbas Abdolmaleki, Tim Hertweck, Roland Hafner, Francesco Nori, Martin A. Riedmiller:
Simultaneously Learning Vision and Feature-based Control Policies for Real-world Ball-in-a-Cup. CoRR abs/1902.04706 (2019) - [i11]Daniel J. Mankowitz, Nir Levine, Rae Jeong, Abbas Abdolmaleki, Jost Tobias Springenberg, Timothy A. Mann, Todd Hester, Martin A. Riedmiller:
Robust Reinforcement Learning for Continuous Control with Model Misspecification. CoRR abs/1906.07516 (2019) - [i10]Markus Wulfmeier, Abbas Abdolmaleki, Roland Hafner, Jost Tobias Springenberg, Michael Neunert, Tim Hertweck, Thomas Lampe, Noah Y. Siegel, Nicolas Heess, Martin A. Riedmiller:
Regularized Hierarchical Policies for Compositional Transfer in Robotics. CoRR abs/1906.11228 (2019) - [i9]H. Francis Song, Abbas Abdolmaleki, Jost Tobias Springenberg, Aidan Clark, Hubert Soyer, Jack W. Rae, Seb Noury, Arun Ahuja, Siqi Liu, Dhruva Tirumala, Nicolas Heess, Dan Belov, Martin A. Riedmiller, Matthew M. Botvinick:
V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control. CoRR abs/1909.12238 (2019) - [i8]Shruti Mishra, Abbas Abdolmaleki, Arthur Guez, Piotr Trochim, Doina Precup:
Augmenting learning using symmetry in a biologically-inspired domain. CoRR abs/1910.00528 (2019) - [i7]Arunkumar Byravan, Jost Tobias Springenberg, Abbas Abdolmaleki, Roland Hafner, Michael Neunert, Thomas Lampe, Noah Y. Siegel, Nicolas Heess, Martin A. Riedmiller:
Imagined Value Gradients: Model-Based Policy Optimization with Transferable Latent Dynamics Models. CoRR abs/1910.04142 (2019) - [i6]Rae Jeong, Jackie Kay, Francesco Romano, Thomas Lampe, Thomas Rothörl, Abbas Abdolmaleki, Tom Erez, Yuval Tassa, Francesco Nori:
Modelling Generalized Forces with Reinforcement Learning for Sim-to-Real Transfer. CoRR abs/1910.09471 (2019) - [i5]Jonas Degrave, Abbas Abdolmaleki, Jost Tobias Springenberg, Nicolas Heess, Martin A. Riedmiller:
Quinoa: a Q-function You Infer Normalized Over Actions. CoRR abs/1911.01831 (2019) - 2018
- [b1]Abbas Abdolmaleki:
Information theoretic stochastic search. University of Minho, Portugal, 2018 - [j3]Riad Akrour, Abbas Abdolmaleki, Hany Abdulsamad, Jan Peters, Gerhard Neumann:
Model-Free Trajectory-based Policy Optimization with Monotonic Improvement. J. Mach. Learn. Res. 19: 14:1-14:25 (2018) - [c19]Abbas Abdolmaleki, Jost Tobias Springenberg, Yuval Tassa, Rémi Munos, Nicolas Heess, Martin A. Riedmiller:
Maximum a Posteriori Policy Optimisation. ICLR (Poster) 2018 - [c18]Voot Tangkaratt, Abbas Abdolmaleki, Masashi Sugiyama:
Guide Actor-Critic for Continuous Control. ICLR (Poster) 2018 - [c17]Victor Barbaros, Herke van Hoof, Abbas Abdolmaleki, David Meger:
Eager and Memory-Based Non-Parametric Stochastic Search Methods for Learning Control. ICRA 2018: 1-9 - [i4]Yuval Tassa, Yotam Doron, Alistair Muldal, Tom Erez, Yazhe Li, Diego de Las Casas, David Budden, Abbas Abdolmaleki, Josh Merel, Andrew Lefrancq, Timothy P. Lillicrap, Martin A. Riedmiller:
DeepMind Control Suite. CoRR abs/1801.00690 (2018) - [i3]Abbas Abdolmaleki, Jost Tobias Springenberg, Yuval Tassa, Rémi Munos, Nicolas Heess, Martin A. Riedmiller:
Maximum a Posteriori Policy Optimisation. CoRR abs/1806.06920 (2018) - [i2]Abbas Abdolmaleki, Jost Tobias Springenberg, Jonas Degrave, Steven Bohez, Yuval Tassa, Dan Belov, Nicolas Heess, Martin A. Riedmiller:
Relative Entropy Regularized Policy Iteration. CoRR abs/1812.02256 (2018) - 2017
- [c16]Abbas Abdolmaleki, David Simões, Nuno Lau, Luís Paulo Reis, Bob Price, Gerhard Neumann:
Stochastic Search In Changing Situations. AAAI Workshops 2017 - [c15]Abbas Abdolmaleki, Bob Price, Nuno Lau, Luís Paulo Reis, Gerhard Neumann:
Deriving and improving CMA-ES with information geometric trust regions. GECCO 2017: 657-664 - [c14]Abbas Abdolmaleki, Bob Price, Nuno Lau, Luís Paulo Reis, Gerhard Neumann:
Contextual Covariance Matrix Adaptation Evolutionary Strategies. IJCAI 2017: 1378-1385 - 2016
- [j2]Abbas Abdolmaleki, Nuno Lau, Luís Paulo Reis, Jan Peters, Gerhard Neumann:
Contextual Policy Search for Linear and Nonlinear Generalization of a Humanoid Walking Controller. J. Intell. Robotic Syst. 83(3-4): 393-408 (2016) - [c13]Abbas Abdolmaleki, Nuno Lau, Luís Paulo Reis, Gerhard Neumann:
Contextual Stochastic Search. GECCO (Companion) 2016: 29-30 - [c12]Abbas Abdolmaleki, Rudolf Lioutikov, Nuno Lau, Luís Paulo Reis, Jan Peters, Gerhard Neumann:
Model-Based Relative Entropy Stochastic Search. GECCO (Companion) 2016: 153-154 - [c11]Abbas Abdolmaleki, David Simões, Nuno Lau, Luís Paulo Reis, Gerhard Neumann:
Contextual Relative Entropy Policy Search with Covariance Matrix Adaptation. ICARSC 2016: 94-99 - [c10]Riad Akrour, Gerhard Neumann, Hany Abdulsamad, Abbas Abdolmaleki:
Model-Free Trajectory Optimization for Reinforcement Learning. ICML 2016: 2961-2970 - [c9]Abbas Abdolmaleki, Nuno Lau, Luís Paulo Reis, Gerhard Neumann:
Non-parametric contextual stochastic search. IROS 2016: 2643-2648 - [c8]Abbas Abdolmaleki, David Simões, Nuno Lau, Luís Paulo Reis, Gerhard Neumann:
Learning a Humanoid Kick with Controlled Distance. RoboCup 2016: 45-57 - [i1]Riad Akrour, Abbas Abdolmaleki, Hany Abdulsamad, Gerhard Neumann:
Model-free Trajectory Optimization for Reinforcement Learning. CoRR abs/1606.09197 (2016) - 2015
- [c7]Abbas Abdolmaleki, Nuno Lau, Luís Paulo Reis, Gerhard Neumann:
Regularized covariance estimation for weighted maximum likelihood policy search methods. Humanoids 2015: 154-159 - [c6]Abbas Abdolmaleki, Nuno Lau, Luís Paulo Reis, Jan Peters, Gerhard Neumann:
Contextual Policy Search for Generalizing a Parameterized Biped Walking Controller. ICARSC 2015: 17-22 - [c5]Abbas Abdolmaleki, Rudolf Lioutikov, Jan Peters, Nuno Lau, Luís Paulo Reis, Gerhard Neumann:
Model-Based Relative Entropy Stochastic Search. NIPS 2015: 3537-3545 - 2014
- [c4]Abbas Abdolmaleki, Nima Shafii, Luís Paulo Reis, Nuno Lau, Jan Peters, Gerhard Neumann:
Omnidirectional Walking with a Compliant Inverted Pendulum Model. IBERAMIA 2014: 481-493 - 2013
- [c3]Nima Shafii, Abbas Abdolmaleki, Rui Ferreira, Nuno Lau, Luís Paulo Reis:
Omnidirectional Walking and Active Balance for Soccer Humanoid Robot. EPIA 2013: 283-294 - 2012
- [j1]Leila Abedi, Mohammad Ali Nematbakhsh, Abbas Abdolmaleki:
A Model for Context Aware Mobile Payment. J. Theor. Appl. Electron. Commer. Res. 7(3): 1-10 (2012) - [c2]Abbas Abdolmaleki, Mostafa Movahedi, Nuno Lau, Luís Paulo Reis:
A Distributed Cooperative Reinforcement Learning Method for Decision Making in Fire Brigade Teams. RoboCup 2012: 237-248 - 2011
- [c1]Abbas Abdolmaleki, Mostafa Movahedi, Sajjad Salehi, Nuno Lau, Luís Paulo Reis:
A Reinforcement Learning Based Method for Optimizing the Process of Decision Making in Fire Brigade Agents. EPIA 2011: 340-351
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-13 23:51 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint