default search action

combined dblp search
author search
venue search
publication search

ask others

Saurabh Kumar 0004

> Home > Persons

Person information

affiliation: Stanford University, CA, USA
affiliation: Google Brain, USA

Other persons with the same name

see FAQ

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[i12]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-06811
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-06811
Alex Lewandowski, Saurabh Kumar, Dale Schuurmans, András György, Marlos C. Machado:
Learning Continually by Spectral Regularization. CoRR abs/2406.06811 (2024)
[i11]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-12185
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-12185
Dilip Arumugam, Saurabh Kumar, Ramki Gummadi, Benjamin Van Roy:
Satisficing Exploration for Deep Reinforcement Learning. CoRR abs/2407.12185 (2024)
[i10]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2408-02930
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2408-02930
Saurabh Kumar, Hong Jun Jeon, Alex Lewandowski, Benjamin Van Roy:
The Need for a Big World Simulator: A Scientific Challenge for Continual Learning. CoRR abs/2408.02930 (2024)
2023
[i9]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2307-04345
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2307-04345
Saurabh Kumar, Henrik Marklund, Ashish Rao, Yifan Zhu, Hong Jun Jeon, Yueyang Liu, Benjamin Van Roy:
Continual Learning as Computationally Constrained Reinforcement Learning. CoRR abs/2307.04345 (2023)
[i8]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-11958
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-11958
Saurabh Kumar, Henrik Marklund, Benjamin Van Roy:
Maintaining Plasticity via Regenerative Regularization. CoRR abs/2308.11958 (2023)
2022
[c6]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/Gummadi0WS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/Gummadi0WS22
Ramki Gummadi, Saurabh Kumar, Junfeng Wen, Dale Schuurmans:
A Parametric Class of Approximate Gradient Updates for Policy Optimization. ICML 2022: 7998-8015
[i7]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-08499
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-08499
Ramki Gummadi, Saurabh Kumar, Junfeng Wen, Dale Schuurmans:
A Parametric Class of Approximate Gradient Updates for Policy Optimization. CoRR abs/2206.08499 (2022)
2021
[c5]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/WenKGS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/WenKGS21
Junfeng Wen, Saurabh Kumar, Ramki Gummadi, Dale Schuurmans:
Characterizing the Gap Between Actor-Critic and Policy Gradient. ICML 2021: 11101-11111
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2106-06932
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-06932
Junfeng Wen, Saurabh Kumar, Ramki Gummadi, Dale Schuurmans:
Characterizing the Gap Between Actor-Critic and Policy Gradient. CoRR abs/2106.06932 (2021)
2020
[c4]
- view
  - electronic edition @ neurips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/KumarKLF20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/KumarKLF20
Saurabh Kumar, Aviral Kumar, Sergey Levine, Chelsea Finn:
One Solution is Not All You Need: Few-Shot Extrapolation via Structured MaxEnt RL. NeurIPS 2020
[c3]
- view
  - electronic edition @ neurips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/YuK0LHF20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/YuK0LHF20
Tianhe Yu, Saurabh Kumar, Abhishek Gupta, Sergey Levine, Karol Hausman, Chelsea Finn:
Gradient Surgery for Multi-Task Learning. NeurIPS 2020
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2001-06782
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2001-06782
Tianhe Yu, Saurabh Kumar, Abhishek Gupta, Sergey Levine, Karol Hausman, Chelsea Finn:
Gradient Surgery for Multi-Task Learning. CoRR abs/2001.06782 (2020)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2010-14484
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-14484
Saurabh Kumar, Aviral Kumar, Sergey Levine, Chelsea Finn:
One Solution is Not All You Need: Few-Shot Extrapolation via Structured MaxEnt RL. CoRR abs/2010.14484 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c2]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/GeladaKBNB19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/GeladaKBNB19
Carles Gelada, Saurabh Kumar, Jacob Buckman, Ofir Nachum, Marc G. Bellemare:
DeepMDP: Learning Continuous Latent Space Models for Representation Learning. ICML 2019: 2170-2179
[c1]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/RowlandDKMBD19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/RowlandDKMBD19
Mark Rowland, Robert Dadashi, Saurabh Kumar, Rémi Munos, Marc G. Bellemare, Will Dabney:
Statistics and Samples in Distributional Reinforcement Learning. ICML 2019: 5528-5536
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1902-08102
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1902-08102
Mark Rowland, Robert Dadashi, Saurabh Kumar, Rémi Munos, Marc G. Bellemare, Will Dabney:
Statistics and Samples in Distributional Reinforcement Learning. CoRR abs/1902.08102 (2019)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1906-02736
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1906-02736
Carles Gelada, Saurabh Kumar, Jacob Buckman, Ofir Nachum, Marc G. Bellemare:
DeepMDP: Learning Continuous Latent Space Models for Representation Learning. CoRR abs/1906.02736 (2019)
2018
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1812-06110
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1812-06110
Pablo Samuel Castro, Subhodeep Moitra, Carles Gelada, Saurabh Kumar, Marc G. Bellemare:
Dopamine: A Research Framework for Deep Reinforcement Learning. CoRR abs/1812.06110 (2018)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.