default search action

combined dblp search
author search
venue search
publication search

ask others

Joar Skalse

Joar Max Viktor Skalse

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c11]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/KarwowskiHBKGS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/KarwowskiHBKGS24
Jacek Karwowski, Oliver Hayman, Xingjian Bai, Klaus Kiendlhofer, Charlie Griffin, Joar Max Viktor Skalse:
Goodhart's Law in Reinforcement Learning. ICLR 2024
[c10]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/SkalseA24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/SkalseA24
Joar Max Viktor Skalse, Alessandro Abate:
Quantifying the Sensitivity of Inverse Reinforcement Learning to Misspecification. ICLR 2024
[c9]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/SkalseFMJGA24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/SkalseFMJGA24
Joar Max Viktor Skalse, Lucy Farnik, Sumeet Ramesh Motwani, Erik Jenner, Adam Gleave, Alessandro Abate:
STARC: A General Framework For Quantifying Differences Between Reward Functions. ICLR 2024
[c8]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/SubramaniWHHGS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/SubramaniWHHGS24
Rohan Subramani, Marcus Williams, Max Heitmann, Halfdan Holm, Charlie Griffin, Joar Max Viktor Skalse:
On the Expressivity of Objective-Specification Formalisms in Reinforcement Learning. ICLR 2024
[i15]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-14811
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-14811
Joar Skalse, Alessandro Abate:
On the Limitations of Markovian Rewards to Express Multi-Objective, Risk-Sensitive, and Modal Tasks. CoRR abs/2401.14811 (2024)
[i14]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-06854
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-06854
Joar Skalse, Alessandro Abate:
Quantifying the Sensitivity of Inverse Reinforcement Learning to Misspecification. CoRR abs/2403.06854 (2024)
[i13]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-06624
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-06624
David Dalrymple, Joar Skalse, Yoshua Bengio, Stuart Russell, Max Tegmark, Sanjit Seshia, Steve Omohundro, Christian Szegedy, Ben Goldhaber, Nora Ammann, Alessandro Abate, Joe Halpern, Clark W. Barrett, Ding Zhao, Tan Zhi-Xuan, Jeannette Wing, Joshua B. Tenenbaum:
Towards Guaranteed Safe AI: A Framework for Ensuring Robust and Reliable AI Systems. CoRR abs/2405.06624 (2024)
[i12]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-15753
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-15753
Lukas Fluri, Leon Lang, Alessandro Abate, Patrick Forré, David Krueger, Joar Skalse:
The Perils of Optimizing Learned Reward Functions: Low Training Error Does Not Guarantee Low Regret. CoRR abs/2406.15753 (2024)
2023
[c7]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/SkalseA23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/SkalseA23
Joar Skalse, Alessandro Abate:
Misspecification in Inverse Reinforcement Learning. AAAI 2023: 15136-15143
[c6]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/SkalseF0AG23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/SkalseF0AG23
Joar Max Viktor Skalse, Matthew Farrugia-Roberts, Stuart Russell, Alessandro Abate, Adam Gleave:
Invariance in Policy Optimisation and Partial Identifiability in Reward Learning. ICML 2023: 32033-32058
[c5]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/uai/SkalseA23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/uai/SkalseA23
Joar Skalse, Alessandro Abate:
On the limitations of Markovian rewards to express multi-objective, risk-sensitive, and modal tasks. UAI 2023: 1974-1984
[i11]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-15257
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-15257
Joar Skalse, Lucy Farnik, Sumeet Ramesh Motwani, Erik Jenner, Adam Gleave, Alessandro Abate:
STARC: A General Framework For Quantifying Differences Between Reward Functions. CoRR abs/2309.15257 (2023)
[i10]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-09144
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-09144
Jacek Karwowski, Oliver Hayman, Xingjian Bai, Klaus Kiendlhofer, Charlie Griffin, Joar Skalse:
Goodhart's Law in Reinforcement Learning. CoRR abs/2310.09144 (2023)
[i9]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-11840
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-11840
Rohan Subramani, Marcus Williams, Max Heitmann, Halfdan Holm, Charlie Griffin, Joar Skalse:
On The Expressivity of Objective-Specification Formalisms in Reinforcement Learning. CoRR abs/2310.11840 (2023)
2022
[c4]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/ijcai/SkalseHGA22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/SkalseHGA22
Joar Skalse, Lewis Hammond, Charlie Griffin, Alessandro Abate:
Lexicographic Multi-Objective Reinforcement Learning. IJCAI 2022: 3430-3436
[c3]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/SkalseHKK22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/SkalseHKK22
Joar Skalse, Nikolaus H. R. Howe, Dmitrii Krasheninnikov, David Krueger:
Defining and Characterizing Reward Gaming. NeurIPS 2022
[i8]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-07475
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-07475
Joar Skalse, Matthew Farrugia-Roberts, Stuart Russell, Alessandro Abate, Adam Gleave:
Invariance in Policy Optimisation and Partial Identifiability in Reward Learning. CoRR abs/2203.07475 (2022)
[i7]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2209-13085
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2209-13085
Joar Skalse, Nikolaus H. R. Howe, Dmitrii Krasheninnikov, David Krueger:
Defining and Characterizing Reward Hacking. CoRR abs/2209.13085 (2022)
[i6]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2212-03201
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2212-03201
Joar Skalse, Alessandro Abate:
Misspecification in Inverse Reinforcement Learning. CoRR abs/2212.03201 (2022)
[i5]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2212-13769
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2212-13769
Joar Skalse, Lewis Hammond, Charlie Griffin, Alessandro Abate:
Lexicographic Multi-Objective Reinforcement Learning. CoRR abs/2212.13769 (2022)
2021
[j1]
- view
  - electronic edition @ jmlr.org (open access)
  - no references & citations available
- export record
  dblp key:
  - journals/jmlr/MingardPSL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jmlr/MingardPSL21
Chris Mingard, Guillermo Valle Pérez, Joar Skalse, Ard A. Louis:
Is SGD a Bayesian sampler? Well, almost. J. Mach. Learn. Res. 22: 79:1-79:64 (2021)
[c2]
- view
  - electronic edition @ ceur-ws.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/aaai/LeechSS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/LeechSS21
Gavin Leech, Nandi Schoots, Joar Skalse:
Safety Properties of Inductive Logic Programming. SafeAI@AAAI 2021
[c1]
- view
  - electronic edition @ neurips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/BellLOS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/BellLOS21
James Bell, Linda Linsefors, Caspar Oesterheld, Joar Skalse:
Reinforcement Learning in Newcomblike Environments. NeurIPS 2021: 22146-22157
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2101-00280
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2101-00280
Joar Skalse:
A General Counterexample to Any Decision Theory and Some Responses. CoRR abs/2101.00280 (2021)
2020
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2006-15191
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2006-15191
Chris Mingard, Guillermo Valle Pérez, Joar Skalse, Ard A. Louis:
Is SGD a Bayesian sampler? Well, almost. CoRR abs/2006.15191 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1906-01820
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1906-01820
Evan Hubinger, Chris van Merwijk, Vladimir Mikulik, Joar Skalse, Scott Garrabrant:
Risks from Learned Optimization in Advanced Machine Learning Systems. CoRR abs/1906.01820 (2019)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1909-11522
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1909-11522
Chris Mingard, Joar Skalse, Guillermo Valle Pérez, David Martínez-Rubio, Vladimir Mikulik, Ard A. Louis:
Neural networks are a priori biased towards Boolean functions with low entropy. CoRR abs/1909.11522 (2019)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.