default search action

combined dblp search
author search
venue search
publication search

ask others

Lior Shani

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c9]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/TennenholtzCHJS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/TennenholtzCHJS24
Guy Tennenholtz, Yinlam Chow, Chih-Wei Hsu, Jihwan Jeong, Lior Shani, Azamat Tulepbergenov, Deepak Ramachandran, Martin Mladenov, Craig Boutilier:
Demystifying Embedding Spaces using Large Language Models. ICLR 2024
[i13]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-14655
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-14655
Lior Shani, Aviv Rosenberg, Asaf B. Cassel, Oran Lang, Daniele Calandriello, Avital Zipori, Hila Noga, Orgad Keller, Bilal Piot, Idan Szpektor, Avinatan Hassidim, Yossi Matias, Rémi Munos:
Multi-turn Reinforcement Learning from Preference Human Feedback. CoRR abs/2405.14655 (2024)
[i12]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-19107
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-19107
Pierre Harvey Richemond, Yunhao Tang, Daniel Guo, Daniele Calandriello, Mohammad Gheshlaghi Azar, Rafael Rafailov, Bernardo Ávila Pires, Eugene Tarassov, Lucas Spangher, Will Ellsworth, Aliaksei Severyn, Jonathan Mallinson, Lior Shani, Gil Shamir, Rishabh Joshi, Tianqi Liu, Rémi Munos, Bilal Piot:
Offline Regularised Reinforcement Learning for Large Language Models Alignment. CoRR abs/2405.19107 (2024)
[i11]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-00024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-00024
Guy Tennenholtz, Yinlam Chow, Chih-Wei Hsu, Lior Shani, Ethan Liang, Craig Boutilier:
Embedding-Aligned Language Models. CoRR abs/2406.00024 (2024)
2023
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/RoitFSACDGGHKMG23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/RoitFSACDGGHKMG23
Paul Roit, Johan Ferret, Lior Shani, Roee Aharoni, Geoffrey Cideron, Robert Dadashi, Matthieu Geist, Sertan Girgin, Léonard Hussenot, Orgad Keller, Nikola Momchev, Sabela Ramos Garea, Piotr Stanczyk, Nino Vieillard, Olivier Bachem, Gal Elidan, Avinatan Hassidim, Olivier Pietquin, Idan Szpektor:
Factually Consistent Summarization via Reinforcement Learning with Textual Entailment Feedback. ACL (1) 2023: 6252-6272
[c7]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/TennenholtzMSMB23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/TennenholtzMSMB23
Guy Tennenholtz, Nadav Merlis, Lior Shani, Martin Mladenov, Craig Boutilier:
Reinforcement Learning with History Dependent Dynamic Contexts. ICML 2023: 34011-34053
[i10]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-02061
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-02061
Guy Tennenholtz, Nadav Merlis, Lior Shani, Martin Mladenov, Craig Boutilier:
Reinforcement Learning with History-Dependent Dynamic Contexts. CoRR abs/2302.02061 (2023)
[i9]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-00186
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-00186
Paul Roit, Johan Ferret, Lior Shani, Roee Aharoni, Geoffrey Cideron, Robert Dadashi, Matthieu Geist, Sertan Girgin, Léonard Hussenot, Orgad Keller, Nikola Momchev, Sabela Ramos, Piotr Stanczyk, Nino Vieillard, Olivier Bachem, Gal Elidan, Avinatan Hassidim, Olivier Pietquin, Idan Szpektor:
Factually Consistent Summarization via Reinforcement Learning with Textual Entailment Feedback. CoRR abs/2306.00186 (2023)
[i8]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-04475
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-04475
Guy Tennenholtz, Yinlam Chow, Chih-Wei Hsu, Jihwan Jeong, Lior Shani, Azamat Tulepbergenov, Deepak Ramachandran, Martin Mladenov, Craig Boutilier:
Demystifying Embedding Spaces using Large Language Models. CoRR abs/2310.04475 (2023)
2022
[c6]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/ShaniZM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/ShaniZM22
Lior Shani, Tom Zahavy, Shie Mannor:
Online Apprenticeship Learning. AAAI 2022: 8240-8248
[c5]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/TomarSEG22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/TomarSEG22
Manan Tomar, Lior Shani, Yonathan Efroni, Mohammad Ghavamzadeh:
Mirror Descent Policy Optimization. ICLR 2022
[c4]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/TennenholtzMSMS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/TennenholtzMSMS22
Guy Tennenholtz, Nadav Merlis, Lior Shani, Shie Mannor, Uri Shalit, Gal Chechik, Assaf Hallak, Gal Dalal:
Reinforcement Learning with a Terminator. NeurIPS 2022
[i7]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-15376
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-15376
Guy Tennenholtz, Nadav Merlis, Lior Shani, Shie Mannor, Uri Shalit, Gal Chechik, Assaf Hallak, Gal Dalal:
Reinforcement Learning with a Terminator. CoRR abs/2205.15376 (2022)
2021
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2102-06924
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-06924
Lior Shani, Tom Zahavy, Shie Mannor:
Online Apprenticeship Learning. CoRR abs/2102.06924 (2021)
2020
[c3]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/ShaniEM20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/ShaniEM20
Lior Shani, Yonathan Efroni, Shie Mannor:
Adaptive Trust Region Policy Optimization: Global Convergence and Faster Rates for Regularized MDPs. AAAI 2020: 5668-5675
[c2]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/ShaniE0M20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/ShaniE0M20
Lior Shani, Yonathan Efroni, Aviv Rosenberg, Shie Mannor:
Optimistic Policy Optimization with Bandit Feedback. ICML 2020: 8604-8613
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2002-08243
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-08243
Yonathan Efroni, Lior Shani, Aviv Rosenberg, Shie Mannor:
Optimistic Policy Optimization with Bandit Feedback. CoRR abs/2002.08243 (2020)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2005-09814
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2005-09814
Manan Tomar, Lior Shani, Yonathan Efroni, Mohammad Ghavamzadeh:
Mirror Descent Policy Optimization. CoRR abs/2005.09814 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c1]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/ShaniEM19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/ShaniEM19
Lior Shani, Yonathan Efroni, Shie Mannor:
Exploration Conscious Reinforcement Learning Revisited. ICML 2019: 5680-5689
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1909-02769
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1909-02769
Lior Shani, Yonathan Efroni, Shie Mannor:
Adaptive Trust Region Policy Optimization: Global Convergence and Faster Rates for Regularized MDPs. CoRR abs/1909.02769 (2019)
2018
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1812-05551
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1812-05551
Lior Shani, Yonathan Efroni, Shie Mannor:
Revisiting Exploration-Conscious Reinforcement Learning. CoRR abs/1812.05551 (2018)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1812-07010
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1812-07010
Mark Kozdoba, Edward Moroshko, Lior Shani, Takuya Takagi, Takashi Katoh, Shie Mannor, Koby Crammer:
Multi Instance Learning For Unbalanced Data. CoRR abs/1812.07010 (2018)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.