default search action

combined dblp search
author search
venue search
publication search

ask others

Javier Rando

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c4]
- view
  - electronic edition @ aclanthology.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/emnlp/JoshiRSK024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/JoshiRSK024
Nitish Joshi, Javier Rando, Abulhair Saparov, Najoung Kim, He He:
Personas as a Way to Model Truthfulness in Language Models. EMNLP 2024: 6346-6359
[c3]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/RandoT24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/RandoT24
Javier Rando, Florian Tramèr:
Universal Jailbreak Backdoors from Poisoned Human Feedback. ICLR 2024
[i14]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-09932
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-09932
Usman Anwar, Abulhair Saparov, Javier Rando, Daniel Paleka, Miles Turpin, Peter Hase, Ekdeep Singh Lubana, Erik Jenner, Stephen Casper, Oliver Sourbut, Benjamin L. Edelman, Zhaowei Zhang, Mario Günther, Anton Korinek, José Hernández-Orallo, Lewis Hammond, Eric J. Bigelow, Alexander Pan, Lauro Langosco, Tomasz Korbak, Heidi Zhang, Ruiqi Zhong, Seán Ó hÉigeartaigh, Gabriel Recchia, Giulio Corsi, Alan Chan, Markus Anderljung, Lilian Edwards, Yoshua Bengio, Danqi Chen, Samuel Albanie, Tegan Maharaj, Jakob N. Foerster, Florian Tramèr, He He, Atoosa Kasirzadeh, Yejin Choi, David Krueger:
Foundational Challenges in Assuring Alignment and Safety of Large Language Models. CoRR abs/2404.09932 (2024)
[i13]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-14461
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-14461
Javier Rando, Francesco Croce, Krystof Mitka, Stepan Shabalin, Maksym Andriushchenko, Nicolas Flammarion, Florian Tramèr:
Competition Report: Finding Universal Jailbreak Backdoors in Aligned LLMs. CoRR abs/2404.14461 (2024)
[i12]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-07954
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-07954
Edoardo Debenedetti, Javier Rando, Daniel Paleka, Silaghi Fineas Florin, Dragos Albastroiu, Niv Cohen, Yuval Lemberg, Reshmi Ghosh, Rui Wen, Ahmed Salem, Giovanni Cherubin, Santiago Zanella Béguelin, Robin Schmid, Victor Klemm, Takahiro Miki, Chenhao Li, Stefan Kraft, Mario Fritz, Florian Tramèr, Sahar Abdelnabi, Lea Schönherr:
Dataset and Lessons Learned from the 2024 SaTML LLM Capture-the-Flag Competition. CoRR abs/2406.07954 (2024)
[i11]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-11854
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-11854
Eyal Aharoni, Sharlene Fernandes, Daniel J. Brady, Caelan Alexander, Michael Criner, Kara Queen, Javier Rando, Eddy Nahmias, Victor Crespo:
Attributions toward Artificial Agents in a modified Moral Turing Test. CoRR abs/2406.11854 (2024)
[i10]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-12027
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-12027
Robert Hönig, Javier Rando, Nicholas Carlini, Florian Tramèr:
Adversarial Perturbations Cannot Reliably Protect Artists From Generative AI. CoRR abs/2406.12027 (2024)
[i9]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-03489
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-03489
Javier Rando, Hannah Korevaar, Erik Brinkman, Ivan Evtimov, Florian Tramèr:
Gradient-based Jailbreak Images for Multimodal Fusion Models. CoRR abs/2410.03489 (2024)
2023
[j1]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - journals/tmlr/CasperDSGSRFKLF23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmlr/CasperDSGSRFKLF23
Stephen Casper, Xander Davies, Claudia Shi, Thomas Krendl Gilbert, Jérémy Scheurer, Javier Rando, Rachel Freedman, Tomasz Korbak, David Lindner, Pedro Freire, Tony Tong Wang, Samuel Marks, Charbel-Raphaël Ségerie, Micah Carroll, Andi Peng, Phillip J. K. Christoffersen, Mehul Damani, Stewart Slocum, Usman Anwar, Anand Siththaranjan, Max Nadeau, Eric J. Michaud, Jacob Pfau, Dmitrii Krasheninnikov, Xin Chen, Lauro Langosco, Peter Hase, Erdem Biyik, Anca D. Dragan, David Krueger, Dorsa Sadigh, Dylan Hadfield-Menell:
Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback. Trans. Mach. Learn. Res. 2023 (2023)
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/esorics/RandoPH23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/esorics/RandoPH23
Javier Rando, Fernando Pérez-Cruz, Briland Hitaj:
PassGPT: Password Modeling and (Guided) Generation with Large Language Models. ESORICS (4) 2023: 164-183
[i8]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-01545
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-01545
Javier Rando, Fernando Pérez-Cruz, Briland Hitaj:
PassGPT: Password Modeling and (Guided) Generation with Large Language Models. CoRR abs/2306.01545 (2023)
[i7]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2307-15217
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2307-15217
Stephen Casper, Xander Davies, Claudia Shi, Thomas Krendl Gilbert, Jérémy Scheurer, Javier Rando, Rachel Freedman, Tomasz Korbak, David Lindner, Pedro Freire, Tony Tong Wang, Samuel Marks, Charbel-Raphaël Ségerie, Micah Carroll, Andi Peng, Phillip J. K. Christoffersen, Mehul Damani, Stewart Slocum, Usman Anwar, Anand Siththaranjan, Max Nadeau, Eric J. Michaud, Jacob Pfau, Dmitrii Krasheninnikov, Xin Chen, Lauro Langosco, Peter Hase, Erdem Biyik, Anca D. Dragan, David Krueger, Dorsa Sadigh, Dylan Hadfield-Menell:
Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback. CoRR abs/2307.15217 (2023)
[i6]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-18168
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-18168
Nitish Joshi, Javier Rando, Abulhair Saparov, Najoung Kim, He He:
Personas as a Way to Model Truthfulness in Language Models. CoRR abs/2310.18168 (2023)
[i5]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2311-03348
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2311-03348
Rusheb Shah, Quentin Feuillade-Montixi, Soroush Pour, Arush Tagade, Stephen Casper, Javier Rando:
Scalable and Transferable Black-Box Jailbreaks for Language Models via Persona Modulation. CoRR abs/2311.03348 (2023)
[i4]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2311-14455
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2311-14455
Javier Rando, Florian Tramèr:
Universal Jailbreak Backdoors from Poisoned Human Feedback. CoRR abs/2311.14455 (2023)
2022
[i3]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-06761
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-06761
Javier Rando, Nasib Naimi, Thomas Baumann, Max Mathys:
Exploring Adversarial Attacks and Defenses in Vision Transformers trained with DINO. CoRR abs/2206.06761 (2022)
[i2]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-04610
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-04610
Javier Rando, Daniel Paleka, David Lindner, Lennart Heim, Florian Tramèr:
Red-Teaming the Stable Diffusion Safety Filter. CoRR abs/2210.04610 (2022)
2020
[c1]
- view
  - electronic edition @ iscram.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iscram/LoriniRS020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscram/LoriniRS020
Valerio Lorini, Javier Rando, Diego Sáez-Trumper, Carlos Castillo:
Uneven Coverage of Natural Disasters in Wikipedia: The Case of Floods. ISCRAM 2020: 688-703
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2001-08810
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2001-08810
Valerio Lorini, Javier Rando, Diego Sáez-Trumper, Carlos Castillo:
Uneven Coverage of Natural Disasters in Wikipedia: the Case of Flood. CoRR abs/2001.08810 (2020)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.