default search action
Tristan Thrush
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c17]Jirayu Burapacheep, Ishan Gaur, Agam Bhatia, Tristan Thrush:
ColorSwap: A Color and Word Order Dataset for Multimodal Evaluation. ACL (Findings) 2024: 1716-1726 - [c16]Tristan Thrush, Jared Moore, Miguel Monares, Christopher Potts, Douwe Kiela:
I am a Strange Dataset: Metalinguistic Tests for Language Models. ACL (1) 2024: 8888-8907 - [i20]Tristan Thrush, Jared Moore, Miguel Monares, Christopher Potts, Douwe Kiela:
I am a Strange Dataset: Metalinguistic Tests for Language Models. CoRR abs/2401.05300 (2024) - [i19]Jirayu Burapacheep, Ishan Gaur, Agam Bhatia, Tristan Thrush:
ColorSwap: A Color and Word Order Dataset for Multimodal Evaluation. CoRR abs/2402.04492 (2024) - [i18]Tristan Thrush, Christopher Potts, Tatsunori Hashimoto:
Improving Pretraining Data Using Perplexity Correlations. CoRR abs/2409.05816 (2024) - 2023
- [c15]Mark Mazumder, Colby R. Banbury, Xiaozhe Yao, Bojan Karlas, William Gaviria Rojas, Sudnya Frederick Diamos, Greg Diamos, Lynn He, Alicia Parrish, Hannah Rose Kirk, Jessica Quaye, Charvi Rastogi, Douwe Kiela, David Jurado, David Kanter, Rafael Mosquera, Will Cukierski, Juan Ciro, Lora Aroyo, Bilge Acun, Lingjiao Chen, Mehul Raje, Max Bartolo, Evan Sabri Eyuboglu, Amirata Ghorbani, Emmett D. Goodman, Addison Howard, Oana Inel, Tariq Kane, Christine R. Kirkpatrick, D. Sculley, Tzu-Sheng Kuo, Jonas W. Mueller, Tristan Thrush, Joaquin Vanschoren, Margaret Warren, Adina Williams, Serena Yeung, Newsha Ardalani, Praveen K. Paritosh, Ce Zhang, James Y. Zou, Carole-Jean Wu, Cody Coleman, Andrew Y. Ng, Peter Mattson, Vijay Janapa Reddi:
DataPerf: Benchmarks for Data-Centric AI Development. NeurIPS 2023 - [i17]Hugo Laurençon, Lucile Saulnier, Thomas Wang, Christopher Akiki, Albert Villanova del Moral, Teven Le Scao, Leandro von Werra, Chenghao Mou, Eduardo González Ponferrada, Huu Nguyen, Jörg Frohberg, Mario Sasko, Quentin Lhoest, Angelina McMillan-Major, Gérard Dupont, Stella Biderman, Anna Rogers, Loubna Ben Allal, Francesco De Toni, Giada Pistilli, Olivier Nguyen, Somaieh Nikpoor, Maraim Masoud, Pierre Colombo, Javier de la Rosa, Paulo Villegas, Tristan Thrush, Shayne Longpre, Sebastian Nagel, Leon Weber, Manuel Muñoz, Jian Zhu, Daniel van Strien, Zaid Alyafeai, Khalid Almubarak, Minh Chien Vu, Itziar Gonzalez-Dios, Aitor Soroa, Kyle Lo, Manan Dey, Pedro Ortiz Suarez, Aaron Gokaslan, Shamik Bose, David Ifeoluwa Adelani, Long Phan, Hieu Tran, Ian Yu, Suhas Pai, Jenny Chim, Violette Lepercq, Suzana Ilic, Margaret Mitchell, Sasha Luccioni, Yacine Jernite:
The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset. CoRR abs/2303.03915 (2023) - [i16]William Berrios, Gautam Mittal, Tristan Thrush, Douwe Kiela, Amanpreet Singh:
Towards Language Models That Can See: Computer Vision Through the LENS of Natural Language. CoRR abs/2306.16410 (2023) - 2022
- [c14]Tristan Thrush, Kushal Tirumala, Anmol Gupta, Max Bartolo, Pedro Rodriguez, Tariq Kane, William Gaviria Rojas, Peter Mattson, Adina Williams, Douwe Kiela:
Dynatask: A Framework for Creating Dynamic AI Benchmark Tasks. ACL (demo) 2022: 174-181 - [c13]Tristan Thrush, Ryan Jiang, Max Bartolo, Amanpreet Singh, Adina Williams, Douwe Kiela, Candace Ross:
Winoground: Probing Vision and Language Models for Visio-Linguistic Compositionality. CVPR 2022: 5228-5238 - [c12]Leandro von Werra, Lewis Tunstall, Abhishek Thakur, Sasha Luccioni, Tristan Thrush, Aleksandra Piktus, Felix Marty, Nazneen Rajani, Victor Mustar, Helen Ngo:
Evaluate & Evaluation on the Hub: Better Best Practices for Data and Model Measurements. EMNLP (Demos) 2022: 128-136 - [c11]Hannah Kirk, Bertie Vidgen, Paul Röttger, Tristan Thrush, Scott A. Hale:
Hatemoji: A Test Suite and Adversarially-Generated Dataset for Benchmarking and Detecting Emoji-Based Hate. NAACL-HLT 2022: 1352-1368 - [c10]Max Bartolo, Tristan Thrush, Sebastian Riedel, Pontus Stenetorp, Robin Jia, Douwe Kiela:
Models in the Loop: Aiding Crowdworkers with Generative Annotation Assistants. NAACL-HLT 2022: 3754-3767 - [c9]Hugo Laurençon, Lucile Saulnier, Thomas Wang, Christopher Akiki, Albert Villanova del Moral, Teven Le Scao, Leandro von Werra, Chenghao Mou, Eduardo González Ponferrada, Huu Nguyen, Jörg Frohberg, Mario Sasko, Quentin Lhoest, Angelina McMillan-Major, Gérard Dupont, Stella Biderman, Anna Rogers, Loubna Ben Allal, Francesco De Toni, Giada Pistilli, Olivier Nguyen, Somaieh Nikpoor, Maraim Masoud, Pierre Colombo, Javier de la Rosa, Paulo Villegas, Tristan Thrush, Shayne Longpre, Sebastian Nagel, Leon Weber, Manuel Muñoz, Jian Zhu, Daniel van Strien, Zaid Alyafeai, Khalid Almubarak, Minh Chien Vu, Itziar Gonzalez-Dios, Aitor Soroa, Kyle Lo, Manan Dey, Pedro Ortiz Suarez, Aaron Gokaslan, Shamik Bose, David Ifeoluwa Adelani, Long Phan, Hieu Tran, Ian Yu, Suhas Pai, Jenny Chim, Violette Lepercq, Suzana Ilic, Margaret Mitchell, Alexandra Sasha Luccioni, Yacine Jernite:
The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset. NeurIPS 2022 - [i15]Tristan Thrush, Kushal Tirumala, Anmol Gupta, Max Bartolo, Pedro Rodriguez, Tariq Kane, William Gaviria Rojas, Peter Mattson, Adina Williams, Douwe Kiela:
Dynatask: A Framework for Creating Dynamic AI Benchmark Tasks. CoRR abs/2204.01906 (2022) - [i14]Tristan Thrush, Ryan Jiang, Max Bartolo, Amanpreet Singh, Adina Williams, Douwe Kiela, Candace Ross:
Winoground: Probing Vision and Language Models for Visio-Linguistic Compositionality. CoRR abs/2204.03162 (2022) - [i13]Mark Mazumder, Colby R. Banbury, Xiaozhe Yao, Bojan Karlas, William Gaviria Rojas, Sudnya Frederick Diamos, Greg Diamos, Lynn He, Douwe Kiela, David Jurado, David Kanter, Rafael Mosquera, Juan Ciro, Lora Aroyo, Bilge Acun, Sabri Eyuboglu, Amirata Ghorbani, Emmett D. Goodman, Tariq Kane, Christine R. Kirkpatrick, Tzu-Sheng Kuo, Jonas Mueller, Tristan Thrush, Joaquin Vanschoren, Margaret Warren, Adina Williams, Serena Yeung, Newsha Ardalani, Praveen K. Paritosh, Ce Zhang, James Zou, Carole-Jean Wu, Cody Coleman, Andrew Y. Ng, Peter Mattson, Vijay Janapa Reddi:
DataPerf: Benchmarks for Data-Centric AI Development. CoRR abs/2207.10062 (2022) - [i12]Leandro von Werra, Lewis Tunstall, Abhishek Thakur, Alexandra Sasha Luccioni, Tristan Thrush, Aleksandra Piktus, Felix Marty, Nazneen Rajani, Victor Mustar, Helen Ngo, Omar Sanseviero, Mario Sasko, Albert Villanova del Moral, Quentin Lhoest, Julien Chaumond, Margaret Mitchell, Alexander M. Rush, Thomas Wolf, Douwe Kiela:
Evaluate & Evaluation on the Hub: Better Best Practices for Data and Model Measurements. CoRR abs/2210.01970 (2022) - [i11]Margaret Mitchell, Alexandra Sasha Luccioni, Nathan Lambert, Marissa Gerchick, Angelina McMillan-Major, Ezinwanne Ozoani, Nazneen Rajani, Tristan Thrush, Yacine Jernite, Douwe Kiela:
Measuring Data. CoRR abs/2212.05129 (2022) - 2021
- [j1]Tu-Hoa Pham, William Seto, Shreyansh Daftry, Barry Ridge, Johanna Hansen, Tristan Thrush, Mark Van der Merwe, Gerard Maggiolino, Alexander Brinkman, John Mayo, Yang Cheng, Curtis Padgett, Eric A. Kulczycki, Renaud Detry:
Rover Relocalization for Mars Sample Return by Virtual Template Synthesis and Matching. IEEE Robotics Autom. Lett. 6(2): 4009-4016 (2021) - [c8]Bertie Vidgen, Tristan Thrush, Zeerak Waseem, Douwe Kiela:
Learning from the Worst: Dynamically Generated Datasets to Improve Online Hate Detection. ACL/IJCNLP (1) 2021: 1667-1682 - [c7]Max Bartolo, Tristan Thrush, Robin Jia, Sebastian Riedel, Pontus Stenetorp, Douwe Kiela:
Improving Question Answering Model Robustness with Synthetic Adversarial Data Generation. EMNLP (1) 2021: 8830-8848 - [c6]Douwe Kiela, Max Bartolo, Yixin Nie, Divyansh Kaushik, Atticus Geiger, Zhengxuan Wu, Bertie Vidgen, Grusha Prasad, Amanpreet Singh, Pratik Ringshia, Zhiyi Ma, Tristan Thrush, Sebastian Riedel, Zeerak Waseem, Pontus Stenetorp, Robin Jia, Mohit Bansal, Christopher Potts, Adina Williams:
Dynabench: Rethinking Benchmarking in NLP. NAACL-HLT 2021: 4110-4124 - [c5]Zhiyi Ma, Kawin Ethayarajh, Tristan Thrush, Somya Jain, Ledell Wu, Robin Jia, Christopher Potts, Adina Williams, Douwe Kiela:
Dynaboard: An Evaluation-As-A-Service Platform for Holistic Next-Generation Benchmarking. NeurIPS 2021: 10351-10367 - [c4]Sasha Sheng, Amanpreet Singh, Vedanuj Goswami, Jose Alberto Lopez Magana, Tristan Thrush, Wojciech Galuba, Devi Parikh, Douwe Kiela:
Human-Adversarial Visual Question Answering. NeurIPS 2021: 20346-20359 - [c3]Guillaume Wenzek, Vishrav Chaudhary, Angela Fan, Sahir Gomez, Naman Goyal, Somya Jain, Douwe Kiela, Tristan Thrush, Francisco Guzmán:
Findings of the WMT 2021 Shared Task on Large-Scale Multilingual Machine Translation. WMT@EMNLP 2021: 89-99 - [i10]Tu-Hoa Pham, William Seto, Shreyansh Daftry, Barry Ridge, Johanna Hansen, Tristan Thrush, Mark Van der Merwe, Gerard Maggiolino, Alexander Brinkman, John Mayo, Yang Cheng, Curtis Padgett, Eric A. Kulczycki, Renaud Detry:
Rover Relocalization for Mars Sample Return by Virtual Template Synthesis and Matching. CoRR abs/2103.03395 (2021) - [i9]Max Bartolo, Tristan Thrush, Robin Jia, Sebastian Riedel, Pontus Stenetorp, Douwe Kiela:
Improving Question Answering Model Robustness with Synthetic Adversarial Data Generation. CoRR abs/2104.08678 (2021) - [i8]Douwe Kiela, Max Bartolo, Yixin Nie, Divyansh Kaushik, Atticus Geiger, Zhengxuan Wu, Bertie Vidgen, Grusha Prasad, Amanpreet Singh, Pratik Ringshia, Zhiyi Ma, Tristan Thrush, Sebastian Riedel, Zeerak Waseem, Pontus Stenetorp, Robin Jia, Mohit Bansal, Christopher Potts, Adina Williams:
Dynabench: Rethinking Benchmarking in NLP. CoRR abs/2104.14337 (2021) - [i7]Zhiyi Ma, Kawin Ethayarajh, Tristan Thrush, Somya Jain, Ledell Wu, Robin Jia, Christopher Potts, Adina Williams, Douwe Kiela:
Dynaboard: An Evaluation-As-A-Service Platform for Holistic Next-Generation Benchmarking. CoRR abs/2106.06052 (2021) - [i6]Hannah Rose Kirk, Bertram Vidgen, Paul Röttger, Tristan Thrush, Scott A. Hale:
Hatemoji: A Test Suite and Adversarially-Generated Dataset for Benchmarking and Detecting Emoji-based Hate. CoRR abs/2108.05921 (2021) - [i5]Max Bartolo, Tristan Thrush, Sebastian Riedel, Pontus Stenetorp, Robin Jia, Douwe Kiela:
Models in the Loop: Aiding Crowdworkers with Generative Annotation Assistants. CoRR abs/2112.09062 (2021) - 2020
- [c2]Tristan Thrush, Ethan Wilcox, Roger Levy:
Investigating Novel Verb Learning in BERT: Selectional Preference Classes and Alternation-Based Syntactic Generalization. BlackboxNLP@EMNLP 2020: 265-275 - [c1]Tristan Thrush:
Compositional Neural Machine Translation by Removing the Lexicon from Syntax. CogSci 2020 - [i4]Tristan Thrush:
Compositional Neural Machine Translation by Removing the Lexicon from Syntax. CoRR abs/2002.08899 (2020) - [i3]Adina Williams, Tristan Thrush, Douwe Kiela:
ANLIzing the Adversarial Natural Language Inference Dataset. CoRR abs/2010.12729 (2020) - [i2]Tristan Thrush, Ethan Wilcox, Roger Levy:
Investigating Novel Verb Learning in BERT: Selectional Preference Classes and Alternation-Based Syntactic Generalization. CoRR abs/2011.02417 (2020) - [i1]Bertie Vidgen, Tristan Thrush, Zeerak Waseem, Douwe Kiela:
Learning from the Worst: Dynamically Generated Datasets to Improve Online Hate Detection. CoRR abs/2012.15761 (2020)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-10 22:16 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint