default search action
Yonatan Belinkov
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j11]Edo Dotan, Gal Jaschek, Tal Pupko, Yonatan Belinkov:
Effect of tokenization on transformers for biological sequences. Bioinform. 40(4) (2024) - [c80]Alon Mor, Yonatan Belinkov, Benny Kimelfeld:
Accelerating the Global Aggregation of Local Explanations. AAAI 2024: 18807-18814 - [c79]Boaz Carmeli, Yonatan Belinkov, Ron Meir:
Concept-Best-Matching: Evaluating Compositionality In Emergent Communication. ACL (Findings) 2024: 3186-3194 - [c78]Michael Toker, Hadas Orgad, Mor Ventura, Dana Arad, Yonatan Belinkov:
Diffusion Lens: Interpreting Text Encoders in Text-to-Image Pipelines. ACL (1) 2024: 9713-9728 - [c77]Dor Muhlgay, Ori Ram, Inbal Magar, Yoav Levine, Nir Ratner, Yonatan Belinkov, Omri Abend, Kevin Leyton-Brown, Amnon Shashua, Yoav Shoham:
Generating Benchmarks for Factuality Evaluation of Language Models. EACL (1) 2024: 49-66 - [c76]Michael Toker, Oren Mishali, Ophir Münz-Manor, Benny Kimelfeld, Yonatan Belinkov:
A Dataset for Metaphor Detection in Early Medieval Hebrew Poetry. EACL (2) 2024: 443-453 - [c75]Shahar Katz, Yonatan Belinkov, Mor Geva, Lior Wolf:
Backward Lens: Projecting Language Model Gradients into the Vocabulary Space. EMNLP 2024: 2390-2422 - [c74]Adir Rahamim, Naomi Saphra, Sara Kangaslahti, Yonatan Belinkov:
Fast Forwarding Low-Rank Training. EMNLP 2024: 9553-9562 - [c73]Evan Hernandez, Arnab Sen Sharma, Tal Haklay, Kevin Meng, Martin Wattenberg, Jacob Andreas, Yonatan Belinkov, David Bau:
Linearity of Relation Decoding in Transformer Language Models. ICLR 2024 - [c72]Nikhil Prakash, Tamar Rott Shaham, Tal Haklay, Yonatan Belinkov, David Bau:
Fine-Tuning Enhances Existing Mechanisms: A Case Study on Entity Tracking. ICLR 2024 - [c71]Shadi Iskander, Kira Radinsky, Yonatan Belinkov:
Leveraging Prototypical Representations for Mitigating Social Bias without Demographic Information. NAACL (Short Papers) 2024: 379-390 - [c70]Dana Arad, Hadas Orgad, Yonatan Belinkov:
ReFACT: Updating Text-to-Image Models by Editing the Text Encoder. NAACL-HLT 2024: 2537-2558 - [c69]Adir Rahamim, Yonatan Belinkov:
ContraSim - Analyzing Neural Representations Based on Contrastive Learning. NAACL-HLT 2024: 6325-6339 - [c68]Rohit Gandikota, Hadas Orgad, Yonatan Belinkov, Joanna Materzynska, David Bau:
Unified Concept Editing in Diffusion Models. WACV 2024: 5099-5108 - [i93]Shahar Katz, Yonatan Belinkov, Mor Geva, Lior Wolf:
Backward Lens: Projecting Language Model Gradients into the Vocabulary Space. CoRR abs/2402.12865 (2024) - [i92]Nikhil Prakash, Tamar Rott Shaham, Tal Haklay, Yonatan Belinkov, David Bau:
Fine-Tuning Enhances Existing Mechanisms: A Case Study on Entity Tracking. CoRR abs/2402.14811 (2024) - [i91]Michael Toker, Oren Mishali, Ophir Münz-Manor, Benny Kimelfeld, Yonatan Belinkov:
A Dataset for Metaphor Detection in Early Medieval Hebrew Poetry. CoRR abs/2402.17371 (2024) - [i90]Michael Toker, Hadas Orgad, Mor Ventura, Dana Arad, Yonatan Belinkov:
Diffusion Lens: Interpreting Text Encoders in Text-to-Image Pipelines. CoRR abs/2403.05846 (2024) - [i89]Shadi Iskander, Kira Radinsky, Yonatan Belinkov:
Leveraging Prototypical Representations for Mitigating Social Bias without Demographic Information. CoRR abs/2403.09516 (2024) - [i88]Boaz Carmeli, Yonatan Belinkov, Ron Meir:
Concept-Best-Matching: Evaluating Compositionality in Emergent Communication. CoRR abs/2403.14705 (2024) - [i87]Michael Hanna, Sandro Pezzelle, Yonatan Belinkov:
Have Faith in Faithfulness: Going Beyond Circuit Overlap When Finding Model Mechanisms. CoRR abs/2403.17806 (2024) - [i86]Samuel Marks, Can Rager, Eric J. Michaud, Yonatan Belinkov, David Bau, Aaron Mueller:
Sparse Feature Circuits: Discovering and Editing Interpretable Causal Graphs in Language Models. CoRR abs/2403.19647 (2024) - [i85]Opher Lieber, Barak Lenz, Hofit Bata, Gal Cohen, Jhonathan Osin, Itay Dalmedigos, Erez Safahi, Shaked Meirom, Yonatan Belinkov, Shai Shalev-Shwartz, Omri Abend, Raz Alon, Tomer Asida, Amir Bergman, Roman Glozman, Michael Gokhman, Avashalom Manevich, Nir Ratner, Noam Rozen, Erez Shwartz, Mor Zusman, Yoav Shoham:
Jamba: A Hybrid Transformer-Mamba Language Model. CoRR abs/2403.19887 (2024) - [i84]Adi Simhi, Jonathan Herzig, Idan Szpektor, Yonatan Belinkov:
Constructing Benchmarks and Interventions for Combating Hallucinations in LLMs. CoRR abs/2404.09971 (2024) - [i83]Zachary Bamberger, Ofek Glick, Chaim Baskin, Yonatan Belinkov:
DEPTH: Discourse Education through Pre-Training Hierarchically. CoRR abs/2405.07788 (2024) - [i82]Tomer Ashuach, Martin Tutek, Yonatan Belinkov:
REVS: Unlearning Sensitive Information in Language Models via Rank Editing in the Vocabulary Space. CoRR abs/2406.09325 (2024) - [i81]Alessandro Stolfo, Ben Wu, Wes Gurnee, Yonatan Belinkov, Xingyi Song, Mrinmaya Sachan, Neel Nanda:
Confidence Regulation Neurons in Language Models. CoRR abs/2406.16254 (2024) - [i80]Sarah Wiegreffe, Oyvind Tafjord, Yonatan Belinkov, Hannaneh Hajishirzi, Ashish Sabharwal:
Answer, Assemble, Ace: Understanding How Transformers Answer Multiple Choice Questions. CoRR abs/2407.15018 (2024) - [i79]Aaron Mueller, Jannik Brinkmann, Millicent L. Li, Samuel Marks, Koyena Pal, Nikhil Prakash, Can Rager, Aruna Sankaranarayanan, Arnab Sen Sharma, Jiuding Sun, Eric Todd, David Bau, Yonatan Belinkov:
The Quest for the Right Mediator: A History, Survey, and Theoretical Grounding of Causal Interpretability. CoRR abs/2408.01416 (2024) - [i78]Barak Lenz, Alan Arazi, Amir Bergman, Avshalom Manevich, Barak Peleg, Ben Aviram, Chen Almagor, Clara Fridman, Dan Padnos, Daniel Gissin, Daniel Jannai, Dor Muhlgay, Dor Zimberg, Edden M. Gerber, Elad Dolev, Eran Krakovsky, Erez Safahi, Erez Schwartz, Gal Cohen, Gal Shachaf, Haim Rozenblum, Hofit Bata, Ido Blass, Inbal Magar, Itay Dalmedigos, Jhonathan Osin, Julie Fadlon, Maria Rozman, Matan Danos, Michael Gokhman, Mor Zusman, Naama Gidron, Nir Ratner, Noam Gat, Noam Rozen, Oded Fried, Ohad Leshno, Omer Antverg, Omri Abend, Opher Lieber, Or Dagan, Orit Cohavi, Raz Alon, Ro'i Belson, Roi Cohen, Rom Gilad, Roman Glozman, Shahar Lev, Shaked Meirom, Tal Delbari, Tal Ness, Tomer Asida, Tom Ben Gal, Tom Braude, Uriya Pumerantz, Yehoshua Cohen, Yonatan Belinkov, Yuval Globerson, Yuval Peleg Levy, Yoav Shoham:
Jamba-1.5: Hybrid Transformer-Mamba Models at Scale. CoRR abs/2408.12570 (2024) - [i77]Adir Rahamim, Naomi Saphra, Sara Kangaslahti, Yonatan Belinkov:
Fast Forwarding Low-Rank Training. CoRR abs/2409.04206 (2024) - [i76]Hadas Orgad, Michael Toker, Zorik Gekhman, Roi Reichart, Idan Szpektor, Hadas Kotek, Yonatan Belinkov:
LLMs Know More Than They Show: On the Intrinsic Representation of LLM Hallucinations. CoRR abs/2410.02707 (2024) - [i75]Tsachi Blau, Moshe Kimhi, Yonatan Belinkov, Alexander M. Bronstein, Chaim Baskin:
Context-aware Prompt Tuning: Advancing In-Context Learning with Adversarial Methods. CoRR abs/2410.17222 (2024) - [i74]Yaniv Nikankin, Anja Reusch, Aaron Mueller, Yonatan Belinkov:
Arithmetic Without Algorithms: Language Models Solve Math With a Bag of Heuristics. CoRR abs/2410.21272 (2024) - [i73]Adi Simhi, Jonathan Herzig, Idan Szpektor, Yonatan Belinkov:
Distinguishing Ignorance from Error in LLM Hallucinations. CoRR abs/2410.22071 (2024) - 2023
- [j10]Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adrià Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza, Ambrose Slone, Ameet Rahane, Anantharaman S. Iyer, Anders Andreassen, Andrea Madotto, Andrea Santilli, Andreas Stuhlmüller, Andrew M. Dai, Andrew La, Andrew K. Lampinen, Andy Zou, Angela Jiang, Angelica Chen, Anh Vuong, Animesh Gupta, Anna Gottardi, Antonio Norelli, Anu Venkatesh, Arash Gholamidavoodi, Arfa Tabassum, Arul Menezes, Arun Kirubarajan, Asher Mullokandov, Ashish Sabharwal, Austin Herrick, Avia Efrat, Aykut Erdem, Ayla Karakas, B. Ryan Roberts, Bao Sheng Loe, Barret Zoph, Bartlomiej Bojanowski, Batuhan Özyurt, Behnam Hedayatnia, Behnam Neyshabur, Benjamin Inden, Benno Stein, Berk Ekmekci, Bill Yuchen Lin, Blake Howald, Bryan Orinion, Cameron Diao, Cameron Dour, Catherine Stinson, Cedrick Argueta, Cèsar Ferri Ramírez, Chandan Singh, Charles Rathkopf, Chenlin Meng, Chitta Baral, Chiyu Wu, Chris Callison-Burch, Chris Waites, Christian Voigt, Christopher D. Manning, Christopher Potts, Cindy Ramirez, Clara E. Rivera, Clemencia Siro, Colin Raffel, Courtney Ashcraft, Cristina Garbacea, Damien Sileo, Dan Garrette, Dan Hendrycks, Dan Kilman, Dan Roth, Daniel Freeman, Daniel Khashabi, Daniel Levy, Daniel Moseguí González, Danielle Perszyk, Danny Hernandez, Danqi Chen, Daphne Ippolito, Dar Gilboa, David Dohan, David Drakard, David Jurgens, Debajyoti Datta, Deep Ganguli, Denis Emelin, Denis Kleyko, Deniz Yuret, Derek Chen, Derek Tam, Dieuwke Hupkes, Diganta Misra, Dilyar Buzan, Dimitri Coelho Mollo, Diyi Yang, Dong-Ho Lee, Dylan Schrader, Ekaterina Shutova, Ekin Dogus Cubuk, Elad Segal, Eleanor Hagerman, Elizabeth Barnes, Elizabeth Donoway, Ellie Pavlick, Emanuele Rodolà, Emma Lam, Eric Chu, Eric Tang, Erkut Erdem, Ernie Chang, Ethan A. Chi, Ethan Dyer, Ethan J. Jerzak, Ethan Kim, Eunice Engefu Manyasi, Evgenii Zheltonozhskii, Fanyue Xia, Fatemeh Siar, Fernando Martínez-Plumed, Francesca Happé, François Chollet, Frieda Rong, Gaurav Mishra, Genta Indra Winata, Gerard de Melo, Germán Kruszewski, Giambattista Parascandolo, Giorgio Mariani, Gloria Wang, Gonzalo Jaimovitch-López, Gregor Betz, Guy Gur-Ari, Hana Galijasevic, Hannah Kim, Hannah Rashkin, Hannaneh Hajishirzi, Harsh Mehta, Hayden Bogar, Henry Shevlin, Hinrich Schütze, Hiromu Yakura, Hongming Zhang, Hugh Mee Wong, Ian Ng, Isaac Noble, Jaap Jumelet, Jack Geissinger, Jackson Kernion, Jacob Hilton, Jaehoon Lee, Jaime Fernández Fisac, James B. Simon, James Koppel, James Zheng, James Zou, Jan Kocon, Jana Thompson, Janelle Wingfield, Jared Kaplan, Jarema Radom, Jascha Sohl-Dickstein, Jason Phang, Jason Wei, Jason Yosinski, Jekaterina Novikova, Jelle Bosscher, Jennifer Marsh, Jeremy Kim, Jeroen Taal, Jesse H. Engel, Jesujoba Alabi, Jiacheng Xu, Jiaming Song, Jillian Tang, Joan Waweru, John Burden, John Miller, John U. Balis, Jonathan Batchelder, Jonathan Berant, Jörg Frohberg, Jos Rozen, José Hernández-Orallo, Joseph Boudeman, Joseph Guerr, Joseph Jones, Joshua B. Tenenbaum, Joshua S. Rule, Joyce Chua, Kamil Kanclerz, Karen Livescu, Karl Krauth, Karthik Gopalakrishnan, Katerina Ignatyeva, Katja Markert, Kaustubh D. Dhole, Kevin Gimpel, Kevin Omondi, Kory Mathewson, Kristen Chiafullo, Ksenia Shkaruta, Kumar Shridhar, Kyle McDonell, Kyle Richardson, Laria Reynolds, Leo Gao, Li Zhang, Liam Dugan, Lianhui Qin, Lidia Contreras Ochando, Louis-Philippe Morency, Luca Moschella, Lucas Lam, Lucy Noble, Ludwig Schmidt, Luheng He, Luis Oliveros Colón, Luke Metz, Lütfi Kerem Senel, Maarten Bosma, Maarten Sap, Maartje ter Hoeve, Maheen Farooqi, Manaal Faruqui, Mantas Mazeika, Marco Baturan, Marco Marelli, Marco Maru, María José Ramírez-Quintana, Marie Tolkiehn, Mario Giulianelli, Martha Lewis, Martin Potthast, Matthew L. Leavitt, Matthias Hagen, Mátyás Schubert, Medina Baitemirova, Melody Arnaud, Melvin McElrath, Michael A. Yee, Michael Cohen, Michael Gu, Michael I. Ivanitskiy, Michael Starritt, Michael Strube, Michal Swedrowski, Michele Bevilacqua, Michihiro Yasunaga, Mihir Kale, Mike Cain, Mimee Xu, Mirac Suzgun, Mitch Walker, Mo Tiwari, Mohit Bansal, Moin Aminnaseri, Mor Geva, Mozhdeh Gheini, Mukund Varma T., Nanyun Peng, Nathan A. Chi, Nayeon Lee, Neta Gur-Ari Krakover, Nicholas Cameron, Nicholas Roberts, Nick Doiron, Nicole Martinez, Nikita Nangia, Niklas Deckers, Niklas Muennighoff, Nitish Shirish Keskar, Niveditha Iyer, Noah Constant, Noah Fiedel, Nuan Wen, Oliver Zhang, Omar Agha, Omar Elbaghdadi, Omer Levy, Owain Evans, Pablo Antonio Moreno Casares, Parth Doshi, Pascale Fung, Paul Pu Liang, Paul Vicol, Pegah Alipoormolabashi, Peiyuan Liao, Percy Liang, Peter Chang, Peter Eckersley, Phu Mon Htut, Pinyu Hwang, Piotr Milkowski, Piyush Patil, Pouya Pezeshkpour, Priti Oli, Qiaozhu Mei, Qing Lyu, Qinlang Chen, Rabin Banjade, Rachel Etta Rudolph, Raefer Gabriel, Rahel Habacker, Ramon Risco, Raphaël Millière, Rhythm Garg, Richard Barnes, Rif A. Saurous, Riku Arakawa, Robbe Raymaekers, Robert Frank, Rohan Sikand, Roman Novak, Roman Sitelew, Ronan LeBras, Rosanne Liu, Rowan Jacobs, Rui Zhang, Ruslan Salakhutdinov, Ryan Chi, Ryan Lee, Ryan Stovall, Ryan Teehan, Rylan Yang, Sahib Singh, Saif M. Mohammad, Sajant Anand, Sam Dillavou, Sam Shleifer, Sam Wiseman, Samuel Gruetter, Samuel R. Bowman, Samuel S. Schoenholz, Sanghyun Han, Sanjeev Kwatra, Sarah A. Rous, Sarik Ghazarian, Sayan Ghosh, Sean Casey, Sebastian Bischoff, Sebastian Gehrmann, Sebastian Schuster, Sepideh Sadeghi, Shadi Hamdan, Sharon Zhou, Shashank Srivastava, Sherry Shi, Shikhar Singh, Shima Asaadi, Shixiang Shane Gu, Shubh Pachchigar, Shubham Toshniwal, Shyam Upadhyay, Shyamolima (Shammie) Debnath, Siamak Shakeri, Simon Thormeyer, Simone Melzi, Siva Reddy, Sneha Priscilla Makini, Soo-Hwan Lee, Spencer Torene, Sriharsha Hatwar, Stanislas Dehaene, Stefan Divic, Stefano Ermon, Stella Biderman, Stephanie Lin, Stephen Prasad, Steven T. Piantadosi, Stuart M. Shieber, Summer Misherghi, Svetlana Kiritchenko, Swaroop Mishra, Tal Linzen, Tal Schuster, Tao Li, Tao Yu, Tariq Ali, Tatsu Hashimoto, Te-Lin Wu, Théo Desbordes, Theodore Rothschild, Thomas Phan, Tianle Wang, Tiberius Nkinyili, Timo Schick, Timofei Kornev, Titus Tunduny, Tobias Gerstenberg, Trenton Chang, Trishala Neeraj, Tushar Khot, Tyler Shultz, Uri Shaham, Vedant Misra, Vera Demberg, Victoria Nyamai, Vikas Raunak, Vinay V. Ramasesh, Vinay Uday Prabhu, Vishakh Padmakumar, Vivek Srikumar, William Fedus, William Saunders, William Zhang, Wout Vossen, Xiang Ren, Xiaoyu Tong, Xinran Zhao, Xinyi Wu, Xudong Shen, Yadollah Yaghoobzadeh, Yair Lakretz, Yangqiu Song, Yasaman Bahri, Yejin Choi, Yichi Yang, Yiding Hao, Yifu Chen, Yonatan Belinkov, Yu Hou, Yufang Hou, Yuntao Bai, Zachary Seid, Zhuoye Zhao, Zijian Wang, Zijie J. Wang, Zirui Wang, Ziyi Wu:
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models. Trans. Mach. Learn. Res. 2023 (2023) - [c67]Boaz Carmeli, Ron Meir, Yonatan Belinkov:
Emergent Quantized Communication. AAAI 2023: 11533-11541 - [c66]Ori Ram, Liat Bezalel, Adi Zicher, Yonatan Belinkov, Jonathan Berant, Amir Globerson:
What Are You Token About? Dense Retrieval as Distributions Over the Vocabulary. ACL (1) 2023: 2481-2498 - [c65]Shadi Iskander, Kira Radinsky, Yonatan Belinkov:
Shielded Representations: Protecting Sensitive Attributes Through Iterative Gradient-Based Projection. ACL (Findings) 2023: 5961-5977 - [c64]Nir Ratner, Yoav Levine, Yonatan Belinkov, Ori Ram, Inbal Magar, Omri Abend, Ehud Karpas, Amnon Shashua, Kevin Leyton-Brown, Yoav Shoham:
Parallel Context Windows for Large Language Models. ACL (1) 2023: 6383-6402 - [c63]Hadas Orgad, Yonatan Belinkov:
BLIND: Bias Removal With No Demographics. ACL (1) 2023: 8801-8821 - [c62]Ophir Münz-Manor, Michael Toker, Oren Mishali, Benny Kimelfeld, Yonatan Belinkov, Adir Cohen:
FigureOut - Automatic Detection of Metaphors in Hebrew Across the Eras. DH 2023 - [c61]Alessandro Stolfo, Yonatan Belinkov, Mrinmaya Sachan:
A Mechanistic Interpretation of Arithmetic Reasoning in Language Models using Causal Mediation Analysis. EMNLP 2023: 7035-7052 - [c60]Michael Hanna, Yonatan Belinkov, Sandro Pezzelle:
When Language Models Fall in Love: Animacy Processing in Transformer Language Models. EMNLP 2023: 12120-12135 - [c59]Shahar Katz, Yonatan Belinkov:
VISIT: Visualizing and Interpreting the Semantic Information Flow of Transformers. EMNLP (Findings) 2023: 14094-14113 - [c58]Hadas Orgad, Bahjat Kawar, Yonatan Belinkov:
Editing Implicit Assumptions in Text-to-Image Diffusion Models. ICCV 2023: 7030-7038 - [c57]Edo Dotan, Yonatan Belinkov, Oren Avram, Elya Wygoda, Noa Ecker, Michael Alburquerque, Omri Keren, Gil Loewenthal, Tal Pupko:
Multiple sequence alignment as a sequence-to-sequence learning problem. ICLR 2023 - [c56]Kevin Meng, Arnab Sen Sharma, Alex J. Andonian, Yonatan Belinkov, David Bau:
Mass-Editing Memory in a Transformer. ICLR 2023 - [e5]Yonatan Belinkov, Sophie Hao, Jaap Jumelet, Najoung Kim, Arya McCarthy, Hosein Mohebbi:
Proceedings of the 6th BlackboxNLP Workshop: Analyzing and Interpreting Neural Networks for NLP, BlackboxNLP@EMNLP 2023, Singapore, December 7, 2023. Association for Computational Linguistics 2023, ISBN 979-8-89176-052-3 [contents] - [i72]Hadas Orgad, Bahjat Kawar, Yonatan Belinkov:
Editing Implicit Assumptions in Text-to-Image Diffusion Models. CoRR abs/2303.08084 (2023) - [i71]Adir Rahamim, Yonatan Belinkov:
ContraSim - A Similarity Measure Based on Contrastive Learning. CoRR abs/2303.16992 (2023) - [i70]Shadi Iskander, Kira Radinsky, Yonatan Belinkov:
Shielded Representations: Protecting Sensitive Attributes Through Iterative Gradient-Based Projection. CoRR abs/2305.10204 (2023) - [i69]Shahar Katz, Yonatan Belinkov:
Interpreting Transformer's Attention Dynamic Memory and Visualizing the Semantic Information Flow of GPT. CoRR abs/2305.13417 (2023) - [i68]Alessandro Stolfo, Yonatan Belinkov, Mrinmaya Sachan:
Understanding Arithmetic Reasoning in Language Models using Causal Mediation Analysis. CoRR abs/2305.15054 (2023) - [i67]Dana Arad, Hadas Orgad, Yonatan Belinkov:
ReFACT: Updating Text-to-Image Models by Editing the Text Encoder. CoRR abs/2306.00738 (2023) - [i66]Dor Muhlgay, Ori Ram, Inbal Magar, Yoav Levine, Nir Ratner, Yonatan Belinkov, Omri Abend, Kevin Leyton-Brown, Amnon Shashua, Yoav Shoham:
Generating Benchmarks for Factuality Evaluation of Language Models. CoRR abs/2307.06908 (2023) - [i65]Itay Itzhak, Gabriel Stanovsky, Nir Rosenfeld, Yonatan Belinkov:
Instructed to Bias: Instruction-Tuned Language Models Exhibit Emergent Cognitive Bias. CoRR abs/2308.00225 (2023) - [i64]Evan Hernandez, Arnab Sen Sharma, Tal Haklay, Kevin Meng, Martin Wattenberg, Jacob Andreas, Yonatan Belinkov, David Bau:
Linearity of Relation Decoding in Transformer Language Models. CoRR abs/2308.09124 (2023) - [i63]Rohit Gandikota, Hadas Orgad, Yonatan Belinkov, Joanna Materzynska, David Bau:
Unified Concept Editing in Diffusion Models. CoRR abs/2308.14761 (2023) - [i62]Michael Hanna, Yonatan Belinkov, Sandro Pezzelle:
When Language Models Fall in Love: Animacy Processing in Transformer Language Models. CoRR abs/2310.15004 (2023) - [i61]Alon Mor, Yonatan Belinkov, Benny Kimelfeld:
Accelerating the Global Aggregation of Local Explanations. CoRR abs/2312.07991 (2023) - 2022
- [j9]Yonatan Belinkov:
Probing Classifiers: Promises, Shortcomings, and Advances. Comput. Linguistics 48(1): 207-219 (2022) - [c55]Joe Stacey, Yonatan Belinkov, Marek Rei:
Supervising Model Attention with Human Explanations for Robust Natural Language Inference. AAAI 2022: 11349-11357 - [c54]Kerem Zaman, Yonatan Belinkov:
A Multilingual Perspective Towards the Evaluation of Attribution Methods in Natural Language Inference. EMNLP 2022: 1556-1576 - [c53]Omer Antverg, Yonatan Belinkov:
On the Pitfalls of Analyzing Individual Neurons in Language Models. ICLR 2022 - [c52]Hadas Orgad, Seraphina Goldfarb-Tarrant, Yonatan Belinkov:
How Gender Debiasing Affects Internal Model Representations, and Why It Matters. NAACL-HLT 2022: 2602-2628 - [c51]Rachit Bansal, Danish Pruthi, Yonatan Belinkov:
Measures of Information Reflect Memorization Patterns. NeurIPS 2022 - [c50]Kevin Meng, David Bau, Alex Andonian, Yonatan Belinkov:
Locating and Editing Factual Associations in GPT. NeurIPS 2022 - [c49]Dimion Asael, Zachary M. Ziegler, Yonatan Belinkov:
A Generative Approach for Mitigating Structural Biases in Natural Language Inference. *SEM@NAACL-HLT 2022: 186-199 - [e4]Jasmijn Bastings, Yonatan Belinkov, Yanai Elazar, Dieuwke Hupkes, Naomi Saphra, Sarah Wiegreffe:
Proceedings of the Fifth BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP, BlackboxNLP@EMNLP 2022, Abu Dhabi, United Arab Emirates (Hybrid), December 8, 2022. Association for Computational Linguistics 2022, ISBN 978-1-959429-05-0 [contents] - [i60]Kevin Meng, David Bau, Alex Andonian, Yonatan Belinkov:
Locating and Editing Factual Knowledge in GPT. CoRR abs/2202.05262 (2022) - [i59]Kerem Zaman, Yonatan Belinkov:
A Multilingual Perspective Towards the Evaluation of Attribution Methods in Natural Language Inference. CoRR abs/2204.05428 (2022) - [i58]Hadas Orgad, Seraphina Goldfarb-Tarrant, Yonatan Belinkov:
How Gender Debiasing Affects Internal Model Representations, and Why It Matters. CoRR abs/2204.06827 (2022) - [i57]Ehud Karpas, Omri Abend, Yonatan Belinkov, Barak Lenz, Opher Lieber, Nir Ratner, Yoav Shoham, Hofit Bata, Yoav Levine, Kevin Leyton-Brown, Dor Muhlgay, Noam Rozen, Erez Schwartz, Gal Shachaf, Shai Shalev-Shwartz, Amnon Shashua, Moshe Tennenholtz:
MRKL Systems: A modular, neuro-symbolic architecture that combines large language models, external knowledge sources and discrete reasoning. CoRR abs/2205.00445 (2022) - [i56]Omer Antverg, Eyal Ben-David, Yonatan Belinkov:
IDANI: Inference-time Domain Adaptation via Neuron-level Interventions. CoRR abs/2206.00259 (2022) - [i55]Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adrià Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza, Ambrose Slone, Ameet Rahane, Anantharaman S. Iyer, Anders Andreassen, Andrea Madotto, Andrea Santilli, Andreas Stuhlmüller, Andrew M. Dai, Andrew La, Andrew K. Lampinen, Andy Zou, Angela Jiang, Angelica Chen, Anh Vuong, Animesh Gupta, Anna Gottardi, Antonio Norelli, Anu Venkatesh, Arash Gholamidavoodi, Arfa Tabassum, Arul Menezes, Arun Kirubarajan, Asher Mullokandov, Ashish Sabharwal, Austin Herrick, Avia Efrat, Aykut Erdem, Ayla Karakas, B. Ryan Roberts, Bao Sheng Loe, Barret Zoph, Bartlomiej Bojanowski, Batuhan Özyurt, Behnam Hedayatnia, Behnam Neyshabur, Benjamin Inden, Benno Stein, Berk Ekmekci, Bill Yuchen Lin, Blake Howald, Bryan Orinion, Cameron Diao, Cameron Dour, Catherine Stinson, Cedrick Argueta, Cèsar Ferri Ramírez, Chandan Singh, Charles Rathkopf, Chenlin Meng, Chitta Baral, Chiyu Wu, Chris Callison-Burch, Chris Waites, Christian Voigt, Christopher D. Manning, Christopher Potts, Cindy Ramirez, Clara E. Rivera, Clemencia Siro, Colin Raffel, Courtney Ashcraft, Cristina Garbacea, Damien Sileo, Dan Garrette, Dan Hendrycks, Dan Kilman, Dan Roth, Daniel Freeman, Daniel Khashabi, Daniel Levy, Daniel Moseguí González, Danielle Perszyk, Danny Hernandez, Danqi Chen, Daphne Ippolito, Dar Gilboa, David Dohan, David Drakard, David Jurgens, Debajyoti Datta, Deep Ganguli, Denis Emelin, Denis Kleyko, Deniz Yuret, Derek Chen, Derek Tam, Dieuwke Hupkes, Diganta Misra, Dilyar Buzan, Dimitri Coelho Mollo, Diyi Yang, Dong-Ho Lee, Dylan Schrader, Ekaterina Shutova, Ekin Dogus Cubuk, Elad Segal, Eleanor Hagerman, Elizabeth Barnes, Elizabeth Donoway, Ellie Pavlick, Emanuele Rodolà, Emma Lam, Eric Chu, Eric Tang, Erkut Erdem, Ernie Chang, Ethan A. Chi, Ethan Dyer, Ethan J. Jerzak, Ethan Kim, Eunice Engefu Manyasi, Evgenii Zheltonozhskii, Fanyue Xia, Fatemeh Siar, Fernando Martínez-Plumed, Francesca Happé, François Chollet, Frieda Rong, Gaurav Mishra, Genta Indra Winata, Gerard de Melo, Germán Kruszewski, Giambattista Parascandolo, Giorgio Mariani, Gloria Wang, Gonzalo Jaimovitch-López, Gregor Betz, Guy Gur-Ari, Hana Galijasevic, Hannah Kim, Hannah Rashkin, Hannaneh Hajishirzi, Harsh Mehta, Hayden Bogar, Henry Shevlin, Hinrich Schütze, Hiromu Yakura, Hongming Zhang, Hugh Mee Wong, Ian Ng, Isaac Noble, Jaap Jumelet, Jack Geissinger, Jackson Kernion, Jacob Hilton, Jaehoon Lee, Jaime Fernández Fisac, James B. Simon, James Koppel, James Zheng, James Zou, Jan Kocon, Jana Thompson, Janelle Wingfield, Jared Kaplan, Jarema Radom, Jascha Sohl-Dickstein, Jason Phang, Jason Wei, Jason Yosinski, Jekaterina Novikova, Jelle Bosscher, Jennifer Marsh, Jeremy Kim, Jeroen Taal, Jesse H. Engel, Jesujoba Alabi, Jiacheng Xu, Jiaming Song, Jillian Tang, Joan Waweru, John Burden, John Miller, John U. Balis, Jonathan Batchelder, Jonathan Berant, Jörg Frohberg, Jos Rozen, José Hernández-Orallo, Joseph Boudeman, Joseph Guerr, Joseph Jones, Joshua B. Tenenbaum, Joshua S. Rule, Joyce Chua, Kamil Kanclerz, Karen Livescu, Karl Krauth, Karthik Gopalakrishnan, Katerina Ignatyeva, Katja Markert, Kaustubh D. Dhole, Kevin Gimpel, Kevin Omondi, Kory Mathewson, Kristen Chiafullo, Ksenia Shkaruta, Kumar Shridhar, Kyle McDonell, Kyle Richardson, Laria Reynolds, Leo Gao, Li Zhang, Liam Dugan, Lianhui Qin, Lidia Contreras Ochando, Louis-Philippe Morency, Luca Moschella, Lucas Lam, Lucy Noble, Ludwig Schmidt, Luheng He, Luis Oliveros Colón, Luke Metz, Lütfi Kerem Senel, Maarten Bosma, Maarten Sap, Maartje ter Hoeve, Maheen Farooqi, Manaal Faruqui, Mantas Mazeika, Marco Baturan, Marco Marelli, Marco Maru, María José Ramírez-Quintana, Marie Tolkiehn, Mario Giulianelli, Martha Lewis, Martin Potthast, Matthew L. Leavitt, Matthias Hagen, Mátyás Schubert, Medina Baitemirova, Melody Arnaud, Melvin McElrath, Michael A. Yee, Michael Cohen, Michael Gu, Michael I. Ivanitskiy, Michael Starritt, Michael Strube, Michal Swedrowski, Michele Bevilacqua, Michihiro Yasunaga, Mihir Kale, Mike Cain, Mimee Xu, Mirac Suzgun, Mitch Walker, Mo Tiwari, Mohit Bansal, Moin Aminnaseri, Mor Geva, Mozhdeh Gheini, Mukund Varma T., Nanyun Peng, Nathan A. Chi, Nayeon Lee, Neta Gur-Ari Krakover, Nicholas Cameron, Nicholas Roberts, Nick Doiron, Nicole Martinez, Nikita Nangia, Niklas Deckers, Niklas Muennighoff, Nitish Shirish Keskar, Niveditha Iyer, Noah Constant, Noah Fiedel, Nuan Wen, Oliver Zhang, Omar Agha, Omar Elbaghdadi, Omer Levy, Owain Evans, Pablo Antonio Moreno Casares, Parth Doshi, Pascale Fung, Paul Pu Liang, Paul Vicol, Pegah Alipoormolabashi, Peiyuan Liao, Percy Liang, Peter Chang, Peter Eckersley, Phu Mon Htut, Pinyu Hwang, Piotr Milkowski, Piyush Patil, Pouya Pezeshkpour, Priti Oli, Qiaozhu Mei, Qing Lyu, Qinlang Chen, Rabin Banjade, Rachel Etta Rudolph, Raefer Gabriel, Rahel Habacker, Ramon Risco, Raphaël Millière, Rhythm Garg, Richard Barnes, Rif A. Saurous, Riku Arakawa, Robbe Raymaekers, Robert Frank, Rohan Sikand, Roman Novak, Roman Sitelew, Ronan LeBras, Rosanne Liu, Rowan Jacobs, Rui Zhang, Ruslan Salakhutdinov, Ryan Chi, Ryan Lee, Ryan Stovall, Ryan Teehan, Rylan Yang, Sahib Singh, Saif M. Mohammad, Sajant Anand, Sam Dillavou, Sam Shleifer, Sam Wiseman, Samuel Gruetter, Samuel R. Bowman, Samuel S. Schoenholz, Sanghyun Han, Sanjeev Kwatra, Sarah A. Rous, Sarik Ghazarian, Sayan Ghosh, Sean Casey, Sebastian Bischoff, Sebastian Gehrmann, Sebastian Schuster, Sepideh Sadeghi, Shadi Hamdan, Sharon Zhou, Shashank Srivastava, Sherry Shi, Shikhar Singh, Shima Asaadi, Shixiang Shane Gu, Shubh Pachchigar, Shubham Toshniwal, Shyam Upadhyay, Shyamolima (Shammie) Debnath, Siamak Shakeri, Simon Thormeyer, Simone Melzi, Siva Reddy, Sneha Priscilla Makini, Soo-Hwan Lee, Spencer Torene, Sriharsha Hatwar, Stanislas Dehaene, Stefan Divic, Stefano Ermon, Stella Biderman, Stephanie Lin, Stephen Prasad, Steven T. Piantadosi, Stuart M. Shieber, Summer Misherghi, Svetlana Kiritchenko, Swaroop Mishra, Tal Linzen, Tal Schuster, Tao Li, Tao Yu, Tariq Ali, Tatsu Hashimoto, Te-Lin Wu, Théo Desbordes, Theodore Rothschild, Thomas Phan, Tianle Wang, Tiberius Nkinyili, Timo Schick, Timofei Kornev, Titus Tunduny, Tobias Gerstenberg, Trenton Chang, Trishala Neeraj, Tushar Khot, Tyler Shultz, Uri Shaham, Vedant Misra, Vera Demberg, Victoria Nyamai, Vikas Raunak, Vinay V. Ramasesh, Vinay Uday Prabhu, Vishakh Padmakumar, Vivek Srikumar, William Fedus, William Saunders, William Zhang, Wout Vossen, Xiang Ren, Xiaoyu Tong, Xinran Zhao, Xinyi Wu, Xudong Shen, Yadollah Yaghoobzadeh, Yair Lakretz, Yangqiu Song, Yasaman Bahri, Yejin Choi, Yichi Yang, Yiding Hao, Yifu Chen, Yonatan Belinkov, Yu Hou, Yufang Hou, Yuntao Bai, Zachary Seid, Zhuoye Zhao, Zijian Wang, Zijie J. Wang, Zirui Wang, Ziyi Wu:
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models. CoRR abs/2206.04615 (2022) - [i54]Yanai Elazar, Nora Kassner, Shauli Ravfogel, Amir Feder, Abhilasha Ravichander, Marius Mosbach, Yonatan Belinkov, Hinrich Schütze, Yoav Goldberg:
Measuring Causal Effects of Data Statistics on Language Model's 'Factual' Predictions. CoRR abs/2207.14251 (2022) - [i53]Kevin Meng, Arnab Sen Sharma, Alex Andonian, Yonatan Belinkov, David Bau:
Mass-Editing Memory in a Transformer. CoRR abs/2210.07229 (2022) - [i52]Rachit Bansal, Danish Pruthi, Yonatan Belinkov:
Measures of Information Reflect Memorization Patterns. CoRR abs/2210.09404 (2022) - [i51]Hadas Orgad, Yonatan Belinkov:
Choose Your Lenses: Flaws in Gender Bias Evaluation. CoRR abs/2210.11471 (2022) - [i50]Boaz Carmeli, Ron Meir, Yonatan Belinkov:
Emergent Quantized Communication. CoRR abs/2211.02412 (2022) - [i49]Ori Ram, Liat Bezalel, Adi Zicher, Yonatan Belinkov, Jonathan Berant, Amir Globerson:
What Are You Token About? Dense Retrieval as Distributions Over the Vocabulary. CoRR abs/2212.10380 (2022) - [i48]Hadas Orgad, Yonatan Belinkov:
Debiasing NLP Models Without Demographic Information. CoRR abs/2212.10563 (2022) - [i47]Nir Ratner, Yoav Levine, Yonatan Belinkov, Ori Ram, Omri Abend, Ehud Karpas, Amnon Shashua, Kevin Leyton-Brown, Yoav Shoham:
Parallel Context Windows Improve In-Context Learning of Large Language Models. CoRR abs/2212.10947 (2022) - 2021
- [c48]Matthew Finlayson, Aaron Mueller, Sebastian Gehrmann, Stuart M. Shieber, Tal Linzen, Yonatan Belinkov:
Causal Analysis of Syntactic Agreement Mechanisms in Neural Language Models. ACL/IJCNLP (1) 2021: 1828-1843 - [c47]Abhilasha Ravichander, Yonatan Belinkov, Eduard H. Hovy:
Probing the Probing Paradigm: Does Probing Accuracy Entail Task Relevance? EACL 2021: 3363-3377 - [c46]Michael Mendelson, Yonatan Belinkov:
Debiasing Methods in Natural Language Understanding Make Bias More Accessible. EMNLP (1) 2021: 1545-1557 - [c45]Yu-An Chung, Yonatan Belinkov, James R. Glass:
Similarity Analysis of Self-Supervised Speech Representations. ICASSP 2021: 3040-3044 - [c44]Rabeeh Karimi Mahabadi, Yonatan Belinkov, James Henderson:
Variational Information Bottleneck for Effective Low-Resource Fine-Tuning. ICLR 2021 - [c43]Victor Sanh, Thomas Wolf, Yonatan Belinkov, Alexander M. Rush:
Learning from others' mistakes: Avoiding dataset biases without modeling them. ICLR 2021 - [c42]Yana Dranker, He He, Yonatan Belinkov:
IRM - when it works and when it doesn't: A test case of natural language inference. NeurIPS 2021: 18212-18224 - [e3]Jasmijn Bastings, Yonatan Belinkov, Emmanuel Dupoux, Mario Giulianelli, Dieuwke Hupkes, Yuval Pinter, Hassan Sajjad:
Proceedings of the Fourth BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP, BlackboxNLP@EMNLP 2021, Punta Cana, Dominican Republic, November 11, 2021. Association for Computational Linguistics 2021, ISBN 978-1-955917-06-3 [contents] - [i46]Yonatan Belinkov:
Probing Classifiers: Promises, Shortcomings, and Alternatives. CoRR abs/2102.12452 (2021) - [i45]Joe Stacey, Yonatan Belinkov, Marek Rei:
Natural Language Inference with a Human Touch: Using Human Explanations to Guide Model Attention. CoRR abs/2104.08142 (2021) - [i44]Rabeeh Karimi Mahabadi, Yonatan Belinkov, James Henderson:
Variational Information Bottleneck for Effective Low-Resource Fine-Tuning. CoRR abs/2106.05469 (2021) - [i43]Matthew Finlayson, Aaron Mueller, Sebastian Gehrmann, Stuart M. Shieber, Tal Linzen, Yonatan Belinkov:
Causal Analysis of Syntactic Agreement Mechanisms in Neural Language Models. CoRR abs/2106.06087 (2021) - [i42]Dimion Asael, Zachary M. Ziegler, Yonatan Belinkov:
A Generative Approach for Mitigating Structural Biases in Natural Language Inference. CoRR abs/2108.14006 (2021) - [i41]Michael Mendelson, Yonatan Belinkov:
Debiasing Methods in Natural Language Understanding Make Bias More Accessible. CoRR abs/2109.04095 (2021) - [i40]Omer Antverg, Yonatan Belinkov:
On the Pitfalls of Analyzing Individual Neurons in Language Models. CoRR abs/2110.07483 (2021) - 2020
- [j8]Yonatan Belinkov, Nadir Durrani, Fahim Dalvi, Hassan Sajjad, James R. Glass:
On the Linguistic Representational Power of Neural Machine Translation Models. Comput. Linguistics 46(1): 1-52 (2020) - [c41]Yonatan Belinkov, Sebastian Gehrmann, Ellie Pavlick:
Interpretability and Analysis in Neural NLP. ACL (tutorial) 2020: 1-5 - [c40]John M. Wu, Yonatan Belinkov, Hassan Sajjad, Nadir Durrani, Fahim Dalvi, James R. Glass:
Similarity Analysis of Contextual Word Representation Models. ACL 2020: 4638-4655 - [c39]Mostafa Abdou, Vinit Ravishankar, Maria Barrett, Yonatan Belinkov, Desmond Elliott, Anders Søgaard:
The Sensitivity of Language Models and Humans to Winograd Schema Perturbations. ACL 2020: 7590-7604 - [c38]Rabeeh Karimi Mahabadi, Yonatan Belinkov, James Henderson:
End-to-End Bias Mitigation by Modelling Biases in Corpora. ACL 2020: 8706-8716 - [c37]Nadir Durrani, Hassan Sajjad, Fahim Dalvi, Yonatan Belinkov:
Analyzing Individual Neurons in Pre-trained Language Models. EMNLP (1) 2020: 4865-4880 - [c36]Fahim Dalvi, Hassan Sajjad, Nadir Durrani, Yonatan Belinkov:
Analyzing Redundancy in Pretrained Transformer Models. EMNLP (1) 2020: 4908-4926 - [c35]Jonathan S. Rosenfeld, Amir Rosenfeld, Yonatan Belinkov, Nir Shavit:
A Constructive Prediction of the Generalization Error Across Scales. ICLR 2020 - [c34]Jesse Vig, Sebastian Gehrmann, Yonatan Belinkov, Sharon Qian, Daniel Nevo, Yaron Singer, Stuart M. Shieber:
Investigating Gender Bias in Language Models Using Causal Mediation Analysis. NeurIPS 2020 - [c33]Lucia Specia, Zhenhao Li, Juan Miguel Pino, Vishrav Chaudhary, Francisco Guzmán, Graham Neubig, Nadir Durrani, Yonatan Belinkov, Philipp Koehn, Hassan Sajjad, Paul Michel, Xian Li:
Findings of the WMT 2020 Shared Task on Machine Translation Robustness. WMT@EMNLP 2020: 76-91 - [e2]Afra Alishahi, Yonatan Belinkov, Grzegorz Chrupala, Dieuwke Hupkes, Yuval Pinter, Hassan Sajjad:
Proceedings of the Third BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP, BlackboxNLP@EMNLP 2020, Online, November 2020. Association for Computational Linguistics 2020, ISBN 978-1-952148-86-6 [contents] - [i39]Fahim Dalvi, Hassan Sajjad, Nadir Durrani, Yonatan Belinkov:
Exploiting Redundancy in Pre-trained Language Models for Efficient Transfer Learning. CoRR abs/2004.04010 (2020) - [i38]Jesse Vig, Sebastian Gehrmann, Yonatan Belinkov, Sharon Qian, Daniel Nevo, Yaron Singer, Stuart M. Shieber:
Causal Mediation Analysis for Interpreting Neural NLP: The Case of Gender Bias. CoRR abs/2004.12265 (2020) - [i37]Abhilasha Ravichander, Yonatan Belinkov, Eduard H. Hovy:
Probing the Probing Paradigm: Does Probing Accuracy Entail Task Relevance? CoRR abs/2005.00719 (2020) - [i36]John M. Wu, Yonatan Belinkov, Hassan Sajjad, Nadir Durrani, Fahim Dalvi, James R. Glass:
Similarity Analysis of Contextual Word Representation Models. CoRR abs/2005.01172 (2020) - [i35]Mostafa Abdou, Vinit Ravishankar, Maria Barrett, Yonatan Belinkov, Desmond Elliott, Anders Søgaard:
The Sensitivity of Language Models and Humans to Winograd Schema Perturbations. CoRR abs/2005.01348 (2020) - [i34]Abdelrhman Saleh, Tovly Deutsch, Stephen Casper, Yonatan Belinkov, Stuart M. Shieber:
Probing Neural Dialog Models for Conversational Understanding. CoRR abs/2006.08331 (2020) - [i33]Nadir Durrani, Hassan Sajjad, Fahim Dalvi, Yonatan Belinkov:
Analyzing Individual Neurons in Pre-trained Language Models. CoRR abs/2010.02695 (2020) - [i32]Yu-An Chung, Yonatan Belinkov, James R. Glass:
Similarity Analysis of Self-Supervised Speech Representations. CoRR abs/2010.11481 (2020) - [i31]Victor Sanh, Thomas Wolf, Yonatan Belinkov, Alexander M. Rush:
Learning from others' mistakes: Avoiding dataset biases without modeling them. CoRR abs/2012.01300 (2020)
2010 – 2019
- 2019
- [j7]Salvatore Romeo, Giovanni Da San Martino, Yonatan Belinkov, Alberto Barrón-Cedeño, Mohamed Eldesouki, Kareem Darwish, Hamdy Mubarak, James R. Glass, Alessandro Moschitti:
Language processing and learning models for community question answering in Arabic. Inf. Process. Manag. 56(2): 274-290 (2019) - [j6]Yonatan Belinkov, Alexander Magidow, Alberto Barrón-Cedeño, Avi Shmidman, Maxim Romanov:
Studying the history of the Arabic language: language technology and a large-scale historical corpus. Lang. Resour. Evaluation 53(4): 771-805 (2019) - [j5]Yonatan Belinkov, James R. Glass:
Analysis Methods in Neural Language Processing: A Survey. Trans. Assoc. Comput. Linguistics 7: 49-72 (2019) - [c32]Fahim Dalvi, Nadir Durrani, Hassan Sajjad, Yonatan Belinkov, Anthony Bau, James R. Glass:
What Is One Grain of Sand in the Desert? Analyzing Individual Neurons in Deep NLP Models. AAAI 2019: 6309-6317 - [c31]Fahim Dalvi, Avery Nortonsmith, Anthony Bau, Yonatan Belinkov, Hassan Sajjad, Nadir Durrani, James R. Glass:
NeuroX: A Toolkit for Analyzing Individual Neurons in Neural Networks. AAAI 2019: 9851-9852 - [c30]Yonatan Belinkov, Adam Poliak, Stuart M. Shieber, Benjamin Van Durme, Alexander M. Rush:
Don't Take the Premise for Granted: Mitigating Artifacts in Natural Language Inference. ACL (1) 2019: 877-891 - [c29]Hongyin Luo, Lan Jiang, Yonatan Belinkov, James R. Glass:
Improving Neural Language Models by Segmenting, Attending, and Predicting the Future. ACL (1) 2019: 1483-1493 - [c28]Jesse Vig, Yonatan Belinkov:
Analyzing the Structure of Attention in a Transformer Language Model. BlackboxNLP@ACL 2019: 63-76 - [c27]Michael Hahn, Frank Keller, Yonatan Bisk, Yonatan Belinkov:
Character-based Surprisal as a Model of Reading Difficulty in the Presence of Errors. CogSci 2019: 401-407 - [c26]Anthony Bau, Yonatan Belinkov, Hassan Sajjad, Nadir Durrani, Fahim Dalvi, James R. Glass:
Identifying and Controlling Important Neurons in Neural Machine Translation. ICLR (Poster) 2019 - [c25]Yonatan Belinkov, Ahmed Ali, James R. Glass:
Analyzing Phonetic and Graphemic Representations in End-to-End Automatic Speech Recognition. INTERSPEECH 2019: 81-85 - [c24]Nelson F. Liu, Matt Gardner, Yonatan Belinkov, Matthew E. Peters, Noah A. Smith:
Linguistic Knowledge and Transferability of Contextual Representations. NAACL-HLT (1) 2019: 1073-1094 - [c23]Nadir Durrani, Fahim Dalvi, Hassan Sajjad, Yonatan Belinkov, Preslav Nakov:
One Size Does Not Fit All: Comparing NMT Representations of Different Granularities. NAACL-HLT (1) 2019: 1504-1516 - [c22]Yonatan Belinkov, James R. Glass:
Analysis Methods in Neural Language Processing: A Survey. NAACL-HLT (1) 2019: 3348-3354 - [c21]Yonatan Belinkov, Adam Poliak, Stuart M. Shieber, Benjamin Van Durme, Alexander M. Rush:
On Adversarial Removal of Hypothesis-only Bias in Natural Language Inference. *SEM@NAACL-HLT 2019: 256-262 - [c20]Xian Li, Paul Michel, Antonios Anastasopoulos, Yonatan Belinkov, Nadir Durrani, Orhan Firat, Philipp Koehn, Graham Neubig, Juan Miguel Pino, Hassan Sajjad:
Findings of the First Shared Task on Machine Translation Robustness. WMT (2) 2019: 91-102 - [e1]Tal Linzen, Grzegorz Chrupala, Yonatan Belinkov, Dieuwke Hupkes:
Proceedings of the 2019 ACL Workshop BlackboxNLP: Analyzing and Interpreting Neural Networks for NLP, BlackboxNLP@ACL 2019, Florence, Italy, August 1, 2019. Association for Computational Linguistics 2019, ISBN 978-1-950737-30-7 [contents] - [i30]Michael Hahn, Frank Keller, Yonatan Bisk, Yonatan Belinkov:
Character-based Surprisal as a Model of Human Reading in the Presence of Errors. CoRR abs/1902.00595 (2019) - [i29]Nelson F. Liu, Matt Gardner, Yonatan Belinkov, Matthew E. Peters, Noah A. Smith:
Linguistic Knowledge and Transferability of Contextual Representations. CoRR abs/1903.08855 (2019) - [i28]Hongyin Luo, Lan Jiang, Yonatan Belinkov, James R. Glass:
Improving Neural Language Models by Segmenting, Attending, and Predicting the Future. CoRR abs/1906.01702 (2019) - [i27]Mirac Suzgun, Sebastian Gehrmann, Yonatan Belinkov, Stuart M. Shieber:
LSTM Networks Can Perform Dynamic Counting. CoRR abs/1906.03648 (2019) - [i26]Jesse Vig, Yonatan Belinkov:
Analyzing the Structure of Attention in a Transformer Language Model. CoRR abs/1906.04284 (2019) - [i25]Gabriel Grand, Yonatan Belinkov:
Adversarial Regularization for Visual Question Answering: Strengths, Shortcomings, and Side Effects. CoRR abs/1906.08430 (2019) - [i24]Xian Li, Paul Michel, Antonios Anastasopoulos, Yonatan Belinkov, Nadir Durrani, Orhan Firat, Philipp Koehn, Graham Neubig, Juan Miguel Pino, Hassan Sajjad:
Findings of the First Shared Task on Machine Translation Robustness. CoRR abs/1906.11943 (2019) - [i23]Yonatan Belinkov, Ahmed Ali, James R. Glass:
Analyzing Phonetic and Graphemic Representations in End-to-End Automatic Speech Recognition. CoRR abs/1907.04224 (2019) - [i22]Yonatan Belinkov, Adam Poliak, Stuart M. Shieber, Benjamin Van Durme, Alexander M. Rush:
Don't Take the Premise for Granted: Mitigating Artifacts in Natural Language Inference. CoRR abs/1907.04380 (2019) - [i21]Yonatan Belinkov, Adam Poliak, Stuart M. Shieber, Benjamin Van Durme, Alexander M. Rush:
On Adversarial Removal of Hypothesis-only Bias in Natural Language Inference. CoRR abs/1907.04389 (2019) - [i20]Jonathan S. Rosenfeld, Amir Rosenfeld, Yonatan Belinkov, Nir Shavit:
A Constructive Prediction of the Generalization Error Across Scales. CoRR abs/1909.12673 (2019) - [i19]Yonatan Belinkov, Nadir Durrani, Fahim Dalvi, Hassan Sajjad, James R. Glass:
On the Linguistic Representational Power of Neural Machine Translation Models. CoRR abs/1911.00317 (2019) - [i18]Mirac Suzgun, Sebastian Gehrmann, Yonatan Belinkov, Stuart M. Shieber:
Memory-Augmented Recurrent Neural Networks Can Learn Generalized Dyck Languages. CoRR abs/1911.03329 (2019) - 2018
- [b1]Yonatan Belinkov:
On internal language representations in deep learning: an analysis of machine translation and speech recognition. Massachusetts Institute of Technology, Cambridge, USA, 2018 - [c19]Yonatan Belinkov, Yonatan Bisk:
Synthetic and Natural Noise Both Break Neural Machine Translation. ICLR 2018 - [c18]Adam Poliak, Yonatan Belinkov, James R. Glass, Benjamin Van Durme:
On the Evaluation of Semantic Phenomena in Neural Machine Translation Using Natural Language Inference. NAACL-HLT (2) 2018: 513-523 - [i17]Yonatan Belinkov, Lluís Màrquez, Hassan Sajjad, Nadir Durrani, Fahim Dalvi, James R. Glass:
Evaluating Layers of Representation in Neural Machine Translation on Part-of-Speech and Semantic Tagging Tasks. CoRR abs/1801.07772 (2018) - [i16]Adam Poliak, Yonatan Belinkov, James R. Glass, Benjamin Van Durme:
On the Evaluation of Semantic Phenomena in Neural Machine Translation Using Natural Language Inference. CoRR abs/1804.09779 (2018) - [i15]Yonatan Belinkov, Alexander Magidow, Alberto Barrón-Cedeño, Avi Shmidman, Maxim Romanov:
Studying the History of the Arabic Language: Language Technology and a Large-Scale Historical Corpus. CoRR abs/1809.03891 (2018) - [i14]Mirac Suzgun, Yonatan Belinkov, Stuart M. Shieber:
On Evaluating the Generalization of LSTM Models in Formal Languages. CoRR abs/1811.01001 (2018) - [i13]Anthony Bau, Yonatan Belinkov, Hassan Sajjad, Nadir Durrani, Fahim Dalvi, James R. Glass:
Identifying and Controlling Important Neurons in Neural Machine Translation. CoRR abs/1811.01157 (2018) - [i12]Yonatan Belinkov, James R. Glass:
Analysis Methods in Neural Language Processing: A Survey. CoRR abs/1812.08951 (2018) - [i11]Fahim Dalvi, Nadir Durrani, Hassan Sajjad, Yonatan Belinkov, Anthony Bau, James R. Glass:
What Is One Grain of Sand in the Desert? Analyzing Individual Neurons in Deep NLP Models. CoRR abs/1812.09355 (2018) - [i10]Fahim Dalvi, Avery Nortonsmith, Anthony Bau, Yonatan Belinkov, Hassan Sajjad, Nadir Durrani, James R. Glass:
NeuroX: A Toolkit for Analyzing Individual Neurons in Neural Networks. CoRR abs/1812.09359 (2018) - 2017
- [j4]Yossi Adi, Einat Kermany, Yonatan Belinkov, Ofer Lavi, Yoav Goldberg:
Analysis of sentence embedding models using prediction tasks in natural language processing. IBM J. Res. Dev. 61(4-5): 3:1-3:9 (2017) - [c17]Hassan Sajjad, Fahim Dalvi, Nadir Durrani, Ahmed Abdelali, Yonatan Belinkov, Stephan Vogel:
Challenging Language-Dependent Segmentation for Arabic: An Application to Machine Translation and Part-of-Speech Tagging. ACL (2) 2017: 601-607 - [c16]Yonatan Belinkov, Nadir Durrani, Fahim Dalvi, Hassan Sajjad, James R. Glass:
What do Neural Machine Translation Models Learn about Morphology? ACL (1) 2017: 861-872 - [c15]Yossi Adi, Einat Kermany, Yonatan Belinkov, Ofer Lavi, Yoav Goldberg:
Fine-grained Analysis of Sentence Embeddings Using Auxiliary Prediction Tasks. ICLR (Poster) 2017 - [c14]Yonatan Belinkov, Lluís Màrquez, Hassan Sajjad, Nadir Durrani, Fahim Dalvi, James R. Glass:
Evaluating Layers of Representation in Neural Machine Translation on Part-of-Speech and Semantic Tagging Tasks. IJCNLP(1) 2017: 1-10 - [c13]Fahim Dalvi, Nadir Durrani, Hassan Sajjad, Yonatan Belinkov, Stephan Vogel:
Understanding and Improving Morphological Learning in the Neural Machine Translation Decoder. IJCNLP(1) 2017: 142-151 - [c12]Sameer Khurana, Maryam Najafian, Ahmed Ali, Tuka Al Hanai, Yonatan Belinkov, James R. Glass:
QMDIS: QCRI-MIT Advanced Dialect Identification System. INTERSPEECH 2017: 2591-2595 - [c11]Hassan Sajjad, Nadir Durrani, Fahim Dalvi, Yonatan Belinkov, Stephan Vogel:
Neural Machine Translation Training in a Multi-Domain Scenario. IWSLT 2017: 66-73 - [c10]Yonatan Belinkov, James R. Glass:
Analyzing Hidden Representations in End-to-End Automatic Speech Recognition Systems. NIPS 2017: 2441-2451 - [i9]Yonatan Belinkov, Nadir Durrani, Fahim Dalvi, Hassan Sajjad, James R. Glass:
What do Neural Machine Translation Models Learn about Morphology? CoRR abs/1704.03471 (2017) - [i8]Hassan Sajjad, Nadir Durrani, Fahim Dalvi, Yonatan Belinkov, Stephan Vogel:
Neural Machine Translation Training in a Multi-Domain Scenario. CoRR abs/1708.08712 (2017) - [i7]Hassan Sajjad, Fahim Dalvi, Nadir Durrani, Ahmed Abdelali, Yonatan Belinkov, Stephan Vogel:
Challenging Language-Dependent Segmentation for Arabic: An Application to Machine Translation and Part-of-Speech Tagging. CoRR abs/1709.00616 (2017) - [i6]Yonatan Belinkov, James R. Glass:
Analyzing Hidden Representations in End-to-End Automatic Speech Recognition Systems. CoRR abs/1709.04482 (2017) - [i5]Yonatan Belinkov, Yonatan Bisk:
Synthetic and Natural Noise Both Break Neural Machine Translation. CoRR abs/1711.02173 (2017) - 2016
- [c9]Salvatore Romeo, Giovanni Da San Martino, Alberto Barrón-Cedeño, Alessandro Moschitti, Yonatan Belinkov, Wei-Ning Hsu, Yu Zhang, Mitra Mohtarami, James R. Glass:
Neural Attention for Learning to Rank Questions in Community Question Answering. COLING 2016: 1734-1745 - [c8]Yonatan Belinkov, Alexander Magidow, Maxim Romanov, Avi Shmidman, Moshe Koppel:
Shamela: A Large-Scale Historical Arabic Corpus. LT4DH@COLING 2016: 45-53 - [c7]Mitra Mohtarami, Yonatan Belinkov, Wei-Ning Hsu, Yu Zhang, Tao Lei, Kfir Bar, Scott Cyphers, James R. Glass:
SLS at SemEval-2016 Task 3: Neural-based Approaches for Ranking in Community Question Answering. SemEval@NAACL-HLT 2016: 828-835 - [c6]Roee Aharoni, Yoav Goldberg, Yonatan Belinkov:
Improving Sequence to Sequence Learning for Morphological Inflection Generation: The BIU-MIT Systems for the SIGMORPHON 2016 Shared Task for Morphological Reinflection. SIGMORPHON 2016: 41-48 - [c5]Yonatan Belinkov, James R. Glass:
A Character-level Convolutional Neural Network for Distinguishing Similar Languages and Dialects. VarDial@COLING 2016: 145-152 - [i4]Yossi Adi, Einat Kermany, Yonatan Belinkov, Ofer Lavi, Yoav Goldberg:
Fine-grained Analysis of Sentence Embeddings Using Auxiliary Prediction Tasks. CoRR abs/1608.04207 (2016) - [i3]Yonatan Belinkov, James R. Glass:
A Character-level Convolutional Neural Network for Distinguishing Similar Languages and Dialects. CoRR abs/1609.07568 (2016) - [i2]Yonatan Belinkov, James R. Glass:
Large-Scale Machine Translation between Arabic and Hebrew: Available Corpora and Initial Results. CoRR abs/1609.07701 (2016) - [i1]Yonatan Belinkov, Alexander Magidow, Maxim Romanov, Avi Shmidman, Moshe Koppel:
Shamela: A Large-Scale Historical Arabic Corpus. CoRR abs/1612.08989 (2016) - 2015
- [j3]Yonatan Belinkov, Tao Lei, Regina Barzilay, Amir Globerson:
Erratum: "Exploring Compositional Architectures and Word Vector Representations for Prepositional Phrase Attachment". Trans. Assoc. Comput. Linguistics 3: 101 (2015) - [c4]Yonatan Belinkov, James R. Glass:
Arabic Diacritization with Recurrent Neural Networks. EMNLP 2015: 2281-2285 - [c3]Yonatan Belinkov, Mitra Mohtarami, Scott Cyphers, James R. Glass:
VectorSLU: A Continuous Word Vector Approach to Answer Selection in Community Question Answering Systems. SemEval@NAACL-HLT 2015: 282-287 - [c2]Yonatan Belinkov, Alberto Barrón-Cedeño, Hamdy Mubarak:
Answer Selection in Arabic Community Question Answering: A Feature-Rich Approach. ANLP@ACL 2015: 183-190 - 2014
- [j2]Tressy Arts, Yonatan Belinkov, Nizar Habash, Adam Kilgarriff, Vit Suchomel:
arTenTen: Arabic Corpus and Word Sketches. J. King Saud Univ. Comput. Inf. Sci. 26(4): 357-371 (2014) - [j1]Yonatan Belinkov, Tao Lei, Regina Barzilay, Amir Globerson:
Exploring Compositional Architectures and Word Vector Representations for Prepositional Phrase Attachment. Trans. Assoc. Comput. Linguistics 2: 561-572 (2014) - 2013
- [c1]Hassan Sajjad, Kareem Darwish, Yonatan Belinkov:
Translating Dialectal Arabic to English. ACL (2) 2013: 1-6
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-01 00:15 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint