Daan van Esch

Multimodal Modeling for Spoken Language Identification

Shikhar Bharadwaj

Min Ma

Shikhar Vashishth

Ankur Bapna

Sriram (Sri) Ganapathy

Vera Axelrod

Sid Dalmia

Wei Han

Yu Zhang

Daan van Esch

Sandy Ritchie

Partha Talukdar

Jason Riesa

Proceedings of 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2024) (2024)

Now You See Me, Now You Don't: 'Poverty of the Stimulus' Problems and Arbitrary Correspondences in End-to-End Speech Models

Daan van Esch

Proceedings of the Second Workshop on Computation and Written Language (CAWL) 2024

Connecting Language Technologies with Rich, Diverse Data Sources Covering Thousands of Languages

Daan van Esch

Sandy Ritchie

Sebastian Ruder

Julia Kreutzer

Clara Rivera

Ishank Saxena

Isaac Caswell

Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)

LinguaMeta: Unified Metadata for Thousands of Languages

Sandy Ritchie

Daan van Esch

Uche Okonkwo

Shikhar Vashishth

Emily Drummond

Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)

XTREME-S: Evaluating Cross-lingual Speech Representations

Ankur Bapna

Clara E. Rivera

Daan van Esch

Jason Riesa

Jon Clark

Melvin Johnson

Mihir Sanjay Kale

Min Ma

Orhan Firat

Sandy Ritchie

Sebastian Ruder

Simran Khanuja

Ye Jia

Yu Zhang

Proc. Interspeech 2022

Large vocabulary speech recognition for languages of Africa: multilingual modeling and self-supervised learning

Sandy Ritchie

You-Chi Cheng

Mingqing Chen

Rajiv Mathews

Daan van Esch

Bo Li

Khe Chai Sim

(2022)

Handling Compounding in Mobile Keyboard Input

Andreas Christian Kabel

Keith B. Hall

Tom Ouyang

David Rybach

Daan van Esch

Françoise Simone Beaufays

arXiv cs.CL (2022)

Managing Transcription Data for Automatic Speech Recognition with Elpis

Ben Foley

Daan van Esch

Nay San

The Open Handbook of Linguistic Data Management, The MIT Press (2022)

Accented Speech Recognition: Benchmarking, Pre-training, and Diverse Data

Alëna Aksënova

Zhehuai Chen

Chung-Cheng Chiu

Daan van Esch

Pavel Golik

Wei Han

Levi King

Bhuvana Ramabhadran

Andrew Rosenberg

Suzan Schwartz

Gary Wang

(2022)

Quality at a Glance: An Audit of Web-Crawled Multilingual Datasets

Julia Kreutzer

Isaac Caswell

Lisa Wang

Ahsan Wahab

Daan van Esch

Nasanbayar Ulzii-Orshikh

Allahsera Auguste Tapo

Nishant Subramani

Artem Sokolov

Claytone Sikasote

Monang Setyawan

Supheakmungkol Sarin

Sokhar Samb

Benoît Sagot

Clara E. Rivera

Annette Rios

Isabel Papadimitriou

Salomey Osei

Pedro Javier Ortiz Suárez

Iroro Fred Ọ̀nọ̀mẹ̀ Orife

Kelechi Ogueji

Rubungo Andre Niyongabo

Toan Nguyen

Mathias Müller

André Müller

Shamsuddeen Hassan Muhammad

Nanda Muhammad

Ayanda Mnyakeni

Jamshidbek Mirzakhalov

Tapiwanashe Matangira

Colin Leong

Nze Lawson

Sneha Kudugunta

Yacine Jernite

Mathias Jenny

Orhan Firat

Bonaventure F. P. Dossou

Sakhile Dlamini

Nisansa de Silva

Sakine Çabuk Ballı

Stella Biderman

Alessia Battisti

Ahmed Baruwa

Ankur Bapna

Pallavi Baljekar

Israel Abebe Azime

Ayodele Awokoya

Duygu Ataman

Orevaoghene Ahia

Oghenefego Ahia

Sweta Agrawal

Mofetoluwa Adeyemi

TACL (2022)

Defining the technology of today and tomorrow.

Philosophy

People

Research areas

Foundational ML & Algorithms

Computing Systems & Quantum AI

Science, AI & Society

Projects

Publications

Resources

Shaping the future, together.

Student programs

Faculty programs

Conferences & events

Daan van Esch

Research Areas

Join us

Defining the technology of today and tomorrow.

Philosophy

People

Research areas

Foundational ML & Algorithms

Computing Systems & Quantum AI

Science, AI & Society

Projects

Publications

Resources

Shaping the future, together.

Student programs

Faculty programs

Conferences & events

Daan van Esch

Research Areas

Filter by:

Publications

Years

Research Areas

Teams

Join us