Google Scholar

User profiles for Sneha Reddy Kudugunta

Sneha Kudugunta

Google DeepMind

Verified email at google.com

Cited by 3322

[PDF] arxiv.org

Investigating multilingual NMT representations at scale

SR Kudugunta, A Bapna, I Caswell… - arXiv preprint arXiv …, 2019 - arxiv.org

Multilingual Neural Machine Translation (NMT) models have yielded large empirical
success in transfer learning settings. However, these black-box representations are poorly …

Save Cite Cited by 125 Related articles All 7 versions View as HTML

[PDF] arxiv.org

Beyond distillation: Task-level mixture-of-experts for efficient inference

S Kudugunta, Y Huang, A Bapna, M Krikun… - arXiv preprint arXiv …, 2021 - arxiv.org

Sparse Mixture-of-Experts (MoE) has been a successful approach for scaling multilingual
translation models to billions of parameters without a proportional increase in training …

Save Cite Cited by 95 Related articles All 4 versions View as HTML

Beyond Distillation: Task-level Mixture-of-Experts for Efficient Inference

…, DD Lepikhin, M Krikun, O Firat, SR Kudugunta… - 2021 - research.google

Sparse Mixture-of-Experts (MoE) has been a successful approach for scaling multilingual
translation models to billions of parameters without a proportional increase in training …

Investigating Multilingual NMT Representations at Scale

S Reddy Kudugunta, A Bapna, I Caswell… - arXiv e …, 2019 - ui.adsabs.harvard.edu

Multilingual Neural Machine Translation (NMT) models have yielded large empirical
success in transfer learning settings. However, these black-box representations are poorly …

Save Cite Related articles

[PDF] aclanthology.org

MiTTenS: A Dataset for Evaluating Gender Mistranslation

K Robinson, S Kudugunta, R Stella… - Proceedings of the …, 2024 - aclanthology.org

Translation systems, including foundation models capable of translation, can produce errors
that result in gender mistranslation, and such errors can be especially harmful. To measure …

Save Cite Related articles View as HTML

[PDF] openreview.net

Exploring routing strategies for multilingual mixture-of-experts models

S Kudugunta, Y Huang, A Bapna, M Krikun, D Lepikhin… - 2021 - openreview.net

Sparsely-Gated Mixture-of-Experts (MoE) has been a successful approach for scaling multilingual
translation models to billions of parameters without a proportional increase in training …

Save Cite Cited by 4 Related articles View as HTML

[PDF] arxiv.org

MiTTenS: A Dataset for Evaluating Gender Mistranslation

K Robinson, S Kudugunta, R Stella, S Dev… - arXiv preprint arXiv …, 2024 - arxiv.org

Translation systems, including foundation models capable of translation, can produce errors
that result in gender mistranslation, and such errors can be especially harmful. To measure …

[PDF] arxiv.org

Buffet: Benchmarking large language models for few-shot cross-lingual transfer

A Asai, S Kudugunta, XV Yu, T Blevins… - arXiv preprint arXiv …, 2023 - arxiv.org

Despite remarkable advancements in few-shot generalization in natural language processing,
most models are developed and evaluated primarily in English. To facilitate research on …

Save Cite Cited by 11 Related articles All 4 versions View as HTML

[PDF] arxiv.org

Gradient vaccine: Investigating and improving multi-task optimization in massively multilingual models

Z Wang, Y Tsvetkov, O Firat, Y Cao - arXiv preprint arXiv:2010.05874, 2020 - arxiv.org

Massively multilingual models subsuming tens or even hundreds of languages pose great
challenges to multi-task optimization. While it is a common practice to apply a language-…

Save Cite Cited by 186 Related articles All 8 versions View as HTML

[PDF] aclanthology.org

BERT is not an interlingua and the bias of tokenization

J Singh, B McCann, R Socher… - Proceedings of the 2nd …, 2019 - aclanthology.org

Multilingual transfer learning can benefit both high-and low-resource languages, but the
source of these improvements is not well understood. Cananical Correlation Analysis (CCA) of …

Save Cite Cited by 95 Related articles All 2 versions View as HTML

Create alert

Cite

Advanced search

Saved to My library

User profiles for Sneha Reddy Kudugunta

Sneha Kudugunta

Investigating multilingual NMT representations at scale

Beyond distillation: Task-level mixture-of-experts for efficient inference

Beyond Distillation: Task-level Mixture-of-Experts for Efficient Inference

Investigating Multilingual NMT Representations at Scale

MiTTenS: A Dataset for Evaluating Gender Mistranslation

Exploring routing strategies for multilingual mixture-of-experts models

MiTTenS: A Dataset for Evaluating Gender Mistranslation

Buffet: Benchmarking large language models for few-shot cross-lingual transfer

Gradient vaccine: Investigating and improving multi-task optimization in massively multilingual models

BERT is not an interlingua and the bias of tokenization