default search action

combined dblp search
author search
venue search
publication search

ask others

Ramon Sanabria

> Home > Persons

Person information

affiliation: Carnegie Mellon University, Language Technology Institute, Pittsburgh, PA, USA

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c19]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SalibaLSL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SalibaLSL24
Alexandra Saliba, Yuanchao Li, Ramon Sanabria, Catherine Lai:
Layer-Wise Analysis of Self-Supervised Acoustic Word Embeddings: A Study on Speech Emotion Recognition. ICASSP Workshops 2024: 590-594
[i24]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-02617
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-02617
Alexandra Saliba, Yuanchao Li, Ramon Sanabria, Catherine Lai:
Layer-Wise Analysis of Self-Supervised Acoustic Word Embeddings: A Study on Speech Emotion Recognition. CoRR abs/2402.02617 (2024)
[i23]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-01616
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-01616
Frank Palma Gomez, Ramon Sanabria, Yun-Hsuan Sung, Daniel Cer, Siddharth Dalmia, Gustavo Hernández Ábrego:
Transforming LLMs into Cross-modal and Cross-lingual Retrieval Systems. CoRR abs/2404.01616 (2024)
2023
[c18]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SanabriaBMCKB23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SanabriaBMCKB23
Ramon Sanabria, Nikolay Bogoychev, Nina Markl, Andrea Carmantini, Ondrej Klejch, Peter Bell:
The Edinburgh International Accents of English Corpus: Towards the Democratization of English ASR. ICASSP 2023: 1-5
[c17]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SanabriaHBA23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SanabriaHBA23
Ramon Sanabria, Wei-Ning Hsu, Alexei Baevski, Michael Auli:
Measuring the Impact of Domain Factors in Self-Supervised Pre-Training. ICASSP Workshops 2023: 1-5
[c16]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SanabriaTG23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SanabriaTG23
Ramon Sanabria, Hao Tang, Sharon Goldwater:
Analyzing Acoustic Word Embeddings from Pre-Trained Self-Supervised Speech Models. ICASSP 2023: 1-5
[c15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SanabriaKTG23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SanabriaKTG23
Ramon Sanabria, Ondrej Klejch, Hao Tang, Sharon Goldwater:
Acoustic Word Embeddings for Untranscribed Target Languages with Continued Pretraining and Learned Pooling. INTERSPEECH 2023: 406-410
[i22]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-18110
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-18110
Ramon Sanabria, Nikolay Bogoychev, Nina Markl, Andrea Carmantini, Ondrej Klejch, Peter Bell:
The Edinburgh International Accents of English Corpus: Towards the Democratization of English ASR. CoRR abs/2303.18110 (2023)
[i21]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-02153
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-02153
Ramon Sanabria, Ondrej Klejch, Hao Tang, Sharon Goldwater:
Acoustic Word Embeddings for Untranscribed Target Languages with Continued Pretraining and Learned Pooling. CoRR abs/2306.02153 (2023)
2022
[i20]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-00648
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-00648
Ramon Sanabria, Wei-Ning Hsu, Alexei Baevski, Michael Auli:
Measuring the Impact of Individual Domain Factors in Self-Supervised Pre-Training. CoRR abs/2203.00648 (2022)
[i19]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-16043
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-16043
Ramon Sanabria, Hao Tang, Sharon Goldwater:
Analyzing Acoustic Word Embeddings from Pre-trained Self-supervised Speech Models. CoRR abs/2210.16043 (2022)
2021
[c14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SanabriaWB21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SanabriaWB21
Ramon Sanabria, Austin Waters, Jason Baldridge:
Talk, Don't Write: A Study of Direct Speech-Based Image Retrieval. Interspeech 2021: 2976-2980
[i18]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2104-01894
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-01894
Ramon Sanabria, Austin Waters, Jason Baldridge:
Talk, Don't Write: A Study of Direct Speech-Based Image Retrieval. CoRR abs/2104.01894 (2021)
[i17]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2109-10107
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2109-10107
Ramon Sanabria, Hao Tang, Sharon Goldwater:
On the Difficulty of Segmenting Words with Attention. CoRR abs/2109.10107 (2021)
2020
[j2]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/csl/PalaskarSM20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/csl/PalaskarSM20
Shruti Palaskar, Ramon Sanabria, Florian Metze:
Transfer learning for multimodal dialog. Comput. Speech Lang. 64: 101093 (2020)
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/jstsp/SpeciaBCDEGHLLL20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jstsp/SpeciaBCDEGHLLL20
Lucia Specia, Loïc Barrault, Ozan Caglayan, Amanda Cardoso Duarte, Desmond Elliott, Spandana Gella, Nils Holzenberger, Chiraag Lala, Sun Jae Lee, Jindrich Libovický, Pranava Madhyastha, Florian Metze, Karl Mulligan, Alissa Ostapenko, Shruti Palaskar, Ramon Sanabria, Josiah Wang, Raman Arora:
Grounded Sequence to Sequence Transduction. IEEE J. Sel. Top. Signal Process. 14(3): 577-591 (2020)
[c13]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/SrinivasanSME20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/SrinivasanSME20
Tejas Srinivasan, Ramon Sanabria, Florian Metze, Desmond Elliott:
Fine-Grained Grounding for Multimodal Speech Recognition. EMNLP (Findings) 2020: 2667-2677
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SrinivasanSM20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SrinivasanSM20
Tejas Srinivasan, Ramon Sanabria, Florian Metze:
Looking Enhances Listening: Recovering Missing Speech Using Images. ICASSP 2020: 6304-6308
[i16]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2002-05639
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-05639
Tejas Srinivasan, Ramon Sanabria, Florian Metze:
Looking Enhances Listening: Recovering Missing Speech Using Images. CoRR abs/2002.05639 (2020)
[i15]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-02384
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-02384
Tejas Srinivasan, Ramon Sanabria, Florian Metze, Desmond Elliott:
Fine-Grained Grounding for Multimodal Speech Recognition. CoRR abs/2010.02384 (2020)
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-08642
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-08642
Tejas Srinivasan, Ramon Sanabria, Florian Metze, Desmond Elliott:
Multimodal Speech Recognition with Unstructured Audio Masking. CoRR abs/2010.08642 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/CaglayanSPBM19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/CaglayanSPBM19
Ozan Caglayan, Ramon Sanabria, Shruti Palaskar, Loïc Barrault, Florian Metze:
Multimodal Grounding for Sequence-to-sequence Speech Recognition. ICASSP 2019: 8648-8652
[c10]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/iwslt/NiehuesCSNTHSSB19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iwslt/NiehuesCSNTHSSB19
Jan Niehues, Roldano Cattoni, Sebastian Stüker, Matteo Negri, Marco Turchi, Thanh-Le Ha, Elizabeth Salesky, Ramon Sanabria, Loïc Barrault, Lucia Specia, Marcello Federico:
The IWSLT 2019 Evaluation Campaign. IWSLT 2019
[c9]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/iwslt/SrinivasanSM19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iwslt/SrinivasanSM19
Tejas Srinivasan, Ramon Sanabria, Florian Metze:
CMU's Machine Translation System for IWSLT 2019. IWSLT 2019
[c8]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/iwslt/SrinivasanSM19a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iwslt/SrinivasanSM19a
Tejas Srinivasan, Ramon Sanabria, Florian Metze:
Multitask Learning For Different Subword Segmentations In Neural Machine Translation. IWSLT 2019
[c7]
- view
  - electronic edition @ ceur-ws.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/mediaeval/MoriyaSMJ19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mediaeval/MoriyaSMJ19
Yasufumi Moriya, Ramon Sanabria, Florian Metze, Gareth J. F. Jones:
MediaEval 2019: Eyes and Ears Together. MediaEval 2019
[e1]
- view
- export record
  dblp key:
  - conf/iwslt/2019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iwslt/2019
Jan Niehues, Roldano Cattoni, Sebastian Stüker, Matteo Negri, Marco Turchi, Thanh-Le Ha, Elizabeth Salesky, Ramon Sanabria, Loïc Barrault, Lucia Specia, Marcello Federico:
Proceedings of the 16th International Conference on Spoken Language Translation, IWSLT 2019, Hong Kong, November 2-3, 2019. Association for Computational Linguistics 2019 [contents]
[i13]
- view
  - electronic edition @ nist.gov (open access)
  - details & citations
- export record
  dblp key:
  - conf/tac/HovyCCGHMMSDCCK19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/tac/HovyCCGHMMSDCCK19
Eduard H. Hovy, Jaime G. Carbonell, Hans Chalupsky, Anatole Gershman, Alex Hauptmann, Florian Metze, Teruko Mitamura, Zaid Sheikh, Ankit Dangi, Aditi Chaudhary, Xianyang Chen, Xiang Kong, Bernie Huang, Salvador Medina, Hector Liu, Xuezhe Ma, Maria Ryskina, Ramon Sanabria, Varun Gangal:
OPERA: Operations-oriented Probabilistic Extraction, Reasoning, and Analysis. TAC 2019
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1906-06147
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1906-06147
Yasufumi Moriya, Ramon Sanabria, Florian Metze, Gareth J. F. Jones:
Grounding Object Detections With Transcriptions. CoRR abs/1906.06147 (2019)
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1907-00477
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1907-00477
Tejas Srinivasan, Ramon Sanabria, Florian Metze:
Analyzing Utility of Visual Context in Multimodal Speech Recognition Under Noisy Conditions. CoRR abs/1907.00477 (2019)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1910-12368
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1910-12368
Tejas Srinivasan, Ramon Sanabria, Florian Metze:
Multitask Learning For Different Subword Segmentations In Neural Machine Translation. CoRR abs/1910.12368 (2019)
2018
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/DalmiaSMB18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/DalmiaSMB18
Siddharth Dalmia, Ramon Sanabria, Florian Metze, Alan W. Black:
Sequence-Based Multi-Lingual Low Resource Speech Recognition. ICASSP 2018: 4909-4913
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/PalaskarSM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/PalaskarSM18
Shruti Palaskar, Ramon Sanabria, Florian Metze:
End-to-end Multimodal Speech Recognition. ICASSP 2018: 5774-5778
[c4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZenkelSMW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZenkelSMW18
Thomas Zenkel, Ramon Sanabria, Florian Metze, Alex Waibel:
Subword and Crossword Units for CTC Acoustic Models. INTERSPEECH 2018: 396-400
[c3]
- view
  - electronic edition @ ceur-ws.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/mediaeval/MoriyaSMJ18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mediaeval/MoriyaSMJ18
Yasufumi Moriya, Ramon Sanabria, Florian Metze, Gareth J. F. Jones:
Eyes and Ears Together: New Task for Multimodal Spoken Content Analysis. MediaEval 2018
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/SanabriaM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/SanabriaM18
Ramon Sanabria, Florian Metze:
Hierarchical Multitask Learning With CTC. SLT 2018: 485-490
[i9]
- view
  - electronic edition @ nist.gov (open access)
  - details & citations
- export record
  dblp key:
  - conf/tac/HovyBCCGHMMCCHL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/tac/HovyBCCGHMMCCHL18
Eduard H. Hovy, Taylor Berg-Kirkpatrick, Jaime G. Carbonell, Hans Chalupsky, Anatole Gershman, Alexander G. Hauptmann, Florian Metze, Teruko Mitamura, Aditi Chaudhary, Xianyang Chen, Bernie Po-Yao Huang, Hector Zhengzhong Liu, Xuezhe Ma, Shruti Palaskar, Dheeraj Rajagopal, Maria Ryskina, Ramon Sanabria:
OPERA: Operations-oriented Probabilistic Extraction, Reasoning, and Analysis. TAC 2018
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1802-07420
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1802-07420
Siddharth Dalmia, Ramon Sanabria, Florian Metze, Alan W. Black:
Sequence-based Multi-lingual Low Resource Speech Recognition. CoRR abs/1802.07420 (2018)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1804-09713
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1804-09713
Shruti Palaskar, Ramon Sanabria, Florian Metze:
End-to-End Multimodal Speech Recognition. CoRR abs/1804.09713 (2018)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1807-07104
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1807-07104
Ramon Sanabria, Florian Metze:
Hierarchical Multi Task Learning With CTC. CoRR abs/1807.07104 (2018)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1811-00347
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-00347
Ramon Sanabria, Ozan Caglayan, Shruti Palaskar, Desmond Elliott, Loïc Barrault, Lucia Specia, Florian Metze:
How2: A Large-scale Dataset for Multimodal Language Understanding. CoRR abs/1811.00347 (2018)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1811-03865
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-03865
Ozan Caglayan, Ramon Sanabria, Shruti Palaskar, Loïc Barrault, Florian Metze:
Multimodal Grounding for Sequence-to-Sequence Speech Recognition. CoRR abs/1811.03865 (2018)
2017
[c1]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZenkelSMNSSW17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZenkelSMNSSW17
Thomas Zenkel, Ramon Sanabria, Florian Metze, Jan Niehues, Matthias Sperber, Sebastian Stüker, Alex Waibel:
Comparison of Decoding Strategies for CTC Acoustic Models. INTERSPEECH 2017: 513-517
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1708-04469
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1708-04469
Thomas Zenkel, Ramon Sanabria, Florian Metze, Jan Niehues, Matthias Sperber, Sebastian Stüker, Alex Waibel:
Comparison of Decoding Strategies for CTC Acoustic Models. CoRR abs/1708.04469 (2017)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1712-06855
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1712-06855
Thomas Zenkel, Ramon Sanabria, Florian Metze, Alex Waibel:
Subword and Crossword Units for CTC Acoustic Models. CoRR abs/1712.06855 (2017)
2016
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/SanabriaMT16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/SanabriaMT16
Ramon Sanabria, Florian Metze, Fernando De la Torre:
Robust end-to-end deep audiovisual speech recognition. CoRR abs/1611.06986 (2016)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.