default search action

combined dblp search
author search
venue search
publication search

ask others

Hirofumi Inaguma

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c40]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/ShiIMKS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/ShiIMKS24
Jiatong Shi, Hirofumi Inaguma, Xutai Ma, Ilia Kulikov, Anna Y. Sun:
Multi-resolution HuBERT: Multi-resolution Speech Self-Supervised Learning with Masked Unit Prediction. ICLR 2024
[i41]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-09869
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-09869
Jiatong Shi, Xutai Ma, Hirofumi Inaguma, Anna Y. Sun, Shinji Watanabe:
MMM: Multi-Layer Multi-Residual Multi-Stream Discrete Speech Representation from Self-supervised Learning Model. CoRR abs/2406.09869 (2024)
[i40]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-03169
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-03169
Chao-Wei Huang, Hui Lu, Hongyu Gong, Hirofumi Inaguma, Ilia Kulikov, Ruslan Mavlyutov, Sravya Popuri:
Investigating Decoder-only Large Language Models for Speech-to-text Translation. CoRR abs/2407.03169 (2024)
[i39]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-00168
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-00168
Weiting Tan, Hirofumi Inaguma, Ning Dong, Paden Tomasello, Xutai Ma:
SSR: Alignment-Aware Modality Connector for Speech Language Models. CoRR abs/2410.00168 (2024)
2023
[j1]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/taslp/InagumaK23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/InagumaK23
Hirofumi Inaguma, Tatsuya Kawahara:
Alignment Knowledge Distillation for Online Streaming Attention-Based Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 31: 1371-1385 (2023)
[c39]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/YanS0IPDPFBHZNH23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/YanS0IPDPFBHZNH23
Brian Yan, Jiatong Shi, Yun Tang, Hirofumi Inaguma, Yifan Peng, Siddharth Dalmia, Peter Polak, Patrick Fernandes, Dan Berrebbi, Tomoki Hayashi, Xiaohui Zhang, Zhaoheng Ni, Moto Hira, Soumi Maiti, Juan Pino, Shinji Watanabe:
ESPnet-ST-v2: Multipurpose Spoken Language Translation Toolkit. ACL (demo) 2023: 400-411
[c38]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/ChenTYDKCTDSGIP23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/ChenTYDKCTDSGIP23
Peng-Jen Chen, Kevin Tran, Yilin Yang, Jingfei Du, Justine Kao, Yu-An Chung, Paden Tomasello, Paul-Ambroise Duquenne, Holger Schwenk, Hongyu Gong, Hirofumi Inaguma, Sravya Popuri, Changhan Wang, Juan Pino, Wei-Ning Hsu, Ann Lee:
Speech-to-Speech Translation for a Real-world Unwritten Language. ACL (Findings) 2023: 4969-4983
[c37]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/WangICK0HA023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/WangICK0HA023
Changhan Wang, Hirofumi Inaguma, Peng-Jen Chen, Ilia Kulikov, Yun Tang, Wei-Ning Hsu, Michael Auli, Juan Pino:
Simple and Effective Unsupervised Speech Translation. ACL (1) 2023: 10771-10784
[c36]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/0002SICDMT023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/0002SICDMT023
Yun Tang, Anna Y. Sun, Hirofumi Inaguma, Xinyue Chen, Ning Dong, Xutai Ma, Paden Tomasello, Juan Pino:
Hybrid Transducer and Attention based Encoder-Decoder Modeling for Speech-to-Text Tasks. ACL (1) 2023: 12441-12455
[c35]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/InagumaPKCWC00023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/InagumaPKCWC00023
Hirofumi Inaguma, Sravya Popuri, Ilia Kulikov, Peng-Jen Chen, Changhan Wang, Yu-An Chung, Yun Tang, Ann Lee, Shinji Watanabe, Juan Pino:
UnitY: Two-pass Direct Speech-to-speech Translation with Discrete Units. ACL (1) 2023: 15655-15680
[c34]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/GaidoTKHGI23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/GaidoTKHGI23
Marco Gaido, Yun Tang, Ilia Kulikov, Rongqing Huang, Hongyu Gong, Hirofumi Inaguma:
Named Entity Detection and Injection for Direct Speech Translation. ICASSP 2023: 1-5
[c33]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ShiTLIWPW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ShiTLIWPW23
Jiatong Shi, Yun Tang, Ann Lee, Hirofumi Inaguma, Changhan Wang, Juan Pino, Shinji Watanabe:
Enhancing Speech-To-Speech Translation with Multiple TTS Targets. ICASSP 2023: 1-5
[c32]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Shi0IG0023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Shi0IG0023
Jiatong Shi, Yun Tang, Hirofumi Inaguma, Hongyu Gong, Juan Pino, Shinji Watanabe:
Exploration on HuBERT with Multiple Resolution. INTERSPEECH 2023: 3287-3291
[c31]
- view
  authority control:
- export record
  dblp key:
  - conf/iwslt/AgrawalABBBCCCC23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iwslt/AgrawalABBBCCCC23
Sweta Agrawal, Antonios Anastasopoulos, Luisa Bentivogli, Ondrej Bojar, Claudia Borg, Marine Carpuat, Roldano Cattoni, Mauro Cettolo, Mingda Chen, William Chen, Khalid Choukri, Alexandra Chronopoulou, Anna Currey, Thierry Declerck, Qianqian Dong, Kevin Duh, Yannick Estève, Marcello Federico, Souhir Gahbiche, Barry Haddow, Benjamin Hsu, Phu Mon Htut, Hirofumi Inaguma, Dávid Javorský, John Judge, Yasumasa Kano, Tom Ko, Rishu Kumar, Pengwei Li, Xutai Ma, Prashant Mathur, Evgeny Matusov, Paul McNamee, John P. McCrae, Kenton Murray, Maria Nadejde, Satoshi Nakamura, Matteo Negri, Ha Nguyen, Jan Niehues, Xing Niu, Atul Kr. Ojha, John E. Ortega, Proyag Pal, Juan Pino, Lonneke van der Plas, Peter Polák, Elijah Rippeth, Elizabeth Salesky, Jiatong Shi, Matthias Sperber, Sebastian Stüker, Katsuhito Sudoh, Yun Tang, Brian Thompson, Kevin Tran, Marco Turchi, Alex Waibel, Mingxuan Wang, Shinji Watanabe, Rodolfo Zevallos:
Findings of the IWSLT 2023 Evaluation Campaign. IWSLT@ACL 2023: 1-61
[i38]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2304-04618
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2304-04618
Jiatong Shi, Yun Tang, Ann Lee, Hirofumi Inaguma, Changhan Wang, Juan Pino, Shinji Watanabe:
Enhancing Speech-to-Speech Translation with Multiple TTS Targets. CoRR abs/2304.04618 (2023)
[i37]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-03101
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-03101
Yun Tang, Anna Y. Sun, Hirofumi Inaguma, Xinyue Chen, Ning Dong, Xutai Ma, Paden D. Tomasello, Juan Pino:
Hybrid Transducer and Attention based Encoder-Decoder Modeling for Speech-to-Text Tasks. CoRR abs/2305.03101 (2023)
[i36]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-01084
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-01084
Jiatong Shi, Yun Tang, Hirofumi Inaguma, Hongyu Gong, Juan Pino, Shinji Watanabe:
Exploration on HuBERT with Multiple Resolutions. CoRR abs/2306.01084 (2023)
[i35]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-11596
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-11596
Seamless Communication, Loïc Barrault, Yu-An Chung, Mariano Coria Meglioli, David Dale, Ning Dong, Paul-Ambroise Duquenne, Hady Elsahar, Hongyu Gong, Kevin Heffernan, John Hoffman, Christopher Klaiber, Pengwei Li, Daniel Licht, Jean Maillard, Alice Rakotoarison, Kaushik Ram Sadagopan, Guillaume Wenzek, Ethan Ye, Bapi Akula, Peng-Jen Chen, Naji El Hachem, Brian Ellis, Gabriel Mejia Gonzalez, Justin Haaheim, Prangthip Hansanti, Russ Howes, Bernie Huang, Min-Jae Hwang, Hirofumi Inaguma, Somya Jain, Elahe Kalbassi, Amanda Kallet, Ilia Kulikov, Janice Lam, Daniel Li, Xutai Ma, Ruslan Mavlyutov, Benjamin N. Peloquin, Mohamed Ramadan, Abinesh Ramakrishnan, Anna Y. Sun, Kevin Tran, Tuan Tran, Igor Tufanov, Vish Vogeti, Carleigh Wood, Yilin Yang, Bokai Yu, Pierre Andrews, Can Balioglu, Marta R. Costa-jussà, Onur Celebi, Maha Elbayad, Cynthia Gao, Francisco Guzmán, Justine Kao, Ann Lee, Alexandre Mourachko, Juan Pino, Sravya Popuri, Christophe Ropers, Safiyyah Saleem, Holger Schwenk, Paden Tomasello, Changhan Wang, Jeff Wang, Skyler Wang:
SeamlessM4T-Massively Multilingual & Multimodal Machine Translation. CoRR abs/2308.11596 (2023)
[i34]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-02720
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-02720
Jiatong Shi, Hirofumi Inaguma, Xutai Ma, Ilia Kulikov, Anna Y. Sun:
Multi-resolution HuBERT: Multi-resolution Speech Self-Supervised Learning with Masked Unit Prediction. CoRR abs/2310.02720 (2023)
[i33]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-04515
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-04515
Xutai Ma, Anna Y. Sun, Siqi Ouyang, Hirofumi Inaguma, Paden Tomasello:
Efficient Monotonic Multihead Attention. CoRR abs/2312.04515 (2023)
[i32]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-05187
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-05187
Loïc Barrault, Yu-An Chung, Mariano Coria Meglioli, David Dale, Ning Dong, Mark Duppenthaler, Paul-Ambroise Duquenne, Brian Ellis, Hady Elsahar, Justin Haaheim, John Hoffman, Min-Jae Hwang, Hirofumi Inaguma, Christopher Klaiber, Ilia Kulikov, Pengwei Li, Daniel Licht, Jean Maillard, Ruslan Mavlyutov, Alice Rakotoarison, Kaushik Ram Sadagopan, Abinesh Ramakrishnan, Tuan Tran, Guillaume Wenzek, Yilin Yang, Ethan Ye, Ivan Evtimov, Pierre Fernandez, Cynthia Gao, Prangthip Hansanti, Elahe Kalbassi, Amanda Kallet, Artyom Kozhevnikov, Gabriel Mejia Gonzalez, Robin San Roman, Christophe Touret, Corinne Wong, Carleigh Wood, Bokai Yu, Pierre Andrews, Can Balioglu, Peng-Jen Chen, Marta R. Costa-jussà, Maha Elbayad, Hongyu Gong, Francisco Guzmán, Kevin Heffernan, Somya Jain, Justine Kao, Ann Lee, Xutai Ma, Alexandre Mourachko, Benjamin N. Peloquin, Juan Pino, Sravya Popuri, Christophe Ropers, Safiyyah Saleem, Holger Schwenk, Anna Y. Sun, Paden Tomasello, Changhan Wang, Jeff Wang, Skyler Wang, Mary Williamson:
Seamless: Multilingual Expressive and Streaming Speech Translation. CoRR abs/2312.05187 (2023)
2022
[c30]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FutamiIUMSK22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FutamiIUMSK22
Hayato Futami, Hirofumi Inaguma, Sei Ueno, Masato Mimura, Shinsuke Sakai, Tatsuya Kawahara:
Non-autoregressive Error Correction for CTC-based ASR with Phone-conditioned Masked LM. INTERSPEECH 2022: 3889-3893
[i31]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2201-05420
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2201-05420
Florian Boyer, Yusuke Shinohara, Takaaki Ishii, Hirofumi Inaguma, Shinji Watanabe:
A Study of Transducer based End-to-End ASR with ESPnet: Architecture, Auxiliary Loss and Decoding Strategies. CoRR abs/2201.05420 (2022)
[i30]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2209-02030
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2209-02030
Hayato Futami, Hirofumi Inaguma, Masato Mimura, Shinsuke Sakai, Tatsuya Kawahara:
Distilling the Knowledge of BERT for CTC-based ASR. CoRR abs/2209.02030 (2022)
[i29]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2209-04062
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2209-04062
Hayato Futami, Hirofumi Inaguma, Sei Ueno, Masato Mimura, Shinsuke Sakai, Tatsuya Kawahara:
Non-autoregressive Error Correction for CTC-based ASR with Phone-conditioned Masked LM. CoRR abs/2209.04062 (2022)
[i28]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-10191
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-10191
Changhan Wang, Hirofumi Inaguma, Peng-Jen Chen, Ilia Kulikov, Yun Tang, Wei-Ning Hsu, Michael Auli, Juan Pino:
Simple and Effective Unsupervised Speech Translation. CoRR abs/2210.10191 (2022)
[i27]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-11981
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-11981
Marco Gaido, Yun Tang, Ilia Kulikov, Rongqing Huang, Hongyu Gong, Hirofumi Inaguma:
Named Entity Detection and Injection for Direct Speech Translation. CoRR abs/2210.11981 (2022)
[i26]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-06474
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-06474
Peng-Jen Chen, Kevin Tran, Yilin Yang, Jingfei Du, Justine Kao, Yu-An Chung, Paden Tomasello, Paul-Ambroise Duquenne, Holger Schwenk, Hongyu Gong, Hirofumi Inaguma, Sravya Popuri, Changhan Wang, Juan Miguel Pino, Wei-Ning Hsu, Ann Lee:
Speech-to-Speech Translation For A Real-world Unwritten Language. CoRR abs/2211.06474 (2022)
[i25]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2212-08055
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2212-08055
Hirofumi Inaguma, Sravya Popuri, Ilia Kulikov, Peng-Jen Chen, Changhan Wang, Yu-An Chung, Yun Tang, Ann Lee, Shinji Watanabe, Juan Pino:
UnitY: Two-pass Direct Speech-to-speech Translation with Discrete Units. CoRR abs/2212.08055 (2022)
2021
[b1]
- view
  authority control:
- export record
  dblp key:
  - phd/jp/Inaguma21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/phd/jp/Inaguma21
Hirofumi Inaguma:
Fast and Low-Latency End-to-End Speech Recognition and Translation. Kyoto University, Japan, 2021
[c29]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/BoyerSIIW21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/BoyerSIIW21
Florian Boyer, Yusuke Shinohara, Takaaki Ishii, Hirofumi Inaguma, Shinji Watanabe:
A Study of Transducer Based End-to-End ASR with ESPnet: Architecture, Auxiliary Loss and Decoding Strategies. ASRU 2021: 16-23
[c28]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/HiguchiCFIKLNWW21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/HiguchiCFIKLNWW21
Yosuke Higuchi, Nanxin Chen, Yuya Fujita, Hirofumi Inaguma, Tatsuya Komatsu, Jaesong Lee, Jumon Nozaki, Tianzi Wang, Shinji Watanabe:
A Comparative Study on Non-Autoregressive Modelings for Speech-to-Text Generation. ASRU 2021: 47-54
[c27]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/FutamiIMSK21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/FutamiIMSK21
Hayato Futami, Hirofumi Inaguma, Masato Mimura, Shinsuke Sakai, Tatsuya Kawahara:
ASR Rescoring and Confidence Estimation with Electra. ASRU 2021: 380-387
[c26]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/InagumaDYW21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/InagumaDYW21
Hirofumi Inaguma, Siddharth Dalmia, Brian Yan, Shinji Watanabe:
Fast-MD: Fast Multi-Decoder End-to-End Speech Translation with Non-Autoregressive Hidden Intermediates. ASRU 2021: 922-929
[c25]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/GuoBCHHIKLGSSWW21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/GuoBCHHIKLGSSWW21
Pengcheng Guo, Florian Boyer, Xuankai Chang, Tomoki Hayashi, Yosuke Higuchi, Hirofumi Inaguma, Naoyuki Kamo, Chenda Li, Daniel Garcia-Romero, Jiatong Shi, Jing Shi, Shinji Watanabe, Kun Wei, Wangyou Zhang, Yuekai Zhang:
Recent Developments on Espnet Toolkit Boosted By Conformer. ICASSP 2021: 5874-5878
[c24]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/InagumaHDK021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/InagumaHDK021
Hirofumi Inaguma, Yosuke Higuchi, Kevin Duh, Tatsuya Kawahara, Shinji Watanabe:
ORTHROS: non-autoregressive end-to-end speech translation With dual-decoder. ICASSP 2021: 7503-7507
[c23]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HiguchiI0OK21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HiguchiI0OK21
Yosuke Higuchi, Hirofumi Inaguma, Shinji Watanabe, Tetsuji Ogawa, Tetsunori Kobayashi:
Improved Mask-CTC for Non-Autoregressive End-to-End ASR. ICASSP 2021: 8363-8367
[c22]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/InagumaK21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/InagumaK21
Hirofumi Inaguma, Tatsuya Kawahara:
StableEmit: Selection Probability Discount for Reducing Emission Latency of Streaming Monotonic Attention ASR. Interspeech 2021: 1817-1821
[c21]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/InagumaK21a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/InagumaK21a
Hirofumi Inaguma, Tatsuya Kawahara:
VAD-Free Streaming Hybrid CTC/Attention ASR for Unsegmented Recording. Interspeech 2021: 4049-4053
[c20]
- view
  authority control:
- export record
  dblp key:
  - conf/iwslt/InagumaYDGSDW21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iwslt/InagumaYDGSDW21
Hirofumi Inaguma, Brian Yan, Siddharth Dalmia, Pengcheng Guo, Jiatong Shi, Kevin Duh, Shinji Watanabe:
ESPnet-ST IWSLT 2021 Offline Speech Translation System. IWSLT 2021: 100-109
[c19]
- view
  authority control:
- export record
  dblp key:
  - conf/naacl/InagumaKW21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/naacl/InagumaKW21
Hirofumi Inaguma, Tatsuya Kawahara, Shinji Watanabe:
Source and Target Bidirectional Knowledge Distillation for End-to-end Speech Translation. NAACL-HLT 2021: 1872-1881
[i24]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2103-00422
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-00422
Hirofumi Inaguma, Tatsuya Kawahara:
Alignment Knowledge Distillation for Online Streaming Attention-based Speech Recognition. CoRR abs/2103.00422 (2021)
[i23]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2104-06457
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-06457
Hirofumi Inaguma, Tatsuya Kawahara, Shinji Watanabe:
Source and Target Bidirectional Knowledge Distillation for End-to-end Speech Translation. CoRR abs/2104.06457 (2021)
[i22]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2107-00635
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2107-00635
Hirofumi Inaguma, Tatsuya Kawahara:
StableEmit: Selection Probability Discount for Reducing Emission Latency of Streaming Monotonic Attention ASR. CoRR abs/2107.00635 (2021)
[i21]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2107-00636
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2107-00636
Hirofumi Inaguma, Brian Yan, Siddharth Dalmia, Pengcheng Guo, Jiatong Shi, Kevin Duh, Shinji Watanabe:
ESPnet-ST IWSLT 2021 Offline Speech Translation System. CoRR abs/2107.00636 (2021)
[i20]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2107-07509
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2107-07509
Hirofumi Inaguma, Tatsuya Kawahara:
VAD-free Streaming Hybrid CTC/Attention ASR for Unsegmented Recording. CoRR abs/2107.07509 (2021)
[i19]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2109-04411
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2109-04411
Hirofumi Inaguma, Yosuke Higuchi, Kevin Duh, Tatsuya Kawahara, Shinji Watanabe:
Non-autoregressive End-to-end Speech Translation with Parallel Autoregressive Rescoring. CoRR abs/2109.04411 (2021)
[i18]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2109-12804
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2109-12804
Hirofumi Inaguma, Siddharth Dalmia, Brian Yan, Shinji Watanabe:
Fast-MD: Fast Multi-Decoder End-to-End Speech Translation with Non-Autoregressive Hidden Intermediates. CoRR abs/2109.12804 (2021)
[i17]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2110-01857
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-01857
Hayato Futami, Hirofumi Inaguma, Masato Mimura, Shinsuke Sakai, Tatsuya Kawahara:
ASR Rescoring and Confidence Estimation with ELECTRA. CoRR abs/2110.01857 (2021)
[i16]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2110-05249
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-05249
Yosuke Higuchi, Nanxin Chen, Yuya Fujita, Hirofumi Inaguma, Tatsuya Komatsu, Jaesong Lee, Jumon Nozaki, Tianzi Wang, Shinji Watanabe:
A Comparative Study on Non-Autoregressive Modelings for Speech-to-Text Generation. CoRR abs/2110.05249 (2021)
2020
[c18]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/InagumaKDKYHW20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/InagumaKDKYHW20
Hirofumi Inaguma, Shun Kiyono, Kevin Duh, Shigeki Karita, Nelson Yalta, Tomoki Hayashi, Shinji Watanabe:
ESPnet-ST: All-in-One Speech Translation Toolkit. ACL (demo) 2020: 302-311
[c17]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/InagumaGLLG20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/InagumaGLLG20
Hirofumi Inaguma, Yashesh Gaur, Liang Lu, Jinyu Li, Yifan Gong:
Minimum Latency Training Strategies for Streaming Sequence-to-Sequence ASR. ICASSP 2020: 6064-6068
[c16]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/InagumaMK20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/InagumaMK20
Hirofumi Inaguma, Masato Mimura, Tatsuya Kawahara:
CTC-Synchronous Training for Monotonic Attention Model. INTERSPEECH 2020: 571-575
[c15]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/InagumaMK20a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/InagumaMK20a
Hirofumi Inaguma, Masato Mimura, Tatsuya Kawahara:
Enhancing Monotonic Multihead Attention for Streaming ASR. INTERSPEECH 2020: 2137-2141
[c14]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FutamiIUMSK20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FutamiIUMSK20
Hayato Futami, Hirofumi Inaguma, Sei Ueno, Masato Mimura, Shinsuke Sakai, Tatsuya Kawahara:
Distilling the Knowledge of BERT for Sequence-to-Sequence ASR. INTERSPEECH 2020: 3635-3639
[c13]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DangZUIK20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DangZUIK20
Viet-Trung Dang, Tianyu Zhao, Sei Ueno, Hirofumi Inaguma, Tatsuya Kawahara:
End-to-End Speech-to-Dialog-Act Recognition. INTERSPEECH 2020: 3910-3914
[i15]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2004-05009
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2004-05009
Hirofumi Inaguma, Yashesh Gaur, Liang Lu, Jinyu Li, Yifan Gong:
Minimum Latency Training Strategies for Streaming Sequence-to-Sequence ASR. CoRR abs/2004.05009 (2020)
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2004-10234
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2004-10234
Hirofumi Inaguma, Shun Kiyono, Kevin Duh, Shigeki Karita, Nelson Enrique Yalta Soplin, Tomoki Hayashi, Shinji Watanabe:
ESPnet-ST: All-in-One Speech Translation Toolkit. CoRR abs/2004.10234 (2020)
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2004-11419
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2004-11419
Viet-Trung Dang, Tianyu Zhao, Sei Ueno, Hirofumi Inaguma, Tatsuya Kawahara:
End-to-end speech-to-dialog-act recognition. CoRR abs/2004.11419 (2020)
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2005-04712
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2005-04712
Hirofumi Inaguma, Masato Mimura, Tatsuya Kawahara:
CTC-synchronous Training for Monotonic Attention Model. CoRR abs/2005.04712 (2020)
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2005-09394
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2005-09394
Hirofumi Inaguma, Masato Mimura, Tatsuya Kawahara:
Enhancing Monotonic Multihead Attention for Streaming ASR. CoRR abs/2005.09394 (2020)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2008-03822
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2008-03822
Hayato Futami, Hirofumi Inaguma, Sei Ueno, Masato Mimura, Shinsuke Sakai, Tatsuya Kawahara:
Distilling the Knowledge of BERT for Sequence-to-Sequence ASR. CoRR abs/2008.03822 (2020)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2010-13047
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-13047
Hirofumi Inaguma, Yosuke Higuchi, Kevin Duh, Tatsuya Kawahara, Shinji Watanabe:
Orthros: Non-autoregressive End-to-end Speech Translation with Dual-decoder. CoRR abs/2010.13047 (2020)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2010-13270
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-13270
Yosuke Higuchi, Hirofumi Inaguma, Shinji Watanabe, Tetsuji Ogawa, Tetsunori Kobayashi:
Improved Mask-CTC for Non-Autoregressive End-to-End ASR. CoRR abs/2010.13270 (2020)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2010-13956
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-13956
Pengcheng Guo, Florian Boyer, Xuankai Chang, Tomoki Hayashi, Yosuke Higuchi, Hirofumi Inaguma, Naoyuki Kamo, Chenda Li, Daniel Garcia-Romero, Jiatong Shi, Jing Shi, Shinji Watanabe, Kun Wei, Wangyou Zhang, Yuekai Zhang:
Recent Developments on ESPnet Toolkit Boosted by Conformer. CoRR abs/2010.13956 (2020)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2012-13006
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2012-13006
Shinji Watanabe, Florian Boyer, Xuankai Chang, Pengcheng Guo, Tomoki Hayashi, Yosuke Higuchi, Takaaki Hori, Wen-Chin Huang, Hirofumi Inaguma, Naoyuki Kamo, Shigeki Karita, Chenda Li, Jing Shi, Aswin Shanmugam Subramanian, Wangyou Zhang:
The 2020 ESPnet update: new features, broadened applications, performance improvements, and future plans. CoRR abs/2012.13006 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/KaritaWWYZCHHIJ19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/KaritaWWYZCHHIJ19
Shigeki Karita, Xiaofei Wang, Shinji Watanabe, Takenori Yoshimura, Wangyou Zhang, Nanxin Chen, Tomoki Hayashi, Takaaki Hori, Hirofumi Inaguma, Ziyan Jiang, Masao Someki, Nelson Enrique Yalta Soplin, Ryuichi Yamamoto:
A Comparative Study on Transformer vs RNN in Speech Applications. ASRU 2019: 449-456
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/InagumaDKW19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/InagumaDKW19
Hirofumi Inaguma, Kevin Duh, Tatsuya Kawahara, Shinji Watanabe:
Multilingual End-to-End Speech Translation. ASRU 2019: 570-577
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/InagumaCBKW19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/InagumaCBKW19
Hirofumi Inaguma, Jaejin Cho, Murali Karthick Baskar, Tatsuya Kawahara, Shinji Watanabe:
Transfer Learning of Language-independent End-to-end ASR with Language Model Fusion. ICASSP 2019: 6096-6100
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChoWHBIVD19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChoWHBIVD19
Jaejin Cho, Shinji Watanabe, Takaaki Hori, Murali Karthick Baskar, Hirofumi Inaguma, Jesús Villalba, Najim Dehak:
Language Model Integration Based on Memory Control for Sequence to Sequence Speech Recognition. ICASSP 2019: 6191-6195
[c8]
- view
  - electronic edition @ aclanthology.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iwslt/InagumaKSSD019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iwslt/InagumaKSSD019
Hirofumi Inaguma, Shun Kiyono, Nelson Enrique Yalta Soplin, Jun Suzuki, Kevin Duh, Shinji Watanabe:
ESPnet How2 Speech Translation System for IWSLT 2019: Pre-training, Knowledge Distillation, and Going Deeper. IWSLT 2019
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1909-06317
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1909-06317
Shigeki Karita, Nanxin Chen, Tomoki Hayashi, Takaaki Hori, Hirofumi Inaguma, Ziyan Jiang, Masao Someki, Nelson Enrique Yalta Soplin, Ryuichi Yamamoto, Xiaofei Wang, Shinji Watanabe, Takenori Yoshimura, Wangyou Zhang:
A Comparative Study on Transformer vs RNN in Speech Applications. CoRR abs/1909.06317 (2019)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1909-09993
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1909-09993
Hirofumi Inaguma, Masato Mimura, Shinsuke Sakai, Tatsuya Kawahara:
Improving OOV Detection and Resolution with External Language Models in Acoustic-to-Word ASR. CoRR abs/1909.09993 (2019)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1910-00254
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1910-00254
Hirofumi Inaguma, Kevin Duh, Tatsuya Kawahara, Shinji Watanabe:
Multilingual End-to-End Speech Translation. CoRR abs/1910.00254 (2019)
2018
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/UenoIMK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/UenoIMK18
Sei Ueno, Hirofumi Inaguma, Masato Mimura, Tatsuya Kawahara:
Acoustic-to-Word Attention-Based Model Complemented with Character-Level CTC-Based Model. ICASSP 2018: 5804-5808
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/InagumaMIYK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/InagumaMIYK18
Hirofumi Inaguma, Masato Mimura, Koji Inoue, Kazuyoshi Yoshii, Tatsuya Kawahara:
An End-to-End Approach to Joint Social Signal Detection and Automatic Speech Recognition. ICASSP 2018: 6214-6218
[c5]
- view
  - electronic edition @ aclanthology.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iwslt/InagumaZWR0D18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iwslt/InagumaZWR0D18
Hirofumi Inaguma, Xuan Zhang, Zhiqi Wang, Adithya Renduchintala, Shinji Watanabe, Kevin Duh:
The JHU/KyotoU Speech Translation System for IWSLT 2018. IWSLT 2018: 153-159
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/InagumaMSK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/InagumaMSK18
Hirofumi Inaguma, Masato Mimura, Shinsuke Sakai, Tatsuya Kawahara:
Improving OOV Detection and Resolution with External Language Models in Acoustic-to-Word ASR. SLT 2018: 212-218
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/MimuraUISK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/MimuraUISK18
Masato Mimura, Sei Ueno, Hirofumi Inaguma, Shinsuke Sakai, Tatsuya Kawahara:
Leveraging Sequence-to-Sequence Speech Synthesis for Enhancing Acoustic-to-Word Speech Recognition. SLT 2018: 477-484
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1811-02134
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-02134
Hirofumi Inaguma, Jaejin Cho, Murali Karthick Baskar, Tatsuya Kawahara, Shinji Watanabe:
Transfer learning of language-independent end-to-end ASR with language model fusion. CoRR abs/1811.02134 (2018)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1811-02162
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-02162
Jaejin Cho, Shinji Watanabe, Takaaki Hori, Murali Karthick Baskar, Hirofumi Inaguma, Jesús Villalba, Najim Dehak:
Language model integration based on memory control for sequence to sequence speech recognition. CoRR abs/1811.02162 (2018)
2017
[c2]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/InagumaIMK17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/InagumaIMK17
Hirofumi Inaguma, Koji Inoue, Masato Mimura, Tatsuya Kawahara:
Social Signal Detection in Spontaneous Dialogue Using Bidirectional LSTM-CTC. INTERSPEECH 2017: 1691-1695
2016
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/icmi/InagumaINTK16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmi/InagumaINTK16
Hirofumi Inaguma, Koji Inoue, Shizuka Nakamura, Katsuya Takanashi, Tatsuya Kawahara:
Prediction of ice-breaking between participants using prosodic features in the first meeting dialogue. ASSP4MI@ICMI 2016: 11-15

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.