default search action

combined dblp search
author search
venue search
publication search

ask others

Tomoki Hayashi

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2023
[c67]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/YanS0IPDPFBHZNH23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/YanS0IPDPFBHZNH23
Brian Yan, Jiatong Shi, Yun Tang, Hirofumi Inaguma, Yifan Peng, Siddharth Dalmia, Peter Polak, Patrick Fernandes, Dan Berrebbi, Tomoki Hayashi, Xiaohui Zhang, Zhaoheng Ni, Moto Hira, Soumi Maiti, Juan Pino, Shinji Watanabe:
ESPnet-ST-v2: Multipurpose Spoken Language Translation Toolkit. ACL (demo) 2023: 400-411
[c66]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KobayashiHT23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KobayashiHT23
Kazuhiro Kobayashi, Tomoki Hayashi, Tomoki Toda:
Low-Latency Electrolaryngeal Speech Enhancement Based on Fastspeech2-Based Voice Conversion and Self-Supervised Speech Representation. ICASSP 2023: 1-5
[i39]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2301-09099
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2301-09099
Massa Baali, Tomoki Hayashi, Hamdy Mubarak, Soumi Maiti, Shinji Watanabe, Wassim El-Hajj, Ahmed Ali:
Unsupervised Data Selection for TTS: Using Arabic Broadcast News as a Case Study. CoRR abs/2301.09099 (2023)
2022
[j9]
- view
  authority control:
- export record
  dblp key:
  - journals/jstsp/HuangYHT22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jstsp/HuangYHT22
Wen-Chin Huang, Shu-Wen Yang, Tomoki Hayashi, Tomoki Toda:
A Comparative Study of Self-Supervised Speech Representation Based Voice Conversion. IEEE J. Sel. Top. Signal Process. 16(6): 1308-1318 (2022)
[c65]
- view
  - electronic edition @ mpg.de
  - no references & citations available
- export record
  dblp key:
  - conf/bmvc/KarlssonH0COT22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/bmvc/KarlssonH0COT22
Robin Karlsson, Tomoki Hayashi, Keisuke Fujii, Alexander Carballo, Kento Ohtani, Kazuya Takeda:
Improving Dense Representation Learning by Superpixelization and Contrasting Cluster Assignment. BMVC 2022: 699
[c64]
- view
  - electronic edition @ ieee.org
  - no references & citations available
- export record
  dblp key:
  - conf/eusipco/KimHT22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/KimHT22
Sehun Kim, Tomoki Hayashi, Tomoki Toda:
Note-level Automatic Guitar Transcription Using Attention Mechanism. EUSIPCO 2022: 229-233
[c63]
- view
  - electronic edition @ ieee.org
  - no references & citations available
- export record
  dblp key:
  - conf/eusipco/KuroyanagiHTT22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/KuroyanagiHTT22
Ibuki Kuroyanagi, Tomoki Hayashi, Kazuya Takeda, Tomoki Toda:
Improvement of Serial Approach to Anomalous Sound Detection by Incorporating Two Binary Cross-Entropies for Outlier Exposure. EUSIPCO 2022: 294-298
[c62]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HuangYHLWT22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HuangYHLWT22
Wen-Chin Huang, Shu-Wen Yang, Tomoki Hayashi, Hung-Yi Lee, Shinji Watanabe, Tomoki Toda:
S3PRL-VC: Open-Source Voice Conversion Framework with Self-Supervised Speech Representations. ICASSP 2022: 6552-6556
[c61]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HayashiKT22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HayashiKT22
Tomoki Hayashi, Kazuhiro Kobayashi, Tomoki Toda:
An Investigation of Streaming Non-Autoregressive sequence-to-sequence Voice Conversion. ICASSP 2022: 6802-6806
[c60]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShiGQHWXCLW0J22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShiGQHWXCLW0J22
Jiatong Shi, Shuai Guo, Tao Qian, Tomoki Hayashi, Yuning Wu, Fangzheng Xu, Xuankai Chang, Huazhe Li, Peter Wu, Shinji Watanabe, Qin Jin:
Muskits: an End-to-end Music Processing Toolkit for Singing Voice Synthesis. INTERSPEECH 2022: 4277-4281
[c59]
- view
  authority control:
- export record
  dblp key:
  - conf/rita/YamamotoOHCT22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/rita/YamamotoOHCT22
Takumi Yamamoto, Kento Ohtani, Tomoki Hayashi, Alexander Carballo, Kazuya Takeda:
Efficient Training Method for Point Cloud-Based Object Detection Models by Combining Environmental Transitions and Active Learning. RiTA 2022: 292-303
[i38]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2202-08470
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-08470
Tatsuya Komatsu, Shinji Watanabe, Koichi Miyazaki, Tomoki Hayashi:
Acoustic Event Detection with Classifier Chains. CoRR abs/2202.08470 (2022)
[i37]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-04029
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-04029
Jiatong Shi, Shuai Guo, Tao Qian, Nan Huo, Tomoki Hayashi, Yuning Wu, Frank Xu, Xuankai Chang, Huazhe Li, Peter Wu, Shinji Watanabe, Qin Jin:
Muskits: an End-to-End Music Processing Toolkit for Singing Voice Synthesis. CoRR abs/2205.04029 (2022)
[i36]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-05929
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-05929
Ibuki Kuroyanagi, Tomoki Hayashi, Kazuya Takeda, Tomoki Toda:
Improvement of Serial Approach to Anomalous Sound Detection by Incorporating Two Binary Cross-Entropies for Outlier Exposure. CoRR abs/2206.05929 (2022)
[i35]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2207-04356
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2207-04356
Wen-Chin Huang, Shu-Wen Yang, Tomoki Hayashi, Tomoki Toda:
A Comparative Study of Self-supervised Speech Representation Based Voice Conversion. CoRR abs/2207.04356 (2022)
2021
[j8]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/taslp/HuangHWKT21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/HuangHWKT21
Wen-Chin Huang, Tomoki Hayashi, Yi-Chiao Wu, Hirokazu Kameoka, Tomoki Toda:
Pretraining Techniques for Sequence-to-Sequence Voice Conversion. IEEE ACM Trans. Audio Speech Lang. Process. 29: 745-755 (2021)
[j7]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/taslp/WuHOKT21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/WuHOKT21
Yi-Chiao Wu, Tomoki Hayashi, Takuma Okamoto, Hisashi Kawai, Tomoki Toda:
Quasi-Periodic Parallel WaveGAN: A Non-Autoregressive Raw Waveform Generative Model With Pitch-Dependent Dilated Convolution Neural Network. IEEE ACM Trans. Audio Speech Lang. Process. 29: 792-806 (2021)
[j6]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/taslp/WuHTKT21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/WuHTKT21
Yi-Chiao Wu, Tomoki Hayashi, Patrick Lumban Tobing, Kazuhiro Kobayashi, Tomoki Toda:
Quasi-Periodic WaveNet: An Autoregressive Raw Waveform Generative Model With Pitch-Dependent Dilated Convolution Neural Network. IEEE ACM Trans. Audio Speech Lang. Process. 29: 1134-1148 (2021)
[c58]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/HuangHLWT21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/HuangHLWT21
Wen-Chin Huang, Tomoki Hayashi, Xinjian Li, Shinji Watanabe, Tomoki Toda:
On Prosody Modeling for ASR+TTS Based Voice Conversion. ASRU 2021: 642-649
[c57]
- view
  - electronic edition @ dcase.community (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/dcase/KuroyanagiHAYTT21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/dcase/KuroyanagiHAYTT21
Ibuki Kuroyanagi, Tomoki Hayashi, Yusuke Adachi, Takenori Yoshimura, Kazuya Takeda, Tomoki Toda:
An Ensemble Approach to Anomalous Sound Detection Based on Conformer-Based Autoencoder and Binary Classifier Incorporated with Metric Learning. DCASE 2021: 110-114
[c56]
- view
  - electronic edition @ dcase.community (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/dcase/NarisettyHIWT21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/dcase/NarisettyHIWT21
Chaitanya Prasad Narisetty, Tomoki Hayashi, Ryunosuke Ishizaki, Shinji Watanabe, Kazuya Takeda:
Leveraging State-of-the-art ASR Techniques to Audio Captioning. DCASE 2021: 160-164
[c55]
- view
  authority control:
- export record
  dblp key:
  - conf/eusipco/HayashiYIKS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/HayashiYIKS21
Tomoki Hayashi, Takenori Yoshimura, Masaya Inuzuka, Ibuki Kuroyanagi, Osamu Segawa:
Spontaneous Speech Summarization: Transformers All The Way Through. EUSIPCO 2021: 456-460
[c54]
- view
  authority control:
- export record
  dblp key:
  - conf/eusipco/KuroyanagiHTT21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/KuroyanagiHTT21
Ibuki Kuroyanagi, Tomoki Hayashi, Kazuya Takeda, Tomoki Toda:
Anomalous Sound Detection Using a Binary Classification Model and Class Centroids. EUSIPCO 2021: 1995-1999
[c53]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/GuoBCHHIKLGSSWW21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/GuoBCHHIKLGSSWW21
Pengcheng Guo, Florian Boyer, Xuankai Chang, Tomoki Hayashi, Yosuke Higuchi, Hirofumi Inaguma, Naoyuki Kamo, Chenda Li, Daniel Garcia-Romero, Jiatong Shi, Jing Shi, Shinji Watanabe, Kun Wei, Wangyou Zhang, Yuekai Zhang:
Recent Developments on Espnet Toolkit Boosted By Conformer. ICASSP 2021: 5874-5878
[c52]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KobayashiHWTHT21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KobayashiHWTHT21
Kazuhiro Kobayashi, Wen-Chin Huang, Yi-Chiao Wu, Patrick Lumban Tobing, Tomoki Hayashi, Tomoki Toda:
Crank: An Open-Source Software for Nonparallel Voice Conversion Based on Vector-Quantized Variational Autoencoder. ICASSP 2021: 5934-5938
[c51]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HuangWH21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HuangWH21
Wen-Chin Huang, Yi-Chiao Wu, Tomoki Hayashi:
Any-to-One Sequence-to-Sequence Voice Conversion Using Self-Supervised Discrete Speech Representations. ICASSP 2021: 5944-5948
[c50]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HayashiHKT21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HayashiHKT21
Tomoki Hayashi, Wen-Chin Huang, Kazuhiro Kobayashi, Tomoki Toda:
Non-Autoregressive Sequence-To-Sequence Voice Conversion. ICASSP 2021: 7068-7072
[c49]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Komatsu0MH21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Komatsu0MH21
Tatsuya Komatsu, Shinji Watanabe, Koichi Miyazaki, Tomoki Hayashi:
Acoustic Event Detection with Classifier Chains. Interspeech 2021: 601-605
[c48]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/Li0ZSCKHHBC021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/Li0ZSCKHHBC021
Chenda Li, Jing Shi, Wangyou Zhang, Aswin Shanmugam Subramanian, Xuankai Chang, Naoyuki Kamo, Moto Hira, Tomoki Hayashi, Christoph Böddeker, Zhuo Chen, Shinji Watanabe:
ESPnet-SE: End-To-End Speech Enhancement and Separation Toolkit Designed for ASR Integration. SLT 2021: 785-792
[i34]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2103-02858
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-02858
Kazuhiro Kobayashi, Wen-Chin Huang, Yi-Chiao Wu, Patrick Lumban Tobing, Tomoki Hayashi, Tomoki Toda:
crank: An Open-Source Software for Nonparallel Voice Conversion Based on Vector-Quantized Variational Autoencoder. CoRR abs/2103.02858 (2021)
[i33]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2104-06793
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-06793
Tomoki Hayashi, Wen-Chin Huang, Kazuhiro Kobayashi, Tomoki Toda:
Non-autoregressive sequence-to-sequence voice conversion. CoRR abs/2104.06793 (2021)
[i32]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2106-06151
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-06151
Ibuki Kuroyanagi, Tomoki Hayashi, Kazuya Takeda, Tomoki Toda:
Anomalous Sound Detection Using a Binary Classification Model and Class Centroids. CoRR abs/2106.06151 (2021)
[i31]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2107-09477
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2107-09477
Wen-Chin Huang, Tomoki Hayashi, Xinjian Li, Shinji Watanabe, Tomoki Toda:
On Prosody Modeling for ASR+TTS based Voice Conversion. CoRR abs/2107.09477 (2021)
[i30]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2110-06280
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-06280
Wen-Chin Huang, Shu-Wen Yang, Tomoki Hayashi, Hung-Yi Lee, Shinji Watanabe, Tomoki Toda:
S3PRL-VC: Open-source Voice Conversion Framework with Self-supervised Speech Representations. CoRR abs/2110.06280 (2021)
[i29]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2110-07840
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-07840
Tomoki Hayashi, Ryuichi Yamamoto, Takenori Yoshimura, Peter Wu, Jiatong Shi, Takaaki Saeki, Yooncheol Ju, Yusuke Yasuda, Shinnosuke Takamichi, Shinji Watanabe:
ESPnet2-TTS: Extending the Edge of TTS Research. CoRR abs/2110.07840 (2021)
[i28]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2111-12460
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2111-12460
Robin Karlsson, Tomoki Hayashi, Keisuke Fujii, Alexander Carballo, Kento Ohtani, Kazuya Takeda:
ViCE: Self-Supervised Visual Concept Embeddings as Contextual and Pixel Appearance Invariant Semantic Representations. CoRR abs/2111.12460 (2021)
[i27]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2112-09382
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2112-09382
Jing Shi, Xuankai Chang, Tomoki Hayashi, Yen-Ju Lu, Shinji Watanabe, Bo Xu:
Discretization and Re-synthesis: an alternative method to solve the Cocktail Party Problem. CoRR abs/2112.09382 (2021)
2020
[j5]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/access/WuTKHT20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/access/WuTKHT20
Yi-Chiao Wu, Patrick Lumban Tobing, Kazuhiro Kobayashi, Tomoki Hayashi, Tomoki Toda:
Non-Parallel Voice Conversion System With WaveNet Vocoder and Collapsed Speech Suppression. IEEE Access 8: 62094-62106 (2020)
[c47]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/InagumaKDKYHW20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/InagumaKDKYHW20
Hirofumi Inaguma, Shun Kiyono, Kevin Duh, Shigeki Karita, Nelson Yalta, Tomoki Hayashi, Shinji Watanabe:
ESPnet-ST: All-in-One Speech Translation Toolkit. ACL (demo) 2020: 302-311
[c46]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/blizzard/HuangH0T20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/blizzard/HuangH0T20
Wen-Chin Huang, Tomoki Hayashi, Shinji Watanabe, Tomoki Toda:
The Sequence-to-Sequence Baseline for the Voice Conversion Challenge 2020: Cascading ASR and TTS. Blizzard Challenge / Voice Conversion Challenge 2020
[c45]
- view
  - electronic edition @ dcase.community (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/dcase/MiyazakiKHWTT20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/dcase/MiyazakiKHWTT20
Koichi Miyazaki, Tatsuya Komatsu, Tomoki Hayashi, Shinji Watanabe, Tomoki Toda, Kazuya Takeda:
Conformer-Based Sound Event Detection with Semi-Supervised Learning and Data Augmentation. DCASE 2020: 100-104
[c44]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MiyazakiKH0TT20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MiyazakiKH0TT20
Koichi Miyazaki, Tatsuya Komatsu, Tomoki Hayashi, Shinji Watanabe, Tomoki Toda, Kazuya Takeda:
Weakly-Supervised Sound Event Detection with Self-Attention. ICASSP 2020: 66-70
[c43]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YoshimuraHT020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YoshimuraHT020
Takenori Yoshimura, Tomoki Hayashi, Kazuya Takeda, Shinji Watanabe:
End-to-End Automatic Speech Recognition Integrated with CTC-Based Voice Activity Detection. ICASSP 2020: 6999-7003
[c42]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TobingWHKT20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TobingWHKT20
Patrick Lumban Tobing, Yi-Chiao Wu, Tomoki Hayashi, Kazuhiro Kobayashi, Tomoki Toda:
Efficient Shallow Wavenet Vocoder Using Multiple Samples Output Based on Laplacian Distribution and Linear Prediction. ICASSP 2020: 7204-7208
[c41]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/InoueHAHY020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/InoueHAHY020
Katsuki Inoue, Sunao Hara, Masanobu Abe, Tomoki Hayashi, Ryuichi Yamamoto, Shinji Watanabe:
Semi-Supervised Speaker Adaptation for End-to-End Speech Synthesis with Pretrained Models. ICASSP 2020: 7634-7638
[c40]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HayashiYIY0TTZT20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HayashiYIY0TTZT20
Tomoki Hayashi, Ryuichi Yamamoto, Katsuki Inoue, Takenori Yoshimura, Shinji Watanabe, Tomoki Toda, Kazuya Takeda, Yu Zhang, Xu Tan:
Espnet-TTS: Unified, Reproducible, and Integratable Open Source End-to-End Text-to-Speech Toolkit. ICASSP 2020: 7654-7658
[c39]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WuHOKT20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WuHOKT20
Yi-Chiao Wu, Tomoki Hayashi, Takuma Okamoto, Hisashi Kawai, Tomoki Toda:
Quasi-Periodic Parallel WaveGAN Vocoder: A Non-Autoregressive Pitch-Dependent Dilated Convolution Model for Parametric Speech Generation. INTERSPEECH 2020: 3535-3539
[c38]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HikosakaSHKTBT20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HikosakaSHKTBT20
Shu Hikosaka, Shogo Seki, Tomoki Hayashi, Kazuhiro Kobayashi, Kazuya Takeda, Hideki Banno, Tomoki Toda:
Intelligibility Enhancement Based on Speech Waveform Modification Using Hearing Impairment. INTERSPEECH 2020: 4059-4063
[c37]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuangHWKT20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuangHWKT20
Wen-Chin Huang, Tomoki Hayashi, Yi-Chiao Wu, Hirokazu Kameoka, Tomoki Toda:
Voice Transformer Network: Sequence-to-Sequence Voice Conversion Using Transformer with Text-to-Speech Pretraining. INTERSPEECH 2020: 4676-4680
[c36]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TobingHWKT20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TobingHWKT20
Patrick Lumban Tobing, Tomoki Hayashi, Yi-Chiao Wu, Kazuhiro Kobayashi, Tomoki Toda:
Cyclic Spectral Modeling for Unsupervised Unit Discovery into Voice Conversion with Excitation and Waveform Modeling. INTERSPEECH 2020: 4861-4865
[i26]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2002-00551
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-00551
Takenori Yoshimura, Tomoki Hayashi, Kazuya Takeda, Shinji Watanabe:
End-to-End Automatic Speech Recognition Integrated With CTC-Based Voice Activity Detection. CoRR abs/2002.00551 (2020)
[i25]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2003-11750
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2003-11750
Yi-Chiao Wu, Patrick Lumban Tobing, Kazuhiro Kobayashi, Tomoki Hayashi, Tomoki Toda:
Non-parallel Voice Conversion System with WaveNet Vocoder and Collapsed Speech Suppression. CoRR abs/2003.11750 (2020)
[i24]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2004-10234
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2004-10234
Hirofumi Inaguma, Shun Kiyono, Kevin Duh, Shigeki Karita, Nelson Enrique Yalta Soplin, Tomoki Hayashi, Shinji Watanabe:
ESPnet-ST: All-in-One Speech Translation Toolkit. CoRR abs/2004.10234 (2020)
[i23]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2005-05525
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2005-05525
Tomoki Hayashi, Shinji Watanabe:
DiscreTalk: Text-to-Speech as a Machine Translation Problem. CoRR abs/2005.05525 (2020)
[i22]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2005-08654
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2005-08654
Yi-Chiao Wu, Tomoki Hayashi, Takuma Okamoto, Hisashi Kawai, Tomoki Toda:
Quasi-Periodic Parallel WaveGAN Vocoder: A Non-autoregressive Pitch-dependent Dilated Convolution Model for Parametric Speech Generation. CoRR abs/2005.08654 (2020)
[i21]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2007-05663
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2007-05663
Yi-Chiao Wu, Tomoki Hayashi, Patrick Lumban Tobing, Kazuhiro Kobayashi, Tomoki Toda:
Quasi-Periodic WaveNet: An Autoregressive Raw Waveform Generative Model with Pitch-dependent Dilated Convolution Neural Network. CoRR abs/2007.05663 (2020)
[i20]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2007-12955
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2007-12955
Yi-Chiao Wu, Tomoki Hayashi, Takuma Okamoto, Hisashi Kawai, Tomoki Toda:
Quasi-Periodic Parallel WaveGAN: A Non-autoregressive Raw Waveform Generative Model with Pitch-dependent Dilated Convolution Neural Network. CoRR abs/2007.12955 (2020)
[i19]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2008-03088
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2008-03088
Wen-Chin Huang, Tomoki Hayashi, Yi-Chiao Wu, Hirokazu Kameoka, Tomoki Toda:
Pretraining Techniques for Sequence-to-Sequence Voice Conversion. CoRR abs/2008.03088 (2020)
[i18]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2010-02434
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-02434
Wen-Chin Huang, Tomoki Hayashi, Shinji Watanabe, Tomoki Toda:
The Sequence-to-Sequence Baseline for the Voice Conversion Challenge 2020: Cascading ASR and TTS. CoRR abs/2010.02434 (2020)
[i17]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2010-12231
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-12231
Wen-Chin Huang, Yi-Chiao Wu, Tomoki Hayashi, Tomoki Toda:
Any-to-One Sequence-to-Sequence Voice Conversion using Self-Supervised Discrete Speech Representations. CoRR abs/2010.12231 (2020)
[i16]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2010-13956
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-13956
Pengcheng Guo, Florian Boyer, Xuankai Chang, Tomoki Hayashi, Yosuke Higuchi, Hirofumi Inaguma, Naoyuki Kamo, Chenda Li, Daniel Garcia-Romero, Jiatong Shi, Jing Shi, Shinji Watanabe, Kun Wei, Wangyou Zhang, Yuekai Zhang:
Recent Developments on ESPnet Toolkit Boosted by Conformer. CoRR abs/2010.13956 (2020)
[i15]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2011-03706
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-03706
Chenda Li, Jing Shi, Wangyou Zhang, Aswin Shanmugam Subramanian, Xuankai Chang, Naoyuki Kamo, Moto Hira, Tomoki Hayashi, Christoph Böddeker, Zhuo Chen, Shinji Watanabe:
ESPnet-se: end-to-end speech enhancement and separation toolkit designed for asr integration. CoRR abs/2011.03706 (2020)
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2012-13006
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2012-13006
Shinji Watanabe, Florian Boyer, Xuankai Chang, Pengcheng Guo, Tomoki Hayashi, Yosuke Higuchi, Takaaki Hori, Wen-Chin Huang, Hirofumi Inaguma, Naoyuki Kamo, Shigeki Karita, Chenda Li, Jing Shi, Aswin Shanmugam Subramanian, Wangyou Zhang:
The 2020 ESPnet update: new features, broadened applications, performance improvements, and future plans. CoRR abs/2012.13006 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[j4]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/access/TobingWHKT19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/access/TobingWHKT19
Patrick Lumban Tobing, Yi-Chiao Wu, Tomoki Hayashi, Kazuhiro Kobayashi, Tomoki Toda:
Voice Conversion With CycleRNN-Based Spectral Mapping and Finely Tuned WaveNet Vocoder. IEEE Access 7: 171114-171125 (2019)
[c35]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/TobingHT19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/TobingHT19
Patrick Lumban Tobing, Tomoki Hayashi, Tomoki Toda:
Investigation of Shallow Wavenet Vocoder with Laplacian Distribution Output. ASRU 2019: 176-183
[c34]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/KaritaWWYZCHHIJ19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/KaritaWWYZCHHIJ19
Shigeki Karita, Xiaofei Wang, Shinji Watanabe, Takenori Yoshimura, Wangyou Zhang, Nanxin Chen, Tomoki Hayashi, Takaaki Hori, Hirofumi Inaguma, Ziyan Jiang, Masao Someki, Nelson Enrique Yalta Soplin, Ryuichi Yamamoto:
A Comparative Study on Transformer vs RNN in Speech Applications. ASRU 2019: 449-456
[c33]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/SegawaHT19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/SegawaHT19
Osamu Segawa, Tomoki Hayashi, Kazuya Takeda:
Attention-Based Speech Recognition Using Gaze Information. ASRU 2019: 465-470
[c32]
- view
  authority control:
- export record
  dblp key:
  - conf/eusipco/HuangWHTHKTTW19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/HuangWHTHKTTW19
Wen-Chin Huang, Yi-Chiao Wu, Hsin-Te Hwang, Patrick Lumban Tobing, Tomoki Hayashi, Kazuhiro Kobayashi, Tomoki Toda, Yu Tsao, Hsin-Min Wang:
Refined WaveNet Vocoder for Variational Autoencoder Based Voice Conversion. EUSIPCO 2019: 1-5
[c31]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KomatsuHKTT19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KomatsuHKTT19
Tatsuya Komatsu, Tomoki Hayashi, Reishi Kondo, Tomoki Toda, Kazuya Takeda:
Scene-dependent Anomalous Acoustic-event Detection Based on Conditional Wavenet and I-vector. ICASSP 2019: 870-874
[c30]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HoriAHZWR19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HoriAHZWR19
Takaaki Hori, Ramón Fernandez Astudillo, Tomoki Hayashi, Yu Zhang, Shinji Watanabe, Jonathan Le Roux:
Cycle-consistency Training for End-to-end Speech Recognition. ICASSP 2019: 6271-6275
[c29]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TobingWHKT19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TobingWHKT19
Patrick Lumban Tobing, Yi-Chiao Wu, Tomoki Hayashi, Kazuhiro Kobayashi, Tomoki Toda:
Voice Conversion with Cyclic Recurrent Neural Network and Fine-tuned Wavenet Vocoder. ICASSP 2019: 6815-6819
[c28]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WuHTKT19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WuHTKT19
Yi-Chiao Wu, Tomoki Hayashi, Patrick Lumban Tobing, Kazuhiro Kobayashi, Tomoki Toda:
Quasi-Periodic WaveNet Vocoder: A Pitch Dependent Dilated Convolution Model for Parametric Speech Generation. INTERSPEECH 2019: 196-200
[c27]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TobingWHKT19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TobingWHKT19
Patrick Lumban Tobing, Yi-Chiao Wu, Tomoki Hayashi, Kazuhiro Kobayashi, Tomoki Toda:
Non-Parallel Voice Conversion with Cyclic Variational Autoencoder. INTERSPEECH 2019: 674-678
[c26]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuangWLTHKT0W19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuangWLTHKT0W19
Wen-Chin Huang, Yi-Chiao Wu, Chen-Chou Lo, Patrick Lumban Tobing, Tomoki Hayashi, Kazuhiro Kobayashi, Tomoki Toda, Yu Tsao, Hsin-Min Wang:
Investigation of F0 Conditioning and Fully Convolutional Networks in Variational Autoencoder Based Voice Conversion. INTERSPEECH 2019: 709-713
[c25]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HayashiWTTTL19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HayashiWTTTL19
Tomoki Hayashi, Shinji Watanabe, Tomoki Toda, Kazuya Takeda, Shubham Toshniwal, Karen Livescu:
Pre-Trained Text Embeddings for Enhanced Text-to-Speech Synthesis. INTERSPEECH 2019: 4430-4434
[c24]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/ssw/WuTHKT19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ssw/WuTHKT19
Yi-Chiao Wu, Patrick Lumban Tobing, Tomoki Hayashi, Kazuhiro Kobayashi, Tomoki Toda:
Statistical Voice Conversion with Quasi-periodic WaveNet Vocoder. SSW 2019: 63-68
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1905-00615
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1905-00615
Wen-Chin Huang, Yi-Chiao Wu, Chen-Chou Lo, Patrick Lumban Tobing, Tomoki Hayashi, Kazuhiro Kobayashi, Tomoki Toda, Yu Tsao, Hsin-Min Wang:
Investigation of F0 conditioning and Fully Convolutional Networks in Variational Autoencoder based Voice Conversion. CoRR abs/1905.00615 (2019)
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1907-00797
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1907-00797
Yi-Chiao Wu, Tomoki Hayashi, Patrick Lumban Tobing, Kazuhiro Kobayashi, Tomoki Toda:
Quasi-Periodic WaveNet Vocoder: A Pitch Dependent Dilated Convolution Model for Parametric Speech Generation. CoRR abs/1907.00797 (2019)
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1907-08940
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1907-08940
Yi-Chiao Wu, Patrick Lumban Tobing, Tomoki Hayashi, Kazuhiro Kobayashi, Tomoki Toda:
Statistical Voice Conversion with Quasi-Periodic WaveNet Vocoder. CoRR abs/1907.08940 (2019)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1907-10185
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1907-10185
Patrick Lumban Tobing, Yi-Chiao Wu, Tomoki Hayashi, Kazuhiro Kobayashi, Tomoki Toda:
Non-Parallel Voice Conversion with Cyclic Variational Autoencoder. CoRR abs/1907.10185 (2019)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1909-06317
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1909-06317
Shigeki Karita, Nanxin Chen, Tomoki Hayashi, Takaaki Hori, Hirofumi Inaguma, Ziyan Jiang, Masao Someki, Nelson Enrique Yalta Soplin, Ryuichi Yamamoto, Xiaofei Wang, Shinji Watanabe, Takenori Yoshimura, Wangyou Zhang:
A Comparative Study on Transformer vs RNN in Speech Applications. CoRR abs/1909.06317 (2019)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1910-10909
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1910-10909
Tomoki Hayashi, Ryuichi Yamamoto, Katsuki Inoue, Takenori Yoshimura, Shinji Watanabe, Tomoki Toda, Kazuya Takeda, Yu Zhang, Xu Tan:
ESPnet-TTS: Unified, Reproducible, and Integratable Open Source End-to-End Text-to-Speech Toolkit. CoRR abs/1910.10909 (2019)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1912-06813
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1912-06813
Wen-Chin Huang, Tomoki Hayashi, Yi-Chiao Wu, Hirokazu Kameoka, Tomoki Toda:
Voice Transformer Network: Sequence-to-Sequence Voice Conversion Using Transformer with Text-to-Speech Pretraining. CoRR abs/1912.06813 (2019)
2018
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/ieicet/HayashiNKTT18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ieicet/HayashiNKTT18
Tomoki Hayashi, Masafumi Nishida, Norihide Kitaoka, Tomoki Toda, Kazuya Takeda:
Daily Activity Recognition with Large-Scaled Real-Life Recording Datasets Based on Deep Neural Network Using Multi-Modal Signals. IEICE Trans. Fundam. Electron. Commun. Comput. Sci. 101-A(1): 199-210 (2018)
[c23]
- view
  authority control:
- export record
  dblp key:
  - conf/eusipco/MiyazakiHTT18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/MiyazakiHTT18
Koichi Miyazaki, Tomoki Hayashi, Tomoki Toda, Kazuya Takeda:
Connectionist Temporal Classification-based Sound Event Encoder for Converting Sound Events into Onomatopoeic Representations. EUSIPCO 2018: 852-856
[c22]
- view
  authority control:
- export record
  dblp key:
  - conf/eusipco/HayashiKKTT18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/HayashiKKTT18
Tomoki Hayashi, Tatsuya Komatsu, Reishi Kondo, Tomoki Toda, Kazuya Takeda:
Anomalous Sound Event Detection Based on WaveNet. EUSIPCO 2018: 2494-2498
[c21]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HayashiWTT18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HayashiWTT18
Tomoki Hayashi, Shinji Watanabe, Tomoki Toda, Kazuya Takeda:
Multi-Head Decoder for End-to-End Speech Recognition. INTERSPEECH 2018: 801-805
[c20]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WuKHTT18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WuKHTT18
Yi-Chiao Wu, Kazuhiro Kobayashi, Tomoki Hayashi, Patrick Lumban Tobing, Tomoki Toda:
Collapsed Speech Segment Detection and Suppression for WaveNet Vocoder. INTERSPEECH 2018: 1988-1992
[c19]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WatanabeHKHNUSH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WatanabeHKHNUSH18
Shinji Watanabe, Takaaki Hori, Shigeki Karita, Tomoki Hayashi, Jiro Nishitoba, Yuya Unno, Nelson Enrique Yalta Soplin, Jahn Heymann, Matthew Wiesner, Nanxin Chen, Adithya Renduchintala, Tsubasa Ochiai:
ESPnet: End-to-End Speech Processing Toolkit. INTERSPEECH 2018: 2207-2211
[c18]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/WuTHKT18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/WuTHKT18
Yi-Chiao Wu, Patrick Lumban Tobing, Tomoki Hayashi, Kazuhiro Kobayashi, Tomoki Toda:
The NU Non-Parallel Voice Conversion System for the Voice Conversion Challenge 2018. Odyssey 2018: 211-218
[c17]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/TobingWHKT18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/TobingWHKT18
Patrick Lumban Tobing, Yi-Chiao Wu, Tomoki Hayashi, Kazuhiro Kobayashi, Tomoki Toda:
NU Voice Conversion System for the Voice Conversion Challenge 2018. Odyssey 2018: 219-226
[c16]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/TobingHWKT18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/TobingHWKT18
Patrick Lumban Tobing, Tomoki Hayashi, Yi-Chiao Wu, Kazuhiro Kobayashi, Tomoki Toda:
An Evaluation of Deep Spectral Mappings and WaveNet Vocoder for Voice Conversion. SLT 2018: 297-303
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/HayashiWZTHAT18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/HayashiWZTHAT18
Tomoki Hayashi, Shinji Watanabe, Yu Zhang, Tomoki Toda, Takaaki Hori, Ramón Fernandez Astudillo, Kazuya Takeda:
Back-Translation-Style Data Augmentation for end-to-end ASR. SLT 2018: 426-433
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1804-00015
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1804-00015
Shinji Watanabe, Takaaki Hori, Shigeki Karita, Tomoki Hayashi, Jiro Nishitoba, Yuya Unno, Nelson Enrique Yalta Soplin, Jahn Heymann, Matthew Wiesner, Nanxin Chen, Adithya Renduchintala, Tsubasa Ochiai:
ESPnet: End-to-End Speech Processing Toolkit. CoRR abs/1804.00015 (2018)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1804-08050
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1804-08050
Tomoki Hayashi, Shinji Watanabe, Tomoki Toda, Kazuya Takeda:
Multi-Head Decoder for End-to-End Speech Recognition. CoRR abs/1804.08050 (2018)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1804-11055
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1804-11055
Yi-Chiao Wu, Kazuhiro Kobayashi, Tomoki Hayashi, Patrick Lumban Tobing, Tomoki Toda:
Collapsed speech segment detection and suppression for WaveNet vocoder. CoRR abs/1804.11055 (2018)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1807-10893
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1807-10893
Tomoki Hayashi, Shinji Watanabe, Yu Zhang, Tomoki Toda, Takaaki Hori, Ramón Fernandez Astudillo, Kazuya Takeda:
Back-Translation-Style Data Augmentation for End-to-End ASR. CoRR abs/1807.10893 (2018)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1811-01690
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-01690
Takaaki Hori, Ramón Fernandez Astudillo, Tomoki Hayashi, Yu Zhang, Shinji Watanabe, Jonathan Le Roux:
Cycle-consistency training for end-to-end speech recognition. CoRR abs/1811.01690 (2018)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1811-11078
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-11078
Wen-Chin Huang, Yi-Chiao Wu, Hsin-Te Hwang, Patrick Lumban Tobing, Tomoki Hayashi, Kazuhiro Kobayashi, Tomoki Toda, Yu Tsao, Hsin-Min Wang:
Refined WaveNet Vocoder for Variational Autoencoder Based Voice Conversion. CoRR abs/1811.11078 (2018)
2017
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/jstsp/WatanabeHKHH17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jstsp/WatanabeHKHH17
Shinji Watanabe, Takaaki Hori, Suyoun Kim, John R. Hershey, Tomoki Hayashi:
Hybrid CTC/Attention Architecture for End-to-End Speech Recognition. IEEE J. Sel. Top. Signal Process. 11(8): 1240-1253 (2017)
[j1]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/taslp/HayashiWTHRT17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/HayashiWTHRT17
Tomoki Hayashi, Shinji Watanabe, Tomoki Toda, Takaaki Hori, Jonathan Le Roux, Kazuya Takeda:
Duration-Controlled LSTM for Polyphonic Sound Event Detection. IEEE ACM Trans. Audio Speech Lang. Process. 25(11): 2059-2070 (2017)
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/TamamoriHTT17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/TamamoriHTT17
Akira Tamamori, Tomoki Hayashi, Tomoki Toda, Kazuya Takeda:
An investigation of recurrent neural network for daily activity recognition using multi-modal signals. APSIPA 2017: 1334-1340
[c13]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/HayashiTKTT17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/HayashiTKTT17
Tomoki Hayashi, Akira Tamamori, Kazuhiro Kobayashi, Kazuya Takeda, Tomoki Toda:
An investigation of multi-speaker training for wavenet vocoder. ASRU 2017: 712-718
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HayashiWTHRT17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HayashiWTHRT17
Tomoki Hayashi, Shinji Watanabe, Tomoki Toda, Takaaki Hori, Jonathan Le Roux, Kazuya Takeda:
BLSTM-HMM hybrid system combined with sound activity detection network for polyphonic Sound Event Detection. ICASSP 2017: 766-770
[c11]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TamamoriHKTT17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TamamoriHKTT17
Akira Tamamori, Tomoki Hayashi, Kazuhiro Kobayashi, Kazuya Takeda, Tomoki Toda:
Speaker-Dependent WaveNet Vocoder. INTERSPEECH 2017: 1118-1122
[c10]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KobayashiHTT17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KobayashiHTT17
Kazuhiro Kobayashi, Tomoki Hayashi, Akira Tamamori, Tomoki Toda:
Statistical Voice Conversion with WaveNet-Based Waveform Generation. INTERSPEECH 2017: 1138-1142
2016
[c9]
- view
  - electronic edition @ dcase.community (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/dcase/HayashiWTHRT16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/dcase/HayashiWTHRT16
Tomoki Hayashi, Shinji Watanabe, Tomoki Toda, Takaaki Hori, Jonathan Le Roux, Kazuya Takeda:
Bidirectional LSTM-HMM Hybrid System for Polyphonic Sound Event Detection. DCASE 2016: 35-39
2015
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/eusipco/HayashiNKT15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/HayashiNKT15
Tomoki Hayashi, Masafumi Nishida, Norihide Kitaoka, Kazuya Takeda:
Daily activity recognition based on DNN using environmental sound and acceleration signals. EUSIPCO 2015: 2306-2310
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ArakiHDFTN15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ArakiHDFTN15
Shoko Araki, Tomoki Hayashi, Marc Delcroix, Masakiyo Fujimoto, Kazuya Takeda, Tomohiro Nakatani:
Exploring multi-channel features for denoising-autoencoder-based speech enhancement. ICASSP 2015: 116-120
2014
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/KitaokaHT14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/KitaokaHT14
Norihide Kitaoka, Tomoki Hayashi, Kazuya Takeda:
Noisy speech recognition using blind spatial subtraction array technique and deep bottleneck features. APSIPA 2014: 1-5
2013
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/siggraph/HatanakaHSSTK13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/siggraph/HatanakaHSSTK13
Tomomi Hatanaka, Tomoki Hayashi, Keita Suzuki, Hiroaki Sawano, Takeshi Tsuchiya, Kei'ichi Koyanagi:
Dream board: a visualization system by handwriting recognition. SIGGRAPH ASIA Posters 2013: 22
[c4]
- no documents available
  - no references & citations available
- export record
  dblp key:
  - conf/visapp/ShimizuYHSS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/visapp/ShimizuYHSS13
Naoki Shimizu, Takumi Yoshida, Tomoki Hayashi, François de Sorbier, Hideo Saito:
Non-rigid Surface Tracking for Virtual Fitting System. VISAPP (2) 2013: 12-18
2012
[c3]
- no documents available
  - no references & citations available
- export record
  dblp key:
  - conf/visapp/HayashiSS12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/visapp/HayashiSS12
Tomoki Hayashi, François de Sorbier, Hideo Saito:
Texture Overlay onto Non-rigid Surface using Commodity Depth Camera. VISAPP (2) 2012: 66-71
2011
[c2]
- view
  - electronic edition @ mva-org.jp (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/mva/HayashiRNS11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mva/HayashiRNS11
Tomoki Hayashi, Benjamin Raynal, Vincent Nozick, Hideo Saito:
Skeleton Features Distribution for 3D Object Retrieval. MVA 2011: 377-380
2010
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/icpr/HayashiUPS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icpr/HayashiUPS10
Tomoki Hayashi, Hideaki Uchiyama, Julien Pilet, Hideo Saito:
An Augmented Reality Setup with an Omnidirectional Camera Based on Multiple Object Detection. ICPR 2010: 3171-3174

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.