default search action
Yushi Aono
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j4]Fanglu Xie, Motohiro Takagi, Hitoshi Seshimo, Yushi Aono:
Refining Line Art From Stroke Style Disentanglement With Diffusion Models. IEEE Access 12: 9526-9535 (2024) - [c47]Tsukasa Shiota, Motohiro Takagi, Kaori Kumagai, Hitoshi Seshimo, Yushi Aono:
Egocentric Action Recognition by Capturing Hand-Object Contact and Object State. WACV 2024: 6527-6537 - 2022
- [c46]Shoichiro Takeda, Kenta Niwa, Mariko Isogawa, Shinya Shimizu, Kazuki Okami, Yushi Aono:
Bilateral Video Magnification Filter. CVPR 2022: 17348-17357 - 2021
- [j3]Ryo Ishii, Ryuichiro Higashinaka, Koh Mitsuda, Taichi Katayama, Masahiro Mizukami, Junji Tomita, Hidetoshi Kawabata, Emi Yamaguchi, Noritake Adachi, Yushi Aono:
Methods for Efficiently Constructing Text-dialogue-agent System using Existing Anime Characters. J. Inf. Process. 29: 30-44 (2021) - 2020
- [j2]Atsushi Ando, Ryo Masumura, Hosana Kamiyama, Satoshi Kobashikawa, Yushi Aono, Tomoki Toda:
Customer Satisfaction Estimation in Contact Center Calls Based on a Hierarchical Multi-Task Model. IEEE ACM Trans. Audio Speech Lang. Process. 28: 715-728 (2020) - [c45]Ryo Ishii, Ryuichiro Higashinaka, Koh Mitsuda, Taichi Katayama, Masahiro Mizukami, Junji Tomita, Hidetoshi Kawabata, Emi Yamaguchi, Noritake Adachi, Yushi Aono:
Methods of Efficiently Constructing Text-Dialogue-Agent System Using Existing Anime Character. HCI (45) 2020: 328-347 - [c44]Toshiki Onishi, Arisa Yamauchi, Ryo Ishii, Yushi Aono, Akihiro Miyata:
Analyzing Nonverbal Behaviors along with Praising. ICMI 2020: 609-613 - [c43]Takashi Kodama, Ryuichiro Higashinaka, Koh Mitsuda, Ryo Masumura, Yushi Aono, Ryuta Nakamura, Noritake Adachi, Hidetoshi Kawabata:
Generating Responses that Reflect Meta Information in User-Generated Question Answer Pairs. LREC 2020: 5433-5441 - [c42]Hideharu Nakajima, Yushi Aono:
Collection and Analyses of Exemplary Speech Data to Establish Easy-to-Understand Speech Synthesis for Japanese Elderly Adults. O-COCOSDA 2020: 145-150
2010 – 2019
- 2019
- [j1]Taichi Asami, Ryo Masumura, Yushi Aono, Koichi Shinoda:
Recurrent out-of-vocabulary word detection based on distribution of features. Comput. Speech Lang. 58: 247-259 (2019) - [c41]Hiroshi Sato, Takafumi Moriya, Yusuke Shinohara, Ryo Masumura, Takaaki Fukutomi, Kiyoaki Matsui, Takanori Ashihara, Yoshikazu Yamaguchi, Yushi Aono:
Revisiting Dynamic Adjustment of Language Model Scaling Factor for Automatic Speech Recognition. APSIPA 2019: 186-191 - [c40]Ryo Masumura, Yusuke Ijima, Satoshi Kobashikawa, Takanobu Oba, Yushi Aono:
Can We Simulate Generative Process of Acoustic Modeling Data? Towards Data Restoration for Acoustic Modeling. APSIPA 2019: 655-661 - [c39]Hosana Kamiyama, Atsushi Ando, Ryo Masumura, Satoshi Kobashikawa, Yushi Aono:
Likability Estimation of Call-center Agents by Suppressing Annotator Variability. APSIPA 2019: 911-916 - [c38]Hosana Kamiyama, Atsushi Ando, Ryo Masumura, Satoshi Kobashikawa, Yushi Aono:
Urgent Voicemail Detection Focused on Long-term Temporal Variation. APSIPA 2019: 917-921 - [c37]Tomohiro Tanaka, Ryo Masumura, Takafumi Moriya, Takanobu Oba, Yushi Aono:
Disfluency Detection Based on Speech-Aware Token-by-Token Sequence Labeling with BLSTM-CRFs and Attention Mechanisms. APSIPA 2019: 1009-1013 - [c36]Ryo Masumura, Kiyoaki Matsui, Yuma Koizumi, Takaaki Fukutomi, Takanobu Oba, Yushi Aono:
Context-Aware Neural Voice Activity Detection Using Auxiliary Networks for Phoneme Recognition, Speech Enhancement and Acoustic Scene Classification. EUSIPCO 2019: 1-5 - [c35]Ryo Masumura, Tomohiro Tanaka, Takafumi Moriya, Yusuke Shinohara, Takanobu Oba, Yushi Aono:
Large Context End-to-end Automatic Speech Recognition via Extension of Hierarchical Recurrent Encoder-decoder Models. ICASSP 2019: 5661-5665 - [c34]Ryo Masumura, Tomohiro Tanaka, Atsushi Ando, Hosana Kamiyama, Takanobu Oba, Satoshi Kobashikawa, Yushi Aono:
Improving Conversation-Context Language Models with Multiple Spoken Language Understanding Models. INTERSPEECH 2019: 834-838 - [c33]Tomohiro Tanaka, Ryo Masumura, Takafumi Moriya, Takanobu Oba, Yushi Aono:
A Joint End-to-End and DNN-HMM Hybrid Automatic Speech Recognition System with Transferring Sharable Knowledge. INTERSPEECH 2019: 2210-2214 - [c32]Atsushi Ando, Ryo Masumura, Hosana Kamiyama, Satoshi Kobashikawa, Yushi Aono:
Speech Emotion Recognition Based on Multi-Label Emotion Existence Model. INTERSPEECH 2019: 2818-2822 - [c31]Takanori Ashihara, Yusuke Shinohara, Hiroshi Sato, Takafumi Moriya, Kiyoaki Matsui, Takaaki Fukutomi, Yoshikazu Yamaguchi, Yushi Aono:
Neural Whispered Speech Detection with Imbalanced Learning. INTERSPEECH 2019: 3352-3356 - [c30]Takafumi Moriya, Jian Wang, Tomohiro Tanaka, Ryo Masumura, Yusuke Shinohara, Yoshikazu Yamaguchi, Yushi Aono:
Joint Maximization Decoder with Neural Converters for Fully Neural Network-Based Japanese Speech Recognition. INTERSPEECH 2019: 4410-4414 - [c29]Satoshi Kobashikawa, Atushi Odakura, Takao Nakamura, Takeshi Mori, Kimitaka Endo, Takafumi Moriya, Ryo Masumura, Yushi Aono, Nobuaki Minematsu:
Does Speaking Training Application with Speech Recognition Motivate Junior High School Students in Actual Classroom? - A Case Study. SLaTE 2019: 119-123 - 2018
- [c28]Tomohiro Tanaka, Ryo Masumura, Takafumi Moriya, Yushi Aono:
Neural Speech-to-Text Language Models for Rescoring Hypotheses of DNN-HMM Hybrid Automatic Speech Recognition Systems. APSIPA 2018: 196-200 - [c27]Ryo Masumura, Setsuo Yamada, Tomohiro Tanaka, Atsushi Ando, Hosana Kamiyama, Yushi Aono:
Online Call Scene Segmentation of Contact Center Dialogues based on Role Aware Hierarchical LSTM-RNNs. APSIPA 2018: 811-815 - [c26]Takafumi Moriya, Ryo Masumura, Taichi Asami, Yusuke Shinohara, Marc Delcroix, Yoshikazu Yamaguchi, Yushi Aono:
Progressive Neural Network-based Knowledge Transfer in Acoustic Models. APSIPA 2018: 998-1002 - [c25]Ryo Masumura, Suguru Kabashima, Takafumi Moriya, Satoshi Kobashikawa, Yoshikazu Yamaguchi, Yushi Aono:
Relevant Phonetic-aware Neural Acoustic Models using Native English and Japanese Speech for Japanese-English Automatic Speech Recognition. APSIPA 2018: 1435-1439 - [c24]Ryo Masumura, Tomohiro Tanaka, Ryuichiro Higashinaka, Hirokazu Masataki, Yushi Aono:
Multi-task and Multi-lingual Joint Learning of Neural Lexical Utterance Classification based on Partially-shared Modeling. COLING 2018: 3586-3596 - [c23]Ryo Masumura, Yusuke Shinohara, Ryuichiro Higashinaka, Yushi Aono:
Adversarial Training for Multi-task and Multi-lingual Joint Modeling of Utterance Intent Classification. EMNLP 2018: 633-639 - [c22]Atsushi Ando, Satoshi Kobashikawa, Hosana Kamiyama, Ryo Masumura, Yusuke Ijima, Yushi Aono:
Soft-Target Training with Ambiguous Emotional Utterances for DNN-Based Speech Emotion Classification. ICASSP 2018: 4964-4968 - [c21]Tomohiro Tanaka, Ryo Masumura, Hirokazu Masataki, Yushi Aono:
Neural Error Corrective Language Models for Automatic Speech Recognition. INTERSPEECH 2018: 401-405 - [c20]Ryo Masumura, Tomohiro Tanaka, Atsushi Ando, Hirokazu Masataki, Yushi Aono:
Role Play Dialogue Aware Language Models Based on Conditional Hierarchical Recurrent Encoder-Decoder. INTERSPEECH 2018: 1259-1263 - [c19]Tsukasa Yoshida, Takafumi Moriya, Kazuho Watanabe, Yusuke Shinohara, Yoshikazu Yamaguchi, Yushi Aono:
Automatic DNN Node Pruning Using Mixture Distribution-based Group Regularization. INTERSPEECH 2018: 1269-1273 - [c18]Atsushi Ando, Reine Asakawa, Ryo Masumura, Hosana Kamiyama, Satoshi Kobashikawa, Yushi Aono:
Automatic Question Detection from Acoustic and Phonetic Features Using Feature-wise Pre-training. INTERSPEECH 2018: 1731-1735 - [c17]Takafumi Moriya, Sei Ueno, Yusuke Shinohara, Marc Delcroix, Yoshikazu Yamaguchi, Yushi Aono:
Multi-task Learning with Augmentation Strategy for Acoustic-to-word Attention-based Encoder-decoder Speech Recognition. INTERSPEECH 2018: 2399-2403 - [c16]Sei Ueno, Takafumi Moriya, Masato Mimura, Shinsuke Sakai, Yusuke Shinohara, Yoshikazu Yamaguchi, Yushi Aono, Tatsuya Kawahara:
Encoder Transfer for Attention-based Acoustic-to-word Speech Recognition. INTERSPEECH 2018: 2424-2428 - [c15]Ryo Masumura, Tomohiro Tanaka, Atsushi Ando, Ryo Ishii, Ryuichiro Higashinaka, Yushi Aono:
Neural Dialogue Context Online End-of-Turn Detection. SIGDIAL Conference 2018: 224-228 - [c14]Takafumi Moriya, Hiroki Kanagawa, Kiyoaki Matsui, Takaaki Fukutomi, Yusuke Shinohara, Yoshikazu Yamaguchi, Manabu Okamoto, Yushi Aono:
Efficient Building Strategy with Knowledge Distillation for Small-Footprint Acoustic Models. SLT 2018: 21-28 - 2017
- [c13]Hosana Kamiyama, Atsushi Ando, Satoshi Kobashikawa, Yushi Aono:
Robust children and adults speech identification and confidence measure based on DNN posteriorgram. APSIPA 2017: 502-505 - [c12]Ryo Masumura, Taichi Asami, Hirokazu Masataki, Yushi Aono:
Joint unsupervised adaptation of n-gram and RNN language models via LDA-based hybrid mixture modeling. APSIPA 2017: 1588-1591 - [c11]Taichi Asami, Ryo Masumura, Yoshikazu Yamaguchi, Hirokazu Masataki, Yushi Aono:
Domain adaptation of DNN acoustic models using knowledge distillation. ICASSP 2017: 5185-5189 - [c10]Ryo Masumura, Taichi Asami, Hirokazu Masataki, Yushi Aono:
Parallel phonetically aware DNNs and LSTM-RNNS for frame-by-frame discriminative modeling of spoken language identification. ICASSP 2017: 5260-5264 - [c9]Ruo Zhang, Atsushi Ando, Satoshi Kobashikawa, Yushi Aono:
Interaction and Transition Model for Speech Emotion Recognition in Dialogue. INTERSPEECH 2017: 1094-1097 - [c8]Atsushi Ando, Ryo Masumura, Hosana Kamiyama, Satoshi Kobashikawa, Yushi Aono:
Hierarchical LSTMs with Joint Learning for Estimating Customer Satisfaction from Contact Center Calls. INTERSPEECH 2017: 1716-1720 - 2016
- [c7]Atsushi Ando, Taichi Asami, Yoshikazu Yamaguchi, Yushi Aono:
Speaker recognition in duration-mismatched condition using bootstrapped i-vectors. APSIPA 2016: 1-4 - [c6]Taichi Asami, Ryo Masumura, Yushi Aono, Koichi Shinoda:
Recurrent Out-of-Vocabulary Word Detection Using Distribution of Features. INTERSPEECH 2016: 1320-1324 - [c5]Ryo Masumura, Taichi Asami, Hirokazu Masataki, Yushi Aono, Sumitaka Sakauchi:
Language Identification Based on Generative Modeling of Posteriorgram Sequences Extracted from Frame-by-Frame DNNs and LSTM-RNNs. INTERSPEECH 2016: 3275-3279
2000 – 2009
- 2000
- [c4]Osamu Ishikawa, Yushi Aono, Haruhiro Katayose, Seiji Inokuchi:
Extraction of Musical Performance Rules Using a Modified Algorithm of Multiple Regression Analysis. ICMC 2000
1990 – 1999
- 1998
- [c3]Yushi Aono, Haruhiro Katayose, Seiji Inokuchi:
A Real-time Session Composer with Acoustic Polyphonic Instruments. ICMC 1998 - 1995
- [c2]Yushi Aono, Haruhiro Katayose, Seiji Inokuchi:
An Improvisational Accompaniment System Observing Performer's Musical Gesture. ICMC 1995 - [c1]Tsutomu Kanamori, Haruhiro Katayose, Yushi Aono, Seiji Inokuchi, Takashi Sakaguchi:
Sensor Integration for Interactive Digital Art. ICMC 1995
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-08-05 20:16 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint