default search action
Sakriani Sakti
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j53]Bimasena Putra, Kurniawati Azizah, Candy Olivia Mawalim, Ikhlasul Akmal Hanif, Sakriani Sakti, Chee Wee Leong, Shogo Okada:
MAG-BERT-ARL for Fair Automated Video Interview Assessment. IEEE Access 12: 145188-145205 (2024) - [j52]Kei Furukawa, Takeshi Kishiyama, Satoshi Nakamura, Sakriani Sakti:
Applying Syntax-Prosody Mapping Hypothesis and Boundary-Driven Theory to Neural Sequence-to-Sequence Speech Synthesis. IEEE Access 12: 160896-160917 (2024) - [j51]Yuka Ko, Katsuhito Sudoh, Sakriani Sakti, Satoshi Nakamura:
Neural End-To-End Speech Translation Leveraged by ASR Posterior Distribution. IEICE Trans. Inf. Syst. 107(10): 1322-1331 (2024) - [c215]Mushaffa Rasyid Ridha, Sakriani Sakti:
Refining rtMRI Landmark-Based Vocal Tract Contour Labels with FCN-Based Smoothing and Point-to-Curve Projection. LREC/COLING 2024: 13796-13802 - [e4]Nicoletta Calzolari, Min-Yen Kan, Véronique Hoste, Alessandro Lenci, Sakriani Sakti, Nianwen Xue:
Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation, LREC/COLING 2024, 20-25 May, 2024, Torino, Italy. ELRA and ICCL 2024, ISBN 978-2-493814-10-4 [contents] - [i36]Yuka Ko, Ryo Fukuda, Yuta Nishikawa, Yasumasa Kano, Tomoya Yanagita, Kosuke Doi, Mana Makinae, Haotian Tan, Makoto Sakai, Sakriani Sakti, Katsuhito Sudoh, Satoshi Nakamura:
NAIST Simultaneous Speech Translation System for IWSLT 2024. CoRR abs/2407.00826 (2024) - [i35]Haotian Tan, Sakriani Sakti:
Contrastive Feedback Mechanism for Simultaneous Speech Translation. CoRR abs/2407.20524 (2024) - [i34]Nick Rossenbach, Ralf Schlüter, Sakriani Sakti:
On the Problem of Text-To-Speech Model Selection for Synthetic Data Generation in Automatic Speech Recognition. CoRR abs/2407.21476 (2024) - 2023
- [j50]Tomoya Yanagita, Sakriani Sakti, Satoshi Nakamura:
Japanese Neural Incremental Text-to-Speech Synthesis Framework With an Accent Phrase Input. IEEE Access 11: 22355-22363 (2023) - [c214]Samuel Cahyawijaya, Holy Lovenia, Alham Fikri Aji, Genta Indra Winata, Bryan Wilie, Fajri Koto, Rahmad Mahendra, Christian Wibisono, Ade Romadhony, Karissa Vincentio, Jennifer Santoso, David Moeljadi, Cahya Wirawan, Frederikus Hudi, Muhammad Satrio Wicaksono, Ivan Halim Parmonangan, Ika Alfina, Ilham Firdausi Putra, Samsul Rahmadani, Yulianti Oenang, Ali Akbar Septiandri, James Jaya, Kaustubh D. Dhole, Arie Ardiyanti Suryani, Rifki Afina Putri, Dan Su, Keith Stevens, Made Nindyatama Nityasya, Muhammad Farid Adilazuarda, Ryan Hadiwijaya, Ryandito Diandaru, Tiezheng Yu, Vito Ghifari, Wenliang Dai, Yan Xu, Dyah Damapuspita, Haryo Akbarianto Wibowo, Cuk Tho, Ichwanul Muslim Karo Karo, Tirana Fatyanosa, Ziwei Ji, Graham Neubig, Timothy Baldwin, Sebastian Ruder, Pascale Fung, Herry Sujaini, Sakriani Sakti, Ayu Purwarianti:
NusaCrowd: Open Source Initiative for Indonesian NLP Resources. ACL (Findings) 2023: 13745-13818 - [c213]Sakriani Sakti, Benita Angela Titalim:
Leveraging the Multilingual Indonesian Ethnic Languages Dataset In Self-Supervised Models for Low-Resource ASR Task. ASRU 2023: 1-8 - [c212]Ruhiyah Widiaputri, Ayu Purwarianti, Dessi Puji Lestari, Kurniawati Azizah, Dipta Tanaya, Sakriani Sakti:
Speech Recognition and Meaning Interpretation: Towards Disambiguation of Structurally Ambiguous Spoken Utterances in Indonesian. EMNLP 2023: 16813-16824 - [c211]Jianan Chen, Sakriani Sakti:
An Isotropy Analysis for Self-Supervised Acoustic Unit Embeddings on the Zero Resource Speech Challenge 2021 Framework. ICASSP 2023: 1-5 - [c210]Sashi Novitasari, Sakriani Sakti, Satoshi Nakamura:
Self-Adaptive Incremental Machine Speech Chain for Lombard TTS with High-Granularity ASR Feedback in Dynamic Noise Condition. ICASSP 2023: 1-5 - [c209]Shun Takahashi, Sakriani Sakti:
Unsupervised Learning of Discrete Latent Representations with Data-Adaptive Dimensionality from Continuous Speech Streams. INTERSPEECH 2023: 416-420 - [c208]Chung Tran, Chi Mai Luong, Sakriani Sakti:
STEN-TTS: Improving Zero-shot Cross-Lingual Transfer for Multi-Lingual TTS with Style-Enhanced Normalization Diffusion Framework. INTERSPEECH 2023: 4464-4468 - [c207]Ryo Fukuda, Yuta Nishikawa, Yasumasa Kano, Yuka Ko, Tomoya Yanagita, Kosuke Doi, Mana Makinae, Sakriani Sakti, Katsuhito Sudoh, Satoshi Nakamura:
NAIST Simultaneous Speech-to-speech Translation System for IWSLT 2023. IWSLT@ACL 2023: 330-340 - [c206]Bella Septina Ika Hartanti, Dipta Tanaya, Kurniawati Azizah, Dessi Puji Lestari, Ayu Purwarianti, Sakriani Sakti:
Generating Speech with Prosodic Prominence based on SSL-Visually Grounded Models. O-COCOSDA 2023: 1-6 - [c205]Hang Xi, Sakriani Sakti:
Exploring Difficulties Encountered by Professional Interpreters in Japanese-to-English and English-to-Japanese Simultaneous Translation. O-COCOSDA 2023: 1-6 - [i33]Heli Qi, Sashi Novitasari, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
SpeeChain: A Speech Toolkit for Large-Scale Machine Speech Chain. CoRR abs/2301.02966 (2023) - 2022
- [j49]Fan Yang, Zheng Wang, Yang Wu, Sakriani Sakti, Satoshi Nakamura:
Tackling multiple object tracking with complicated motions - Re-designing the integration of motion and appearance. Image Vis. Comput. 124: 104514 (2022) - [j48]Bin Wu, Sakriani Sakti, Jinsong Zhang, Satoshi Nakamura:
Modeling Unsupervised Empirical Adaptation by DPGMM and DPGMM-RNN Hybrid Model to Extract Perceptual Features for Low-Resource ASR. IEEE ACM Trans. Audio Speech Lang. Process. 30: 901-916 (2022) - [j47]Sashi Novitasari, Sakriani Sakti, Satoshi Nakamura:
A Machine Speech Chain Approach for Dynamically Adaptive Lombard TTS in Static and Dynamic Noise Environments. IEEE ACM Trans. Audio Speech Lang. Process. 30: 2673-2688 (2022) - [c204]Heli Qi, Sashi Novitasari, Sakriani Sakti, Satoshi Nakamura:
Improved Consistency Training for Semi-Supervised Sequence-to-Sequence ASR via Speech Chain Reconstruction and Self-Transcribing. INTERSPEECH 2022: 3413-3417 - [c203]Ryo Fukuda, Yuka Ko, Yasumasa Kano, Kosuke Doi, Hirotaka Tokuyama, Sakriani Sakti, Katsuhito Sudoh, Satoshi Nakamura:
NAIST Simultaneous Speech-to-Text Translation System for IWSLT 2022. IWSLT@ACL 2022: 286-292 - [c202]Rendi Chevi, Radityo Eko Prasojo, Alham Fikri Aji, Andros Tjandra, Sakriani Sakti:
NIX-TTS: Lightweight and End-to-End Text-to-Speech Via Module-Wise Distillation. SLT 2022: 970-976 - [i32]Heli Qi, Sashi Novitasari, Sakriani Sakti, Satoshi Nakamura:
Improved Consistency Training for Semi-Supervised Sequence-to-Sequence ASR via Speech Chain Reconstruction and Self-Transcribing. CoRR abs/2205.06963 (2022) - [i31]Holy Lovenia, Hiroki Tanaka, Sakriani Sakti, Ayu Purwarianti, Satoshi Nakamura:
Speech Artifact Removal from EEG Recordings of Spoken Word Production with Tensor Decomposition. CoRR abs/2206.00635 (2022) - [i30]Fan Yang, Norimichi Ukita, Sakriani Sakti, Satoshi Nakamura:
Actor-identified Spatiotemporal Action Detection - Detecting Who Is Doing What in Videos. CoRR abs/2208.12940 (2022) - [i29]Fan Yang, Yang Wu, Zheng Wang, Xiang Li, Sakriani Sakti, Satoshi Nakamura:
Instance-level Heterogeneous Domain Adaptation for Limited-labeled Sketch-to-Photo Retrieval. CoRR abs/2211.14515 (2022) - [i28]Samuel Cahyawijaya, Holy Lovenia, Alham Fikri Aji, Genta Indra Winata, Bryan Wilie, Rahmad Mahendra, Christian Wibisono, Ade Romadhony, Karissa Vincentio, Fajri Koto, Jennifer Santoso, David Moeljadi, Cahya Wirawan, Frederikus Hudi, Ivan Halim Parmonangan, Ika Alfina, Muhammad Satrio Wicaksono, Ilham Firdausi Putra, Samsul Rahmadani, Yulianti Oenang, Ali Akbar Septiandri, James Jaya, Kaustubh D. Dhole, Arie Ardiyanti Suryani, Rifki Afina Putri, Dan Su, Keith Stevens, Made Nindyatama Nityasya, Muhammad Farid Adilazuarda, Ryan Ignatius, Ryandito Diandaru, Tiezheng Yu, Vito Ghifari, Wenliang Dai, Yan Xu, Dyah Damapuspita, Cuk Tho, Ichwanul Muslim Karo Karo, Tirana Noor Fatyanosa, Ziwei Ji, Pascale Fung, Graham Neubig, Timothy Baldwin, Sebastian Ruder, Herry Sujaini, Sakriani Sakti, Ayu Purwarianti:
NusaCrowd: Open Source Initiative for Indonesian NLP Resources. CoRR abs/2212.09648 (2022) - 2021
- [j46]Johanes Effendi, Sakriani Sakti, Satoshi Nakamura:
End-to-End Image-to-Speech Generation for Untranscribed Unknown Languages. IEEE Access 9: 55144-55154 (2021) - [j45]Johanes Effendi, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Multimodal Chain: Cross-Modal Collaboration Through Listening, Speaking, and Visualizing. IEEE Access 9: 70286-70299 (2021) - [j44]Sahoko Nakayama, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Code-Switching ASR and TTS Using Semisupervised Learning with Machine Speech Chain. IEICE Trans. Inf. Syst. 104-D(10): 1661-1677 (2021) - [j43]Sashi Novitasari, Sakriani Sakti, Satoshi Nakamura:
Neural Incremental Speech Recognition Toward Real-Time Machine Speech Translation. IEICE Trans. Inf. Syst. 104-D(12): 2195-2208 (2021) - [j42]Fan Yang, Xin Chang, Sakriani Sakti, Yang Wu, Satoshi Nakamura:
ReMOT: A model-agnostic refinement for multiple object tracking. Image Vis. Comput. 106: 104091 (2021) - [j41]Bin Wu, Sakriani Sakti, Jinsong Zhang, Satoshi Nakamura:
Tackling Perception Bias in Unsupervised Phoneme Discovery Using DPGMM-RNN Hybrid Model and Functional Load. IEEE ACM Trans. Audio Speech Lang. Process. 29: 348-362 (2021) - [j40]Fan Yang, Yang Wu, Zheng Wang, Xiang Li, Sakriani Sakti, Satoshi Nakamura:
Instance-Level Heterogeneous Domain Adaptation for Limited-Labeled Sketch-to-Photo Retrieval. IEEE Trans. Multim. 23: 2347-2360 (2021) - [c201]Shun Takahashi, Sakriani Sakti, Satoshi Nakamura:
Unsupervised Neural-Based Graph Clustering for Variable-Length Speech Representation Discovery of Zero-Resource Languages. Interspeech 2021: 1559-1563 - [c200]Johanes Effendi, Sakriani Sakti, Satoshi Nakamura:
Weakly-Supervised Speech-to-Text Mapping with Visually Connected Non-Parallel Speech-Text Data Using Cyclic Partially-Aligned Transformer. Interspeech 2021: 2257-2261 - [c199]Hirotaka Tokuyama, Sakriani Sakti, Katsuhito Sudoh, Satoshi Nakamura:
Transcribing Paralinguistic Acoustic Cues to Target Language Text in Transformer-Based Speech-to-Text Translation. Interspeech 2021: 2262-2266 - [c198]Yuka Ko, Katsuhito Sudoh, Sakriani Sakti, Satoshi Nakamura:
ASR Posterior-Based Loss for Multi-Task End-to-End Speech Translation. Interspeech 2021: 2272-2276 - [c197]Sashi Novitasari, Sakriani Sakti, Satoshi Nakamura:
Dynamically Adaptive Machine Speech Chain Inference for TTS in Noisy Environment: Listen and Speak Louder. Interspeech 2021: 4124-4128 - [c196]Sara Asai, Koichiro Yoshino, Seitaro Shinagawa, Sakriani Sakti, Satoshi Nakamura:
Eliciting Cooperative Persuasive Dialogue by Multimodal Emotional Robot. IWSDS 2021: 143-158 - [c195]Ryo Fukuda, Yui Oka, Yasumasa Kano, Yuki Yano, Yuka Ko, Hirotaka Tokuyama, Kosuke Doi, Sakriani Sakti, Katsuhito Sudoh, Satoshi Nakamura:
NAIST English-to-Japanese Simultaneous Translation System for IWSLT 2021 Simultaneous Text-to-text Task. IWSLT 2021: 39-45 - [c194]Nobuya Tachimori, Sakriani Sakti, Satoshi Nakamura:
Multi-Encoder Sequential Attention Network for Context-Aware Speech Recognition in Japanese Dialog Conversation. O-COCOSDA 2021: 1-6 - [c193]Ryo Fukuda, Sashi Novitasari, Yui Oka, Yasumasa Kano, Yuki Yano, Yuka Ko, Hirotaka Tokuyama, Kosuke Doi, Tomoya Yanagita, Sakriani Sakti, Katsuhito Sudoh, Satoshi Nakamura:
Simultaneous Speech-to-Speech Translation System with Transformer-Based Incremental ASR, MT, and TTS. O-COCOSDA 2021: 186-192 - [c192]Nobuyoshi Kaiki, Sakriani Sakti, Satoshi Nakamura:
Using Local Phrase Dependency Structure Information in Neural Sequence-to-Sequence Speech Synthesis. O-COCOSDA 2021: 206-211 - [c191]Bin Wu, Sakriani Sakti, Satoshi Nakamura:
Incorporating Discriminative DPGMM Posteriorgrams for Low-Resource ASR. SLT 2021: 201-208 - [c190]Takatomo Kano, Sakriani Sakti, Satoshi Nakamura:
Transformer-Based Direct Speech-To-Speech Translation with Transcoder. SLT 2021: 958-965 - 2020
- [j39]Seitaro Shinagawa, Koichiro Yoshino, Seyed Hossein Alavi, Kallirroi Georgila, David R. Traum, Sakriani Sakti, Satoshi Nakamura:
An Interactive Image Editing System Using an Uncertainty-Based Confirmation Strategy. IEEE Access 8: 98471-98480 (2020) - [j38]The Tung Nguyen, Koichiro Yoshino, Sakriani Sakti, Satoshi Nakamura:
Policy Reuse for Dialog Management Using Action-Relation Probability. IEEE Access 8: 159639-159649 (2020) - [j37]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Recurrent Neural Network Compression Based on Low-Rank Tensor Representation. IEICE Trans. Inf. Syst. 103-D(2): 435-449 (2020) - [j36]Johanes Effendi, Sakriani Sakti, Katsuhito Sudoh, Satoshi Nakamura:
Leveraging Neural Caption Translation with Visually Grounded Paraphrase Augmentation. IEICE Trans. Inf. Syst. 103-D(3): 674-683 (2020) - [j35]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Machine Speech Chain. IEEE ACM Trans. Audio Speech Lang. Process. 28: 976-989 (2020) - [j34]Takatomo Kano, Sakriani Sakti, Satoshi Nakamura:
End-to-End Speech Translation With Transcoding by Multi-Task Learning for Distant Language Pairs. IEEE ACM Trans. Audio Speech Lang. Process. 28: 1342-1355 (2020) - [j33]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Corrections to "Machine Speech Chain". IEEE ACM Trans. Audio Speech Lang. Process. 28: 1706 (2020) - [c189]Fan Yang, Feiran Li, Yang Wu, Sakriani Sakti, Satoshi Nakamura:
Using Panoramic Videos for Multi-Person Localization and Tracking In A 3D Panoramic Coordinate. ICASSP 2020: 1863-1867 - [c188]Kazuki Tsunematsu, Johanes Effendi, Sakriani Sakti, Satoshi Nakamura:
Neural Speech Completion. INTERSPEECH 2020: 2742-2746 - [c187]Ivan Halim Parmonangan, Hiroki Tanaka, Sakriani Sakti, Satoshi Nakamura:
Combining Audio and Brain Activity for Predicting Speech Quality. INTERSPEECH 2020: 2762-2766 - [c186]Sashi Novitasari, Andros Tjandra, Tomoya Yanagita, Sakriani Sakti, Satoshi Nakamura:
Incremental Machine Speech Chain Towards Enabling Listening While Speaking in Real-Time. INTERSPEECH 2020: 4372-4376 - [c185]Ewan Dunbar, Julien Karadayi, Mathieu Bernard, Xuan-Nga Cao, Robin Algayres, Lucas Ondel, Laurent Besacier, Sakriani Sakti, Emmanuel Dupoux:
The Zero Resource Speech Challenge 2020: Discovering Discrete Subword and Word Units. INTERSPEECH 2020: 4831-4835 - [c184]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Transformer VQ-VAE for Unsupervised Unit Discovery and Speech Synthesis: ZeroSpeech 2020 Challenge. INTERSPEECH 2020: 4851-4855 - [c183]Johanes Effendi, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Augmenting Images for ASR and TTS Through Single-Loop and Dual-Loop Multimodal Chain Framework. INTERSPEECH 2020: 4901-4905 - [c182]Sara Asai, Koichiro Yoshino, Seitaro Shinagawa, Sakriani Sakti, Satoshi Nakamura:
Emotional Speech Corpus for Persuasive Dialogue System. LREC 2020: 491-497 - [c181]Mayuko Okamato, Sakriani Sakti, Satoshi Nakamura:
Towards Speech Entrainment: Considering ASR Information in Speaking Rate Variation of TTS Waveform Generation. O-COCOSDA 2020: 139-144 - [c180]Sashi Novitasari, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Cross-Lingual Machine Speech Chain for Javanese, Sundanese, Balinese, and Bataks Speech Recognition and Synthesis. SLTU-CCURL@LREC 2020: 131-138 - [e3]Dorothee Beermann, Laurent Besacier, Sakriani Sakti, Claudia Soria:
Proceedings of the 1st Joint Workshop on Spoken Language Technologies for Under-resourced languages and Collaboration and Computing for Under-Resourced Languages, SLTU-CCURL@LREC 2020, Marseille, France, May 2020. European Language Resources association 2020, ISBN 979-10-95546-35-1 [contents] - [i27]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Transformer VQ-VAE for Unsupervised Unit Discovery and Speech Synthesis: ZeroSpeech 2020 Challenge. CoRR abs/2005.11676 (2020) - [i26]Fan Yang, Xin Chang, Chenyu Dang, Ziqiang Zheng, Sakriani Sakti, Satoshi Nakamura, Yang Wu:
ReMOTS: Self-Supervised Refining Multi-Object Tracking and Segmentation. CoRR abs/2007.03200 (2020) - [i25]Ewan Dunbar, Julien Karadayi, Mathieu Bernard, Xuan-Nga Cao, Robin Algayres, Lucas Ondel, Laurent Besacier, Sakriani Sakti, Emmanuel Dupoux:
The Zero Resource Speech Challenge 2020: Discovering discrete subword and word units. CoRR abs/2010.05967 (2020) - [i24]Johanes Effendi, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Augmenting Images for ASR and TTS through Single-loop and Dual-loop Multimodal Chain Framework. CoRR abs/2011.02099 (2020) - [i23]Sashi Novitasari, Andros Tjandra, Tomoya Yanagita, Sakriani Sakti, Satoshi Nakamura:
Incremental Machine Speech Chain Towards Enabling Listening while Speaking in Real-time. CoRR abs/2011.02126 (2020) - [i22]Sashi Novitasari, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Sequence-to-Sequence Learning via Attention Transfer for Incremental Speech Recognition. CoRR abs/2011.02127 (2020) - [i21]Sashi Novitasari, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Cross-Lingual Machine Speech Chain for Javanese, Sundanese, Balinese, and Bataks Speech Recognition and Synthesis. CoRR abs/2011.02128 (2020) - [i20]Katsuhito Sudoh, Takatomo Kano, Sashi Novitasari, Tomoya Yanagita, Sakriani Sakti, Satoshi Nakamura:
Simultaneous Speech-to-Speech Translation System with Neural Incremental ASR, MT, and TTS. CoRR abs/2011.04845 (2020)
2010 – 2019
- 2019
- [j32]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
End-to-End Speech Recognition Sequence Training With Reinforcement Learning. IEEE Access 7: 79758-79769 (2019) - [j31]Fan Yang, Sakriani Sakti, Yang Wu, Satoshi Nakamura:
A Framework for Knowing Who is Doing What in Aerial Surveillance Videos. IEEE Access 7: 93315-93325 (2019) - [j30]Hiroki Tanaka, Hiroki Watanabe, Hayato Maki, Sakriani Sakti, Satoshi Nakamura:
Electroencephalogram-Based Single-Trial Detection of Language Expectation Violations in Listening to Speech. Frontiers Comput. Neurosci. 13: 15 (2019) - [j29]Hiroki Watanabe, Hiroki Tanaka, Sakriani Sakti, Satoshi Nakamura:
Neural Oscillation-Based Classification of Japanese Spoken Sentences During Speech Perception. IEICE Trans. Inf. Syst. 102-D(2): 383-391 (2019) - [j28]Nurul Lubis, Sakriani Sakti, Koichiro Yoshino, Satoshi Nakamura:
Positive Emotion Elicitation in Chat-Based Dialogue Systems. IEEE ACM Trans. Audio Speech Lang. Process. 27(4): 866-877 (2019) - [c179]Johanes Effendi, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Listening While Speaking and Visualizing: Improving ASR Through Multimodal Chain. ASRU 2019: 471-478 - [c178]Takatomo Kano, Sakriani Sakti, Satoshi Nakamura:
Neural Machine Translation with Acoustic Embedding. ASRU 2019: 578-584 - [c177]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Speech-to-Speech Translation Between Untranscribed Unknown Languages. ASRU 2019: 593-600 - [c176]Sahoko Nakayama, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Zero-Shot Code-Switching ASR and TTS with Multilingual Machine Speech Chain. ASRU 2019: 964-971 - [c175]Holy Lovenia, Hiroki Tanaka, Sakriani Sakti, Ayu Purwarianti, Satoshi Nakamura:
Speech Artifact Removal from Eeg Recordings of Spoken Word Production with Tensor Decomposition. ICASSP 2019: 1115-1119 - [c174]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
End-to-end Feedback Loss in Speech Chain Framework via Straight-through Estimator. ICASSP 2019: 6281-6285 - [c173]Marco Vetter, Sakriani Sakti, Satoshi Nakamura:
Cross-lingual Speech-based Tobi Label Generation Using Bidirectional Lstm. ICASSP 2019: 6620-6624 - [c172]Ewan Dunbar, Robin Algayres, Julien Karadayi, Mathieu Bernard, Juan Benjumea, Xuan-Nga Cao, Lucie Miskic, Charlotte Dugrain, Lucas Ondel, Alan W. Black, Laurent Besacier, Sakriani Sakti, Emmanuel Dupoux:
The Zero Resource Speech Challenge 2019: TTS Without T. INTERSPEECH 2019: 1088-1092 - [c171]Andros Tjandra, Berrak Sisman, Mingyang Zhang, Sakriani Sakti, Haizhou Li, Satoshi Nakamura:
VQVAE Unsupervised Unit Discovery and Multi-Scale Code2Spec Inverter for Zerospeech Challenge 2019. INTERSPEECH 2019: 1118-1122 - [c170]Ivan Halim Parmonangan, Hiroki Tanaka, Sakriani Sakti, Shinnosuke Takamichi, Satoshi Nakamura:
Speech Quality Evaluation of Synthesized Japanese Speech Using EEG. INTERSPEECH 2019: 1228-1232 - [c169]Sashi Novitasari, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Sequence-to-Sequence Learning via Attention Transfer for Incremental Speech Recognition. INTERSPEECH 2019: 3835-3839 - [c168]Koichiro Yoshino, Yukitoshi Murase, Nurul Lubis, Kyoshiro Sugiyama, Hiroki Tanaka, Sakriani Sakti, Shinnosuke Takamichi, Satoshi Nakamura:
Spoken Dialogue Robot for Watching Daily Life of Elderly People. IWSDS 2019: 141-146 - [c167]Fan Yang, Yang Wu, Sakriani Sakti, Satoshi Nakamura:
Make Skeleton-based Action Recognition Model Smaller, Faster and Better. MMAsia 2019: 31:1-31:6 - [c166]Sahoko Nakayama, Takatomo Kano, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Recognition and translation of code-switching speech utterances. O-COCOSDA 2019: 1-6 - [c165]Mayuko Okamato, Sakriani Sakti, Satoshi Nakamura:
Phoneme-level speaking rate variation on waveform generation using GAN-TTS. O-COCOSDA 2019: 1-7 - [c164]Tomoya Yanagita, Sakriani Sakti, Satoshi Nakamura:
Neural iTTS: Toward Synthesizing Speech in Real-time with End-to-end Neural Text-to-Speech Framework. SSW 2019: 183-188 - [i19]Ewan Dunbar, Robin Algayres, Julien Karadayi, Mathieu Bernard, Juan Benjumea, Xuan-Nga Cao, Lucie Miskic, Charlotte Dugrain, Lucas Ondel, Alan W. Black, Laurent Besacier, Sakriani Sakti, Emmanuel Dupoux:
The Zero Resource Speech Challenge 2019: TTS without T. CoRR abs/1904.11469 (2019) - [i18]Andros Tjandra, Berrak Sisman, Mingyang Zhang, Sakriani Sakti, Haizhou Li, Satoshi Nakamura:
VQVAE Unsupervised Unit Discovery and Multi-scale Code2Spec Inverter for Zerospeech Challenge 2019. CoRR abs/1905.11449 (2019) - [i17]Johanes Effendi, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
From Speech Chain to Multimodal Chain: Leveraging Cross-modal Data Augmentation for Semi-supervised Learning. CoRR abs/1906.00579 (2019) - [i16]Fan Yang, Sakriani Sakti, Yang Wu, Satoshi Nakamura:
Make Skeleton-based Action Recognition Model Smaller, Faster and Better. CoRR abs/1907.09658 (2019) - [i15]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Speech-to-speech Translation between Untranscribed Unknown Languages. CoRR abs/1910.00795 (2019) - [i14]Fan Yang, Feiran Li, Yang Wu, Sakriani Sakti, Satoshi Nakamura:
Using panoramic videos for multi-person localization and tracking in a 3D panoramic coordinate. CoRR abs/1911.10535 (2019) - 2018
- [j27]Michael Heck, Sakriani Sakti, Satoshi Nakamura:
Learning Supervised Feature Transformations on Zero Resources for Improved Acoustic Unit Discovery. IEICE Trans. Inf. Syst. 101-D(1): 205-214 (2018) - [j26]Nurul Lubis, Dessi Puji Lestari, Sakriani Sakti, Ayu Purwarianti, Satoshi Nakamura:
Construction of Spontaneous Emotion Corpus from Indonesian TV Talk Shows and Its Application on Multimodal Emotion Recognition. IEICE Trans. Inf. Syst. 101-D(8): 2092-2100 (2018) - [j25]Takatomo Kano, Shinnosuke Takamichi, Sakriani Sakti, Graham Neubig, Tomoki Toda, Satoshi Nakamura:
An end-to-end model for cross-lingual transformation of paralinguistic information. Mach. Transl. 32(4): 353-368 (2018) - [j24]Quoc Truong Do, Sakriani Sakti, Satoshi Nakamura:
Sequence-to-Sequence Models for Emphasis Speech Translation. IEEE ACM Trans. Audio Speech Lang. Process. 26(10): 1873-1883 (2018) - [j23]Michael Heck, Sakriani Sakti, Satoshi Nakamura:
Dirichlet Process Mixture of Mixtures Model for Unsupervised Subword Modeling. IEEE ACM Trans. Audio Speech Lang. Process. 26(11): 2027-2042 (2018) - [c163]Nurul Lubis, Sakriani Sakti, Koichiro Yoshino, Satoshi Nakamura:
Eliciting Positive Emotion through Affect-Sensitive Dialogue Response Generation: A Neural Network Approach. AAAI 2018: 5293-5300 - [c162]Naoki Hosomi, Sakriani Sakti, Koichiro Yoshino, Satoshi Nakamura:
Deception Detection and Analysis in Spoken Dialogues based on FastText. APSIPA 2018: 139-142 - [c161]Masahiro Honda, Hiroki Tanaka, Sakriani Sakti, Satoshi Nakamura:
Detecting suppression of negative emotion by time series change of cerebral blood flow using fNIRS. BHI 2018: 398-401 - [c160]Hiroki Tanaka, Hiroki Watanabe, Hayato Maki, Sakriani Sakti, Satoshi Nakamura:
Single-Trial Detection of Semantic Anomalies From EEG During Listening to Spoken Sentences. EMBC 2018: 977-980 - [c159]Sashi Novitasari, Dessi Puji Lestari, Sakriani Sakti, Ayu Purwarianti:
Rude-Words Detection for Indonesian Speech Using Support Vector Machine. IALP 2018: 19-24 - [c158]Hayato Maki, Hiroki Tanaka, Sakriani Sakti, Satoshi Nakamura:
Graph Regularized Tensor Factorization for Single-Trial EEG Analysis. ICASSP 2018: 846-850 - [c157]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Sequence-to-Sequence Asr Optimization Via Reinforcement Learning. ICASSP 2018: 5829-5833 - [c156]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Tensor Decomposition for Compressing Recurrent Neural Network. IJCNN 2018: 1-8 - [c155]Takuma Mori, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Compressing End-to-end ASR Networks by Tensor-Train Decomposition. INTERSPEECH 2018: 806-810 - [c154]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Machine Speech Chain with One-shot Speaker Adaptation. INTERSPEECH 2018: 887-891 - [c153]Tomoya Yanagita, Sakriani Sakti, Satoshi Nakamura:
Incremental TTS for Japanese Language. INTERSPEECH 2018: 902-906 - [c152]The Tung Nguyen, Koichiro Yoshino, Sakriani Sakti, Satoshi Nakamura:
Impact of Deception Information on Negotiation Dialog Management: A Case Study on Doctor-Patient Conversations. IWSDS 2018: 199-206 - [c151]Johanes Effendi, Sakriani Sakti, Katsuhito Sudoh, Satoshi Nakamura:
Multi-paraphrase Augmentation to Leverage Neural Caption Translation. IWSLT 2018: 181-188 - [c150]Kaho Osamura, Takatomo Kano, Sakriani Sakti, Katsuhito Sudoh, Satoshi Nakamura:
Using Spoken Word Posterior Features in Neural Machine Translation. IWSLT 2018: 189-195 - [c149]Sashi Novitasari, Quoc Truong Do, Sakriani Sakti, Dessi Puji Lestari, Satoshi Nakamura:
Construction of English-French Multimodal Affective Conversational Corpus from TV Dramas. LREC 2018 - [c148]Koichiro Yoshino, Yoko Ishikawa, Masahiro Mizukami, Yu Suzuki, Sakriani Sakti, Satoshi Nakamura:
Dialogue Scenario Collection of Persuasive Dialogue with Emotional Expressions via Crowdsourcing. LREC 2018 - [c147]Sashi Novitasari, Quoc Truong Do, Sakriani Sakti, Dessi Puji Lestari, Satoshi Nakamura:
Multi-Modal Multi-Task Deep Learning For Speaker And Emotion Recognition Of TV-Series Data. O-COCOSDA 2018: 37-42 - [c146]Sahoko Nakayama, Takatomo Kano, Quoc Truong Do, Sakriani Sakti, Satoshi Nakamura:
Japanese-English Code-Switching Speech Data Construction. O-COCOSDA 2018: 67-71 - [c145]Nurul Lubis, Sakriani Sakti, Koichiro Yoshino, Satoshi Nakamura:
Unsupervised Counselor Dialogue Clustering for Positive Emotion Elicitation in Neural Dialogue System. SIGDIAL Conference 2018: 161-170 - [c144]Sahoko Nakayama, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Speech Chain for Semi-Supervised Learning of Japanese-English Code-Switching ASR and TTS. SLT 2018: 182-189 - [c143]Berrak Sisman, Mingyang Zhang, Sakriani Sakti, Haizhou Li, Satoshi Nakamura:
Adaptive Wavenet Vocoder for Residual Compensation in GAN-Based Voice Conversion. SLT 2018: 282-289 - [c142]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Multi-Scale Alignment and Contextual History for Attention Mechanism in Sequence-to-Sequence Model. SLT 2018: 648-655 - [c141]Quoc Truong Do, Sakriani Sakti, Satoshi Nakamura:
Toward Multi-Features Emphasis Speech Translation: Assessment of Human Emphasis Production and Perception with Speech and Text Clues. SLT 2018: 700-706 - [c140]Nurul Lubis, Sakriani Sakti, Koichiro Yoshino, Satoshi Nakamura:
Optimizing Neural Response Generator with Emotional Impact Information. SLT 2018: 876-883 - [c139]Bin Wu, Sakriani Sakti, Jinsong Zhang, Satoshi Nakamura:
Optimizing DPGMM Clustering in Zero Resource Setting Based on Functional Load. SLTU 2018: 1-5 - [c138]Khumaisa Nur'Aini, Johanes Effendi, Sakriani Sakti, Mirna Adriani, Satoshi Nakamura:
Corpus Construction and Semantic Analysis of Indonesian Image Description. SLTU 2018: 42-46 - [i13]Takatomo Kano, Sakriani Sakti, Satoshi Nakamura:
Structured-based Curriculum Learning for End-to-end English-Japanese Speech Translation. CoRR abs/1802.06003 (2018) - [i12]Seitaro Shinagawa, Koichiro Yoshino, Sakriani Sakti, Yu Suzuki, Satoshi Nakamura:
Interactive Image Manipulation with Natural Language Instruction Commands. CoRR abs/1802.08645 (2018) - [i11]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Tensor Decomposition for Compressing Recurrent Neural Network. CoRR abs/1802.10410 (2018) - [i10]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Machine Speech Chain with One-shot Speaker Adaptation. CoRR abs/1803.10525 (2018) - [i9]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Multi-scale Alignment and Contextual History for Attention Mechanism in Sequence-to-sequence Model. CoRR abs/1807.08280 (2018) - [i8]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
End-to-End Feedback Loss in Speech Chain Framework via Straight-Through Estimator. CoRR abs/1810.13107 (2018) - 2017
- [j22]Quoc Truong Do, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura:
Preserving Word-Level Emphasis in Speech-to-Speech Translation. IEEE ACM Trans. Audio Speech Lang. Process. 25(3): 544-556 (2017) - [c137]Nurul Lubis, Michael Heck, Sakriani Sakti, Koichiro Yoshino, Satoshi Nakamura:
Processing negative emotions through social communication: Multimodal database construction and analysis. ACII 2017: 79-85 - [c136]Kazutaka Kubo, Kazuhiro Kobayashi, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura:
An investigation of how to design control parameters for statistical voice timbre control. APSIPA 2017: 1520-1523 - [c135]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Listening while speaking: Speech chain by deep learning. ASRU 2017: 301-308 - [c134]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Attention-based Wav2Text with feature transfer learning. ASRU 2017: 309-315 - [c133]Michael Heck, Sakriani Sakti, Satoshi Nakamura:
Feature optimized DPGMM clustering for unsupervised subword modeling: A contribution to zerospeech 2017. ASRU 2017: 740-746 - [c132]Naoto Terasawa, Hiroki Tanaka, Sakriani Sakti, Satoshi Nakamura:
Tracking liking state in brain activity while watching multiple movies. ICMI 2017: 321-325 - [c131]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Local Monotonic Attention Mechanism for End-to-End Speech And Language Processing. IJCNLP(1) 2017: 431-440 - [c130]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Compressing recurrent neural network with tensor train. IJCNN 2017: 4451-4458 - [c129]Hiroki Watanabe, Hiroki Tanaka, Sakriani Sakti, Satoshi Nakamura:
Subject-Independent Classification of Japanese Spoken Sentences by Multiple Frequency Bands Phase Pattern of EEG Response During Speech Perception. INTERSPEECH 2017: 2431-2435 - [c128]Takatomo Kano, Sakriani Sakti, Satoshi Nakamura:
Structured-Based Curriculum Learning for End-to-End English-Japanese Speech Translation. INTERSPEECH 2017: 2630-2634 - [c127]Quoc Truong Do, Sakriani Sakti, Satoshi Nakamura:
Toward Expressive Speech Translation: A Unified Sequence-to-Sequence LSTMs Approach for Translating Words and Emphasis. INTERSPEECH 2017: 2640-2644 - [c126]Nurul Lubis, Sakriani Sakti, Koichiro Yoshino, Satoshi Nakamura:
Eliciting Positive Emotional Impact in Dialogue Response Selection. IWSDS 2017: 135-148 - [c125]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Speech recognition features based on deep latent Gaussian models. MLSP 2017: 1-6 - [c124]Johanes Effendi, Sakriani Sakti, Satoshi Nakamura:
Creation of a multi-paraphrase corpus based on various elementary operations. O-COCOSDA 2017: 1-6 - [c123]Kohei Mukaihara, Sakriani Sakti, Satoshi Nakamura:
Recognizing Emotionally Coloured Dialogue Speech Using Speaker-Adapted DNN-CNN Bottleneck Features. SPECOM 2017: 632-641 - [e2]Sakriani Sakti, Masao Utiyama:
Proceedings of the 14th International Conference on Spoken Language Translation, IWSLT 2017, Tokyo, Japan, December 14-15, 2017. International Workshop on Spoken Language Translation 2017 [contents] - [i7]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Compressing Recurrent Neural Network with Tensor Train. CoRR abs/1705.08052 (2017) - [i6]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Local Monotonic Attention Mechanism for End-to-End Speech Recognition. CoRR abs/1705.08091 (2017) - [i5]Andros Tjandra, Sakriani Sakti, Ruli Manurung, Mirna Adriani, Satoshi Nakamura:
Gated Recurrent Neural Tensor Network. CoRR abs/1706.02222 (2017) - [i4]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Listening while Speaking: Speech Chain by Deep Learning. CoRR abs/1707.04879 (2017) - [i3]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Attention-based Wav2Text with Feature Transfer Learning. CoRR abs/1709.07814 (2017) - [i2]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Sequence-to-Sequence ASR Optimization via Reinforcement Learning. CoRR abs/1710.10774 (2017) - 2016
- [j21]Hayato Maki, Tomoki Toda, Sakriani Sakti, Graham Neubig, Satoshi Nakamura:
Enhancing Event-Related Potentials Based on Maximum a Posteriori Estimation with a Spatial Correlation Prior. IEICE Trans. Inf. Syst. 99-D(6): 1437-1446 (2016) - [j20]Shinnosuke Takamichi, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura:
A Statistical Sample-Based Approach to GMM-Based Voice Conversion Using Tied-Covariance Acoustic Models. IEICE Trans. Inf. Syst. 99-D(10): 2490-2498 (2016) - [j19]Lasguido Nio, Sakriani Sakti, Graham Neubig, Koichiro Yoshino, Satoshi Nakamura:
Neural Network Approaches to Dialog Response Retrieval and Generation. IEICE Trans. Inf. Syst. 99-D(10): 2508-2517 (2016) - [j18]Yuji Oshima, Shinnosuke Takamichi, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura:
Non-Native Text-to-Speech Preserving Speaker Individuality Based on Partial Correction of Prosodic and Phonetic Characteristics. IEICE Trans. Inf. Syst. 99-D(12): 3132-3139 (2016) - [j17]Takuya Hiraoka, Graham Neubig, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura:
Learning cooperative persuasive dialogue policies using framing. Speech Commun. 84: 83-96 (2016) - [j16]Shinnosuke Takamichi, Tomoki Toda, Alan W. Black, Graham Neubig, Sakriani Sakti, Satoshi Nakamura:
Postfilters to Modify the Modulation Spectrum for Statistical Parametric Speech Synthesis. IEEE ACM Trans. Audio Speech Lang. Process. 24(4): 755-767 (2016) - [j15]Hiroki Tanaka, Sakriani Sakti, Graham Neubig, Tomoki Toda, Hideki Negoro, Hidemi Iwasaka, Satoshi Nakamura:
Teaching Social Communication Skills Through Human-Agent Interaction. ACM Trans. Interact. Intell. Syst. 6(2): 18:1-18:26 (2016) - [c122]Hiroki Tanaka, Sakriani Sakti, Graham Neubig, Hideki Negoro, Hidemi Iwasaka, Satoshi Nakamura:
Automated social skills training with audiovisual information. EMBC 2016: 2262-2265 - [c121]Hayato Maki, Tomoki Toda, Sakriani Sakti, Graham Neubig, Satoshi Nakamura:
Removing noise from event-related potentials using a probabilistic generative model with grouped covariance matrices. EMBC 2016: 3728-3731 - [c120]Rui Hiraoka, Hiroki Tanaka, Sakriani Sakti, Graham Neubig, Satoshi Nakamura:
Personalized unknown word detection in non-native language reading using eye gaze. ICMI 2016: 66-70 - [c119]Andros Tjandra, Sakriani Sakti, Ruli Manurung, Mirna Adriani, Satoshi Nakamura:
Gated Recurrent Neural Tensor Network. IJCNN 2016: 448-455 - [c118]Michael Heck, Sakriani Sakti, Satoshi Nakamura:
Supervised Learning of Acoustic Models in a Zero Resource Setting to Improve DPGMM Clustering. INTERSPEECH 2016: 1310-1314 - [c117]Quoc Truong Do, Sakriani Sakti, Graham Neubig, Satoshi Nakamura:
Transferring Emphasis in Speech Translation Using Hard-Attentional Neural Network Models. INTERSPEECH 2016: 2533-2537 - [c116]Satoshi Tsujioka, Sakriani Sakti, Koichiro Yoshino, Graham Neubig, Satoshi Nakamura:
Unsupervised Joint Estimation of Grapheme-to-Phoneme Conversion Systems and Acoustic Model Adaptation for Non-Native Speech Recognition. INTERSPEECH 2016: 3091-3095 - [c115]Quoc Truong Do, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura:
A Hybrid System for Continuous Word-Level Emphasis Modeling Based on HMM State Clustering and Adaptive Training. INTERSPEECH 2016: 3196-3200 - [c114]Nurul Lubis, Randy Gomez, Sakriani Sakti, Keisuke Nakamura, Koichiro Yoshino, Satoshi Nakamura, Kazuhiro Nakadai:
Construction of Japanese Audio-Visual Emotion Database and Its Application in Emotion Recognition. LREC 2016 - [c113]Sakriani Sakti, Seiji Kawanishi, Graham Neubig, Koichiro Yoshino, Satoshi Nakamura:
Deep bottleneck features and sound-dependent i-vectors for simultaneous recognition of speech and environmental sounds. SLT 2016: 35-42 - [c112]Michael Heck, Sakriani Sakti, Satoshi Nakamura:
Iterative training of a DPGMM-HMM acoustic unit recognizer in a zero resource scenario. SLT 2016: 57-63 - [c111]Michael Heck, Sakriani Sakti, Satoshi Nakamura:
Unsupervised Linear Discriminant Analysis for Supporting DPGMM Clustering in the Zero Resource Scenario. SLTU 2016: 73-79 - [e1]Sakriani Sakti, Mirna Adriani, Ayu Purwarianti, Laurent Besacier, Eric Castelli, Pascal Nocera:
SLTU-2016, 5th Workshop on Spoken Language Technologies for Under-resourced languages, 9-12 May 2016, Yogyakarta, Indonesia. Procedia Computer Science 81, Elsevier 2016 [contents] - 2015
- [j14]Hiroki Tanaka, Sakriani Sakti, Graham Neubig, Tomoki Toda, Satoshi Nakamura:
NOCOA+: Multimodal Computer-Based Training for Social and Communication Skills. IEICE Trans. Inf. Syst. 98-D(8): 1536-1544 (2015) - [j13]Philip Arthur, Graham Neubig, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura:
Semantic Parsing of Ambiguous Input through Paraphrasing and Verification. Trans. Assoc. Comput. Linguistics 3: 571-584 (2015) - [c110]Masahiro Mizukami, Graham Neubig, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura:
Linguistic Individuality Transformation for Spoken Language. IWSDS 2015: 129-143 - [c109]Fajri Koto, Sakriani Sakti, Graham Neubig, Tomoki Toda, Mirna Adriani, Satoshi Nakamura:
A Study on Natural Expressive Speech: Automatic Memorable Spoken Quote Detection. IWSDS 2015: 145-152 - [c108]Takuya Hiraoka, Graham Neubig, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura:
Evaluation of a Fully Automatic Cooperative Persuasive Dialogue System. IWSDS 2015: 153-167 - [c107]Takafumi Sasakura, Sakriani Sakti, Graham Neubig, Tomoki Toda, Satoshi Nakamura:
Unknown Word Detection Based on Event-Related Brain Desynchronization Responses. IWSDS 2015: 169-175 - [c106]Yuiko Tsunomori, Graham Neubig, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura:
An Analysis Towards Dialogue-Based Deception Detection. IWSDS 2015: 177-187 - [c105]Yusuke Oda, Graham Neubig, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura:
Syntax-based Simultaneous Translation through Prediction of Unseen Syntactic Constituents. ACL (1) 2015: 198-207 - [c104]Akiva Miura, Graham Neubig, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura:
Improving Pivot Translation by Remembering the Pivot. ACL (2) 2015: 573-577 - [c103]Andros Tjandra, Sakriani Sakti, Satoshi Nakamura, Mirna Adriani:
Stochastic Gradient Variational Bayes for deep learning-based ASR. ASRU 2015: 175-180 - [c102]Sakriani Sakti, Faiz Ilham, Graham Neubig, Tomoki Toda, Ayu Purwarianti, Satoshi Nakamura:
Incremental sentence compression using LSTM recurrent networks. ASRU 2015: 252-258 - [c101]Quoc Truong Do, Michael Heck, Sakriani Sakti, Graham Neubig, Tomoki Toda, Satoshi Nakamura:
The NAIST ASR system for the 2015 Multi-Genre Broadcast challenge: On combination of deep learning systems using a rank-score function. ASRU 2015: 654-659 - [c100]Nurul Lubis, Sakriani Sakti, Graham Neubig, Koichiro Yoshino, Tomoki Toda, Satoshi Nakamura:
A study of social-affective communication: Automatic prediction of emotion triggers and responses in television talk shows. ASRU 2015: 777-783 - [c99]Masahiro Mizukami, Hideaki Kizuki, Toshio Nomura, Graham Neubig, Koichiro Yoshino, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura:
Adaptive selection from multiple response candidates in example-based dialogue. ASRU 2015: 784-790 - [c98]Kou Tanaka, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura:
An Enhanced Electrolarynx with Automatic Fundamental Frequency Control based on Statistical Prediction. ASSETS 2015: 435-436 - [c97]Hayato Maki, Tomoki Toda, Sakriani Sakti, Graham Neubig, Satoshi Nakamura:
An evaluation of EEG ocular artifact removal with a multi-channel wiener filter based on probabilistic generative model. EMBC 2015: 2775-2778 - [c96]Hayato Maki, Tomoki Toda, Sakriani Sakti, Graham Neubig, Satoshi Nakamura:
EEG signal enhancement using multi-channel wiener filter with a spatial correlation prior. ICASSP 2015: 2639-2643 - [c95]Andros Tjandra, Sakriani Sakti, Graham Neubig, Tomoki Toda, Mirna Adriani, Satoshi Nakamura:
Combination of two-dimensional cochleogram and spectrogram features for deep learning-based ASR. ICASSP 2015: 4525-4529 - [c94]Yuji Oshima, Shinnosuke Takamichi, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura:
Non-native speech synthesis preserving speaker individuality based on partial correction of prosodic and phonetic characteristics. INTERSPEECH 2015: 299-303 - [c93]Takashi Mieno, Graham Neubig, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura:
Speed or accuracy? a study in evaluation of simultaneous speech translation. INTERSPEECH 2015: 2267-2271 - [c92]The Tung Nguyen, Graham Neubig, Hiroyuki Shindo, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura:
A latent variable model for joint pause prediction and dependency parsing. INTERSPEECH 2015: 2719-2723 - [c91]Kazuhiro Kobayashi, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura:
Statistical singing voice conversion based on direct waveform modification with global variance. INTERSPEECH 2015: 2754-2758 - [c90]Yusuke Tajiri, Kou Tanaka, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura:
Non-audible murmur enhancement based on statistical conversion using air- and body-conductive microphones in noisy environments. INTERSPEECH 2015: 2769-2773 - [c89]Patrick Lumban Tobing, Kazuhiro Kobayashi, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura:
Articulatory controllable speech modification based on Gaussian mixture models with direct waveform modification using spectrum differential. INTERSPEECH 2015: 3350-3354 - [c88]Quoc Truong Do, Shinnosuke Takamichi, Sakriani Sakti, Graham Neubig, Tomoki Toda, Satoshi Nakamura:
Preserving word-level emphasis in speech-to-speech translation using linear regression HSMMs. INTERSPEECH 2015: 3665-3669 - [c87]Sakriani Sakti, Oyunchimeg Shagdar, Fawzi Nashashibi, Satoshi Nakamura:
Context awareness and priority control for ITS based on automatic speech recognition. ITST 2015: 17-21 - [c86]Hiroki Tanaka, Sakriani Sakti, Graham Neubig, Tomoki Toda, Hideki Negoro, Hidemi Iwasaka, Satoshi Nakamura:
Automated Social Skills Trainer. IUI 2015: 17-27 - [c85]Quoc Truong Do, Sakriani Sakti, Graham Neubig, Tomoki Toda, Satoshi Nakamura:
Improving translation of emphasis with pause prediction in speech-to-speech translation systems. IWSLT 2015 - [c84]Michael Heck, Quoc Truong Do, Sakriani Sakti, Graham Neubig, Satoshi Nakamura:
The NAIST English speech recognition system for IWSLT 2015. IWSLT (Evaluation Campaign) 2015 - [c83]Yusuke Oda, Hiroyuki Fudaba, Graham Neubig, Hideaki Hata, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura:
Learning to Generate Pseudo-Code from Source Code Using Statistical Machine Translation (T). ASE 2015: 574-584 - [c82]Hiroyuki Fudaba, Yusuke Oda, Koichi Akabe, Graham Neubig, Hideaki Hata, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura:
Pseudogen: A Tool to Automatically Generate Pseudo-Code from Source Code. ASE 2015: 824-829 - [c81]Yusuke Oda, Graham Neubig, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura:
Ckylark: A More Robust PCFG-LA Parser. HLT-NAACL 2015: 41-45 - [c80]Nurul Lubis, Sakriani Sakti, Graham Neubig, Tomoki Toda, Satoshi Nakamura:
Construction and analysis of social-affective interaction corpus in English and Indonesian. O-COCOSDA/CASLRE 2015: 202-206 - [c79]Kyoshiro Sugiyama, Masahiro Mizukami, Graham Neubig, Koichiro Yoshino, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura:
An Investigation of Machine Translation Evaluation Metrics in Cross-lingual Question Answering. WMT@EMNLP 2015: 442-449 - [i1]Rafael E. Banchs, Sakriani Sakti, Etsuo Mizukami:
The Future of Human-Robot Spoken Dialogue: from Information Services to Virtual Assistants (NII Shonan Meeting 2015-7). NII Shonan Meet. Rep. 2015 (2015) - 2014
- [j12]Kazuhiro Kobayashi, Tomoki Toda, Hironori Doi, Tomoyasu Nakano, Masataka Goto, Graham Neubig, Sakriani Sakti, Satoshi Nakamura:
Voice Timbre Control Based on Perceived Age in Singing Voice Conversion. IEICE Trans. Inf. Syst. 97-D(6): 1419-1428 (2014) - [j11]Kou Tanaka, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura:
A Hybrid Approach to Electrolaryngeal Speech Enhancement Based on Noise Reduction and Statistical Excitation Generation. IEICE Trans. Inf. Syst. 97-D(6): 1429-1437 (2014) - [j10]Keigo Kubo, Sakriani Sakti, Graham Neubig, Tomoki Toda, Satoshi Nakamura:
Structured Adaptive Regularization of Weight Vectors for a Robust Grapheme-to-Phoneme Conversion Model. IEICE Trans. Inf. Syst. 97-D(6): 1468-1476 (2014) - [j9]Yu Tsao, Ting-Yao Hu, Sakriani Sakti, Satoshi Nakamura, Lin-Shan Lee:
Variable Selection Linear Regression for Robust Speech Recognition. IEICE Trans. Inf. Syst. 97-D(6): 1477-1487 (2014) - [j8]Lasguido Nio, Sakriani Sakti, Graham Neubig, Tomoki Toda, Satoshi Nakamura:
Utilizing Human-to-Human Conversation Examples for a Multi Domain Chat-Oriented Dialog System. IEICE Trans. Inf. Syst. 97-D(6): 1497-1505 (2014) - [j7]Shinnosuke Takamichi, Tomoki Toda, Yoshinori Shiga, Sakriani Sakti, Graham Neubig, Satoshi Nakamura:
Parameter Generation Methods With Rich Context Models for High-Quality and Flexible Text-To-Speech Synthesis. IEEE J. Sel. Top. Signal Process. 8(2): 239-250 (2014) - [c78]Yusuke Oda, Graham Neubig, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura:
Optimizing Segmentation Strategies for Simultaneous Speech Translation. ACL (2) 2014: 551-556 - [c77]Hiroki Tanaka, Sakriani Sakti, Graham Neubig, Tomoki Toda, Satoshi Nakamura:
Linguistic and Acoustic Features for Automatic Identification of Autism Spectrum Disorders in Children's Narrative. CLPsych@ACL 2014: 88-96 - [c76]Kazuhiro Kobayashi, Tomoki Toda, Tomoyasu Nakano, Masataka Goto, Graham Neubig, Sakriani Sakti, Satoshi Nakamura:
Gender-dependent spectrum differential models for perceived age control based on direct waveform modification in singing voice conversion. APSIPA 2014: 1-4 - [c75]Fajri Koto, Sakriani Sakti, Graham Neubig, Tomoki Toda, Mirna Adriani, Satoshi Nakamura:
The use of semantic and acoustic features for open-domain TED talk summarization. APSIPA 2014: 1-4 - [c74]Lasguido Nio, Sakriani Sakti, Graham Neubig, Tomoki Toda, Satoshi Nakamura:
Recursive neural network paraphrase identification for example-based dialog retrieval. APSIPA 2014: 1-4 - [c73]Sakriani Sakti, Yu Odagaki, Takafumi Sasakura, Graham Neubig, Tomoki Toda, Satoshi Nakamura:
An event-related brain potential study on the impact of speech recognition errors. APSIPA 2014: 1-4 - [c72]Kou Tanaka, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura:
An inter-speaker evaluation through simulation of electrolarynx control based on statistical F0 prediction. APSIPA 2014: 1-4 - [c71]Sakura Tsuruta, Kou Tanaka, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura:
An evaluation of target speech for a nonaudible murmur enhancement system in noisy environments. APSIPA 2014: 1-4 - [c70]Riki Yoshida, Takuya Hiraoka, Graham Neubig, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura:
Unnecessary utterance detection for avoiding digressions in discussion. APSIPA 2014: 1-4 - [c69]Koichi Akabe, Graham Neubig, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura:
Discriminative Language Models as a Tool for Machine Translation Error Analysis. COLING 2014: 1124-1132 - [c68]Takuya Hiraoka, Graham Neubig, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura:
Reinforcement Learning of Cooperative Persuasive Dialogue Policies using Framing. COLING 2014: 1706-1717 - [c67]Hoa Trong Vu, Graham Neubig, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura:
Acquiring a Dictionary of Emotion-Provoking Events. EACL 2014: 128-132 - [c66]Shinnosuke Takamichi, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura:
A postfilter to modify the modulation spectrum in HMM-based speech synthesis. ICASSP 2014: 290-294 - [c65]Keigo Kubo, Sakriani Sakti, Graham Neubig, Tomoki Toda, Satoshi Nakamura:
Narrow Adaptive Regularization of weights for grapheme-to-phoneme conversion. ICASSP 2014: 2589-2593 - [c64]Kou Tanaka, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura:
An evaluation of excitation feature prediction in a hybrid approach to electrolaryngeal speech enhancement. ICASSP 2014: 4488-4492 - [c63]Kazuhiro Kobayashi, Tomoki Toda, Tomoyasu Nakano, Masataka Goto, Graham Neubig, Sakriani Sakti, Satoshi Nakamura:
Regression approaches to perceptual age control in singing voice conversion. ICASSP 2014: 7904-7908 - [c62]Kou Tanaka, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura:
Direct F0 control of an electrolarynx based on statistical excitation feature prediction and its evaluation through simulation. INTERSPEECH 2014: 31-35 - [c61]Nozomi Jinbo, Shinnosuke Takamichi, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura:
A hearing impairment simulation method using audiogram-based approximation of auditory charatecteristics. INTERSPEECH 2014: 490-494 - [c60]Keigo Kubo, Sakriani Sakti, Graham Neubig, Tomoki Toda, Satoshi Nakamura:
Structured soft margin confidence weighted learning for grapheme-to-phoneme conversion. INTERSPEECH 2014: 1263-1267 - [c59]Sho Matsumiya, Sakriani Sakti, Graham Neubig, Tomoki Toda, Satoshi Nakamura:
Data-driven generation of text balloons based on linguistic and acoustic features of a comics-anime corpus. INTERSPEECH 2014: 1801-1805 - [c58]Patrick Lumban Tobing, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura, Ayu Purwarianti:
Articulatory controllable speech modification based on statistical feature mapping with Gaussian mixture models. INTERSPEECH 2014: 2298-2302 - [c57]Kazuhiro Kobayashi, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura:
Statistical singing voice conversion with direct waveform modification based on the spectrum differential. INTERSPEECH 2014: 2514-2518 - [c56]Nurul Lubis, Sakriani Sakti, Graham Neubig, Tomoki Toda, Ayu Purwarianti, Satoshi Nakamura:
Emotion and Its Triggers in Human Spoken Dialogue: Recognition and Analysis. IWSDS 2014: 103-110 - [c55]Takuya Hiraoka, Graham Neubig, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura:
Construction and Analysis of a Persuasive Dialogue Corpus. IWSDS 2014: 125-138 - [c54]Hiroaki Shimizu, Graham Neubig, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura:
Collection of a Simultaneous Translation Corpus for Comparative Analysis. LREC 2014: 670-673 - [c53]Sakriani Sakti, Keigo Kubo, Sho Matsumiya, Graham Neubig, Tomoki Toda, Satoshi Nakamura, Fumihiro Adachi, Ryosuke Isotani:
Towards Multilingual Conversations in the Medical Domain: Development of Multilingual Medical Data and A Network-based ASR System. LREC 2014: 2639-2643 - [c52]Quoc Truong Do, Graham Neubig, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura:
Collection and analysis of a Japanese-English emphasized speech corpora. O-COCOSDA 2014: 1-5 - [c51]Fajri Koto, Sakriani Sakti, Graham Neubig, Tomoki Toda, Mirna Adriani, Satoshi Nakamura:
Memorable spoken quote corpora of TED public speaking. O-COCOSDA 2014: 1-4 - [c50]Nurul Lubis, Dessi Puji Lestari, Ayu Purwarianti, Sakriani Sakti, Satoshi Nakamura:
Construction and analysis of Indonesian Emotional Speech Corpus. O-COCOSDA 2014: 1-5 - [c49]Masahiro Mizukami, Graham Neubig, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura:
Building a free, general-domain paraphrase database for Japanese. O-COCOSDA 2014: 1-4 - [c48]Lasguido Nio, Sakriani Sakti, Graham Neubig, Tomoki Toda, Satoshi Nakamura:
Conversation dialog corpora from television and movie scripts. O-COCOSDA 2014: 1-4 - [c47]Lasguido Nio, Sakriani Sakti, Graham Neubig, Tomoki Toda, Satoshi Nakamura:
Improving the robustness of example-based dialog retrieval using recursive neural network paraphrase identification. SLT 2014: 306-311 - [c46]Nurul Lubis, Dessi Puji Lestari, Ayu Purwarianti, Sakriani Sakti, Satoshi Nakamura:
Emotion recognition on Indonesian television talk shows. SLT 2014: 466-471 - [c45]Sakriani Sakti, Satoshi Nakamura:
Recent progress in developing grapheme-based speech recognition for Indonesian ethnic languages: Javanese, Sundanese, Balinese and Bataks. SLTU 2014: 46-52 - [c44]Yuto Hatakoshi, Graham Neubig, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura:
Rule-based Syntactic Preprocessing for Syntax-based Machine Translation. SSST@EMNLP 2014: 34-42 - 2013
- [j6]Sakriani Sakti, Michael Paul, Andrew M. Finch, Shinsuke Sakai, Thang Tat Vu, Noriyuki Kimura, Chiori Hori, Eiichiro Sumita, Satoshi Nakamura, Jun Park, Chai Wutiwiwatchai, Bo Xu, Hammam Riza, Karunesh Arora, Chi Mai Luong, Haizhou Li:
A-STAR: Toward translating Asian spoken languages. Comput. Speech Lang. 27(2): 509-527 (2013) - [c43]Graham Neubig, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura, Yuji Matsumoto, Ryosuke Isotani, Yukichi Ikeda:
Towards High-Reliability Speech Translation in the Medical Domain. NLPHealthcare@IJCNLP 2013: 22-29 - [c42]Takuya Hiraoka, Yuki Yamauchi, Graham Neubig, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura:
Dialogue management for leading the conversation in persuasive dialogue systems. ASRU 2013: 114-119 - [c41]Philip Arthur, Graham Neubig, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura:
Inter-Sentence Features and Thresholded Minimum Error Rate Training: NAIST at CLEF 2013 QA4MRE. CLEF (Working Notes) 2013 - [c40]Philip Arthur, Graham Neubig, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura:
NAIST at the CLEF 2013 QA4MRE Pilot Task. CLEF (Working Notes) 2013 - [c39]Hiroki Tanaka, Sakriani Sakti, Graham Neubig, Tomoki Toda, Satoshi Nakamura:
Modality and contextual differences in computer based non-verbal communication training. CogInfoCom 2013: 127-132 - [c38]Shinnosuke Takamichi, Tomoki Toda, Yoshinori Shiga, Sakriani Sakti, Graham Neubig, Satoshi Nakamura:
Improvements to HMM-based speech synthesis based on parameter generation with rich context models. INTERSPEECH 2013: 364-368 - [c37]Kazuhiro Kobayashi, Hironori Doi, Tomoki Toda, Tomoyasu Nakano, Masataka Goto, Graham Neubig, Sakriani Sakti, Satoshi Nakamura:
An investigation of acoustic features for singing voice conversion based on perceptual age. INTERSPEECH 2013: 1057-1061 - [c36]Keigo Kubo, Sakriani Sakti, Graham Neubig, Tomoki Toda, Satoshi Nakamura:
Grapheme-to-phoneme conversion based on adaptive regularization of weight vectors. INTERSPEECH 2013: 1946-1950 - [c35]Takatomo Kano, Shinnosuke Takamichi, Sakriani Sakti, Graham Neubig, Tomoki Toda, Satoshi Nakamura:
Generalizing continuous-space translation of paralinguistic information. INTERSPEECH 2013: 2614-2618 - [c34]Masaya Ohgushi, Graham Neubig, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura:
An empirical comparison of joint optimization techniques for speech translation. INTERSPEECH 2013: 2619-2623 - [c33]Kou Tanaka, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura:
A hybrid approach to electrolaryngeal speech enhancement based on spectral subtraction and statistical voice conversion. INTERSPEECH 2013: 3067-3071 - [c32]Takuto Moriguchi, Tomoki Toda, Motoaki Sano, Hiroshi Sato, Graham Neubig, Sakriani Sakti, Satoshi Nakamura:
A digital signal processor implementation of silent/electrolaryngeal speech enhancement based on real-time statistical voice conversion. INTERSPEECH 2013: 3072-3076 - [c31]Tomoki Fujita, Graham Neubig, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura:
Simple, lexicalized choice of translation timing for simultaneous speech translation. INTERSPEECH 2013: 3487-3491 - [c30]Michael Heck, Sebastian Stüker, Sakriani Sakti, Alex Waibel, Satoshi Nakamura:
Incremental unsupervised training for university lecture recognition. IWSLT 2013 - [c29]Sakriani Sakti, Keigo Kubo, Graham Neubig, Tomoki Toda, Satoshi Nakamura:
The NAIST English speech recognition system for IWSLT 2013. IWSLT (Evaluation Campaign) 2013 - [c28]Hiroaki Shimizu, Graham Neubig, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura:
Constructing a speech translation system using simultaneous interpretation data. IWSLT 2013 - [c27]Sakriani Sakti, Satoshi Nakamura:
Towards language preservation: Design and collection of graphemically balanced and parallel speech corpora of Indonesian ethnic languages. O-COCOSDA/CASLRE 2013: 1-5 - [c26]Tatsuo Inukai, Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura:
Investigation of intra-speaker spectral parameter variation and its prediction towards improvement of spectral conversion metric. SSW 2013: 89-94 - 2012
- [j5]Hansjörg Hofmann, Sakriani Sakti, Chiori Hori, Hideki Kashioka, Satoshi Nakamura, Wolfgang Minker:
Sequence-Based Pronunciation Variation Modeling for Spontaneous ASR Using a Noisy Channel Approach. IEICE Trans. Inf. Syst. 95-D(8): 2084-2093 (2012) - [j4]Sakriani Sakti, Michael Paul, Andrew M. Finch, Xinhui Hu, Jinfu Ni, Noriyuki Kimura, Shigeki Matsuda, Chiori Hori, Yutaka Ashikari, Hisashi Kawai, Hideki Kashioka, Eiichiro Sumita, Satoshi Nakamura:
Distributed speech translation technologies for multiparty multilingual communication. ACM Trans. Speech Lang. Process. 9(2): 4:1-4:27 (2012) - [c25]Hiroki Tanaka, Sakriani Sakti, Graham Neubig, Tomoki Toda, Nick Campbell, Satoshi Nakamura:
Non-verbal cognitive skills and autistic conditions: An analysis and training tool. CogInfoCom 2012: 41-46 - [c24]Shinnosuke Takamichi, Tomoki Toda, Yoshinori Shiga, Hisashi Kawai, Sakriani Sakti, Satoshi Nakamura:
An Evaluation of Parameter Generation Methods with Rich Context Models in HMM-Based Speech Synthesis. INTERSPEECH 2012: 1139-1142 - [c23]Lasguido Nio, Sakriani Sakti, Graham Neubig, Tomoki Toda, Mirna Adriani, Satoshi Nakamura:
Developing Non-goal Dialog System Based on Examples of Drama Television. IWSDS 2012: 355-361 - [c22]Graham Neubig, Kevin Duh, Masaya Ogushi, Takatomo Kano, Tetsuo Kiso, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura:
The NAIST machine translation system for IWSLT2012. IWSLT 2012: 54-60 - [c21]Christian Saam, Christian Mohr, Kevin Kilgour, Michael Heck, Matthias Sperber, Keigo Kubo, Sebastian Stüker, Sakriani Sakti, Graham Neubig, Tomoki Toda, Satoshi Nakamura, Alex Waibel:
The 2012 KIT and KIT-NAIST English ASR systems for the IWSLT evaluation. IWSLT 2012: 87-90 - [c20]Michael Heck, Keigo Kubo, Matthias Sperber, Sakriani Sakti, Sebastian Stüker, Christian Saam, Kevin Kilgour, Christian Mohr, Graham Neubig, Tomoki Toda, Satoshi Nakamura, Alex Waibel:
The KIT-NAIST (contrastive) English ASR system for IWSLT 2012. IWSLT 2012: 91-95 - [c19]Takatomo Kano, Sakriani Sakti, Shinnosuke Takamichi, Graham Neubig, Tomoki Toda, Satoshi Nakamura:
A method for translation of paralinguistic information. IWSLT 2012: 158-163 - 2011
- [c18]Shunta Ishii, Tomoki Toda, Hiroshi Saruwatari, Sakriani Sakti, Satoshi Nakamura:
Blind noise suppression for Non-Audible Murmur recognition with stereo signal processing. ASRU 2011: 494-499 - [c17]Sakriani Sakti, Andrew M. Finch, Ryosuke Isotani, Hisashi Kawai, Satoshi Nakamura:
Unsupervised determination of efficient Korean LVCSR units using a Bayesian Dirichlet process model. ICASSP 2011: 4664-4667 - [c16]Sakriani Sakti, Andrew M. Finch, Chiori Hori, Hideki Kashioka, Satoshi Nakamura:
Conditional Random Fields for Modeling Korean Pronunciation Variation. IWSDS 2011: 49-55 - 2010
- [c15]Kazuhiko Abe, Sakriani Sakti, Ryosuke Isotani, Hisashi Kawai, Satoshi Nakamura:
Brazilian portuguese acoustic model training based on data borrowing from other language. INTERSPEECH 2010: 861-864 - [c14]Sakriani Sakti, Ryosuke Isotani, Hisashi Kawai, Satoshi Nakamura:
Utilizing a noisy-channel approach for Korean LVCSR. INTERSPEECH 2010: 1513-1516 - [c13]Sakriani Sakti, Andrew M. Finch, Ryosuke Isotani, Hisashi Kawai, Satoshi Nakamura:
Korean pronunciation variation modeling with probabilistic Bayesian networks. IUCS 2010: 52-57 - [c12]Hansjörg Hofmann, Sakriani Sakti, Ryosuke Isotani, Hisashi Kawai, Satoshi Nakamura, Wolfgang Minker:
Improving spontaneous English ASR using a joint-sequence pronunciation model. IUCS 2010: 58-61 - [c11]Hansjörg Hofmann, Sakriani Sakti, Ryosuke Isotani, Hisashi Kawai, Satoshi Nakamura, Wolfgang Minker:
Sequence-Based Pronunciation Modeling Using a Noisy-Channel Approach. IWSDS 2010: 156-162
2000 – 2009
- 2009
- [b2]Sakriani Sakti, Satoshi Nakamura, Konstantin Markov, Wolfgang Minker:
Incorporating Knowledge Sources into Statistical Speech Recognition. Lecture Notes in Electrical Engineering 42, Springer 2009, ISBN 978-0-387-85829-6, pp. 1-59 [contents] - [c10]Sakriani Sakti, Noriyuki Kimura, Michael Paul, Chiori Hori, Eiichiro Sumita, Satoshi Nakamura, Jun Park, Chai Wutiwiwatchai, Bo Xu, Hammam Riza, Karunesh Arora, Chi Mai Luong, Haizhou Li:
The Asian network-based speech-to-speech translation system. ASRU 2009: 507-512 - [c9]Chiori Hori, Sakriani Sakti, Michael Paul, Noriyuki Kimura, Yutaka Ashikari, Ryosuke Isotani, Eiichiro Sumita, Satoshi Nakamura:
Network-based speech-to-speech translation. IWSLT 2009: 168 - 2008
- [b1]Sakriani Watiasri Sakti:
Incorporating knowledge into statistical acoustic models for spoken language dialog systems. University of Ulm, 2008, pp. 1-182 - [c8]Sakriani Sakti, Eka Kelana, Hammam Riza, Shinsuke Sakai, Konstantin Markov, Satoshi Nakamura:
Development of Indonesian Large Vocabulary Continuous Speech Recognition System within A-STAR Project. IJCNLP 2008: 19-24 - [c7]Sakriani Sakti, Konstantin Markov, Satoshi Nakamura:
Probabilistic Pronunciation Variation Model Based on Bayesian Network for Conversational Speech Recognition. ISUC 2008: 405-410 - 2007
- [j3]Sakriani Sakti, Konstantin Markov, Satoshi Nakamura:
Incorporating Knowledge Sources Into a Statistical Acoustic Model for Spoken Language Communication Systems. IEEE Trans. Computers 56(9): 1199-1211 (2007) - [c6]Sakriani Sakti, Konstantin Markov, Satoshi Nakamura:
A method to integrate additional knowledge sources into HMM based on junction tree decomposition. EUSIPCO 2007: 2404-2408 - [c5]Sakriani Sakti, Konstantin Markov, Satoshi Nakamura:
An HMM acoustic model incorporating various additional knowledge sources. INTERSPEECH 2007: 2117-2120 - 2006
- [j2]Sakriani Sakti, Satoshi Nakamura, Konstantin Markov:
Improving Acoustic Model Precision by Incorporating a Wide Phonetic Context Based on a Bayesian Framework. IEICE Trans. Inf. Syst. 89-D(3): 946-953 (2006) - [j1]Sakriani Sakti, Konstantin Markov, Satoshi Nakamura:
A Hybrid HMM/BN Acoustic Model Utilizing Pentaphone-Context Dependency. IEICE Trans. Inf. Syst. 89-D(3): 954-961 (2006) - [c4]Sakriani Sakti, Konstantin Markov, Satoshi Nakamura:
Incorporation of Pentaphone-Context Dependency Based on Hybrid Hmm/Bn Acoustic Modeling Framework. ICASSP (1) 2006: 1177-1180 - [c3]Sakriani Sakti, Konstantin Markov, Satoshi Nakamura:
The use of Bayesian network for incorporating accent, gender and wide-context dependency information. INTERSPEECH 2006 - 2005
- [c2]Sakriani Sakti, Satoshi Nakamura, Konstantin Markov:
Incorporating a Bayesian wide phonetic context model for acoustic rescoring. INTERSPEECH 2005: 1629-1632 - 2004
- [c1]Sakriani Sakti, Arry Akhmad Arman, Satoshi Nakamura, Paulus Hutagaol:
Indonesian speech recognition for hearing and speaking impaired people. INTERSPEECH 2004: 1037-1040
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-18 20:46 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint