default search action
18. SPECOM 2016: Budapest, Hungary
- Andrey Ronzhin, Rodmonga Potapova, Géza Németh:
Speech and Computer - 18th International Conference, SPECOM 2016, Budapest, Hungary, August 23-27, 2016, Proceedings. Lecture Notes in Computer Science 9811, Springer 2016, ISBN 978-3-319-43957-0
Invited Talks
- Ralf Schlüter, Patrick Doetsch, Pavel Golik, Markus Kitza, Tobias Menne, Kazuki Irie, Zoltán Tüske, Albert Zeyer:
Automatic Speech Recognition Based on Neural Networks. 3-17 - Nick Campbell:
Machine Processing of Dialogue States; Speculations on Conversational Entropy. 18-25 - Attila Vékony:
Speech Recognition Challenges in the Car Navigation Industry. 26-40
Conference Papers
- Elena E. Lyakso, Olga V. Frolova, Aleksey Grigorev:
A Comparison of Acoustic Features of Speech of Typically Developing Children and Children with Autism Spectrum Disorders. 43-50 - Mohamed S. Elaraby, Mustafa Abdallah, Sherif M. Abdou, Mohsen A. Rashwan:
A Deep Neural Networks (DNN) Based Models for a Computer Aided Pronunciation Learning System. 51-58 - Tijana Delic, Branislav Gerazov, Branislav M. Popovic, Milan Secujski:
A Linguistic Interpretation of the Atom Decomposition of Fundamental Frequency Contour for American English. 59-66 - Edvin Pakoci, Branislav M. Popovic, Niksa Jakovljevic, Darko Pekar, Fathy Yassa:
A Phonetic Segmentation Procedure Based on Hidden Markov Models. 67-74 - Yuyun Huang, Emer Gilmartin, Benjamin R. Cowan, Nick Campbell:
A Preliminary Exploration of Group Social Engagement Level Recognition in Multiparty Casual Conversation. 75-83 - Branislav Gerazov, Philip N. Garner:
An Agonist-Antagonist Pitch Production Model. 84-91 - Darko Pekar, Sinisa Suzic, Robert Mak, Meir Friedlander, Milan Secujski:
An Algorithm for Phase Manipulation in a Speech Signal. 92-99 - Natalia Bogdanova-Beglarian, Tatiana Y. Sherstinova, Olga Blinova, Gregory Y. Martynenko:
An Exploratory Study on Sociolinguistic Variation of Russian Everyday Speech. 100-107 - László Tóth, Gábor Gosztolya:
Adaptation of DNN Acoustic Models Using KL-divergence Regularization and Multi-task Training. 108-115 - Ivan Medennikov, Alexey Prudnikov:
Advances in STC Russian Spontaneous Speech Recognition System. 116-123 - Andrey Shulipa, Sergey Novoselov, Aleksandr Melnikov:
Approaches for Out-of-Domain Adaptation to Improve Speaker Recognition Performance. 124-130 - Alexander Sepúlveda-Sepúlveda, Germán Castellanos-Domínguez:
Assessment of the Relation Between Low-Frequency Features and Velum Opening by Using Real Articulatory Data. 131-139 - András Beke, György Szaszák:
Automatic Summarization of Highly Spontaneous Speech. 140-147 - Michimasa Inaba, Kenichi Takahashi:
Backchanneling via Twitter Data for Conversational Dialogue Systems. 148-155 - Alexey A. Petrovsky, Vadzim Herasimovich, Alexander A. Petrovsky:
Bio-Inspired Sparse Representation of Speech and Audio Using Psychoacoustic Adaptive Matching Pursuit. 156-164 - György Szaszák, Máté Ákos Tündik, Branislav Gerazov, Aleksandar Gjoreski:
Combining Atom Decomposition of the F0 Track and HMM-based Phonological Phrase Modelling for Robust Stress Detection in Speech. 165-173 - Konstantin Simonchik, Sergey Novoselov, Galina Lavrentyeva:
Comparative Analysis of Classifiers for Automatic Language Recognition in Spontaneous Speech. 174-181 - Lucie Skorkovská:
Comparison of Retrieval Approaches and Blind Relevance Feedback Methods Within the Czech Speech Information Retrieval. 182-190 - Marek Hrúz, Marie Kunesová:
Convolutional Neural Network in the Task of Speaker Change Detection. 191-198 - Milan Secujski, Branislav Gerazov, Tamás Gábor Csapó, Vlado Delic, Philip N. Garner, Aleksandar Gjoreski, David Guennec, Zoran A. Ivanovski, Aleksandar Melov, Géza Németh, Ana Stojkovic, György Szaszák:
Design of a Speech Corpus for Research on Cross-Lingual Prosody Transfer. 199-206 - Markéta Juzová, Daniel Tihelka, Jindrich Matousek:
Designing High-Coverage Multi-level Text Corpus for Non-professional-voice Conservation. 207-215 - Kseniya Proença, Kris Demuynck, Dirk Van Compernolle:
Designing Syllable Models for an HMM Based Speech Recognition System. 216-223 - Vasilisa Verkhodanova, Vladimir Shapranov:
Detecting Filled Pauses and Lengthenings in Russian Spontaneous Speech Using SVM. 224-231 - Gábor Gosztolya:
Detecting Laughter and Filler Events by Time Series Smoothing with Genetic Algorithms. 232-239 - Denis Gordeev:
Detecting State of Aggression in Sentences Using CNN. 240-245 - Irina S. Kipyatkova, Alexey Karpov:
DNN-Based Acoustic Modeling for Russian Speech Recognition Using Kaldi. 246-253 - Péter Nagy, Géza Németh:
DNN-Based Duration Modeling for Synthesizing Short Sentences. 254-261 - Olga V. Frolova, Elena E. Lyakso:
Emotional Speech of 3-Years Old Children: Norm-Risk-Deprivation. 262-270 - Bálint Pál Tóth, Kornél István Kis, György Szaszák, Géza Németh:
Ensemble Deep Neural Network Based Waveform-Driven Stress Model for Speech Synthesis. 271-278 - Hunor Nagy, György Wersényi:
Evaluation of Response Times on a Touch Screen Using Stereo Panned Speech Command Auditory Feedback. 279-286 - Evgeny Kostyuchenko, Roman V. Mescheryakov, Dariya Ignatieva, Alexander Pyatkov, Evgeny L. Choinzonov, Lidiya N. Balatskaya:
Evaluation of the Speech Quality During Rehabilitation After Surgical Treatment of the Cancer of Oral Cavity and Oropharynx Based on a Comparison of the Fourier Spectra. 287-295 - Daniel Tihelka, Martin Gruber, Markéta Juzová:
Experiments with One-Class Classifier as a Predictor of Spectral Discontinuities in Unit Concatenation. 296-303 - Natalia A. Tomashenko, Yuri Y. Khokhlov, Anthony Larcher, Yannick Estève:
Exploring GMM-derived Features for Unsupervised Adaptation of Deep Neural Network Acoustic Models. 304-311 - Maxim Korenevsky, Aleksei Romanenko:
Feature Space VTS with Phase Term Modeling. 312-320 - Evgeniy Shuranov, Aleksandr Lavrentyev, Alexey Kozlyaev, Galina Lavrentyeva, Valeriya Volkovaya:
Finding Speaker Position Under Difficult Acoustic Conditions. 321-327 - Evaldas Vaiciukynas, Antanas Verikas, Adas Gelzinis, Marija Bacauskiene, Kestutis Vaskevicius, Virgilijus Uloza, Evaldas Padervinskis, Jolita Ciceliene:
Fusing Various Audio Feature Sets for Detection of Parkinson's Disease from Sustained Voice and Speech Recordings. 328-337 - Vasilisa Verkhodanova, Alexander L. Ronzhin, Irina S. Kipyatkova, Denis Ivanko, Alexey Karpov, Milos Zelezný:
HAVRUS Corpus: High-Speed Recordings of Audio-Visual Russian Speech. 338-345 - Alexander V. Smirnov, Alexey M. Kashevnik, Igor Lashkov:
Human-Smartphone Interaction for Dangerous Situation Detection and Recommendation Generation While Driving. 346-353 - Marvin Coto-Jiménez, John Goddard Close, Fabiola Martínez Licona:
Improving Automatic Speech Recognition Containing Additive Noise Using Deep Denoising Autoencoders of LSTM Networks. 354-361 - Maxim Korenevsky, Ivan Medennikov, Vadim Shchemelinin:
Improving the Quality of Automatic Speech Recognition in Trucks. 362-369 - Chitralekha Bhat, Bhavik Vachhani, Sunil Kumar Kopparapu:
Improving Recognition of Dysarthric Speech Using Severity Based Tempo Adaptation. 370-377 - Iosif Mporas, Saeid Safavi, Reza Sotudeh:
Improving Robustness of Speaker Verification by Fusion of Prompted Text-Dependent and Text-Independent Operation Modalities. 378-385 - Bálint Pál Tóth, Balázs Szórádi, Géza Németh:
Improvements to Prosodic Variation in Long Short-Term Memory Based Intonation Models Using Random Forest. 386-394 - André Mansikkaniemi, Mikko Kurimo, Krister Lindén:
In-Document Adaptation for a Human Guided Automatic Transcription Service. 395-402 - Anastasiia Spirina, Olesia Vaskovskaia, Maxim Sidorov, Alexander Schmitt:
Interaction Quality as a Human-Human Task-Oriented Conversation Performance. 403-410 - Zbynek Zajíc, Marie Kunesová, Vlasta Radová:
Investigation of Segmentation in i-Vector Based Speaker Diarization of Telephone Speech. 411-418 - Victor Budkov, Irina V. Vatamaniuk, Vladimir V. Basov, Daniyar Volf:
Investigation of Speech Signal Parameters Reflecting the Truth of Transmitted Information. 419-426 - Sai Sirisha Rallabandi, Sai Krishna Rallabandi, Naina Teertha, R. Kumaraswamy, Suryakanth V. Gangashetty:
Investigating Signal Correlation as Continuity Metric in a Syllable Based Unit Selection Synthesis System. 427-434 - Andrei Smirnov, Valentin Mendelev:
Knowledge Transfer for Utterance Classification in Low-Resource Languages. 435-442 - Maxim Tkachenko, Alexander Yamshinin, Nikolay Lyubimov, Mikhail Kotov, Marina Nastasenko:
Language Identification Using Time Delay Neural Network D-Vector on Short Utterances. 443-449 - Swaran Lata, Swati Arora, Simerjeet Kaur:
Lexical Stress in Punjabi and Its Representation in PLS. 450-460 - Anton Stepikhov, Anastassia Loukina:
Low Inter-Annotator Agreement in Sentence Boundary Detection and Annotator Personality. 461-468 - Ivan Medennikov, Anna Bulusheva:
LSTM-Based Language Models for Spontaneous Speech Recognition. 469-475 - Michelina Savino, Loredana Lapertosa, Alessandro O. Caffò, Mario Refice:
Measuring Prosodic Entrainment in Italian Collaborative Game-Based Dialogues. 476-483 - Mikhail Stolbov, Sergei Aleinik:
Microphone Array Directivity Improvement in Low-Frequency Band for Speech Processing. 484-490 - Olga Blinova:
Modeling Imperative Utterances in Russian Spoken Dialogue: Verb-Central Quantitative Approach. 491-498 - Rodmonga Potapova, Liliya Komalova:
Multimodal Perception of Aggressive Behavior. 499-506 - Rodmonga Potapova, Vsevolod Potapov:
On Individual Polyinformativity of Speech and Voice Regarding Speakers Auditive Attribution (Forensic Phonetic Aspect). 507-514 - Gerasimos Arvanitis, Konstantinos Moustakas, Nikos Fakotakis:
Online Biometric Identification with Face Analysis in Web Applications. 515-522 - Sergei Aleinik:
Optimization of Zelinski Post-filtering Calculation. 523-530 - Vera Evdokimova, Pavel A. Skrelin, Andrey Barabanov, Karina Evgrafova:
Phonetic Aspects of High Level of Naturalness in Speech Synthesis. 531-538 - Rodmonga Potapova, Vsevolod Potapov:
Polybasic Attribution of Social Network Discourse. 539-546 - Andrey Barabanov, Valentin V. Magerkin, Evgenij Vikulov:
Precise Estimation of Harmonic Parameter Trend and Modification of a Speech Signal. 547-554 - Tatiana Litvinova, Olga Zagorovskaya, Olga Litvinova, Pavel Seredin:
Profiling a Set of Personality Traits of a Text's Author: A Corpus-Based Approach. 555-562 - Izzad Ramli, Noraini Seman, Norizah Ardi, Nursuriati Jamil:
Prosody Analysis of Malay Language Storytelling Corpus. 563-570 - Michael Maruschke, Oliver Jokisch, Martin Meszaros, Franziska Trojahn, M. Hoffmann:
Quality Assessment of Two Fullband Audio Codecs Supporting Real-Time Communication. 571-579 - Surasak Boonkla, Masashi Unoki, Stanislav S. Makhanov:
Robust Speech Analysis Based on Source-Filter Model Using Multivariate Empirical Mode Decomposition in Noisy Environments. 580-587 - Irina V. Vatamaniuk, Dmitriy Levonevskiy, Anton I. Saveliev, Alexander Denisov:
Scenarios of Multimodal Information Navigation Services for Users in Cyberphysical Environment. 588-595 - Andrey Shulipa, Sergey Novoselov, Yuri Matveev:
Scores Calibration in Speaker Recognition Systems. 596-603 - Lukás Bures, Ludek Müller:
Selecting Keypoint Detector and Descriptor Combination for Augmented Reality Application. 604-612 - Elena Bulgakova, Aleksey Sholohov:
Semi-automatic Speaker Verification System Based on Analysis of Formant, Durational and Pitch Characteristics. 613-619 - Aleksei Romanenko, Valentin Mendelev:
Speaker-Dependent Bottleneck Features for Egyptian Arabic Speech Recognition. 620-626 - Tatiana Y. Sherstinova:
Speech Acts Annotation of Everyday Conversations in the ORD Сorpus of Spoken Russian. 627-635 - Mikhail Stolbov, Alexander Lavrentyev:
Speech Enhancement with Microphone Array Using a Multi Beam Adaptive Noise Suppressor. 636-644 - Ivan Rakhmanenko, Roman V. Meshcheryakov:
Speech Features Evaluation for Small Set Automatic Speaker Verification Using GMM-UBM System. 645-650 - Stamatis Karlos, Nikos Fazakis, Katerina Karanikola, Sotiris B. Kotsiantis, Kyriakos N. Sgarbas:
Speech Recognition Combining MFCCs and Image Features. 651-658 - Natalia Bogdanova-Beglarian, Tatiana Y. Sherstinova, Olga Blinova, Olga Ermolova, Ekaterina Baeva, Gregory Y. Martynenko, Anastassia Ryko:
Sociolinguistic Extension of the ORD Corpus of Russian Everyday Speech. 659-666 - Miklós Gábriel Tulics, Ferenc Kazinczi, Klára Vicsi:
Statistical Analysis of Acoustical Parameters in the Voice of Children with Juvenile Dysphonia. 667-674 - Róbert Sabo, Milan Rusko, Andrej Ridzik, Jakub Rajcáni:
Stress, Arousal, and Stress Detector Trained on Acted Speech Database. 675-682 - Yuto Tanaka, Mitsunori Mizumachi, Yoshihisa Nakatoh:
Study on the Improvement of Intelligibility for Elderly Speech Using Formant Frequency Shift Method. 683-690 - Ksenia Oskina:
Text Classification in the Domain of Applied Linguistics as Part of a Pre-editing Module for Machine Translation Systems. 691-698 - Nina B. Volskaya, Tatiana Kachkovskaia:
Tonal Specification of Perceptually Prominent Non-nuclear Pitch Accents in Russian. 699-705 - Zdenek Krnoul, Pavel Jedlicka, Jakub Kanis, Milos Zelezný:
Toward Sign Language Motion Capture Dataset Building. 706-713 - Andrey Barabanov, Aleksandr Melnikov:
Trade-Off Between Speed and Accuracy for Noise Variance Minimization (NVM) Pitch Estimation Algorithm. 714-721 - Varvara Krayvanova, Svetlana Duka:
Unsupervised Trained Functional Discourse Parser for e-Learning Materials Scaffolding. 722-728
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.