default search action
Reinhold Häb-Umbach
Person information
- affiliation: University of Paderborn, Department of Electrical Engineering and Information Technology, Germany
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j41]Christoph Böddeker, Aswin Shanmugam Subramanian, Gordon Wichern, Reinhold Haeb-Umbach, Jonathan Le Roux:
TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings. IEEE ACM Trans. Audio Speech Lang. Process. 32: 1185-1197 (2024) - [c183]Alexander Werning, Reinhold Haeb-Umbach:
Target-Specific Dataset Pruning for Compression of Audio Tagging Models. EUSIPCO 2024: 61-65 - [c182]Yuying Xie, Michael Kuhlmann, Frederik Rautenberg, Zheng-Hua Tan, Reinhold Haeb-Umbach:
Speaker and Style Disentanglement of Speech Based on Contrastive Predictive Coding Supported Factorized Variational Autoencoder. EUSIPCO 2024: 436-440 - [c181]Thilo von Neumann, Christoph Böddeker, Tobias Cord-Landwehr, Marc Delcroix, Reinhold Haeb-Umbach:
Meeting Recognition with Continuous Speech Separation and Transcription-Supported Diarization. ICASSP Workshops 2024: 775-779 - [c180]Tobias Cord-Landwehr, Christoph Böddeker, Catalin Zorila, Rama Doddipatla, Reinhold Haeb-Umbach:
Geodesic Interpolation of Frame-Wise Speaker Embeddings for the Diarization of Meeting Scenarios. ICASSP 2024: 11886-11890 - [c179]Tobias Gburrek, Adrian Meise, Joerg Schmalenstroeer, Reinhold Haeb-Umbach:
Diminishing Domain Mismatch for DNN-Based Acoustic Distance Estimation via Stochastic Room Reverberation Models. IWAENC 2024: 279-283 - [i47]Tobias Gburrek, Adrian Meise, Joerg Schmalenstroeer, Reinhold Haeb-Umbach:
Diminishing Domain Mismatch for DNN-Based Acoustic Distance Estimation via Stochastic Room Reverberation Models. CoRR abs/2408.14213 (2024) - 2023
- [j40]Thilo von Neumann, Keisuke Kinoshita, Christoph Böddeker, Marc Delcroix, Reinhold Haeb-Umbach:
Segment-Less Continuous Speech Separation of Meetings: Training and Evaluation Criteria. IEEE ACM Trans. Audio Speech Lang. Process. 31: 576-589 (2023) - [c178]Tobias Gburrek, Joerg Schmalenstroeer, Reinhold Haeb-Umbach:
Spatial Diarization for Meeting Transcription with Ad-Hoc Acoustic Sensor Networks. ACSSC 2023: 1399-1403 - [c177]Tobias Gburrek, Joerg Schmalenstroeer, Reinhold Haeb-Umbach:
On the Integration of Sampling Rate Synchronization and Acoustic Beamforming. EUSIPCO 2023: 11-15 - [c176]Tobias Cord-Landwehr, Christoph Böddeker, Catalin Zorila, Rama Doddipatla, Reinhold Haeb-Umbach:
Frame-Wise and Overlap-Robust Speaker Embeddings for Meeting Diarization. ICASSP 2023: 1-5 - [c175]Thilo von Neumann, Christoph Böddeker, Keisuke Kinoshita, Marc Delcroix, Reinhold Haeb-Umbach:
On Word Error Rate Definitions and Their Efficient Computation for Multi-Speaker Speech Recognition Systems. ICASSP 2023: 1-5 - [c174]Simon Berger, Peter Vieting, Christoph Böddeker, Ralf Schlüter, Reinhold Haeb-Umbach:
Mixture Encoder for Joint Speech Separation and Recognition. INTERSPEECH 2023: 3527-3531 - [c173]Tobias Cord-Landwehr, Christoph Böddeker, Catalin Zorila, Rama Doddipatla, Reinhold Haeb-Umbach:
A Teacher-Student Approach for Extracting Informative Speaker Embeddings From Speech Mixtures. INTERSPEECH 2023: 4703-4707 - [c172]Fritz Seebauer, Michael Kuhlmann, Reinhold Haeb-Umbach, Petra Wagner:
Re-examining the quality dimensions of synthetic speech. SSW 2023: 34-40 - [i46]Christoph Böddeker, Aswin Shanmugam Subramanian, Gordon Wichern, Reinhold Haeb-Umbach, Jonathan Le Roux:
TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings. CoRR abs/2303.03849 (2023) - [i45]Tobias Cord-Landwehr, Christoph Böddeker, Catalin Zorila, Rama Doddipatla, Reinhold Haeb-Umbach:
A Teacher-Student approach for extracting informative speaker embeddings from speech mixtures. CoRR abs/2306.00634 (2023) - [i44]Simon Berger, Peter Vieting, Christoph Böddeker, Ralf Schlüter, Reinhold Haeb-Umbach:
Mixture Encoder for Joint Speech Separation and Recognition. CoRR abs/2306.12173 (2023) - [i43]Janek Ebbers, Reinhold Haeb-Umbach, Romain Serizel:
Post-Processing Independent Evaluation of Sound Event Detection Systems. CoRR abs/2306.15440 (2023) - [i42]Thilo von Neumann, Christoph Böddeker, Marc Delcroix, Reinhold Haeb-Umbach:
MeetEval: A Toolkit for Computation of Word Error Rates for Meeting Transcription Systems. CoRR abs/2307.11394 (2023) - [i41]Joerg Schmalenstroeer, Tobias Gburrek, Reinhold Haeb-Umbach:
LibriWASN: A Data Set for Meeting Separation, Diarization, and Recognition with Asynchronous Recording Devices. CoRR abs/2308.10682 (2023) - [i40]Peter Vieting, Simon Berger, Thilo von Neumann, Christoph Böddeker, Ralf Schlüter, Reinhold Haeb-Umbach:
Mixture Encoder Supporting Continuous Speech Separation for Meeting Recognition. CoRR abs/2309.08454 (2023) - [i39]Thilo von Neumann, Christoph Böddeker, Tobias Cord-Landwehr, Marc Delcroix, Reinhold Haeb-Umbach:
Meeting Recognition with Continuous Speech Separation and Transcription-Supported Diarization. CoRR abs/2309.16482 (2023) - [i38]Tobias Gburrek, Joerg Schmalenstroeer, Reinhold Haeb-Umbach:
Spatial Diarization for Meeting Transcription with Ad-Hoc Acoustic Sensor Networks. CoRR abs/2311.15597 (2023) - 2022
- [j39]Christopher Grimm, Tai Fei, Ernst Warsitz, Ridha Farhoud, Tobias Breddermann, Reinhold Haeb-Umbach:
Warping of Radar Data Into Camera Image for Cross-Modal Supervision in Automotive Applications. IEEE Trans. Veh. Technol. 71(9): 9435-9449 (2022) - [c171]Jens Heitkaemper, Joerg Schmalenstroeer, Reinhold Haeb-Umbach:
Neural Network Based Carrier Frequency Offset Estimation From Speech Transmitted Over High Frequency Channels. EUSIPCO 2022: 289-293 - [c170]Tobias Gburrek, Joerg Schmalenstroeer, Reinhold Haeb-Umbach:
On Synchronization of Wireless Acoustic Sensor Networks in the Presence of Time-Varying Sampling Rate Offsets and Speaker Changes. ICASSP 2022: 916-920 - [c169]Janek Ebbers, Reinhold Haeb-Umbach, Romain Serizel:
Threshold Independent Evaluation of Sound Event Detection Scores. ICASSP 2022: 1021-1025 - [c168]Thilo von Neumann, Keisuke Kinoshita, Christoph Böddeker, Marc Delcroix, Reinhold Haeb-Umbach:
SA-SDR: A Novel Loss Function for Separation of Meeting Style Data. ICASSP 2022: 6022-6026 - [c167]Christoph Böddeker, Tobias Cord-Landwehr, Thilo von Neumann, Reinhold Haeb-Umbach:
An Initialization Scheme for Meeting Separation with Spatial Mixture Models. INTERSPEECH 2022: 271-275 - [c166]Keisuke Kinoshita, Thilo von Neumann, Marc Delcroix, Christoph Böddeker, Reinhold Haeb-Umbach:
Utterance-by-utterance overlap-aware neural diarization with Graph-PIT. INTERSPEECH 2022: 1486-1490 - [c165]Michael Kuhlmann, Fritz Seebauer, Janek Ebbers, Petra Wagner, Reinhold Haeb-Umbach:
Investigation into Target Speaking Rate Adaptation for Voice Conversion. INTERSPEECH 2022: 4930-4934 - [c164]Tobias Cord-Landwehr, Christoph Böddeker, Thilo von Neumann, Catalin Zorila, Rama Doddipatla, Reinhold Haeb-Umbach:
Monaural Source Separation: From Anechoic To Reverberant Environments. IWAENC 2022: 1-5 - [c163]Tobias Cord-Landwehr, Thilo von Neumann, Christoph Böddeker, Reinhold Haeb-Umbach:
MMS-MSG: A Multi-Purpose Multi-Speaker Mixture Signal Generator. IWAENC 2022: 1-5 - [c162]Tobias Gburrek, Joerg Schmalenstroeer, Jens Heitkaemper, Reinhold Haeb-Umbach:
Informed vs. Blind beamforming in AD-HOC Acoustic Sensor Networks for Meeting Transcription. IWAENC 2022: 1-5 - [i37]Janek Ebbers, Romain Serizel, Reinhold Haeb-Umbach:
Threshold Independent Evaluation of Sound Event Detection Scores. CoRR abs/2201.13148 (2022) - [i36]Christoph Böddeker, Tobias Cord-Landwehr, Thilo von Neumann, Reinhold Haeb-Umbach:
An Initialization Scheme for Meeting Separation with Spatial Mixture Models. CoRR abs/2204.01338 (2022) - [i35]Tobias Gburrek, Christoph Böddeker, Thilo von Neumann, Tobias Cord-Landwehr, Joerg Schmalenstroeer, Reinhold Haeb-Umbach:
A Meeting Transcription System for an Ad-Hoc Acoustic Sensor Network. CoRR abs/2205.00944 (2022) - [i34]Keisuke Kinoshita, Thilo von Neumann, Marc Delcroix, Christoph Böddeker, Reinhold Haeb-Umbach:
Utterance-by-utterance overlap-aware neural diarization with Graph-PIT. CoRR abs/2207.13888 (2022) - [i33]Thilo von Neumann, Christoph Böddeker, Keisuke Kinoshita, Marc Delcroix, Reinhold Haeb-Umbach:
On Word Error Rate Definitions and their Efficient Computation for Multi-Speaker Speech Recognition Systems. CoRR abs/2211.16112 (2022) - 2021
- [j38]Tobias Gburrek, Joerg Schmalenstroeer, Reinhold Haeb-Umbach:
Geometry calibration in wireless acoustic sensor networks utilizing DoA and distance information. EURASIP J. Audio Speech Music. Process. 2021(1): 25 (2021) - [j37]Reinhold Haeb-Umbach, Jahn Heymann, Lukas Drude, Shinji Watanabe, Marc Delcroix, Tomohiro Nakatani:
Far-Field Automatic Speech Recognition. Proc. IEEE 109(2): 124-148 (2021) - [j36]Katharina J. Rohlfing, Philipp Cimiano, Ingrid Scharlau, Tobias Matzner, Heike M. Buhl, Hendrik Buschmeier, Elena Esposito, Angela Grimminger, Barbara Hammer, Reinhold Häb-Umbach, Ilona Horwath, Eyke Hüllermeier, Friederike Kern, Stefan Kopp, Kirsten Thommes, Axel-Cyrille Ngonga Ngomo, Carsten Schulte, Henning Wachsmuth, Petra Wagner, Britta Wrede:
Explanation as a Social Practice: Toward a Conceptual Framework for the Social Design of AI Systems. IEEE Trans. Cogn. Dev. Syst. 13(3): 717-728 (2021) - [c161]Christoph Böddeker, Frederik Rautenberg, Reinhold Haeb-Umbach:
A Comparison and Combination of Unsupervised Blind Source Separation Techniques. ITG Conference on Speech Communication 2021: 1-5 - [c160]Tobias Gburrek, Joerg Schmalenstroeer, Reinhold Haeb-Umbach:
On Source-Microphone Distance Estimation Using Convolutional Recurrent Neural Networks. ITG Conference on Speech Communication 2021: 1-5 - [c159]Thilo von Neumann, Christoph Böddeker, Keisuke Kinoshita, Marc Delcroix, Reinhold Haeb-Umbach:
Speeding Up Permutation Invariant Training for Source Separation. ITG Conference on Speech Communication 2021: 1-5 - [c158]Janek Ebbers, Reinhold Haeb-Umbach:
Self-Trained Audio Tagging and Sound Event Detection in Domestic Environments. DCASE 2021: 226-230 - [c157]Joerg Schmalenstroeer, Jens Heitkaemper, Joerg Ullmann, Reinhold Haeb-Umbach:
Open Range Pitch Tracking for Carrier Frequency Difference Estimation from HF Transmitted Speech. EUSIPCO 2021: 1-5 - [c156]Janek Ebbers, Moritz Curt Keyser, Reinhold Haeb-Umbach:
Adapting Sound Recognition to A New Environment Via Self-Training. EUSIPCO 2021: 1135-1139 - [c155]Tobias Gburrek, Joerg Schmalenstroeer, Reinhold Haeb-Umbach:
Iterative Geometry Calibration from Distance Estimates for Wireless Acoustic Sensor Networks. ICASSP 2021: 741-745 - [c154]Janek Ebbers, Michael Kuhlmann, Tobias Cord-Landwehr, Reinhold Haeb-Umbach:
Contrastive Predictive Coding Supported Factorized Variational Autoencoder For Unsupervised Learning Of Disentangled Speech Representations. ICASSP 2021: 3860-3864 - [c153]Wangyou Zhang, Christoph Böddeker, Shinji Watanabe, Tomohiro Nakatani, Marc Delcroix, Keisuke Kinoshita, Tsubasa Ochiai, Naoyuki Kamo, Reinhold Haeb-Umbach, Yanmin Qian:
End-to-End Dereverberation, Beamforming, and Speech Recognition with Improved Numerical Stability and Advanced Frontend. ICASSP 2021: 6898-6902 - [c152]Christoph Böddeker, Wangyou Zhang, Tomohiro Nakatani, Keisuke Kinoshita, Tsubasa Ochiai, Marc Delcroix, Naoyuki Kamo, Yanmin Qian, Reinhold Haeb-Umbach:
Convolutive Transfer Function Invariant SDR Training Criteria for Multi-Channel Reverberant Speech Separation. ICASSP 2021: 8428-8432 - [c151]Thilo von Neumann, Keisuke Kinoshita, Christoph Böddeker, Marc Delcroix, Reinhold Haeb-Umbach:
Graph-PIT: Generalized Permutation Invariant Training for Continuous Separation of Arbitrary Numbers of Speakers. Interspeech 2021: 3490-3494 - [i32]Wangyou Zhang, Christoph Böddeker, Shinji Watanabe, Tomohiro Nakatani, Marc Delcroix, Keisuke Kinoshita, Tsubasa Ochiai, Naoyuki Kamo, Reinhold Haeb-Umbach, Yanmin Qian:
End-to-End Dereverberation, Beamforming, and Speech Recognition with Improved Numerical Stability and Advanced Frontend. CoRR abs/2102.11525 (2021) - [i31]Joerg Schmalenstroeer, Jens Heitkaemper, Joerg Ullmann, Reinhold Haeb-Umbach:
Open Range Pitch Tracking for Carrier Frequency Difference Estimation from HF Transmitted Speech. CoRR abs/2103.01599 (2021) - [i30]Janek Ebbers, Reinhold Haeb-Umbach:
Forward-Backward Convolutional Recurrent Neural Networks and Tag-Conditioned Convolutional Neural Networks for Weakly Labeled Semi-supervised Sound Event Detection. CoRR abs/2103.06581 (2021) - [i29]Thomas Glarner, Janek Ebbers, Reinhold Häb-Umbach:
Voice Conversion Based Speaker Normalization for Acoustic Unit Discovery. CoRR abs/2105.01786 (2021) - [i28]Jens Heitkaemper, Joerg Schmalenstroeer, Joerg Ullmann, Valentin Ion, Reinhold Haeb-Umbach:
A Database for Research on Detection and Enhancement of Speech Transmitted over HF links. CoRR abs/2106.02472 (2021) - [i27]Christoph Böddeker, Frederik Rautenberg, Reinhold Haeb-Umbach:
A Comparison and Combination of Unsupervised Blind Source Separation Techniques. CoRR abs/2106.05627 (2021) - [i26]Thilo von Neumann, Christoph Böddeker, Keisuke Kinoshita, Marc Delcroix, Reinhold Haeb-Umbach:
Speeding Up Permutation Invariant Training for Source Separation. CoRR abs/2107.14445 (2021) - [i25]Thilo von Neumann, Keisuke Kinoshita, Christoph Böddeker, Marc Delcroix, Reinhold Haeb-Umbach:
Graph-PIT: Generalized permutation invariant training for continuous separation of arbitrary numbers of speakers. CoRR abs/2107.14446 (2021) - [i24]Tobias Gburrek, Joerg Schmalenstroeer, Reinhold Haeb-Umbach:
On Synchronization of Wireless Acoustic Sensor Networks in the Presence of Time-varying Sampling Rate Offsets and Speaker Changes. CoRR abs/2110.12820 (2021) - [i23]Thilo von Neumann, Keisuke Kinoshita, Christoph Böddeker, Marc Delcroix, Reinhold Haeb-Umbach:
SA-SDR: A novel loss function for separation of meeting style data. CoRR abs/2110.15581 (2021) - [i22]Tobias Cord-Landwehr, Christoph Böddeker, Thilo von Neumann, Catalin Zorila, Rama Doddipatla, Reinhold Haeb-Umbach:
Monaural source separation: From anechoic to reverberant environments. CoRR abs/2111.07578 (2021) - 2020
- [j35]Tomohiro Nakatani, Christoph Böddeker, Keisuke Kinoshita, Rintaro Ikeshita, Marc Delcroix, Reinhold Haeb-Umbach:
Jointly Optimal Denoising, Dereverberation, and Source Separation. IEEE ACM Trans. Audio Speech Lang. Process. 28: 2267-2282 (2020) - [c150]Janek Ebbers, Reinhold Haeb-Umbach:
Forward-Backward Convolutional Recurrent Neural Networks and Tag-Conditioned Convolutional Neural Networks for Weakly Labeled Semi-Supervised Sound Event Detection. DCASE 2020: 41-45 - [c149]Tobias Gburrek, Joerg Schmalenstroeer, Andreas Brendel, Walter Kellermann, Reinhold Haeb-Umbach:
Deep Neural Network based Distance Estimation for Geometry Calibration in Acoustic Sensor Networks. EUSIPCO 2020: 196-200 - [c148]Christoph Böddeker, Tomohiro Nakatani, Keisuke Kinoshita, Reinhold Haeb-Umbach:
Jointly Optimal Dereverberation and Beamforming. ICASSP 2020: 216-220 - [c147]Jens Heitkaemper, Darius Jakobeit, Christoph Böddeker, Lukas Drude, Reinhold Haeb-Umbach:
Demystifying TasNet: A Dissecting Approach. ICASSP 2020: 6359-6363 - [c146]Thilo von Neumann, Keisuke Kinoshita, Lukas Drude, Christoph Böddeker, Marc Delcroix, Tomohiro Nakatani, Reinhold Haeb-Umbach:
End-to-End Training of Time Domain Audio Separation and Recognition. ICASSP 2020: 7004-7008 - [c145]Jens Heitkaemper, Joerg Schmalenstroeer, Reinhold Haeb-Umbach:
Statistical and Neural Network Based Speech Activity Detection in Non-Stationary Acoustic Environments. INTERSPEECH 2020: 2597-2601 - [c144]Keisuke Kinoshita, Thilo von Neumann, Marc Delcroix, Tomohiro Nakatani, Reinhold Haeb-Umbach:
Multi-Path RNN for Hierarchical Modeling of Long Sequential Data and its Application to Speaker Stream Separation. INTERSPEECH 2020: 2652-2656 - [c143]Thilo von Neumann, Christoph Böddeker, Lukas Drude, Keisuke Kinoshita, Marc Delcroix, Tomohiro Nakatani, Reinhold Haeb-Umbach:
Multi-Talker ASR for an Unknown Number of Sources: Joint Training of Source Counting, Separation and ASR. INTERSPEECH 2020: 3097-3101 - [i21]Tomohiro Nakatani, Christoph Böddeker, Keisuke Kinoshita, Rintaro Ikeshita, Marc Delcroix, Reinhold Haeb-Umbach:
Jointly optimal denoising, dereverberation, and source separation. CoRR abs/2005.09843 (2020) - [i20]Jens Heitkaemper, Joerg Schmalenstroeer, Reinhold Haeb-Umbach:
Statistical and Neural Network Based Speech Activity Detection in Non-Stationary Acoustic Environments. CoRR abs/2005.09913 (2020) - [i19]Janek Ebbers, Michael Kuhlmann, Reinhold Haeb-Umbach:
Adversarial Contrastive Predictive Coding for Unsupervised Learning of Disentangled Representations. CoRR abs/2005.12963 (2020) - [i18]Thilo von Neumann, Christoph Böddeker, Lukas Drude, Keisuke Kinoshita, Marc Delcroix, Tomohiro Nakatani, Reinhold Haeb-Umbach:
Multi-talker ASR for an unknown number of sources: Joint training of source counting, separation and ASR. CoRR abs/2006.02786 (2020) - [i17]Keisuke Kinoshita, Thilo von Neumann, Marc Delcroix, Tomohiro Nakatani, Reinhold Haeb-Umbach:
Multi-path RNN for hierarchical modeling of long sequential data and its application to speaker stream separation. CoRR abs/2006.13579 (2020) - [i16]Tobias Gburrek, Joerg Schmalenstroeer, Andreas Brendel, Walter Kellermann, Reinhold Haeb-Umbach:
Deep Neural Network based Distance Estimation for Geometry Calibration in Acoustic Sensor Networks. CoRR abs/2006.13769 (2020) - [i15]Christoph Böddeker, Wangyou Zhang, Tomohiro Nakatani, Keisuke Kinoshita, Tsubasa Ochiai, Marc Delcroix, Naoyuki Kamo, Yanmin Qian, Shinji Watanabe, Reinhold Haeb-Umbach:
Convolutive Transfer Function Invariant SDR training criteria for Multi-Channel Reverberant Speech Separation. CoRR abs/2011.15003 (2020) - [i14]Tobias Gburrek, Joerg Schmalenstroeer, Reinhold Haeb-Umbach:
Iterative Geometry Calibration from Distance Estimates for Wireless Acoustic Sensor Networks. CoRR abs/2012.06142 (2020) - [i13]Christopher Grimm, Tai Fei, Ernst Warsitz, Ridha Farhoud, Tobias Breddermann, Reinhold Haeb-Umbach:
Warping of Radar Data into Camera Image for Cross-Modal Supervision in Automotive Applications. CoRR abs/2012.12809 (2020)
2010 – 2019
- 2019
- [j34]Shinji Watanabe, Shoko Araki, Michiel Bacchiani, Reinhold Haeb-Umbach, Michael L. Seltzer:
Introduction to the Issue on Far-Field Speech Processing in the Era of Deep Learning: Speech Enhancement, Separation, and Recognition. IEEE J. Sel. Top. Signal Process. 13(4): 785-786 (2019) - [j33]Lukas Drude, Reinhold Haeb-Umbach:
Integration of Neural Networks and Probabilistic Spatial Models for Acoustic Blind Source Separation. IEEE J. Sel. Top. Signal Process. 13(4): 815-826 (2019) - [j32]Reinhold Haeb-Umbach, Shinji Watanabe, Tomohiro Nakatani, Michiel Bacchiani, Björn Hoffmeister, Michael L. Seltzer, Heiga Zen, Mehrez Souden:
Speech Processing for Digital Home Assistants: Combining signal processing with deep-learning techniques. IEEE Signal Process. Mag. 36(6): 111-124 (2019) - [c142]Catalin Zorila, Christoph Böddeker, Rama Doddipatla, Reinhold Haeb-Umbach:
An Investigation into the Effectiveness of Enhancement in ASR Training and Test for Chime-5 Dinner Party Transcription. ASRU 2019: 47-53 - [c141]Janek Ebbers, Lukas Drude, Reinhold Haeb-Umbach, Andreas Brendel, Walter Kellermann:
Weakly Supervised Sound Activity Detection and Event Classification in Acoustic Sensor Networks. CAMSAP 2019: 301-305 - [c140]Janek Ebbers, Reinhold Häb-Umbach:
Convolutional Recurrent Neural Network and Data Augmentation for Audio Tagging with Noisy Labels and Minimal Supervision. DCASE 2019: 64-68 - [c139]Thilo von Neumann, Keisuke Kinoshita, Marc Delcroix, Shoko Araki, Tomohiro Nakatani, Reinhold Haeb-Umbach:
All-neural Online Source Separation, Counting, and Diarization for Meeting Analysis. ICASSP 2019: 91-95 - [c138]Lukas Drude, Daniel Hasenklever, Reinhold Haeb-Umbach:
Unsupervised Training of a Deep Clustering Model for Multichannel Blind Source Separation. ICASSP 2019: 695-699 - [c137]Jahn Heymann, Lukas Drude, Reinhold Haeb-Umbach, Keisuke Kinoshita, Tomohiro Nakatani:
Joint Optimization of Neural Network-based WPE Dereverberation and Acoustic Model for Robust Online ASR. ICASSP 2019: 6655-6659 - [c136]Juan M. Martín-Doñas, Jens Heitkaemper, Reinhold Haeb-Umbach, Angel M. Gomez, Antonio M. Peinado:
Multi-Channel Block-Online Source Extraction Based on Utterance Adaptation. INTERSPEECH 2019: 96-100 - [c135]Naoyuki Kanda, Christoph Böddeker, Jens Heitkaemper, Yusuke Fujita, Shota Horiguchi, Kenji Nagamatsu, Reinhold Haeb-Umbach:
Guided Source Separation Meets a Strong ASR Backend: Hitachi/Paderborn University Joint Investigation for Dinner Party ASR. INTERSPEECH 2019: 1248-1252 - [c134]Lukas Drude, Jahn Heymann, Reinhold Haeb-Umbach:
Unsupervised Training of Neural Mask-Based Beamforming. INTERSPEECH 2019: 1253-1257 - [c133]Alexandru Nelus, Janek Ebbers, Reinhold Haeb-Umbach, Rainer Martin:
Privacy-Preserving Variational Information Feature Extraction for Domestic Activity Monitoring versus Speaker Identification. INTERSPEECH 2019: 3710-3714 - [c132]Jens Heitkaemper, Thomas Fehér, Michael Freitag, Reinhold Haeb-Umbach:
A Study on Online Source Extraction in the Presence of Changing Speaker Positions. SLSP 2019: 198-209 - [c131]Tobias Gburrek, Thomas Glarner, Janek Ebbers, Reinhold Haeb-Umbach, Petra Wagner:
Unsupervised Learning of a Disentangled Speech Representation for Voice Conversion. SSW 2019: 81-86 - [i12]Thilo von Neumann, Keisuke Kinoshita, Marc Delcroix, Shoko Araki, Tomohiro Nakatani, Reinhold Haeb-Umbach:
All-neural online source separation, counting, and diarization for meeting analysis. CoRR abs/1902.07881 (2019) - [i11]Lukas Drude, Daniel Hasenklever, Reinhold Haeb-Umbach:
Unsupervised training of a deep clustering model for multichannel blind source separation. CoRR abs/1904.01340 (2019) - [i10]Lukas Drude, Jahn Heymann, Reinhold Haeb-Umbach:
Unsupervised training of neural mask-based beamforming. CoRR abs/1904.01578 (2019) - [i9]Naoyuki Kanda, Christoph Böddeker, Jens Heitkaemper, Yusuke Fujita, Shota Horiguchi, Kenji Nagamatsu, Reinhold Haeb-Umbach:
Guided Source Separation Meets a Strong ASR Backend: Hitachi/Paderborn University Joint Investigation for Dinner Party ASR. CoRR abs/1905.12230 (2019) - [i8]Catalin Zorila, Christoph Böddeker, Rama Doddipatla, Reinhold Haeb-Umbach:
An Investigation into the Effectiveness of Enhancement in ASR Training and Test for CHiME-5 Dinner Party Transcription. CoRR abs/1909.12208 (2019) - [i7]Christoph Böddeker, Tomohiro Nakatani, Keisuke Kinoshita, Reinhold Haeb-Umbach:
Jointly optimal dereverberation and beamforming. CoRR abs/1910.13707 (2019) - [i6]Lukas Drude, Jens Heitkaemper, Christoph Böddeker, Reinhold Haeb-Umbach:
SMS-WSJ: Database, performance measures, and baseline recipe for multi-channel source separation and recognition. CoRR abs/1910.13934 (2019) - [i5]Jens Heitkaemper, Darius Jakobeit, Christoph Böddeker, Lukas Drude, Reinhold Haeb-Umbach:
Demystifying TasNet: A Dissecting Approach. CoRR abs/1911.08895 (2019) - [i4]Thilo von Neumann, Keisuke Kinoshita, Lukas Drude, Christoph Böddeker, Marc Delcroix, Tomohiro Nakatani, Reinhold Haeb-Umbach:
End-to-end training of time domain audio separation and recognition. CoRR abs/1912.08462 (2019) - 2018
- [j31]Vladimir Despotovic, Oliver Walter, Reinhold Haeb-Umbach:
Machine learning techniques for semantic analysis of dysarthric speech: An experimental study. Speech Commun. 99: 242-251 (2018) - [c130]Haitham Afifi, Joerg Schmalenstroeer, Joerg Ullmann, Reinhold Haeb-Umbach, Holger Karl:
MARVELO - A Framework for Signal Processing in Wireless Acoustic Sensor Networks. ITG Symposium on Speech Communication 2018: 1-5 - [c129]Lukas Drude, Jahn Heymann, Christoph Böddeker, Reinhold Haeb-Umbach:
NARA-WPE: A Python package for weighted prediction error dereverberation in Numpy and Tensorflow for online and offline processing. ITG Symposium on Speech Communication 2018: 1-5 - [c128]Janek Ebbers, Jens Heitkaemper, Joerg Schmalenstroeer, Reinhold Haeb-Umbach:
Benchmarking Neural Network Architectures for Acoustic Sensor Networks. ITG Symposium on Speech Communication 2018: 1-5 - [c127]Jens Heitkaemper, Jahn Heymann, Reinhold Haeb-Umbach:
Smoothing along Frequency in Online Neural Network Supported Acoustic Beamforming. ITG Symposium on Speech Communication 2018: 1-5 - [c126]Joerg Schmalenstroeer, Reinhold Haeb-Umbach:
Insights into the Interplay of Sampling Rate Offsets and MVDR Beamforming. ITG Symposium on Speech Communication 2018: 1-5 - [c125]Joerg Schmalenstroeer, Reinhold Haeb-Umbach:
Efficient Sampling Rate Offset Compensation - an Overlap-Save Based Approach. EUSIPCO 2018: 499-503 - [c124]Lukas Drude, Thilo von Neumann, Reinhold Haeb-Umbach:
Deep Attractor Networks for Speaker Re-Identification and Blind Source Separation. ICASSP 2018: 11-15 - [c123]Lukas Drude, Takuya Higuchi, Keisuke Kinoshita, Tomohiro Nakatani, Reinhold Haeb-Umbach:
Dual Frequency- and Block-Permutation Alignment for Deep Learning Based Block-Online Blind Source Separation. ICASSP 2018: 691-695 - [c122]Christoph Böddeker, Hakan Erdogan, Takuya Yoshioka, Reinhold Haeb-Umbach:
Exploring Practical Aspects of Neural Mask-Based Beamforming for Far-Field Speech Recognition. ICASSP 2018: 6697-6701 - [c121]Thomas Glarner, Patrick Hanebrink, Janek Ebbers, Reinhold Haeb-Umbach:
Full Bayesian Hidden Markov Model Variational Autoencoder for Acoustic Unit Discovery. INTERSPEECH 2018: 2688-2692 - [c120]Lukas Drude, Christoph Böddeker, Jahn Heymann, Reinhold Haeb-Umbach, Keisuke Kinoshita, Marc Delcroix, Tomohiro Nakatani:
Integrating Neural Network Based Beamforming and Weighted Prediction Error Dereverberation. INTERSPEECH 2018: 3043-3047 - [c119]Jahn Heymann, Lukas Drude, Reinhold Haeb-Umbach, Keisuke Kinoshita, Tomohiro Nakatani:
Frame-Online DNN-WPE Dereverberation. IWAENC 2018: 466-470 - 2017
- [j30]Jahn Heymann, Lukas Drude, Reinhold Haeb-Umbach:
A generic neural acoustic beamforming architecture for robust multi-channel speech processing. Comput. Speech Lang. 46: 374-385 (2017) - [c118]Christoph Böddeker, Patrick Hanebrink, Lukas Drude, Jahn Heymann, Reinhold Haeb-Umbach:
Optimizing neural-network supported acoustic beamforming by algorithmic differentiation. ICASSP 2017: 171-175 - [c117]Aleksej Chinaev, Reinhold Haeb-Umbach:
A generalized log-spectral amplitude estimator for single-channel speech enhancement. ICASSP 2017: 4980-4984 - [c116]Jahn Heymann, Lukas Drude, Christoph Böddeker, Patrick Hanebrink, Reinhold Haeb-Umbach:
Beamnet: End-to-end training of a beamformer-supported multi-channel ASR system. ICASSP 2017: 5325-5329 - [c115]Janek Ebbers, Jahn Heymann, Lukas Drude, Thomas Glarner, Reinhold Haeb-Umbach, Bhiksha Raj:
Hidden Markov Model Variational Autoencoder for Acoustic Unit Discovery. INTERSPEECH 2017: 488-492 - [c114]Thomas Glarner, Benedikt T. Boenninghoff, Oliver Walter, Reinhold Haeb-Umbach:
Leveraging Text Data for Word Segmentation for Underresourced Languages. INTERSPEECH 2017: 2143-2147 - [c113]Lukas Drude, Reinhold Haeb-Umbach:
Tight Integration of Spatial and Spectral Features for BSS with Deep Clustering Embeddings. INTERSPEECH 2017: 2650-2654 - [c112]Prerna Arora, Reinhold Haeb-Umbach:
A study on transfer learning for acoustic event detection in a real life scenario. MMSP 2017: 1-6 - [c111]Joerg Schmalenstroeer, Jahn Heymann, Lukas Drude, Christoph Böddeker, Reinhold Haeb-Umbach:
Multi-stage coherence drift based sampling rate synchronization for acoustic beamforming. MMSP 2017: 1-6 - [p6]Keisuke Kinoshita, Marc Delcroix, Sharon Gannot, Emanuël A. P. Habets, Reinhold Haeb-Umbach, Walter Kellermann, Volker Leutnant, Roland Maas, Tomohiro Nakatani, Bhiksha Raj, Armin Sehr, Takuya Yoshioka:
The REVERB Challenge: A Benchmark Task for Reverberation-Robust ASR Techniques. New Era for Robust Speech Recognition, Exploiting Deep Learning 2017: 345-354 - [i3]Christoph Böddeker, Patrick Hanebrink, Lukas Drude, Jahn Heymann, Reinhold Haeb-Umbach:
On the Computation of Complex-valued Gradients with Application to Statistically Optimum Beamforming. CoRR abs/1701.00392 (2017) - [i2]Gerhard Kurz, Igor Gilitschenski, Florian Pfaff, Lukas Drude, Uwe D. Hanebeck, Reinhold Haeb-Umbach, Roland Yves Siegwart:
Directional Statistics and Filtering Using libDirectional. CoRR abs/1712.09718 (2017) - 2016
- [j29]Keisuke Kinoshita, Marc Delcroix, Sharon Gannot, Emanuël A. P. Habets, Reinhold Haeb-Umbach, Walter Kellermann, Volker Leutnant, Roland Maas, Tomohiro Nakatani, Bhiksha Raj, Armin Sehr, Takuya Yoshioka:
A summary of the REVERB challenge: state-of-the-art and remaining challenges in reverberant speech processing research. EURASIP J. Adv. Signal Process. 2016: 7 (2016) - [j28]Axel Plinge, Florian Jacob, Reinhold Haeb-Umbach, Gernot A. Fink:
Acoustic Microphone Geometry Calibration: An overview and experimental evaluation of state-of-the-art algorithms. IEEE Signal Process. Mag. 33(4): 14-29 (2016) - [c110]Aleksej Chinaev, Jahn Heymann, Lukas Drude, Reinhold Haeb-Umbach:
Noise-Presence-Probability-Based Noise PSD Estimation by Using DNNs. ITG Symposium on Speech Communication 2016: 1-5 - [c109]Aleksej Chinaev, Jens Heitkaemper, Reinhold Haeb-Umbach:
A Priori SNR Estimation Using Weibull Mixture Model. ITG Symposium on Speech Communication 2016: 1-5 - [c108]Thomas Glarner, Mohammad Mahdi Momenzadeh, Lukas Drude, Reinhold Haeb-Umbach:
Factor Graph Decoding for Speech Presence Probability Estimation. ITG Symposium on Speech Communication 2016: 1-5 - [c107]Florian Jacob, Reinhold Haeb-Umbach:
On the Bias of Direction of Arrival Estimation Using Linear Microphone Arrays. ITG Symposium on Speech Communication 2016: 1-5 - [c106]Markus Kitza, Albert Zeyer, Ralf Schlüter, Jahn Heymann, Reinhold Haeb-Umbach:
Robust Online Multi-Channel Speech Recognition. ITG Symposium on Speech Communication 2016: 1-5 - [c105]Joerg Schmalenstroeer, Reinhold Haeb-Umbach:
Investigations into Bluetooth low energy localization precision limits. EUSIPCO 2016: 652-656 - [c104]Lukas Drude, Christoph Böddeker, Reinhold Haeb-Umbach:
Blind speech separation based on complex spherical k-mode clustering. ICASSP 2016: 141-145 - [c103]Jahn Heymann, Lukas Drude, Reinhold Haeb-Umbach:
Neural network based spectral mask estimation for acoustic beamforming. ICASSP 2016: 196-200 - [c102]Lukas Drude, Bhiksha Raj, Reinhold Haeb-Umbach:
On the Appropriateness of Complex-Valued Neural Networks for Speech Enhancement. INTERSPEECH 2016: 1745-1749 - [c101]Aleksej Chinaev, Reinhold Haeb-Umbach:
A priori SNR Estimation Using a Generalized Decision Directed Approach. INTERSPEECH 2016: 3758-3762 - 2015
- [j27]Oliver Walter, Reinhold Haeb-Umbach, Bassam Mokbel, Benjamin Paaßen, Barbara Hammer:
Autonomous Learning of Representations. Künstliche Intell. 29(4): 339-351 (2015) - [j26]Joerg Schmalenstroeer, Patrick Jebramcik, Reinhold Haeb-Umbach:
A combined hardware-software approach for acoustic sensor network synchronization. Signal Process. 107: 171-184 (2015) - [c100]Jahn Heymann, Lukas Drude, Aleksej Chinaev, Reinhold Haeb-Umbach:
BLSTM supported GEV beamformer front-end for the 3RD CHiME challenge. ASRU 2015: 444-451 - [c99]Lukas Drude, Florian Jacob, Reinhold Haeb-Umbach:
DOA-estimation based on a complex Watson kernel method. EUSIPCO 2015: 255-259 - [c98]Oliver Walter, Lukas Drude, Reinhold Haeb-Umbach:
Source counting in speech mixtures by nonparametric Bayesian estimation of an infinite Gaussian mixture model. ICASSP 2015: 459-463 - [c97]Manh Kha Hoang, Joerg Schmalenstroeer, Reinhold Haeb-Umbach:
Aligning training modelswith smartphone properties in WiFi fingerprinting based indoor localization. ICASSP 2015: 1981-1985 - [c96]Jahn Heymann, Reinhold Haeb-Umbach, Pavel Golik, Ralf Schlüter:
Unsupervised adaptation of a denoising autoencoder by Bayesian Feature Enhancement for reverberant asr under mismatch conditions. ICASSP 2015: 5053-5057 - [c95]Erik Marchi, Björn W. Schuller, Simon Baron-Cohen, Ofer Golan, Sven Bölte, Prerna Arora, Reinhold Häb-Umbach:
Typicality and emotion in the voice of children with autism spectrum condition: evidence across three languages. INTERSPEECH 2015: 115-119 - [c94]Aleksej Chinaev, Reinhold Haeb-Umbach:
On optimal smoothing in minimum statistics based noise tracking. INTERSPEECH 2015: 1785-1789 - [c93]Vladimir Despotovic, Oliver Walter, Reinhold Haeb-Umbach:
Semantic analysis of spoken input using Markov logic networks. INTERSPEECH 2015: 1859-1863 - [i1]Florian Jacob, Reinhold Haeb-Umbach:
Absolute Geometry Calibration of Distributed Microphone Arrays in an Audio-Visual Sensor Network. CoRR abs/1504.03128 (2015) - 2014
- [j25]Volker Leutnant, Alexander Krueger, Reinhold Haeb-Umbach:
A New Observation Model in the Logarithmic Mel Power Spectral Domain for the Automatic Recognition of Noisy Reverberant Speech. IEEE ACM Trans. Audio Speech Lang. Process. 22(1): 95-109 (2014) - [j24]Jinyu Li, Li Deng, Yifan Gong, Reinhold Haeb-Umbach:
An Overview of Noise-Robust Automatic Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 22(4): 745-777 (2014) - [c92]Aleksej Chinaev, Marc Püls, Reinhold Haeb-Umbach:
Spectral Noise Tracking for Improved Nonstationary Noise Robust ASR. ITG Symposium on Speech Communication 2014: 1-4 - [c91]Florian Jacob, Reinhold Haeb-Umbach:
Coordinate Mapping Between an Acoustic and Visual Sensor Network in the Shape Domain for a Joint Self-Calibrating Speaker Tracking. ITG Symposium on Speech Communication 2014: 1-4 - [c90]Joerg Schmalenstroeer, Weile Zhao, Reinhold Haeb-Umbach:
Online Observation ErrorModel Estimation for Acoustic Sensor Network Synchronization. ITG Symposium on Speech Communication 2014: 1-4 - [c89]Jahn Heymann, Oliver Walter, Reinhold Haeb-Umbach, Bhiksha Raj:
Iterative Bayesian word segmentation for unsupervised vocabulary discovery from phoneme lattices. ICASSP 2014: 4057-4061 - [c88]Lukas Drude, Aleksej Chinaev, Dang Hai Tran Vu, Reinhold Haeb-Umbach:
Source counting in speech mixtures using a variational EM approach for complex WATSON mixture models. ICASSP 2014: 6834-6838 - [c87]Joerg Schmalenstroeer, Patrick Jebramcik, Reinhold Haeb-Umbach:
A gossiping approach to sampling clock synchronization in wireless acoustic sensor networks. ICASSP 2014: 7575-7579 - [c86]Oliver Walter, Vladimir Despotovic, Reinhold Haeb-Umbach, Jort F. Gemmeke, Bart Ons, Hugo Van hamme:
An evaluation of unsupervised acoustic model training for a dysarthric speech interface. INTERSPEECH 2014: 1013-1017 - [c85]Lukas Drude, Aleksej Chinaev, Dang Hai Tran Vu, Reinhold Haeb-Umbach:
Towards online source counting in speech mixtures applying a variational EM for complex Watson mixture models. IWAENC 2014: 213-217 - 2013
- [j23]Volker Leutnant, Alexander Krueger, Reinhold Haeb-Umbach:
Bayesian Feature Enhancement for Reverberation and Noise Robust Speech Recognition. IEEE Trans. Speech Audio Process. 21(8): 1640-1652 (2013) - [c84]Oliver Walter, Timo Korthals, Reinhold Haeb-Umbach, Bhiksha Raj:
A hierarchical system for word discovery exploiting DTW-based initialization. ASRU 2013: 386-391 - [c83]Jahn Heymann, Oliver Walter, Reinhold Haeb-Umbach, Bhiksha Raj:
Unsupervised word segmentation from noisy input. ASRU 2013: 458-463 - [c82]Gerald Enzner, Dominic Schmid, Reinhold Haeb-Umbach:
On acoustic channel identification in multi-microphone systems via adaptive blind signal enhancement techniques. EUSIPCO 2013: 1-5 - [c81]Manh Kha Hoang, Joerg Schmalenstroeer, Christian Drueke, Dang Hai Tran Vu, Reinhold Haeb-Umbach:
A hidden Markov model for indoor user tracking based on WiFi fingerprinting and step detection. EUSIPCO 2013: 1-5 - [c80]Joerg Schmalenstroeer, Reinhold Haeb-Umbach:
Sampling rate synchronisation in acoustic sensor networks with a pre-trained clock skew error model. EUSIPCO 2013: 1-5 - [c79]Dang Hai Tran Vu, Reinhold Haeb-Umbach:
Blind speech separation exploiting temporal and spectral correlations using 2D-HMMs. EUSIPCO 2013: 1-5 - [c78]Florian Jacob, Joerg Schmalenstroeer, Reinhold Haeb-Umbach:
DOA-based microphone array postion self-calibration using circular statistics. ICASSP 2013: 116-120 - [c77]Dang Hai Tran Vu, Reinhold Haeb-Umbach:
Using the turbo principle for exploiting temporal and spectral correlations in speech presence probability estimation. ICASSP 2013: 863-867 - [c76]Aleksej Chinaev, Reinhold Haeb-Umbach:
Map-based estimation of the parameters of a Gaussian Mixture Model in the presence of noisy observations. ICASSP 2013: 3352-3356 - [c75]Manh Kha Hoang, Reinhold Haeb-Umbach:
Parameter estimation and classification of censored Gaussian data with application to WiFi indoor positioning. ICASSP 2013: 3721-3725 - [c74]Ahmed Hussen Abdelaziz, Steffen Zeiler, Dorothea Kolossa, Volker Leutnant, Reinhold Haeb-Umbach:
GMM-based significance decoding. ICASSP 2013: 6827-6831 - [c73]Aleksej Chinaev, Reinhold Haeb-Umbach, Jalal Taghia, Rainer Martin:
Improved single-channel nonstationary noise tracking by an optimized MAP-based postprocessor. ICASSP 2013: 7477-7481 - [c72]Manh Kha Hoang, Sarah Schmitz, Christian Drueke, Dang Hai Tran Vu, Joerg Schmalenstroeer, Reinhold Haeb-Umbach:
Server based indoor navigation using RSSI and inertial sensor information. WPNC 2013: 1-6 - [c71]Oliver Walter, Joerg Schmalenstroeer, Andreas Engler, Reinhold Haeb-Umbach:
Smartphone-based sensor fusion for improved vehicular navigation. WPNC 2013: 1-6 - 2012
- [c70]Aleksej Chinaev, Reinhold Haeb-Umbach:
Quality Analysis and Optimization of the MAP-based Noise Power Spectral Density Tracker. ITG Conference on Speech Communication 2012: 1-4 - [c69]Volker Leutnant, Alexander Krueger, Reinhold Haeb-Umbach:
Investigations Into a Statistical Observation Model for Logarithmic Mel Power Spectral Density Features of Noisy Reverberant Speech. ITG Conference on Speech Communication 2012: 1-4 - [c68]Aleksej Chinaev, Alexander Krueger, Dang Hai Tran Vu, Reinhold Haeb-Umbach:
Improved noise power spectral density tracking by a MAP-based postprocessor. ICASSP 2012: 4041-4044 - [c67]Alexander Krueger, Oliver Walter, Volker Leutnant, Reinhold Haeb-Umbach:
Bayesian Feature Enhancement for ASR of Noisy Reverberant Real-World Data. INTERSPEECH 2012: 807-810 - [c66]Florian Jacob, Joerg Schmalenstroeer, Reinhold Haeb-Umbach:
Microphone Array Position Self-Calibration from Reverberant Speech Input. IWAENC 2012 - [c65]Dang Hai Tran Vu, Reinhold Haeb-Umbach:
Exploiting Temporal Correlations in Joint Multichannel Speech Separation and Noise Suppression Using Hidden Markov Models. IWAENC 2012 - [p5]Reinhold Haeb-Umbach, Alexander Krueger:
Reverberant Speech Recognition. Techniques for Noise Robustness in Automatic Speech Recognition 2012: 251-281 - 2011
- [j22]Tobias Herbig, Franz Gerl, Wolfgang Minker, Reinhold Haeb-Umbach:
Adaptive systems for unsupervised speaker tracking and speech recognition. Evol. Syst. 2(3): 199-214 (2011) - [j21]Alexander Krueger, Ernst Warsitz, Reinhold Haeb-Umbach:
Speech Enhancement With a GSC-Like Structure Employing Eigenvector-Based Transfer Function Ratios Estimation. IEEE Trans. Speech Audio Process. 19(1): 206-219 (2011) - [c64]Alexander Krueger, Reinhold Haeb-Umbach:
MAP-based estimation of the parameters of non-stationary Gaussian processes from noisy observations. ICASSP 2011: 3596-3599 - [c63]Joerg Schmalenstroeer, Markus Bartek, Reinhold Haeb-Umbach:
Unsupervised Learning of Acoustic Events Using Dynamic Time Warping and Hierarchical K-Means++ Clustering. INTERSPEECH 2011: 305-308 - [c62]Joerg Schmalenstroeer, Florian Jacob, Reinhold Haeb-Umbach, Marius H. Hennecke, Gernot A. Fink:
Unsupervised Geometry Calibration of Acoustic Sensor Networks Using Source Correspondences. INTERSPEECH 2011: 597-600 - [c61]Volker Leutnant, Alexander Krueger, Reinhold Haeb-Umbach:
A Versatile Gaussian Splitting Approach to Non-Linear State Estimation and its Application to Noise-Robust ASR. INTERSPEECH 2011: 1641-1644 - [c60]Dang Hai Tran Vu, Reinhold Haeb-Umbach:
On Initial Seed Selection for Frequency Domain Blind Speech Separation. INTERSPEECH 2011: 1757-1760 - [p4]Reinhold Haeb-Umbach, Dorothea Kolossa:
Introduction. Robust Speech Recognition of Uncertain or Missing Data 2011: 1-5 - [p3]Reinhold Haeb-Umbach:
Uncertainty Decoding and Conditional Bayesian Estimation. Robust Speech Recognition of Uncertain or Missing Data 2011: 9-33 - [p2]Volker Leutnant, Reinhold Haeb-Umbach:
Conditional Bayesian Estimation Employing a Phase-Sensitive Observation Model for Noise Robust Speech Recognition. Robust Speech Recognition of Uncertain or Missing Data 2011: 187-221 - [p1]Alexander Krueger, Reinhold Haeb-Umbach:
A Model-Based Approach to Joint Compensation of Noise and Reverberation for Speech Recognition. Robust Speech Recognition of Uncertain or Missing Data 2011: 257-290 - [e1]Dorothea Kolossa, Reinhold Haeb-Umbach:
Robust Speech Recognition of Uncertain or Missing Data - Theory and Applications. Springer 2011, ISBN 978-3-642-21316-8 [contents] - 2010
- [j20]Zheng-Hua Tan, Reinhold Haeb-Umbach, Sadaoki Furui, James R. Glass, Maurizio Omologo:
Introduction to the Issue on Speech Processing for Natural Interaction With Intelligent Environments. IEEE J. Sel. Top. Signal Process. 4(5): 769-771 (2010) - [j19]Joerg Schmalenstroeer, Reinhold Haeb-Umbach:
Online Diarization of Streaming Audio-Visual Data for Smart Environments. IEEE J. Sel. Top. Signal Process. 4(5): 845-856 (2010) - [j18]Alexander Krueger, Reinhold Haeb-Umbach:
Model-Based Feature Enhancement for Reverberant Speech Recognition. IEEE Trans. Speech Audio Process. 18(7): 1692-1707 (2010) - [c59]Alexander Krueger, Volker Leutnant, Reinhold Haeb-Umbach, Marcel R. Ackermann, Johannes Blömer:
On the Initialization of Dynamic Models for Speech Features. Sprachkommunikation 2010: 1-4 - [c58]Dang Hai Tran Vu, Reinhold Haeb-Umbach:
Blind speech separation employing directional statistics in an Expectation Maximization framework. ICASSP 2010: 241-244 - [c57]Bhiksha Raj, Kevin W. Wilson, Alexander Krueger, Reinhold Haeb-Umbach:
Ungrounded independent non-negative factor analysis. INTERSPEECH 2010: 330-333 - [c56]Volker Leutnant, Reinhold Haeb-Umbach:
On the exploitation of hidden Markov models and linear dynamic models in a hybrid decoder architecture for continuous speech recognition. INTERSPEECH 2010: 2946-2949 - [c55]Maik Bevermeier, Oliver Walter, Sven Peschke, Reinhold Haeb-Umbach:
Barometric height estimation combined with map-matching in a loosely-coupled Kalman-filter. WPNC 2010: 128-134
2000 – 2009
- 2009
- [j17]Stefan Windmann, Reinhold Haeb-Umbach:
Approaches to Iterative Speech Feature Enhancement and Recognition. IEEE Trans. Speech Audio Process. 17(5): 974-984 (2009) - [j16]Stefan Windmann, Reinhold Haeb-Umbach:
Parameter Estimation of a State-Space Model of Noise for Robust Speech Recognition. IEEE Trans. Speech Audio Process. 17(8): 1577-1590 (2009) - [c54]Joerg Schmalenstroeer, Martin Kelling, Volker Leutnant, Reinhold Haeb-Umbach:
Fusing audio and video information for online speaker diarization. INTERSPEECH 2009: 1163-1166 - [c53]Alexander Krueger, Reinhold Haeb-Umbach:
Model based feature enhancement for automatic speech recognition in reverberant environments. INTERSPEECH 2009: 1231-1234 - [c52]Volker Leutnant, Reinhold Haeb-Umbach:
An analytic derivation of a phase-sensitive observation model for noise robust speech recognition. INTERSPEECH 2009: 2395-2398 - [c51]Maik Bevermeier, Sven Peschke, Reinhold Haeb-Umbach:
Joint Parameter Estimation and Tracking in a Multi-Stage Kalman Filter for Vehicle Positioning. VTC Spring 2009 - [c50]Sven Peschke, Maik Bevermeier, Reinhold Haeb-Umbach:
A GPS positioning approach exploiting GSM velocity estimates. WPNC 2009: 195-202 - [c49]Maik Bevermeier, Sven Peschke, Reinhold Haeb-Umbach:
Robust vehicle localization based on multi-level sensor fusion and online parameter estimation. WPNC 2009: 235-242 - 2008
- [j15]Valentin Ion, Reinhold Haeb-Umbach:
A Novel Uncertainty Decoding Rule With Applications to Transmission Error Robust Speech Recognition. IEEE Trans. Speech Audio Process. 16(5): 1047-1060 (2008) - [c48]Ernst Warsitz, Alexander Krueger, Reinhold Haeb-Umbach:
Speech enhancement with a new generalized eigenvector blocking matrix for application in a generalized sidelobe canceller. ICASSP 2008: 73-76 - [c47]Stefan Windmann, Reinhold Haeb-Umbach:
Modeling the dynamics of speech and noise for speech feature enhancement in ASR. ICASSP 2008: 4409-4412 - 2007
- [j14]Ernst Warsitz, Reinhold Haeb-Umbach:
Blind Acoustic Beamforming Based on Generalized Eigenvalue Decomposition. IEEE Trans. Speech Audio Process. 15(5): 1529-1539 (2007) - [j13]Reinhold Haeb-Umbach, Sven Peschke:
A Novel Similarity Measure for Positioning Cellular Phones by a Comparison With a Database of Signal Power Levels. IEEE Trans. Veh. Technol. 56(1): 368-372 (2007) - [c46]Joerg Schmalenstroeer, Volker Leutnant, Reinhold Haeb-Umbach:
Amigo Context Management Service with Applications in Ambient Communication Scenarios. AmI Workshops 2007: 397-402 - [c45]Reinhold Haeb-Umbach, Maik Bevermeier:
OFDM Channel Estimation Based on Combined Estimation in Time and Frequency Domain. ICASSP (3) 2007: 277-280 - [c44]Joerg Schmalenstroeer, Reinhold Haeb-Umbach:
Joint speaker segmentation, localization and identification for streaming audio. INTERSPEECH 2007: 570-573 - [c43]Valentin Ion, Reinhold Haeb-Umbach:
Multi-resolution soft features for channel-robust distributed speech recognition. INTERSPEECH 2007: 594-597 - [c42]Ernst Warsitz, Reinhold Haeb-Umbach, Dang Hai Tran Vu:
Blind adaptive principal eigenvector beamforming for acoustical source separation. INTERSPEECH 2007: 842-845 - [c41]Stefan Windmann, Reinhold Haeb-Umbach:
An approach to iterative speech feature enhancement and recognition. INTERSPEECH 2007: 1086-1089 - [c40]Maik Bevermeier, Reinhold Haeb-Umbach:
Combined time and frequency domain OFDM channel estimation. MCSS 2007: 317-326 - [c39]Maik Bevermeier, Tobias Ebel, Reinhold Haeb-Umbach:
Channel estimation by exploiting sublayer information in OFDM systems. MCSS 2007: 387-396 - [c38]Sven Peschke, Reinhold Haeb-Umbach:
Velocity Estimation of Mobile Terminals by Exploiting GSM Downlink Signalling. WPNC 2007: 217-222 - 2006
- [j12]Valentin Ion, Reinhold Haeb-Umbach:
Uncertainty decoding for distributed speech recognition over error-prone networks. Speech Commun. 48(11): 1435-1446 (2006) - [c37]Valentin Ion, Reinhold Haeb-Umbach:
An Inexpensive Packet Loss Compensation Scheme for Distributed Speech Recognition Based on Soft-Features. ICASSP (1) 2006: 169-172 - [c36]Stefan Windmann, Reinhold Haeb-Umbach:
Iterative Speech Enhancement using a Non-Linear Dynamic State Model of Speech and its Parameters. ICASSP (1) 2006: 465-468 - [c35]Valentin Ion, Reinhold Haeb-Umbach:
Improved source modeling and predictive classification for channel robust speech recognition. INTERSPEECH 2006 - [c34]Joerg Schmalenstroeer, Reinhold Haeb-Umbach:
Online speaker change detection by combining BIC with microphone array beamforming. INTERSPEECH 2006 - 2005
- [c33]Valentin Ion, Reinhold Haeb-Umbach:
A Comparison of Soft-Feature Distributed Speech Recognition with Candidate Codecs for Speech Enabled Mobile Services. ICASSP (1) 2005: 333-336 - [c32]Ernst Warsitz, Reinhold Haeb-Umbach:
Acoustic filter-and-sum beamforming by adaptive principal component analysis. ICASSP (4) 2005: 797-800 - [c31]Reinhold Haeb-Umbach, Basilis Kladis, Joerg Schmalenstroeer:
Speech processing in the networked home environment - a view on the amigo project. INTERSPEECH 2005: 121-124 - [c30]Reinhold Haeb-Umbach, Joerg Schmalenstroeer:
A comparison of particle filtering variants for speech feature enhancement. INTERSPEECH 2005: 913-916 - [c29]Valentin Ion, Reinhold Haeb-Umbach:
Unified probabilistic approach to error concealment for distributed speech recognition. INTERSPEECH 2005: 2853-2856 - 2004
- [c28]Reinhold Haeb-Umbach, Valentin Ion:
Soft features for improved distributed speech recognition over wireless networks. INTERSPEECH 2004: 2125-2128 - [c27]Reinhold Haeb-Umbach, Sven Peschke, Ernst Warsitz:
Adaptive beamforming combined with particle filtering for acoustic source localization. INTERSPEECH 2004: 2849-2852 - [c26]Ernst Warsitz, Reinhold Haeb-Umbach:
Robust speaker direction estimation with particle filtering. MMSP 2004: 367-370 - 2002
- [j11]Peter Beyerlein, Xavier L. Aubert, Reinhold Haeb-Umbach, Matthew Harris, Dietrich Klakow, Andreas Wendemuth, Sirko Molau, Hermann Ney, Michael Pitz, Achim Sixtus:
Large vocabulary continuous speech recognition of Broadcast News - The Philips/RWTH approach. Speech Commun. 37(1-2): 109-131 (2002) - [c25]Renke Bischoff, Reinhold Häb-Umbach, Wolfgang Schulz, Günter Heinrichs:
Employment of a multipath receiver structure in a combined GALILEO/UMTS receiver. VTC Spring 2002: 1844-1848 - 2001
- [j10]Marco Loog, Robert P. W. Duin, Reinhold Haeb-Umbach:
Multiclass Linear Dimension Reduction by Weighted Pairwise Fisher Criteria. IEEE Trans. Pattern Anal. Mach. Intell. 23(7): 762-766 (2001) - [j9]Reinhold Haeb-Umbach:
Automatic generation of phonetic regression class trees for MLLR adaptation. IEEE Trans. Speech Audio Process. 9(3): 299-302 (2001) - 2000
- [c24]Markus Lieb, Reinhold Haeb-Umbach:
LDA derived cepstral trajectory filters in adverse environmental conditions. ICASSP 2000: 1105-1108 - [c23]Robert P. W. Duin, Marco Loog, Reinhold Haeb-Umbach:
Multi-Class Linear Feature Extraction by Nonlinear PCA. ICPR 2000: 2398-2401 - [c22]Reinhold Haeb-Umbach:
Data-driven phonetic regression class tree estimation for MLLR adaptation. INTERSPEECH 2000: 857-860 - [c21]Marco Loog, Reinhold Haeb-Umbach:
Multi-class linear dimension reduction by generalized Fisher criteria. INTERSPEECH 2000: 1069-1072
1990 – 1999
- 1999
- [c20]Reinhold Haeb-Umbach:
Investigations on inter-speaker variability in the feature space. ICASSP 1999: 397-400 - [c19]Peter Beyerlein, Xavier L. Aubert, Reinhold Haeb-Umbach, Matthew Harris, Dietrich Klakow, Andreas Wendemuth, Sirko Molau, Michael Pitz, Achim Sixtus:
The philips/RWTH system for transcription of broadcast news. EUROSPEECH 1999 - [c18]Reinhold Haeb-Umbach, Marco Loog:
An investigation of cepstral parameterisations for large vocabulary speech recognition. EUROSPEECH 1999 - [c17]Matthew Harris, Xavier L. Aubert, Reinhold Haeb-Umbach, Peter Beyerlein:
A study of broadcast news audio stream segmentation and segment clustering. EUROSPEECH 1999: 1027-1030 - 1998
- [c16]Lutz Welling, Reinhold Haeb-Umbach, X. Zubert, N. Haberland:
A study on speaker normalization using vocal tract normalization and speaker adaptive training. ICASSP 1998: 797-800 - 1997
- [j8]Stephan Gamm, Reinhold Haeb-Umbach, Detlev Langmann:
The development of a command-based speech interface for a telephone answering machine. Speech Commun. 23(1-2): 161-171 (1997) - [c15]Harald Höge, Herbert S. Tropf, Richard Winski, Henk van den Heuvel, Reinhold Haeb-Umbach, Khalid Choukri:
European speech databases for telephone applications. ICASSP 1997: 1771-1774 - [c14]Hans J. G. A. Dolfing, Reinhold Haeb-Umbach:
Signal representations for hidden Markov model based online handwriting recognition. ICASSP 1997: 3385-3388 - [c13]Reinhold Haeb-Umbach:
Robust speech recognition for wireless networks and mobile telephony. EUROSPEECH 1997: 2427-2430 - [c12]Detlev Langmann, Alexander Fischer, Friedhelm Wuppermann, Reinhold Haeb-Umbach, Thomas Eisele:
Acoustic front ends for speaker-independent digit recognition in car environments. EUROSPEECH 1997: 2571-2574 - 1996
- [c11]Thomas Eisele, Reinhold Haeb-Umbach, Detlev Langmann:
A comparative study of linear feature transformation techniques for automatic speech recognition. ICSLP 1996: 252-255 - [c10]Detlev Langmann, Reinhold Haeb-Umbach, Lou Boves, Els den Os:
FRESCO: the French telephone speech data collection - part of the european Speechdat(m) project. ICSLP 1996: 1918-1921 - 1995
- [j7]Volker Steinbiss, Hermann Ney, Ute Essen, Bach-Hiep Tran, Xavier L. Aubert, Christian Dugast, Reinhard Kneser, Hans-Günter Meier, Martin Oerder, Reinhold Haeb-Umbach, Dieter Geller, W. Höllerbauer, H. Bartosik:
Continuous speech dictation - From theory to practice. Speech Commun. 17(1-2): 19-38 (1995) - [c9]Christian Dugast, Peter Beyerlein, Reinhold Haeb-Umbach:
Application of clustering techniques to mixture density modelling for continuous-speech recognition. ICASSP 1995: 524-527 - [c8]Reinhold Haeb-Umbach, Peter Beyerlein, Eric Thelen:
Automatic transcription of unknown words in a speech recognition system. ICASSP 1995: 840-843 - [c7]Reinhold Haeb-Umbach, Stephan Gamm:
Human factors of a voice-controlled car stereo. EUROSPEECH 1995: 1453-1456 - 1994
- [j6]Hermann Ney, Volker Steinbiss, Reinhold Haeb-Umbach, Bach-Hiep Tran, Ute Essen:
An Overview of the Philips Research System for Large Vocabulary Continuous Speech Recognition. Int. J. Pattern Recognit. Artif. Intell. 8(1): 33-70 (1994) - [j5]Reinhold Haeb-Umbach, Hermann Ney:
Improvements in beam search for 10000-word continuous-speech recognition. IEEE Trans. Speech Audio Process. 2(2): 353-356 (1994) - 1993
- [j4]Stefan Dobler, Dieter Geller, Reinhold Haeb-Umbach, Peter Meyer, Hermann Ney, Hans-Wilhelm Rühl:
Design and use of speech recognition algorithms for a mobile radio telephone. Speech Commun. 12(3): 221-229 (1993) - [c6]Reinhold Haeb-Umbach, Dieter Geller, Hermann Ney:
Improvements in connected digit recognition using linear discriminant analysis and mixture densities. ICASSP (2) 1993: 239-242 - [c5]Xavier L. Aubert, Reinhold Haeb-Umbach, Hermann Ney:
Continuous mixture densities and linear discriminant analysis for improved context-dependent acoustic models. ICASSP (2) 1993: 648-651 - [c4]Volker Steinbiss, Hermann Ney, Reinhold Haeb-Umbach, B.-H. Iran, Ute Essen, Reinhard Kneser, Martin Oerder, Hans-Günter Meier, Xavier L. Aubert, Christian Dugast, Dieter Geller, W. Höllerbauer, H. Bartosik:
The Philips research system for large-vocabulary continuous-speech recognition. EUROSPEECH 1993: 2125-2128 - 1992
- [j3]Reinhold Haeb, Robert T. Lynch Jr.:
Trellis Codes for Partial-Response Magnetooptical Direct Overwrite Recording. IEEE J. Sel. Areas Commun. 10(1): 182-190 (1992) - [j2]Reinhold Haeb:
A modified trellis coding technique for partial response channels. IEEE Trans. Commun. 40(3): 513-520 (1992) - [c3]Hermann Ney, Reinhold Häb-Umbach, Bach-Hiep Tran, Martin Oerder:
Improvements in beam search for 10000-word continuous speech recognition. ICASSP 1992: 9-12 - [c2]Reinhold Häb-Umbach, Hermann Ney:
Linear discriminant analysis for improved large vocabulary continuous speech recognition. ICASSP 1992: 13-16 - 1991
- [c1]Reinhold Haeb-Umbach, Hermann Ney:
A look-ahead search technique for large vocabulary continuous speech recognition. EUROSPEECH 1991: 495-498
1980 – 1989
- 1989
- [j1]Reinhold Haeb, Heinrich Meyr:
A systematic approach to carrier recovery and detection of digitally phase modulated signals of fading channels. IEEE Trans. Commun. 37(7): 748-754 (1989)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-01-20 22:51 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint