default search action
Jahn Heymann
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2023
- [c27]Yifeng Fan, Colin Vaz, Di He, Jahn Heymann, Viet Anh Trinh, Zhe Zhang, Venkatesh Ravichandran:
Towards Accurate and Real-Time End-of-Speech Estimation. ICASSP 2023: 1-5 - [c26]Martin Radfar, Paulina Lyskawa, Brandon Trujillo, Yi Xie, Kai Zhen, Jahn Heymann, Denis Filimonov, Grant P. Strimel, Nathan Susanj, Athanasios Mouchtaris:
Conmer: Streaming Conformer Without Self-attention for Interactive Voice Assistants. INTERSPEECH 2023: 2198-2202 - [c25]Belen Alastruey, Lukas Drude, Jahn Heymann, Simon Wiesler:
Multi-View Frequency-Attention Alternative to CNN Frontends for Automatic Speech Recognition. INTERSPEECH 2023: 4973-4977 - [i6]Belen Alastruey, Lukas Drude, Jahn Heymann, Simon Wiesler:
Multi-View Frequency-Attention Alternative to CNN Frontends for Automatic Speech Recognition. CoRR abs/2306.06954 (2023) - 2022
- [c24]Jahn Heymann, Egor Lakomkin, Leif Rädel:
Being Greedy Does Not Hurt: Sampling Strategies for End-To-End Speech Recognition. ICASSP 2022: 7787-7791 - 2021
- [j2]Reinhold Haeb-Umbach, Jahn Heymann, Lukas Drude, Shinji Watanabe, Marc Delcroix, Tomohiro Nakatani:
Far-Field Automatic Speech Recognition. Proc. IEEE 109(2): 124-148 (2021) - [c23]Lukas Drude, Jahn Heymann, Andreas Schwarz, Jean-Marc Valin:
Multi-Channel Opus Compression for Far-Field Automatic Speech Recognition with a Fixed Bitrate Budget. Interspeech 2021: 1669-1673 - [i5]Lukas Drude, Jahn Heymann, Andreas Schwarz, Jean-Marc Valin:
Multi-channel Opus compression for far-field automatic speech recognition with a fixed bitrate budget. CoRR abs/2106.07994 (2021) - 2020
- [b1]Jahn Heymann:
Robust multi-channel speech recognition with neural network supported statistical beamforming. University of Paderborn, Germany, 2020 - [c22]Egor Lakomkin, Jahn Heymann, Ilya Sklyar, Simon Wiesler:
Subword Regularization: An Analysis of Scalability and Generalization for End-to-End Automatic Speech Recognition. INTERSPEECH 2020: 3600-3604 - [c21]Prabhat Pandey, Volker Leutnant, Simon Wiesler, Jahn Heymann, Daniel Willett:
Improving Speech Recognition of Compound-Rich Languages. INTERSPEECH 2020: 4936-4940 - [i4]Egor Lakomkin, Jahn Heymann, Ilya Sklyar, Simon Wiesler:
Subword Regularization: An Analysis of Scalability and Generalization for End-to-End Automatic Speech Recognition. CoRR abs/2008.04034 (2020)
2010 – 2019
- 2019
- [c20]Jahn Heymann, Khe Chai Sim, Bo Li:
Improving CTC Using Stimulated Learning for Sequence Modeling. ICASSP 2019: 5701-5705 - [c19]Jahn Heymann, Lukas Drude, Reinhold Haeb-Umbach, Keisuke Kinoshita, Tomohiro Nakatani:
Joint Optimization of Neural Network-based WPE Dereverberation and Acoustic Model for Robust Online ASR. ICASSP 2019: 6655-6659 - [c18]Lukas Drude, Jahn Heymann, Reinhold Haeb-Umbach:
Unsupervised Training of Neural Mask-Based Beamforming. INTERSPEECH 2019: 1253-1257 - [i3]Lukas Drude, Jahn Heymann, Reinhold Haeb-Umbach:
Unsupervised training of neural mask-based beamforming. CoRR abs/1904.01578 (2019) - 2018
- [c17]Lukas Drude, Jahn Heymann, Christoph Böddeker, Reinhold Haeb-Umbach:
NARA-WPE: A Python package for weighted prediction error dereverberation in Numpy and Tensorflow for online and offline processing. ITG Symposium on Speech Communication 2018: 1-5 - [c16]Jens Heitkaemper, Jahn Heymann, Reinhold Haeb-Umbach:
Smoothing along Frequency in Online Neural Network Supported Acoustic Beamforming. ITG Symposium on Speech Communication 2018: 1-5 - [c15]Jahn Heymann, Michiel Bacchiani, Tara N. Sainath:
Performance of Mask Based Statistical Beamforming in a Smart Home Scenario. ICASSP 2018: 6722-6726 - [c14]Shinji Watanabe, Takaaki Hori, Shigeki Karita, Tomoki Hayashi, Jiro Nishitoba, Yuya Unno, Nelson Enrique Yalta Soplin, Jahn Heymann, Matthew Wiesner, Nanxin Chen, Adithya Renduchintala, Tsubasa Ochiai:
ESPnet: End-to-End Speech Processing Toolkit. INTERSPEECH 2018: 2207-2211 - [c13]Lukas Drude, Christoph Böddeker, Jahn Heymann, Reinhold Haeb-Umbach, Keisuke Kinoshita, Marc Delcroix, Tomohiro Nakatani:
Integrating Neural Network Based Beamforming and Weighted Prediction Error Dereverberation. INTERSPEECH 2018: 3043-3047 - [c12]Jahn Heymann, Lukas Drude, Reinhold Haeb-Umbach, Keisuke Kinoshita, Tomohiro Nakatani:
Frame-Online DNN-WPE Dereverberation. IWAENC 2018: 466-470 - [i2]Shinji Watanabe, Takaaki Hori, Shigeki Karita, Tomoki Hayashi, Jiro Nishitoba, Yuya Unno, Nelson Enrique Yalta Soplin, Jahn Heymann, Matthew Wiesner, Nanxin Chen, Adithya Renduchintala, Tsubasa Ochiai:
ESPnet: End-to-End Speech Processing Toolkit. CoRR abs/1804.00015 (2018) - 2017
- [j1]Jahn Heymann, Lukas Drude, Reinhold Haeb-Umbach:
A generic neural acoustic beamforming architecture for robust multi-channel speech processing. Comput. Speech Lang. 46: 374-385 (2017) - [c11]Christoph Böddeker, Patrick Hanebrink, Lukas Drude, Jahn Heymann, Reinhold Haeb-Umbach:
Optimizing neural-network supported acoustic beamforming by algorithmic differentiation. ICASSP 2017: 171-175 - [c10]Jahn Heymann, Lukas Drude, Christoph Böddeker, Patrick Hanebrink, Reinhold Haeb-Umbach:
Beamnet: End-to-end training of a beamformer-supported multi-channel ASR system. ICASSP 2017: 5325-5329 - [c9]Janek Ebbers, Jahn Heymann, Lukas Drude, Thomas Glarner, Reinhold Haeb-Umbach, Bhiksha Raj:
Hidden Markov Model Variational Autoencoder for Acoustic Unit Discovery. INTERSPEECH 2017: 488-492 - [c8]Joerg Schmalenstroeer, Jahn Heymann, Lukas Drude, Christoph Böddeker, Reinhold Haeb-Umbach:
Multi-stage coherence drift based sampling rate synchronization for acoustic beamforming. MMSP 2017: 1-6 - [i1]Christoph Böddeker, Patrick Hanebrink, Lukas Drude, Jahn Heymann, Reinhold Haeb-Umbach:
On the Computation of Complex-valued Gradients with Application to Statistically Optimum Beamforming. CoRR abs/1701.00392 (2017) - 2016
- [c7]Aleksej Chinaev, Jahn Heymann, Lukas Drude, Reinhold Haeb-Umbach:
Noise-Presence-Probability-Based Noise PSD Estimation by Using DNNs. ITG Symposium on Speech Communication 2016: 1-5 - [c6]Markus Kitza, Albert Zeyer, Ralf Schlüter, Jahn Heymann, Reinhold Haeb-Umbach:
Robust Online Multi-Channel Speech Recognition. ITG Symposium on Speech Communication 2016: 1-5 - [c5]Jahn Heymann, Lukas Drude, Reinhold Haeb-Umbach:
Neural network based spectral mask estimation for acoustic beamforming. ICASSP 2016: 196-200 - 2015
- [c4]Jahn Heymann, Lukas Drude, Aleksej Chinaev, Reinhold Haeb-Umbach:
BLSTM supported GEV beamformer front-end for the 3RD CHiME challenge. ASRU 2015: 444-451 - [c3]Jahn Heymann, Reinhold Haeb-Umbach, Pavel Golik, Ralf Schlüter:
Unsupervised adaptation of a denoising autoencoder by Bayesian Feature Enhancement for reverberant asr under mismatch conditions. ICASSP 2015: 5053-5057 - 2014
- [c2]Jahn Heymann, Oliver Walter, Reinhold Haeb-Umbach, Bhiksha Raj:
Iterative Bayesian word segmentation for unsupervised vocabulary discovery from phoneme lattices. ICASSP 2014: 4057-4061 - 2013
- [c1]Jahn Heymann, Oliver Walter, Reinhold Haeb-Umbach, Bhiksha Raj:
Unsupervised word segmentation from noisy input. ASRU 2013: 458-463
Coauthor Index
aka: Reinhold Haeb-Umbach
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-07 22:19 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint