Bayesian on-line spectral change point detection: a soft computing approach for on-line ASR

MFR Chowdhury, SA Selouani… - International Journal of …, 2012 - Springer
International Journal of Speech Technology, 2012Springer
Current automatic speech recognition (ASR) works in off-line mode and needs prior
knowledge of the stationary or quasi-stationary test conditions for expected word recognition
accuracy. These requirements limit the application of ASR for real-world applications where
test conditions are highly non-stationary and are not known a priori. This paper presents an
innovative frame dynamic rapid adaptation and noise compensation technique for tracking
highly non-stationary noises and its application for on-line ASR. The proposed algorithm is …
Abstract
Current automatic speech recognition (ASR) works in off-line mode and needs prior knowledge of the stationary or quasi-stationary test conditions for expected word recognition accuracy. These requirements limit the application of ASR for real-world applications where test conditions are highly non-stationary and are not known a priori. This paper presents an innovative frame dynamic rapid adaptation and noise compensation technique for tracking highly non-stationary noises and its application for on-line ASR. The proposed algorithm is based on a soft computing model using Bayesian on-line inference for spectral change point detection (BOSCPD) in unknown non-stationary noises. BOSCPD is tested with the MCRA noise tracking technique for on-line rapid environmental change learning in different non-stationary noise scenarios. The test results show that the proposed BOSCPD technique reduces the delay in spectral change point detection significantly compared to the baseline MCRA and its derivatives. The proposed BOSCPD soft computing model is tested for joint additive and channel distortions compensation (JAC)-based on-line ASR in unknown test conditions using non-stationary noisy speech samples from the Aurora 2 speech database. The simulation results for the on-line AR show significant improvement in recognition accuracy compared to the baseline Aurora 2 distributed speech recognition (DSR) in batch-mode.
Springer
Showing the best result for this search. See all results