Phase-sensitive decision-directed snr estimator for single-channel speech enhancement
S Ou, P Song, Y Gao - … Journal of Pattern Recognition and Artificial …, 2017 - World Scientific
S Ou, P Song, Y Gao
International Journal of Pattern Recognition and Artificial Intelligence, 2017•World ScientificThe a priori signal-to-noise ratio (SNR) plays an essential role in many speech
enhancement systems. Most of the existing approaches to estimate the a priori SNR only
exploit the amplitude spectra while making the phase neglected. Considering the fact that
incorporating phase information into a speech processing system can significantly improve
the speech quality, this paper proposes a phase-sensitive decision-directed (DD) approach
for the a priori SNR estimate. By representing the short-time discrete Fourier transform …
enhancement systems. Most of the existing approaches to estimate the a priori SNR only
exploit the amplitude spectra while making the phase neglected. Considering the fact that
incorporating phase information into a speech processing system can significantly improve
the speech quality, this paper proposes a phase-sensitive decision-directed (DD) approach
for the a priori SNR estimate. By representing the short-time discrete Fourier transform …
The a priori signal-to-noise ratio (SNR) plays an essential role in many speech enhancement systems. Most of the existing approaches to estimate the a priori SNR only exploit the amplitude spectra while making the phase neglected. Considering the fact that incorporating phase information into a speech processing system can significantly improve the speech quality, this paper proposes a phase-sensitive decision-directed (DD) approach for the a priori SNR estimate. By representing the short-time discrete Fourier transform (STFT) signal spectra geometrically in a complex plane, the proposed approach estimates the a priori SNR using both the magnitude and phase information while making no assumptions about the phase difference between clean speech and noise spectra. Objective evaluations in terms of the spectrograms, segmental SNR, log-spectral distance (LSD) and short-time objective intelligibility (STOI) measures are presented to demonstrate the superiority of the proposed approach compared to several competitive methods at different noise conditions and input SNR levels.
World Scientific
Showing the best result for this search. See all results