CLC number: TN912
On-line Access: 2024-08-27
Received: 2023-10-17
Revision Accepted: 2024-05-08
Crosschecked: 2009-04-27
Cited: 2
Clicked: 5901
Mohsen RAHMANI, Ahmad AKBARI, Beghdad AYAD, Nima DERAKHSHAN. A noise cross PSD estimator for dual-microphone speech enhancement based on minimum statistics[J]. Journal of Zhejiang University Science A, 2009, 10(6): 805-809.
@article{title="A noise cross PSD estimator for dual-microphone speech enhancement based on minimum statistics",
author="Mohsen RAHMANI, Ahmad AKBARI, Beghdad AYAD, Nima DERAKHSHAN",
journal="Journal of Zhejiang University Science A",
volume="10",
number="6",
pages="805-809",
year="2009",
publisher="Zhejiang University Press & Springer",
doi="10.1631/jzus.A0820390"
}
%0 Journal Article
%T A noise cross PSD estimator for dual-microphone speech enhancement based on minimum statistics
%A Mohsen RAHMANI
%A Ahmad AKBARI
%A Beghdad AYAD
%A Nima DERAKHSHAN
%J Journal of Zhejiang University SCIENCE A
%V 10
%N 6
%P 805-809
%@ 1673-565X
%D 2009
%I Zhejiang University Press & Springer
%DOI 10.1631/jzus.A0820390
TY - JOUR
T1 - A noise cross PSD estimator for dual-microphone speech enhancement based on minimum statistics
A1 - Mohsen RAHMANI
A1 - Ahmad AKBARI
A1 - Beghdad AYAD
A1 - Nima DERAKHSHAN
J0 - Journal of Zhejiang University Science A
VL - 10
IS - 6
SP - 805
EP - 809
%@ 1673-565X
Y1 - 2009
PB - Zhejiang University Press & Springer
ER -
DOI - 10.1631/jzus.A0820390
Abstract: Some two-microphone noise reduction techniques that work in the frequency domain exploit coherence function between two noisy signals. They have shown good results when noise signals on two sensors are uncorrelated, but their performance decreases with correlated noises. Coherence based methods can be improved when the cross power spectral density (CPSD) of correlated noise signals is available. In this paper, we propose a new method for estimation of the CPSD of the noise, which is based on the minimum tracking technique. Despite the fact that the proposed estimator does not need to implement a voice activity detector (VAD), its performance is comparable to a CPSD estimator that uses an ideal VAD.
[1] Cohen, I., 2002. Noise estimation by minima controlled recursive averaging for robust speech enhancement. IEEE Signal Processing Lett., 9(1):12-15.
[2] Deller, J.R., Hansen, J.H.L., Proakis, J.G., 2000. Discrete-time Processing of Speech Signals (2nd Ed.). IEEE Press, New York, USA.
[3] Derakhshan, N., Ayatollahi, A., Akbari, A., Rahmani, M., 2007. Noise Power Spectrum Estimation Using Time-variant Spectral Smoothing and Low-delay Minima Tracking. SPECOM, p.542-548.
[4] Guerin, A., La Bouquin-Jeannes, R., Faucon, G., 2003. A two-sensor noise reduction system: applications for hands-free car kit. EURASIP J. Appl. Signal Processing, 2003(11):1125-1134.
[5] ITU-T P.862, 2001. Perceptual Evaluation of Speech Quality (PESQ): An Objective Method for End-to-end Speech Quality Assessment of Narrow-band Telephone Networks and Speech Codecs. International Telecommunication Union, Geneva.
[6] La Bouquin-Jeannes, R., Azirani, A.A., Faucon, G., 1997. Enhancement of speech degraded by coherent and incoherent noise using a cross-spectral estimator. IEEE Trans. Speech Audio Processing, 5(5):484-487.
[7] Martin, R., 1994. Spectral Subtraction Based on Minimum Statistics. 7th European Signal Processing Conf., p.1182-1185.
[8] Martin, R., 2001. Noise power spectral density estimation based on optimal smoothing and minimum statistics. IEEE Trans. Speech Audio Processing, 9(5):504-512.
[9] Martin, R., 2006. Bias compensation methods for minimum statistics noise power spectral density estimation. Signal Processing, 86(6):1215-1229.
[10] Zhang, X., Jia, Y., 2005. A Soft Decision Based Noise Cross Power Spectral Density Estimation for Two-microphone Speech Enhancement Systems. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing, p.813-816.
Open peer comments: Debate/Discuss/Question/Opinion
<1>