Home   >   CSC-OpenAccess Library   >    Manuscript Information
Improvement of minimum tracking in Minimum Statistics noise estimation method
Hassan Farsi
Pages - 17 - 22     |    Revised - 25-02-2010     |    Published - 26-03-2010
Volume - 4   Issue - 1    |    Publication Date - March 2010  Table of Contents
MORE INFORMATION
KEYWORDS
End point Detection, Distributed Speech Recognition , Voice activity detection , Zero Cross Rating , Robust speech recognition , Volume
ABSTRACT
Noise spectrum estimation is a fundamental component of speech enhancement and speech recognition systems. In this paper we propose a new method for minimum tracking in Minimum Statistics (MS) noise estimation method. This noise estimation algorithm is proposed for highly nonstationary noise environments. This was confirmed with formal listening tests which indicated that the proposed noise estimation algorithm when integrated in speech enhancement was preferred over other noise estimation algorithms.
CITED BY (7)  
1 Li, J., Li, Z., Zhu, W., Chen, X., & Cheng, L. (2014). A new frequency-domain background noise power estimation algorithm. Simulation and Modelling Methodologies, Technologies and Applications, 60, 125.
2 Yu, Y., & Zhao, H. (2013). Improved of noise estimation algorithm based on minimum statistic. Jisuanji Gongcheng yu Yingyong(Computer Engineering and Applications), 49(4), 134-137.
3 Yu Yao, & Zhao Heming. (2013). An improved minimum statistical noise power spectrum estimation algorithm. Computer Engineering and Applications, 49 (4).
4 Kallel, F., Ghorbel, M., Frikha, M., Berger-Vachon, C., & Hamida, A. B. (2012). A noise cross PSD estimator based on improved minimum statistics method for two-microphone speech enhancement dedicated to a bilateral cochlear implant. Applied Acoustics, 73(3), 256-264.
5 Yu Yao, & Zhao Heming. (2012). Noise power non-stationary noise environments spectral estimation method of data acquisition and processing, (4), 486-489.
6 Yu, Y., & Zhao, H. (2012). New noise estimation method for highly non-stationary noise environments. Journal of Data Acquisition & Processing, 27(4), 486-489.
7 Yu, Y., & Zhao, H. (2011, January). A new method for noise power spectrum estimation. In 4th IET International Conference on Wireless, Mobile & Multimedia Networks (ICWMMN 2011).
1 Google Scholar 
2 ScientificCommons 
3 Academic Index 
4 CiteSeerX 
5 refSeek 
6 iSEEK 
7 Socol@r  
8 ResearchGATE 
9 Bielefeld Academic Search Engine (BASE) 
10 Scribd 
11 WorldCat 
12 SlideShare 
13 PDFCAST 
14 PdfSR 
A. Varga and H. J. M. Steeneken, "Assessment for automatic speech recognition: II. NOISEX-92: A database and an experiment to study the effiect of additive noise on speech recognition systems," Speech Communication, 12(3): 247-251, July 1993.
B. L. McKinley and G. H. Whipple, "Model based speech pause detection," Proc. 22th IEEE Internat. Conf. Acoust. Speech Signal Process., ICASSP-97, Munich, Germany, 20-24 April 1997, pp. 1179-1182.
C. Ris and S. Dupont, "Assessing local noise level estimation methods: application to noise robust ASR," Speech Communication, 34(1): 141-158, April 2001.
Doblinger, G., 1995. "Computationally efficient speech enhancement by spectral minima tracking in subbands," in Proc. Eurospeech’ 2002, 1513–1516.
G. Doblinger, "Computationally efficient speech enhancement by spectral minima tracking in subbands," Proc. 4th EUROSPEECH'95, Madrid, Spain, 18-21 September 1995, pp. 1513-1516.
H. G. Hirsch and C. Ehrlicher, "Noise estimation techniques for robust speech recognition," Proc. 20th IEEE Inter. Conf. Acoust. Speech Signal Process., ICASSP-95, Detroit, Michigan, 8-12 May 1995, pp. 153-156.
I. Cohen and B. Berdugo, "Speech Enhancement for Non-Stationary Noise Environments," Signal Processing, 81(11): 2403-2418, November 2001.
I. Cohen and B. Berdugo, "Speech Enhancement for Non-Stationary Noise Environments," Signal Processing, 81(11): 2403-2418, November 2001.
I. Cohen, "On speech enhancement under signal presence uncertainty," Proc. 26th IEEE Internat. Conf. Acoust. Speech Signal Process., ICASSP-2001, 7-11 May 2001, pp. 167-170.
I. Cohen, “Noise spectrum estimation in adverse environments: improved minima controlled recursive averaging,” IEEE Trans. Speech Audio Process. 11 (5): 466–475, 2003.
J. Ghasemi, K. Mollaei, “A new approach for speech enhancement based on eigenvalue spectral subtraction,” in Signal Processing: An International Journal (SPIJ), 3(4): 34-41, Sep. 2009.
J. Meyer, K. U. Simmer and K. D. Kammeyer "Comparison of one- and two-channel noiseestimation techniques," Proc. 5th International Workshop on Acoustic Echo and Noise Control, IWAENC-97, London, UK, 11-12 September 1997, pp. 137-145.
J. Sohn, N. S Kim and W. Sung, "A statistical model-based voice activity detector," IEEE Signal Processing Letters, 6(1): 1-3, January 1999.
M. Satya Sai Ram, P. Siddaiah, M. M. Latha, ” Usefullness of speech coding in voice banking,” in Signal Processing: An International Journal (SPIJ), 3(4): 42-54, Sep. 2009.
M.S. Salam, D. Mohammad, S-H Salleh, “ Segmentation of Malay Syllables in connected digit speech using statistical approach,” in Signal Processing: An International Journal (SPIJ), 2(1): 23- 33, February 2008.
R. J. McAulay and M. L. Malpass "Speech enhancement using a soft-decision noise suppression filter," IEEE Trans. Acoustics, Speech and Signal Processing, 28(2): 137-145, April 1980.
R. Martin, "Noise power spectral density estimation based on optimal smoothing and minimum statistics," IEEE Trans. Speech and Audio Processing, 9(5): 504-512, July 2001.
R. Martin, "Spectral subtraction based on minimum statistics," Proc. 7th European Signal Processing Conf., EUSIPCO-94, Edinburgh, Scotland, 13-16 September 1994, pp. 1182-1185.
R. Martin: “An Efficient Algorithm to Estimate the instantaneous SNR of Speech Signals,” Proc. EUROSPEECH ‘93, pp. 1093-1096, Berlin, September 21-23, 1993.
S. Quackenbush, T. Barnwell and M. Clements, “Objective Measures of Speech Quality,” Englewood Cliffs, NJ: Prentice-Hall, 1988.
Y. Ephraim and D. Malah, "Speech enhancement using a minimum mean-square error log-spectral amplitude estimator," IEEE Trans. Acoustics, Speech and Signal Processing, 33(2): 443-455, April 1985.
Dr. Hassan Farsi
University of Birjand - Iran
hfarsi@birjand.ac.ir


CREATE AUTHOR ACCOUNT
 
LAUNCH YOUR SPECIAL ISSUE
View all special issues >>
 
PUBLICATION VIDEOS