全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

Research on Speech Endpoint Detection Algorithm with Low SNR

DOI: 10.4236/oalib.1103487, PP. 1-8

Subject Areas: Multimedia/Signal processing

Keywords: Endpoint Detection, Multitaper Spectral Estimation, Improved Spectral Subtraction, BARK Subband Variance

Full-Text   Cite this paper   Add to My Lib

Abstract

The detection of speech endpoint is an important application for speech signal processing. Although there are variance methods, the endpoint can’t be detected accurately in low SNR (Signal to Noise Ratio). The paper pointes out an endpoint detection algorithm combining two methods together: the one is improved spectral subtraction based on multitaper spectral estimation, and the other is BARK subband variance in frequency domain. Firstly, the noisy speech signal is processed though the improved spectral subtraction based on multitaper spectral estimation. It can achieve the purpose of noise reduction through this step. Then the noisy speech signal is detected using the method of BARK subband variance in frequency domain. Compared with the common endpoint detection algorithm, it is concluded that endpoint detection accuracy by new method can be improved in low SNR.

Cite this paper

Wei, J. and Sun, X. (2017). Research on Speech Endpoint Detection Algorithm with Low SNR. Open Access Library Journal, 4, e3487. doi: http://dx.doi.org/10.4236/oalib.1103487.

References

[1]  Zhuo, G. and Bian-Ba, W.-D. (2015) A Study of Tibetan Speech Pitch Detection Algorithm Based on Matlab. Modern Electronics Technique, 10, 20-22.
[2]  Zhao, L. (2016) Speech Signal Processing. 3rd Edition, China Machine Press, Beijing.
[3]  Han, L.H., Wang, B. and Duan, S.F. (2010) Development of Voice Activity Detection Technology. Application Research of Computers, 4, 1220-1226.
[4]  Sun, Y.M., Wu, Y.Y. and Li, P. (2016) Research on Speech Endpoint Detection Based on the Improved Dual-Threshold. Journal of Changchun University of Science and Technology, 1, 92-95.
[5]  Wang, L.L., Xia, X., Feng, L. and Liu, G.C. (2014) New Speech Endpoint Detection Algorithm Based on Spectrum Variance and Spectral Subtraction. Computer Engineering and Applications, 8, 194-197.
[6]  Haigh, J.A. and Mason, J.S. (1993) Robust Voice Activity Detection Using Cepstral Features. Proceedings of Computer, Communication, Control and Power Engineering, Vol. 3, Beijing, 19-21 October 1993, 321-324.
https://doi.org/10.1109/tencon.1993.327987
[7]  Zhao, H., Wang, G.J. and Zhao, L.X. (2010) A New Voice Activity Detection Using Logarithmic Energy Spectral Entropy. Journal of Hunan University, 7, 72-77.
[8]  En, D., Zhang, F.L., Zhang, Z. and Hu, S.Q. (2016) Application of Fuzzy Entropy in Speech Endpoint Detection in Car Environments. Computer Engineering and Applications, 10, 147-150.
[9]  Zheng, J.H., Huang, H.M., Zhong, M.H., Cao, N.W. and Chen, Y.L. (2007) Comparative Study of Several Speech Signal Endpoint Detection Methods. Guanagxi Wuli, 4, 20-23.
[10]  Yin, X.C., Guo, Y., Zhang, B.F. and Liu, X. (2011) Voice Activity Detection Algorithm Based on Bark Wavelet. Computer Engineering, 12, 276-278.
[11]  Wang, Y., Feng, Y., Ding, X.B. and Chen, D.Y. (2016) Endpoint Detection Algorithm for Noisy Speech Based on Time-Frequency Combination. Journal of Natural Science of Heilonjiang University, 3, 410-415.
[12]  Wu, P.P., Zhao, G. and Zou, M. (2008) An Improved Spectral Subtraction Method Based on Multitaper Estimation. Modern Electronics Technique, 12, 150-152.
[13]  Thomson, D.J. (1982) Spectrum Estimation and Harmonic Analysis. Proceedings of the IEEE, 70, 1055-1096.
https://doi.org/10.1109/PROC.1982.12433
[14]  Han, F. and Jin, Z.X. (2016) Study of Endpoint Detection Algorithm in Low SNR. Journal of Northwest Normal University, 5, 55-59.
[15]  Hu, Y. and Loizou, P.C. (2004) Incorporating a Psycho Acoustical Model in Frequency Domain Speech Enhancement. IEEE Signal Processing Letters, 11, 270-273.
https://doi.org/10.1109/LSP.2003.821714
[16]  Dong, H. (2016) Improved Speech Endpoint Detection under Low SNR Environment. Computer Technology and Development, 3, 71-74.
[17]  Zhang, C.L., Zeng, X.Y. and Wang, S.G. (2012) A Voice Activity Detection Algorithm Based on the Variance of Critical Band Power Spectrum. Technical Acoustics, 2, 204-208.
[18]  Gao, M.M., Chang, T.H., Yang, G.T. and Li, M. (2009) Speech Feature Extraction Algorithm Based on Subband Dominant Frequency Information. Computer Engineering, 18, 161-163.
[19]  Wang, X.H., Qu, L., Zhang, C. and Jian, X.W. (2016) Speech Feature Extraction Algorithm Based on the Bark Wavelet Packet Transform with Fisher. Journal of Xian Polytechnic University, 4, 453-457.
[20]  Wang, W., Hu, G.M., Yang, L., Huang, D.F. and Zhou, Y. (2016) Research of Endpoint Detection Based on Spectral Subtraction and Uniform Subband Spectrum Variance. Audio Engineering, 5, 40-43.

Full-Text


comments powered by Disqus

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133

WeChat 1538708413