Computer Science ›› 2021, Vol. 48 ›› Issue (6A): 33-37.doi: 10.11896/jsjkx.200700135

• Image Processing & Multimedia Technology • Previous Articles     Next Articles

Speech Endpoint Detection Based on Bayesian Decision of Logarithmic Power Spectrum Ratio in High and Low Frequency Band

ZHANG Zi-cheng, TAN Zhi-wei, ZHANG Chen-rui, WANG Xuan, LIU Xiao-xuan, YU Yi-biao   

  1. School of Electronic and Information Engineering,Soochow University,Suzhou,Jiangsu 215006,China
  • Online:2021-06-10 Published:2021-06-17
  • About author:ZHANG Zi-cheng,born in 1999,undergraduate.His main research interests include digital signal processing and speech processing.
    YU Yi-biao,born in 1962,Ph.D,professor.His main research interests include speech and image processing,pattern recognition and multimedia system.

Abstract: Based on the analysis of the power spectrum of speech signal and noise in high and low frequency band,a speech endpoint detection method under low SNR based on Bayesian decision of logarithmic power spectrum ratio in high and low frequency band is proposed.Firstly,the logarithm power spectrum ratio of speech signal and background noise in two different frequency bands is calculated respectively,and the statistical distribution is obtained according to the maximum likelihood estimation,and the optimal decision threshold is derived based on Bayesian decision criterion.When the signal is input,the log energy spectrum ratio of high and low frequency bands is calculated frame by frame and it is compared with the decision threshold to classify the speech and background noise,so as to realize the endpoint detection of speech signal.The experimental results show that,compared with the traditional double threshold detection method and spectral entropy detection method,the proposed method can detect speech endpoint more accurately under the condition of low SNR,and significantly improve the accuracy and speed of endpoint detection.

Key words: Bayesian decision, Logarithmic power spectrum ratio, Low SNR, Speech endpoint detection

CLC Number: 

  • TN912.34
[1] CHEN Y Y,JIAN L.Speech endpoint detection based on maximum entropy spectral estimation and time-frequency characteris-tics[J].Computer Applications and Software,2017,34 (11):91-96.
[2] SUN H F,LONG H,SHAO Y B,et al.Speech music classification algorithm based on zero crossing rate and spectrum [J].Journal of Yunnan University (Natural Science Edition),2019,41(5):925-931.
[3] CHEN H H,XU P.Sound endpoint detection method based on spectral subtraction and short-term energy [J].Mechanical Manufacturing and Automation,2016,45(3):191-192,209.
[4] CHEN Z W,ZENG Q N,XIE X M,et al.Speech endpoint detection method based on autocorrelation function [J].Computer Engineering and Application,2018,54(6):216-221,256.
[5] WANG R D,CHAI P Q.An improved speech endpoint detection method based on spectral entropy [J].Information and Control,2004(1):77-81.
[6] RABINER L R,SAMBUR M R.An Algorithm for Determining the Endpoints of Isolated Utterances[M].John Wiley & Sons,1975.
[7] FEI Y Q,WANG Y J,XIA Y L.Research on speech endpoint detection algorithm [J].Automation Technology and Application,2017,36(8):98-102.
[8] LI Y,CHENG L F,ZHANG P L.A speech endpoint detection method based on improved spectral entropy [J].Computer Science,2016,43(S2):233-236.
[9] WANG L,LI C R.An improved endpoint detection methodbased on adaptive spectral entropy [J].Computer Simulation,2010,27(12):373-375,395.
[10] WANG S,PU B M,LI X Z,et al.Wiener filter speech enhancement algorithm based on improved energy entropy ratio [J].Computer System Applications,2017,26(11):124-131.
[11] WU P,ZHANG X B,DING W.Speech endpoint detection basedon spectral subtraction and Short-term zero entropy method [J].Electroacoustic Technology,2018,42(3):55-59.
[1] XU Pu-le, WANG Yang, HUANG Ya-kun, HUANG Shao-fen, ZHAO Chuan-xin and CHEN Fu-long. Chinese Place-name Address Matching Method Based on Large Data Analysis and Bayesian Decision [J]. Computer Science, 2017, 44(9): 266-271.
[2] SANG Yan-li and QIAN Yu-hua. Granular Structure Reduction Approach to Multigranulation Decision-theoretic Rough Sets [J]. Computer Science, 2017, 44(5): 199-205.
[3] ZHONG Jin-yi and YE Dong-yi. Extended Decision-theoretic Rough Set Models Based on Fuzzy Minimum Cost [J]. Computer Science, 2014, 41(3): 50-54.
[4] . Fuzzy Decision-theoretic Rough Sets [J]. Computer Science, 2012, 39(12): 25-29.
[5] LIU Dun,YAO Yi-yu,LI Tian-rui. Three-way Decision-theoretic Rough Sets [J]. Computer Science, 2011, 38(1): 246-250.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!