一个面向短波通信的LHOG话音检测方法

doi:10.11896/jsjkx.230600115

Abstract

Abstract: The low accuracy of voice detection in noisy environment is an open challenge for short wave communication.The application of existing methods is limited,because it is difficult to reliably extract accurate and efficient voice features in the noise environment.To solve the above problem,a Low-rank histogram of oriented gradient(LHOG) voice detection method for short wave communication is proposed in this paper.Firstly,target audio source data is preprocessed to realize visual representation of voice information in noisy environment.Then,a low-rank structure is embedded in the HOG feature extractor to alleviate redundant information and reduce noise interference,so as to obtain accurate and efficient features.Finally,the common SVM classification model can be used to reliably distinguish voice from noise in noisy environment.The test results show that the accuracy of this method is 95.12%,the false positive rate is 0.96%,and false negative rate is 13.14%.Compared with the existing mainstream methods,the experiment shows that the average detection accuracy of this method is higher,and resource occupation is less.Therefore,this method can effectively improve the detection and control efficiency of short-wave communication.

Key words: Pattern recognition, Spectrogram, HOG, Low-rank structure, SVM

CLC Number:

TP391.4

BAI Jie, TIAN Ruili, REN Yifu, YUAN Jianxia. Low-rank HOG Voice Detection Method for Short-wave Communication[J].Computer Science, 2024, 51(6A): 230600115-5.

References

[1]WANG J R,LI Y B.Design on all-digital demodulation algo-rithm for HF multitone parallel signal[J].Radio Engineering,2016,46(1):76-79.
[2]WAN L,WANG Q,LI J.End-to-End Speech Recognition with Recurrent Neural Networks for Mandarin Chinese[J].IEEE/ACM Transactions on Audio,Speech,and Language Processing,2017,25(10):1974-1983.
[3]LI B.Speech Activity Detection Based on Deep Neural Networks Trained with Noise-Robust Features[J].IEEE/ACM Transactions on Audio,Speech,and Language Processing,2017,25(11):2193-2203.
[4]ALDARMAKI H,ULLAH A,RAM S,et al.Unsupervised automatic speech recognition:A review[J].Speech Communication,2022,139:76-91
[5]DONG B H,LI S Q.Current status and developing tendency for high frequency communications[J].Information and Electronic Engineering,2007,5(1):1-5.
[6]YIN F M,WANG S J,ZHAO L.Environmental sound classification using DeepESC convolutional neural networks[J].Technical Acoustics.2019,38(5):590-593.
[7]CHEN D,HUANG Z P.Car honking recognition based on mel frequency cepstrum coefficient and support vector machine[J].Science Technology and Engineering,2021,21(11):4486-4491.
[8]SAILOR H B,AGRAWAL D M,PATIL H A.Unsupervised filterbank learning using convolutional restricted boltzmann machine for environmental sound classification[C]//Proceedings of Conference on the International Voice Communication Association,2017:3107-3111.
[9]CHEN H T,LIU Z Z,LIU Z M,et al.Integrating the data augmentation scheme with various classifiers for acoustic scene modeling[J].arXiv:1907.006639,2019.
[10]CHOI Y,ATIF O,LEE J,et al.Noise-robust sound-event classification system with texture analysis[J].Symmetry,2018,10(9):402.
[11]QIU Y,JIA G M,YANG J F,et al.Voice recognition model of civil aviation radiotelephony communication based on BiLSTM[J].Journal of Signal Processing,2019,35(2):293-300.
[12]YU Q Q,LI Y,LI Y.Eco-environmental sounds classificationunder noise conditions[J].Journal of Chinese Computer Systems,2011,32(8):1689-1693.
[13]YANG L D,HU J T.Audio scene recognition of deep neural network under multiple optimization mechanisms[J].Journal of Signal Processing,2021,37(10):1969-1976.
[14]DALAL N,TRIGGS B.Histograms of briented gradients forhuman detection[C]//IEEE Computer Society Conference on Computer Vision and Pattern Recognition(CVPR 2005).IEEE,2005:886-893.
[15]GENG Y N,LIU S S,LIU T T,et al.Survey of pedestrian detection technology based on computer vision[J].Journal of Computer Applications,2021,41(S1):43-50.
[16]LE V,ZHU Y,NGUYEN A.Research on depth image gesture segmentation and HOG-SVM gesture recognition method[J].Computer Applictions and Software,2016,33(12):122-126.
[17]ALBIOL A,MONZO D,MARTIN A,et al.Face recognitionusing HOG-EB-GM[J].Pattern Recognition Letters,2008,29(10):1537-1543.
[18]BAO X M,REN W J,LV W T.A novet algorithm for Pedestrian recognition based on gabor wavelet and HOG feature[J].Radio Engineering,2017,47(10):25-29,48.
[19]ZHANG L,ZHANG Y,CHEN L L.A method of low illumination image target recognition[J].Radio Engineering,2020,50(8):656-660.
[20]CORTES C,VAPNIK V.Support vector networks[J].Machine Learning,1995,20:273-297.
[21]XU X Y,YAO P.Palm vein recognition algorithm based onHOG and improved SVM[J].Computer Engineering and Applications,2016,52(11):175-180.
[22]SRIVASTAVA R K,PANDEY D.Speech recognition usingHMM and Soft Computing[J].Materials Today:Proceedings,2022,51:1878-1883.

Related Articles 15

[1]	SUN Ziwen, YUAN Guanglin, LI Congli, QIN Xiaoyan, ZHU Hong. Object Tracking of Structured SVM Based on DIoU Loss and Smoothness Constraints [J]. Computer Science, 2024, 51(6A): 230700113-8.
[2]	GAO Jianlei, LUO Minxia. Similarity Measure Between Picture Fuzzy Sets and Its Application in Pattern Recognition [J]. Computer Science, 2024, 51(6A): 230500153-5.
[3]	XUE Jianbin, DOU Jun, WANG Tao, MA Yuling. Scheme for Maximizing Secure Communication Capacity in UAV-assisted Edge Computing Networks [J]. Computer Science, 2024, 51(6A): 230800032-7.
[4]	ZHAO Geng, WU Rui, MA Yingjie, HUANG Sijie, DONG Youheng. Three-dimensional OFDM Constellation Encryption Scheme Based on Perturbed Spatiotemporal Chaos [J]. Computer Science, 2024, 51(5): 390-399.
[5]	YANG Xing, SONG Lingling, WANG Shihui. Remote Sensing Image Classification Based on Improved ResNeXt Network Structure [J]. Computer Science, 2023, 50(6A): 220100158-6.
[6]	LI Yuge, WANG Tianjing, SHEN Hang, LUO Xiaokang, BAI Guangwei. Anti-interference Multiuser Detection Algorithm Based on Variable Step Size Adaptive Matching Pursuit in Grant-free NOMA System [J]. Computer Science, 2023, 50(5): 322-328.
[7]	LUO Jinyan, CHANG Jun, WU Peng, XU Yan, LU Zhongkui. FMCW Radar Human Behavior Recognition Based on Residual Network [J]. Computer Science, 2023, 50(11A): 220800247-6.
[8]	XUE Jianbin, AN Na, WANG Qi, ZHANG Han. Study on NOMA-MEC System Based on JTORATPAIA in Emergency Communication Scenarios [J]. Computer Science, 2023, 50(11A): 221000240-8.
[9]	JIN Jianguo. Study on Decomposition of Two-dimensional Polygonal Objects [J]. Computer Science, 2023, 50(11A): 230300237-5.
[10]	ZHU Jianyong, LI Zhaoxiang, XU Bin, YANG Hui, NIE Feiping. Orthogonal Locality Preserving Projection Unsupervised Feature Selection Based on Graph Embedding [J]. Computer Science, 2023, 50(11A): 220900003-9.
[11]	LIU Wei-ming, AN Ran, MAO Yi-min. Parallel Support Vector Machine Algorithm Based on Clustering and WOA [J]. Computer Science, 2022, 49(7): 64-72.
[12]	ZHOU Zhi-hao, CHEN Lei, WU Xiang, QIU Dong-liang, LIANG Guang-sheng, ZENG Fan-qiao. SMOTE-SDSAE-SVM Based Vehicle CAN Bus Intrusion Detection Algorithm [J]. Computer Science, 2022, 49(6A): 562-570.
[13]	HU Cong, HE Xiao-hui, SHAO Fa-ming, ZHANG Yan-wu, LU Guan-lin, WANG Jin-kang. Traffic Sign Detection Based on MSERs and SVM [J]. Computer Science, 2022, 49(6A): 325-330.
[14]	ZHAO Geng, WANG Chao, MA Ying-jie. Study on PAPR Reduction Based on Correlation of Chaotic Sequences [J]. Computer Science, 2022, 49(5): 250-255.
[15]	SHEN Jia-fang, QIAN Li-ping, YANG Chao. Non-orthogonal Multiple Access and Multi-dimension Resource Optimization in EH Relay NB-IoT Networks [J]. Computer Science, 2022, 49(5): 279-286.

Metrics

Viewed

Full text

Abstract

Cited

Shared

Discussed

Comments

Recommended 0

No Suggested Reading articles found!

Low-rank HOG Voice Detection Method for Short-wave Communication

PDF (PC)

Abstract

Cite this article

share this article

References

Related Articles 15

Metrics

Comments

Recommended 0