Computer Science ›› 2020, Vol. 47 ›› Issue (6A): 187-195.doi: 10.11896/JsJkx.190900064

• Computer Graphics & Multimedia • Previous Articles     Next Articles

Automatic Voice Detection Algorithm for Schizophrenic Combining EHHT and CI

TIAN Wei-wei1, ZHOU Yue1, YIN Wang1, HE Ling1, DENG Li-hua1 and LI Yuan-yuan2   

  1. 1 College of Electrical Engineering,Sichuan University,Chengdu 610065,China
    2 Mental Health Center of West China Hospital,Sichuan University,Chengdu 610041,China
  • Published:2020-07-07
  • About author:TIAN Wei-wei, born in 1998, postgradua-te.Her main research interests include speech signal processing and so on.
    LI Yuan-yuan, born in 1984, Ph.D.attending doctor.Her main research interests include psychiatry and mental health.
  • Supported by:
    This work was supported by the Chengdu Science and Technology Benefiting People Technology Research and Development ProJect,Sichuan Province,China (2015-HM01-00430-SF),National Natural Science Foundation of China (61503264), Sichuan University Innovation SparkBank ProJect,China(2082604401189) and Science and Technology Department ProJect of Sichuan Province,China (2019YFS0236).

Abstract: Through studying the clinical characteristics of schizophrenic speech,the experiment collected 686 vowel data samples from 14 schizophrenic patients and 793 vowel data samples from 14 healthy controls matched with gender,age and education level to establish a pathological voice database.Using the improved formant extraction algorithm combining Ensemble Hilbert-Huang Transform (EHHT) and Cepstrum Interpolation (CI) to obtain the acoustic feature parameter set reflecting emotion change of schizophrenic voice quality,then combined with the Support Vector Machine (SVM) classifier for classification,automatic voice detection of schizophrenic patients and the healthy controls is achieved.Besides,it designed experiments to discuss the influence of the four factors,that is,the number and variance of white noise,the number of IMF components and the window length,on the detection effect,and compared with the classical formant estimation methods.Experimental results show that the detection accuracy of the proposed algorithm can reach 98.8%.The patients with schizophrenia have a significant difference in the acoustical parameters of the formants represent the sound quality feature with the healthy controls,and it may provide a new obJective,quantitative and efficient indicator for the clinical assistant diagnostic research of schizophrenia.

Key words: Cepstral interpolation, Ensemble hilbert-huang transform, Formant, Schizophrenic voice, Sound quality feature

CLC Number: 

  • TP391.9
[1] QIAO Y S,ZHANG S Q,CUI W D,et al.The Effect of Mental Health Education Diverse Family Therapy on Social and Family Functioning in Patients with Schizophrenia.China Journal of Health Psychology,2019,27(2):11-15.
[2] DELATTRE P.The Physiological Interpretation of Sound Spectrograms.Publications of the Modern Language Association of America,1951,66(5): 864-875.
[3] JIN Y F,XIONG L N,CAI C L,et al.Research Progress on Psychotic Symptom Assessment Tools in Patients with Schizophrenia.Journal of Nursing,2015,22(16):17-21.
[4] WANG X H.Diagnostic Research Method of Schizophrenia Based on fMRI Technology.Imaging Research and Medical Applications,2019,3(2):59-60.
[5] WANG J J,WANG P F,QUAN W X,et al.The Characteristics of Language Cognition and Its Neural Basis in Schizophrenia .Progress in Biochemistry and Biophysics,2015,42(1):49-55.
[6] BREAKSPEAR M.The Nonlinear Theory of Schizophrenia.Australian and New Zealand Journal of Psychiatry,2006,40(1):20-35.
[7] CHAKRABORTY D,TAHIR Y,YANG Z,et al.Assessment and Prediction of Negative Symptoms of Schizophrenia from RGB+D Movement Signals//2017 IEEE 19th International Workshop on Multimedia Signal Processing (MMSP).New York:IEEE Press,2017:1-6.
[8] XUE C B,CHEN Q L,ZHU J,et al.Progress in Schizophrenic Genomics Research.International Journal of Psychiatry,2014,41(1):17-20.
[9] RAPCAN V,D’ARCY S,YEAP S,et al.Acoustic and Temporal Analysis of Speech:A Potential Biomarker for Schizophrenia.Medical Engineering and Physics,2010,32(9):1074-1079.
[10] COMPTON,MICHAEL T,ANYA L,et al.The Aprosody of Schizophrenia:Computationally Derived Acoustic Phonetic Underpinnings of Monotone Speech.Schizophrenic Research,2018,197:392-399.
[11] GOLD R,BUTLER P,REVHEIM N,et al.Auditory Emotion Recognition Impairments in Schizophrenia:Relationship to Acoustic Features and Cognition.American Journal of Psychia-try,2012,169(4):424-432.
[12] XU S,YANG Z,CHAKRABORTY D,et al.Automatic Verbal Analysis of Interviews with Schizophrenic Patients//2018 IEEE 23rd International Conference on Digital Signal Processing (DSP).New York:IEEE Press,2018:1-5.
[13] COHEN A S,MITCHELL K R,DOCHERTY N M,et al.Vocal Expression in Schizophrenia:Less Than Meets the Ear.Journal of Abnormal Psychology,2016,125(2):299-309.
[14] ZHANG J,PAN Z D,GUI C,et al.Clinical Investigation of Speech Signal Features Among Patients with Schizophrenia.Shanghai Archives of Psychiatry,2016,28(2):95-102.
[15] HAN W J,LI H F,RUAN H B,et al.Review on Speech Emotion Recognition.Journal of Software,2014,25(1):37-50.
[16] GOBL C,CHASAIDE A N.The Role of Voice Quality in Communicating Emotion,Mood and Attitude.Speech Communication,2003,40(1/2):189-212.
[17] JIANG H H,HU B.Speech Emotion Recognition in Mandarin Based on PCA and SVM.Computer Science,2015,42(11):270-273.
[18] SONG Z Y.Application of MATLAB in Speech Signal Analysis and Synthesis.BeiJing:BeiJing University of Aeronautics and Astronautics Press,2013:1-344.
[19] YANG J X,SHE Y M,FU M J,et al.Formant Estimation of Isolated Words in the Wa Language Based on Adaptive Variational Mode Decomposition.Journal of Yunnan Minzu University (Natural Science Edition),2019,28(3):83-91.
[20] FRANCES M A,PINCUS H A,FIRST M B.Diagnostic and Statistical Manual of Mental Disorders.BMC Med,2013,17:133-137.
[21] COVINGTON M A,LUNDEN S L A,CRISTOFARO S L,et al.Phonetic Measures of Reduced Tongue Movement Correlate with Negative Symptom Severity in Hospitalized Patients with First-Episode Schizophrenia-Spectrum Disorders.Schizophrenia Research,2012,142(1/2/3):93-95.
[22] BERNARDINI F,LUNDEN A,COVINGTON M,et al.Associations of Acoustically Measured Tongue/Jaw Movements and Portion of Time Speaking with Negative Symptom Severity in Patients with Schizophrenia in Italy and the United States.Psychiatry Research,2016,239:253-258.
[23] HAN Z Y,WANG J,WANG D,et al.Dynamic Feature Extraction for Speech Signal Based on Formant Curve.Computer Technology and Development,2017,27(6):72-80.
[24] FENG J C,PAN S Y.Extraction Algorithm of Vital Signals Based on Empirical Mode Decomposition.Journal of South China University of Technology (Natural Science Edition),2010,38(10):1-6.
[25] WU Z,HUANG N E.Ensemble Empirical Mode Decomposi tion:A Noise-Assisted Data Analysis Method.Advances in Adaptive Data Analysis,2009,1(1):1-41.
[26] WANG Y H,YEH C H,YOUNG H W V,et al.On the Computational Complexity of the Empirical Mode Decomposition Algorithm.Physica A:Statistical Mechanics and Its Applications,2014,400:159-167.
[27] FU K.Research on Feature Extraction Methods and Its Application to Nonstationary Signal Based on HHT.Chongqing:Chongqing University,2015.
[28] MA T L.Improvement of HHT Method and Its Application in Ground Tilting Tide Signal.Kunming:Kunming University of Technology,2013.
[29] BAO H Q,LIN M C.Experimental Phonetics Summary (Updated Vision) .BeiJing:Peking University Press,2014:107-113.
[30] ZHAO T T,YANG H W.Formant Extraction Algorithm of Speech Signal by Combining EMD and WMCEP.Computer Engineering and Application,2015,51(9):207-212.
[31] ZHUO J W,WANG H J.MATLAB Mathematical Modeling Method and Practice (3rd Edition) .BeiJing:BeiJing University of Aeronautics and Astronautics Press,2018:66-69.
[32] ZHAO Y,YIN X F,CHEN K A.A New Formant Detection Algorithm Based on Cepstrum.Applied Acoustics,2010,29(6):416-424.
[33] HAN F,ZHENG J J.Improved Resonance Peak Detection Algorithm Based on LPC.Electronic Design Engineering,2017,25(17):85-89.
[34] HUANG H,CHEN X X.Speech Formant Frequency Estimation Based on Hilbert Huang Transform.Journal of ZheJiang University (Engineering Science),2006,40(11):1926-1930.
[35] YU F Q,XIAO Z.Finding Speech Formant by Using the Chara-cter of Hilbert-Huang Transform as An Adaptive Band-Filter.Technical Acoustics,2008,27(2):266-270.
[1] CHAI Hui-min, ZHANG Yong, FANG Min. Aerial Target Grouping Method Based on Feature Similarity Clustering [J]. Computer Science, 2022, 49(9): 70-75.
[2] WANG Jin, LIU Jiang. GPU-based Parallel DILU Preconditioning Technique [J]. Computer Science, 2022, 49(6): 108-118.
[3] SHAO Xin-xin. TI-FastText Automatic Goods Classification Algorithm [J]. Computer Science, 2022, 49(6A): 206-210.
[4] MAO Sen-lin, XIA Zhen, GENG Xin-yu, CHEN Jian-hui, JIANG Hong-xia. FCM Algorithm Based on Density Sensitive Distance and Fuzzy Partition [J]. Computer Science, 2022, 49(6A): 285-290.
[5] YAO Ye, ZHU Yi-an, QIAN Liang, JIA Yao, ZHANG Li-xiang, LIU Rui-liang. Android Malware Detection Method Based on Heterogeneous Model Fusion [J]. Computer Science, 2022, 49(6A): 508-515.
[6] MAO Dian-hui, HUANG Hui-yu, ZHAO Shuang. Study on Automatic Synthetic News Detection Method Complying with Regulatory Compliance [J]. Computer Science, 2022, 49(6A): 523-530.
[7] ZHOU Chu-lin, CHEN Jing-dong, HUANG Fan. WiFi-PDR Fusion Indoor Positioning Technology Based on Unscented Particle Filter [J]. Computer Science, 2022, 49(6A): 606-611.
[8] XU Jia-nan, ZHANG Tian-rui, ZHAO Wei-bo, JIA Ze-xuan. Study on Improved BP Wavelet Neural Network for Supply Chain Risk Assessment [J]. Computer Science, 2022, 49(6A): 654-660.
[9] CHEN Yu-si, AI Zhi-hua, ZHANG Qing-hua. Efficient Neighborhood Covering Model Based on Triangle Inequality Checkand Local Strategy [J]. Computer Science, 2022, 49(5): 152-158.
[10] ZHAO Geng, WANG Chao, MA Ying-jie. Study on PAPR Reduction Based on Correlation of Chaotic Sequences [J]. Computer Science, 2022, 49(5): 250-255.
[11] LIN Jin-cheng, JI Qing-ge, ZHONG Zhen-wei. Modified Social Force Model Considering Pedestrian Characteristics and Leaders [J]. Computer Science, 2022, 49(5): 347-354.
[12] JIANG Hua-nan, ZHANG Shuai, LIN Yu-fei, LI Hao. Simulation Optimization and Testing Based on Gazebo of MPI Distributed Parallelism [J]. Computer Science, 2021, 48(11A): 672-677.
[13] SHAO Xin-xin. Service Recommendation Algorithm Based on Canopy and Shared Nearest Neighbor [J]. Computer Science, 2020, 47(11A): 479-481.
[14] CHEN Pei, ZHENG Wan-bo, LIU Wen-qi, XIAO Min, ZHANG Ling-xiao. Analysis and Forecast of Some Climate Indexes in Main Producing Areas of Yunnan Province Based on Multiple Models [J]. Computer Science, 2020, 47(11A): 496-503.
[15] ZENG Lei, LI Hao, LIN Yu-fei, ZHANG Shuai. Study on Simulation Optimization of Gazebo Based on Asynchronous Mechanism [J]. Computer Science, 2020, 47(11A): 593-598.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!