Computer Science ›› 2020, Vol. 47 ›› Issue (11A): 638-641.doi: 10.11896/jsjkx.200500097

• Interdiscipline & Application • Previous Articles     Next Articles

PIFA-based Evaluation Platform for Speech Recognition System

CUI Yang1, LIU Chang-hong2   

  1. 1 College of Applied Technology,China University of Labor Relations,Beijing 100048,China
    2 College of Computer Information Engineering,Jiangxi Normal University,Nanchang 330022,China
  • Online:2020-11-15 Published:2020-11-17
  • About author:CUI Yang,born in 1979,Ph.D,lecturer.His main research interests include knowledge engineering and knowledge discovery.
  • Supported by:
    This work was supported by the Research Project of China University of Labor Relations (20XYJS004) and National Natural Science Foundation of China (61662030).

Abstract: There are many application fields of speech recognition technology,and the performance evaluation of the speech recognition system plays an important role in promoting the development of speech recognition technology.PIFA (PerformanceInfluen-cing Factor Analysis) based architecture of evaluation platform for speech recognition system is proposed by summarizing va-rious existing speech recognition evaluation methods to compare the performance of various speech systems better,and a platform with PIFA is implemented.The platform involves two key concepts,evaluation database and evaluation project,and includes mo-dules of evaluation data generation,data analysis,performance evaluation index calculation and performance influencing factors analysis.It can deal with multiple recognition tasks and many kinds of data,especially for speech recognition with large vocabulary and continuity.The evaluation results can be statistically analyzed by the platform to reveal the influence of various data attri-butes on the performance of the recognition system,and help the improvement of the speech recognition system.

Key words: Evaluation items, Evaluation library, Performance influencing factor analysis, PIFA, Speech recognition

CLC Number: 

  • TP311.1
[1] LIU J,ZHANG W Q.Research Progress on Key Technologies of Low Resource Speech Recognition[J].Journal of Data Acquisition and Processing,2017,32(2):205-220.
[2] ZHAO J H,GAO H B,LIU Y C,et al.Speech Recognition Algorithm Based on Neural Network and Hidden Markov Model[J].The Journal of China Universities of Posts and Telecommunications,August,2018,25(4):28-37.
[3] YANG Y,WANG Y D.Speech Recognition Based on Improved Convolutional Neural Netword Algorithm[J].Journal of Applied Acoustics,2018,37(6):940-946.
[4] CHEN Z H,ZHENG W L,YOU Y B,et al.Label Synchronous Decoding for Speech Recognition[J].Chinese Journal of Computers,2019,42:1-15.
[5] WEI G W,FENG Z Y.Design of DSP Speech Recognition Sys-tem Based on Denoising Technology[J].Transducer and Mic-rosystem Technologies,2017,36(1):108-118.
[6] JIN C,GONG C,LI H.Speaker Adaptation Research of Neural Network Acoustic Model in Speech Recognition[J].Computer Applications and Software,2018,35(2):200-205.
[7] HU D,ZENG Q N,LONG C,et al.Front-end Robust Study forContinuous Speech Recognition[J].Video Engineering,2015,39(24):43-46.
[8] WANG X D,XIE F,LIN S X,et al.DOE and ANOVA basedPerformance Influencing Factor Analysis for Evaluation of Speech Recognition Systems[C]//International Conference on Industrial.Singapore:ISCSLP,2013:431-442.
[9] WANG B C,WEI Y Y,DAI N.Pivoting and Approximate Pivo-ting of Bootstrap Statistics[J].Statistics and Decision,2016,16:17-20.
[10] BAO Y B,HU Y,LIU C,et al.Phomeme Modeling Units Desing for Mandarin LVCSR Systems[J].Journal of Tsinghua University(Sci & Tech),2011,51(9):1288-1292.
[1] XU Ming-ke, ZHANG Fan. Head Fusion:A Method to Improve Accuracy and Robustness of Speech Emotion Recognition [J]. Computer Science, 2022, 49(7): 132-141.
[2] CHENG Gao-feng, YAN Yong-hong. Latest Development of Multilingual Speech Recognition Acoustic Model Modeling Methods [J]. Computer Science, 2022, 49(1): 47-52.
[3] YANG Run-yan, CHENG Gao-feng, LIU Jian. Study on Keyword Search Framework Based on End-to-End Automatic Speech Recognition [J]. Computer Science, 2022, 49(1): 53-58.
[4] ZHENG Chun-jun, WANG Chun-li, JIA Ning. Survey of Acoustic Feature Extraction in Speech Tasks [J]. Computer Science, 2020, 47(5): 110-119.
[5] ZHANG Jing, YANG Jian, SU Peng. Survey of Monosyllable Recognition in Speech Recognition [J]. Computer Science, 2020, 47(11A): 172-174.
[6] SHI Yan-yan, BAI Jing. Speech Recognition Combining CFCC and Teager Energy Operators Cepstral Coefficients [J]. Computer Science, 2019, 46(5): 286-289.
[7] WEI Ying, WANG Shuang-wei, PAN Di, ZHANG Ling, XU Ting-fa and LIANG Shi-li. Specific Two Words Chinese Lexical Recognition Based on Broadband and Narrowband Spectrogram Feature Fusion with Zoning Projection [J]. Computer Science, 2016, 43(Z11): 215-219.
[8] LI Wei-lin, WEN Jian and MA Wen-kai. Speech Recognition System Based on Deep Neural Network [J]. Computer Science, 2016, 43(Z11): 45-49.
[9] SUN Zhi-yuan, LU Cheng-xiang, SHI Zhong-zhi and MA Gang. Research and Advances on Deep Learning [J]. Computer Science, 2016, 43(2): 1-8.
[10] DONG Jun-jian,MAO Qi-rong,HU Su-li and ZHAN Yong-zhao. Sub-coding and Entire-coding Jointly Penalty Based Sparse Representation Dictionary Learning [J]. Computer Science, 2014, 41(10): 122-127.
[11] LIU Wan-feng,HU Jun and YUAN Wei-wei. Research on Technology of Voice Instruction Recognition for Air Traffic Control Communication [J]. Computer Science, 2013, 40(7): 131-137.
[12] CHAO Hao,YANG Zhan-lei and LIU Wen-ju. Landmark Guided Segmental Speech Decoding Algorithm for Continuous Mandarin Speech Recognition [J]. Computer Science, 2013, 40(10): 208-212.
[13] WANG Hui,Li Sheng-hua. Feature Extraction of Speech Signal Based on Empirical Mode Decomposition [J]. Computer Science, 2011, 38(Z10): 434-436.
[14] HAN Zhi-yan,WANG Jian,LUN Shu-xian. Design of Speech Recognition Classifier Based on Genetic Wavelet Neural Network [J]. Computer Science, 2010, 37(11): 243-246.
[15] HAN Zhi-yan,WANG Jian,WANG Xu. Parameter Optimization Based on Orthogonal Test Design in Speech Recognition System [J]. Computer Science, 2010, 37(1): 214-216.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!