Computer Science ›› 2022, Vol. 49 ›› Issue (11A): 210800246-6.doi: 10.11896/jsjkx.210800246

• Image Processing & Multimedia Technology • Previous Articles     Next Articles

Study on Quality Evaluation Method of Speech Datasets for Algorithm Model

LI Sun, CAO Feng, LIU Zi-shan   

  1. China Academy of Information and Communications Technology,Beijing 100191,China
  • Online:2022-11-10 Published:2022-11-21
  • About author:LI Sun,born in 1988,postgraduate,engineer.Her main research interests include machine learning,perceptual cognitive technology and data governance,etc.
    LIU Zi-shan,born in 1992,Ph.D.Her main research interests include network intelligence,federated learning,data security and privacy,etc.

Abstract: With the maturity of intelligent voice technology and product application,the demand for high-quality voice datasets is increasing.There have been some researchers put effort on the quality evaluation of the structured data,but there are few stan-dards appeared for the unstructured voice dataset.By analyzing the construction principle of speech algorithm model and analyzing the construction demand of voice dataset,a unified quality assessment framework for the voice dataset is presented.The framework proposes to evaluate the dataset in terms of four dimensions,each of which subsumes a set of criteria:breadth coverage,anthology distinction,field depth and accuracy completeness.The criteria that are suitable to evaluate the quality dimensions are presented,each with the definition,measurement method,and the evaluation process for the voice dataset quality measurement.Experimental assessment and analysis results of the voice datasets in the vehicular application field are presented as the reference for evaluating the voice dataset quality,and promoting the construction of the voice dataset.Considering the diversified applicabi-lity,privacy issues,efficiency requirements,automation requirements and other aspects of the construction of voice data sets,the development suggestions for building high-quality voice datasets are proposed.

Key words: Artificial intelligence, Speech dataset, Quality assessment, Algorithm, Model, Intelligent speech

CLC Number: 

  • TN912.34
[1]WANG R Y,STOREY V C,FIRTH C P.A framework for ana-lysis of data quality research[J].IEEE Transactions on Know-ledge and Data Engineering,1995,7(4):623-640.
[2]LEE Y W,STRONG D M,KAHN B K,et al.AIMQ:a metho-dology for information quality assessment[J].Information & Management,2002,40(2):133-146.
[3]PIPINO L L,LEE Y W,WANG R Y.Data quality assessment[J].Communications of the ACM,2002,45(4):211-218.
[4]YANG Q Y,ZHAO P Y,YANG D Q,et al.Research on data quality evaluation method[J].Computer Engineering and Application,2004,40(9):3-4,15.
[5]HUANG G,YUAN M,WU X Y,et al.Research on metadata driven data quality evaluation architecture[J].Computer Engineering and Application,2013(8):114-119,181.
[6]SHAN Y H,LI J,WANG X R,et al.The generation method of speech recognition training data and the training method of speech recognition model:CN111402865A[P].2020.
[7]ZU Y Q.Corpus design of Chinese continuous speech database[J].Journal of Acoustics,1999(3):236-247.
[8]WU H,XU B,HUANG T Y.Automatic corpus selection algorithm based on triphone model[J].Journal of Software,2000,11(2):271-276.
[9]ZHUANG J L.Research and application of quantitative analysis of data quality[D].Shanghai:Donghua University,2019.
[10]JIN J.Research on the value evaluation of information entropy in the era of big data[D].Changchun:Jilin University,2019.
[11]LIU L Y.Development of modern Chinese Corpus[J].Language Application,1996(3):3-9.
[12]GHEITH M,ABOUL-ELA M,ARAFA W.Learning WordGraph Representation for Document Classification[C]//27th Conference for Computer Science,Statistics and Operation Research.Egyptian Computer Society,2002.
[13]GOU H W,GOU X T.Analysis of word separation and sentence similarity based on word vector[J].Scientific and Technological Innovation,2018(33):55-56.
[14]GU B,LI J H,LIU K Y.Chinese text clustering based on COSA algorithm[J].Chinese Journal of Information,2007,21(6):65-70.
[15]LIU P,WANG Z Y.Multimodal speech endpoint detection[J].Journal of Tsinghua University(Natural Science Edition),2005(7):896-899.
[16]WANG T Q,LI A J.Design of continuous Chinese Speech Re-cognition Corpus[C]//National Conference on Modern Phone-tics.2003.
[1] CHAI Hui-min, ZHANG Yong, FANG Min. Aerial Target Grouping Method Based on Feature Similarity Clustering [J]. Computer Science, 2022, 49(9): 70-75.
[2] ZHOU Le-yuan, ZHANG Jian-hua, YUAN Tian-tian, CHEN Sheng-yong. Sequence-to-Sequence Chinese Continuous Sign Language Recognition and Translation with Multi- layer Attention Mechanism Fusion [J]. Computer Science, 2022, 49(9): 155-161.
[3] WU Zi-yi, LI Shao-mei, JIANG Meng-han, ZHANG Jian-peng. Ontology Alignment Method Based on Self-attention [J]. Computer Science, 2022, 49(9): 215-220.
[4] HU Yu-jiao, JIA Qing-min, SUN Qing-shuang, XIE Ren-chao, HUANG Tao. Functional Architecture to Intelligent Computing Power Network [J]. Computer Science, 2022, 49(9): 249-259.
[5] LIU Xin, WANG Jun, SONG Qiao-feng, LIU Jia-hao. Collaborative Multicast Proactive Caching Scheme Based on AAE [J]. Computer Science, 2022, 49(9): 260-267.
[6] NING Han-yang, MA Miao, YANG Bo, LIU Shi-chang. Research Progress and Analysis on Intelligent Cryptology [J]. Computer Science, 2022, 49(9): 288-296.
[7] WANG Zi-kai, ZHU Jian, ZHANG Bo-jun, HU Kai. Research and Implementation of Parallel Method in Blockchain and Smart Contract [J]. Computer Science, 2022, 49(9): 312-317.
[8] JIANG Yang-yang, SONG Li-hua, XING Chang-you, ZHANG Guo-min, ZENG Qing-wei. Belief Driven Attack and Defense Policy Optimization Mechanism in Honeypot Game [J]. Computer Science, 2022, 49(9): 333-339.
[9] DOU Jia-wei. Privacy-preserving Hamming and Edit Distance Computation and Applications [J]. Computer Science, 2022, 49(9): 355-360.
[10] CHEN Jun, HE Qing, LI Shou-yu. Archimedes Optimization Algorithm Based on Adaptive Feedback Adjustment Factor [J]. Computer Science, 2022, 49(8): 237-246.
[11] YANG Bing-xin, GUO Yan-rong, HAO Shi-jie, Hong Ri-chang. Application of Graph Neural Network Based on Data Augmentation and Model Ensemble in Depression Recognition [J]. Computer Science, 2022, 49(7): 57-63.
[12] LIU Wei-ming, AN Ran, MAO Yi-min. Parallel Support Vector Machine Algorithm Based on Clustering and WOA [J]. Computer Science, 2022, 49(7): 64-72.
[13] ZHANG Hong-bo, DONG Li-jia, PAN Yu-biao, HSIAO Tsung-chih, ZHANG Hui-zhen, DU Ji-xiang. Survey on Action Quality Assessment Methods in Video Understanding [J]. Computer Science, 2022, 49(7): 79-88.
[14] HOU Yu-tao, ABULIZI Abudukelimu, ABUDUKELIMU Halidanmu. Advances in Chinese Pre-training Models [J]. Computer Science, 2022, 49(7): 148-163.
[15] ZHOU Hui, SHI Hao-chen, TU Yao-feng, HUANG Sheng-jun. Robust Deep Neural Network Learning Based on Active Sampling [J]. Computer Science, 2022, 49(7): 164-169.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!