Computer Science ›› 2013, Vol. 40 ›› Issue (9): 208-211.

Previous Articles     Next Articles

Research on Mongolian Spoken Term Detection Method Based on Segmentation Recognition

BAO Fei-long,GAO Guang-lai,YAN Xue-liang and WANG Wei-hua   

  • Online:2018-11-16 Published:2018-11-16

Abstract: Mongolian is an agglutinative language.This special formation rule results in an amount of probably millions of words which are far beyond the coverage of the pronunciation dictionary of any current Mongolian Large Vocabulary Continuous Speech Recognition(LVCSR)System.On the other hand,even if the pronunciation dictionary is large enough to cover most of the Mongolian words,the recognition system still won’t perform well due to the sparseness of training data.To avoid the poor coverage problem of pronunciation Dictionary,we proposed a segmentation-based LVCSR approach and trained its acoustic model and language model.This approach is integrated into Mongolian Spoken Term Detection(STD)system and tested on Mongolian speech data.Experimental results show that our segmentation-based LVCSR approach can recognize most of the Mongolian words successfully and both the precision and recall of the Mongolian STD system are greatly improved by converting most of the out-of-vocabulary words into their in-vocabulary form.

Key words: Mongolian,Stem,Ending suffix,Spoken term detection,Out-of-vocabulary word,Confusion network

[1] Bao Fei-long,Gao Guang-lai.The Research on Mongolian Spo-ken Term Detection Based on Confusion Network[C]∥Procee-dings of The Chinese Conference on Pattern Recognition(CCPR2012).Beijing,2012:606-612
[2] Gao Guang-lai,Biligetu,Nabuqing,et al.A Mongolian speechrecognition system based on HMM[C]∥Proceedings of International Conference on Intelligent Computing(ICIC2006).Kunming,2006:667-676
[3] Qilao H S,Gao Guang-lai.Researching of Speech Recognition Oriented Mongolian Acoustic Model[C]∥Proceedings of The Chinese Conference on Pattern Recognition(CCPR2008).Beijing,2008:406-411
[4] Bao Fei-long,Gao Guang-lai.Improving of Acoustic Model forthe Mongolian Speech Recognition System[C]∥Proceedings of The Chinese Conference on Pattern Recognition(CCPR2009).Nanjing,2009:616-620
[5] 清格尔泰.蒙古语语法 [M].呼和浩特:内蒙古人民出版社,1991:77-133
[6] Mangu L,Brill E,Stolcke A.Finding consensus in speech recognition:word error minimization and other applications of confusion networks[J].Computer Speech and Language,2000,14(4):373-400
[7] Mamou J,Carmel D,Hoory R.Spoken document retrieval from call-center conversations[C]∥Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval.New York,NY,USA,2006:51-58
[8] Mamou J,Ramabhadran B,Siohan O.Vocabulary independentspoken term detection[C]∥Proc.ACM-SIGIR’07.Amsterdam,2007:615-622
[9] Young S,et al.The HTK book(Revised for HTK version 3.4.1)[M].Cambridge University,2009
[10] Stolcke A.SRILM-An Extensible Language Modeling Toolkit[C]∥Proc.Intl.Conf.Spoken Language Processing.Denver,Colorado,2002

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!