Computer Science ›› 2024, Vol. 51 ›› Issue (6A): 230900073-6.doi: 10.11896/jsjkx.230900073

• Interdiscipline & Application • Previous Articles     Next Articles

Implementation and Application of Chinese Grammatical Error Diagnosis System Based on CRF

LI Bin1, WANG Haochang2   

  1. 1 School of Computer Science and Engineering,Hunan University of Science and Technology,Xiangtan,Hunan 411201,China
    2 School of Computer and Information Technology,Northeast Petroleum University,Daqing,Heilongjiang 163318,China
  • Published:2024-06-06
  • About author:LI Bin,born in 1993,master,lecturer.His main research interests include na-tural language processing and intelligence education.
  • Supported by:
    National Natural Science Foundation of China(61402099) and Natural Science Foundation of Heilongjiang Pro-vince of China(LH2021F004).

Abstract: With the improvement of China’s international influence and the worldwide status of Chinese,the number of foreigners who learn Chinese as a second language increases year by year,and Chinese has become one of the most popular languages in the world.Based on this,the research of Chinese grammatical error diagnosis has attracted much attention.This paper first summarizes the current research status from the definition of Chinese grammatical error diagnosis.Secondly,through the analysis of various Chinese grammatical error diagnosis methods,a Chinese grammatical error diagnosis system based on conditional random field (CRF) is constructed to explore the Chinese grammar automatic error detection system and its specific application process,so as to assist Chinese learners in improving their learning efficiency.Experimental results on the CGED2016 dataset show that the system performs well in the detection and identification levels and needs to be improved in the position level.

Key words: Chinese grammatical error diagnosis, Sequence annotation, Conditional random field, Natural language processing

CLC Number: 

  • TP391.1
[1]YU L C,LEE L H,CHANG L P.Overview of grammatical error diagnosis for learning Chinese as a foreign language[C]//Proceedings of the 1st Workshop on Natural Language Processing Techniques for Educational Applications.Nara,Japan:Asia Pacific Society for Computers in Education,2014:42-47.
[2]CHENG S M,YU C H,CHEN H H.Chinese Word Ordering Errors Detection and Correction for Non-Native Chinese Language Learners[C]//Proceedings of COLING 2014,the 25th International Conference on Computational Linguistics:Technical Papers.2014:279-289.
[3]LAFFERTY J,MCCALLUM A,PEREIRA F.Conditional Random Fields:Probabilistic Models for Segmenting and Labeling Sequence Data[C]//Proceedings of the 18th International Conference on Machine Learning.2001:282-289.
[4]YEH J F,YEH C K,YU K H,et al.Condition Random Fields-based Grammatical Error Detection for Chinese as Second Language[C]//Proceedings of the 2nd Workshop on Natural Language Processing Techniques for Educational Applications.Beijing,China:Association for Computational Linguistics and Asian Federation of Natural Language Processing,2015:105-110.
[5]ZHENG B,CHE W,GUO J,et al.Chinese Grammatical Error Diagnosis with Long Short-TermMemory Networks[C]//Proceedings of the 3rd Workshop on Natural Language Processing Techniques for Educational Applications.Osaka,Japan:Natural Language Processing Techniques for Educational Applications,2016:49-56.
[6]HOCHREITER S,SCHMIDHUBER J.Long short-term memory[J].Neural Computation,1997,9(8):1735-1780.
[7]YANG Y,XIE P,TAO J,et al.Alibaba at IJCNLP-2017 Task 1:Embedding Grammatical Features into LSTMs for Chinese Grammatical Error Diagnosis Task[C]//Proceedings of the IJCNLP:Shared Tasks,2017:41-46.
[8]FU K,HUANG J,DUAN Y.Youdao’s Winning solution to the NLPCC-2018 Task 2 challenge:A neural machine translation approach to Chinese grammatical error correction[C]//Proceedings of the CCF International Conference on Natural Language Processing and Chinese Computing.Cham:Springer,2018:341-350.
[9]LI C,ZHOU J,BAO Z,et al.A Hybrid System for ChineseGrammatical Error Diagnosis and Correction[C]//Proceedings of the 5th Workshop on Natural Language Processing Techniques for Educational Applications.Stroudsburg,PA,USA:Association for Computational Linguistics,2018:60-69.
[10]FU R,PEI Z,GONG J,et al.Chinese Grammatical Error Diagnosis using Statistical and Prior Knowledge driven Features with Probabilistic Ensemble Enhancement[C]//Proceedings of the 5th Workshop on Natural Language Processing Techniques for Educational Applications.Stroudsburg,PA,USA:Association for Computational Linguistics,2018:52-59.
[11]DEVLIN J,CHANG M W,LEE K,et al.BERT:Pre-training of Deep Bidirectional Transformers for Language Understanding[C]//Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics:Human Language Technologies. Minneapolis:ACL,2019:4171-4186.
[12]WANG S,WANG B,GONG J,et al.Combining Resnet andTransformer for Chinese Grammatical Error Diagnosis[C]//Proceedings of the 6th Workshop on Natural Language Proces-sing Techniques for Educational Applications.Suzhou,China:Association for Computational Linguistics,2020:36-43.
[13]LUO Y,BAO Z,LI C,et al.Chinese Grammatical Error Diagnosis with Graph Convolution Network and Multi-task Learning[C]//Proceedings of the 6th Workshop on Natural Language Processing Techniques for Educational Applications.Suzhou,China:Association for Computational Linguistics,2020:44-48.
[14]SUTSKEVER I,VINYALS O,LE Q V.Sequence to sequencelearning with neural networks[C]//Proceedings of the 27th International Conference on Neural Information Processing Systems.Cambridge,USA:MIT Press,2014:3104-3112.
[15]CAO Y,HE L,RIDLEY R,et al.Integrating BERT and Score-based Feature Gates for Chinese Grammatical Error Diagnosis[C]//Proceedings of the 6th Workshop on Natural Language Processing Techniques for Educational Applications.Suzhou,China:Association for Computational Linguistics,2020:49-56.
[16]CHEN C.Research on the Diagnosis and Repair of Short Text Error Based on Web[D].Wuhan:Central China Normal University,2018.
[17]ZAREMBA W,SUTSKEVER I,VINYALS O.Recurrent Neural Network Regularization[J].arXiv:1409.2329,2014.
[18]ZHAO Y N.The Study of Automatic Function Information Extraction and Classification Approach for Chinese Patent[D].Tianjin:Hebei University of Technology,2016.
[19]WANG S S.Research on Construction Technology of Question Bank Based on Knowledge Graph[D].Chongqing:Chongqing University of Posts and Telecommunications,2021.
[20]LIN Q M,QI Z Z.Research on Speech Emotion RecognitionBased on HMM and ANN Mixed Model[J].Computer Techno-logy and Development,2018,28(10):74-78.
[21]LANG B,FAN Y N.Personalized Learning Behavior Evaluation Method Based on Deep Neural Network[J].Computer Techno-logy and Development,2019,29(7):6-10.
[22]ZHANG Y P.Research on the Construction Technology of Military Equipment Knowledge Graph[D].Xi’an:Xidian University,2021.
[23]HAN W Y.Question Answering Oriented Automatic Chinese Grammatical Error Diagnosis Method[D].Harbin:Harbin Institute of Technology,2015.
[24]LEE L H,RAO G Q,YU L C,et al.Overview of NLP-TEA 2016 Shared Task for Chinese Grammatical Error Diagnosis[C]//Proceedings of the 3rd Workshop on Natural Language Processing Techniques for Educational Applications.Osaka,Japan:Natural Language Processing Techniques for Educational Applications,2016:40-48.
[25]LI B,WANG H C.Design and Research of Intelligent Foreign Chinese Learning System[J].Computer Technology and Deve-lopment,2022,32(3):15-20.
[26]LI B.Research and Implementation of Teaching Chinese as Fo-reign Language System Based on Chatbot[D].Daqing:Northeast Petroleum University,2019.
[1] YANG Binxia, LUO Xudong, SUN Kaili. Recent Progress on Machine Translation Based on Pre-trained Language Models [J]. Computer Science, 2024, 51(6A): 230700112-8.
[2] WANG Yingjie, ZHANG Chengye, BAI Fengbo, WANG Zumin. Named Entity Recognition Approach of Judicial Documents Based on Transformer [J]. Computer Science, 2024, 51(6A): 230500164-9.
[3] LI Minzhe, YIN Jibin. TCM Named Entity Recognition Model Combining BERT Model and Lexical Enhancement [J]. Computer Science, 2024, 51(6A): 230900030-6.
[4] PENG Bo, LI Yaodong, GONG Xianfu, LI Hao. Method for Entity Relation Extraction Based on Heterogeneous Graph Neural Networks and TextSemantic Enhancement [J]. Computer Science, 2024, 51(6A): 230700071-5.
[5] LAI Xin, LI Sining, LIANG Changsheng, ZHANG Hengyan. Ontology-driven Study on Information Structuring of Aeronautical Information Tables [J]. Computer Science, 2024, 51(6A): 230800150-7.
[6] ZHANG Mingdao, ZHOU Xin, WU Xiaohong, QING Linbo, HE Xiaohai. Unified Fake News Detection Based on Semantic Expansion and HDGCN [J]. Computer Science, 2024, 51(4): 299-306.
[7] TU Xin, ZHANG Wei, LI Jidong, LI Meijiao , LONG Xiangbo. Study on Automatic Classification of English Tense Exercises for Intelligent Online Teaching [J]. Computer Science, 2024, 51(4): 353-358.
[8] ZHENG Cheng, SHI Jingwei, WEI Suhua, CHENG Jiaming. Dual Feature Adaptive Fusion Network Based on Dependency Type Pruning for Aspect-basedSentiment Analysis [J]. Computer Science, 2024, 51(3): 205-213.
[9] GE Huibin, WANG Dexin, ZHENG Tao, ZHANG Ting, XIONG Deyi. Study on Model Migration of Natural Language Processing for Domestic Deep Learning Platform [J]. Computer Science, 2024, 51(1): 50-59.
[10] GU Shiwei, LIU Jing, LI Bingchun, XIONG Deyi. Survey of Unsupervised Sentence Alignment [J]. Computer Science, 2024, 51(1): 60-67.
[11] MAO Xin, LEI Zhanyao, QI Zhengwei. Automated Kaomoji Extraction Based on Large-scale Danmaku Texts [J]. Computer Science, 2024, 51(1): 284-294.
[12] ZHANG Yian, YANG Ying, REN Gang, WANG Gang. Study on Multimodal Online Reviews Helpfulness Prediction Based on Attention Mechanism [J]. Computer Science, 2023, 50(8): 37-44.
[13] ZHOU Ziyi, XIONG Hailing. Image Captioning Optimization Strategy Based on Deep Learning [J]. Computer Science, 2023, 50(8): 99-110.
[14] ZHU Xiubao, ZHOU Gang, CHEN Jing, LU Jicang, XIANG Yixin. Single-stage Joint Entity and Relation Extraction Method Based on Enhanced Sequence Annotation Strategy [J]. Computer Science, 2023, 50(8): 184-192.
[15] ZHU Yuying, GUO Yan, WAN Yizhao, TIAN Kai. New Word Detection Based on Branch Entropy-Segmentation Probability Model [J]. Computer Science, 2023, 50(7): 221-228.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!