计算机科学 ›› 2022, Vol. 49 ›› Issue (6A): 66-73.doi: 10.11896/jsjkx.210600134
李小伟, 舒辉, 光焱, 翟懿, 杨资集
LI Xiao-wei, SHU Hui, GUANG Yan, ZHAI Yi, YANG Zi-ji
摘要: 随着信息技术的快速发展,数据产生了爆炸式的增长,互联网上每天都会新增大量的简历数据。对求职者的简历进行分析,从中获取候选人的各类人员信息、所属行业类别和进一步的工作职位推荐是学者们所关注的问题。人工分析简历效率低下的问题,推动了自然语言处理(Natural Language Processing,NLP)技术在简历分析中的广泛应用。NLP利用人工智能和计算机技术来分析、理解和处理自然语言,可实现简历的自动化分析。文中调研了近10年来的相关文献,对NLP在简历分析中的应用环节及常用方法进行了梳理。首先,对自然语言处理进行了介绍;接着,从简历信息抽取、简历分类和简历推荐3个方面分析和归纳了自然语言处理在简历分析技术中的研究进展;最后,对简历分析的研究趋势作了预测并总结了全文。
中图分类号:
[1] ZU S,WANG X.Resume Information Extraction with a Novel Text Block Segmentation Algorithm[J].International Journal on Natural Language Computing,2019,8(5):29-48. [2] CHOU Y,YU H.Based on the Application of Ai Technology in Resume Analysis and Job Recommendation[C]//2020 IEEE International Conference on Computational Electromagnetics(ICCEM).IEEE,2020:291-296. [3] MATHEW L,GEORGE NC,LINET N,et al.“ats Breaker”-a System for Comparing Candidate Resume and Company Requirements[J].International Journal of Engineering Research & Technology,2020,9(6):591-594. [4] DESHPANDE A,KHATRI D,DESHPANDE D,et al.Proposed System for Resume Analytics[J].International Journal of Engineering Research & Technology,2016,5(11):468-471 [5] MITTAL V,MEHTA P,RELAN D,et al.Methodology for Resume Parsing and Job Domain Prediction.[J].Journal of Statistics & Management Systems,2020,23(7):1265-1274. [6] QIU X P,SUN T X,XU Y G,et al.Pre-trained Models for Natural Language Processing:A Survey[J].Science China(Technological Sciences),2020,63(10):1872-1897. [7] ZHOU M,DUAN N,LIU S.Progress in Neural Nlp:Modeling,Learning,and Reasoning[J].Engineering,2020,6(3):275-290. [8] CHEN D G,MA J L,MA Z P,et al.Review of Pre-trainingTechniques for Natural Language Processing[J].Journal of Frontiers of Computer Science and Technology,2021,15(8):1359-1389. [9] WANG Y,DENG H,LI X Y,et al.A review of natural language processing application in construction engineering[J].Journal ofGraphics,2020,41(4):501-511. [10] DEEPAK G,TEJA V,SANTHANAVIJAYAN A.A NovelFirefly Driven Scheme for Resume Parsing and Matching Based on Entity Linking Paradigm[J].Journal of Discrete Mathematical Sciences and Cryptography,2020,23(1):157-165. [11] GUO X Y,HE T T.Survey about Research on Information Extraction[J].Computer Science,2015,42(2):14-17,38. [12] OTTER D,MEDINA J,KALITA J.A Survey of the Usages of Deep Learning for Natural Language Processing[J].IEEE Transactions on Neural Networks and Learning Systems,Neural Networks and Learning Systems,2021,32(2):604-624. [13] CHEN C B.Curriculum Vitae Recognition System base on Identification OF Semi-structured Text [D].Beijing:Beijing University of Post and Telecommunications,2008. [14] JIANG Z,ZHANG C,XIAO B,et al.Research and Implementation of Intelligent Chinese Resume Parsing[C]//2009 Wri International Conference on Communications and Mobile Computing.IEEE,2009:588-593. [15] ZHANG C,WU M,LI C G,et al.Resume Parser:Semi-structured Chinese Document Analysis[C]//2009 Wri World Congress on Computer Science and Information Engineering.IEEE,2009:12-16. [16] MU Y,WANG Y,GUO J.Extracting Software Functional Requirements From Free Text Documents[C]//2009 International Conference on Information and Multimedia Technology.IEEE,2009:194-198. [17] DARSHAN P M.Ontology Based Information Extraction From Resume[C]//2017 International Conference on Trends in Electronics and Informatics(ICEI).2017:43-47. [18] TOBING B C L,SUHENDRA I R,HALIM C.Catapa Resume Parser:End to End Indonesian Resume Extraction[C]//Proceedings of the 2019 3rd International Conference on Natural Language Processing and Information Retrieval.2019:68-74. [19] PAWAR S,SRIVASTAVA R,PALSHIKAR G K.AutomaticGazette Creation for Named Entity Recognition and Application to Resume Processing[C]//Proceedings of the 5th Acm Compute Conference:Intelligent & Scalable System Technologies.2012:1-7. [20] YAN W T,QIAO Y P.Chinese Resume Information Extraction Based on Semi-structured Text[C]//2017 36th Chinese Control Conference(CCC).2017:11177-11182. [21] YU K,GUAN G,ZHOU M.Resume Information Extractionwith Cascaded Hybrid Model[C]//Proceedings of the 43rd Annual Meeting of the Association for Computational Linguistics(ACĹ05).2005:499-506. [22] GU N N,FENG J,SUN X,et al.Chinese resume information automatic extraction and recommendation algorithm[J].Computer Engineering and Applications,2017,53(18):141-148,270. [23] YADAV V,BETHARD S.A Survey on Recent Advances inNamed Entity Recognition From Deep Learning Models[J].arXiv:1910.1147v1,2019. [24] LI J,ZHANG H,GU X W.Framework of Vita Event Extraction and Retrieval[J].Computer Science,2012,39(7):154-160,174. [25] CHEN J,GAO L,TANG Z.Information Extraction From Resume Documents in Pdf Format[J].Electronic Imaging,2016,2016(17):1-8. [26] MU R,ZENG X,HAN L.A Survey of Recommender Systems Based on Deep Learning[J].IEEE Access,2019,6:69009-69022. [27] HUANG S,LI W,ZAHNG J.Entity extraction method of re-sume information based on deep learning[J].Computer Engineering and Design,2018,39(12):3873-3878. [28] CHEN Y,FU L,DAI Y X,et al.Research of Chinese Resume Analysis Based on Feature Fusion[J].Computer Engineering and Applications,2019,55(10):244-249. [29] PHAM VAN L,VU NGOC S,NGUYEN VAN V.Study of Information Extraction in Resume[R].Vienna:VNU University of Engineering and Technology,2018. [30] AYISHATHAHIRA C,SREEJITH C,RASEEK C.Combina-tion of Neural Networks and Conditional Random Fields for Efficient Resume Parsing[C]//2018 International Cet Conference on Control,Communication,and Computing(IC4).2018:388-393. [31] KATSUTA A,H ANJAYA HA,ASATI S,et al.InfomationExtraction From English & Japanese Résumé with Neural Sequence Labelling Methods[C]//2018 The Association for Natural Language Processing.2018:1007-1010. [32] DERNONCOURT F,LEE J Y,SZOLOVITS P.Neuroner:anEasy-to-use Program for Named-entity Recognition Based on Neural Networks[J].arXiv:1705.05487,2017. [33] SU Y,ZHANG J,LU J.The Resume Corpus:a Large Dataset for Research in Information Extraction Systems[C]//2019 15th International Conference on Computational Intelligence and Security(CIS).2019:375-378. [34] WANG K,LIU B S.A survey of text classification[J].DataCommunication,2019,190(3):37-47. [35] ZAROOR A,MAREE M,SABHA M.Jrc:a Job Post and Resume Classification System for Online Recruitment[C]//2017 IEEE 29th International Conference on Tools with Artificial Intelligence(ICTAI).2017:780-787. [36] FENG L N,ZHOU L H.Application of K-Means in the Curriculum Vitae Data[J].Computer Science,2009,36(8):226-278. [37] HUANG X Y,ZHOU W M.A Study of Job Description Clustering Based on Latent Semantics Index[J].Network New Media Technology,2017,6(3):33-37,64. [38] WAN J,CHEN B,SI H.Mining and Measurement of Vocational Skills and Their Association Rules Based on Big Data[C]//Proceedings of the International Conference on Digital Technology in Education.2017:59-63. [39] XU X,QIAN H,GE C,et al.Industry classification with online resume big data:A design science approach[J].Information & Management,2020,57(5):103-182. [40] LAMBA D,GOYAL S,CHITRESH V,et al.An IntegratedSystem for Occupational Category Classification based on Resume and Job Matching[C]//Proceedings of the International Conference on Innovative Computing & Communications(ICICC).2020. [41] JAVED F,MCNAIR M,JACOB F,et al.Towards a job title classification system[J].arXiv:1606.00917,2016. [42] GOPALAKRISHNA S T,VIJAYARAGHAVAN V.Automa-ted Tool for Resume Classification Using Sementic Analysis[J].International Journal of Artificial Intelligence and Applications(IJAIA),2019,10(1):11-23. [43] SAYFULLINA L,MALMI E,LIAO Y,et al.Domain adaptation for resume classification using convolutional neural networks[C]//International Conference on Analysis of Images,Social Networks and Texts.Cham:Springer,2017:82-93. [44] NASSER S,SREEJITH C,IRSHAD M.Convolutional NeuralNetwork with Word Embedding Based Approach for Resume Classification[C]//2018 International Conference on Emerging Trends and Innovations In Engineering And Technological Research(ICETIETR).IEEE,2018:1-6. [45] JIECHIEU K F F,TSOPZE N.Skills prediction based on multi-label resume classification using CNN with model predictions explanation[J].Neural Computing and Applications,2021,33(10):5069-5087. [46] HAO K.Design and Implementation of Job RecommendationSystem Based on Resume Data [D].Nanjing:Southeast University,2018. [47] CERNIAN A,CARSTOIU D,MARTIN B.Semi-automatic Tool for Parsing Cvs and Identifying Candidates' Abilities and Competencies[C]//EMASS 2016.2016. [48] CHANDOLA D,GARG A,MAURYA A,et al.Online Resume Parsing System Using Text Analytics[J].Journal of Multi Disciplinary Engineering Technologies(JMDET),2015,9(1):1-5. [49] GAJALAKSHMI P,RAMESH V.Design of Automated Re-sume Extraction System Using Horspool and Karp-rabin Algorithms in Text Mining[J].International Journal of Research in Engineering Technology,2016,1(6):9-16. [50] ROY P K,CHOWDHARY S S,BHATIA R.A Machine Learning Approach for Automation of Resume Recommendation System[J].Procedia Computer Science,2020,167:2318-2327. [51] WANG Z,TANG X,CHEN D.A Resume RecommendationModel for Online Recruitment[C]//2015 11th International Conference on Semantics,Knowledge and Grids(SKG).IEEE,2015:256-259. [52] CUI Y.Research on Two-way Recommendation of GraduateEmployment Based on Collaborative Filtering and Thematic Model [D].Beijing:Beijing Jiaotong University,2018. [53] GUO S,ALAMUDUN F,HAMMOND T.Résumatcher:a Personalized Résumé-job Matching System[J].Expert Systems with Applications,2016,60:169-182. [54] MOHAMED A,BAGAWATHINATHAN W,IQBAL U,et al.Smart Talents Recruiter-resume Ranking and Recommendation System[C]//2018 IEEE International Conference on Information and Automation for Sustainability(ICIAFS).2018:1-5. [55] SIVARAMAKRISHNAN N,SUBRAMANIYASWAMY V,ARUNKUMAR S,et al.Validating Effective Resume Based on Employer's Interest with Recommendation System[J].International Journal of Pure and Applied Mathematics,2018,119(12):13261-13272. [56] FU Z Y.Research on Computing the Fitness of Person-PostBased on Ability Evaluation and Semantic Similarity [D].Shenzhen:Shenzhen University,2018. [57] CAO J.Design and implementation of enterprise recruitmentsystem based on text semantic similarity [D].Qufu:Qufu Normal University,2020. [58] LI C,FISHER E,THOMAS R,et al.Competence-Level Prediction and Resume & Job Description Matching Using Context-Aware Transformer Models[J].arXiv:2011.02998,2020. [59] WANG S,JIANG W,ZHENG L B,et al.Resume ScreeningBased on Discrete Selection Model[J].Value Engineering,2019,38(20):191-194. [60] QIN S,XU X L,WANG C L.The Study of Resume Information Selection Basing on the Discrete Choice Model[C]//Proceedings of the 14th Annual Conference of China Management Science.2012:143-147. [61] HONG H Y.Screening of Resumes Based on Bayesian Classifier[J].Computer Technology and Development,2012,22(7):85-87. [62] QIU X Y.Text Classification Algorithm and its Realization inthe Campus Recruitment Management System [D].Wuhan:Wuhan University of Technology,2015. [63] ZHANG B,CUI C Y,CUI J Z.An enterprise resume screening method based on BPNN and its application[J].Intelligent City,2016,2(5):49-52. [64] DENG Y,LEI H,LI X,et al.An Improved Deep Neural Network Model for Job Matching[C]//2018 International Confe-rence on Artificial Intelligence and Big Data(ICAIBD).2018:106-112. [65] YAN R,LE R,SONG Y,et al.Interview Choice Reveals Your Preference on the Market:to Improve Job-resume Matching Through Profiling Memories[C]//Proceedings of the 25th Acm Sigkdd International Conference on Knowledge Discovery & Data Mining.2019:914-922. [66] ZHU C,ZHU H,XIONG H,et al.Person-job Fit:Adapting the Right Talent for the Right Job with Joint Representation Lear-ning[J].ACM Transactions on Management Information Systems(TMIS),2018,9(3):1-17. [67] QIN C,ZHU H,XU T,et al.Enhancing Person-job Fit for Ta-lent Recruitment:an Ability-aware Neural Network Approach[C]//The 41st International Acm Sigir Conference on Research &Development in Information Retrieval.2018:25-34. [68] BIAN S,ZHAO W X,SONG Y,et al.Domain Adaptation forPerson-job Fit with Transferable Deep Global Match Network[C]//Proceedings of the 2019 Conference on Empirical Me-thods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing(emnlp-ijcnlp).2019:4812-4822. [69] DEVLIN J,CHANG M,LEE K,et al.Bert:Pre-training of Deep Bidirectional Transformers for Language Understanding[J].arXiv:1810.04805,2018. [70] BHATIA V,RAWAT P,KUMAR A,et al.End-to-end Resume Parsing and Finding Candidates for a Job Description Using Bert[J].arXiv:1910.03089,2019. [71] BIAN S,CHEN X,ZHAO W X,et al.Learning to Match Jobs with Resumes from Sparse Interaction Data using Multi-View Co-Teaching Network[C]//Proceedings of the 29th ACM International Conference on Information & Knowledge Management.2020:65-74. |
[1] | 闫佳丹, 贾彩燕. 基于双图神经网络信息融合的文本分类方法 Text Classification Method Based on Information Fusion of Dual-graph Neural Network 计算机科学, 2022, 49(8): 230-236. https://doi.org/10.11896/jsjkx.210600042 |
[2] | 王剑, 彭雨琦, 赵宇斐, 杨健. 基于深度学习的社交网络舆情信息抽取方法综述 Survey of Social Network Public Opinion Information Extraction Based on Deep Learning 计算机科学, 2022, 49(8): 279-293. https://doi.org/10.11896/jsjkx.220300099 |
[3] | 侯钰涛, 阿布都克力木·阿布力孜, 哈里旦木·阿布都克里木. 中文预训练模型研究进展 Advances in Chinese Pre-training Models 计算机科学, 2022, 49(7): 148-163. https://doi.org/10.11896/jsjkx.211200018 |
[4] | 张虎, 柏萍. 融入句子中远距离词语依赖的图卷积短文本分类方法 Graph Convolutional Networks with Long-distance Words Dependency in Sentences for Short Text Classification 计算机科学, 2022, 49(2): 279-284. https://doi.org/10.11896/jsjkx.201200062 |
[5] | 陈志毅, 隋杰. 基于DeepFM和卷积神经网络的集成式多模态谣言检测方法 DeepFM and Convolutional Neural Networks Ensembles for Multimodal Rumor Detection 计算机科学, 2022, 49(1): 101-107. https://doi.org/10.11896/jsjkx.201200007 |
[6] | 王立梅, 朱旭光, 汪德嘉, 张勇, 邢春晓. 基于深度学习的民事案件判决结果分类方法研究 Study on Judicial Data Classification Method Based on Natural Language Processing Technologies 计算机科学, 2021, 48(8): 80-85. https://doi.org/10.11896/jsjkx.210300130 |
[7] | 裴莹, 李天祥, 王鏖清, 付加胜, 韩霄松. 基于新闻的国际天然气价格趋势预测方法 Prediction Method of International Natural Gas Price Trends Based on News 计算机科学, 2021, 48(6A): 235-239. https://doi.org/10.11896/jsjkx.201000056 |
[8] | 丁玲, 向阳. 基于分层次多粒度语义融合的中文事件检测 Chinese Event Detection with Hierarchical and Multi-granularity Semantic Fusion 计算机科学, 2021, 48(5): 202-208. https://doi.org/10.11896/jsjkx.200800038 |
[9] | 吴俣, 李舟军. 检索式聊天机器人技术综述 Survey on Retrieval-based Chatbots 计算机科学, 2021, 48(12): 278-285. https://doi.org/10.11896/jsjkx.210900250 |
[10] | 仝鑫, 王斌君, 王润正, 潘孝勤. 面向自然语言处理的深度学习对抗样本综述 Survey on Adversarial Sample of Deep Learning Towards Natural Language Processing 计算机科学, 2021, 48(1): 258-267. https://doi.org/10.11896/jsjkx.200500078 |
[11] | 田野, 寿黎但, 陈珂, 骆歆远, 陈刚. 基于字段嵌入的数据库自然语言查询接口 Natural Language Interface for Databases with Content-based Table Column Embeddings 计算机科学, 2020, 47(9): 60-66. https://doi.org/10.11896/jsjkx.190800138 |
[12] | 陆龙龙, 陈统, 潘敏学, 张天. CodeSearcher:基于自然语言功能描述的代码查询 CodeSearcher:Code Query Using Functional Descriptions in Natural Languages 计算机科学, 2020, 47(9): 1-9. https://doi.org/10.11896/jsjkx.191200170 |
[13] | 张浩洋, 周良. 改进的GHSOM算法在民航航空法规知识地图构建中的应用 Application of Improved GHSOM Algorithm in Civil Aviation Regulation Knowledge Map Construction 计算机科学, 2020, 47(6A): 429-435. https://doi.org/10.11896/JsJkx.190700161 |
[14] | 张迎, 张宜飞, 王中卿, 王红玲. 基于主次关系特征的自动文摘方法 Automatic Summarization Method Based on Primary and Secondary Relation Feature 计算机科学, 2020, 47(6A): 6-11. https://doi.org/10.11896/JsJkx.191000007 |
[15] | 吴小坤, 赵甜芳. 自然语言处理技术在社会传播学中的应用研究和前景展望 Application of Natural Language Processing in Social Communication:A Review and Future Perspectives 计算机科学, 2020, 47(6): 184-193. https://doi.org/10.11896/jsjkx.191200151 |
|