计算机科学 ›› 2018, Vol. 45 ›› Issue (9): 237-242.doi: 10.11896/j.issn.1002-137X.2018.09.039
杨开平, 李明奇, 覃思义
YANG Kai-ping, LI Ming-qi, QIN Si-yi
摘要: 随着社会与互联网的不断发展,公民的法律意识越来越强,传统的律师业务流程与发展模式已经不能满足客户和行业的需求。根据已有的专业律师咨询回复规范,文中建立了判定回复信息质量优劣的准则,并从5个方面对回复文本进行了量化描述。利用word2vec算法对律师问答系统的历史数据库进行训练,得到该数据库的词向量和对应词语的相似度。基于词语相似度和文本长度,构造文本间相似度。由此,建立了律师回复信息质量评价模型。对数据库中各个律师的问答文本进行了量化分析,结果表明,该模型能够很好地评估律师的回复质量。
中图分类号:
[1]TANG Y,LI F,HUANG M,et al.Summarizing similar questions for chinese community question answering portals[C]∥2010 Second International Conference on Information Technology and Computer Science(ITCS).IEEE,2010:36-39. [2]CAO Z J,LI Z S,LIU C T.Study of Question Analysis in Question-Answering System[J].Computer Science,2005,32(11):158-160.(in Chinese) 曹志娟,李祖枢,刘朝涛.自动问答系统中的问题理解研究[J].计算机科学,2005,32(11):158-160. [3]BRIN S,PAGE L.Reprint of:The anatomy of a large-scale hypertextual web search engine[J].Computer Networks,2012,56(18):3825-3833. [4]ZHENG Z.Answer Bus question answering system[C]∥Pro-ceedings of the Second International Conference on Human Language Technology Research.Morgan Kaufmann Publishers Inc.,2002:399-404. [5]CONG G,WANG L,LIN C Y,et al.Finding question-answer pairs from online forums[C]∥Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Deve-lopment in Information Retrieval.ACM,2008:467-474. [6]SU Q,PAVLOV D,CHOW J H,et al.Internet-scale collection of human-reviewed data[C]∥Proceedings of the 16th International Conference on World Wide Web.ACM,2007:231-240. [7]KONG W Z,LIU Y Q,ZHANG M,et al.Answer Quality Analysis on Community Question Answering[J].Journal of Chinese Information Processing,2011,25(1):3-9.(in Chinese) 孔维泽,刘奕群,张敏,等.问答社区中回答质量的评价方法研究[J].中文信息学报,2011,25(1):3-9. [8]BLOOMA M J,CHUA A Y K,GOH D H L.A predictive framework for retrieving the best answer[C]∥Proceedings of the 2008 ACM Symposium on Applied Computing.ACM,2008:1107-1111. [9]BIAN J,LIU Y,AGICHTEIN E,et al.Finding the right facts in the crowd:factoid question answering over social media[C]∥Proceedings of the 17th International Conference on World Wide Web.ACM,2008:467-476. [10]TANG M,ZHU L,ZOU X C.Document Vector Representation Based on Word2Vec[J].Computer Science,2016,43(6):214-217,269.(in Chinese) 唐明,朱磊,邹显春.基于 Word2Vec 的一种文档向量表示[J].计算机科学,2016,43(6):214-217,269. [11]MIKOLOV T,CHEN K,CORRADO G,et al.Efficient estimation of word representations in vector space[J].arXiv preprint arXiv:1301.3781,2013. [12]MIKOLOV T,SUTSKEVER I,CHEN K,et al.Distributed representations of words and phrases and their compositionality[C]∥Advances in Neural Information Processing Systems.2013:3111-3119. [13]LI Y P,JIN C,JI J C.A Keyword Extraction Algorithm Based on Word2vec[J].E-science Technology & Application,2015,6(4):54-59.(in Chinese) 李跃鹏,金翠,及俊川.基于 word2vec 的关键词提取算法[J].科研信息化技术与应用,2015,6(4):54-59. [14]LIAN X.Research on some key questions in community question answering system [D].Tianjin:Nankai University,2014.(in Chinese) 廉鑫.社区问答系统中若干关键问题研究 [D].天津:南开大学,2014. |
[1] | 曾志贤, 曹建军, 翁年凤, 蒋国权, 徐滨. 基于注意力机制的细粒度语义关联视频-文本跨模态实体分辨 Fine-grained Semantic Association Video-Text Cross-modal Entity Resolution Based on Attention Mechanism 计算机科学, 2022, 49(7): 106-112. https://doi.org/10.11896/jsjkx.210500224 |
[2] | 熊罗庚, 郑尚, 邹海涛, 于化龙, 高尚. 融合双向门控循环单元和注意力机制的软件自承认技术债识别方法 Software Self-admitted Technical Debt Identification with Bidirectional Gate Recurrent Unit and Attention Mechanism 计算机科学, 2022, 49(7): 212-219. https://doi.org/10.11896/jsjkx.210500075 |
[3] | 王胜, 张仰森, 陈若愚, 向尕. 基于细粒度差异特征的文本匹配方法 Text Matching Method Based on Fine-grained Difference Features 计算机科学, 2021, 48(8): 60-65. https://doi.org/10.11896/jsjkx.200700008 |
[4] | 张浩洋, 周良. 改进的GHSOM算法在民航航空法规知识地图构建中的应用 Application of Improved GHSOM Algorithm in Civil Aviation Regulation Knowledge Map Construction 计算机科学, 2020, 47(6A): 429-435. https://doi.org/10.11896/JsJkx.190700161 |
[5] | 张云帆,周宇,黄志球. 基于语义相似度的API使用模式推荐 Semantic Similarity Based API Usage Pattern Recommendation 计算机科学, 2020, 47(3): 34-40. https://doi.org/10.11896/jsjkx.190300053 |
[6] | 刘宇东, 孙豪, 蒋运承. 融合内容相似度与多特征计算的个性化微博推荐模型 Personalized Microblog Recommendation Model Integrating Content Similarity and Multi-feature Computing 计算机科学, 2020, 47(10): 97-101. https://doi.org/10.11896/jsjkx.190700073 |
[7] | 许飞翔,叶霞,李琳琳,曹军博,王馨. 基于SA-BP算法的本体概念语义相似度综合计算 Comprehensive Calculation of Semantic Similarity of Ontology Concept Based on SA-BP Algorithm 计算机科学, 2020, 47(1): 199-204. https://doi.org/10.11896/jsjkx.181202351 |
[8] | 邓珍荣, 张宝军, 蒋周琴, 黄文明. 融合word2vec和注意力机制的图像描述模型 Image Description Model Fusing Word2vec and Attention Mechanism 计算机科学, 2019, 46(4): 268-273. https://doi.org/10.11896/j.issn.1002-137X.2019.04.042 |
[9] | 唐家琪, 吴璟莉, 廖元秀, 王金艳. 基于双加权投票的蛋白质功能预测 Prediction of Protein Functions Based on Bi-weighted Vote 计算机科学, 2019, 46(4): 222-227. https://doi.org/10.11896/j.issn.1002-137X.2019.04.035 |
[10] | 杨进才, 杨璐璐, 汪燕燕, 沈显君. 基于神经网络的关系词非充盈态复句层次的自动识别 Hierarchy Division of Compound Sentence with Non-saturated Relation Word via Neural Network 计算机科学, 2019, 46(11A): 103-107. |
[11] | 温雯, 林泽钿, 蔡瑞初, 郝志峰, 王丽娟. 基于嵌入学习的用户动态偏好预测 Predicting User’s Dynamic Preference Based on Embedding Learning 计算机科学, 2019, 46(10): 32-38. https://doi.org/10.11896/jsjkx.180901801 |
[12] | 孙昭颖,刘功申. 面向短文本的神经网络聚类算法研究 Research on Neural Network Clustering Algorithm for Short Text 计算机科学, 2018, 45(6A): 392-395. |
[13] | 程宏兵, 王珂, 李兵, 钱漫匀. 一种高效的社交网络朋友推荐方案 Efficient Friend Recommendation Scheme for Social Networks 计算机科学, 2018, 45(6A): 433-436. |
[14] | 李颖,郝晓燕,王勇. 中文开放式多元实体关系抽取 N-ary Chinese Open Entity-relation Extraction 计算机科学, 2017, 44(Z6): 80-83. https://doi.org/10.11896/j.issn.1002-137X.2017.6A.016 |
[15] | 李晓,解辉,李立杰. 基于Word2vec的句子语义相似度计算研究 Research on Sentence Semantic Similarity Calculation Based on Word2vec 计算机科学, 2017, 44(9): 256-260. https://doi.org/10.11896/j.issn.1002-137X.2017.09.048 |
|