计算机科学 ›› 2021, Vol. 48 ›› Issue (3): 227-232.doi: 10.11896/jsjkx.200700056

• 人工智能 • 上一篇    下一篇

基于过程监督的序列多任务法律判决预测方法

张春云1, 曲浩2, 崔超然1, 孙皓亮2, 尹义龙2   

  1. 1 山东财经大学计算机科学与技术学院 济南250014
    2 山东大学软件学院 济南250101
  • 收稿日期:2020-07-09 修回日期:2020-09-18 出版日期:2021-03-15 发布日期:2021-03-05
  • 通讯作者: 尹义龙(ylyin@su.edu.cn)
  • 作者简介:zhangchunyunl009@126.com
  • 基金资助:
    国家自然科学基金项目(61703234);国家重点研发计划(2018YFC0830102)

Process Supervision Based Sequence Multi-task Method for Legal Judgement Prediction

ZHANG Chun-yun1, QU Hao2, CUI Chao-ran1, SUN Hao-liang2, YIN Yi-long2   

  1. 1 School of Computer Science and Technology,Shandong University of Finance and Economics,Jinan 250014,China
    2 School of Software,Shandong University,Jinan 250101,China
  • Received:2020-07-09 Revised:2020-09-18 Online:2021-03-15 Published:2021-03-05
  • About author:ZHANG Chun-yun,born in 1986,Ph.D,assistant professor,is a member of China Computer Federation.Her main research interests include machine lear-ning,natural language processing and information extraction.
    YIN Yi-long,born in 1972,Ph.D,professor,Ph.D supervisor,is a member of China Computer Federation.His main research interests include machine learning,data mining and biometrics.
  • Supported by:
    National Natural Science Foundation of China (61703234) and National Key R & D Plan Project (2018YFC0830102).

摘要: 法律判决预测是人工智能技术在法律领域的应用,因此对法律判决预测方法的研究对于实现智慧司法具有重要的理论价值和实际意义。传统的法律判决预测方法大都是只进行单一任务的预测或仅基于参数共享的多任务预测,并未考虑各子任务之间的序列依存关系,因此预测性能难以得到进一步的提升。文中提出了一个端到端的基于过程监督的序列多任务法律判决预测模型,在建模各子任务之间的依存关系时,通过引入过程监督来确保依赖信息的准确性,从而提升序列子任务的预测性能。将所提模型应用到CAIL2018数据集上,取得了较好的分类效果,平均分类准确率比现有的state-of-the-art方法的准确率提升了2%。

关键词: 多任务学习, 法律判决预测, 过程监督, 深度学习

Abstract: Legal judgment prediction is an application of artificial intelligence technology in legal field.Hence,the research on the legal judgment prediction method has important theoretical value and practical significance for the realization of intelligent justice.Traditional legal judgment prediction methods only make single task prediction or just use multi-task prediction based on parameter sharing,without considering the sequence dependence among subtasks,so the prediction performance is difficult to be further improved.This paper proposes a process supervision based sequence multi-task framework (PS-SMTL) by encoding sequence dependency of subtasks in legal judgement.It is an end to end legal judgement prediction method without any external features.By introducing process supervision,the proposed model ensures the accuracy of the obtained dependent prior information from advance tasks.The proposed model is applied to CAIL2018 dataset and a good classification result is achieved.The average classification accuracy is 2% higher than that of the existing state-of-the-art method.

Key words: Deep learning, Legal judgement prediction, Multi-task learning, Process supervision

中图分类号: 

  • TP391
[1]FRED K.Predicting Supreme Court decisions mathematically:Aquantitative analysis of the “right to counsel” cases [R].American Political Science Review,1957.
[2]STUART S.Applying correlation analysis to case prediction[R].Texas Law Review,1963.
[3]KEOWN R.Mathematical Models For Legal Prediction[J].Computer Journal,1980(1):829-831.
[4]JEFFREY A.Predicting Supreme Court cases probabilistically:The search and seizure cases(1962-1981)[R].American Political Science Review,1984,78(4):891-900.
[5]BENJAMIN E L,TOM S C.The Supreme Court’s many median justices [J].American Political Science Review,2012,106(4):847-866.
[6]CHEN H J,CAI D,DAI W,et al.Charge-Based Prison TermPrediction with Deep Gating Network[C]//Proceedings of EMNLP-IJCNLP.2019:6362-6367.
[7]YE H,JIANG X,LUO Z C,et al.Interpretable charge predictions for criminal cases:Learning to generate court views from fact descriptions[C]//Proceedings of NAACL.2018:1854-1864.
[8]JIANG X,YE H,LUO Z C,et al.Interpretable Rationale Augmented Charge Prediction System[C]//Proceedings of COLING.2018:146-151.
[9]LIU C L,CHENG T C,JIM H H.Case instance generation and refinement for case-based criminal summary judgments in Chinese [J].Journal of Information Science & Engineering,2004,20(4):783-800.
[10]LIU C L,CHWEN D H.Exploring phrase-based classification of judicial documents for criminal charges in Chinese[C]//Proceedings of ISMIS.2006,681-690.
[11]DANNIEL M K,MICHAEL J B II,JOSH B.A general ap-proach for predicting the behavior of the supreme court of the united states [J].Plos One,2017,12(4).
[12]LUO B F,FENG Y S,XU J B,et al.Learning to predict charges for criminal cases with legal basis[C]//Proceedings of EMNLP.2017:1-10.
[13]ZHONG H X,GUO Z P,TU C C,et al.Legal judgment prediction via topological learning[C]//Proceedings of the Conference on Empirical Methods in Natural Language Processing.2018:3540-3549.
[14]BAHARUM B,LAM H L,KHAIRULLAH K.A review of machine learning algorithms for text-documents classification [J].Journal of Advances in Information Technology,2010,1(1):4-20.
[15]YOON K.Convolutional neural networks for sentence classification[C]//Proceedings of EMNLP.2014.
[16]TANG D Y,QIN B,LIU T.Document modeling with gated recurrent neural network for sentiment classification [C]//Proceedings of EMNLP.2015:1422-1432.
[17]LIN W C,TSUNG T K,TUNG J C.Exploiting machine lear-ning models for chinese legal documents labeling,case classification,and sentencing prediction[C]//Proceedings of ROCLING.2012:140.
[18]YANG W,JIA W,ZHOU X,et al.Legal Judgment Predictionvia Multi-Perspective Bi-Feedback Network[C]//Twenty-Eighth International Joint Conference on Artificial Intelligence (IJCAI-19).2019.
[19]NIKOLAOS A,DIMITRIOS T,DANIEL P,et al.Predicting judicial decisions of the European court of human rights:A natural language processing perspective [J].PeerJ Computer Science,2016(2):e93.
[20]OCTAVIA M S,MARCOS Z,MIHAELA V,et al.Exploring the use of text classification in the legal domain[C]//Procee-dings of ASAIL Workshop.2017.
[21]HU Z K,LI X,TU C C,et al.Few-shot charge prediction with discriminative legal attributes[C]//Proceedings of the 27th International Conference on Computational Linguistics.2018:487-498.
[22]RONAN C,JASON W.A unified architecture for natural language processing:Deep neural networks with multitask learning[C]//Proceedings of ICML.2008:160-167.
[23]LONG S,TU C,LIU Z,et al.Automatic Judgment Prediction via Legal Reading Comprehension [J].arXiv:1809.06537v1,2018.
[24]MAX R S M,TOMMASO B,MAXIME R,et al.Machine lear-ning for explaining and ranking the most influential matters of law[C]//Proceeding of ICAIL.2019:239-243.
[25]SANDEEP S,ADAM T,YOSHUA B,et al.Learning GeneralPurpose Distributed Sentence Representations via Large Scale Multi-task Learning[C]//Proceeding of ICLR.2018.
[26]WANG A,AMANPREET S,JULIAN M,et al.GLUE:AMulti-Task Benchmark and Analysis Platform for Natural Language Understanding[C]//Proceeding of ICLR.2019.
[27]WU Z,VALENTINI B C,WATTS O,et al.Deep neural networks employing Multi-Task Learning and stacked bottleneck features for speech synthesis[C]//IEEE International Confe-rence on Acoustics.2015:4460-4464.
[28]RAVANELLI M,ZHONG J,PASCUAL S,et al.Multi-TaskSelf-Supervised Learning for Robust Speech Recognition[C]//Proceeding of ICASSP.2020.
[29]TREVOR S,AMIR R Z,DAWN C,et al.Which Tasks Should Be Learned Together in Multi-task Learning?[C]//Proceeding of ICCV.2019.
[30]LIU S K,EDWARD J,ANDREW J D.End-to-End Multi-TaskLearning with Attention[C]//Proceeding of CVPR.2019.
[31]RONAN C,JASON W.A unified architecture for natural lan-guage processing:Deep neural networks with multitask learning[C]//Proceedings of ICML.2008:160-167.
[32]DONG D X,WU H,HE W,et al.Multi-task learning for multiple language translation[C]//Proceedings of ACL.2015:1723-1732.
[33]MINH T L,QUOC V L,ILYA S,et al.Multi-task sequence to sequence learning[C]//Proceedings of ICLR.2016.
[34]ORHAN F,KYUNGHYUN C,YOSHUA B.Multi-way,multilingual neural machine translation with a shared attention mecha-nism[C]//Proceedings of NAACL.2016:866-875.
[35]LONG D,TREVOR C,STEVEN B,et al.Low resource dependency parsing:Cross-lingual parameter sharing in a neural network parser[C]//Proceedings of ACL.2015:845-850.
[36]YANG Y X,TIMOTHY H.Deep multi-task representationlearning:A tensor factorization approach[C]//Proceedings of ICLR.2017.
[37]LIU P F,QIU X P,HUANG X J.Recurrent neural network for text classification with multi-task learning[C]//Proceedings of IJCAI.2016.
[38]SEBASTIAN R,JOACHIM B,ISABELLE,et al.Learning what to share between loosely related tasks [J].arXiv:1705.08142,2017.
[39]POORYA Z,GHOLAMREZA H.Neural Machine Translation for Bilingually Scarce Scenarios:A Deep Multi-task Learning Approach[C]//Proceeding of NAACL.2018.
[40]YANG Z C,YANG D Y,DYER C,et al,Hierarchical attention networks for document classification[C]//Proceedings of NAACL-HLT.2016:1480-1489.
[41]ZHANG S,ZHENG D Q,HU X C,et al.Bidirectional longshort-term memory networks for relation classification[C]//Proceeding of PACLIC.2015:1480-1489.
[42]DIEDERIK K,JIMMY B.Adam:A method for stochastic optimization[C]//Proceedings of ICLR.2015.
[43]NITISH S,GEOFFREY E H,ALEX K,et al.Dropout:a simple way to prevent neural networks from overfitting [J].Journal of Machine Learning Research (JMLR),2014,15(1):1929-1958.
[1] 徐涌鑫, 赵俊峰, 王亚沙, 谢冰, 杨恺.
时序知识图谱表示学习
Temporal Knowledge Graph Representation Learning
计算机科学, 2022, 49(9): 162-171. https://doi.org/10.11896/jsjkx.220500204
[2] 饶志双, 贾真, 张凡, 李天瑞.
基于Key-Value关联记忆网络的知识图谱问答方法
Key-Value Relational Memory Networks for Question Answering over Knowledge Graph
计算机科学, 2022, 49(9): 202-207. https://doi.org/10.11896/jsjkx.220300277
[3] 汤凌韬, 王迪, 张鲁飞, 刘盛云.
基于安全多方计算和差分隐私的联邦学习方案
Federated Learning Scheme Based on Secure Multi-party Computation and Differential Privacy
计算机科学, 2022, 49(9): 297-305. https://doi.org/10.11896/jsjkx.210800108
[4] 孙奇, 吉根林, 张杰.
基于非局部注意力生成对抗网络的视频异常事件检测方法
Non-local Attention Based Generative Adversarial Network for Video Abnormal Event Detection
计算机科学, 2022, 49(8): 172-177. https://doi.org/10.11896/jsjkx.210600061
[5] 王剑, 彭雨琦, 赵宇斐, 杨健.
基于深度学习的社交网络舆情信息抽取方法综述
Survey of Social Network Public Opinion Information Extraction Based on Deep Learning
计算机科学, 2022, 49(8): 279-293. https://doi.org/10.11896/jsjkx.220300099
[6] 郝志荣, 陈龙, 黄嘉成.
面向文本分类的类别区分式通用对抗攻击方法
Class Discriminative Universal Adversarial Attack for Text Classification
计算机科学, 2022, 49(8): 323-329. https://doi.org/10.11896/jsjkx.220200077
[7] 姜梦函, 李邵梅, 郑洪浩, 张建朋.
基于改进位置编码的谣言检测模型
Rumor Detection Model Based on Improved Position Embedding
计算机科学, 2022, 49(8): 330-335. https://doi.org/10.11896/jsjkx.210600046
[8] 侯钰涛, 阿布都克力木·阿布力孜, 哈里旦木·阿布都克里木.
中文预训练模型研究进展
Advances in Chinese Pre-training Models
计算机科学, 2022, 49(7): 148-163. https://doi.org/10.11896/jsjkx.211200018
[9] 周慧, 施皓晨, 屠要峰, 黄圣君.
基于主动采样的深度鲁棒神经网络学习
Robust Deep Neural Network Learning Based on Active Sampling
计算机科学, 2022, 49(7): 164-169. https://doi.org/10.11896/jsjkx.210600044
[10] 苏丹宁, 曹桂涛, 王燕楠, 王宏, 任赫.
小样本雷达辐射源识别的深度学习方法综述
Survey of Deep Learning for Radar Emitter Identification Based on Small Sample
计算机科学, 2022, 49(7): 226-235. https://doi.org/10.11896/jsjkx.210600138
[11] 胡艳羽, 赵龙, 董祥军.
一种用于癌症分类的两阶段深度特征选择提取算法
Two-stage Deep Feature Selection Extraction Algorithm for Cancer Classification
计算机科学, 2022, 49(7): 73-78. https://doi.org/10.11896/jsjkx.210500092
[12] 程成, 降爱莲.
基于多路径特征提取的实时语义分割方法
Real-time Semantic Segmentation Method Based on Multi-path Feature Extraction
计算机科学, 2022, 49(7): 120-126. https://doi.org/10.11896/jsjkx.210500157
[13] 王君锋, 刘凡, 杨赛, 吕坦悦, 陈峙宇, 许峰.
基于多源迁移学习的大坝裂缝检测
Dam Crack Detection Based on Multi-source Transfer Learning
计算机科学, 2022, 49(6A): 319-324. https://doi.org/10.11896/jsjkx.210500124
[14] 楚玉春, 龚航, 王学芳, 刘培顺.
基于YOLOv4的目标检测知识蒸馏算法研究
Study on Knowledge Distillation of Target Detection Algorithm Based on YOLOv4
计算机科学, 2022, 49(6A): 337-344. https://doi.org/10.11896/jsjkx.210600204
[15] 祝文韬, 兰先超, 罗唤霖, 岳彬, 汪洋.
改进Faster R-CNN的光学遥感飞机目标检测
Remote Sensing Aircraft Target Detection Based on Improved Faster R-CNN
计算机科学, 2022, 49(6A): 378-383. https://doi.org/10.11896/jsjkx.210300121
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!