计算机科学 ›› 2026, Vol. 53 ›› Issue (5): 286-298.doi: 10.11896/jsjkx.251000076
韩林睿1,2, 郑日1,2, 丛颖男3
HAN Linrui1,2, ZHENG Ri1,2, CONG Yingnan3
摘要: 刑期预测是法律人工智能赋能刑事司法的核心任务之一,对克服量刑偏见、提升司法质效与保障公平正义具有重要意义。针对传统机器学习模型预测准确率低、可解释性不足的瓶颈问题,提出一种基于量刑规则知识图谱驱动的可解释刑期预测方法。该方法创新性地设计了知识图谱与大语言模型融合架构,其技术路线为:首先,采用BERT-BiLSTM-CRF模型自顶向下构建结构化量刑规则知识图谱;其次,基于《量刑指导意见》提炼量刑思维链,利用图谱结构化数据设计提示工程,对LLaMA-3-8B-Chinese-Chat,Qwen-2-7B,Baichuan2-7B-Chat,GLM-4-9B-Chat大语言模型进行监督式指令微调,引导其学习规范化量刑推理过程;最后,在预测阶段,通过图谱实体识别与检索机制对微调后模型实现检索增强生成,输出刑期预测结果及符合量刑规则的步骤化分析。实验表明:1)BERT-BiLSTM-CRF模型在实体关系抽取任务上F1值达0.953 8,优于传统模型;2)GLM-4-9B-Chat模型在测试集生成质量与下游任务综合表现上最优;3)最终刑期预测模型的F1值达0.627 6,显著优于MTL-Fusion,Lawformer及BERT等基线模型,同时,生成遵循“确定量刑起点-确定基准刑-调节基准刑-确定宣告刑”规范化量刑逻辑的说明文本,显著提升了用户对结果的理解与接受度;4)消融实验与人工评测共同验证模型在量刑准确性、规则援引精准度、说理逻辑性与流畅性及量刑步骤规范性方面均显著优于基线。该研究为法律人工智能提供了知识驱动与数据驱动深度融合的新范式。
中图分类号:
| [1]WANG Y.Substantive Approaches to Sentencing Standardization in the AI Era[J].Law and Economy,2025(3):148-164. [2]ZHAN G C,MENG Y.Bridging the Divide:Technical Research and Application on Legal Judgment Prediction[EB/OL].https://doi.org/10.1007/s10506-025-09473-7. [3]FAN A M,WANG Y C.Multi-Task Intelligent Legal Judgment Method Based on BERT Model[J].Microelectronics & Compu-ter,2022,39(9):107-114. [4]XIAO C,HU X,LIU Z,et al.Lawformer:A Pre-trained Lan-guage Model for Chinese Legal Long Documents[J].AI Open,2021,2:79-84. [5]WEI B.Judicial Applications and Regulation of Large LegalLanguage Models[J].Oriental Law,2024(5):57-73. [6]WANG L S.On Predictive Justice[J].Social Science in China,2024(6):80-99. [7]HUANG B,WU S A,WANG W G,et al.KG-LLM-MCom:A Survey on Integration of Knowledge Graph and Large Language Model[J].Journal of Wuhan University(Natural Science Edition),2024,70(4):397-412. [8]LIU S,YU S,ZHANG C,et al.Deep Learning-Based LegalJudgment Prediction:A Research Review[J].Computer Engineering and Applications,2026,62(1):68-86. [9]WEI J,WANG X,SCHUURMANS D,et al.Chain-of-Thought Prompting Elicits Reasoning in Large Language Models[J].Advances in Neural Information Processing Systems,2022,35:24824-24837. [10]LEWIS P,PEREZ E,PIKTUS A,et al.Retrieval-AugmentedGeneration for Knowledge-Intensive NLP Tasks[J].Advances in Neural Information Processing Systems,2020,33:9459-9474. [11]CUI J,SHEN X,WEN S.A Survey on Legal Judgment Prediction:Datasets,Metrics,Models and Challenges[J].IEEE Access,2023,11:102050-102071. [12]KORT F.Predicting Supreme Court Decisions Mathematically:A Quantitative Analysis of the “Right to Counsel” Cases[J].American Political Science Review,1957,51(1):1-12. [13]SERGOT M J,SADRI F,KOWALSKI R A,et al.The British Nationality Act as a logic program[J].Communications of the ACM,1986,29(5):370-386. [14]ZHANG H,PAN B,TAN H,et al.Judgment Prediction Based on Legal Judgment Documents[J].Big Data Research,2021,7(5):164-175. [15]KATZ D M,BOMMARITO II M J,BLACKMAN J.Predicting the Behavior of the Supreme Court of the United States:A Ge-neral Approach[J].arXiv:1407.6333,2014. [16]SULEA O M,ZAMPIERI M,MALMASI S,et al.Exploring the use of Text Classification in the Legal Domain[J].arXiv:1710.09306,2017. [17]LUO B,FENG Y,XU J,et al.Learning to Predict Charges for Criminal Cases with Legal Basis[J].arXiv:1707.09168,2017. [18]LIU Z L,ZHANG M S,ZHENG R R,et al.Multi-task Learning Model for Legal Judgment Predictions with Charge Keywords[J].Journal of Tsinghua University(Science and Technology),2019,59(7):497-504. [19]ZHANG H,ZHENG W,DOU Z,et al.Integrating Multi-Layer Structure Information of Law for Legal Judgement Prediction[J].Computer Engineering and Applications,2023,59(3):253-263. [20]YIN M,LI X,LI C,et al.An Analysis Method of Influencing Factors of Legal Judgment Prediction[J].Computer and Mo-dernization,2021(4):37-41. [21]CHEN Y G,LIU H,LI C,et al.Knowledge Graph Construction Techniques Based on Criminal Cases[J].Journal of Zhengzhou University(Natural Science Edition),2019,51(3):85-90. [22]WANG N,LIU W,LAN J.Construction and Completion of Legal Knowledge Graph Based on Court Judgment Documents[J].Journal of Zhengzhou University(Natural Science Edition),2021,53(3):23-29. [23]HONG W,HU Z,WENG Y,et al.Automated KnowledgeGraph Construction for Judicial Case Facts[J].Journal of Chinese Information Processing,2020,34(1):34-44. [24]LI Z Y,ZHANG Y J,HUANG Y X,et al.Legal Judgment Prediction Using Case Feature Enhancement Based on Knowledge Graph[J].Application Research of Computers,2024,41(7):2153-2159. [25]HONG L X.Misunderstandings and Correct Ways:the Dilem-ma,Causes and Improvements of Algorithm Problems in Legal Artificial Intelligence[J].Journal of Sichuan Normal University(Social Sciences Edition),2020,47(1):58-70. [26]CONG Y N,HAN L R,MA J Y,et al.Research on Intelligent Judgment of Criminal Cases Based on Large Language Models[J].Computer Science,2025,52(5):58-69. [27]CHEN X H,CHENG J L,HU D B,et al.Application of Artificial Intelligence Large Language Model for Smart Environmental Judicial Adjudication[J].Strategic Study of CAE,2024,26(1):190-201. [28]WANG D S.Multi-defendant Legal Judgment Prediction withMulti-turn LLM and Criminal Knowledge Graph[J].Computer Science,2025,52(8):308-316. [29]JING H F,WANG D S.A Sentencing Normativity Prediction Method Based on Multi-Source Knowledge Aggregation[EB/OL].https://doi.org/10.13209/j.0479-8023.2025.098. [30]VALVODA J,COTTERELL R.Towards Explainability in Legal Outcome Prediction Models[J].arXiv:2403.16852,2024. [31]PENG X,CHEN L.Athena:Retrieval-augmented Legal Judgment Prediction with Large Language Models[J].arXiv:2410.11195,2024. [32]WANG X,ZHANG X,HOO V,et al.LegalReasoner:A Multi-Stage Framework for Legal Judgment Prediction via Large Language Models and Knowledge Integration[J].IEEE Access,2024,12:166843-166854. [33]HU E J,SHEN Y,WALLIS P,et al.LoRA:Low-Rank Adaptation of Large Language Models[J].arXiv:2106.09685,2021. [34]DEVLIN J,CHANG M,LEE K,et al.BERT:Pre-training ofDeep Bidirectional Transformers for Language Understanding[C]//Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics:Human Language Technologies.Minneapolis,Minnesota:Association for Computational Linguistics,2019:4171-4186. [35]VASWANI A,SHAZEER N,PARMAR N,et al.AttentionIs All You Need[C]//Proceedings of the 31st International Conference on Neural Information Processing Systems.2017:6000-6010. [36]TAM S,SAID R B,TANRÖIVER Ö Ö.A ConvBiLSTM Deep Learning Model-based Approach for Twitter Sentiment Classification[J].IEEE Access,2021,9:41283-41293. [37]KUMAR H,AGARWAL A,DASGUPTA R,et al.Dialogue Act Sequence Labeling Using Hierarchical Encoder With CRF[C]//Proceedings of the AAAI Conference on Artificial Intelligence.2018:3440-3447. [38]LI X,ZHANG F,LI Q,et al.Exploiting a Knowledge Hypergraph for Modeling Multi-nary Relations in Fault Diagnosis Reports[J].Advanced Engineering Informatics,2023,57:102084. [39]XIE T,YANG J A,LIU H.Chinese Entity Recognition Based on BERT-BiLSTM-CRF Model[J].Computer Systems & Applications,2020,29(7):48-55. [40]ZHANG F,YANG L Y,LI J W,et al.An Overview of EntityAlignment Methods[J].Chinese Journal of Computers,2022,45(6):1195-1225. [41]PAPINENI K,ROUKOS S,WARD T,et al.BLEU:A Method for Automatic Evaluation of Machine Translation[C]//Procee-dings of the 40th Annual Meeting of the Association for Computational Linguistics.ACL,2002:311-318. [42]LIN C Y.ROUGE:A Package for Automatic Evaluation ofSummaries[C]//Text Summarization Branches Out.Association for Computational Linguistics,2004:74-81. [43]XU L,LI A,ZHU L,et al.SuperCLUE:A Comprehensive Chinese Large Language Model Benchmark[J].arXiv:2307.15020,2023. [44]ZHAO X,ZHANG H,WANG D S.Review of Large Language Model Evaluation Studies:Current Status,Applications,Challenges,and Trends[J].Journal of the China Society for Scientific and Technical Information,2025,44(8):1058-1074. [45]ZHAO Y,HE J W,ZHU S C,et al.Security of Large Language Models:Current Status and Challenges[J].Computer Science,2024,51(1):68-71. [46]HAO Y,DONG L,WEI F,et al.Self-Attention Attribution:Interpreting Information Interactions inside Transformer[C]//Proceedings of the AAAI Conference on Artificial Intelligence.2021,35(14):12963-12971. [47]VELIČKOVIĆ P,CUCURULL G,CASANOVA A,et al.Graph Attention Networks[J].arXiv:1710.10903,2017. [48]JIA Y.On Digital Courts[J].Chinese Journal of Law,2024,46(4):3-20. |
|
||