计算机科学 ›› 2026, Vol. 53 ›› Issue (5): 299-308.doi: 10.11896/jsjkx.250600023
刘旭凯1, 刘洋1,2, 黄浩桢3
LIU Xukai1, LIU Yang1,2, HUANG Haozhen3
摘要: 近年来,大语言模型发展迅速,在多个自然语言处理任务上展现了出色的能力,也为智慧司法领域提供了强大的技术支持。对此,结合模型预训练和微调技术,构建了MIPLD(Micro-model Intellectual Property Learning Direction)知识产权数据库,并根据法学学科特性和知识产权门类特性,搭建分布预训练的算法框架。随后基于MIPLD数据库,构建了知识产权领域多方向的高质量微调问答对,实现了在小参数下拥有高能力密度的知识产权问题分析大语言模型EC-MIIP,该模型适用于知识产权的学理问答、行为性质分析、司法案例解析以及法律文书撰写等任务。实验结果表明,与Qwen3-4B、Qwen3全参和Deepseek-R1全参模型相比,EC-MIIP的性能更优。该研究不仅探索了大语言模型在知识产权领域中的应用,还为实现小参数模型在司法领域的适用提供了参考。
中图分类号:
| [1]LI Y,YAN Y,TONG Z,et al.Efficient fine-tuning of small-parameter large language models for biomedical bilingual multi-task applications[J].Applied Soft Computing,2025,175:113084. [2]NGUYEN C,SHEN X,APONTE R,et al.Asurvey of small language models[J].arXiv:2410.2001,2024. [3]DEEPSEEK-AI,GUO D,YANG D,et al.DeepSeek-R1:In-centivizingreasoning capability in LLMs via reinforcement learning[J].arXiv:2501.12948,2025. [4]YANG A,YANG B,HUI B,et al.Qwen2technical report[J].arXiv:2407.10671,2024. [5]YANG A,LI A,YANG B,et al.Qwen technical report[J].ar-Xiv:2505.09388,2025. [6]COLOMBO P,PIRES T,BOUDIAF M,et al.SaulLM-7B:Apioneeringlarge language model for law[J].arXiv:2403.03883,2024. [7]CUI J,NING M,LI Z,et al.Chatlaw:Amulti-agent collaborative legal assistant with knowledge graph enhanced mixture-of-experts large language model[J].arXiv:2306.16092,2023. [8]LEE J S.Evaluating generative patent language models[J].World Patent Information,2023,72:102173. [9]NI S,TAN M,BAI Y,et al.MoZIP:A multilingual benchmark to evaluate large language models in intellectual property[J].arXiv:2402.16389,2024. [10]LEE J S.Instruct Patent GPT:training patent language models to follow instructions with human feedback[J].Artificial Intelligence and Law,2024,33:739-782. [11]BAI Z,ZHANG R,CHEN L,et al.Patent GPT:a large language model for intellectual property[J].arXiv:2404.18255,2024. [12]LV K,YANG Y,LIU T,et al.Full Parameter Fine-tuning for Large Language Models with Limited Resources[C]//Procee-dings of the 62nd Annual Meeting of the Association for Computational Linguistics.2024:8187-8198. [13]HAN Z,GAO C,LIU J,et al.Parameter-efficient fine-tuning for large models:A Comprehensive Survey[J].arXiv:2403.14608,2024. [14]HU E,SHEN Y,WALLIS P,et al.LoRA:Low-rank adaptation of large language models[C]//ICLR.2022. [15]DETTMERS T,PAGNONI A,HOLTZMAN A,et al.QLoRA:Efficientfinetuning of quantized LLMs[J].Advances in Neural Information Processing Systems,2023,36:10088-10115. [16]LAMY-POIRIER J.Layered gradient accumulation and modular pipeline parallelism:fast and efficient training of large language models[J].arXiv:2106.02679,2021. [17]JOHNSON O,XINGYING C,KHAW K,et al.Ps-CALR:Perio-dic-shift cosine annealing learning rate for deep neural networks[J].IEEE Access,2023,11:139171-139186,. [18]DONG Q,LI L,DAI D,Asurvey on in-context learning[J].ar-Xiv:2301.00234,2024. [19]GUNASERKAR S,ZHANG Y,ANEJA J,et al.Textbooksare all you need[J].arXiv:2306.11644,2023. [20]LZACARD G,LEWIS P,LOMELI M,et al.Atlas:Few-shotlearning with retrieval augmented language models[J].Journal of Machine Learning Research,2023,24(251):1-43. [21]WEI J,WANG X,SCHUURMANS D,et al.Chain-of-thoughtprompting elicits reasoning in large language models[C]//NeurlPS 2022.2022 [22]FEI Z,SGEN X,ZHU D,et al.Lawbench:Benchmarking legal knowledge of large language models[J].arXiv:2309.16289,2023. [23]DAI Y,FENG D,HUANG J,et al.Laiw:A chinese legal large language models benchmark[J].arXiv:2310.05620,2023. [24]LI H,CHEN Y,AI Q,et al.LexEval:Acomprehensive Chinese legal benchmark for evaluating large language models[J].arXiv:2409.20288,2024. [25]CHALKIDIS I,FERGADIOTIS M,PRODROMOS M P,et al.LEGAL-BERT:Themuppets straight out of Law School[C]//Findings of the Association for Computational Linguistics(EMNLP2020).2020:2898-2904. [26]ZHENG L,GUHA N,ANDERSON B,et al.When does pre-training help? Assessing self-supervised learning for law and the CaseHOLD dataset of 53 000+Legal Holdings[C]//Procee-dings of the Eighteenth International Conference on Artificial Intelligence and Law(ICAIL’21).2021:159-168. |
|
||