Computer Science ›› 2025, Vol. 52 ›› Issue (6A): 240900022-4.doi: 10.11896/jsjkx.240900022

• Large Language Model Technology and Its Application • Previous Articles     Next Articles

Large Model Driven AI Application Service Platform

LIANG Binghao, ZHANG Chuangang, YUAN Mingming   

  1. Inspur Communication Information System Co.,Ltd.,Jinan 250013,China
  • Online:2025-06-16 Published:2025-06-12
  • About author:LIANG Binghao,born in 1991,Ph.D.His main research interests include artificial intelligence and computing power network application.
  • Supported by:
    Taishan Industrial Leading Talent Project(tscx202312006) and Shandong Postdoctoral Innovation Project(SDCX-ZG-202400307).

Abstract: With the continuous advancement of the transformation of enterprise data intelligence,artificial intelligence technology has begun to be applied to various aspects of enterprise internal management,operation analysis and production efficiency improvement.However,the traditional AI application research and development process involves data acquisition,data cleaning,feature extraction,algorithm modeling,and application research and development.The overall technical threshold is high,the collaboration of team members is difficult,the utilization rate of hardware resources is low,and it is difficult to support the agile landing of digital intelligent business requirements.To solve these problems,a set of AI application service platform based on pre-trained large model is proposed.The platform is mainly designed for AI application research and development and operation management,which greatly reduces the difficulty of team collaboration and asset management.For the core processes in the preparation state,design state and running state,the pre-trained large model and low-code technology are introduced.By constructing the labeled large model,the test large model and the operation large model,the research and development efficiency of AI application is improved.Meanwhile,the real-time analysis of operational data is realized,the user experience is guaranteed and the utilization rate of hardware resources is greatly improved.

Key words: Artificial intelligence application, Research and development services, Intelligent annotation, Automatic test, Operation service

CLC Number: 

  • TP311
[1]BERNSTEIN D.Containers and Cloud:From LXC to Docker to Kubernetes[J].Cloud Computing,IEEE,2014,1(3):81-84.
[2]NVIDIA.NVIDIA Virtual GPU Software Documentation v18.0[EB/OL].https://docs.nvidia.com/vgpu/18.0/index.html.
[3]GU J,SONG S,LI Y,et al.GaiaGPU:Sharing GPUs in Container Clouds[C]//2018 IEEE Intl Conf on Parallel & Distributed Processing with Applications,Ubiquitous Computing & Communications,Big Data & Cloud Computing,Social Computing & Networking,Sustainable Computing & Communications.2018,469-476.
[4]TAN Z,LI D W,WANG S,et al.Large Language Models for Data Annotation:A Survey [J].arXiv:2402.13446,2024.
[5]JING Z,SU Y Y,HAN Y K,et al.When Large Language Mo-dels Meet Vector Databases:A Survey[J].arXiv:2402.01763,2024.
[6]PASZKE A,GROSS S,MASSAF,et al.PyTorch:An Imperative Style,High-Performance Deep Learning Library[J].arXiv:2209.15428,2019.
[7]ABADI M,AGARWAL A,BARHAM P,et al.TensorFlow:Large-Scale Machine Learning on Heterogeneous Distributed Systems [J].arXiv:1605.08695,2016.
[8]ZHOU Y,YANG K.Exploring TensorRT to Improve Real-Time Inference for Deep Learning[C]//2022,IEEE 24th Int Conf on High Performance Computing & Communications;8th Int Conf on Data Science & Systems;20th Int Conf on Smart City;8th Int Conf on Dependability in Sensor,Cloud & Big Data Systems & Application(HPCC/DSS/SmartCity/DependSys).2022.
[9]KWON W,LI Z H,ZHUANG S Y,et al.Efficient MemoryManagement for Large Language Model Serving with PagedAttention[J].arXiv:2309.06180,2023.
[10]BEN W.MLflow:A Tool for Managing the Machine Learning Lifecycle[EB/OL].https://mlflow.org/docs/latest/index.html.
[11]YANG A,YANG B S,HUI B Y,et al.Qwen2 Technical Report[J].arXiv:2407.10671,2024.
[12]CHENG T H,SONG L,GE Y X,et al.YOLO-World:Real-Time Open-Vocabulary Object Detection[J].arXiv:2401.17270v2,2024.
[13]GAO L,MADAAN A,ZHOU S,et al.PAL:Program-aidedLanguage Models[J].arXiv:2211.10435,2022.
[14]YANG L,YANG C,GAO S T,et al.An Empirical Study of Unit Test Generation with Large Language Models[J].arXiv:2406.18181v1,2024.
[15]NI C,WANG X Y,CHEN L S,et al.CasModaTest:A Cascaded and Model-agnostic Self-directed Framework for Unit Test Gene-ration[J].arXiv:2406.15743,2024.
[16]XIA Y H,CHEN Y Y,SHI T Y,et al.AICoderEval:Improving AI Domain Code Generation of Large Language Models[J].ar-Xiv:2406.04712,2024.
[17]MENG W,LIU Y,ZHU Y,et al.LogAnomaly:UnsupervisedDetection of Sequential and Quantitative Anomalies in Unstructured Logs[C]//Twenty-Eighth International Joint Conference on Artificial Intelligence(IJCAI-19).2019.
[18]TAO S,MENG W,CHENG Y,et al.LogStamp:Automatic Online Log Parsing Based on Sequence Labelling[J].Performance Evaluation Review,2022(4):49.
[19]ZHOU X H,LI G L,LIU Z Y.LLM As DBA [J].arXiv:2308.05481,2024.
[1] SHAO Wenxin, YANG Zhibin, LI Wei, ZHOU Yong. Natural Language Requirements Based Approach for Automatic Test Cases Generation of SCADE Models [J]. Computer Science, 2024, 51(7): 29-39.
[2] HUANG Shuang-qin, LIU Ying-bo, HUANG Xiang-sheng. Research on Automatic Testing Technology of Model Driven Development Tools [J]. Computer Science, 2021, 48(6A): 568-571.
[3] HUANG Zhao,HUANG Shu-guang,DENG Zhao-kun,HUANG Hui. Automatic Vulnerability Detection and Test Cases Generation Method for Vulnerabilities Caused by SEH [J]. Computer Science, 2019, 46(7): 133-138.
[4] HU Jing-hui, WU Ji, MENG Han, LIU Chao, YANG Hai-yan. Signal Model of Automatic Testing Technology for Airborne Equipment [J]. Computer Science, 2018, 45(9): 94-98.
[5] GAO Xue-di, ZHOU Li-juan, ZHANG Shu-dong and LIU Hao-ming. Research on Test Data Automatic Generation Based on Improved Genetic Algorithm [J]. Computer Science, 2017, 44(3): 209-214.
[6] JIAO Chong-yang, ZHOU Qing-lei and ZHANG Wen-ning. MPSO and Its Application in Test Data Automatic Generation [J]. Computer Science, 2017, 44(12): 249-254.
[7] JIANG Peng,CHEN Xin and LI Xuan-dong. Method to Automatic Testcase Generation toward Safety Critical Scenarios of Cyber-physical Systems [J]. Computer Science, 2014, 41(11): 124-127.
[8] . Automatic Test Data Generation Tool of Dynamic Variable Parameters Based on Genetic Algorithm [J]. Computer Science, 2012, 39(5): 124-127.
[9] . Automatic Test for Web Services [J]. Computer Science, 2012, 39(2): 170-174.
[10] HAN Wei, JIANG Chuan (Microelectronic Center, CSIC Wuhan Digital Engineering Institute,Wuhan 430074,China). [J]. Computer Science, 2009, 36(4): 289-292.
[11] . [J]. Computer Science, 2009, 36(1): 263-266.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!