Computer Science ›› 2023, Vol. 50 ›› Issue (11): 348-355.doi: 10.11896/jsjkx.230300171

• Information Security • Previous Articles     Next Articles

USPS:User-space Cross Protocol Proxy System for Efficient Collaboration of Computing Power Resources

XIA Jingxuan, SHEN Guowei, GUO Chun, CUI Yunhe   

  1. Engineering Research Center for Text Computing, Cognitive Intelligence, Ministry of Education, School of Computer Science, Technology, Guizhou University, Guiyang 550025, China
    State Key Laboratory of Public Big Data,School of Computer Science and Technology,Guizhou University,Guiyang 550025,China
  • Received:2023-03-21 Revised:2023-07-29 Online:2023-11-15 Published:2023-11-06
  • About author:XIA Jingxuan,born in 1998,postgra-duate.His main research interests include high performance network communication and RDMA.SHEN Guowei,born in 1986,Ph.D,professor,is a member of China Computer Federation.His main research interests include big data,network and information security.
  • Supported by:
    National Natural Science Foundation of China(62062022) and Natural Science Foundation of Guizhou Province,China([2023]011).

Abstract: With the rapid development of computing power network,computing power resources such as general computing po-wer,artificial intelligence computing power,and supercomputing are widely distributed.Collaborative service of computing power resources is a key issue in computing power network research.In the process of computing power resource collaboration,on the one hand,it faces the high concurrent requests and low latency response requirements of massive terminal computing power ser-vices,on the other hand,it is difficult to give full play to the high throughput and low latency advantages of computing power resources in data center,and then it is difficult to provide efficient computing power services for users.Aiming at the above challenges,a user-space proxy system(USPS) based on the user-space protocol stack and remote direct memory access(RDMA) techno-logy is proposed.The user space protocol stack is used to respond to client's for high concurrent computing power requests,and the high throughput and low latency services of data center computing power based on RDMA is realized under dynamic batch processing strategy coordination.In terms of communication,USPS has implemented an efficient remote procedure call(RPC) communication mechanism,which can make full use of RDMA NIC bandwidth and provide high-speed message communication.In terms of request processing,a dynamic batch processing scheduling method is proposed,which can maximize the batch processing efficiency while meeting the user's delay requirements.Experiment shows that the service response latency of USPS is only 7.8%~23.1% of that of the traditional kernel-space Nginx proxy system,and 17.3%~24.7% of that of other user-space proxy systems.The throughput is 3.4~8.9 times higher than that of the traditional kernel-space Nginx agent system,and 3.2~4.2 times higher than that of other user-space proxy systems.

Key words: Efficient collaboration of computing power resources, User-space proxy, Remote direct memory access, Data center, Batch processing scheduling

CLC Number: 

  • TP393
[1]JIA Q M,HU Y J,ZHANG H Y,et al.Research on deterministic computing power network[J].Journal on Communications,2022,43(10):55-64.
[2]ZHANG H K,YU C X,QUAN W,et al.Fundamental Research on Computing Integration Networking[J].Acta Electronica Si-nica,2022,50(12):2928-2934.
[3]CHEN X Y,ZHANG X S,XIE Z L,et al.A Computing andTransmission Integrated Optimization Method for Cloud-Edge-End Computing First System[J].Journal of Computer Research and Development,2023(4):719-734.
[4]ZHONG L J,WANG M.Blockchain-enabled Cooperative Resource Allocation Scheme for Computing First Networking[J].Journal of Computer Research and Development,2023,60(4):750-762.
[5]TENCENT CLOUD.F-stack:An high performant networkframework based on DPDK[EB/OL].http://www.f-stack.org/.
[6]INTEL.Data Plane Development Kit[EB/OL].http://dpdk.org.
[7]JEONG E Y,WOO S,JAMSHED M,et al.mtcp:a highly scala-ble user-level TCP stack for multicore systems[C]//11th USENIX Symposium on Networked Systems Design and Implementation(NSDI 14).2014:489-502.
[8]JAMSHED M A,MOON Y G,KIM D,et al.mOS:A reusable networking stack for flow monitoring middleboxes[C]//14th USENIX Symposium on Networked Systems Design and Implementation(NSDI 17).2017:113-129.
[9]WANG S,LOU C,CHEN R,et al.Fast and Concurrent RDFQueries using RDMA-assisted GPU Graph Exploration[C]//2018 USENIX Annual TechnicalConference(USENIX ATC 18).2018:651-664.
[10]XUE J,MIAO Y,CHEN C,et al.Fast distributed deep learning over rdma[C]//Proceedings of the Fourteenth EuroSys Conference 2019.2019:1-14.
[11]ZHANG J,LU X,CHU C H,et al.C-GDR:High-Performance Container-aware GPUDirect MPI Communication Schemes on RDMA Networks[C]//2019 IEEE International Parallel and Distributed Processing Symposium(IPDPS).IEEE,2019:242-251.
[12]ZHANG R,SHEN G,GONG L,et al.DSANA:A distributed machine learning acceleration solution based on dynamic scheduling and network acceleration[C]//2020 IEEE 22nd International Conference on High Performance Computing and Communications;IEEE 18th International Conference on Smart City;IEEE 6th International Conference on Data Science and Systems(HPCC/SmartCity/DSS).IEEE,2020:302-311.
[13]DRAGOJEVIC' A,NARAYANAN D,CASTRO M,et al.FaRM:Fast remote memory[C]//11th {USENIX} Symposium on Networked Systems Design and Implementation({NSDI} 14).2014:401-414.
[14]TSAI S Y,ZHANG Y.Lite kernel rdma support for datacenter applications[C]//Proceedings of the 26th Symposium on Ope-rating Systems Principles.2017:306-324.
[15]CHEN Y,LU Y,SHU J.Scalable RDMA RPC on reliable connection with efficient resource sharing[C]//Proceedings of the Fourteenth EuroSys Conference 2019.2019:1-14.
[16]MONGA S K,KASHYAP S,MIN C.Birds of a Feather Flock Together:Scaling RDMA RPCs with Flock[C]//Proceedings of the ACM SIGOPS 28th Symposium on Operating Systems Principles.2021:212-227.
[17]JONATHAN C.Batch processing of network packets[DB/OL].https://lwn.net/Articles/763056/.
[18]LANGE S,LINGUAGLOSSA L,GEISSLER S,et al.Discrete-time modeling ofnfv accelerators that exploit batched processing[C]//IEEE INFOCOM 2019-IEEE Conference on Computer Communications.IEEE,2019:64-72.
[19]LINGUAGLOSSA L,LANGE S,PONTARELLI S,et al.Sur-vey of performance acceleration techniques for network function virtualization[J].Proceedings of the IEEE,2019,107(4):746-764.
[20]LÉVAI T,NÉMETH F,RAGHAVAN B,et al.Batchy:Batch-scheduling data flow graphs with service-level objectives[C]//17th USENIX Symposium on Networked Systems Design and Implementation(NSDI 20).2020:633-649.
[21]LI M Q.Research on cross-protocol user-space proxy technology for data center network[D].Guiyang:Guizhou University.2021.
[22]WILL G.wrk:Modern HTTP benchmarking tool[EB/OL].https://github.com/wg/wrk.
[1] LIU Chenwei, SUN Jian, LEI Bingbing, XU Tao, WU Zhuiwei. Task Scheduling Strategy for Energy Consumption Optimization of Cloud Data Center Based on Improved Particle Swarm Algorithm [J]. Computer Science, 2023, 50(7): 246-253.
[2] PAN Zhi-yong, CHENG Bao-lei, FAN Jian-xi, BIAN Qing-rong. Algorithm to Construct Node-independent Spanning Trees in Data Center Network BCDC [J]. Computer Science, 2022, 49(7): 287-296.
[3] OU Dong-yang, ZHANG Kai-qiang, CHEN Sheng-lei, JIANG Cong-feng, YAN Long-chuan. Data Center Power Attack Defense Strategy Based on PCPEC [J]. Computer Science, 2022, 49(12): 374-380.
[4] YI Yi, FAN Jian-xi, WANG Yan, LIU Zhao, DONG Hui. Fault-tolerant Routing Algorithm in BCube Under 2-restricted Connectivity [J]. Computer Science, 2021, 48(6): 253-260.
[5] ZHANG Deng-ke, WANG Xing-wei, HE Qiang, ZENG Rong-fei, YI bo. State-of-the-art Survey on Reconfigurable Data Center Networks [J]. Computer Science, 2021, 48(3): 246-258.
[6] LI Xiao-guang, SHAO Chao. Density Peak Clustering Algorithm Based on Grid Data Center [J]. Computer Science, 2019, 46(6A): 457-460.
[7] JIN Yong, LIU Yi-xing, WANG Xin-xin. SDN-based Multipath Traffic Scheduling Algorithm for Data Center Network [J]. Computer Science, 2019, 46(6): 90-94.
[8] FAN Zi-fu, LI Shu and ZHANG Dan. Traffic Scheduling Based Congestion Control Algorithm for Data Center Network on Software Defined Network [J]. Computer Science, 2017, 44(Z6): 266-269.
[9] QIAO Yan, JIAO Jun and RAO Yuan. Traffic Estimation for Data Center Network Based on Traffic Characteristics [J]. Computer Science, 2017, 44(2): 171-175.
[10] MENG Fei, LAN Ju-long and HU Yu-xiang. Richards Model Based Data Center Backbone Network Bandwidth Allocation Policy [J]. Computer Science, 2016, 43(1): 133-136.
[11] ZHENG Jian, CAI Ting and DU Xing. Workload Scheduling for Minimizing Electricity Cost of Data Center [J]. Computer Science, 2015, 42(Z11): 542-543.
[12] HUANG Zhao-nian, LI Hai-shan and ZHAO Jun. Virtual Machine Placement Algorithm Based on Improved Genetic Algorithm [J]. Computer Science, 2015, 42(Z11): 406-407.
[13] ZHOU Dong-qing and SI Qing-qian. Energy-efficient Virtual Machine Placement for Heterogeneous Cloud Platform [J]. Computer Science, 2015, 42(3): 81-84.
[14] LONG Yu-jiang and ZHU Zhou. Research on Configuration Management Method of Data Center Based on Ontology [J]. Computer Science, 2014, 41(Z6): 494-498.
[15] XU Guan-jun. Exploratory on Virtualization-based Application Disaster Recovery Platform [J]. Computer Science, 2014, 41(Z11): 426-429.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!