Computer Science ›› 2026, Vol. 53 ›› Issue (6A): 250900126-6.doi: 10.11896/jsjkx.250900126

• Computer Software & Architecture • Previous Articles     Next Articles

Conjugate Gradient Preconditioner Adaptive Selection Algorithm via Deep Learning

LI Qin1, WU Siyuan2, YANG Haoyuan2, DU Qin2, LING Xu1, XIAO Guoqing3   

  1. 1 College of Mechanical Engineering,Hunan Chemical Vocational Technology College,Zhuzhou,Hunan 412000,China
    2 College of Computer Science and Electronic Engineering,Hunan University,Changsha 410082,China
    3 Research Institute of Hunan University in Chongqing,Chongqing 401135,China
  • Online:2026-06-16 Published:2026-06-12
  • About author:LI Qin,born in 1968,professor.Her main research interests include digital design and manufacturing,and so on.
    DU Qin,born in 2002,Ph.D.His main research interests include high-perfor-mance computing and AI computing.
  • Supported by:
    Natural Science Foundation of Hunan Province,China(2023JJ60002).

Abstract: Precondition Conjugate Gradient(PCG) algorithm is an iterativesolving algorithm for solving large-scale sparse matrices which iswidely used in fields such as scientific engineering computing and artificial intelligence.Existing research focuses on using deep learning to generate pre condition operators to improve solving speed.However,fixed preconditioning operators lack generality and are difficult to apply to all sparse matrices because of the spatial complexity of sparse matrices.To address this issue,a preconditioner operator adaptive selection algorithm based on deep learning and its optimization method are proposed.Firstly,a convolutional neural network(PCNN) is designed to capture the spatial structural characteristics of sparse matrices.Secondly,an adaptive classification prediction model combining multi-layer perceptronsis constructed to select the optimal precondition operator.Finally,experimental results on the publicly available dataset in Florida show that the proposed method has a better classification accuracy than deep learning methods such as MLP and SVM,reaching 70.49%;Compared with the PCG algorithm based on Jacobi,ICCG,and SSOR,the proposed method improves performance by 5.5,4.3,and 6.2 times,respectively.

Key words: Conjugate gradient, Precondition method, Deep learning, Large scale sparse matrix, Iterative solution method

CLC Number: 

  • O241.6
[1] GAIHRE A,LI X S,LIU H.Gsofa:Scalable sparse symbolic lu factorization on gpus[J].IEEE Transactions on Parallel and Distributed Systems,2021,33(4):1015-1026.
[2] KARSAVURAN M O,NG E G,PEYTON B W.GPU Accelerated Sparse Cholesky Factorization[C]//SC24-W:Workshops of the International Conference for High Performance Computing,Networking,Storage and Analysis.IEEE,2024:703-707.
[3] LIU R Y,XU Z Q,SHANG H H,et al.Study of Iterative Solution Algorithm of Response Density Matrix in Density Functio-nal Perturbation Theory[J].Computer Science,2023,50(6):81-85.
[4] HESTENES M R,STIEFEL E.Methods of conjugate gradients for solving linear systems[J].Journal of research of the National Bureau of Standards,1952,49(6):409-436.
[5] YE Y,GUO H,WANG B,et al.Coupled Incomplete Cholesky and Jacobi Preconditioned Conjugate Gradient on the New Generation of Sunway Many-Core Architecture[J].IEEE Transactions on Computers,2023,72(11):3326-3339.
[6] NAPOV A.An incomplete Cholesky preconditioner based on orthogonal approximations[J].SIAM Journal on Scientific Computing,2023,45(2):A729-A752.
[7] HUANG G H,XU Y Z,YI X W,et al.Highly efficient iterative methods for solving linear equations of three-dimensional sphere discontinuous deformation analysis[J].International Journal for Numerical and Analytical Methods in Geomechanics,2020,44(9):1301-1314.
[8] MENG X F,HAO X L,MA C H,et al.Research on Machine Learning for Scientific Discovery[J].Chinese Journal of Computers,2023,46(5):877-895.
[9] MA H J.Recommendation Algorithm Based on Convolutional Neural Network and Constrained Probability Matrix Factorization[J].Computer Science,2020,47(S1):540-545.
[10] LOCKHART S,BIENZ A,GROPP W,et al.Performance analysis and optimal node-aware communication for enlarged conjugate gradient methods[J].ACM Transactions on Parallel Computing,2023,10(1):1-25.
[11] TIWARI M,VADHIYAR S.Pipelined Preconditioned Conjugate Gradient Methods for real and complex linear systems for distributed memory architectures[J].Journal of Parallel and Distributed Computing,2022,163:147-155.
[12] YUAN F,YANG X,LI S,et al.Optimizing multi-grid preconditioned conjugate gradient method on multi-cores[J].IEEE Transactions on Parallel and Distributed Systems,2024,35(5):768-779.
[13] GAO J,ZHOU Y,HE G,et al.A multi-GPU parallel optimization model for the preconditioned conjugate gradient algorithm[J].Parallel Computing,2017,63:1-16.
[14] ZHANG T,XUE F.A New Preconditioned Nonlinear Conjugate Gradient Method in Real Arithmetic for Computing the Ground States of Rotational Bose-Einstein Condensate[J].SIAM Journal on Scientific Computing,2024,46(3):A1764-A1792.
[15] LI Y,CHEN P Y,DU T,et al.Learning preconditioners for conjugate gradient PDE solvers[C]//International Conference on Machine Learning.PMLR,2023:19425-19439.
[16] TRIFONOV V,RUDIKOV A,ILIEV O,et al.Learning from linear algebra:A graph neural network approach to preconditioner design for conjugate gradient solvers[J].arXiv:2405.15557,2024.
[17] LIN S,YANG W,WANG H,et al.STM-multifrontal QR:streaming task mapping multifrontal QR factorization empowered by GCN[C]//Proceedings of the International Conference for High Performance Computing,Networking,Storage and Analysis.2021:1-14.
[18] WANG H,YANG W,OUYANG R,et al.A heterogeneous parallel computing approach optimizing SpTTM on CPU-GPU via GCN[J].ACM Transactions on Parallel Computing,2023,10(2):1-23.
[19] SUN Q,LIU Y,DUN M,et al.SpTFS:Sparse tensor format selection for MTTKRP via deep learning[C]//SC20:International Conference for High Performance Computing,Networking,Stora-ge and Analysis.IEEE,2020:1-14.
[20] HESTENES M R,STIEFEL E.Methods of conjugate gradients for solving linear systems[J].Journal of research of the National Bureau of Standards,1952,49(6):409-436.
[21] CHEN R S,YANG L,ZHAO L,et al.SSOR preconditioned CG method for the linear interference cancellation of asynchronous CDMA systems[J].International Journal of Electronics,2004,91(12):745-757.
[22] ZHENG C,ZHANG J H.SSOR-PCG method used in simulation of geotechnical engineering with finite element method [J].Chinese Journal of Rock Mechanics and Engineering,2007,26(S1):2820-2826.
[23] MUKUNOKI D,OZAKI K,OGITA T,et al.Conjugate gradient solvers with high accuracy and bit-wise reproducibility between CPU and GPU using Ozaki scheme[C]//The International Conference on High Performance Computing in Asia-Pacific Region.2021:100-109
[24] VAPNIK V,IZMAILOV R.Reinforced SVM method and memorization mechanisms[J].Pattern Recognition,2021,119:108018.
[25] ZHAO Y,LI J,LIAO C,et al.Bridging the gap between deep learning and sparse matrix format selection[C]//Proceedings of the 23rd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming.2018:94-108.
[1] LI Siyu, QIAN Wenhua. HCKD:Lightweight Skin Lesion Classification Method Based on Dermoscopic Images [J]. Computer Science, 2026, 53(6A): 250600143-9.
[2] CHEN Nuo, ZHAO Peng, HUAN Haisheng. Review of Small Object Detection Based on Deep Learning [J]. Computer Science, 2026, 53(6A): 250700022-9.
[3] LIU Zixuan, TANG Xiaoyong. PID-Dynamic LSTM Generation Model for MCU Driver Code Based on Dynamically-tuned Cross-entropy Loss [J]. Computer Science, 2026, 53(6A): 250800005-9.
[4] ZHANG Xiaozhu, CHEN Hongyou, QU Lingfeng, WANG Yuechenjia, TIAN Baodan, FAN Yong. Carbon Emission Prediction Algorithm Based on TransLSTM-GAN Model [J]. Computer Science, 2026, 53(6A): 250400146-11.
[5] FU Yue, SHI Wei. Social Text MBTI Personality Feature Recognition Method Based on Data Fusion and Deep Learning [J]. Computer Science, 2026, 53(6A): 250500101-8.
[6] SU Ye, XU Xin, ZHAO Longlong, LI Xiaoli, CHEN Pan, CHEN Jinsong. LitchiNet:Lightweight Litchi Variety Recognition Network with Fused Multi-scale Gated Attention and Class Imbalance Awareness [J]. Computer Science, 2026, 53(6A): 250600127-8.
[7] CHEN Di, YIN Jibin. Dynamic Adjustment Technology of Eye Movement Input Based on TCN-AttnRNN Model [J]. Computer Science, 2026, 53(6A): 250300095-7.
[8] WANG Baohui, TAN Yingjie , CHEN Jixuan. Occlusion Head Pose Estimation Algorithm Based on Riemann Optimization [J]. Computer Science, 2026, 53(6A): 250300109-9.
[9] CHU Chunyu, JIANG Feilong. Water Meter Reading Recognition Based on Deep Learning and Prior Correction [J]. Computer Science, 2026, 53(6A): 250300143-7.
[10] WU Xiaoxiao, WU Xinglong. Prenatal Diagnosis of Fetal Cerebellum Based on Brain Anatomical Structures [J]. Computer Science, 2026, 53(6A): 250400049-7.
[11] WANG Yipin, CAI Chenghuan, XU Jiabin, ZHOU Xuegong, ZHANG Fengzhe, CAO Wei, ZHANG Fan, YU Xinsheng. Study on Compilation Technology of Neural Network Accelerator Based on RISC-V InstructionExtension [J]. Computer Science, 2026, 53(6): 128-136.
[12] LI Xiuying, CHEN Xuesong, LI Haoze, LIAO Hongwei, HAN Jiameng, DUAN Xiaoyi. MambaCS:Mamba-based Image Compressed Sensing Algorithm [J]. Computer Science, 2026, 53(6): 232-241.
[13] MA Ning, CHANG Xia, YUAN Lingyu. Pansharpening Method Based on Double-side Guided Filtering and Multi-feature Recalibration [J]. Computer Science, 2026, 53(6): 270-280.
[14] GUO Jingchen, YANG Kuiwu, DING Mengdi, WEI Jianghong. Survey of Adversarial Sample Attacks for Vision Transformer [J]. Computer Science, 2026, 53(5): 404-418.
[15] CHEN Yuansheng, CHEN Shunjue, MO Xuan, WU Weigang, LI Jialun. Deep Learning Training Time Prediction Algorithm Integrating Multi-dimensional Operator Features [J]. Computer Science, 2026, 53(5): 129-136.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!