Computer Science ›› 2025, Vol. 52 ›› Issue (11A): 241000145-9.doi: 10.11896/jsjkx.241000145

• Artificial Intelligence • Previous Articles     Next Articles

PPIS-MFH:Predicting Protein-Protein Interaction Sites Based on Multi-feature HybridNetwork Integrating ViT

HU Zhaolong, HU Chunling, HU Ruijie, GUO Longju   

  1. School of Artificial Intelligence and Big Data,Hefei University,Hefei 230601,China
  • Online:2025-11-15 Published:2025-11-10
  • Supported by:
    National Natural Science Foundation of China:Research on Local Graph Representation Learning for Dynamic Knowledge Graph(62306100).

Abstract: The deeper principles of molecular life can be revealed through an in-depth study of protein-protein interaction sites(PPIS).However,existing methods for identifying PPIS are complex and time-consuming,and more accurate models are needed for PPIS prediction.Although deep learning techniques based on attention mechanisms and convolutional neural networks(CNNs) have made progress in PPIS prediction,they still face limitations in capturing amino acid features.To effectively capture long-range dependencies in protein sequences and accurately characterize amino acid properties,this paper proposes a multi-feature hybrid network(MFH),PPIS-MFH,for predicting protein-protein interaction sites.Protein-protein interaction sites are predicted by combining both global and local sequence features.For local sequence features,the PPIS-MFH model incorporates a Vision Transformer(ViT) module,which captures long-range dependencies and extracts local features from protein sequences.For global sequence features,the model employs a bidirectional gated recurrent neural network to discern intrinsic connections between amino acids in protein sequences.This is achieved through a feature crossover network that combines a text convolutional neural network(TextCNN) with an attention mechanism,specifically a text recurrent neural network(TextRNN-Attention).In this study,the PPIS-MFH model was evaluated on four datasets and compared with eight similar methods.The experimental results show that,on most metrics,the proposed method outperforms other similar methods.

Key words: Protein-protein interaction site, Attention mechanism, Text convolutional neural network, Bidirectional gated recurrent neural network, Feature crosses network

CLC Number: 

  • TP391
[1]DAS S,CHAKRABARTI S.Classification and prediction of protein-protein interaction interface using machine learning algorithm [J].Scientific Reports,2021,11(1):1761.
[2]BUTLAND G,PEREGRÍN-ALVAREZ J M,LI J,et al.Interaction network containing conserved and essential protein complexes in Escherichia coli [J].Nature,2005,433(7025):531-537.
[3]LI X,LI W,ZENG M,et al.Network-based methods for predicting essential genes or proteins:a survey [J].Briefings in Bioinformatics,2020,21(2):566-583.
[4]DE LAS RIVAS J,FONTANILLO C.Protein-protein interac-tions essentials:key concepts to building and analyzing interactome networks [J].PLoS Computational Biology,2010,6(6):e1000807.
[5]BRETTNER L M,MASEL J.Protein stickiness,rather thannumber of functional protein-protein interactions,predicts expression noise and plasticity in yeast [J].BMC Systems Biology,2012,6:1-10.
[6]TERENTIEV A A,MOLDOGAZIEVA N T,SHAITAN K V.Dynamic proteomics in modeling of the living cell.Protein-protein interactions [J].Biochemistry(Moscow),2009,74:1586-1607.
[7]WODAK S J,VLASBLOM J,TURINSKY A L,et al.Protein-protein interaction networks:the puzzling riches [J].Current Opinion in Structural Biology,2013,23(6):941-953.
[8]LI Y,GOLDING G B,ILIE L.DELPHI:accurate deep ensemble model for protein interaction sites prediction [J].Bioinformatics,2021,37(7):896-904.
[9]HOU Q,DE GEEST P F G,VRANKEN W F,et al.Seeing thetrees through the forest:sequence-based homo-and heteromeric protein-protein interaction sites prediction using random forest [J].Bioinformatics,2017,33(10):1479-1487.
[10]HOU Q,LENSINK M F,HERINGA J,et al.Club-martini:se-lectingfavourable interactions amongst available candidates,a coarse-grained simulation approach to scoring docking decoys [J].PloS One,2016,11(5):e0155251.
[11]ZHOU Y,JIANG Y,YANG Y.AGAT-PPIS:A novel protein-protein interaction site predictor based on augmented graph attention network with initial residual and identity mapping [J].Briefings in Bioinformatics,2023,24(3):bbad122.
[12]PITRE S,DEHNE F,CHAN A,et al.PIPE:a protein-protein interaction prediction engine based on the re-occurring short polypeptide sequences between known interacting protein pairs [J].BMC Bioinformatics,2006,7:1-15.
[13]OFRAN Y,ROST B.Predicted protein-protein interaction sites from local sequence information [J].FEBS Letters,2003,544(1/2/3):236-239.
[14]MURAKAMI Y,MIZUGUCHI K.Applying the Naïve Bayesclassifier with kernel density estimation to the prediction of protein-protein interaction sites [J].Bioinformatics,2010,26(15):1841-1848.
[15]YOUSEF A,CHARKARI N M.A novel methodbased on new adaptive LVQ neural network for predicting protein-protein interactions from protein sequences [J].Journal of Theoretical Biology,2013,336:231-239.
[16]SINGH G,DHOLE K,PAI P P,et al.SPRINGS:prediction of protein-protein interaction sites using artificial neural networks [R].PeerJ PrePrints,2014.
[17]WANG B,CHEN P,WANG P,et al.Radial basis function neural network ensemble for predicting protein-protein interaction sites in heterocomplexes [J].Protein and Peptide Letters,2010,17(9):1111-1116.
[18]KOIKE A,TAKAGI T.Prediction of protein-protein interaction sites using support vector machines [J].Protein Engineering Design and Selection,2004,17(2):165-173.
[19]WANG X,YU B,MA A,et al.Protein-protein interaction sites prediction by ensemble random forests with synthetic minority oversampling technique [J].Bioinformatics,2019,35(14):2395-2402.
[20]ZENG M,ZHANG F,WU F X,et al.Protein-protein interaction site prediction through combining local and global features with deep neural networks [J].Bioinformatics,2020,36(4):1114-1120.
[21]ZHANG B,LI J,QUAN L,et al.Sequence-based prediction of protein-protein interaction sites by simplified long short-term memory network [J].Neurocomputing,2019,357:86-100.
[22]LU S,LI Y,NAN X,et al.Attention-based convolutional neural networks for protein-protein interaction site prediction [C]//2021 IEEE International Conference on Bioinformatics and Biomedicine(BIBM).IEEE,2021:141-144.
[23]CONG H,LIU H,CAO Y,et al.Protein-protein interaction site prediction by modelensembling with hybrid feature and self-attention [J].BMC Bioinformatics,2023,24(1):456.
[24]WANG X,YU B,MA A,et al.Protein-protein interaction sites prediction by ensemble random forests with synthetic minority oversampling technique [J].Bioinformatics,2019,35(14):2395-2402.
[25]JOOSTEN R P,TE BEEK T A H,KRIEGER E,et al.A series of PDB related databases for everyday needs [J].Nucleic Acids Research,2010,39(suppl_1):D411-D419.
[26]KABSCH W,SANDER C.Dictionary of protein secondarystructure:pattern recognition of hydrogen-bonded and geometrical features [J].Biopolymers:Original Research on Biomolecules,1983,22(12):2577-2637.
[27]WANG J,YANG B,REVOTE J,et al.POSSUM:a bioinformatics toolkit for generating numerical sequence feature descriptors based onPSSM profiles [J].Bioinformatics,2017,33(17):2756-2758.
[28]WODAK S J,VLASBLOM J,TURINSKY A L,et al.Protein-protein interaction networks:the puzzling riches[J].Current Opinion in Structural Biology,2013,23(6):941-953.
[1] PENG Jiao, HE Yue, SHANG Xiaoran, HU Saier, ZHANG Bo, CHANG Yongjuan, OU Zhonghong, LU Yanyan, JIANG dan, LIU Yaduo. Text-Dynamic Image Cross-modal Retrieval Algorithm Based on Progressive Prototype Matching [J]. Computer Science, 2025, 52(9): 276-281.
[2] GAO Long, LI Yang, WANG Suge. Sentiment Classification Method Based on Stepwise Cooperative Fusion Representation [J]. Computer Science, 2025, 52(9): 313-319.
[3] LIU Jian, YAO Renyuan, GAO Nan, LIANG Ronghua, CHEN Peng. VSRI:Visual Semantic Relational Interactor for Image Caption [J]. Computer Science, 2025, 52(8): 222-231.
[4] LIU Yajun, JI Qingge. Pedestrian Trajectory Prediction Based on Motion Patterns and Time-Frequency Domain Fusion [J]. Computer Science, 2025, 52(7): 92-102.
[5] LIU Chengzhuang, ZHAI Sulan, LIU Haiqing, WANG Kunpeng. Weakly-aligned RGBT Salient Object Detection Based on Multi-modal Feature Alignment [J]. Computer Science, 2025, 52(7): 142-150.
[6] ZHUANG Jianjun, WAN Li. SCF U2-Net:Lightweight U2-Net Improved Method for Breast Ultrasound Lesion SegmentationCombined with Fuzzy Logic [J]. Computer Science, 2025, 52(7): 161-169.
[7] ZHENG Cheng, YANG Nan. Aspect-based Sentiment Analysis Based on Syntax,Semantics and Affective Knowledge [J]. Computer Science, 2025, 52(7): 218-225.
[8] WANG Youkang, CHENG Chunling. Multimodal Sentiment Analysis Model Based on Cross-modal Unidirectional Weighting [J]. Computer Science, 2025, 52(7): 226-232.
[9] KONG Yinling, WANG Zhongqing, WANG Hongling. Study on Opinion Summarization Incorporating Evaluation Object Information [J]. Computer Science, 2025, 52(7): 233-240.
[10] ZENG Fanyun, LIAN Hechun, FENG Shanshan, WANG Qingmei. Material SEM Image Retrieval Method Based on Multi-scale Features and Enhanced HybridAttention Mechanism [J]. Computer Science, 2025, 52(6A): 240800014-7.
[11] HOU Zhexiao, LI Bicheng, CAI Bingyan, XU Yifei. High Quality Image Generation Method Based on Improved Diffusion Model [J]. Computer Science, 2025, 52(6A): 240500094-9.
[12] DING Xuxing, ZHOU Xueding, QIAN Qiang, REN Yueyue, FENG Youhong. High-precision and Real-time Detection Algorithm for Photovoltaic Glass Edge Defects Based onFeature Reuse and Cheap Operation [J]. Computer Science, 2025, 52(6A): 240400146-10.
[13] WANG Rong , ZOU Shuping, HAO Pengfei, GUO Jiawei, SHU Peng. Sand Dust Image Enhancement Method Based on Multi-cascaded Attention Interaction [J]. Computer Science, 2025, 52(6A): 240800048-7.
[14] WANG Baohui, GAO Zhan, XU Lin, TAN Yingjie. Research and Implementation of Mine Gas Concentration Prediction Algorithm Based on Deep Learning [J]. Computer Science, 2025, 52(6A): 240400188-7.
[15] GUAN Xin, YANG Xueyong, YANG Xiaolin, MENG Xiangfu. Tumor Mutation Prediction Model of Lung Adenocarcinoma Based on Pathological [J]. Computer Science, 2025, 52(6A): 240700010-8.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!