Computer Science ›› 2026, Vol. 53 ›› Issue (6A): 250800073-8.doi: 10.11896/jsjkx.250800073

• Artificial Intelligence • Previous Articles     Next Articles

Dual-stream Heterogeneous Social Graph for Micro-video Popularity Prediction

ZHANG Xinliang1, LIU Lilong2, CHEN Shangheng3, CHEN Ziyang2, QIAN Shengsheng3   

  1. 1 China National Institute of Standardization,Beijing 100191,China
    2 Henan Institute of Advanced Technology,Zhengzhou University,Zhengzhou 450003,China
    3 State Key Laboratory of Multimodal ArtificialIntelligence Systems,Institute of Automation,Chinese Academy of Sciences,Beijing 100190,China
  • Online:2026-06-16 Published:2026-06-12
  • About author:ZHANG Xinliang,born in 1983,master.His main research interests include Live streaming short videos and platform economy.
    QIAN Shengsheng,born in 1991,Ph.D,professor,is a member of CCF(No. 77702M).His main research interests include data mining and multimedia content analysis.
  • Supported by:
    National Key Research and Development Program(2023YFC3310700),National Natural Science Foundation of China(62276257),Central Basic Research Business Fund Project(242024Y-11461) and China National Institute of Standardization Self-Funded Project(552025Z-12972).

Abstract: With the rapid rise of short videos on digital platforms,micro-video popularity prediction(MVPP) has become an important research area.Short videos contain rich multimodal content,including video frames,text,and social network interaction data,all of which significantly influence their popularity.However,existing methods have two main shortcomings:they typically rely only on the multimodal content features of the short video itself and fail to effectively model the complex social network structure information formed by user interactions(such as comments,likes,and shares);when handling large-scale social multimodal graphs,existing graph learning methods often lead to the loss of valuable multimodal signals due to neighbor sampling strategies.To address these shortcomings,this paper proposes a novel approach-DHSGL(Dual-Stream Heterogeneous Social Graph Learning Framework).The core innovation of this method lies in:1)proposing an efficient graph learning pre-calculation strategy,which aggregates the complete graph structure information through a single global propagation and constructs unimodal graphs to preserve the original modality features,thus significantly reducing information loss;2)constructing a social multimodal graph that integrates social interactions and multimodal content to fully leverage the neglected social structure information.Experimental results demonstrate a significant improvement in prediction performance,validating the effectiveness of integrating social network structure with multimodal content to enhance micro-video popularity prediction.

Key words: Micro-video popularity prediction, Multimodal fusion, Social network, Heterogeneousgraph neural networks

CLC Number: 

  • TP391
[1] LAKKARAJU H,AJMERA J.Attention prediction on socialmedia brand pages[C]//Proceedings of the 20th ACM International Conference on Information and Knowledge Management.2011:2157-2160.
[2] CAI D,QIAN S,FANG Q,et al.Heterogeneous graph contrastive learning network for personalized micro-video recommendation[J].IEEE Transactions on Multimedia,2022,25:2761-2773.
[3] OU N,YU L,LI H,et al.MTAF:Shopping guide micro-videos popularity prediction using multimodal and temporal attention fusion approach[C]//2022 IEEE International Conference on Acoustics,Speech and Signal Processing(ICASSP 2022).IEEE,2022:4543-4547.
[4] KHOSLA A,DAS SARMA A,HAMID R.What makes animage popular?[C]//Proceedings of the 23rd International Conference on World Wide Web.2014:867-876.
[5] LAI X,ZHANG Y,ZHANG W.HyFea:Winning solution to social media popularity prediction for multimedia grand challenge 2020[C]//Proceedings of the 28th ACM International Confe-rence on Multimedia.2020:4565-4569.
[6] PINTO H,ALMEIDA J M,GONÇALVES M A.Using early view patterns to predict the popularity of youtube videos[C]//Proceedings of the Sixth ACM International Conference on Web Search and Data Mining.2013:365-374.
[7] AHMED M,SPAGNA S,HUICI F,et al.A peek into the future:Predicting the evolution of popularity in user generated content[C]//Proceedings of the Sixth ACM International Conference on Web Search and Data Mining.2013:607-616.
[8] CHEN J,SONG X,NIE L,et al.Micro tells macro:Predicting the popularity of micro-videos via a transductive model[C]//Proceedings of the 24th ACM International Conference on Multimedia.2016:898-907.
[9] XIE J,ZHU Y,CHEN Z.Micro-video popularity prediction via multimodal variational information bottleneck[J].IEEE Transactions on Multimedia,2021,25:24-37.
[10] ZHONG T,LANG J,ZHANG Y,et al.Predicting Micro-video Popularity via Multi-modal Retrieval Augmentation[C]//Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval.2024:2579-2583.
[11] CHENG Z,LI J,LANG J,et al.In-context Prompt-augmented Micro-video Popularity Prediction[C]//Proceedings of the AAAI Conference on Artificial Intelligence.2025:11527-11535.
[12] HUANG Y,ZHAO C.Study on Three-level Short Video User Portrait Based on Improved Topic Model Method[J].Computer Science,2024,51(S1):698-704.
[13] MA Z,SUN A,CONG G.On predicting the popularity of newly emerging hashtags in t witter[J].Journal of the American Society for Information Science and Technology,2013,64(7):1399-1410.
[14] ROY S D,MEI T,ZENG W,et al.Towards cross-domain learning for social video popularity prediction[J].IEEE Transactions on Multimedia,2013,15(6):1255-1267.
[15] CAO Q,SHEN H,GAO J,et al.Popularity prediction on social platforms with coupled graph neural networks[C]//Proceedings of the 13th International Conference on Web Search and Data Mining.2020:70-78.
[16] KONG Q,MAO W,CHEN G,et al.Exploring trends and pat-terns of popularity stage evolution in social media[J].IEEE Transactions on Systems,Man,and Cybernetics:Systems,2018,50(10):3817-3827.
[17] ZHANGTAO C,TING Z,SHENGMING Z,et al.Survey of recommender systems based on graph learning[J].Computer Science,2022,49(9):1-13.
[18] CASANOVA P,LIO A R P,BENGIO Y.Graph attention networks[C]//ICLR.2018.
[19] HAMILTON W,YING Z,LESKOVEC J.Inductive representation learning on large graphs[C]//Advances in Neural Information Processing Systems.2017.
[20] SU J,MAO L,WU C.BG-HGNN:Toward Scalable and Efficient Heterogeneous Graph Neural Network[J].arXiv:2403.08207,2024.
[21] ZHU S,ZHOU C,PAN S,et al.Relation structure-aware heterogeneous graph neural network[C]//2019 IEEE International Conference on Data ining(ICDM).IEEE,2019:1534-1539.
[22] HU Z,DONG Y,WANG K,et al.Heterogeneous graph trans-former[C]//Proceedings of the Web Conference 2020.2020:2704-2710.
[23] YANG X,YAN M,PAN S,et al.Simple and efficient heterogeneous graph neural network[C]//Proceedings of the AAAI Conference on Artificial Intelligence.2023:10816-10824.
[24] HU J,HOOI B,HE B.Efficient heterogeneous graph learning via random projection[J].IEEE Transactions on Knowledge and Data Engineering,2024,36(12):8093-8107.
[25] WU B,XU C,DAI X,et al.Visual transformers:Token-basedimage representation and processing for computer vision[J].arXiv:2006.03677,2020.
[26] LI J,LI D,XIONG C,et al.Blip:Bootstrapping language-image pre-training for unified vision-language understanding and ge-neration[C]//International Conference on Machine Learning.PMLR,2022:12888-12900.
[27] NI Y,CHENG Y,LIU X,et al.A content-driven micro-video recommendation dataset at scale[J].arXiv:2309.15379,2023.
[28] GHOSH S,VINYALS O,STROPE B,et al.Contextual lstm(clstm) models for large scale nlp tasks[J].arXiv:1602.06291,2016.
[29] CHEUNG T,LAM K.Crossmodal bipolar attention for multimodal classification on social media[J].Neurocomputing,2022,514:1-12.
[30] SCHLICHTKRULL M,KIPF T N,BLOEM P,et al.Modeling relational data with graph convolutional networks[C]//The Semantic Web:15th International Conference(ESWC 2018).Springer,2018:593-607.
[31] HONG H,GUO H,LIN Y,et al.An attention-based graph neural network for heterogeneous structural learning[C]//Procee-dings of the AAAI Conference on Artificial Intelligence.2020:4132-4139.
[1] WANG Yuhan, MA Fuyuan, MA Shixuan, WANG Ying. SINDy-GSN:Sparse Identification of Network Dynamics for Group Behavior in Social Graphs [J]. Computer Science, 2026, 53(6A): 250700014-10.
[2] WANG Liyan, ZHANG Qian, GUO Yuanyuan, CHEN Haifeng, LI Jian. Multimodal Continuous Emotion Recognition for English Spoken Emotion Evaluation [J]. Computer Science, 2026, 53(5): 99-108.
[3] ZHANG Xueqin, WANG Zhineng, LI Jinsheng, LU Yisong, LUO Fei. Key Node Identification in Temporal Social Networks Based on Deep Learning and Multi-feature Fusion [J]. Computer Science, 2026, 53(4): 143-154.
[4] CHANG Wenxia, ZHANG Chao, LI Wentao, ZHAN Jianming, LI Deyu. Modeling of Behavior-guided Multi-scale Bi-level Group Consensus Under Social Networks [J]. Computer Science, 2026, 53(4): 180-187.
[5] LIU Wei, XU Yong, FANG Juan, LI Cheng, ZHU Yujun, FANG Qun, HE Xin. Multimodal Air-writing Gesture Recognition Based on Radar-Vision Fusion [J]. Computer Science, 2025, 52(9): 259-268.
[6] GAO Long, LI Yang, WANG Suge. Sentiment Classification Method Based on Stepwise Cooperative Fusion Representation [J]. Computer Science, 2025, 52(9): 313-319.
[7] JIN Hong, CHEN Like, YOU Lan, LYU Shunying, ZHOU Kaicheng, XIAO Kui. Point-of-interest Recommendation Based on Geospatial-TemporalCorrelations and Social Influence [J]. Computer Science, 2025, 52(5): 128-138.
[8] CHEN Zhangyuan, CHEN Ling, LIU Wei, LI Bin. Method for Selecting Observers Based on Doubly Resolving Set in Independent Cascade Model [J]. Computer Science, 2025, 52(4): 280-290.
[9] LI Zongmin, RONG Guangcai, BAI Yun, XU Chang , XIAN Shiyang. 3D Object Detection with Dynamic Weight Graph Convolution [J]. Computer Science, 2025, 52(3): 104-111.
[10] LI Shao, JIANG Fangting, YANG Xinyan, LIANG Gang. Rumor Detection on Potential Hot Topics with Bi-directional Graph Attention Network [J]. Computer Science, 2025, 52(3): 277-286.
[11] ZHANG Fan, LI Ang. Multi-modal Fusion Based Object Detection for All-day and Multi-scenario Environments [J]. Computer Science, 2025, 52(11A): 241100137-10.
[12] LIU Wei, WU Fei, GUO Zhen, CHEN Ling. Two Stage Rumor Blocking Method Based on EHEM in Social Networks [J]. Computer Science, 2024, 51(7): 156-166.
[13] KANG Zhiyong, LI Bicheng, LIN Huang. User Interest Recognition Method Incorporating Category Labels and Topic Information [J]. Computer Science, 2024, 51(6A): 230500169-8.
[14] DENG Ziwei, CHEN Ling, LIU Wei. Continuous Influence Maximization Under Independent Cascade Propagation Model [J]. Computer Science, 2024, 51(6): 161-171.
[15] WANG Gengrun. Survey of Research and Application of User Identity Linkage Technology in Cyberspace [J]. Computer Science, 2024, 51(5): 12-20.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!