Computer Science ›› 2024, Vol. 51 ›› Issue (3): 14-19.doi: 10.11896/jsjkx.230800063

• Information Security Protection in New Computing Mode • Previous Articles     Next Articles

Contrastive Graph Learning for Cross-document Misinformation Detection

LIAO Jinzhi1, ZHAO Hewei1, LIAN Xiaotong1, JI Wenliang1, SHI Haiming1, ZHAO Xiang2   

  1. 1 College of Military Management,National Defense University,Beijing 100000,China
    2 College of System Engineering,National University of Defense Technology,Changsha 410072,China
  • Received:2023-08-09 Revised:2023-11-27 Online:2024-03-15 Published:2024-03-13
  • About author:LIAO Jinzhi,born in 1993,Ph.D,lectu-rer.His main research interests include natural language processing and know-ledge management.ZHAO Xiang,born in 1986,Ph.D,professor,Ph.D supervisor,is a member of CCF(No.39960S).His main research interests include knowledge graph and data analysis.
  • Supported by:
    National Key R&D Program of China(2022YFB3102600)and National Natural Science Foundation of China(72301284,62272469).

Abstract: Misinformation proliferates on the Internet,undermining the normal functioning of various industries.Detecting falsehoods accurately has therefore become an urgent challenge.Existing research on this task focuses primarily on three aspects:account traits,textual content,and multimodality.However,most methods overlook the key attribute of misinformation diffusion the novelty of content.They analyze the veracity of target claims in isolation,failing to capture public opinion dynamics.To address this issue,this paper proposes a cross-document misinformation detection framework called contrastive graph learning(CAL).CAL focuses on content novelty and comprises two key components:a contrastive learning module and a heterogeneous graph module.The former expands the representational difference between factual and false claims,and the latter encompasses five entity types:words,events,event sets,sentences,and documents.It injects semantic features of the public discourse into entity embeddings.We evaluate CAL on the IED,TL17,and Crisis datasets at both document and event levels.CGL achieves state-of-the-art performance,which verifies the efficacy of its design.It provides a robust solution for combating misinformation by mode-ling novelty and environmental context.

Key words: Cross-document misinformation detection, Contrastive learning, Heterogeneous graph, Event-level detection

CLC Number: 

  • TP391
[1]SOROUSH V,ROY D,ARAL S,et al.The spread of true andfalse news online [J].Science,2018,359:1146-1151.
[2]JIN Z,CAO J,ZHANG Y,et al.News verification by exploiting conflicting social viewpoints in microblogs [C]//AAAI Confe-rence on Artificial Intelligence.2016:2972-2978.
[3]WU L,RAO Y,JIN H,et al.Different absorption from the same sharing:Sifted multi-task learning for fake news detection [C]//Conference on Empirical Methods in Natural Language Proce-ssing(EMNLP).2019:4643-4652.
[4]TACCHINI E,BALLARIN G,VEDOVA M L,et al.Some like it hoax:Automated fake news detection in social networks [J].arXiv:1704.07506,2017.
[5]YANG W,LIANG G,XIE K.Rumor detection method based on burst topic detection and domain expert discovery [J].Journal of Computer Applications,2017,37(10):2799- 2805.
[6]SHU K,MAHUDESWARAN D,WANG S,et al.Fakenewsnet:A data repository with news content,social context,and spatiotemporal information for studying fake news on social media [J].Big Data,2020,8(3):171-188.
[7]GUPTA M,ZHAO P,HAN J.Evaluating event credibility ontwitter [C]//International Conference on Data Mining(ICDM).2012:153-164.
[8]ZHANG J,DONG B,YU P S.Fakedetector:Effective fake news detection with deep diffusive neural network [C]//International Conference on Data Engineering(ICDE).2020:1826-1829.
[9]XUE H,WANG L,YANG Y,et al.Rumor detection modelbased on user propagation network and message content [J].Journal of Computer Applications,2021,41(12):3540-3545.
[10]CONROY N,RUBIN V L,CHEN Y.Automatic deception detection:Methods for finding fake news [C]//Association for Information Science and Technology.2015:1-4.
[11]GAO M,CHEN F.Credibility evaluating method of Chinese microblog based on information fusion [J].Journal of Computer Applications,2016,36(8):2071-2075,2081.
[12]POTTHAST M,KIESEL J,REINARTZ K,et al.A stylometric inquiry into hyperpartisan and fake news [C]//Annual Meeting of the Association for Computational Linguistics(ACL).2018:231-240.
[13]SITAULA N,MOHAN C K,GRYGIEL J,et al.Credibilitybased fake news detection [J].arXiv:1911.00643,2019.
[14]LEE D,KIM Y,KIM H,et al.Fake News Detection Using Deep Learning [J].The Journal of Information Processing Systems,2019,15:1119-1130.
[15]VAIBHAV V,ANNASAMY R M,HOVY E H.Do sentence interactions matter? leveraging sentence level representations for fake news classification [C]//Workshop on Graph-Based Me-thods for Natural Language Processing.2019:134-139.
[16]PAN J Z,PAVLOVA S,LI C,et al.Content based fake news detection using knowledge graphs [C]//International Semantic Web Conference(ISWC).2018:669-683.
[17]WANG H,GONG L,ZHOU Z,et al.Dis-information from Social Media with Semantic Enhancement[J].Data Analysis and Knowledge Discovery,2023,7(2):48-60.
[18]SIMONYAN K,ZISSERMAN A.Very Deep Convolutional Networks for Large-Scale Image Recognition [C]//International Conference on Learning Representations(ICLR).2015.
[19]WANG Y,MA F,JIN Z,et al.EANN:Event Adversarial Neural Networks for Multimodal Fake News Detection [C]//ACM SIGKDD International Conference on Knowledge Discovery & Data Mining.2018:849-857.
[20]KHATTAR D,GOUD J S,GUPTA M,et al.MVAE:Multimodal Variational Autoencoder for Fake News Detection [C]//The Web Conference(WWW).2019:2915-2921.
[21]SINGHAL S,SHAH R R,CHAKRABORTY T,et al.Spot-Fake:A Multi-modal Framework for Fake News Detection [C]//International Conference on Multimedia Big Data.2019:39-47.
[22]WANG W Y.liar,liar pants on fire:A new benchmark dataset for fake news detection [C]//Annual Meeting of the Association for Computational Linguistics(ACL).2017:422-426.
[23]ZHANG H,FANG Q,QIAN S,et al.Multimodal Knowledge-aware Event Memory Network for Social Media Rumor Detection [C]//ACM International Conference on Multimedia(MM).2019:1942-1951.
[24]SONG C,NING N,ZHANG Y,et al.A Multimodal Fake News Detection Model Based on Crossmodal Attention Residual and Multichannel Convolutional Neural Networks [J].Information Process and Management(IPM),2021,58(1):102437.
[25]WANG Y,QIAN S,HU J,et al.Fake News Detection viaKnowledge-Driven Multimodal Graph Convolutional Networks [C]//International Conference on Multimedia Retrieval(ICMR).2020:540-547.
[26]LIU J,FENG K,JEFF Z P,et al.MSRD:Multi-Modal Web Rumor Detection Method[J].Journal of Computer Research and Development,2020,57(11):2328-2336.
[27]WU X,HUANG K,FUNG Y R,et al.Cross-document Misinformation Detection based on Event Graph Reasoning [C]//North American Chapter of the Association for Computational Linguistics(NAACL).2022:543-558.
[28]CHEN T,KORNBLITH S,NOROUZI M,et al.A SimpleFramework for Contrastive Learning of Visual Representations [C]//International Conference on Machine Learning(ICML).2020:1597-1607.
[29]LIN Y,JI H,HUANG F,et al.A joint neural model for information extraction with global features [C]//Annual Meeting of the Association for Computational Linguistics(ACL).2020:7999-8009.
[30]PAN X,ZHANG B,MAY J,et al.Cross lingual name tagging and linking for 282 languages [C]//Annual Meeting of the As sociation for Computational Linguistics(ACL).2017:1946-1958.
[31]LEE K,HE L,LEWIS M,et al.End-to-end neural coreference resolution [C]//Conference on Empirical Methods in Natural Language Processing(EMNLP).2017:188-197.
[32]LAI T,JI H,BUI T,et al.A context-dependent gated module for incorporating symbolic semantics into event coreference re-solution[C]//North American Chapter of the Association for Computational Linguistics:Human Language Technologies(NAACL).2021:3491-3499.
[33]WEN H,LIN Y,LAI T,et al.RESIN:A dockerized schemaguided cross-document cross-lingual cross-media in formation extraction and event tracking system [C]//North American Chapter of the Association for Computational Linguistics:Human Language Technologies:Demonstrations(NAACL).2021:133-143.
[34]DEVLIN J,CHANG M,LEE K,et al.BERT:pre-training of deep bidirectional transformers for language understanding [C]//North American Chapter of the Association for Computational Linguistics:Human Language Technologies(NAACL).2019:4171-4186.
[35]HAMILTON W L,YING Z,LESKOVEC J.Inductive representation learning on large graphs [C]//Annual Conference on Neural Information Processing Systems(NeurIPS).2017:1024-1034.
[36]KIPF T N,WELLING M.Semi-supervised classification withgraph convolutional networks [C]//International Conference on Learning Representations(ICLR).2017.
[37]SRIVASTAVA R K,GREFF K,SCHMIDHUBER J.Highway Networks[J].arXiv:1505.00387,2015.
[38]BORDES A,USUNIER N,GARCÌA-DURÁN A,et al.Translating embeddings for modeling multi-relational data [C]//Neural Information Processing Systems(NeurIPS).2013:2787-2795.
[39]LI M,LI S,WANG Z,et al.Future is not one-dimensional:Graph modeling based complex event schema induction for event prediction [J].arXiv:2104.06344,2021.
[40]TRAN G B,ALRIFAI M,NGUYEN D Q.Predicting relevant news events for timeline summaries [C]//International World Wide Web Conference(WWW).2013:91-92.
[41]KARIMI H,TANG J.Learning hierarchical discourse-levelstructure for fake news detection [C]//North American Chapter of the Association for Computational Linguistics:Human Language Technologies(NAACL).2019:3432-3442.
[42]ZELLERS R,HOLTZMAN A,RASHKIN H,et al.Defending against neural fake news [C]//Neural Information Processing Systems(NeurIPS).2019:9051-9062.
[1] CHEN Runhuan, DAI Hua, ZHENG Guineng, LI Hui , YANG Geng. Urban Electricity Load Forecasting Method Based on Discrepancy Compensation and Short-termSampling Contrastive Loss [J]. Computer Science, 2024, 51(4): 158-164.
[2] HUANG Kun, SUN Weiwei. Traffic Speed Forecasting Algorithm Based on Missing Data [J]. Computer Science, 2024, 51(3): 72-80.
[3] YANG Bo, LUO Jiachen, SONG Yantao, WU Hongtao, PENG Furong. Time Series Clustering Method Based on Contrastive Learning [J]. Computer Science, 2024, 51(2): 63-72.
[4] WU Jiawei, FANG Quan, HU Jun, QIAN Shengsheng. Pre-training of Heterogeneous Graph Neural Networks for Multi-label Document Classification [J]. Computer Science, 2024, 51(1): 143-149.
[5] XU Jie, WANG Lisong. Contrastive Clustering with Consistent Structural Relations [J]. Computer Science, 2023, 50(9): 123-129.
[6] HU Shen, QIAN Yuhua, WANG Jieting, LI Feijiang, LYU Wei. Super Multi-class Deep Image Clustering Model Based on Contrastive Learning [J]. Computer Science, 2023, 50(9): 192-201.
[7] LI Xiang, FAN Zhiguang, LIN Nan, CAO Yangjie, LI Xuexiang. Self-supervised Learning for 3D Real-scenes Question Answering [J]. Computer Science, 2023, 50(9): 220-226.
[8] WANG Mingxia, XIONG Yun. Disease Diagnosis Prediction Algorithm Based on Contrastive Learning [J]. Computer Science, 2023, 50(7): 46-52.
[9] SHAN Xiaohuan, SONG Rui, LI Haihai, SONG Baoyan. Event Recommendation Method with Multi-factor Feature Fusion in EBSN [J]. Computer Science, 2023, 50(7): 60-65.
[10] WU Jufeng, ZHAO Xungang, ZHOU Qiang, RAO Ning. Contrastive Learning for Low-light Image Enhancement [J]. Computer Science, 2023, 50(6A): 220600171-6.
[11] ZHANG Hu, ZHANG Guangjun. Document-level Event Extraction Based on Multi-granularity Entity Heterogeneous Graph [J]. Computer Science, 2023, 50(5): 255-261.
[12] HE Chao, CHEN Jinjie, JIN Zhao, LEI Yinjie. Automatic Modulation Recognition Method Based on Multimodal Time-Frequency Feature Fusion [J]. Computer Science, 2023, 50(4): 226-232.
[13] ZHANG Zouquan, ZHANG Hui, WU Tianyue, CHEN Tiancai. Continuous Dense Normalized Flow Model for Anomaly Detection in Industrial Images [J]. Computer Science, 2023, 50(12): 212-220.
[14] LIU Nan, ZHANG Fengli, YIN Jiaqi, CHEN Xueqin, WANG Ruijin. Rumor Detection Model on Social Media Based on Contrastive Learning with Edge-inferenceAugmentation [J]. Computer Science, 2023, 50(11): 49-54.
[15] XIE Zhuo, KANG Le, ZHOU Lijuan, ZHANG Zhihong. Clustering Method Based on Contrastive Learning for Multi-relation Attribute Graph [J]. Computer Science, 2023, 50(11): 62-70.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!