计算机科学 ›› 2024, Vol. 51 ›› Issue (11): 30-38.doi: 10.11896/jsjkx.240700004
于泳欣1,2, 纪科1,2, 高源1,2, 陈贞翔1,2, 马坤1,2, 赵晓凡3,4
YU Yongxin1,2, JI Ke1,2, GAO Yuan1,2, CHEN Zhenxiang1,2, MA Kun1,2, ZHAO Xiaofan3,4
摘要: 社交媒体平台上充斥着大量未经验证的信息,这些信息大多为不同来源的异构数据,其传播范围之广、速度之快,对个人和社会造成了严重危害。因此,有效检测和防范虚假新闻至关重要。针对当前虚假新闻检测模型局限于从单一数据来源获取新闻文本及视觉信息,导致新闻报道主观性较强、数据覆盖不全面的问题,提出了一种多源异构数据渐进式融合的虚假新闻检测模型。首先,进行多源异构数据的收集、筛选和清洗,由此构建了一个多源多模态数据集,其中包含关于每个事件的多个不同角度的报道;接着,通过将文本特征提取器和视觉特征提取器获取的特征输入多源融合模块,实现了不同来源特征之间的渐进式融合;同时,引入文本的情感特征和图像的频域特征,以实现多层次的特征提取;最后,采用软注意力机制进行特征集成。实验结果和分析表明,与已有的流行方法相比,所提模型有较好的检测效果,为大数据时代的虚假新闻检测提供了一种有效的解决方案。
中图分类号:
[1] KRIZHEVSKY A,SUTSKEVER I,HINTON G E.Imagenetclassification with deep convolutional neural networks[J].Communication of the ACM,2017,60(6):84-90. [2] ELMAN J L.Finding structure in time[J].Cognitive Science,1990,14(2):179-211. [3] DEVLIN J,CHANG M W,LEE K,et al.Bert:Pre-training of deep bidirectional transformers for language understanding[J].arXiv:1810.04805,2018. [4] RADFORD A,NARASIMHAN K,SALIMANS T,et al.Improving language understanding by generative pre-training[J/OL].https://cdn.openai.com/research-covers/language-unsupervised/language_understanding_paper.pdf. [5] KIM Y.Convolutional neural networks for sentence classification[J].arXiv:1408.5882,2014. [6] SIMONYAN K,ZISSERMAN A.Very deep convolutional networks for large-scale image recognition[J].arXiv:1409.1556,2014. [7] LIU Z,LIN Y,CAO Y,et al.Swin transformer:Hierarchical vision transformer using shifted windows[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision.2021:10012-10022. [8] HUA J,CUI X,LI X,et al.Multimodal fake news detectionthrough data augmentation-based contrastive learning[J].Applied Soft Computing,2023,136:110125. [9] MAHAJAN S,ROTH S.Diverse image captioning with context-object split latent spaces[J].Advances in Neural Information Processing Systems,2020,33:3613-3624. [10] LI Y,JI K,MA K,et al.Fake News Detection Based on the Correlation Extension of Multimodal Information[C]//Asia-Pacific Web(APWeb) and Web-Age Information Management(WAIM) Joint International Conference on Web and Big Data.Cham:Springer Nature Switzerland,2022:443-450. [11] SU J.Simbert:Integrating retrieval and generation into bert[J/OL].Tech.Rep.https://github.com/Zhuiyi Technology/simbert. [12] MA J,GAO W,MITRA P,et al.Detecting rumors from micro-blogs with recurrent neural networks[C]//Proceedings of IJCAI.2016:3818-3824. [13] YU F,LIU Q,WU S,et al.A Convolutional Approach for Misinformation Identification[C]//IJCAI.2017:3901-3907. [14] MA J,GAO W,WONG K F.Detect rumor and stance jointly by neural multi-task learning[C]//Companion Proceedings of the Web Conference 2018.2018:585-593. [15] VAIBHAV V,ANNASAMMY R M,HOVY E.Do sentence interactions matter? leveraging sentence level representations for fake news classification[J].arXiv:1910.12203,2019. [16] JIAN W,LI J P,AKBAR A,et al.SA-Bi-LSTM:Self Attention With Bi-Directional LSTM based Intelligent Model for Accurate Fake News Detection to ensured information integrity on social media platforms[J].IEEE Access,2024,12:48436-48452. [17] SUN L,WANG H.Topic-Aware Fake News Detection Based on Heterogeneous Graph[J].IEEE Access,2023,11:103743-103752. [18] QI P,CAO J,YANG T,et al.Exploiting multi-domain visual information for fake news detection[C]//2019 IEEE International Conference on Data Mining(ICDM).IEEE,2019:518-527. [19] WANG Y,MA F,JIN Z,et al.Eann:Event adversarial neural networks for multi-modal fake news detection[C]//Proceedings of the 24th ACM Sigkdd International Conference on Knowledge Discovery & Data Mining.2018:849-857. [20] KHATTAR D,GOUD J S,GUPTA M,et al.Mvae:Multimodal variational autoencoder for fake news detection[C]//The World Wide Web Conference.2019:2915-2921. [21] ZHOU X,WU J,ZAFARANI R.Safe:Similarity-aware multi-modal fake news detection.arxiv[J].arXiv:2003.04981,2020. [22] WU Y,ZHAN P,ZHANG Y,et al.Multimodal fusion with co-attention networks for fake news detection[C]//Findings of the Association for Computational Linguistics:ACL-IJCNLP 2021.2021:2560-2569. [23] QIAN S,WANG J,HU J,et al.Hierarchical multi-modal contextual attention network for fake news detection[C]//Procee-dings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval.2021:153-162. [24] GUO Y,GE H,LI J.A two-branch multimodal fake news detection model based on multimodal bilinear pooling and attention mechanism[J].Frontiers in Computer Science,2023,5:1159063. [25] JING J,WU H,SUN J,et al.Multimodal fake news detection via progressive fusion networks[J].Information Processing & Management,2023,60(1):103120. [26] GUO Y.A mutual attention based multimodal fusion for fake news detection on social network[J].Applied Intelligence,2023,53(12):15311-15320. [27] LIU Y,BING W,REN S,et al.BC-FND:An Approach Based on Hierarchical Bilinear Fusion and Multimodal Consistency for Fake News Detection[J].IEEE Access,2024,12:62738-62749. [28] PENG L,JIAN S,KAN Z,et al.Not all fake news is semantically similar:Contextual semantic representation learning for multi-modal fake news detection[J].Information Processing & Ma-nagement,2024,61(1):103564. [29] WANG J,ZHENG J,YAO S,et al.TLFND:A Multimodal Fusion Model Based on Three-Level Feature Matching Distance for Fake News Detection[J].Entropy,2023,25(11):1533. [30] MIHALCEA R,TARAU P.Textrank:Bringing order into text[C]//Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing.2004:404-411. [31] YU Z,YU J,FAN J,et al.Multi-modal factorized bilinear pooling with co-attention learning for visual question answering[C]//Proceedings of the IEEE International Conference on Computer Vision.2017:1821-1830. [32] HE K,ZHANG X,REN S,et al.Deep residual learning forimage recognition[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2016:770-778. [33] MIKOLOV T,CHEN K,CORRADO G,et al.Efficient estimation of word representations in vector space[J].arXiv:1301.3781,2013. [34] JIN Z,CAO J,GUO H,et al.Multimodal fusion with recurrent neural networks for rumor detection on microblogs[C]//Proceedings of the 25th ACM International Conference on Multimedia.2017:795-816. |
|