Computer Science ›› 2021, Vol. 48 ›› Issue (11): 170-175.doi: 10.11896/jsjkx.201100004

Special Issue: Big Data & Data Scinece

• Database & Big Data & Data Science • Previous Articles     Next Articles

Study on Multi-source Data Fusion Framework Based on Graph

KUANG Guang-sheng1,2, GUO Yan2, YU Xiao-ming2, LIU Yue2, CHENG Xue-qi2   

  1. 1 University of Chinese Academy of Sciences,Beijing 100049,China
    2 Key Laboratory of Network Data Science & Technology,Institute of Computing Technology,Chinese Academy of Sciences,Beijing 100190,China
  • Received:2020-11-02 Revised:2021-03-18 Online:2021-11-15 Published:2021-11-10
  • About author:KUANG Guang-sheng,born in 1995,postgraduate.His main research in-terests include natural language proces-sing and data fusion.
    GUO Yan,born in 1974,Ph.D,associate researcher.Her main research interests include network information acquisition and so on.
  • Supported by:
    National Key Research and Development Program of China(2017YFB0803302).

Abstract: When analyzing various data in a given task,most of current researches only analyze single-source data and lack me-thods applied to multi-source data.But now data are becoming more abundant,therefore,this paper proposes a multi-source data fusion framework for fusing data from multiple network platforms.The data of the same platform contains text and various attri-butes,and there are also great differences in content and form among data of different platforms.Most existing network information mining methods only use part of the data in the same platform for analysis,and even ignore the interaction between the data of different platforms.Therefore,this paper proposes a data fusion framework,which can not only use more features of the same platform to improve the performance of a single platform,but also fuse the data features of different platforms to complement each other,thereby improving the performance of multiple platforms.This paper uses the task of event classification,and the abundant features effectively improve the F1 value,which verifies the effectiveness of the proposed multi-source data framework.

Key words: Fusion representation, Graph fusion, Multi-source

CLC Number: 

  • TP391
[1]LE Q,MIKOLOV T.Distributed representations of sentences and documents[C]//Proceedings of International Conference on Machine Learning.China:PMLR,2014:1188-1196.
[2]PEROZZI B,AL-RFOU R,SKIENA S.Deepwalk:Online lear-ning of social representations[C]//Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Disco-very and Data Mining.New York:Association for Computing Machinery,2014:701-710.
[3]ZHANG J,WANG Y,LI K H,et al.Multi-source Sensor Body Area Network Data Fusion Model Based on Manifold Learning[J].Computer Science,2020,47(8):323-328.
[4]CAO S S,LU W,XU Q K.GraRep:Learning Graph Representations with Global Structural Information[C]//Proceedings of the 24th ACM International on Conference on Information and Knowledge Management (CIKM '15).New York:Association for Computing Machinery,2015:891-900.
[5]TANG J,QU M,WANG M,et al.LINE:Large-scale information network embedding[C]//International Conference on World Wide Web.2015:1067-1077.
[6]GROVER A,LESKOVEC J.Node2vec:Scalable Feature Lear-ning for Networks[C]//ACM Sigkdd International Conference on Knowledge Discovery & Data Mining.ACM,2016.
[7]RIBEIRO L F R,SAVERESE P H P,FIGUEIREDO D R.Struc2vec:Learning node representations from structural identity[C]//Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining.2017:385-394.
[8]DONG M Y,CHAWLA N V,SWAMI A.metapath2vec:Scalable Representation Learning for Heterogeneous Networks[C]//the 23rd ACM SIGKDD International Conference.ACM,2017.
[9]DEFFERRARD M,BRESSON X,VANDERGHEYNST P.Convolutional neural networks on graphs with fast localized spectral filtering[J].Advances in Neural Information Processing Systems,2016,29:3844-3852.
[10]KIPF T N,WELLING M.Semi-supervised classification withgraph convolutional networks[C]//5th International Confe-rence on Learning Representations.2017.
[11]SCARSELLI F,GORI M,TSOI A C,et al.The graph neural network model[J].IEEE Transactions on Neural Networks,2008,20(1):61-80.
[12]VELIČKOVIC′ P,CUCURULL G,CASANOVA A,et al.Graph Attention Networks[C]//International Conference on Learning Representations.2018.
[13]LIU Z M.Research on network representation Learning Method based on heterogeneous information fusion [D].Information Engineering University,Strategic Support Forces,2018.
[14]YANG C,LIU Z Y,ZHAO D L,et al.Network representation learning with rich text information[C]//Proceedings of the 24th International Conference on Artificial Intelligence (IJCAI'15).Buenos Aires,Argentina:AAAI Press,2015 S:2111-2117.
[15]LIAO L,HE X,ZHANG H,et al.Attributed social network embedding[J].IEEE Transactions on Knowledge and Data Engineering,2018,30(12):2257-2270.
[16]HSIEH C J,CHIANG K Y,DHILLON I S.Low rank modeling of signed networks[C]//Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining.Beijing,China,2012:507-515.
[17]SUN B,SAENKO K.Deep CORAL:Correlation Alignment forDeep Domain Adaptation[C]//Lecture Notes in Computer Science(ECCV 2016).Workshops,ECCV.Cham:Springer,2016:443-450.
[18]YU C,WANG J,CHEN Y,et al.Transfer learning with dyna-mic adversarial adaptation network[C]//2019 IEEE International Conference on Data Mining (ICDM).IEEE,2019:778-786.
[19]LIU J,LI T,XIE P,et al.Urban big data fusion based on deep learning:An overview[J].Information Fusion,2020,53:123-133.
[20]ZHOU J,HONG X,JIN P.Information Fusion for Multi-Source Material Data:Progress and Challenges[J].Applied Sciences,2019,9(17):3473.
[21]YANG Z,LI Q,LU Z,et al.Dual structure constrained multimodal feature coding for social event detection from flickr data
[J].ACM Transactions on Internet Technology (TOIT),2017,17(2):1-20.
[22]LIN Z.YANG Z,SITU R,et al.Improving Maximum Classifier Discrepancy by Considering Joint Distribution for Domain Adaptation[C]//WISE 2018.Cham:Springer,2018:253-268.
[1] WU Zi-yi, LI Shao-mei, JIANG Meng-han, ZHANG Jian-peng. Ontology Alignment Method Based on Self-attention [J]. Computer Science, 2022, 49(9): 215-220.
[2] CHEN Jing, WU Ling-ling. Mixed Attribute Feature Detection Method of Internet of Vehicles Big Datain Multi-source Heterogeneous Environment [J]. Computer Science, 2022, 49(8): 108-112.
[3] WU Cheng-feng, CAI Li, LI Jin, LIANG Yu. Frequent Pattern Mining of Residents’ Travel Based on Multi-source Location Data [J]. Computer Science, 2021, 48(7): 155-163.
[4] LIU Zhi-xin, ZHANG Ze-hua, ZHANG Jie. Top-N Recommendation Method for Graph Attention Based on Multi-level and Multi-view [J]. Computer Science, 2021, 48(4): 104-110.
[5] JI Nan-xun, SUN Xiao-yan, LI Zhen-qi. Fusion Vectorized Representation Learning of Multi-source Heterogeneous User-generated Contents [J]. Computer Science, 2021, 48(10): 51-58.
[6] ZHUANG Yi, YANG Jia-hai. Multi-source Tree-based Scheduling Algorithm for Deadline-aware P2MP Inter-datacenter Transfers [J]. Computer Science, 2020, 47(7): 213-219.
[7] ZHANG Liang-cheng, WANG Yun-feng. Dynamic Adaptive Multi-radar Tracks Weighted Fusion Method [J]. Computer Science, 2020, 47(11A): 321-326.
[8] QIN Yi-xiu, WEN Yi-min, HE Qian. Multi-source Online Transfer Learning Algorithm for Classification of Data Streams with Concept Drift [J]. Computer Science, 2019, 46(1): 64-72.
[9] LIU Pan, LI Hua-kang and SUN Guo-zi. Risk Observing Method Based on Short-time Multi-source Regression Algorithm on P2P Platform [J]. Computer Science, 2018, 45(5): 97-101.
[10] JU Chun-hua, ZOU Jiang-bo, FU Xiao-kang. Design and Application of Big Data Credit Reporting Platform Integrating Blockchain Technology [J]. Computer Science, 2018, 45(11A): 522-526.
[11] LU Jia-wei, WANG Chen-hao, XIAO Gang and XU Jun. Research and Application of Cloud Push Platform Based on Multi-source and Heterogeneous Data [J]. Computer Science, 2016, 43(Z6): 533-537.
[12] LV Rui, WANG Shu-ying, SUN Lin-fu and PAN Hua. Research on Multi-source Heterogeneous Information Dynamic Integration Technology for Industrial-chain Coordination SaaS Platform [J]. Computer Science, 2016, 43(2): 19-25.
[13] YANG Yu-bo,CHENG Cheng-qi,HAO Ji-gang and LU Nan. Area-location Association and Comprehensive Expression of Multi-source Spatial Information Based on Global Subdivision Framework [J]. Computer Science, 2013, 40(5): 8-10.
[14] ZHU Zheng-Zhou, WU Zhong-Fu, WANG Qian (College of Computer Science and Technology, Chongqing University, Chongqing 400044). [J]. Computer Science, 2008, 35(2): 129-131.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!