Computer Science ›› 2026, Vol. 53 ›› Issue (5): 119-128.doi: 10.11896/jsjkx.250600019

• Database & Big Data & Data Science • Previous Articles     Next Articles

Data Resource Organization Method Based on Enterprise Dataspace and Data Asset Management

LI Minbo1, WANG Shaohua2, WU Dazhen1   

  1. 1 College of Computer Science and Artificial Intelligence, Fudan University, Shanghai 200433, China
    2 Inspur Yunzhou Industrial Internet Co., Ltd., Jinan 250101, China
  • Received:2025-05-29 Revised:2025-08-11 Published:2026-05-08
  • About author:LI Minbo,born in 1970,Ph.D,associate professor,is a member of CCF(No.53905M).His main research interests include industrial big data and industrial AI.
    WANG Shaohua,born in 1984,senior engineer.His main research interests include industrial Internet,AI and supply chain collaboration and data ser-vices.
  • Supported by:
    National Key R & D Program of China(2023YFC3304400).

Abstract: Aiming at the problem of data resource islands caused by confidentiality hierarchical and block management in military research institute,and the difficulty of intelligent retrieval and knowledge reuse of data resources,a governance solution for multi-source heterogeneous enterprise data resources and data assets is proposed.The graph node association mapping between data resources is realized through the attribute graph model of enterprise data space,and a data resource knowledge graph integrating the BOM tree structure is constructed,covering the hierarchical relationship,attribute information and association relationship of R&D process,production and manufacturing,and quality inspection data.This paper proposes a novel RAG framework-HireRAG,and establishes a community-based hierarchical index of knowledge graph based on C-HNSW.The low-level retains fine-grained knowledge units,and the high-level community provides a global summary to handle retrieval at different levels.A graph-enhanced clustering algorithm is proposed to enable C-HNSW to better capture the semantic information in the know-ledge graph.Experiments demonstrate that HireRAG is more adapt at processing bill of materials(BOM) related data within enterprise data spaces compared to several existing advanced retrieval-augmented generation(RAG) frameworks.Furthermore,it achieves superior performance metrics in both retrieval recall and accuracy.The data asset management system ensures that the data assets are entered into the table in compliance with the whole process.

Key words: Enterprise dataspace, Data resources, Knowledge graph, Data query, Hierarchical Navigable Small World Graphs

CLC Number: 

  • TP391
[1]FRANKLIN M,HALEVY A,MAIER D.From databased todataspaces:A new abstraction for information management[J].SIGMOD Record,2005,34(4):27-33.
[2]FAN S H,HOU M S.Dataspace:A New Data Organization and Management Model[J].Computer Science,2023,50(5):115-127.
[3]DITTRICH J P,SALLES M A V.iDM:A unified and versatile data model for personal dataspace management[C]//Internatio-nal Conference on Very Large Data Bases(VLDB’06).2006:367-378.
[4]WEN B L,JIAO S J,GUO J.Research on Data OrganizationMethod Based on Enterprise DataSpace[J].Computer Technology and Development,2020,30(12):56-60.
[5]LI Y K,MENG X F,ZHANG X Y.Research on data space technology[J].Journal of Software,2008,19(8):2018-2031.
[6]YAN X F,YU P S,HAN J W.Graph Indexing:A frequent structure-based approach[C]//Proceedings of the 24th International Conference on Management of Data(SIGMOD 2004).New York:ACM,2004:335-346.
[7]HE H,SINGH A K.Closure-Tree:An index structure for graph queries[C]//Proceedings of the 22nd International Conference on Data Engineering(ICDE 2006).Dallas:IEEE Computer So-ciety,2006.
[8]HOLDER L,COOK D,DJOKO S.Substructure discovery in the subdue system[C]//Proceedings of the AAAI Workshop of Conference on Knowledge Discovery in Databases.Menlo Park:AAAI,1994:169-180.
[9]JIANG H L,WANG H X,YU P S,et al.GString:A novel approach for efficient search in graph databases[C]//Proceedings of the 23rd International Conference on Data Engineering(ICDE 2007).Dallas:IEEE Computer Society,2007:566-575.
[10]VAN BRUGGEN R,BATON J.Learning Neo4j 3.x-second edition[M]//Birmingham:Packt Publishing,2017:33-36.
[11]ELSAYED I,MUSLIMOVIC A,BREZANY P.Intelligent data spaces for science C WSEAS[C]//International Conference on Computational Intelligence,Man-Machine Systems and Cybernetics.2008:94-100.
[12]YANG D,SHEN D,NIE T,et al.Layered graph data model for data management of dataspace support platform[C]//International Conference on Web-Age Information Management.Berlin:Springer,2011:353-365.
[13]FRANKLIN M,HALEVY A,MAIER D.A first tutorial ondataspaces[C]//Proceedings of the VLDB Endowment.2008:1516-1517.
[14]XUE Q,ZENG X Q,OUYANG Z Y.Discussion on Key Issues of Data Asset Inclusion the Background of Digitalization and Intelligence[J].The Chinese Certified Public Accountant,2024,10:105-113.
[15]EDGE D,TRINH H,CHENG N,et al.From Local to Global:A Graph RAG Approach to Query-Focused Summarization[J].arXiv:2404.16130,2024.
[16]GUO Z R,XIA L H,YU Y H,et al.Lightrag:Simple and fast retrieval-augmented generation[J].arXiv:2410.05779,2024.
[17]GROVER A,LESKOVEC J.node2vec:Scalable feature learning for networks[C]//Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mi-ning.2016:855-864.
[18]XU X,YURUK N,FENG Z,et al.Scan:A structural clustering algorithm for networks[C]//Proceedings of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining.2007:824-833.
[19]WANG C,WANG F,ONEGA T.Network optimization ap-proach to delineating health care service areas:Spatially constrained Louvain and Leiden algorithms[J].Transactions in GIS,2021,25(2):1065-1081.
[20]KOJIMA T,GU S S,REID M,et al.Large language models are zero-shot reasoners[J].Advances in Neural Information Processing Systems,2022,35:22199-22213.
[21]ROBERTSON S E,WALKER S.Some simple effective approximations to the 2-poisson model for probabilistic weighted retrieval[C]//Proceedings of the Seventeenth Annual International ACM-SIGIR Conference on Research and Development in Information Retrieval(SIGIR’94).London:Springer,1994:232-241.
[22]SARTHI P,ABDULLAH S,TULI A,et al.Raptor:Recursiveabstractive processing for tree-organized retrieval[C]//The Twelfth International Conference on Learning Representations.2024.
[23]NUSSBAUM Z,MORRIS J X,DUDERSTADT B,et al.Nomic embed:Training a reproducible long context text embedder[J].arXiv:2402.01613,2024.
[1] HAN Linrui, ZHENG Ri, CONG Yingnan. Explainable Sentencing Prediction Method Driven by Sentencing Rule Knowledge Graph [J]. Computer Science, 2026, 53(5): 286-298.
[2] XIN Yichen, LI Shichong, CHEN Bin, CHENG Zhangtao, LI Ye, ZHOU Fan. Enhancing Temporal Knowledge Graph Reasoning Method with Graph Information Bottleneck and Transformer [J]. Computer Science, 2026, 53(4): 393-405.
[3] QIN Jing, LI Guanfeng, CHEN Yuyin, XIAO Yuhang. Embedding Model of Knowledge Graph via Jointly Modeling Ontology and Instances [J]. Computer Science, 2026, 53(3): 331-340.
[4] DU Jiantong, GUAN Zeli, XUE Zhe. Multi-task Learning-based Ophthalmic Video Feature Fusion and Multi-dimensional Profiling [J]. Computer Science, 2026, 53(3): 383-391.
[5] XI Penghui, WU Xiazhen, JIANG Wencong, FANG Liangda, HE Chaobo, GUAN Quanlong. Review of Personalized Educational Resource Recommendations [J]. Computer Science, 2026, 53(2): 1-15.
[6] ZHANG Jing, PAN Jinghao, JIANG Wenchao. Background Structure-aware Few-shot Knowledge Graph Completion [J]. Computer Science, 2026, 53(2): 331-341.
[7] CHEN Yuyin, LI Guanfeng, QIN Jing, XIAO Yuhang. Survey on Complex Logical Query Methods in Knowledge Graphs [J]. Computer Science, 2026, 53(2): 273-288.
[8] CHEN Zhuangzhuang, DENG Yichen, YU Dunhui, XIAO Kui. Cross-language Knowledge Graph Entity Alignment Based on Meta-learning [J]. Computer Science, 2026, 53(1): 271-277.
[9] CAI Qihang, XU Bin, DONG Xiaodi. Knowledge Graph Completion Model Using Semantically Enhanced Prompts and Structural Information [J]. Computer Science, 2025, 52(9): 282-293.
[10] WANG Dongsheng. Multi-defendant Legal Judgment Prediction with Multi-turn LLM and Criminal Knowledge Graph [J]. Computer Science, 2025, 52(8): 308-316.
[11] HONG Xinran, MA Jun, WANG Jing, ZHANG Chuang, YU Jie, LI Xiaoling, ZHANG Xueyan, YANG Yajing. Survey on Research of Compatibility Issues in Operating System for Software Ecology Evolution [J]. Computer Science, 2025, 52(7): 1-12.
[12] LUO Xuyang, TAN Zhiyi. Knowledge-aware Graph Refinement Network for Recommendation [J]. Computer Science, 2025, 52(7): 103-109.
[13] ZHENG Xinxin, CHEN Fan, SUN Baodan, GONG Jianguang, JIANG Junhui. Question Answering System for Soybean Planting Management Based on Knowledge Graph [J]. Computer Science, 2025, 52(6A): 240500025-8.
[14] HU Xin, DUAN Jiangli, HUANG Denan. Concept Cognition for Knowledge Graphs by Mining Double Granularity Concept Characteristics [J]. Computer Science, 2025, 52(6A): 240800047-6.
[15] LI Pengyan, WANG Baohui, YE Zihao. Study on Improvements of RippleNet Model Based on Representation Enhancement [J]. Computer Science, 2025, 52(6A): 240800142-9.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!