计算机科学 ›› 2019, Vol. 46 ›› Issue (2): 11-17.doi: 10.11896/j.issn.1002-137X.2019.02.002

• 大数据与数据科学 • 上一篇    下一篇

质量嵌入的大数据产品生产系统超图模型及其生产线决策研究

王旸1, 蔡淑琴1, 邹新文1, 陈梓桐2   

  1. 华中科技大学管理学院 武汉4300741
    华中科技大学软件学院 武汉4300742
  • 收稿日期:2018-03-27 出版日期:2019-02-25 发布日期:2019-02-25
  • 通讯作者: 蔡淑琴(1955-),女,博士,教授,博士生导师,主要研究方向为信息系统、商务智能,E-mail:caishuqin@hust.edu.cn
  • 作者简介:王 旸(1986-),男,博士生,主要研究方向为信息质量、大数据产品,E-mail:wangyanghim@hust.edu.cn;邹新文(1995-),男,硕士生,主要研究方向为信息管理;陈梓桐(1995-),男,硕士生,主要研究方向为自然语言处理。
  • 基金资助:
    本文受国家自然科学基金项目(71071066,71371081)资助。

Quality-embedded Hypergraph Model for Big Data Product Manufacturing System and Decision for Production Lines

WANG Yang1, CAI Shu-qin1, ZOU Xin-wen1, CHEN Zi-tong2   

  1. School of Management,Huazhong University of Science and Technology,Wuhan 430074,China1
    School of Software Engineering,Huazhong University of Science and Technology,Wuhan 430074,China2
  • Received:2018-03-27 Online:2019-02-25 Published:2019-02-25

摘要: 大数据产品(Big Data Product,BDP)在原材料、用户需求、加工工艺等方面具有不同于实体产品的特征,而现有BDP生产系统的研究仍停留在概念模型阶段。为了解决该问题,提出BDP生产线的概念,基于生产线特征研究了生产线决策要素,强调了质量作为关键决策要素在BDP生产中的作用机理;采用超图理论建立了嵌入质量、质量传递函数和质量聚集函数的BDP生产系统模型,设计了BDP生产线决策流程;提出了供给侧稳定和需求侧稳定的BDP生产线决策模式。实例验证结果表明,所提出的模型和决策方法能够满足用户对BDP质量的要求。

关键词: 超图模型, 大数据产品, 生产线, 数据质量

Abstract: Due to the different characteristics of the Big Data Product (BDP) in terms of raw materials,user requirements,and processing techniques,research on BDP production system still stays at the conceptual model stage.To solve this problem,this paper purposed the concept of BDP production lines,discussed the decision factors for production line selection based on the characteristics of production line,and emphasized the action mechanism of quality as a key decision element in BDP production.This paper established the BDP production system model embedded with quality,quality transfer function and quality aggregation function by using the hypergraph theory,designed the decision processes for BDP production line,and proposed the decision mode of the BDP production line with supply-side-stable and demand-side-stable.The results show that the proposed model and decision method can meet users’ requirements for BDP quality.

Key words: Big data product, Data quality, Hypergraph model, Production line

中图分类号: 

  • TP391
[1]CHEN H,CHIANG R H L,STOREY V C.Business intelli- gence and analytics:From big data to big impact [J].MIS Quarterly,2012,36(4):1165-1188.
[2]JIN X,WAH B W,CHENG X,et al.Significance and challenges of big data research [J].Big Data Research,2015,2(2):59-64.
[3]WANG R Y,LEE Y W,PIPINO L L,et al.Manage your information as a product [J].Sloan Management Review,1998,39(4):95-105.
[4]DAVENPORT T H,KUDYBA S.Designing and developing ana- lytics-based data products [J].MIT Sloan Management Review,2016,58(1):83-89.
[5]CHAO L M,XING C X,ZHANG Y.Data science studies:State-of-the-art and trends [J].Computer Science,2018,45(1):1-13.(in Chinese)
朝乐门,邢春晓,张勇.数据科学研究的现状与趋势 [J].计算机科学,2018,45(1):1-13.
[6]BALLOU D,WANG R,PAZER H,et al.Modeling information manufacturing systems to determine information product quality [J].Management Science,1998,44(4):462-484.
[7]LEE Y,CHASE S,FISHER J,et al.CEIPMaps:Context-em- bedded information product maps[C]∥Proceedings of the 13th Americas Conference on Information Systems (AMCIS 2007).Association for Information Systems,2007.
[8]MEYER M H,ZACK M H.The design and development of information products [J].Sloan Management Review,1996,37(3):43-59.
[9]CAI S Q,MA Y T,XIAO Q,et al.Research on mapping for designing product family of user-generated content based on hypergraph design model [J].Journal of the China Society for Scientific and Technical Information,2011,30(4):387-394.(in Chinese)
蔡淑琴,马玉涛,肖泉,等.基于超图设计模型的用户创造内容产品族设计映射研究 [J].情报学报,2011,30(4):387-394.
[10]YAQOOB I,HASHEM I A,GANI A,et al.Big data:From beginning to future [J].International Journal of Information Mana-gement,2016,36(6):1231-1247.
[11]JURAN J M.Juran on quality by design:The new steps for planning quality into goods and services [M].New York:The Free Press,1992.
[12]LIU J,ZHAO S Z.Research on dynamic development model information product quality assesment [J].Computer Science,2015,42(1):244-248.(in Chinses) 刘婧,赵嵩正.信息产品质量测量动态衍变模型研究 [J].计算机科学,2015,42(1):244-248.
[13]HUANG D M,ZHAO D F,WEI L F,et al.Managing marine data as big data:uprising challenges and tentative solutions [J].Computer Science,2016,43(6):17-23.(in Chinese)
黄冬梅,赵丹枫,魏立斐,等.大数据背景下海洋数据管理的挑战与对策 [J].计算机科学,2016,43(6):17-23.
[14]SAHA B,SRIVASTAVA D.Data quality:The other face of big data[C]∥Proceedings of the IEEE 30th International Confe-rence on Data Engineering.IEEE,2014:1294-1297.
[15]PARSSIAN A,SARKAR S,JACOB V S.Assessing data quality for information products:Impact of selection,projection,and cartesian product [J].Management Science,2004,50(7):967-982.
[16]SHANKARANARAYANAN G,BLAKE R.From content to context:The evolution and growth of data quality research [J].Journal of Data and Information Quality,2017,8(2).
[17]KLEINDIENST D.The data quality improvement plan:Deciding on choice and sequence of data quality improvements [J].Electronic Markets,2017,27(4):387-398.
[18]WATSON H J.Tutorial:Big data analytics:Concepts,technologies,and applications [J].Communications of the Association for Information Systems,2014,34(1):1247-1268.
[19]WOODALL P.The data repurposing challenge:New pressures from data analytics [J].ACM Journal of Data and Information Quality,2017,8(3-4):11.
[20]RICCI F,LIOR R,BRACHA S.Introduction to recommender systems handbook [M].US:Springer,2011:1-35.
[21]PANG B,LILLIAN L.Opinion mining and sentiment analysis [J].Foundations and Trends in Information Retrieval,2008,2(1-2):1-135.
[22]LEE Y W,PIPINO L L,FUNK J D,et al.Journey to data quality [M].Cambridge,USA:The MIT Press,2006.
[23]BATINI C,CAPPIELLO C,FRANCALANCI C,et al.Metho- dologies for data quality assessment and improvement [J].ACM Computing Surveys,2009,41(3):1-52.
[24]TODORAN I G,LECORNU L,KHENCHAF A,et al.A metho- dology to evaluate important dimensions of information quality in systems [J].Journal of Data and Information Quality,2015,6(2-3):11:1-11:23.
[25]RAJAGOPALAN R,BATRA J.Design of cellular production systems a graph-theoretic approach [J].The International Journal of Production Research,1975,13(6):567-579.
[26]LIU P,LI Y,ZHANG K F.Layered orientation graph modeling method for assembly of complex products [J].Journal of Machine Design,2007,24(4):30-32.(in Chinese)
刘平,李原,张开富.复杂产品装配的分层有向图建模方法 [J].机械设计,2007,24(4):30-32.
[27]PURKAIT P,CHIN T J,SADRI A,et al.Clustering with hypergraphs:The case for large hyperedges [J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2017,39(9):1697-711.
[28]KONSTAS I,LAPATA M.Unsupervised concept-to-text gene- ration with hypergraphs[C]∥Proceedings of the 2012 Confe-rence of the North American Chapter of the Association for Computational Linguistics:Human Language Technologies.Association for Computational Linguistics,2012:752-761.
[29]WANG Y,WU Y,CAI S.A hypergraph model for market opportunity discovery[C]∥Proceedings of the Fourth Internatio-nal Joint Conference on Computational Sciences and Optimization.IEEE,2011:322-326.
[30]CHENG W,WANG B,LI Z H,et al.Study on Optimization Matching Method Oriented to E-platform for Investment and Financing[J].Journal of Chongqing University of Technology(Natural Science),2011,25(5):75-81.(in Chinese)
成卫,王奔,李战华,等.一种面向电子商务投融资平台的优化匹配方法[J].重庆理工大学学报(自然科学),2011,25(5):75-81.
[1] 章菊, 李学鋆.
基于莱维萤火虫算法的智能生产线调度问题研究
Research on Intelligent Production Line Scheduling Problem Based on LGSO Algorithm
计算机科学, 2021, 48(6A): 668-672. https://doi.org/10.11896/jsjkx.210300118
[2] 郑小萌, 高猛, 滕俊元.
航天器软件缺陷预测数据集构建方法研究
Research on Construction Method of Defect Prediction Dataset for Spacecraft Software
计算机科学, 2021, 48(6A): 575-580. https://doi.org/10.11896/jsjkx.200900133
[3] 李卓, 徐哲, 陈昕, 李淑琴.
面向移动群智感知的位置相关在线多任务分配算法
Location-related Online Multi-task Assignment Algorithm for Mobile Crowd Sensing
计算机科学, 2019, 46(6): 102-106. https://doi.org/10.11896/j.issn.1002-137X.2019.06.014
[4] 蔡莉,梁宇,朱扬勇,何婧.
数据质量的历史沿革和发展趋势
History and Development Tendency of Data Quality
计算机科学, 2018, 45(4): 1-10. https://doi.org/10.11896/j.issn.1002-137X.2018.04.001
[5] 尚玉玲, 曹建军, 李红梅, 郑奇斌.
基于合作作者与隶属机构信息的同名排歧方法
Co-author and Affiliate Based Name Disambiguation Approach
计算机科学, 2018, 45(11): 220-225. https://doi.org/10.11896/j.issn.1002-137X.2018.11.034
[6] 黄冬梅,赵丹枫,魏立斐,杜艳玲,王振华.
大数据背景下海洋数据管理的挑战与对策
Managing Marine Data as Big Data:Uprising Challenges and Tentative Solutions
计算机科学, 2016, 43(6): 17-23. https://doi.org/10.11896/j.issn.1002-137X.2016.06.003
[7] 韩京宇,陈可佳.
基于事实抽取的Web文档内容数据质量评估
Ranking Data Quality of Web Article Content by Extracting Facts
计算机科学, 2014, 41(11): 247-251. https://doi.org/10.11896/j.issn.1002-137X.2014.11.047
[8] 曹建军,刁兴春,陈 爽,邵衍振.
数据清洗及其一般性系统框架
Data Cleaning and its General System Framework
计算机科学, 2012, 39(Z11): 207-211.
[9] 林印华,张春海,刘 洁.
基于清洗规则和主数据的数据修复算法实现
Realization of Data Cleaning Based on Editing Rules and Master Data
计算机科学, 2012, 39(Z11): 174-176.
[10] 徐俊刚,裴莹.
数据ETL研究综述
Overview of Data Extraction, Transformation and Loading
计算机科学, 2011, 38(4): 15-20.
[11] 丁剑洁,郝克刚,侯红,郭小群.
基于度量的软件生产线管理研究
Metric-based Research of Software Product Line Management
计算机科学, 2011, 38(1): 156-157.
[12] 曹建军,刁兴春,汪挺,王芳潇.
领域无关数据清洗研究综述
Research on Domain-independent Data Cleaning: A Survey
计算机科学, 2010, 37(5): 26-29.
[13] 陈卫东,张维明.
属性粒度数据质量模型及其评价指标研究
Data Quality Model and Metrics Research at Attribute Granularity
计算机科学, 2010, 37(5): 139-142.
[14] 王晓斌,郭长国,王怀民.
一种基于Internet的分布式软件生产线框架
Internet-based Distributed Software Production Line Framework
计算机科学, 2010, 37(4): 125-.
[15] 胡艳丽,张维明.
条件依赖理论及其应用展望
Theory of Conditional Functional Dependencies and its Application for Improving Data Quality
计算机科学, 2009, 36(12): 115-118.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!