计算机科学 ›› 2019, Vol. 46 ›› Issue (2): 152-158.doi: 10.11896/j.issn.1002-137X.2019.02.024

• 软件与数据库技术 • 上一篇    下一篇

Github中开发人员的行为特征分析

李存燕, 洪玫   

  1. 四川大学计算机学院 成都610065
  • 收稿日期:2017-12-23 出版日期:2019-02-25 发布日期:2019-02-25
  • 通讯作者: 洪 玫(1963-),女,教授,硕士生导师,CCF高级会员,主要研究方向为软件工程、软件质量保障与测试,E-mail:hongmei@scu.edu.cn。
  • 作者简介:李存燕(1992-),女,硕士生,主要研究方向为软件质量保证与测试,E-mail:472287674@qq.com

Analysis on Behavior Characteristics of Developers in Github

LI Cun-yan, HONG Mei   

  1. College of Computer,Sichuan University,Chengdu 610065,China
  • Received:2017-12-23 Online:2019-02-25 Published:2019-02-25

摘要: 开源环境下开发人员的行为特征分析是促进开源社区协作开发的重要问题之一。文中以Github开源社区的数据为研究对象,结合可视化分析的技术,分析了Github上开发者贡献度的影响因素,探索了开发者之间的协作关系,进一步分析了开发者所属地域与开发者协作之间的关系。通过研究结果发现了一些具有重要理论和时间价值的现象与结论,从一个新的视角揭示了开发人员的部分行为特征。

关键词: 贡献度, 可视化, 协同关系, 行为特征

Abstract: Analysis of the behavior characteristics of developers in open source environment is one of the important issues to promote the development of open source community.This paper regarded the data of Github open source community as the research object,analyzed the influence factors of developer contribution degree on Github and explored the cooperative relationship between developers through utilizing the visualization analysis technology,and further dissected the relationship between the region that the developers belong to and the collaboration of developers.Some phenomena and conclusions with important theories and time values can be obtained from the study,revealing some behavioral cha-racteristics of developers from a new perspective.

Key words: Behavioral characteristic, Collaboration, Contribution degree, Visualization

中图分类号: 

  • TP311
[1]KALLIAMVAKOU E,GOUSIOS G,BLINCOE K,et al.The promises and perils of mining GitHub[C]∥Proceedings of the 11th Working Conference on Mining Software Repositories (MSR 2014).ACM,2014:92-101.
[2]MOCKUS A,FIELDING R T,HERBSLEB J D.Two case stu- dies of open source software development:Apache and Mozilla[J].Acm Transactions on Software Engineering & Methodology,2002,11(3):309-346.
[3]CROWSTON K,HOWISON J.The social structure of free and open source software development[J].First Monday,2005,10(2):405-411.
[4]YANG W,SHEN B,XU B.Mining GitHub:Why Commit Stops——Exploring the Relationship between Developer’s Commit Pattern and File Version Evolution[C]∥Software Engineering Conference.IEEE,2014:165-169.
[5]BEN X,BEIJUN S,WEICHENG Y.Mining Developer Contri- bution in Open Source Software Using Visualization Techniques[C]∥ International Conference on Intelligent System Design & Engineering Applications.IEEE,2013.
[6]XAVIER J,MACEDO A.Understanding the popularity of rep orters and assignees in the GitHub[C]∥26th International Conference on Software Enginee-ring and Knowledge Engineering.2014:484-489.
[7]THUNG F,BISSYANDE T F,LO D,et al.Network Structure of Social Coding in GitHub[C]∥European Conference on Software Maintenance & Reengineering.2013.
[8]PADHYE R,MANI S,SINHA V S.A study of external community contribution to open-source projects on GitHub[OL].https://dl.acm.org/citation.cfm?id=2597113.
[9]KALLIAMVAKOU E,GOUSIOS G,BLINCOE K,et al.An in-depth study of the promises and perils of mining GitHub[J].Empirical Software Engineering,2016,21(5):2035-2071.
[10]GOUSIOS G,KALLIAMVAKOU E,SPINELLIS D.Measuring Developer Contribution from Software Repository Data[C]∥Mediterranean Conference on Information Systems.DBLP,2008.
[11]YUAN L,WANG H M,YIN G,et al.Mining and Analyzing Be- havioral Characteristic of Developers in Open Source Software[J].Journal of Computers,2010,33(10):1909-1918.(in Chinese)
袁霖,王怀民,尹刚,等.开源环境下开发人员行为特征挖掘与分析[J].计算机学报,2010,33(10):1909-1918.
[12]XU B.Mining Behavioral Characteristic of Developers in Open Source Software[D].Shanghai:Shanghai Jiaotong University,2013.(in Chinese)
徐奔.开源软件开发人员行为特征的可视化挖掘[D].上海:上海交通大学,2013.
[13]SACKMAN H,ERIKSON W J,GRANT E E.Exploratory ex- perimental studies comparing online and offline programming performance[J].Communications of the Acm,1968,11(1):3-11.
[14]LOPEZ-FERNANDEZ L,ROBLES G,GONZALEZ-BARAHONA J M.Applying social network analysis to the information in CVS repositories[J].IEE Digest,2004(917):101-105.
[15]HUANG S K,LIU K M.Mining version histories to verify the learning process of Legitimate Peripheral Participants[C]∥International Workshop on Mining Software Repositories.2005.
[16]BIRD C,RIGBY P C,BARR E T,et al.The Promises and Perils of Mining Git[C]∥IEEE International Working Conference on Mining Software Repositories.IEEE,2014.
[1] 杨啸, 王翔坤, 胡浩, 朱敏.
面向设备状态监测的可视化技术综述
Survey on Visualization Technology for Equipment Condition Monitoring
计算机科学, 2022, 49(7): 89-99. https://doi.org/10.11896/jsjkx.210900167
[2] 陈慧嫔, 王琨, 杨恒, 郑智捷.
蓝舌病毒基因组序列多元概率特征可视化分析
Visual Analysis of Multiple Probability Features of Bluetongue Virus Genome Sequence
计算机科学, 2022, 49(6A): 27-31. https://doi.org/10.11896/jsjkx.210300129
[3] 朱敏, 梁朝晖, 姚林, 王翔坤, 曹梦琦.
学术引用信息可视化方法综述
Survey of Visualization Methods on Academic Citation Information
计算机科学, 2022, 49(4): 88-99. https://doi.org/10.11896/jsjkx.210300219
[4] 李家振, 纪庆革, 朱泳霖.
分子可视化中的光线追踪棋盘渲染
Ray Tracing Checkerboard Rendering in Molecular Visualization
计算机科学, 2022, 49(2): 134-141. https://doi.org/10.11896/jsjkx.210900126
[5] 李家振, 纪庆革.
动态低采样环境光遮蔽的实时光线追踪分子渲染
Dynamic Low-sampling Ambient Occlusion Real-time Ray Tracing for Molecular Rendering
计算机科学, 2022, 49(1): 175-180. https://doi.org/10.11896/jsjkx.210200042
[6] 骆菁菁, 唐卫贞, 丁继婷.
基于皮尔逊系数的管制仿真训练数据独立化与因子分析下的数据可视化研究
Research of ATC Simulator Training Values Independence Based on Pearson Correlation Coefficient and Study of Data Visualization Based on Factor Analysis
计算机科学, 2021, 48(6A): 623-628. https://doi.org/10.11896/jsjkx.210200021
[7] 范家宽, 王皓月, 赵生宇, 周添一, 王伟.
数据驱动的开源贡献度量化评估与持续优化方法
Data-driven Methods for Quantitative Assessment and Enhancement of Open Source Contributions
计算机科学, 2021, 48(5): 45-50. https://doi.org/10.11896/jsjkx.201000107
[8] 苏庆, 黎智洲, 刘添添, 吴伟民, 黄剑锋, 李小妹.
程序调试中的树形结构演变可视化模型
Tree Structure Evaluation Visualization Model for Program Debugging
计算机科学, 2021, 48(5): 68-74. https://doi.org/10.11896/jsjkx.200100133
[9] 鄂海红, 张田宇, 宋美娜.
基于Web的数据可视化图表渲染优化方法
Web-based Data Visualization Chart Rendering Optimization Method
计算机科学, 2021, 48(3): 119-123. https://doi.org/10.11896/jsjkx.200600038
[10] 张倩, 肖丽.
基于流线的流场可视化绘制方法综述
Review of Visualization Drawing Methods of Flow Field Based on Streamlines
计算机科学, 2021, 48(12): 1-7. https://doi.org/10.11896/jsjkx.201200108
[11] 袁禄, 朱郑州, 任庭玉.
虚假评论识别研究综述
Survey on Fake Review Recognition
计算机科学, 2021, 48(1): 111-118. https://doi.org/10.11896/jsjkx.200500101
[12] 马梦宇, 吴烨, 陈荦, 伍江江, 李军, 景宁.
显示导向型的大规模地理矢量实时可视化技术
Display-oriented Data Visualization Technique for Large-scale Geographic Vector Data
计算机科学, 2020, 47(9): 117-122. https://doi.org/10.11896/jsjkx.190800121
[13] 吕泽宇李纪旋陈如剑陈东明.
电商平台用户再购物行为的预测研究
Research on Prediction of Re-shopping Behavior of E-commerce Customers
计算机科学, 2020, 47(6A): 424-428. https://doi.org/10.11896/JsJkx.190900018
[14] 李天培, 陈黎.
基于双注意力编码-解码器架构的视网膜血管分割
Retinal Vessel Segmentation Based on Dual Attention and Encoder-decoder Structure
计算机科学, 2020, 47(5): 166-171. https://doi.org/10.11896/jsjkx.190400062
[15] 尚骏远, 杨乐涵, 何琨.
基于特征可视化分析深度神经网络的内部表征
Analyzing Latent Representation of Deep Neural Networks Based on Feature Visualization
计算机科学, 2020, 47(5): 190-197. https://doi.org/10.11896/jsjkx.190700128
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!