计算机科学 ›› 2020, Vol. 47 ›› Issue (3): 54-60.doi: 10.11896/jsjkx.191100085

所属专题: 智能软件工程

• 智能软件工程 • 上一篇    下一篇

基于强化学习的Web服务众测任务分派方法

唐文君,张佳丽,陈荣,郭世凯   

  1. (大连海事大学信息科学技术学院 辽宁 大连116026)
  • 收稿日期:2019-09-10 出版日期:2020-03-15 发布日期:2020-03-30
  • 通讯作者: 陈荣(rchen@dlmu.edu.cn)
  • 基金资助:
    国家自然科学基金(61672122,61902050,61602077);中央高校基本科研业务费专项基金(3132019355);赛尔创新项目(NGII20190627)

Web Service Crowdtesting Task Assignment Approach Based onReinforcement Learning

TANG Wen-jun,ZHANG Jia-li,CHEN Rong,GUO Shi-kai   

  1. (College of Information Science and Technology, Dalian Maritime University, Dalian, Liaoning 116026, China)
  • Received:2019-09-10 Online:2020-03-15 Published:2020-03-30
  • About author:TANG Wen-jun,born in 1994,Ph.D.Her research interests include crowdsourcing workflows,crowdsourcing task assignment and web service testing. CHEN Rong,born in 1969,Ph.D,professor,is a member of the IEEE and a member of the ACM.His research interests inculde software diagnosis,collective intelligence,activity recognition,Internet and mobile computing.
  • Supported by:
    This work was supported by the National Natural Science Foundation of China (61672122, 61902050, 61602077), Fundamental Research Funds for the Central Universities of Ministry of Education of China (3132019355) and CERNET Innovation Project (NGII20190627).

摘要: 如何将众包测试任务分派给合适的众测工人,以较低的成本获得更好的测试结果,是一个重要问题。文中将CWS众测任务分派问题建模为一个基于马尔可夫决策过程的问题,且使用Deep Q Network进行学习和实时在线测试任务分派。该基于强化学习的方法被命名为WTA-C。此外,文中根据众测工人执行任务的历史时间,通过统计条件概率计算测试工人在任务期限内完成任务的概率,将其作为工人信誉值来反映工人质量,并在每次分派完成后对工人信誉值进行更新。实验结果显示,WTA-C在控制测试任务的“质量-成本”权衡和保证工人可靠度方面优于其他基于启发式策略的实时分派方法,并在分派效果上高于各启发式策略18%以上,从而证明了其可以更好地适应CWS的结构和众测环境的特点。

关键词: Web服务测试, 强化学习, 众包测试, 组合Web服务测试

Abstract: How to assign tasks to appropriate workers to get better testing results at a lower cost is an important problem.This paper modeled the CWS testing task assignment as a Markov decision process-based problem,and used Deep Q Network to learn and perform real-time online testing task assignment.The proposed approach based on reinforcement learning is named WTA-C.In addition,this paper calculated the probability of the testing worker completing the task within the duration through statistical conditional probability in accordance with the time of the worker’s historical execution of tasks,and used it as the workers’ reputation value to reflect their quality.The worker’s reputation is updated after each assignment.The experimental results show that WTA-C is superior to other real-time assignment methods based on heuristic strategies in controlling the “quality-cost” trade-off of testing tasks and ensuring worker quality,and its assignment effect is more than 18% higher than that of each heuristic strategy,which demonstrates that WTA-C can better adapt to the structure of the CWS and the characteristics of Crowdsourcing environment.

Key words: Composite Web service testing, Crowdtesting, Reinforcement learning, Web service testing

中图分类号: 

  • TP311.5
[1]HUSSAIN S,WANG Z,TOURE I,et al.Web Service Testing Tools:A Comparative Study [J],CoRR,2013,10(1):641-647.
[2]LIU X,HSIEH Y,CHEN R,et al.Distributed testing system for web service based on crowdsourcing [J].Complexity,2018:2170585:1-2170585:15.
[3]LIU D,BIAS R,LEASE M,et al.Crowdsourcing for Usability Testing[C]∥Association for Information Science and Technology.2012:1-10.
[4]RAHMAN H,ROY S,THIRUMURUGANATHAN S,et al. Task Assignment Optimization in Collaborative Crowdsourcing[C]∥IEEE International Conference on Data Mining.2015:949-954.
[5]KOMAROV S,REINECKE K,GAJOS K.Crowdsourcing per- formance evaluations of user interfaces[C]∥Computer Human Interaction.2013:207-216.
[6]IPEIROTIS P.Analyzing the Amazon Mechanical Turk Marketplace[J].CM Crossroads Student Magazine,2010,17(2):16-21.
[7]TIAN X,LI H,FENG L.Web Service Reliability Test Method Based on Log Analysis[C]∥Software Quality.Reliability and Security,2017:195-199.
[8]DU Z,MIAO H.Research review on web service composition testing[C]∥The International Workshop on Structured Object-oriented Formal Language and Method.2018:39-51.
[9]GUO S,LIU Y,CHEN R,et al.Improved SMOTE Algorithm to Deal with Imbalanced Activity Classes in Smart Homes[J].Neural Processing Letters,2019,50(2):1503-1526.
[10]GARDLO B.Quality of Experience Evaluation Methodology via Crowdsourcing[D].Žilina:University of Žilina,2012.
[11]GARDLO B,EGGER S,SEUFERT M,et al.Crowdsourcing 2.0:Enhancing execution speed and reliability of web-based QoE testing[C]∥IEEE International Conference on Communications.2014:1070-1075.
[12]BLANCO R,HALPIN H,HERZIG D,et al.Repeatable and reliable search system evaluation using crowdsourcing [C]∥International Conference on Research and Development in Information Retrieval.2011:923-932.
[13]SHERIEF N,JIANG N,HOSSEINI M,et al.Crowdsourcing software evaluation[C]∥Evaluation & Assessment in Software Engineering.2014:19:1-19:4.
[14]CHEN F,KIM S.Crowd debugging,in Joint Meeting on Foundations of Software Engineering [C]∥ESEC/SIGSOFT FSE.2015:320-332.
[15]PETRILLO F,SOH Z,KHOMH F,et al.Towards Understan- ding Interactive Debugging [C]∥IEEE International Conference on Software Quality.2016:152-163.
[16]PETRILLO F,LACERDA G,et al.Visualizing interactive and shared debugging sessions ∥IEEE Working Conference on Software Visualization.2015:140-144.
[17]GUAIANI F,MUCCINI H.Crowd and Laboratory Testing,Can They Co-exist? An Exploratory Study[C]∥International Conference on Software Engineering.2015:32-37.
[18]LIU Y,ZHANG T,LI K,et al.Evaluation model of mobile application crowdsourcing testers [J].Journal of Computer Applications,2017,37(12):3569-3573.
[19]ZHANG Z Q,PANG J S,XIE X Q,et al.Research on Crowdsourcing Quality Control Strategies and Evaluation Algorithm [J].Chinese Journal of Computers,2013,36(8):1636-1649.
[20]CHENG J,GE L Q,ZHANG T,et al.Research on factors affecting quality of mobile application crowdsourced testing [J].Journal of Computer Applications,2018,38(9):2626-2630.
[21]FENG J H,LI G L,GENG J H.A Survey on Crowdsourcing [J].Chinese Journal of Computers,2015,38(9):1713-1726.
[22]TUNG Y,TSENGE S.A novel approach to collaborative testing in a crowdsourcing environment [J].Journal of Systems and Software,2013,86(8):2143-2153.
[23]BOUTSIS I,KALOGERAKI V.On Task Assignment for Real-Time Reliable Crowdsourcing [C]∥IEEE International Conference on Distributed Computing Systems.2014:1-10.
[24]BOUTSIS I,KALOGERAKI V.Crowdsourcing under Real- Time Constraints [C]∥International Parallel (and Distributed) Processing Symposium.2013:753-764.
[25]HAN Y,SHEN Z,FAUVEL S,et al.Efficient scheduling in crowdsourcing based on workers’ mood[C]∥IEEE Internatio-nal Conference on Agents.2017:121-126.
[26]ROY S,LYKOURENTZOU I,THIRUMURUGANATHAN S,et al.Task assignment optimization in knowledge-intensive crowdsourcing [J].The VLDB Journal,2014,24(4):467-491.
[27]LI Y J,GUO J F,GOU X M.Software Task Allocation Method in Crowdsourcing [J].Computer Systems & Applications,2015,24(2):1-6.
[28]YAN M,SUN H,LIU X.iTest:testing software with mobile crowdsourcing[C]∥CrowdSoft@SIGSOFT FSE.2014:19-24.
[29]YAN M,SUN H,LIU X.Efficient Testing of Web Services with Mobile Crowdsourcing[C]∥Internetware.2015:157-165.
[30]CLAUSET A,SHALIZI C,NEWMAN M.Power-law distributions in empirical data[J].SIAM Review,2009,51(4):661-703.
[31]RUMMERY G,NIRANJAN M.On-line Q-learning using connectionist systems [D].University of Cambridge,1994.
[32]TIAN Y L,HUI L.Accelerated modify approach for initial state error iterative learning control in sense of Lebesgue-p norm[J].Control and Decision,2016,31(3):429-434.
[33]BARABÁSI A,ALBERT R.Emergence of scaling in random networks[J].Science,1999,286(5439):509-512.
[34]ZHENG Z,LYU M.Collaborative reliability prediction of service-oriented systems[C]∥ACM/IEEE International Conference on Software Engineering.2010:35-44.
[1] 刘兴光, 周力, 刘琰, 张晓瀛, 谭翔, 魏急波.
基于边缘智能的频谱地图构建与分发方法
Construction and Distribution Method of REM Based on Edge Intelligence
计算机科学, 2022, 49(9): 236-241. https://doi.org/10.11896/jsjkx.220400148
[2] 熊丽琴, 曹雷, 赖俊, 陈希亮.
基于值分解的多智能体深度强化学习综述
Overview of Multi-agent Deep Reinforcement Learning Based on Value Factorization
计算机科学, 2022, 49(9): 172-182. https://doi.org/10.11896/jsjkx.210800112
[3] 史殿习, 赵琛然, 张耀文, 杨绍武, 张拥军.
基于多智能体强化学习的端到端合作的自适应奖励方法
Adaptive Reward Method for End-to-End Cooperation Based on Multi-agent Reinforcement Learning
计算机科学, 2022, 49(8): 247-256. https://doi.org/10.11896/jsjkx.210700100
[4] 袁唯淋, 罗俊仁, 陆丽娜, 陈佳星, 张万鹏, 陈璟.
智能博弈对抗方法:博弈论与强化学习综合视角对比分析
Methods in Adversarial Intelligent Game:A Holistic Comparative Analysis from Perspective of Game Theory and Reinforcement Learning
计算机科学, 2022, 49(8): 191-204. https://doi.org/10.11896/jsjkx.220200174
[5] 于滨, 李学华, 潘春雨, 李娜.
基于深度强化学习的边云协同资源分配算法
Edge-Cloud Collaborative Resource Allocation Algorithm Based on Deep Reinforcement Learning
计算机科学, 2022, 49(7): 248-253. https://doi.org/10.11896/jsjkx.210400219
[6] 李梦菲, 毛莺池, 屠子健, 王瑄, 徐淑芳.
基于深度确定性策略梯度的服务器可靠性任务卸载策略
Server-reliability Task Offloading Strategy Based on Deep Deterministic Policy Gradient
计算机科学, 2022, 49(7): 271-279. https://doi.org/10.11896/jsjkx.210600040
[7] 谢万城, 李斌, 代玥玥.
空中智能反射面辅助边缘计算中基于PPO的任务卸载方案
PPO Based Task Offloading Scheme in Aerial Reconfigurable Intelligent Surface-assisted Edge Computing
计算机科学, 2022, 49(6): 3-11. https://doi.org/10.11896/jsjkx.220100249
[8] 洪志理, 赖俊, 曹雷, 陈希亮, 徐志雄.
基于遗憾探索的竞争网络强化学习智能推荐方法研究
Study on Intelligent Recommendation Method of Dueling Network Reinforcement Learning Based on Regret Exploration
计算机科学, 2022, 49(6): 149-157. https://doi.org/10.11896/jsjkx.210600226
[9] 郭雨欣, 陈秀宏.
融合BERT词嵌入表示和主题信息增强的自动摘要模型
Automatic Summarization Model Combining BERT Word Embedding Representation and Topic Information Enhancement
计算机科学, 2022, 49(6): 313-318. https://doi.org/10.11896/jsjkx.210400101
[10] 范静宇, 刘全.
基于随机加权三重Q学习的异策略最大熵强化学习算法
Off-policy Maximum Entropy Deep Reinforcement Learning Algorithm Based on RandomlyWeighted Triple Q -Learning
计算机科学, 2022, 49(6): 335-341. https://doi.org/10.11896/jsjkx.210300081
[11] 张佳能, 李辉, 吴昊霖, 王壮.
一种平衡探索和利用的优先经验回放方法
Exploration and Exploitation Balanced Experience Replay
计算机科学, 2022, 49(5): 179-185. https://doi.org/10.11896/jsjkx.210300084
[12] 阳真, 黄松, 郑长友.
基于区块链与改进CP-ABE的众测知识产权保护技术研究
Study on Crowdsourced Testing Intellectual Property Protection Technology Based on Blockchain and Improved CP-ABE
计算机科学, 2022, 49(5): 325-332. https://doi.org/10.11896/jsjkx.210900075
[13] 李鹏, 易修文, 齐德康, 段哲文, 李天瑞.
一种基于深度学习的供热策略优化方法
Heating Strategy Optimization Method Based on Deep Learning
计算机科学, 2022, 49(4): 263-268. https://doi.org/10.11896/jsjkx.210300155
[14] 欧阳卓, 周思源, 吕勇, 谭国平, 张悦, 项亮亮.
基于深度强化学习的无信号灯交叉路口车辆控制
DRL-based Vehicle Control Strategy for Signal-free Intersections
计算机科学, 2022, 49(3): 46-51. https://doi.org/10.11896/jsjkx.210700010
[15] 周琴, 罗飞, 丁炜超, 顾春华, 郑帅.
基于逐次超松弛技术的Double Speedy Q-Learning算法
Double Speedy Q-Learning Based on Successive Over Relaxation
计算机科学, 2022, 49(3): 239-245. https://doi.org/10.11896/jsjkx.201200173
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!