计算机科学 ›› 2023, Vol. 50 ›› Issue (2): 237-243.doi: 10.11896/jsjkx.220600203

• 人工智能 • 上一篇    下一篇

基于浏览器扩展的RPA系统跨域DOM拾取与自动化方案

易仁可, 蔡宇辉, 杨圣洪, 吴帆, 李肯立   

  1. 湖南大学计算机科学与工程学院 长沙 410082
  • 收稿日期:2022-06-22 修回日期:2022-11-14 出版日期:2023-02-15 发布日期:2023-02-22
  • 通讯作者: 蔡宇辉(rj_cyh@hnu.edu.cn)
  • 作者简介:(yrk@hnu.edu.cn)
  • 基金资助:
    高性能计算应用软件协同开发工具与环境研究(2017YFB0202201)

Cross-domain DOM Pickup and Automation Scheme of RPA System Based on Browser Extension

YI Renke, CAI Yuhui, YANG Shenghong, WU Fan, LI Kenli   

  1. School of Computer Science and Engineering,Hunan University,Changsha 410082,China
  • Received:2022-06-22 Revised:2022-11-14 Online:2023-02-15 Published:2023-02-22
  • Supported by:
    High-performance Computing Application Software Co-develop Tools and Environmental Research(2017YFB0202201)

摘要: 机器人流程自动化(Robotic Process Automation,RPA)是当今的研究热点之一,对网页元素的拾取和自动化是RPA的重要功能之一。RPA通过使用浏览器扩展向网页中注入脚本程序以处理网页,使用网页元素定位路径定位到目标节点进行自动化操作,在源网页中存在跨域frame时,由于同源策略的限制,注入源网页的脚本无法获取到目标节点的DOM对象,导致无法生成网页元素定位路径,从而无法自动化。文中提出了一种基于浏览器扩展的RPA系统设计方案,该方案能够对跨域DOM进行拾取和自动化。该方案在处理包含第三方跨域frame的网页时,将其看作与源网页frame地位相等的frame进行处理,网页元素定位路径设计为包含frame的url和网页元素Xpath的形式,以实现跨域网页元素拾取和自动化的功能。实验结果表明,该方案能够有效地对包含跨域资源的网页进行拾取和自动化操作,且同时支持chrome,firefox等支持浏览器扩展的浏览器。

关键词: 浏览器扩展, 跨域, RPA

Abstract: Robotic process automation(RPA) is one of today's research hotspots.The pickup and automation of web page elements is one of the important functions of RPA.RPA injects scripts into the web page to process web pages by using browser extensions,using web page element positioning path to locate to the target node for automated operations.When there is a cross-domain frame in the source web page,due to the limitations of the same-origin strategy,the script injected into the source web page can not obtain the DOM object of the target node,resulting in the inability to generate a web page element positioning path,so that it can not be automated.When processing a web page containing a third-party cross-domain frame,the scheme treats it as a frame process equal to the status of the source web page frame,and the web page element positioning path is designed to contain the url of the frame and the form of the web page element Xpath to achieve cross-domain web page element pickup and automation.Experiments show that the scheme can effectively pick up and automate the elements of cross-domain web pages,and support chrome,firefox,and other browsers that support browser extensions.

Key words: Browser extensions, Cross-domain, RPA

中图分类号: 

  • TP312
[1]VAJGEL B,CORRÊA P L P,DE SOUSA T T,et al.Development of intelligent robotic process automation:A utility case study in Brazil[J].IEEE Access,2021,9:71222-71235.
[2]HUANG S Q,LIU Y B,HUANG X S.Research on Automatic Testing Technology of Model Driven Development Tools[J].Computer Science,2021,48(6A):568-571.
[3]KEDZIORA D,SMOLANDER K.Responding to HealthcareEmergency Outbreak of COVID-19 Pandemic with Robotic Process Automation(RPA)[C]//HICSS.2022:1-10.
[4]TØMMERVÅG A S,BACH T,JÆGER B.Leveraging the competition:Robotic Process Automation(RPA) enabling competitive Small and Medium sized Auditing Firms[C]//2022 IEEE/SICE International Symposium on System Integration(SII).IEEE,2022:833-837.
[5]VILLAR A S,KHAN N.Robotic process automation in banking industry:a case study on Deutsche Bank[J].Journal of Banking and Financial Technology,2021,5(1):71-86.
[6]KOCH O,BUCHKREMER R,KNEISEL E.Graph Databasesand Robotic Process Automation:Achieving Improvement in Project Knowledge Management[C]//33rd Blede Conference Enabling Technology for a Sustainable Society.2020:171-184.
[7]BAIDYA A.Document Analysis and Classification:A Robotic Process Automation(RPA) and Machine Learning Approach[C]//2021 4th International Conference on Information and Computer Technologies(ICICT).IEEE,2021:33-37.
[8]LI X D,GU Y Q.DOM-based Information Extraction for theWeb Sources[J].Chinese Journal of Computers,2002(5):526-533.
[9]ZHANG J,LIU X F.Intelligent Discernment and AutomaticManipulation of Web-Page Controls[J].Computer Systems & Applications,2009,18(4):163-166.
[10]WANG H.Design and implementation of a breadth-first themed crawler[D].Shanghai:Fudan University,2011.
[11]SHARMA A,GULERIA K.A Framework for Hotel InventoryControl System for Online Travel Agency using Robotic Process Automation[C]//2021 International Conference on Advance Computing and Innovative Technologies in Engineering(ICACITE).IEEE,2021:764-768.
[12]YATSKIV S,VOYTYUK I,YATSKIV N,et al.Improvedmethod of software automation testing based on the robotic process automation technology[C]//2019 9th International Conference on Advanced Computer Information Technologies(ACIT).IEEE,2019:293-296.
[13]SHIDAGANTI G,SALIL S,ANAND P,et al.Robotic ProcessAutomation with AI and OCR to Improve Business Process[C]//2021 Second International Conference on Electronics and Sustainable Communication Systems(ICESC).IEEE,2021:1612-1618.
[14]MA Y W,LIN D P,CHEN S J,et al.System design and deve-lopment for robotic process automation[C]//2019 IEEE International Conference on Smart Cloud(SmartCloud).IEEE,2019:187-189.
[15]DO ROSÁRIO CABRITA M,PARGANA F,COSTA J.Robotic Process Automation implementation framework in a financial institution[C]//2021 16th Iberian Conference on Information Systems and Technologies(CISTI).IEEE,2021:1-9.
[16]DONG R,HUANG Z,LAM I I,et al.WebRobot:web robotic process automation using interactive programming-by-demonstration[C]//Proceedings of the 43rd ACM SIGPLAN International Conference on Programming Language Design and Implementation.2022:152-167.
[17]WEWERKA J,MICUS C,REICHERT M.Seven Guidelines for Designing the User Interface in Robotic Process Automation[C]//2021 IEEE 25th International Enterprise Distributed Object Computing Workshop(EDOCW).Gold Coast,Australia,2021:157-165.
[18]MARAGATHASUNDARI S,MUTHANANTHAM M,VANALAKSHMI R,et al.Queuing Analysis in Robotic Process Automation[C]//2022 8th International Conference on Advanced Computing and Communication Systems(ICACCS).IEEE,2022:174-181.
[19]MARTINS P,SÁF,MORGADO F,et al.Using machine lear-ning for cognitive Robotic Process Automation(RPA) [C]//2020 15th Iberian Conference on Information Systems and Technologies (CISTI).2020:1-6.
[20]MARTINS P,SÁ F,MORGADO F,et al.Using machine learning for cognitive Robotic Process Automation(RPA) [C]//2020 15th Iberian Conference on Information Systems and Technologies(CISTI).2020:1-6.
[21]WANG Y S,QIN Y C,CAI Y H,et al.Design and Implementation of RPA System Based on UIA Interface [J].Computer Science,2022,49(8):225-229.
[22]LUO W,SHEN Q N,WU Z H,et al.State-of-the-art survey of research on browser's same origin policy security[J].Ruan Jian Xue Bao/Journal of Software,2021,32(8):2469-2504.
[1] 王岩松, 秦云川, 蔡宇辉, 李肯立.
一种基于UIA接口的RPA系统设计方法
Design and Implementation of RPA System Based on UIA Interface
计算机科学, 2022, 49(8): 225-229. https://doi.org/10.11896/jsjkx.211100046
[2] 陈彦冰, 钟超然, 周超然, 薛凌妍, 黄海平.
基于医疗联盟链的跨域认证方案设计
Design of Cross-domain Authentication Scheme Based on Medical Consortium Chain
计算机科学, 2022, 49(6A): 537-543. https://doi.org/10.11896/jsjkx.220200139
[3] 马冀, 林尚静, 李月颖, 庄琲, 贾睿, 田锦.
多源跨域数据融合的无线通信网络流量预测
Traffic Prediction for Wireless Communication Networks with Multi-source and Cross-domain Data Fusion
计算机科学, 2022, 49(11A): 210800165-7. https://doi.org/10.11896/jsjkx.210800165
[4] 江泽涛, 徐娟娟.
云环境下基于代理盲签名的高效异构跨域认证方案
Efficient Heterogeneous Cross-domain Authentication Scheme Based on Proxy Blind Signature in Cloud Environment
计算机科学, 2020, 47(11): 60-67. https://doi.org/10.11896/jsjkx.191100068
[5] 唐成华,张鑫,王璐,王宇,强保华.
一种跨域网络资源的安全互操作模型
Security Interoperation Model of Cross-domain Network Resources
计算机科学, 2016, 43(6): 141-145. https://doi.org/10.11896/j.issn.1002-137X.2016.06.029
[6] 周福才 周恩光 闫虹 苏晓曦.
基于不同口令认证的跨域组密钥协议

计算机科学, 2009, 36(3): 74-77.
[7] 刘天成,陶军,董永强,夏勤.
基于分层P2PSIP的移动终端跨域切换方案
Scheme of Cross-domain Hand-off Based on Hierarchical P2PSIP
计算机科学, 2009, 36(11): 47-51.
[8] .
基于移动Agent的网格跨域安全审计体系结构及实现

计算机科学, 2006, 33(6): 97-100.
[9] 吕志军 金毅 赖海光 黄皓 谢立.
DAPRA测试分析和IDS测试方法研究

计算机科学, 2004, 31(11): 73-76.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!