计算机科学 ›› 2012, Vol. 39 ›› Issue (9): 15-19.

• 计算机网络与信息安全 • 上一篇    下一篇

基于DoLFA的高效正则表达式匹配算法

杜文超,陈庶樵,胡宇翔   

  1. (国家数字交换系统工程技术研究中心 郑州450002)
  • 出版日期:2018-11-16 发布日期:2018-11-16

Efficient Regular Expression Matching Algorithm Based on DoLFA

  • Online:2018-11-16 Published:2018-11-16

摘要: 随着规则数量的急剧增长,表示正则表达式的DFA(Dctcrministic Finitc Automata,确定型有限自动机)容易 引起状态空间爆炸,难以满足高速网络的实时处理需求。提出一种高效的正则表达式匹配算法,该算法通过将正则表 达式分割为精确串、字符集合以及重复字符3个子集,分别对其进行分区优化及检测,然后再利用结点信息对匹配信 号进行连接,即构建一种特殊的状态机DoLFA(DividcoptimizcI_ink Finitc Automata)。理论分析和仿真结果表明, 该算法可以大大节省存储空间,并获得较高的吞吐量,且具有较强的扩展性。

关键词: 深度包检测,正则表达式,有限自动机,编码,计数器

Abstract: With the rapid increase of the number of rules, the DFA used to present regular expression often results in states explosion, so it is very hard to satisfy the requirement of high speed network online processing. This paper pro- posed an efficient regular expression matching algorithm, which first divides an expression into three subsets: exact string, character class and character repetition, and then optimizes and detects the corresponding blocks, at last links them together with auxiliary node data structure, namely constructing a special state machine DoLFA. Theoretical anal- ysis and simulation shows that this algorithm not only can save more memory space, but also provide high throughput performance and scalability.

Key words: Deep packet inspection, Regular expression, Finite automata, Coding, Counter

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!