An improved algorithm to accelerate regular expression evaluation

  • Michela Becchi
  • , Patrick Crowley

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

191 Scopus citations

Abstract

Modern network intrusion detection systems need to perform regular expression matching at line rate in order to detect the occurrence of critical patterns in packet payloads. While deterministic finite automata (DFAs) allow this operation to be performed in linear time, they may exhibit prohibitive memory requirements. In [9], Kumar et al. propose Delayed Input DFAs (D 2FAs), which provide a trade-off between the memory requirements of the compressed DFA and the number of states visited for each character processed, which corresponds directly to the memory bandwidth required to evaluate regular expressions. In this paper we introduce a general compression technique that results in at most 2N state traversals when processing a string of length N. In comparison to the D2FA approach, our technique achieves comparable levels of compression, with lower provable bounds on memory bandwidth (or greater compression for a given bandwidth bound). Moreover, our proposed algorithm has lower complexity, is suitable for scenarios where a compressed DFA needs to be dynamically built or updated, and fosters locality in the traversal process. Finally, we also describe a novel alphabet reduction scheme for DFA-based structures that can yield further dramatic reductions in data structure size.

Original languageEnglish
Title of host publicationANCS'07 - Proceedings of the 2007 ACM Symposium on Architecture for Networking and Communications
Pages145-154
Number of pages10
DOIs
StatePublished - 2007
Event3rd ACM/IEEE Symposium on Architectures for Networking and Communications Systems, ANCS 2007 - Orlando, FL, United States
Duration: Dec 3 2007Dec 4 2007

Publication series

NameANCS'07 - Proceedings of the 2007 ACM Symposium on Architecture for Networking and Communications

Conference

Conference3rd ACM/IEEE Symposium on Architectures for Networking and Communications Systems, ANCS 2007
Country/TerritoryUnited States
CityOrlando, FL
Period12/3/0712/4/07

Keywords

  • deep packet inspection
  • DFA
  • regular expressions

Fingerprint

Dive into the research topics of 'An improved algorithm to accelerate regular expression evaluation'. Together they form a unique fingerprint.

Cite this