In this paper, we describe the design, architecture, and the lessons learned from the implementation of a fast regular expression indexing engine FREE. FREE uses a pre-built index to identify the text data units which may contain a matching string and only examines these further. In this way, FREE shows orders of magnitude performance improvement in certain cases over standard regular expression matching systems, such as lex, awk and grep.
Index Terms:
regular expression, multigram index, index
Citation:
Junghoo Cho, Sridhar Rajagopalan, "A Fast Regular Expression Indexing Engine," icde, pp.0419, 18th International Conference on Data Engineering (ICDE'02), 2002