We propose a novel index structure, termed XTrie, that supports the efficient filtering of XML documents based on XPath expressions. Our XTrie index structure offers several novel features that make it especially attractive for large-scale publish/subscribe systems. First, XTrie is designed to support effective filtering based on complex XPath expressions (as opposed to simple, single-path specifications). Second, our XTrie structure and algorithms are designed to support both ordered and unordered matching of XML data. Third, by indexing on sequences of element names organized in a trie structure and using a sophisticated matching algorithm, XTrie is able to both reduce the number of unnecessary index probes as well as avoid redundant matchings, thereby providing extremely efficient filtering. Our experimental results over a wide range of XML document and XPath expression workloads demonstrate that our XTrie index structure outperforms earlier approaches by wide margins.
Index Terms:
index structure, XPath, XML, publish-subscribe
Citation:
Chee-Yong Chan, Pascal Felber, Minos Garofalakis, Rajeev Rastogi, "Efficient Filtering of XML Documents with XPath Expressions," icde, pp.0235, 18th International Conference on Data Engineering (ICDE'02), 2002