Abstract
In this paper, we study the problem of filtering and querying massive XML data against a large set of XPath patterns in Univariate XPath. Based on an efficient matching engine XSIGMA for linear XPath patterns with Boolean expression over keywords and a twig evaluator over event streams, we propose an XPath filtering/querying engine Chimera, which runs fast and stably for any XPath patterns without heavy pre- processing techniques for queried data often used by existing native XMLDBs and RDBs. Chimera also runs much faster than those engines against thousands of XPath patterns. We implemented Chimera and showed its effectiveness by several experiments on artificial and real datasets.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Aho, A.V., Corasick, M.: Efficient String Matching: An Aid to Bibliographic Search. Comm. ACM 18(6), 333–340 (1975)
Avila-Campillo, I., Green, T.J., Gupta, A., Onizuka, M., Raven, D., Suciu, D.: XMLTK: An XML toolkit for scalable XML processing. In: Proc. PLANX 2002 (2002)
Bar-Yossef, Z., Fontoura, M., Josifovski, V.: On the Memory Requirements of XPath Evaluation over XML Streams. Journal of Computer and System Sciences 73(3), 391–441 (2007)
Boag, S., Chamberlin, D., Ferandez, M.F., Florescu, D., Robie, J., Simeon, J.: XQuery 1.0: An XML Query Language. W3C (2003), http://www.w3.org/TR/xquery
Chen, Y., Davidson, S., Zheng, Y.: An Efficient XPath Query Processor for XML Streams. In: Proc. ICDE 2006 (2006)
Clerk, J.: XML Transformations (XSLT) Version 1.0. W3C (1999), http://www.w3.org/TR/xslt
Clerk, J., DeRose, R.: XML Path Language (XPath) Version 1.0. W3C (1999), http://www.w3.org/TR/xpath
Diao, Y., Altinel, H., Franklin, M.J., Zhang, H., Fischer, P.M.: Path Sharing and Predicate Evaluation for High-performance XPath Filtering. In: Proc. ACMTOD (2003)
Gou, G., Chirkova, R.: Efficient Algorithms for Evaluating XPath over Streams. In: Proc. SIGMOD 2007, pp. 269–280 (2007)
Mitarai, S., Ishino, A., Takeda, M.: Light-weight Acceleration for Streaming XML Document Filtering. In: Proc. SWOD 2007, pp. 37–42 (2007)
Olteanu, D.: SPEX: Streamed and Progressive Evaluation of XPath. TKDE 19(7), 934–949 (2007)
Olteanu, D., Meuss, M., Furche, T., Bry, F.: XPath: Looking Forward. In: Chaudhri, A.B., Unland, R., Djeraba, C., Lindner, W. (eds.) EDBT 2002. LNCS, vol. 2490, pp. 109–127. Springer, Heidelberg (2002)
Qin, L., Yu, J.X., Ding, B.: TwigList: Make Twig Pattern Matching Fast. In: Kotagiri, R., Radha Krishna, P., Mohania, M., Nantajeewarawat, E. (eds.) DASFAA 2007. LNCS, vol. 4443, pp. 850–862. Springer, Heidelberg (2007)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Asai, T., Tago, Si., Inakoshi, H., Okamoto, S., Takeda, M. (2010). Chimera: Stream-Oriented XML Filtering/Querying Engine. In: Kitagawa, H., Ishikawa, Y., Li, Q., Watanabe, C. (eds) Database Systems for Advanced Applications. DASFAA 2010. Lecture Notes in Computer Science, vol 5982. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-12098-5_32
Download citation
DOI: https://doi.org/10.1007/978-3-642-12098-5_32
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-12097-8
Online ISBN: 978-3-642-12098-5
eBook Packages: Computer ScienceComputer Science (R0)