URI | http://purl.tuc.gr/dl/dias/43865BF3-40E0-431C-A147-9F25C55B00E0 | - |
Identifier | http://www.vldb.org/conf/2002/S23P02.pdf | - |
Language | en | - |
Extent | 12 pages | en |
Title | Tree pattern aggregation for scalable XML data dissemination | en |
Creator | Chan Chee-Yong | en |
Creator | Wenfei Fan | en |
Creator | Felber Pascal | en |
Creator | Garofalakis Minos | en |
Creator | Γαροφαλακης Μινως | el |
Creator | Rastogi Rajeev | en |
Content Summary | With the rapid growth of XML-document traffic on the
Internet, scalable content-based dissemination of XML
documents to a large, dynamic group of consumers has
become an important research challenge. To indicate
the type of content that they are interested in, data
consumers typically specify their subscriptions using
some XML pattern specification language (e.g., XPath).
Given the large volume of subscribers, system scalability
and efficiency mandate the ability to aggregate the
set of consumer subscriptions to a smaller set of content
specifications, so as to both reduce their storagespace
requirements as well as speed up the documentsubscription
matching process. In this paper, we provide
the first systematic study of subscription aggregation
where subscriptions are specified with tree patterns
(an important subclass of XPath expressions). The
main challenge is to aggregate an input set of tree patterns
into a smaller set of generalized tree patterns such
that: (1) a given space constraint on the total size of the
subscriptions is met, and (2) the loss in precision (due
to aggregation) during document filtering is minimized.
We propose an efficient tree-pattern aggregation algorithm
that makes effective use of document-distribution
statistics in order to compute a precise set of aggregate
tree patterns within the allotted space budget. As part
of our solution, we also develop several novel algorithms
for tree-pattern containment and minimization,
as well as “least-upper-bound” computation for a set of
tree patterns. These results are of interest in their own
right, and can prove useful in other domains, such as
XML query optimization. Extensive results from a prototype
implementation validate our approach. | en |
Type of Item | Δημοσίευση σε Συνέδριο | el |
Type of Item | Conference Publication | en |
License | http://creativecommons.org/licenses/by/4.0/ | en |
Date of Item | 2015-12-01 | - |
Date of Publication | 2002 | - |
Subject | Databases | en |
Bibliographic Citation | C-Y Chan, W. Fan, P. Felber, M. Garofalakis and R. Rastogi, "Tree pattern aggregation for scalable XML data dissemination", in 28th International Conference on Very Large Data Bases, August 2002, pp. 826-837. | en |