URI | http://purl.tuc.gr/dl/dias/55D94D1B-3273-412A-A1CF-EFCDEFAB35AB | - |
Identifier | http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.83.9573&rep=rep1&type=pdf | - |
Identifier | https://doi.org/10.1109/ICDE.2006.175 | - |
Language | en | - |
Extent | 12 pages | en |
Title | XCluster synopses for structured XML content | en |
Creator | Polyzotis, Neoklis | en |
Creator | Garofalakis Minos | en |
Creator | Γαροφαλακης Μινως | el |
Publisher | Institute of Electrical and Electronics Engineers | en |
Content Summary | We tackle the difficult problem of summarizing the path/branching structure and value content of an XML database that comprises both numeric and textual values. We introduce a novel XML-summarization model, termed XCLUSTERs, that enables accurate selectivity estimates for the class of twig queries with numeric-range, substring, and textual IR predicates over the content of XML elements. In a nutshell, an XCLUSTER synopsis represents an effective clustering of XML elements based on both their structural and value-based characteristics. By leveraging techniques for summarizing XML-document structure as well as numeric and textual data distributions, our XCLUSTER model provides the first known unified framework for handling path/branching structure and different types of element values. We detail the XCLUSTER model, and develop a systematic framework for the construction of effective XCLUSTER summaries within a specified storage budget. Experimental results on synthetic and real-life data verify the effectiveness of our XCLUSTER synopses, clearly demonstrating their ability to accurately summarize XML databases with mixed-value content. To the best of our knowledge, ours is the first work to address the summarization problem for structured XML content in its full generality. | en |
Type of Item | Πλήρης Δημοσίευση σε Συνέδριο | el |
Type of Item | Conference Full Paper | en |
License | http://creativecommons.org/licenses/by/4.0/ | en |
Date of Item | 2015-12-01 | - |
Date of Publication | 2006 | - |
Subject | Data engineering | en |
Bibliographic Citation | N. Polyzotis and M. Garofalakis, "XCluster synopses for structured XML content", in 22nd International Conference on Data Engineering, 2006, doi: 10.1109/ICDE.2006.175 | en |