| URI | http://purl.tuc.gr/dl/dias/55D94D1B-3273-412A-A1CF-EFCDEFAB35AB | - |
| Identifier | http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.83.9573&rep=rep1&type=pdf | - |
| Identifier | https://doi.org/10.1109/ICDE.2006.175 | - |
| Language | en | - |
| Extent | 12 pages | en |
| Title | XCluster synopses for structured XML content | en |
| Creator | Polyzotis, Neoklis | en |
| Creator | Garofalakis Minos | en |
| Creator | Γαροφαλακης Μινως | el |
| Publisher | Institute of Electrical and Electronics Engineers | en |
| Content Summary | We tackle the difficult problem of summarizing the path/branching structure and value content of an XML database that comprises both numeric and textual values. We introduce a novel XML-summarization model, termed XCLUSTERs, that enables accurate selectivity estimates for the class of twig queries with numeric-range, substring, and textual IR predicates over the content of XML elements. In a nutshell, an XCLUSTER synopsis represents an effective clustering of XML elements based on both their structural and value-based characteristics. By leveraging techniques for summarizing XML-document structure as well as numeric and textual data distributions, our XCLUSTER model provides the first known unified framework for handling path/branching structure and different types of element values. We detail the XCLUSTER model, and develop a systematic framework for the construction of effective XCLUSTER summaries within a specified storage budget. Experimental results on synthetic and real-life data verify the effectiveness of our XCLUSTER synopses, clearly demonstrating their ability to accurately summarize XML databases with mixed-value content. To the best of our knowledge, ours is the first work to address the summarization problem for structured XML content in its full generality. | en |
| Type of Item | Πλήρης Δημοσίευση σε Συνέδριο | el |
| Type of Item | Conference Full Paper | en |
| License | http://creativecommons.org/licenses/by/4.0/ | en |
| Date of Item | 2015-12-01 | - |
| Date of Publication | 2006 | - |
| Subject | Data engineering | en |
| Bibliographic Citation | N. Polyzotis and M. Garofalakis, "XCluster synopses for structured XML content", in 22nd International Conference on Data Engineering, 2006, doi: 10.1109/ICDE.2006.175 | en |