URI | http://purl.tuc.gr/dl/dias/55D94D1B-3273-412A-A1CF-EFCDEFAB35AB | - |
Αναγνωριστικό | http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.83.9573&rep=rep1&type=pdf | - |
Αναγνωριστικό | https://doi.org/10.1109/ICDE.2006.175 | - |
Γλώσσα | en | - |
Μέγεθος | 12 pages | en |
Τίτλος | XCluster synopses for structured XML content | en |
Δημιουργός | Polyzotis, Neoklis | en |
Δημιουργός | Garofalakis Minos | en |
Δημιουργός | Γαροφαλακης Μινως | el |
Εκδότης | Institute of Electrical and Electronics Engineers | en |
Περίληψη | We tackle the difficult problem of summarizing the path/branching structure and value content of an XML database that comprises both numeric and textual values. We introduce a novel XML-summarization model, termed XCLUSTERs, that enables accurate selectivity estimates for the class of twig queries with numeric-range, substring, and textual IR predicates over the content of XML elements. In a nutshell, an XCLUSTER synopsis represents an effective clustering of XML elements based on both their structural and value-based characteristics. By leveraging techniques for summarizing XML-document structure as well as numeric and textual data distributions, our XCLUSTER model provides the first known unified framework for handling path/branching structure and different types of element values. We detail the XCLUSTER model, and develop a systematic framework for the construction of effective XCLUSTER summaries within a specified storage budget. Experimental results on synthetic and real-life data verify the effectiveness of our XCLUSTER synopses, clearly demonstrating their ability to accurately summarize XML databases with mixed-value content. To the best of our knowledge, ours is the first work to address the summarization problem for structured XML content in its full generality. | en |
Τύπος | Πλήρης Δημοσίευση σε Συνέδριο | el |
Τύπος | Conference Full Paper | en |
Άδεια Χρήσης | http://creativecommons.org/licenses/by/4.0/ | en |
Ημερομηνία | 2015-12-01 | - |
Ημερομηνία Δημοσίευσης | 2006 | - |
Θεματική Κατηγορία | Data engineering | en |
Βιβλιογραφική Αναφορά | N. Polyzotis and M. Garofalakis, "XCluster synopses for structured XML content", in 22nd International Conference on Data Engineering, 2006, doi: 10.1109/ICDE.2006.175 | en |