URI | http://purl.tuc.gr/dl/dias/3FB1CA9C-4EF7-4928-A2A0-DCDF08682CD9 | - |
Αναγνωριστικό | https://doi.org/10.1145/882082.882084 | - |
Αναγνωριστικό | http://dl.acm.org/citation.cfm?id=882084 | - |
Γλώσσα | en | - |
Τίτλος | Analyzing massive data streams: past, present, and future | en |
Δημιουργός | Garofalakis Minos | en |
Δημιουργός | Γαροφαλακης Μινως | el |
Εκδότης | Association for Computing Machinery | en |
Περίληψη | Continuous data streams arise naturally, for example, in the installations of large telecom and Internet service providers where detailed usage information (Call-Detail-Records, SNMP-/RMON packet-flow data, etc.) from different parts of the underlying network needs to be continuously collected and analyzed for interesting trends. Such environments raise a critical need for effective stream-processing algorithms that can provide (typically, approximate) answers to data-analysis queries while utilizing only small space (to maintain concise stream synopses) and small processing time per stream item. In this talk, I will discuss the basic pseudo-random sketching mechanism for building stream synopses and our ongoing work that exploits sketch synopses to build an approximate SQL (multi) query processor. I will also describe our recent results on extending sketching to handle more complex forms of queries and streaming data (e.g., similarity joins over streams of XML trees), and try to identify some challenging open problems in the data-streaming area. | en |
Τύπος | Δημοσίευση σε Συνέδριο | el |
Τύπος | Conference Publication | en |
Άδεια Χρήσης | http://creativecommons.org/licenses/by/4.0/ | en |
Ημερομηνία | 2015-12-01 | - |
Ημερομηνία Δημοσίευσης | 2003 | - |
Θεματική Κατηγορία | Data mining | en |
Θεματική Κατηγορία | Knowledge discovery | en |
Βιβλιογραφική Αναφορά | M. N. Garofalakis, "Analyzing massive data streams: past, present, and future", in 8th ACM SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery, 13 June 2003. doi: 10.1145/882082.882084 | en |