Geometric monitoring of heterogeneous streamsGeometric monitoring of heterogeneous streams Peer-Reviewed Journal Publication Δημοσίευση σε Περιοδικό με Κριτές 2015-11-012014enInterest in stream monitoring is shifting toward the distributed case. In many applications the data is high volume, dynamic, and distributed, making it infeasible to collect the distinct streams to a central node for processing. Often, the monitoring problem consists of determining whether the value of a global function, defined on the union of all streams, crossed a certain threshold. We wish to reduce communication by transforming the global monitoring to the testing of local constraints, checked independently at the nodes. Geometric monitoring (GM) proved useful for constructing such local constraints for general functions. Alas, in GM the constraints at all nodes share an identical structure and are thus unsuitable for handling heterogeneous streams. Therefore, we propose a general approach for monitoring heterogeneous streams (HGM), which defines constraints tailored to fit the data distributions at the nodes. While we prove that optimally selecting the constraints is NP-hard, we provide a practical solution, which reduces the running time by hierarchically clustering nodes with similar data distributions and then solving simpler optimization problems. We also present a method for efficiently recovering from local violations at the nodes. Experiments yield an improvement of over an order of magnitude in communication relative to GM.http://creativecommons.org/licenses/by/4.0/IEEE Transactions on Knowledge and Data Engineering2681890-1903 Keren Daniel Sagy Guy Abboud Amir Ben-David David Schuster Assaf Sharfman Izchak Deligiannakis Antonios Δεληγιαννακης Αντωνιος Institute of Electrical and Electronics Engineers Monitoring Vectors Distributed databases Optimization Correlation Nickel Data models