<efrbr:recordSet xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:efrbr="http://vfrbr.info/efrbr/1.1" xmlns:efrbr-work="http://vfrbr.info/efrbr/1.1/work" xmlns:efrbr-expression="http://vfrbr.info/efrbr/1.1/expression" xmlns:efrbr-manifestation="http://vfrbr.info/efrbr/1.1/manifestation" xmlns:efrbr-person="http://vfrbr.info/efrbr/1.1/person" xmlns:efrbr-corporateBody="http://vfrbr.info/efrbr/1.1/corporateBody" xmlns:efrbr-concept="http://vfrbr.info/efrbr/1.1/concept" xmlns:efrbr-structure="http://vfrbr.info/efrbr/1.1/structure" xmlns:efrbr-responsible="http://vfrbr.info/efrbr/1.1/responsible" xmlns:efrbr-subject="http://vfrbr.info/efrbr/1.1/subject" xmlns:efrbr-other="http://vfrbr.info/efrbr/1.1/other" xsi:schemaLocation="http://vfrbr.info/efrbr/1.1 http://vfrbr.info/schemas/1.1/efrbr.xsd"><efrbr:entities><efrbr-work:work identifier="http://purl.tuc.gr/dl/dias/3B99A3C5-D256-4D61-89EA-3B0A0852F489"><efrbr-work:titleOfTheWork>Συντονισμός κάλυψης σε δίκτυα αισθητήρων μέσω ενισχυτικής μάθησης</efrbr-work:titleOfTheWork></efrbr-work:work><efrbr-expression:expression identifier="http://purl.tuc.gr/dl/dias/3B99A3C5-D256-4D61-89EA-3B0A0852F489"><efrbr-expression:titleOfTheExpression>Συντονισμός κάλυψης σε δίκτυα αισθητήρων μέσω ενισχυτικής μάθησης</efrbr-expression:titleOfTheExpression><efrbr-expression:titleOfTheExpression>Coordinated coverage in sensor networks via reinforcement learning</efrbr-expression:titleOfTheExpression><efrbr-expression:formOfExpression vocabulary="DIAS:TYPES">
            Διπλωματική Εργασία
            Diploma Work
         </efrbr-expression:formOfExpression><efrbr-expression:dateOfExpression type="issued">2018-09-14</efrbr-expression:dateOfExpression><efrbr-expression:dateOfExpression type="published">2018</efrbr-expression:dateOfExpression><efrbr-expression:languageOfExpression vocabulary="iso639-1">en</efrbr-expression:languageOfExpression><efrbr-expression:summarizationOfContent>Machine Learning is a fast developing and ever growing field in computer science. In addition to that, Sensor Networks are also a very promising field that has significant impact on a variety of applications. Given these facts, a multi-agent system (MAS) approach on wireless sensor networks (WSNs) comprising sensor-actuator nodes is very promising, as it has the potential to tackle the resource constraints inherent in these networks by efficiently coordinating the activities among the nodes. Furthermore, a very common issue in the field of sensor networks is the sensing coverage problem, which is the task of properly and sufficiently covering an area. In this thesis, we consider the coordinated sensing coverage problem and study the behavior and performance of the fully distributed Q-Learning algorithm for reinforcement learning using linear value function approximation. We use the Tossim platform to simulate our TinyOS application, which consists of different topologies of sensor networks with parametric sizes. Subsequently, we present the results of our simulation and display a number of graphs to visualize performance and learning outcomes on three specific topologies. We consider issues, such as successful convergence to optimal policies and maximization of local and global rewards. The implementation results are quite promising, since our algorithms exhibit high percentage of successful convergence to optimal policies.</efrbr-expression:summarizationOfContent><efrbr-expression:summarizationOfContent>Η μηχανική μάθηση είναι ένα ταχύτατα και διαρκώς αναπτυσσόμενο πεδίο στην επιστήμη των υπολογιστών. Εκτός από αυτό, τα δίκτυα αισθητήρων είναι επίσης ένα πολλά υποσχόμενο πεδίο που έχει σημαντική επίδραση σε μία ποικιλία από εφαρμογές. Βάσει των παραπάνω, μία προσέγγιση πολυπρακτορικού συστήματος (MAS) σε ασύρματα δίκτυα αισθητήρων (WSNs) που περιλαμβάνει αισθητήρες-ενεργοποιητές κόμβους είναι πολλά υποσχόμενη, καθώς μπορεί δυνητικά να αντιμετωπίσει τους περιορισμούς σε πόρους που είναι έμφυτοι σε αυτά τα δίκτυα με το να συντονίζει αποδοτικά τις δραστηριότητες μεταξύ των κόμβων. Επιπλέον, ένα κοινό θέμα στο πεδίο των δικτύων αισθητήρων είναι το πρόβλημα της συντονισμένης κάλυψης, στο οποίο καλείται κάποιος να καλύψει κατάλληλα και επαρκώς μία περιοχή με αισθητήρες. Σε αυτή τη διπλωματική εργασία, εξετάζουμε το πρόβλημα της συντονισμένης κάλυψης των αισθητήρων και μελετάμε τη συμπεριφορά και την απόδοση του τελείως κατανεμημένου Q-Learning αλγορίθμου ενισχυτικής μάθησης χρησιμοποιώντας γραμμική προσέγγιση της συνάρτησης χρησιμότητας. Χρησιμοποιούμε την πλατφόρμα Tossim για να προσομοιώσουμε την TinyOS εφαρμογή μας, η οποία αποτελείται από διαφορετικές τοπολογίες δικτύου αισθητήρων με παραμετροποιημένο μέγεθος. Στη συνέχεια, παρουσιάζουμε τα αποτελέσματα της υλοποίησης μας και δείχνουμε έναν αριθμό από γραφήματα για να οπτικοποιήσουμε τις εκβάσεις της απόδοσης και της μάθησης σε τρεις συγκεκριμένες τοπολογίες. Λαμβάνουμε υπ’ όψιν θέματα, όπως επιτυχή σύγκλιση σε βέλτιστες πολιτικές και μεγιστοποίηση των τοπικών και καθολικών ανταμοιβών. Τα αποτελέσματα της υλοποίησης είναι αρκετά ενθαρρυντικά από την άποψη των υψηλών ποσοστών επιτυχών συγκλίσεων του αλγορίθμου μας σε βέλτιστες πολιτικές.</efrbr-expression:summarizationOfContent><efrbr-expression:useRestrictionsOnTheExpression type="creative-commons">http://creativecommons.org/licenses/by-sa/4.0/</efrbr-expression:useRestrictionsOnTheExpression><efrbr-expression:note type="academic unit">Πολυτεχνείο Κρήτης::Σχολή Ηλεκτρολόγων Μηχανικών και Μηχανικών Υπολογιστών</efrbr-expression:note></efrbr-expression:expression><efrbr-manifestation:manifestation identifier="http://purl.tuc.gr/dl/dias/3EED559A-0840-4C4F-B352-E0E2347FDD54"><efrbr-manifestation:titleOfTheManifestation>Kotzabasakis_Georgios_Dip_2018.pdf</efrbr-manifestation:titleOfTheManifestation><efrbr-manifestation:publicationDistribution><efrbr-manifestation:placeOfPublicationDistribution type="distribution">Chania [Greece]</efrbr-manifestation:placeOfPublicationDistribution><efrbr-manifestation:publisherDistributor type="distributor">Library of TUC</efrbr-manifestation:publisherDistributor><efrbr-manifestation:dateOfPublicationDistribution>2018-09-14</efrbr-manifestation:dateOfPublicationDistribution></efrbr-manifestation:publicationDistribution><efrbr-manifestation:formOfCarrier>application/pdf</efrbr-manifestation:formOfCarrier><efrbr-manifestation:extentOfTheCarrier>1.5 MB</efrbr-manifestation:extentOfTheCarrier><efrbr-manifestation:accessRestrictionsOnTheManifestation>free</efrbr-manifestation:accessRestrictionsOnTheManifestation></efrbr-manifestation:manifestation><efrbr-person:person identifier="http://users.isc.tuc.gr/~gekotzabasakis"><efrbr-person:nameOfPerson vocabulary="TUC:LDAP">
            Kotzabasakis Georgios
            Κοτζαμπασακης Γεωργιος
         </efrbr-person:nameOfPerson></efrbr-person:person><efrbr-person:person identifier="http://users.isc.tuc.gr/~lagoudakis"><efrbr-person:nameOfPerson vocabulary="TUC:LDAP">
            Lagoudakis Michail
            Λαγουδακης Μιχαηλ
         </efrbr-person:nameOfPerson></efrbr-person:person><efrbr-person:person identifier="http://users.isc.tuc.gr/~gchalkiadakis"><efrbr-person:nameOfPerson vocabulary="TUC:LDAP">
            Chalkiadakis Georgios
            Χαλκιαδακης Γεωργιος
         </efrbr-person:nameOfPerson></efrbr-person:person><efrbr-person:person identifier="http://users.isc.tuc.gr/~adeligiannakis"><efrbr-person:nameOfPerson vocabulary="TUC:LDAP">
            Deligiannakis Antonios
            Δεληγιαννακης Αντωνιος
         </efrbr-person:nameOfPerson></efrbr-person:person><efrbr-corporateBody:corporateBody identifier="88DACD06-39DC-4712-9BFF-27BE3220D387"><efrbr-corporateBody:nameOfTheCorporateBody vocabulary="">
            Πολυτεχνείο Κρήτης
            Technical University of Crete
         </efrbr-corporateBody:nameOfTheCorporateBody></efrbr-corporateBody:corporateBody><efrbr-concept:concept identifier="5109FFA2-BD2B-4DD9-83AB-E4659219A9F0"><efrbr-concept:termForTheConcept>
            Sensor networks
         </efrbr-concept:termForTheConcept></efrbr-concept:concept><efrbr-concept:concept identifier="003DFEA7-1BE2-431F-8D0A-F5EF43A69511"><efrbr-concept:termForTheConcept>
            Reinforcement learning
         </efrbr-concept:termForTheConcept></efrbr-concept:concept></efrbr:entities><efrbr:relationships><efrbr-structure:structureRelations><efrbr-structure:realizedThrough sourceEntity="work" sourceURI="http://purl.tuc.gr/dl/dias/3B99A3C5-D256-4D61-89EA-3B0A0852F489" targetEntity="expression" targetURI="http://purl.tuc.gr/dl/dias/3B99A3C5-D256-4D61-89EA-3B0A0852F489"/><efrbr-structure:embodiedIn sourceEntity="expression" sourceURI="http://purl.tuc.gr/dl/dias/3B99A3C5-D256-4D61-89EA-3B0A0852F489" targetEntity="manifestation" targetURI="http://purl.tuc.gr/dl/dias/3EED559A-0840-4C4F-B352-E0E2347FDD54"/></efrbr-structure:structureRelations><efrbr-responsible:responsibleRelations><efrbr-responsible:createdBy sourceEntity="work" sourceURI="http://purl.tuc.gr/dl/dias/3B99A3C5-D256-4D61-89EA-3B0A0852F489" targetEntity="person" targetURI="http://users.isc.tuc.gr/~gekotzabasakis"/><efrbr-responsible:realizedBy sourceEntity="expression" sourceURI="http://purl.tuc.gr/dl/dias/3B99A3C5-D256-4D61-89EA-3B0A0852F489" targetEntity="person" targetURI="http://users.isc.tuc.gr/~gekotzabasakis" role="author"/><efrbr-responsible:realizedBy sourceEntity="expression" sourceURI="http://purl.tuc.gr/dl/dias/3B99A3C5-D256-4D61-89EA-3B0A0852F489" targetEntity="person" targetURI="http://users.isc.tuc.gr/~lagoudakis" role="http://purl.tuc.gr/dl/dias/vocabs/contributor-roles/1"/><efrbr-responsible:realizedBy sourceEntity="expression" sourceURI="http://purl.tuc.gr/dl/dias/3B99A3C5-D256-4D61-89EA-3B0A0852F489" targetEntity="person" targetURI="http://users.isc.tuc.gr/~gchalkiadakis" role="http://purl.tuc.gr/dl/dias/vocabs/contributor-roles/2"/><efrbr-responsible:realizedBy sourceEntity="expression" sourceURI="http://purl.tuc.gr/dl/dias/3B99A3C5-D256-4D61-89EA-3B0A0852F489" targetEntity="person" targetURI="http://users.isc.tuc.gr/~adeligiannakis" role="http://purl.tuc.gr/dl/dias/vocabs/contributor-roles/2"/><efrbr-responsible:realizedBy sourceEntity="expression" sourceURI="http://purl.tuc.gr/dl/dias/3B99A3C5-D256-4D61-89EA-3B0A0852F489" targetEntity="person" targetURI="88DACD06-39DC-4712-9BFF-27BE3220D387" role="publisher"/></efrbr-responsible:responsibleRelations><efrbr-subject:subjectRelations><efrbr-subject:hasSubject sourceEntity="work" sourceURI="http://purl.tuc.gr/dl/dias/3B99A3C5-D256-4D61-89EA-3B0A0852F489" targetEntity="concept" targetURI="5109FFA2-BD2B-4DD9-83AB-E4659219A9F0"/><efrbr-subject:hasSubject sourceEntity="work" sourceURI="http://purl.tuc.gr/dl/dias/3B99A3C5-D256-4D61-89EA-3B0A0852F489" targetEntity="concept" targetURI="003DFEA7-1BE2-431F-8D0A-F5EF43A69511"/></efrbr-subject:subjectRelations><efrbr-other:otherRelations/></efrbr:relationships></efrbr:recordSet>