Institutional Repository
Technical University of Crete
EN  |  EL

Search

Browse

My Space

Monte Carlo tree search in the "Settlers of Catan" strategy game

Karamalegos Emmanouil

Simple record


URIhttp://purl.tuc.gr/dl/dias/1CCC4417-B1AA-4C2E-B33B-0F6BEFDED36E-
Identifierhttps://drive.google.com/drive/folders/0B9pKS-Z7bpHxWEotSUxFbGhFUWM-
Identifierhttps://doi.org/10.26233/heallink.tuc.66891-
Languageen-
Extent1,17 megabytesel
TitleMonte Carlo tree search in the "Settlers of Catan" strategy gameen
TitleΔενδρική αναζήτηση Monte Carlo στο παιχνίδι στρατηγικής "Άποικοι του Κατάν"el
CreatorKaramalegos Emmanouilen
CreatorΚαραμαλεγκος Εμμανουηλel
Contributor [Thesis Supervisor]Chalkiadakis Georgiosen
Contributor [Thesis Supervisor]Χαλκιαδακης Γεωργιοςel
Contributor [Committee Member]Lagoudakis Michaelen
Contributor [Committee Member]Λαγουδακης Μιχαηλel
Contributor [Committee Member]Deligiannakis Antoniosen
Contributor [Committee Member]Δεληγιαννακης Αντωνιοςel
PublisherΠολυτεχνείο Κρήτηςel
PublisherTechnical University of Creteen
Academic UnitTechnical University of Crete::School of Electrical and Computer Engineeringen
Academic UnitΠολυτεχνείο Κρήτης::Σχολή Ηλεκτρολόγων Μηχανικών και Μηχανικών Υπολογιστώνel
Content SummaryClassic approaches to game AI require either a high quality of domain knowledge, or a long time to generate effective AI behavior. Monte Carlo Tree Search (MCTS) is a search method that combines the precision of tree search with the generality of random sampling. The family of MCTS algorithms has achieved promising results with perfect-information games such as Go. In our work, we apply Monte-Carlo Tree Search to the non-deterministic game "Settlers of Catan", a multi-player board-turnedweb- based game that necessitates strategic planning and negotiation skills. We implemented an agent which takes into consideration all the aspects of the game for the first time, using no domain knowledge. In our work, we are experimenting using a reinforcement learning method Value of Perfect Information (VPI) and two bandit methods, namely, the Upper Coefficient Bound and Bayesian Upper Coefficient Bound methods. Such methods attempt to strike a balance between exploitation and exploration when creating of the search tree. For first time, we implemented an agent that takes into consideration the complete rules-set of the game and makes it possible to negotiate trading between all players. Furthermore, we included in our agent an alternative initial placement found in the literature, which is based on the analysis of human behavior in Settlers of Catan games. In our experiments we compare the performance of our methods against each other and against appropriate benchmarks (e.g., JSettlers agents), and examine the effect that the number of simulations and the simulation depth has on the algorithms’ performance. Our results suggests that VPI scores significantly better than bandit based methods, even if these employ a much higher number of simulations. In addition to this, the simulation depth of the algorithm needs to be calculated so the method will neither get lost in deep simulations of improbable scenarios neither end up shortly without given a proper estimation of the upcoming moves. Finally, our results indicate that our agent performance is improved when the alternative, human behavior-based, initial placement method.en
Type of ItemΔιπλωματική Εργασίαel
Type of ItemDiploma Worken
Licensehttp://creativecommons.org/licenses/by/4.0/en
Date of Item2016-11-03-
Date of Publication2016-
SubjectArtificial intelligenceen
Bibliographic CitationEmmanouil Karamalegos, "Monte Carlo tree search in the "Settlers of Catan" strategy game", Diploma Work, School of Electrical and Computer Engineering, Technical University of Crete, Chania, Greece, 2016en
Bibliographic CitationΕμμανουήλ Καραμαλέγκος, "Δενδρική αναζήτηση Monte Carlo στο παιχνίδι στρατηγικής "Άποικοι του Κατάν"", Διπλωματική Εργασία, Σχολή Ηλεκτρολόγων Μηχανικών και Μηχανικών Υπολογιστών, Πολυτεχνείο Κρήτης, Χανιά, Ελλάς, 2016el

Available Files

Services

Statistics