URI | http://purl.tuc.gr/dl/dias/1CCC4417-B1AA-4C2E-B33B-0F6BEFDED36E | - |
Identifier | https://drive.google.com/drive/folders/0B9pKS-Z7bpHxWEotSUxFbGhFUWM | - |
Identifier | https://doi.org/10.26233/heallink.tuc.66891 | - |
Language | en | - |
Extent | 1,17 megabytes | el |
Title | Monte Carlo tree search in the "Settlers of Catan" strategy game | en |
Title | Δενδρική αναζήτηση Monte Carlo στο παιχνίδι στρατηγικής "Άποικοι του Κατάν" | el |
Creator | Karamalegos Emmanouil | en |
Creator | Καραμαλεγκος Εμμανουηλ | el |
Contributor [Thesis Supervisor] | Chalkiadakis Georgios | en |
Contributor [Thesis Supervisor] | Χαλκιαδακης Γεωργιος | el |
Contributor [Committee Member] | Lagoudakis Michael | en |
Contributor [Committee Member] | Λαγουδακης Μιχαηλ | el |
Contributor [Committee Member] | Deligiannakis Antonios | en |
Contributor [Committee Member] | Δεληγιαννακης Αντωνιος | el |
Publisher | Πολυτεχνείο Κρήτης | el |
Publisher | Technical University of Crete | en |
Academic Unit | Technical University of Crete::School of Electrical and Computer Engineering | en |
Academic Unit | Πολυτεχνείο Κρήτης::Σχολή Ηλεκτρολόγων Μηχανικών και Μηχανικών Υπολογιστών | el |
Content Summary | Classic approaches to game AI require either a high quality of domain knowledge, or
a long time to generate effective AI behavior. Monte Carlo Tree Search (MCTS) is a
search method that combines the precision of tree search with the generality of
random sampling. The family of MCTS algorithms has achieved promising results
with perfect-information games such as Go. In our work, we apply Monte-Carlo Tree
Search to the non-deterministic game "Settlers of Catan", a multi-player board-turnedweb-
based game that necessitates strategic planning and negotiation skills. We
implemented an agent which takes into consideration all the aspects of the game for
the first time, using no domain knowledge.
In our work, we are experimenting using a reinforcement learning method Value of
Perfect Information (VPI) and two bandit methods, namely, the Upper Coefficient
Bound and Bayesian Upper Coefficient Bound methods. Such methods attempt to
strike a balance between exploitation and exploration when creating of the search
tree.
For first time, we implemented an agent that takes into consideration the complete
rules-set of the game and makes it possible to negotiate trading between all players.
Furthermore, we included in our agent an alternative initial placement found in the
literature, which is based on the analysis of human behavior in Settlers of Catan
games.
In our experiments we compare the performance of our methods against each other
and against appropriate benchmarks (e.g., JSettlers agents), and examine the effect
that the number of simulations and the simulation depth has on the algorithms’
performance. Our results suggests that VPI scores significantly better than bandit
based methods, even if these employ a much higher number of simulations. In
addition to this, the simulation depth of the algorithm needs to be calculated so the
method will neither get lost in deep simulations of improbable scenarios neither end
up shortly without given a proper estimation of the upcoming moves. Finally, our
results indicate that our agent performance is improved when the alternative, human
behavior-based, initial placement method. | en |
Type of Item | Διπλωματική Εργασία | el |
Type of Item | Diploma Work | en |
License | http://creativecommons.org/licenses/by/4.0/ | en |
Date of Item | 2016-11-03 | - |
Date of Publication | 2016 | - |
Subject | Artificial intelligence | en |
Bibliographic Citation | Emmanouil Karamalegos, "Monte Carlo tree search in the "Settlers of Catan" strategy game", Diploma Work, School of Electrical and Computer Engineering, Technical University of Crete, Chania, Greece, 2016 | en |
Bibliographic Citation | Εμμανουήλ Καραμαλέγκος, "Δενδρική αναζήτηση Monte Carlo στο παιχνίδι στρατηγικής "Άποικοι του Κατάν"", Διπλωματική Εργασία, Σχολή Ηλεκτρολόγων Μηχανικών και Μηχανικών Υπολογιστών, Πολυτεχνείο Κρήτης, Χανιά, Ελλάς, 2016 | el |