URI | http://purl.tuc.gr/dl/dias/E504E59F-683F-49F8-9E3F-41D26D6FC513 | - |
Αναγνωριστικό | http://eprints.ulster.ac.uk/21032/1/AAMAS2012_0089_7016861a9.pdf | - |
Γλώσσα | en | - |
Μέγεθος | 8 pages | en |
Τίτλος | Decentralized bayesian reinforcement learning for online agent collaboration | en |
Δημιουργός | Parr G. | en |
Δημιουργός | Farinelli A. | en |
Δημιουργός | Rogers A. | en |
Δημιουργός | Chalkiadakis Georgios | en |
Δημιουργός | Χαλκιαδακης Γεωργιος | el |
Δημιουργός | Jennings N. R. | en |
Δημιουργός | McClean S. | en |
Δημιουργός | Teacy W. T. L. | en |
Εκδότης | International Foundation for Autonomous Agents and Multiagent Systems | en |
Εκδότης | IFAAMS | en |
Περίληψη | Solving complex but structured problems in a decentralized manner via multiagent collaboration has received much attention in recent years. This is natural, as on one hand, multiagent systems usu- ally possess a structure that determines the allowable interactions among the agents; and on the other hand, the single most pressing need in a cooperative multiagent system is to coordinate the local policies of autonomous agents with restricted capabilities to serve a system-wide goal. The presence of uncertainty makes this even more challenging, as the agents face the additional need to learn the unknown environment parameters while forming (and follow- ing) local policies in an online fashion. In this paper, we provide the first Bayesian reinforcement learning (BRL) approach for dis- tributed coordination and learning in a cooperative multiagent sys- tem by devising two solutions to this type of problem. More specif- ically, we show how the Value of Perfect Information (VPI) can be used to perform efficient decentralised exploration in both model- based and model-free BRL, and in the latter case, provide a closed form solution for VPI, correcting a decade old result by Dearden, Friedman and Russell. To evaluate these solutions, we present ex- perimental results comparing their relative merits, and demonstrate empirically that both solutions outperform an existing multiagent learning method, representative of the state-of-the-art. | en |
Τύπος | Πλήρης Δημοσίευση σε Συνέδριο | el |
Τύπος | Conference Full Paper | en |
Άδεια Χρήσης | http://creativecommons.org/licenses/by/4.0/ | en |
Ημερομηνία | 2015-09-30 | - |
Ημερομηνία Δημοσίευσης | 2012 | - |
Θεματική Κατηγορία | Multiagent learning | en |
Θεματική Κατηγορία | Bayesian techniques | en |
Θεματική Κατηγορία | Uncertainty | en |
Βιβλιογραφική Αναφορά | W. T. L. Leacy, G. Chalkiadakis, A. Farinelli, A. Rogers, N. R. Jennings, S. McClean and G. Parr, "Decentralized bayesian reinforcement learning for online agent collaboration," presented at 11th International Conference on Autonomous Agents and Multiagent Systems, Valencia, Spain, 2012. | en |