URI | http://purl.tuc.gr/dl/dias/E504E59F-683F-49F8-9E3F-41D26D6FC513 | - |
Identifier | http://eprints.ulster.ac.uk/21032/1/AAMAS2012_0089_7016861a9.pdf | - |
Language | en | - |
Extent | 8 pages | en |
Title | Decentralized bayesian reinforcement learning for online agent collaboration | en |
Creator | Parr G. | en |
Creator | Farinelli A. | en |
Creator | Rogers A. | en |
Creator | Chalkiadakis Georgios | en |
Creator | Χαλκιαδακης Γεωργιος | el |
Creator | Jennings N. R. | en |
Creator | McClean S. | en |
Creator | Teacy W. T. L. | en |
Publisher | International Foundation for Autonomous Agents and Multiagent Systems | en |
Publisher | IFAAMS | en |
Content Summary | Solving complex but structured problems in a decentralized manner via multiagent collaboration has received much attention in recent years. This is natural, as on one hand, multiagent systems usu- ally possess a structure that determines the allowable interactions among the agents; and on the other hand, the single most pressing need in a cooperative multiagent system is to coordinate the local policies of autonomous agents with restricted capabilities to serve a system-wide goal. The presence of uncertainty makes this even more challenging, as the agents face the additional need to learn the unknown environment parameters while forming (and follow- ing) local policies in an online fashion. In this paper, we provide the first Bayesian reinforcement learning (BRL) approach for dis- tributed coordination and learning in a cooperative multiagent sys- tem by devising two solutions to this type of problem. More specif- ically, we show how the Value of Perfect Information (VPI) can be used to perform efficient decentralised exploration in both model- based and model-free BRL, and in the latter case, provide a closed form solution for VPI, correcting a decade old result by Dearden, Friedman and Russell. To evaluate these solutions, we present ex- perimental results comparing their relative merits, and demonstrate empirically that both solutions outperform an existing multiagent learning method, representative of the state-of-the-art. | en |
Type of Item | Πλήρης Δημοσίευση σε Συνέδριο | el |
Type of Item | Conference Full Paper | en |
License | http://creativecommons.org/licenses/by/4.0/ | en |
Date of Item | 2015-09-30 | - |
Date of Publication | 2012 | - |
Subject | Multiagent learning | en |
Subject | Bayesian techniques | en |
Subject | Uncertainty | en |
Bibliographic Citation | W. T. L. Leacy, G. Chalkiadakis, A. Farinelli, A. Rogers, N. R. Jennings, S. McClean and G. Parr, "Decentralized bayesian reinforcement learning for online agent collaboration," presented at 11th International Conference on Autonomous Agents and Multiagent Systems, Valencia, Spain, 2012. | en |