Ιδρυματικό Αποθετήριο
Πολυτεχνείο Κρήτης

EN | EL

Search

Browse

My Space

Login

On the locality of action domination in sequential decision making

Rachelson, Emmanuel, Lagoudakis Michael

Απλή Εγγραφή

URI	http://purl.tuc.gr/dl/dias/E0292307-A486-42F6-A1D4-8BF6498753E2	-
Αναγνωριστικό	http://www.researchgate.net/profile/Emmanuel_Rachelson/publication/221186156_On_the_locality_of_action_domination_in_sequential_decision_making/links/0fcfd5051c4eaad94f000000.pdf	-
Γλώσσα	en	-
Μέγεθος	8 pages	en
Τίτλος	On the locality of action domination in sequential decision making	en
Δημιουργός	Rachelson, Emmanuel	en
Δημιουργός	Lagoudakis Michael	en
Δημιουργός	Λαγουδακης Μιχαηλ	el
Περίληψη	In the field of sequential decision making and reinforcement learning, it has been observed that good policies for most problems exhibit a significant amount of structure. In practice, this implies that when a learning agent discovers an action is better than any other in a given state, this action actually happens to also dominate in a certain neighbourhood around that state. This paper presents new results proving that this notion of locality in action domination can be linked to the smoothness of the environment’s underlying stochastic model. Namely, we link the Lipschitz continuity of a Markov Decision Process to the Lispchitz continuity of its policies’ value functions and introduce the key concept of influence radius to describe the neighbourhood of states where the dominating action is guaranteed to be constant. These ideas are directly exploited into the proposed Localized Policy Iteration (LPI) algorithm, which is an active learning version of Rollout-based Policy Iteration. Preliminary results on the Inverted Pendulum domain demonstrate the viability and the potential of the proposed approach.	en
Τύπος	Πλήρης Δημοσίευση σε Συνέδριο	el
Τύπος	Conference Full Paper	en
Άδεια Χρήσης	http://creativecommons.org/licenses/by/4.0/	en
Ημερομηνία	2015-11-13	-
Ημερομηνία Δημοσίευσης	2010	-
Θεματική Κατηγορία	Artificial Intelligence	en
Βιβλιογραφική Αναφορά	E. Rachelson and Michail G. Lagoudakis. (2010, Jan.). On the locality of action domination in sequential decision making. Presented at 11th International Symposium on Artificial Intelligence and Mathematics (ISAIM). [Online]. Available: http://www.researchgate.net/profile/Emmanuel_Rachelson/publication/221186156_On_the_locality_of_action_domination_in_sequential_decision_making/links/0fcfd5051c4eaad94f000000.pdf	en

Υπηρεσίες

Στατιστικά

Copyright © DIAS 2013