Institutional Repository
Technical University of Crete
EN  |  EL

Search

Browse

My Space

Factored MDPs for optimal prosumer decision-making in the smart grid

Angelidakis Angelos

Full record


URI: http://purl.tuc.gr/dl/dias/5AE5689F-69AA-4E38-8EDD-0F5F8EC9D826
Year 2015
Type of Item Master Thesis
License
Details
Bibliographic Citation Άγγελος Αγγελιδάκης, "Παραγοντοποιημένες μαρκοβιανές στοχαστικές διαδικασίες για βέλτιστη λήψη αποφάσεων ενός παραγωγού - καταναλωτή στο έξυπνο δίκτυο ηλεκτροδότησης", Μεταπτυχιακή Διατριβή, Σχολή Ηλεκτρονικών Μηχανικών και Μηχανικών Υπολογιστών, Πολυτεχνείο Κρήτης, Χανιά, Ε https://doi.org/10.26233/heallink.tuc.62539
Appears in Collections

Summary

Tackling the decision-making problem faced by a prosumer (i.e., a producer thatis simultaneously a consumer) when selling and buying energy in the emergingsmart electricity grid, is of utmost importance for the economic profitability ofsuch a business entity. In this thesis, we model, for the first time, this problemas a factored Markov Decision process (MDP). Our model successfully capturesthe main aspects of the business decisions of a prosumer corresponding to a com-munity microgrid of any size. Moreover, it includes appropriate sub-models forprosumer production and consumption prediction.Employing this model, we are able to represent the problem compactly, andto provide an exact optimal solution via dynamic programming—notwithstandingits large size. In addition, we show how to use approximate MDP solution meth-ods for taking decisions in this domain, without the need of discretizing the statespace. Specifically, we employ fitted value iteration, a sampling-based approxi-mation method that is known to be well behaved. By so doing, we generalize ourfactored MDP solution method to continuous state spaces.Our experimental simulations verify the effectiveness of our approach. Theyshow that our exact value iteration solution matches that of a state-of-the-artmethod for stochastic planning in very large environments, while outperforming itin terms of computation time. Furthermore, we evaluate our approximate solutionmethod via using a variety of basis functions over different state sample sizes,and comparing its performance to that of our exact value iteration algorithm. Ourapproximation method is shown to exhibit stable performance in terms of accu-mulated reward, which for certain basis functions reaches 90% of that gathered bythe exact algorithm.

Available Files

Services

Statistics