Factored MDPs for optimal prosumer decision-making in the smart grid

Angelidakis Angelos

Full record

URI:

http://purl.tuc.gr/dl/dias/5AE5689F-69AA-4E38-8EDD-0F5F8EC9D826

Year

2015

Type of Item

Master Thesis

License

Details

Bibliographic Citation

Άγγελος Αγγελιδάκης, "Παραγοντοποιημένες μαρκοβιανές στοχαστικές διαδικασίες για βέλτιστη λήψη αποφάσεων ενός παραγωγού - καταναλωτή στο έξυπνο δίκτυο ηλεκτροδότησης", Μεταπτυχιακή Διατριβή, Σχολή Ηλεκτρονικών Μηχανικών και Μηχανικών Υπολογιστών, Πολυτεχνείο Κρήτης, Χανιά, Ε https://doi.org/10.26233/heallink.tuc.62539

Appears in Collections

Master Theses in Community School of Electrical and Computer Engineering

Summary

Tackling the decision-making problem faced by a prosumer (i.e., a producer thatis simultaneously a consumer) when selling and buying energy in the emergingsmart electricity grid, is of utmost importance for the economic profitability ofsuch a business entity. In this thesis, we model, for the first time, this problemas a factored Markov Decision process (MDP). Our model successfully capturesthe main aspects of the business decisions of a prosumer corresponding to a com-munity microgrid of any size. Moreover, it includes appropriate sub-models forprosumer production and consumption prediction.Employing this model, we are able to represent the problem compactly, andto provide an exact optimal solution via dynamic programming—notwithstandingits large size. In addition, we show how to use approximate MDP solution meth-ods for taking decisions in this domain, without the need of discretizing the statespace. Specifically, we employ fitted value iteration, a sampling-based approxi-mation method that is known to be well behaved. By so doing, we generalize ourfactored MDP solution method to continuous state spaces.Our experimental simulations verify the effectiveness of our approach. Theyshow that our exact value iteration solution matches that of a state-of-the-artmethod for stochastic planning in very large environments, while outperforming itin terms of computation time. Furthermore, we evaluate our approximate solutionmethod via using a variety of basis functions over different state sample sizes,and comparing its performance to that of our exact value iteration algorithm. Ourapproximation method is shown to exhibit stable performance in terms of accu-mulated reward, which for certain basis functions reaches 90% of that gathered bythe exact algorithm.

Search

Browse

My Space

Factored MDPs for optimal prosumer decision-making in the smart grid

Angelidakis Angelos

Summary

Available Files

Services

Export

Share

Statistics

Metadata & Content in a METS Package:

Metadata in Format: