Efficient Policy Representation for Markov Decision Processes: Fid-Bau Portal

Fachinformationsdienst BAUdigital

Eine Plattform für die Wissenschaft: Bauingenieurwesen, Architektur und Urbanistik

Efficient Policy Representation for Markov Decision Processes

Khademi, Anahita / Khademian, Sepehr

Storing the optimal policy is an important challenge for autonomous agents and cyber-physical systems where the behavior is modeled by Markov decision processes. In this paper, we propose a symbolic approach to store an optimal policy compactly. We use decision trees for classifying optimal actions that are used to store a compressed representation of an optimal policy. To reduce memory consumption, we propose several approaches that keep the precision of the computed values or provide an approximation of them by considering a threshold for errors in the computed values. The first approach limits the depth of trees to reduce memory consumption. The second one detects and avoids non-important states to have smaller sets of states. The third approach prioritizes states by considering their impact on the precision of results. We use the PRISM case studies to investigate the effectiveness of our approach. Based on our results for the standard case studies, using decision trees and the proposed approaches, the needed memory for storing the optimal policies is reduced by several orders of magnitude, which is promising for embedded systems.

Zugriff

Zugriff prüfen

Verfügbarkeit in meiner Bibliothek prüfen

Bestellung bei Subito €

Seitennavigation

Dokumentinformationen

Ähnliche Titel

Exportieren, teilen und zitieren

Efficient Policy Representation for Markov Decision Processes

Khademi, Anahita / Khademian, Sepehr

Storing the optimal policy is an important challenge for autonomous agents and cyber-physical systems where the behavior is modeled by Markov decision processes. In this paper, we propose a symbolic approach to store an optimal policy compactly. We use decision trees for classifying optimal actions that are used to store a compressed representation of an optimal policy. To reduce memory consumption, we propose several approaches that keep the precision of the computed values or provide an approximation of them by considering a threshold for errors in the computed values. The first approach limits the depth of trees to reduce memory consumption. The second one detects and avoids non-important states to have smaller sets of states. The third approach prioritizes states by considering their impact on the precision of results. We use the PRISM case studies to investigate the effectiveness of our approach. Based on our results for the standard case studies, using decision trees and the proposed approaches, the needed memory for storing the optimal policies is reduced by several orders of magnitude, which is promising for embedded systems.

Efficient Policy Representation for Markov Decision Processes

Khademi, Anahita / Khademian, Sepehr

Storing the optimal policy is an important challenge for autonomous agents and cyber-physical systems where the behavior is modeled by Markov decision processes. In this paper, we propose a symbolic approach to store an optimal policy compactly. We use decision trees for classifying optimal actions that are used to store a compressed representation of an optimal policy. To reduce memory consumption, we propose several approaches that keep the precision of the computed values or provide an approximation of them by considering a threshold for errors in the computed values. The first approach limits the depth of trees to reduce memory consumption. The second one detects and avoids non-important states to have smaller sets of states. The third approach prioritizes states by considering their impact on the precision of results. We use the PRISM case studies to investigate the effectiveness of our approach. Based on our results for the standard case studies, using decision trees and the proposed approaches, the needed memory for storing the optimal policies is reduced by several orders of magnitude, which is promising for embedded systems.

Zugriff

Zugriff prüfen

Verfügbarkeit in meiner Bibliothek prüfen

Bestellung bei Subito €

Seitennavigation

Dokumentinformationen

Ähnliche Titel

Exportieren, teilen und zitieren

Dokumentinformationen

Titel:

Efficient Policy Representation for Markov Decision Processes

Weitere Titelangaben:

Lect. Notes in Networks, Syst.

Beteiligte:

Arsenyeva, Olga (Herausgeber:in) / Romanova, Tatiana (Herausgeber:in) / Sukhonos, Maria (Herausgeber:in) / Tsegelnyk, Yevgen (Herausgeber:in) / Khademi, Anahita (Autor:in) / Khademian, Sepehr (Autor:in)

Kongress:

International Conference on Smart Technologies in Urban Engineering ; 2022 ; Kharkiv, Ukraine

Erschienen in:

Smart Technologies in Urban Engineering ; Kapitel: 15 ; 151-162

Lecture Notes in Networks and Systems ; 536

Erscheinungsdatum:

29.11.2022

Format / Umfang:

12 pages

ISBN:

978-3-031-20141-7 , 978-3-031-20140-0

ISSN:

2367-3389 , 2367-3370

DOI:

https://doi.org/10.1007/978-3-031-20141-7_15

Medientyp:

Aufsatz/Kapitel (Buch)

Format:

Elektronische Ressource

Sprache:

Englisch

Schlagwörter:

Machine learning , Model checking , Markov decision processes , Optimal policy , Decision tree classifier Engineering , Computational Intelligence , Transportation Technology and Traffic Engineering , Manufacturing, Machines, Tools, Processes

Ähnliche Titel

Efficient Policy Representation for Markov Decision Processes

Khademi, Anahita / Khademian, Sepehr | TIBKAT | 2023

Markov decision processes with fuzzy rewards

Kurano, M. / Yasuda, M. / Nakagami, J.-i. et al. | British Library Online Contents | 2003

Incorporating Bayesian Networks in Markov Decision Processes

Faddoul, R. / Raphael, W. / Soubra, A.-H. et al. | ASCE | 2012

Incorporating Bayesian Networks in Markov Decision Processes

Faddoul, R. | Online Contents | 2013

Reliability-Based Structural Design with Markov Decision Processes

Tao, Zongwei | Online Contents | 1995