Finding optimal memoryless policies of POMDPs under the expected average reward criterion

From MaRDI portal

Publication:418072

Jump to:navigation, search

DOI10.1016/j.ejor.2010.12.014zbMath1237.90250OpenAlexW2055418958MaRDI QIDQ418072

Baoqun Yin, Hong-Sheng Xi, Yan-Jie Li

Publication date: 14 May 2012

Published in: European Journal of Operational Research (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1016/j.ejor.2010.12.014

zbMATH Keywords

correlated actions memoryless policy performance difference policy iteration with step sizes POMDPs

Mathematics Subject Classification ID

Markov and semi-Markov decision processes (90C40)

Related Items (2)

Future memories are not needed for large classes of POMDPs ⋮ Unnamed Item

Uses Software

POMDP

Cites Work

This page was built for publication: Finding optimal memoryless policies of POMDPs under the expected average reward criterion

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:418072&oldid=12293010"