Nearly optimal stationary policies in negative dynamic programming
From MaRDI portal
Publication:1304419
DOI10.1007/S001860050060zbMath0937.90114OpenAlexW2021270645MaRDI QIDQ1304419
Rolando Cavazos-Cadena, Raúl Montes-De-oca
Publication date: 22 September 1999
Published in: Mathematical Methods of Operations Research (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1007/s001860050060
Markov decision processesexpected total-reward criterionnegative rewardsuniformly \(\varepsilon\)-optimal stationary policies
This page was built for publication: Nearly optimal stationary policies in negative dynamic programming