A class of procedures to compute the optimal value f unction in a Markovian decision problem
From MaRDI portal
Publication:3725896
DOI10.1080/02331938608843148zbMath0594.90091OpenAlexW1989482537MaRDI QIDQ3725896
Publication date: 1986
Published in: Optimization (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1080/02331938608843148
successive approximation methodfinite state Markov decision processeseffective computation of the value functionfinite expected discounted or non- discounted rewardsimproving the convergence rate
Cites Work
This page was built for publication: A class of procedures to compute the optimal value f unction in a Markovian decision problem