scientific article
From MaRDI portal
Publication:3745652
zbMath0606.90130MaRDI QIDQ3745652
Roberto S. Acosta Abreu, Onésimo Hernández-Lerma
Publication date: 1985
Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.
successive approximationcountable state spacenaive feedback controlleraverage reward adaptive Markov decision processescompact feasible action setsnonstationary value- iterationstrong scrambling condition
Markov processes: estimation; hidden Markov models (62M05) Markov and semi-Markov decision processes (90C40)
Related Items (6)
A unified approach to adaptive control of average reward Markov decision processes ⋮ Density estimation and adaptive control of Markov processes: Average and discounted criteria ⋮ Unnamed Item ⋮ Recursive adaptive control of Markov decision processes with the average reward criterion ⋮ Unnamed Item ⋮ Adaptive control of Markov processes with incomplete state information and unknown parameters
This page was built for publication: