scientific article
From MaRDI portal
Publication:3312038
zbMath0529.90092MaRDI QIDQ3312038
Publication date: 1984
Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.
average reward criteriondiscounted reward criterioncompact state and action spacesasymptotically optimal policyconsistent asymptotic estimatorincompletely known law of motionsequential Markov decision models
Inference from stochastic processes (62M99) Statistical decision theory (62C99) Markov and semi-Markov decision processes (90C40)
Related Items (2)
Adaptive policy-iteration and policy-value-iteration for discounted Markov decision processes ⋮ Adaptive discounted control for piecewise deterministic Markov processes
This page was built for publication: