Adaptive control of discounted Markov decision chains

From MaRDI portal
Publication:796461

DOI10.1007/BF00938426zbMath0543.90093OpenAlexW2075970677MaRDI QIDQ796461

Steven I. Marcus, Onésimo Hernández-Lerma

Publication date: 1985

Published in: Journal of Optimization Theory and Applications (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1007/bf00938426




Related Items (21)

Finite-state approximations for denumerable multidimensional state discounted Markov decision processesFinite-state approximations for denumerable state discounted Markov decision processesA unified approach to adaptive control of average reward Markov decision processesAdaptive policies for discrete-time stochastic control systems with unknown disturbance distributionDensity estimation and adaptive control of Markov processes: Average and discounted criteriaNonparametric adaptive control of discrete-time partially observable stochastic systemsDiscretization procedures for adaptive Markov control processesAdaptive discounted control for piecewise deterministic Markov processesNonparametric adaptive control of discounted stochastic systems with compact state spaceStatistical inference for a finite optimal stopping problem with unknown transition probabilitiesAdaptive control of constrained Markov chains: Criteria and policiesNonparametric estimation and adaptive control in a class of finite Markov decision chainsAdaptive control of stochastic systems with unknown disturbance distribution: discounted criteriaRecursive adaptive control of Markov decision processes with the average reward criterionAdaptive control of diffusion processes with a discounted reward criterionUnnamed ItemNonstationary value-iteration and adaptive control of discounted semi- Markov processesStability estimation of some Markov controlled processesOptimal cost and policy for a Markovian replacement problemAdaptive control of Markov processes with incomplete state information and unknown parametersIdentification and control in the partially known Merton portfolio selection model



Cites Work


This page was built for publication: Adaptive control of discounted Markov decision chains