Steering policies for controlled Markov chains under a recurrence condition
From MaRDI portal
Publication:4506874
DOI10.1109/9.780427zbMath0955.93061OpenAlexW2106327723MaRDI QIDQ4506874
Armand M. Makowski, Dye-Jyun Ma
Publication date: 17 October 2000
Published in: IEEE Transactions on Automatic Control (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1109/9.780427
adaptive controlMarkov decision processessample path argumentscontrolled Markov chainsrecurrence conditionsample average costs
Optimal stochastic control (93E20) Stochastic learning and adaptive control (93E35) Markov and semi-Markov decision processes (90C40)
This page was built for publication: Steering policies for controlled Markov chains under a recurrence condition