${Q}$-Learning Algorithms for Constrained Markov Decision Processes With Randomized Monotone Policies: Application to MIMO Transmission Control
From MaRDI portal
Publication:4564552
DOI10.1109/TSP.2007.893228zbMath1390.90558WikidataQ114982612 ScholiaQ114982612MaRDI QIDQ4564552
Dejan V. Djonin, Vikram Krishnamurthy
Publication date: 12 June 2018
Published in: IEEE Transactions on Signal Processing (Search for Journal in Brave)
Learning and adaptive systems in artificial intelligence (68T05) Stochastic learning and adaptive control (93E35) Markov and semi-Markov decision processes (90C40) Randomized algorithms (68W20)
Related Items (5)
Event-based optimization approach for solving stochastic decision problems with probabilistic constraint ⋮ Constrained Markov decision processes with uncertain costs ⋮ Unnamed Item ⋮ Sleeping experts and bandits approach to constrained Markov decision processes ⋮ A reinforcement learning approach to call admission and call dropping control in links with variable capacity
This page was built for publication: ${Q}$-Learning Algorithms for Constrained Markov Decision Processes With Randomized Monotone Policies: Application to MIMO Transmission Control