Performance gradient estimation for the very large finite Markov chains
From MaRDI portal
Publication:3986172
DOI10.1109/9.100931zbMath0736.60069OpenAlexW2138113829MaRDI QIDQ3986172
Publication date: 27 June 1992
Published in: IEEE Transactions on Automatic Control (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1109/9.100931
Queueing theory (aspects of probability theory) (60K25) Applications of Markov chains and discrete-time Markov processes on general state spaces (social mobility, learning theory, industrial processes, etc.) (60J20) Applications of Markov renewal processes (reliability, queueing networks, etc.) (60K20)
Related Items (5)
A time aggregation approach to Markov decision processes ⋮ Solving average cost Markov decision processes by means of a two-phase time aggregation algorithm ⋮ The control of a two-level Markov decision process by time aggregation ⋮ Optimization via simulation: A review ⋮ A unified approach to time-aggregated Markov decision processes
This page was built for publication: Performance gradient estimation for the very large finite Markov chains