Pages that link to "Item:Q5955823"
From MaRDI portal
The following pages link to Algorithms for optimization and stabilization of controlled Markov chains. (Q5955823):
Displaying 3 items.
- Nonzero-sum risk-sensitive average stochastic games: The case of unbounded costs (Q2068913) (← links)
- Mean-Field Controls with Q-Learning for Cooperative MARL: Convergence and Complexity Analysis (Q5018896) (← links)
- A note on the existence of optimal stationary policies for average Markov decision processes with countable states (Q6163982) (← links)