Pages that link to "Item:Q2386346"
From MaRDI portal
The following pages link to Conditions for the uniqueness of optimal policies of discounted Markov decision processes (Q2386346):
Displaying 20 items.
- Existence of optimal policy for time non-homogeneous discounted Markovian decision programming (Q811418) (← links)
- A note on deterministic approximation of discounted Markov decision processes (Q1033081) (← links)
- Stackelberg equilibrium in a dynamic stimulation model with complete information (Q1796245) (← links)
- Approximate stochastic annealing for online control of infinite horizon Markov decision processes (Q1937498) (← links)
- A version of the Euler equation in discounted Markov decision processes (Q1952742) (← links)
- Detection-averse optimal and receding-horizon control for Markov decision processes (Q2208599) (← links)
- Semi-Markov decision processes with variance minimization criterion (Q2342919) (← links)
- Nonuniqueness versus uniqueness of optimal policies in convex discounted Markov decision processes (Q2375462) (← links)
- An envelope theorem and some applications to discounted Markov decision processes (Q2483011) (← links)
- Uniqueness of optimal policies as a generic property of discounted Markov decision processes: Ekeland's variational principle approach. (Q2798089) (← links)
- Monotone optimal policies in discounted Markov decision processes with transition probabilities independent of the current state: existence and approximation (Q2868780) (← links)
- A consumption-investment problem modelled as a discounted Markov decision process (Q2892535) (← links)
- An unbounded Berge's minimum theorem with applications to discounted Markov decision processes (Q2907896) (← links)
- (Q3552451) (← links)
- (Q3770312) (← links)
- A Moreau-Yosida regularization for Markov decision processes (Q5027703) (← links)
- Uniqueness and Stability of Optimal Policies of Finite State Markov Decision Processes (Q5388022) (← links)
- (Q5389901) (← links)
- (Q5446613) (← links)
- Markov decision processes approximation with coupled dynamics via Markov deterministic control systems (Q6611476) (← links)