Pages that link to "Item:Q3984139"
From MaRDI portal
The following pages link to Adaptive policy-iteration and policy-value-iteration for discounted Markov decision processes (Q3984139):
Displaying 16 items.
- Policy iteration for robust nonstationary Markov decision processes (Q518127) (← links)
- Adaptive control of discounted Markov decision chains (Q796461) (← links)
- Adaptive stepsizes for recursive estimation with applications in approximate dynamic programming (Q851872) (← links)
- Generalized polynomial approximations in Markovian decision processes (Q1066821) (← links)
- Nonstationary value-iteration and adaptive control of discounted semi- Markov processes (Q1068732) (← links)
- Monotone value iteration for discounted finite Markov decision processes (Q1076618) (← links)
- Computationally efficient algorithms for on-line optimization of Markov decision processes (Q1190506) (← links)
- A \(K\)-step look-ahead analysis of value iteration algorithms for Markov decision processes (Q1266643) (← links)
- Policy iteration type algorithms for recurrent state Markov decision processes (Q1886500) (← links)
- A note on policy algorithms for discounted Markov decision problems (Q1969768) (← links)
- Policy set iteration for Markov decision processes (Q2350853) (← links)
- Q-learning and enhanced policy iteration in discounted dynamic programming (Q2884305) (← links)
- A novel use of value iteration for deriving bounds for threshold and switching curve optimal policies (Q3120095) (← links)
- Suboptimal Policies, with Bounds, for Parameter Adaptive Decision Processes (Q4202459) (← links)
- Adaptive discounted control for piecewise deterministic Markov processes (Q6136351) (← links)
- Adaptive average control for piecewise deterministic Markov processes (Q6636454) (← links)