The following pages link to (Q2953657):
Displaying 4 items.
- A Class of Decision Processes Showing Policy-Improvement/Newton–Raphson Equivalence (Q3415941) (← links)
- A new policy iteration scheme for Markov decision processes using Schweitzer's formula (Q4296391) (← links)
- (Q4969098) (← links)
- A Stochastic Trust-Region Framework for Policy Optimization (Q5096136) (← links)