The following pages link to (Q5053310):
Displaying 4 items.
- Policy iteration for robust nonstationary Markov decision processes (Q518127) (← links)
- Toward theoretical understandings of robust Markov decision processes: sample complexity and asymptotics (Q2112808) (← links)
- Technical Note—On the Convexity of Policy Regions in Partially Observed Systems (Q3775355) (← links)
- A family of \(s\)-rectangular robust MDPs: relative conservativeness, asymptotic analyses, and finite-sample properties (Q6495779) (← links)