Pages that link to "Item:Q467477"
From MaRDI portal
The following pages link to Temporal difference-based policy iteration for optimal control of stochastic systems (Q467477):
Displaying 7 items.
- Potential-based least-squares policy iteration for a parameterized feedback control system (Q289143) (← links)
- Stochastic control via direct comparison (Q633815) (← links)
- Suboptimal control for nonlinear systems with disturbance via integral sliding mode control and policy iteration (Q2178900) (← links)
- Policy iteration based feedback control (Q2440692) (← links)
- Stochastic linear quadratic optimal control for continuous-time systems based on policy iteration (Q2992467) (← links)
- A least squares temporal difference actor–critic algorithm with applications to warehouse management (Q3120552) (← links)
- Two Time-Scale Stochastic Approximation with Controlled Markov Noise and Off-Policy Temporal-Difference Learning (Q5219302) (← links)