Pages that link to "Item:Q5157372"
From MaRDI portal
The following pages link to Policy Gradient Methods for the Noisy Linear Quadratic Regulator over a Finite Horizon (Q5157372):
Displaying 11 items.
- Regularized policy iteration with nonparametric function spaces (Q2834459) (← links)
- Robust Policy Iteration for Continuous-Time Linear Quadratic Regulation (Q5034045) (← links)
- Entropy Regularization for Mean Field Games with Learning (Q5870374) (← links)
- A Small Gain Analysis of Single Timescale Actor Critic (Q6042800) (← links)
- Linear Convergence of a Policy Gradient Method for Some Finite Horizon Continuous Time Control Problems (Q6140987) (← links)
- Recent advances in reinforcement learning in finance (Q6146668) (← links)
- Continuous‐time stochastic gradient descent for optimizing over the stationary distribution of stochastic differential equations (Q6196292) (← links)
- Reinforcement learning with dynamic convex risk measures (Q6196296) (← links)
- Convergence of policy gradient methods for finite-horizon exploratory linear-quadratic control problems (Q6490237) (← links)
- Policy gradient methods for discrete time linear quadratic regulator with random parameters (Q6491779) (← links)
- Recent developments in machine learning methods for stochastic control and games (Q6615618) (← links)