Pages that link to "Item:Q5093265"
From MaRDI portal
The following pages link to A Q-Learning Algorithm for Discrete-Time Linear-Quadratic Control with Random Parameters of Unknown Distribution: Convergence and Stabilization (Q5093265):
Displaying 14 items.
- The DQR algorithm, basic theory, convergence, and conditional stability (Q1360724) (← links)
- Model-free linear quadratic regulator (Q2094032) (← links)
- Convergence results for an averaged LQR problem with applications to reinforcement learning (Q2236442) (← links)
- On the effect of probing noise in optimal control LQR via Q-learning using adaptive filtering algorithms (Q2673621) (← links)
- A learning algorithm for discrete-time stochastic control (Q2711577) (← links)
- Finite-time stochastic linear quadratic optimal control based on \(Q\)-learning (Q3386228) (← links)
- Optimal control for unknown mean-field discrete-time system based on Q-Learning (Q5029154) (← links)
- An iterative Q‐learning scheme for the global stabilization of discrete‐time linear systems subject to actuator saturation (Q5223457) (← links)
- Deterministic optimal control for discrete-time systems with multiplicative noises and random coefficients (Q6100435) (← links)
- Approximate Q Learning for Controlled Diffusion Processes and Its Near Optimality (Q6136230) (← links)
- A distributed stochastic approximation algorithm for stochastic LQ control with unknown uncertainty (Q6164027) (← links)
- Policy gradient methods for discrete time linear quadratic regulator with random parameters (Q6491779) (← links)
- Solving optimal predictor-feedback control using approximate dynamic programming (Q6632499) (← links)
- Weighted stochastic Riccati equations for generalization of linear optimal control (Q6659181) (← links)