Off-policy based adaptive dynamic programming method for nonzero-sum games on discrete-time system
From MaRDI portal
Publication:2198700
DOI10.1016/j.jfranklin.2020.05.038zbMath1447.93205OpenAlexW3033023361MaRDI QIDQ2198700
Huaguang Zhang, He Ren, Kun Zhang, Yinlei Wen
Publication date: 15 September 2020
Published in: Journal of the Franklin Institute (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1016/j.jfranklin.2020.05.038
2-person games (91A05) Applications of game theory (91A80) Discrete-time control/observation systems (93C55) Linear systems in control theory (93C05)
Related Items (2)
Off‐policy integral reinforcement learning‐based optimal tracking control for a class of nonzero‐sum game systems with unknown dynamics ⋮ Event‐triggered neural experience replay learning for nonzero‐sum tracking games of unknown continuous‐time nonlinear systems
Cites Work
- Unnamed Item
- Unnamed Item
- Computational adaptive optimal control for continuous-time linear systems with completely unknown dynamics
- \(\mathrm{H}_\infty\) control of linear discrete-time systems: off-policy reinforcement learning
- An iterative adaptive dynamic programming method for solving a class of nonlinear zero-sum differential games
- Multi-player non-zero-sum games: online adaptive learning solution of coupled Hamilton-Jacobi equations
- Galerkin approximations of the generalized Hamilton-Jacobi-Bellman equation
- General value iteration based single network approach for constrained optimal controller design of partially-unknown continuous-time nonlinear systems
- Reinforcement learning solution for HJB equation arising in constrained optimal control problem
- A mixed 0-1 linear programming approach to the computation of all pure-strategy Nash equilibria of a finite \(n\)-person game in normal form
- Haar wavelet-based approach for optimal control of second-order linear systems in time domain
- Actuator and sensor faults estimation based on proportional integral observer for TS fuzzy model
- Observer based adaptive dynamic programming for fault tolerant control of a class of nonlinear systems
- Data-based approximate policy iteration for affine nonlinear continuous-time optimal control design
- Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach
- Computationally efficient simultaneous policy update algorithm for nonlinearH∞state feedback control with Galerkin's method
- A computational method for solving optimal control and parameter estimation of linear systems using Haar wavelets
- 10.1162/1532443041827880
This page was built for publication: Off-policy based adaptive dynamic programming method for nonzero-sum games on discrete-time system