Off-policy based adaptive dynamic programming method for nonzero-sum games on discrete-time system

DOI10.1016/j.jfranklin.2020.05.038zbMath1447.93205OpenAlexW3033023361MaRDI QIDQ2198700

Huaguang Zhang, He Ren, Kun Zhang, Yinlei Wen

Publication date: 15 September 2020

Published in: Journal of the Franklin Institute (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1016/j.jfranklin.2020.05.038

Mathematics Subject Classification ID

2-person games (91A05) Applications of game theory (91A80) Discrete-time control/observation systems (93C55) Linear systems in control theory (93C05)

Related Items (2)

Off‐policy integral reinforcement learning‐based optimal tracking control for a class of nonzero‐sum game systems with unknown dynamics ⋮ Event‐triggered neural experience replay learning for nonzero‐sum tracking games of unknown continuous‐time nonlinear systems

Cites Work

Unnamed Item
Unnamed Item
Computational adaptive optimal control for continuous-time linear systems with completely unknown dynamics
\(\mathrm{H}_\infty\) control of linear discrete-time systems: off-policy reinforcement learning
An iterative adaptive dynamic programming method for solving a class of nonlinear zero-sum differential games
Multi-player non-zero-sum games: online adaptive learning solution of coupled Hamilton-Jacobi equations
Galerkin approximations of the generalized Hamilton-Jacobi-Bellman equation
General value iteration based single network approach for constrained optimal controller design of partially-unknown continuous-time nonlinear systems
Reinforcement learning solution for HJB equation arising in constrained optimal control problem
A mixed 0-1 linear programming approach to the computation of all pure-strategy Nash equilibria of a finite \(n\)-person game in normal form
Haar wavelet-based approach for optimal control of second-order linear systems in time domain
Actuator and sensor faults estimation based on proportional integral observer for TS fuzzy model
Observer based adaptive dynamic programming for fault tolerant control of a class of nonlinear systems
Data-based approximate policy iteration for affine nonlinear continuous-time optimal control design
Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach
Computationally efficient simultaneous policy update algorithm for nonlinearH_∞state feedback control with Galerkin's method
A computational method for solving optimal control and parameter estimation of linear systems using Haar wavelets
10.1162/1532443041827880

This page was built for publication: Off-policy based adaptive dynamic programming method for nonzero-sum games on discrete-time system