Pages that link to "Item:Q2198700"
From MaRDI portal
The following pages link to Off-policy based adaptive dynamic programming method for nonzero-sum games on discrete-time system (Q2198700):
Displaying 8 items.
- Off-policy Q-learning: solving Nash equilibrium of multi-player games with network-induced delay and unmeasured state (Q2063842) (← links)
- A policy iteration algorithm for nonzero-sum games with unknown models (Q3195737) (← links)
- Optimal tracking control for non‐zero‐sum games of linear discrete‐time systems via off‐policy reinforcement learning (Q5003645) (← links)
- Data-driven adaptive dynamic programming for partially observable nonzero-sum games via <i>Q</i>-learning method (Q5025895) (← links)
- Off‐policy integral reinforcement learning‐based optimal tracking control for a class of nonzero‐sum game systems with unknown dynamics (Q6054473) (← links)
- Model-free finite-horizon optimal control of discrete-time two-player zero-sum games (Q6099271) (← links)
- Model-free policy iteration approach to NCE-based strategy design for linear quadratic Gaussian games (Q6165338) (← links)
- Event‐triggered neural experience replay learning for nonzero‐sum tracking games of unknown continuous‐time nonlinear systems (Q6194544) (← links)