A boundedness result for the direct heuristic dynamic programming
From MaRDI portal
Publication:1929722
DOI10.1016/j.neunet.2012.02.005zbMath1254.90286DBLPjournals/nn/LiuSSGM12OpenAlexW2005675298WikidataQ51407734 ScholiaQ51407734MaRDI QIDQ1929722
Publication date: 9 January 2013
Published in: Neural Networks (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1016/j.neunet.2012.02.005
Lyapunov stabilityapproximate dynamic programming (ADP)direct heuristic dynamic programming (direct HDP)uniformly ultimately boundedness (UUB)
Approximation methods and heuristics in mathematical programming (90C59) Dynamic programming (90C39)
Related Items (13)
Three bounded proofs for nonlinear multi‐input multi‐output approximate dynamic programming based on the <scp>L</scp>yapunov stability theory ⋮ Toward reliable designs of data-driven reinforcement learning tracking control for Euler-Lagrange systems ⋮ Optimal consensus control for double‐integrator multiagent systems with unknown dynamics using adaptive dynamic programming ⋮ Complete stability analysis of a heuristic approximate dynamic programming control design ⋮ A novel actor-critic-identifier architecture for nonlinear multiagent systems with gradient descent method ⋮ Optimal control of unknown nonlinear system under event‐triggered mechanism and identifier‐critic‐actor architecture ⋮ Model‐free optimal tracking over finite horizon using adaptive dynamic programming ⋮ Convergence analysis of the deep neural networks based globalized dual heuristic programming ⋮ Prioritized experience replay based reinforcement learning for adaptive tracking control of autonomous underwater vehicle ⋮ Data-driven approximate Q-learning stabilization with optimality error bound analysis ⋮ Data-driven approximate value iteration with optimality error bound analysis ⋮ Adaptive cruise control via adaptive dynamic programming with experience replay ⋮ Q-learning solution for optimal consensus control of discrete-time multiagent systems using reinforcement learning
Cites Work
This page was built for publication: A boundedness result for the direct heuristic dynamic programming