Bellman's principle of optimality and deep reinforcement learning for time-varying tasks
From MaRDI portal
Publication:5043501
DOI10.1080/00207179.2021.1913516zbMath1500.93144OpenAlexW3146773041MaRDI QIDQ5043501
Alessandro Giuseppi, Antonio Pietrabissa
Publication date: 6 October 2022
Published in: International Journal of Control (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1080/00207179.2021.1913516
Related Items (1)
Uses Software
Cites Work
- An iterative adaptive dynamic programming method for solving a class of nonlinear zero-sum differential games
- Adaptive dynamic programming for discrete-time linear quadratic regulation based on multirate generalised policy iteration
- Allocating resources via price management systems: a dynamic programming-based approach
- Modelling and solving resource allocation problems via a dynamic programming approach
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
This page was built for publication: Bellman's principle of optimality and deep reinforcement learning for time-varying tasks