What you should know about approximate dynamic programming

From MaRDI portal

Publication:3621932

Jump to:navigation, search

DOI10.1002/nav.20347zbMath1158.90418OpenAlexW2062457326MaRDI QIDQ3621932

Warren B. Powell

Publication date: 22 April 2009

Published in: Naval Research Logistics (NRL) (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1002/nav.20347

zbMATH Keywords

stochastic optimization Monte Carlo simulation reinforcement learning approximate dynamic programming neuro-dynamic programming

Mathematics Subject Classification ID

Dynamic programming (90C39)

Related Items

Undiscounted control policy generation for continuous-valued optimal control by approximate dynamic programming ⋮ Project planning with alternative technologies in uncertain environments ⋮ Approximate dynamic programming for the dispatch of military medical evacuation assets ⋮ A stochastic model for the patient-bed assignment problem with random arrivals and departures ⋮ Dynamic appointment scheduling with wait-dependent abandonment ⋮ Stochastic optimization for real time service capacity allocation under random service demand ⋮ Three ways to solve partial differential equations with neural networks — A review ⋮ Nonmyopic and pseudo-nonmyopic approaches to optimal sequential design in the presence of covariates ⋮ Optimized ensemble value function approximation for dynamic programming ⋮ Dynamic surgery management under uncertainty ⋮ The dynamic bowser routing problem ⋮ Literature review on multi-appointment scheduling problems in hospitals ⋮ Approximate dynamic programming for missile defense interceptor fire control ⋮ Price optimization with reference price effects: a generalized Benders' decomposition method and a myopic heuristic approach ⋮ A Bayesian learning model for estimating unknown demand parameter in revenue management ⋮ Controlled approximation of the value function in stochastic dynamic programming for multi-reservoir systems ⋮ Deep hedging of long-term financial derivatives ⋮ An approximate dynamic programming approach for comparing firing policies in a networked air defense environment ⋮ A perturbation approach to approximate value iteration for average cost Markov decision processes with Borel spaces and bounded costs

Cites Work

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:3621932&oldid=17058489"