An Evolutionary Random Policy Search Algorithm for Solving Markov Decision Processes
From MaRDI portal
Publication:2892321
DOI10.1287/ijoc.1050.0155zbMath1241.90173OpenAlexW2106198477WikidataQ114967841 ScholiaQ114967841MaRDI QIDQ2892321
Jiaqiao Hu, Steven I. Marcus, Vahid Reza Ramezani, Michael C. Fu
Publication date: 18 June 2012
Published in: INFORMS Journal on Computing (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1287/ijoc.1050.0155
Approximation methods and heuristics in mathematical programming (90C59) Dynamic programming (90C39) Markov and semi-Markov decision processes (90C40)
Related Items (2)
A variable neighborhood search based algorithm for finite-horizon Markov decision processes ⋮ Optimal Online Learning for Nonlinear Belief Models Using Discrete Priors
Uses Software
Cites Work
This page was built for publication: An Evolutionary Random Policy Search Algorithm for Solving Markov Decision Processes