Exploiting the Structural Properties of the Underlying Markov Decision Problem in the Q-Learning Algorithm
From MaRDI portal
Publication:2901012
DOI10.1287/ijoc.1070.0240zbMath1243.90235OpenAlexW2102195169MaRDI QIDQ2901012
Huseyin Topaloglu, Sumit Kunnumkal
Publication date: 28 July 2012
Published in: INFORMS Journal on Computing (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1287/ijoc.1070.0240
Related Items (2)
A Machine Learning–Enabled Partially Observable Markov Decision Process Framework for Early Sepsis Prediction ⋮ Shape constraints in economics and operations research
This page was built for publication: Exploiting the Structural Properties of the Underlying Markov Decision Problem in the Q-Learning Algorithm