scientific article; zbMATH DE number 970511
From MaRDI portal
Publication:5688680
zbMATH Open0873.90107MaRDI QIDQ5688680
Publication date: 23 January 1997
Title of this publication is not available (Why is that?)
value iterationaverage rewardunknown parametersadaptive policiesoptimal control of Markovian systems
Related Items (6)
Discrete-time Markov decision processes with first passage models ⋮ Markov decision processes ⋮ Decision Problems for Interval Markov Chains ⋮ Markov decision processes ⋮ Analysis for some properties of discrete time Markov decision processes ⋮ Multitime scale markov decision processes
This page was built for publication:
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q5688680)