Averaging vs. discounting in dynamic programming: a counterexample
DOI10.1214/aos/1176342678zbMath0276.49019OpenAlexW2064154559WikidataQ124864459 ScholiaQ124864459MaRDI QIDQ1393717
Publication date: 1974
Published in: The Annals of Statistics (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1214/aos/1176342678
Sequential statistical methods (62L99) Discrete-time control/observation systems (93C55) Markov chains (discrete-time Markov processes on discrete state spaces) (60J10) Applications of Markov chains and discrete-time Markov processes on general state spaces (social mobility, learning theory, industrial processes, etc.) (60J20) Markov and semi-Markov decision processes (90C40) Hamilton-Jacobi theories (49L99)
Related Items (8)
This page was built for publication: Averaging vs. discounting in dynamic programming: a counterexample