Mathematical Research Data Initiative
Main page
Recent changes
Random page
Help about MediaWiki
Create a new Item
Create a new Property
Create a new EntitySchema
Merge two items
In other projects
Discussion
View source
View history
Purge
English
Log in

Computational comparison of policy iteration algorithms for discounted Markov decision processes

From MaRDI portal
Publication:1088914
Jump to:navigation, search

DOI10.1016/0305-0548(86)90028-6zbMath0617.90086OpenAlexW2087482191WikidataQ115104694 ScholiaQ115104694MaRDI QIDQ1088914

A. C. Lavercombe, Lyn C. Thomas, Roger T. Hartley

Publication date: 1986

Published in: Computers \& Operations Research (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1016/0305-0548(86)90028-6


zbMATH Keywords

computational comparisonpolicy iteration algorithmsdiscounted Markov decision processes


Mathematics Subject Classification ID

Numerical mathematical programming methods (65K05) Markov and semi-Markov decision processes (90C40)




Cites Work

  • Computational comparison of value iteration algorithms for discounted Markov decision processes
  • Computing the discounted return in markov and semi-markov chains
  • Improved iterative computation of the expected discounted return in Markov and semi-Markov chains
  • Bounds and Transformations for Discounted Finite Markov Decision Chains
  • Technical Note—Accelerated Computation of the Expected Discounted Return in a Markov Chain
  • Letter to the Editor—A Test for Suboptimal Actions in Markovian Decision Problems
  • Unnamed Item
  • Unnamed Item
Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:1088914&oldid=13114635"
Tools
What links here
Related changes
Special pages
Printable version
Permanent link
Page information
MaRDI portal item
This page was last edited on 31 January 2024, at 02:03.
Privacy policy
About MaRDI portal
Disclaimers
Imprint
Powered by MediaWiki