Mathematical Research Data Initiative
Main page
Recent changes
Random page
Help about MediaWiki
Create a new Item
Create a new Property
Create a new EntitySchema
Merge two items
In other projects
Discussion
View source
View history
Purge
English
Log in

Truncated policy iteration methods

From MaRDI portal
Publication:1060136
Jump to:navigation, search

DOI10.1016/0167-6377(84)90054-3zbMath0567.90097OpenAlexW1999380740MaRDI QIDQ1060136

Ron S. Dembo, Moshe Haviv

Publication date: 1984

Published in: Operations Research Letters (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1016/0167-6377(84)90054-3


zbMATH Keywords

Markov chainsmodified policy iteration methodspreassigned rate-of- convergence


Mathematics Subject Classification ID

Numerical mathematical programming methods (65K05) Stochastic programming (90C15) Dynamic programming (90C39) Markov and semi-Markov decision processes (90C40)


Related Items

Hierarchic Markov processes and their applications in replacement models ⋮ (Approximate) iterated successive approximations algorithm for sequential decision processes ⋮ A \(K\)-step look-ahead analysis of value iteration algorithms for Markov decision processes



Cites Work

  • Unnamed Item
  • Unnamed Item
  • Unnamed Item
  • Inexact Newton Methods
  • Modified Policy Iteration Algorithms for Discounted Markov Decision Problems
  • Contraction Mappings in the Theory Underlying Dynamic Programming
Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:1060136&oldid=13079177"
Tools
What links here
Related changes
Special pages
Printable version
Permanent link
Page information
MaRDI portal item
This page was last edited on 31 January 2024, at 00:49.
Privacy policy
About MaRDI portal
Disclaimers
Imprint
Powered by MediaWiki