Mathematical Research Data Initiative
Main page
Recent changes
Random page
Help about MediaWiki
Create a new Item
Create a new Property
Create a new EntitySchema
Merge two items
In other projects
Discussion
View source
View history
Purge
English
Log in

Foolproof convergence in multichain policy iteration

From MaRDI portal
Publication:1245162
Jump to:navigation, search

DOI10.1016/0022-247X(78)90044-6zbMath0373.90081MaRDI QIDQ1245162

Paul J. Schweitzer, Awi Federgruen

Publication date: 1978

Published in: Journal of Mathematical Analysis and Applications (Search for Journal in Brave)



Mathematics Subject Classification ID

Markov and semi-Markov decision processes (90C40)


Related Items (3)

A value-iteration scheme for undiscounted multichain Markov renewal programs ⋮ A unified approach to Markov decision problems and performance sensitivity analysis with discounted and average criteria: multichain cases ⋮ A further anticycling rule in multichain policy iteration for undiscounted Markov renewal programs



Cites Work

  • Unnamed Item
  • The Functional Equations of Undiscounted Markov Renewal Programming
  • Markov-Renewal Programming. I: Formulation, Finite Return Models
  • Discrete Dynamic Programming
  • Perturbation Theory and Undiscounted Markov Renewal Programming
  • Perturbation theory and finite Markov chains
  • Multichain Markov Renewal Programs
  • Potentials for denumerable Markov chains


This page was built for publication: Foolproof convergence in multichain policy iteration

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:1245162&oldid=13330021"
Tools
What links here
Related changes
Special pages
Printable version
Permanent link
Page information
MaRDI portal item
This page was last edited on 31 January 2024, at 09:24.
Privacy policy
About MaRDI portal
Disclaimers
Imprint
Powered by MediaWiki