Mathematical Research Data Initiative
Main page
Recent changes
Random page
Help about MediaWiki
Create a new Item
Create a new Property
Create a new EntitySchema
Merge two items
In other projects
Discussion
View source
View history
Purge
English
Log in

An empirical study of policy convergence in Markov decision process value iteration

From MaRDI portal
Publication:1886733
Jump to:navigation, search

DOI10.1016/S0305-0548(03)00207-7zbMath1076.90066OpenAlexW2094964720MaRDI QIDQ1886733

William T. Scherer, Christopher W. Zobel

Publication date: 19 November 2004

Published in: Computers \& Operations Research (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1016/s0305-0548(03)00207-7


zbMATH Keywords

Markov decision processesDynamic programmingConvergence results


Mathematics Subject Classification ID

Dynamic programming (90C39) Markov and semi-Markov decision processes (90C40)


Related Items

Approximate dynamic programming via direct search in the space of value function approximations



Cites Work

  • Unnamed Item
  • Unnamed Item
  • Dynamic programming and stochastic control
  • Geometric bounds for eigenvalues of Markov chains
  • The convergence of value iteration in discounted Markov decision processes
  • Time will tell: behavioural scoring and the dynamics of consumer credit assessment
  • Finding Optimal Survey Policies via Adaptive Markov Decision Processes
  • A New Value Iteration method for the Average Cost Dynamic Programming Problem
Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:1886733&oldid=14289160"
Tools
What links here
Related changes
Special pages
Printable version
Permanent link
Page information
MaRDI portal item
This page was last edited on 1 February 2024, at 13:07.
Privacy policy
About MaRDI portal
Disclaimers
Imprint
Powered by MediaWiki