Mathematical Research Data Initiative
Main page
Recent changes
Random page
Help about MediaWiki
Create a new Item
Create a new Property
Create a new EntitySchema
Merge two items
In other projects
Discussion
View source
View history
Purge
English
Log in

A class of procedures to compute the optimal value f unction in a Markovian decision problem

From MaRDI portal
Publication:3725896
Jump to:navigation, search

DOI10.1080/02331938608843148zbMath0594.90091OpenAlexW1989482537MaRDI QIDQ3725896

Jörg-Uwe Löbus

Publication date: 1986

Published in: Optimization (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1080/02331938608843148


zbMATH Keywords

successive approximation methodfinite state Markov decision processeseffective computation of the value functionfinite expected discounted or non- discounted rewardsimproving the convergence rate


Mathematics Subject Classification ID

Markov and semi-Markov decision processes (90C40)




Cites Work

  • Unnamed Item
  • Unnamed Item


This page was built for publication: A class of procedures to compute the optimal value f unction in a Markovian decision problem

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:3725896&oldid=17238017"
Tools
What links here
Related changes
Special pages
Printable version
Permanent link
Page information
MaRDI portal item
This page was last edited on 5 February 2024, at 11:08.
Privacy policy
About MaRDI portal
Disclaimers
Imprint
Powered by MediaWiki