Mathematical Research Data Initiative
Main page
Recent changes
Random page
SPARQL
MaRDI@GitHub
Special pages
In other projects
MaRDI portal item
Discussion
View source
View history
Purge
English
Log in

scientific article; zbMATH DE number 2159039

From MaRDI portal
Publication:4668597
Jump to:navigation, search

zbMATH Open1140.90421MaRDI QIDQ4668597

Xi-Ren Cao

Publication date: 19 April 2005



Title of this publication is not available (Why is that?)


zbMATH Keywords

Poisson equationspotentialsgradient-based policy iteration\(Q\)-learning, \(\text{TD}(\lambda)\)


Mathematics Subject Classification ID

Management decision making, including multiple objectives (90B50) Markov and semi-Markov decision processes (90C40)



Related Items (4)

Sensitivity analysis of a sequential decision problem with learning ⋮ Sensitivity of constrained Markov decision processes ⋮ Two classes Markov decision processes with perturbations ⋮ From perturbation analysis to Markov decision processes and reinforcement learning






This page was built for publication:

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q4668597)

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:4668597&oldid=18882086"
Tools
What links here
Related changes
Printable version
Permanent link
Page information
This page was last edited on 7 February 2024, at 18:45.
Privacy policy
About MaRDI portal
Disclaimers
Imprint
Powered by MediaWiki