An empirical study of policy convergence in Markov decision process value iteration (Q1886733)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: An empirical study of policy convergence in Markov decision process value iteration |
scientific article; zbMATH DE number 2116802
| Language | Label | Description | Also known as |
|---|---|---|---|
| English | An empirical study of policy convergence in Markov decision process value iteration |
scientific article; zbMATH DE number 2116802 |
Statements
An empirical study of policy convergence in Markov decision process value iteration (English)
0 references
19 November 2004
0 references
Markov decision processes
0 references
Dynamic programming
0 references
Convergence results
0 references