A methodology for computation reduction for specially structured large scale Markov decision problems (Q1092822)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: A methodology for computation reduction for specially structured large scale Markov decision problems |
scientific article; zbMATH DE number 4020872
| Language | Label | Description | Also known as |
|---|---|---|---|
| English | A methodology for computation reduction for specially structured large scale Markov decision problems |
scientific article; zbMATH DE number 4020872 |
Statements
A methodology for computation reduction for specially structured large scale Markov decision problems (English)
0 references
1988
0 references
Markov Decision Processes (MDP's) deal with sequential decision making in stochastic systems. Existing solution techniques provide powerful tools for determining the optimal policy set in such systems. However, many problems have extremely large state and action spaces making them computationally intractable. Typically, the state variable definition is n-dimensional and the number of states expands at a rate proportional to the power of n. For such large problems, the need for large amounts of random access memory and computation time restricts the ability to obtain solutions. The purpose of this paper is both to present a methodology which takes advantage of the structure of many large scale problems (i.e., problems with a high percentage of transient states under optimal control), and to provide computational results indicating the value of the approach.
0 references
sequential decision making
0 references
extremely large state and action spaces
0 references
large scale problems
0 references
0 references
0 references
0.8294246792793274
0 references
0.7958163619041443
0 references