Suboptimal policy determination for large-scale Markov decision processes. II: Implementation and numerical evaluation (Q5896459)

From MaRDI portal
scientific article; zbMATH DE number 3874993
Language Label Description Also known as
English
Suboptimal policy determination for large-scale Markov decision processes. II: Implementation and numerical evaluation
scientific article; zbMATH DE number 3874993

    Statements

    Suboptimal policy determination for large-scale Markov decision processes. II: Implementation and numerical evaluation (English)
    0 references
    0 references
    0 references
    1985
    0 references
    We present an implementation of the procedure for determining a suboptimal policy for a large-scale Markov decision process (MDP) presented in Part 1 [see the preceding review]. An operation count analysis illuminates the significant computational benefits of this procedure for determining an optimal policy relative to a procedure for determining a suboptimal policy based on state and action space aggregation. Results of a preliminary numerical study indicate that the quality of the suboptimal policy produced by the 3MDP approach shows promise.
    0 references
    suboptimal policy
    0 references
    large-scale Markov decision process
    0 references

    Identifiers