Q-learning and enhanced policy iteration in discounted dynamic programming (Q2884305)

From MaRDI portal





scientific article; zbMATH DE number 6038619
Language Label Description Also known as
English
Q-learning and enhanced policy iteration in discounted dynamic programming
scientific article; zbMATH DE number 6038619

    Statements

    0 references
    0 references
    24 May 2012
    0 references
    Markov decision processes
    0 references
    Q-learning
    0 references
    policy iteration
    0 references
    value iteration stochastic approximation
    0 references
    reinforcement learning
    0 references
    Q-learning and enhanced policy iteration in discounted dynamic programming (English)
    0 references

    Identifiers

    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references