Adaptive Policies in Markov Decision Processes with Uncertain Transition Matrices
From MaRDI portal
Publication:3664853
DOI10.1080/02522667.1983.10698747zbMath0516.90077OpenAlexW2321325624MaRDI QIDQ3664853
Publication date: 1983
Published in: Journal of Information and Optimization Sciences (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1080/02522667.1983.10698747
Bayesian analysisadaptive policyaverage caseoptimal adaptive policyepsilon-optimal policydiscounted caseuncertain transition matriceslearning policynon-Bayesian analysis
Related Items (3)
A unified approach to adaptive control of average reward Markov decision processes ⋮ Adaptive policy-iteration and policy-value-iteration for discounted Markov decision processes ⋮ Unnamed Item
Cites Work
This page was built for publication: Adaptive Policies in Markov Decision Processes with Uncertain Transition Matrices