scientific article
From MaRDI portal
zbMath1182.68237arXiv1109.2156MaRDI QIDQ3623997
Sungwook Yoon, Alan Fern, Robert L. Givan
Publication date: 28 April 2009
Full work available at URL: https://arxiv.org/abs/1109.2156
Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.
Related Items
The factored policy-gradient planner, Practical solution techniques for first-order MDPs, A Comprehensive Framework for Learning Declarative Action Models, Reducing reinforcement learning to KWIK online regression, Preference-based reinforcement learning: a formal framework and a policy iteration algorithm, Qualitative Numeric Planning: Reductions and Complexity, A new representation and associated algorithms for generalized planning, APPSSAT: Approximate probabilistic planning using stochastic satisfiability, Rollout sampling approximate policy iteration, Structured machine learning: the next ten years, Bounds for Multistage Stochastic Programs Using Supervised Learning Strategies