scientific article
From MaRDI portal
Publication:3624032
zbMath1182.68252arXiv1110.0028MaRDI QIDQ3624032
Carlos Guestrin, Miloš Hauskrecht, Branislav Kveton
Publication date: 28 April 2009
Full work available at URL: https://arxiv.org/abs/1110.0028
Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.
Linear programming (90C05) Problem solving in the context of artificial intelligence (heuristics, search strategies, etc.) (68T20)
Related Items (6)
Practical solution techniques for first-order MDPs ⋮ An online prediction algorithm for reinforcement learning with linear function approximation using cross entropy method ⋮ Restricted gradient-descent algorithm for value-function approximation in reinforcement learning ⋮ A framework and a mean-field algorithm for the local control of spatial processes ⋮ Embedding a state space model into a Markov decision process ⋮ Influence of modeling structure in probabilistic sequential decision problems
This page was built for publication: