scientific article; zbMATH DE number 2243398
From MaRDI portal
Publication:5715714
zbMath1080.68674arXiv1109.2145MaRDI QIDQ5715714
Matthijs T. J. Spaan, Nikos Vlassis
Publication date: 4 January 2006
Full work available at URL: https://arxiv.org/abs/1109.2145
Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.
Related Items (25)
Gradient-descent for randomized controllers under partial observability ⋮ Cooperative decision-making to minimize biased perceived value effect on business process decisions using partially observable Markov decision processes ⋮ Admission Control Policies in a Finite Capacity Geo/Geo/1 Queue Under Partial State Observations ⋮ Planning for multiple measurement channels in a continuous-state POMDP ⋮ Planning in partially-observable switching-mode continuous domains ⋮ Optimal management of stochastic invasion in a metapopulation with Allee effects ⋮ TEAMSTER: model-based reinforcement learning for ad hoc teamwork ⋮ A Machine Learning–Enabled Partially Observable Markov Decision Process Framework for Early Sepsis Prediction ⋮ Solving zero-sum one-sided partially observable stochastic games ⋮ Parameter-Independent Strategies for pMDPs via POMDPs ⋮ Multi-goal motion planning using traveling salesman problem in belief space ⋮ Exploiting symmetries for single- and multi-agent partially observable stochastic domains ⋮ Control: a perspective ⋮ Unnamed Item ⋮ Unnamed Item ⋮ A fast approximation method for partially observable Markov decision processes ⋮ Bottom-up learning of hierarchical models in a class of deterministic pomdp environments ⋮ An Uncertainty-Based Belief Selection Method for POMDP Value Iteration ⋮ Partially observable Markov decision processes with imprecise parameters ⋮ Learning and planning in partially observable environments without prior domain knowledge ⋮ A tutorial on partially observable Markov decision processes ⋮ Optimizing active surveillance for prostate cancer using partially observable Markov decision processes ⋮ Constrained Multiagent Markov Decision Processes: a Taxonomy of Problems and Algorithms ⋮ Task-Aware Verifiable RNN-Based Policies for Partially Observable Markov Decision Processes ⋮ A novel factored POMDP model for affective dialogue management
Uses Software
This page was built for publication: