Mathematical Research Data Initiative
Main page
Recent changes
Random page
Help about MediaWiki
Create a new Item
Create a new Property
Create a new EntitySchema
Merge two items
In other projects
Discussion
View source
View history
Purge
English
Log in

Policy iteration for bounded-parameter POMDPs

From MaRDI portal
Publication:1955470
Jump to:navigation, search

DOI10.1007/s00500-012-0932-3zbMath1264.90174OpenAlexW2069186469MaRDI QIDQ1955470

Yaodong Ni, Zhi-Qiang Liu

Publication date: 11 June 2013

Published in: Soft Computing (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1007/s00500-012-0932-3


zbMATH Keywords

policy iterationdecision making under uncertainty\(\epsilon\)-optimal policybounded-parameter POMDPfinite-state controlleroptimistic optimality


Mathematics Subject Classification ID

Markov and semi-Markov decision processes (90C40)


Related Items

BOUNDED-PARAMETER PARTIALLY OBSERVABLE MARKOV DECISION PROCESSES: FRAMEWORK AND ALGORITHM



Cites Work

  • Unnamed Item
  • Unnamed Item
  • Planning and acting in partially observable stochastic domains
  • Partially observable Markov decision processes with imprecise parameters
  • Bounded-parameter Markov decision processes
  • Markovian Decision Processes with Uncertain Transition Probabilities
  • Bayesian Sequential Detection With Phase-Distributed Change Time and Nonlinear Penalty—A POMDP Lattice Programming Approach
  • Bounded Parameter Markov Decision Processes with Average Reward Criterion
Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:1955470&oldid=14398399"
Tools
What links here
Related changes
Special pages
Printable version
Permanent link
Page information
MaRDI portal item
This page was last edited on 1 February 2024, at 17:27.
Privacy policy
About MaRDI portal
Disclaimers
Imprint
Powered by MediaWiki