Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
The Optimal Control of Partially Observable Markov Processes over the Infinite Horizon: Discounted Costs - MaRDI portal

The Optimal Control of Partially Observable Markov Processes over the Infinite Horizon: Discounted Costs

From MaRDI portal
Publication:4158268

DOI10.1287/opre.26.2.282zbMath0379.60067OpenAlexW2044375425WikidataQ29030996 ScholiaQ29030996MaRDI QIDQ4158268

Edward J. Sondik

Publication date: 1978

Published in: Operations Research (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1287/opre.26.2.282




Related Items

Ambiguous partially observable Markov decision processes: structural results and applicationsPARTIALLY OBSERVABLE MARKOV DECISION PROCESSES AND PERIODIC POLICIES WITH APPLICATIONSAn optimal inspection and replacement policy under incomplete state informationA model of project evaluation with limited attentionOn arrival driven queueing models: Admission control, traffic policing, abandonments, and correlated arrivalsA nonhomogeneous hidden Markov model of response dynamics and mailing optimization in direct marketingOn the construction of \(\epsilon\)-optimal strategies in partially observed MDPsA survey of algorithmic methods for partially observed Markov decision processesOn the average cost optimality equation and the structure of optimal policies for partially observable Markov decision processesOn the computation of the optimal cost function for discrete time Markov models with partial observationsCross-entropic learning of a machine for the decision in a partially observable universeOn-line parameter estimation for a failure-prone system subject to condition monitoringAvailability maximization under partial observationsAdmission Control Policies in a Finite Capacity Geo/Geo/1 Queue Under Partial State ObservationsLQG dynamic games with a control-sharing information patternAn efficient heuristic for a partially observable Markov decision process of machine replacementCops and invisible robbers: the cost of drunkennessThe skyline algorithm for POMDP value function pruningMarkov Decision Processes with Incomplete Information and Semiuniform Feller Transition ProbabilitiesA nonlinear programming model for partially observable Markov decision processes: Finite horizon caseBOUNDED-PARAMETER PARTIALLY OBSERVABLE MARKOV DECISION PROCESSES: FRAMEWORK AND ALGORITHMRationality and intelligencePlanning in partially-observable switching-mode continuous domainsA systematic approach to determining mean-variance tradeoffs when managing randomly varying populationsFormalization of methods for the development of autonomous artificial intelligence systemsApproximability and efficient algorithms for constrained fixed-horizon POMDPs with durative actionsPartially observable Markov decision process-based optimal maintenance planning with time-dependent observationsSolving zero-sum one-sided partially observable stochastic gamesInventory control with modulated demand and a partially observed modulation processFuture memories are not needed for large classes of POMDPsMonotone control laws for noisy, countable-state Markov chainsINDEXABILITY AND OPTIMAL INDEX POLICIES FOR A CLASS OF REINITIALISING RESTLESS BANDITSOff-policy evaluation in partially observed Markov decision processes under sequential ignorabilityPartially observable Markov decision model for the treatment of early prostate cancerComputation of approximate optimal policies in a partially observed inventory model with rain checksExploiting symmetries for single- and multi-agent partially observable stochastic domainsA unified framework for stochastic optimizationStrong planning under partial observabilityConvergence of probability measures and Markov decision models with incomplete informationManaging mobile production-inventory systems influenced by a modulation processMarkov-Entscheidungs-Prozesse mit abhängigen Aktionen für optimale Reparaturmaßnahmen bei unvollständiger Information. (Markov decision processes with dependent actions for optimal repair policies under incomplete information)Optimal control of infinite horizon partially observable decision processes modelled as generators of probabilistic regular languagesReinforcement learning with limited reinforcement: using Bayes risk for active learning in POMDPsPlanning and acting in partially observable stochastic domainsConformant plans and beyond: principles and complexityFilters and parameter estimation for a partially observable system subject to random failure with continuous-range observationsHeuristic anytime approaches to stochastic decision processesProbabilistic Acceptors for Languages over Infinite WordsFinite-state, discrete-time optimization with randomly varying observation qualityMinimum principles in motor control.Optimal condition based maintenance with imperfect information and the proportional hazards modelUndiscounted Markov decision chains with partial information; an algorithm for computing a locally optimal periodic policyPartially Observable Total-Cost Markov Decision Processes with Weakly Continuous Transition ProbabilitiesLocally-Connected Interrelated Network: A Forward Propagation PrimitiveAn Uncertainty-Based Belief Selection Method for POMDP Value IterationOn replacement policies for additive systems with several working levelsPartially observable Markov decision processes with imprecise parametersA Fenchel-Moreau-Rockafellar type theorem on the Kantorovich-Wasserstein space with applications in partially observable Markov decision processesMonitoring machine operations using on-line sensorsA dynamic epistemic framework for reasoning about conformant probabilistic plansA tutorial on partially observable Markov decision processesStochastic dynamic programming with factored representationsA New Computational Approach to Cost Variance Iuvestigation ProblemsConstrained Multiagent Markov Decision Processes: a Taxonomy of Problems and AlgorithmsTransformation of partially observable Markov decision processes into piecewise linear onesA survey of solution techniques for the partially observed Markov decision processOptimal cost and policy for a Markovian replacement problemA leader-follower partially observed, multiobjective Markov gameValue of information for a leader-follower partially observed Markov gameOn the undecidability of probabilistic planning and related stochastic optimization problemsOptimal sensor scheduling for hidden Markov model state estimation