Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
Finite state Markovian decision processes - MaRDI portal

Finite state Markovian decision processes

From MaRDI portal
Publication:2561156

zbMath0262.90001MaRDI QIDQ2561156

C. Derman

Publication date: 1970

Published in: Mathematics in Science and Engineering (Search for Journal in Brave)




Related Items

Markov decision programming with constraintsAsymptotic expansions for dynamic programming recursions with general nonnegative matricesDeciding probabilistic automata weak bisimulation: theory and practiceDiscounted Markov decision processes with fuzzy costsMatching, search, and bargainingUtility, probabilistic constraints, mean and variance of discounted rewards in Markov decision processesA variance minimization problem for a Markov decision processQuadratic programming and the single-controller stochastic gamePerformance analysis of probabilistic timed automata using digital clocksContinue, quit, restart probability modelPerspectives of approximate dynamic programmingImpact of supply risks on procurement decisionsOptimal control in light traffic Markov decision processesOn efficiency of linear programming applied to discounted Markovian decision problemsTrading performance for stability in Markov decision processesCommunicating MDPs: Equivalence and LP propertiesStrategy improvement for concurrent reachability and turn-based stochastic safety gamesAn efficient heuristic for a partially observable Markov decision process of machine replacementFinite-horizon variance penalised Markov decision processesA semimartingale characterization of average optimal stationary policies for Markov decision processesConstrained Markov decision processes with first passage criteriaOn confidence intervals from simulation of finite Markov chainsOn the life and work of Cyrus DermanDerman's book as inspiration: some results on LP for MDPsQ-learning and policy iteration algorithms for stochastic shortest path problemsAsymptotically optimal Bayesian sequential change detection and identification rulesDiscounting axioms imply risk neutralityVariance minimization for constrained discounted continuous-time MDPs with exponentially distributed stopping timesTraining and repair policies for stand-by systemsInventory replenishment control under supply uncertaintyExact decomposition approaches for Markov decision processes: a surveyA uniform framework for modeling nondeterministic, probabilistic, stochastic, or mixed processes and their behavioral equivalencesFirst passage problems for nonstationary discrete-time stochastic control systemsA game-theoretic analysis of bargaining with reputationsThe blast furnaces problemMathematical modeling of distributed catastrophic and terrorist risksOn strategy improvement algorithms for simple stochastic gamesSeparable value functions for infinite horizon average reward Markov decision processesOptimal control of Markov chains admitting strong and weak interactionsA decomposition algorithm for limiting average Markov decision problems.Explicit solution of the average-cost optimality equation for a pest-control problemOptimal maintenance of systems with Markovian mission and deteriorationA model of far-sighted electoral competitionDynamic programming and the secretary problemOptimal threshold probability in undiscounted Markov decision processes with a target set.Adaptive control of constrained Markov chains: Criteria and policiesNonlinear programming and stationary equilibria in stochastic gamesSensitivity of constrained Markov decision processesPolynomial time decision algorithms for probabilistic automataLearning about variable demand in the long runSeparable Markovian decision problems. The linear programming method in the multichain caseAn exponential lower bound for Cunningham's ruleSaddle-point calculation for constrained finite Markov chainsFinite approximation of the first passage models for discrete-time Markov decision processes with varying discount factorsEstimating the value of a discounted reward processStructural properties of optimal tool replacement policy in a machining centerOn polynomial cases of the unichain classification problem for Markov decision processesAn optimal maintenance policy of a discrete time Markovian deterioration systemPolicy iteration and Newton-Raphson methods for Markov decision processes under average cost criterionSome remarks on the new optimality criterion of Mine and TabataOn the convergence of the average expected return in dynamic programmingMaximizing the probability of attaining a target prior to extinctionPlastic-elastic torsion, optimal stopping and free boundariesStochastic control of paging in a two-level computer memoryOptimal software testing in the setting of controlled Markov chainsOptimal isolation policies for deterministic and stochastic epidemicsOn stopped decision processes with discrete time parameterOptimal threshold probability and expectation in semi-Markov decision processesOptimal maintenance of two stochastically deteriorating machines with an intermediate bufferStochastic evolution and control of an economic activityStochastische dynamische Optimierung als Spezialfall linearer Optimierung in halbgeordneten VektorräumenOptimal immunization rules for an epidemic with recoveryOptimal systems for equipment maintenance and replacement under Markovian deteriorationMarkov decision models for the optimal maintenance of a production unit with an upstream bufferContraction mappings underlying undiscounted Markov decision problemsProbabilistic mobile ambientsFast convergence to state-action frequency polytopes for MDPsA note on the hypercube modelOptimal maintenance of a production-inventory system with idle periodsConstrained denumerable state non-stationary MDPs with expected total reward criterionOptimal strategies for an inventory system with cost functions of general formOn the optimal control of M/G/1 systems under the cycle criterionOn asymptotic optimization of a class of nonlinear stochastic hybrid systemsPercentiles and Markovian decision processesOptimal control of a dam under seasonal electricity pricesOn maximizing the average time at a goalOn the block upper-triangularity of undiscounted multi-chain Markov decision problemsPaging against a distribution and IP networkingStochastic control theory and operational researchSensitivity analysis in discounted Markovian decision problemsMean-variance criteria in an undiscounted Markov decision processDerivation of optimal stocking policies for grazing in arid regions. I. MethodologyOptimal maintenance strategies for systems with partial repair options and without assuming bounded costsFinite state Markov decision models with average reward criteriaGeneralized polynomial approximations in Markovian decision processesOptimal policies for controlled Markov chains with a constraintSome comments on a theorem of Hardy and LittlewoodRevised simplex algorithm for finite Markov decision processesOptimal stopping problems for multiarmed bandit processes with arms' independenceCombinatorial structure and randomized subexponential algorithms for infinite gamesThe power of two choices for random walksModel Checking of Biological SystemsRegular Policies in Abstract Dynamic ProgrammingAlgebraic optimization of sequential decision problemsGeometry and convergence of natural policy gradient methodsMachine maintenance with workload considerationsControlled Markov Fields with Finite State Space on GraphsMarkov decision processesAdaptive policy for two finite Markov chains zero-sum stochastic game with unknown transition matrices and average payoffsAn analysis of transient Markov decision processesProcess algebra for performance evaluationOn the chance to visit a goal set infinitely oftenThe linear time-branching time spectrum of equivalences for stochastic systems with non-determinismA Convex Analytic Approach to Risk-Aware Markov Decision ProcessesMarkov Decision Processes with Asymptotic Average Failure Rate ConstraintControl of a hybrid stochastic systemTemporal logics for the specification of performance and reliabilitySymbolic model checking for probabilistic timed automataPerformance Model Checking Scenario-Aware DataflowManaging stochastic inventory systems with free shipping optionA control policy of an inventory system with compound poisson demandAlgorithms for stochastic games ? A surveyCertified Impossibility Results and Analyses in Coq of Some Randomised Distributed AlgorithmsBlock-scaling of value-iteration for discounted Markov renewal programmingReplacement process decomposition for discounted Markov renewal programmingOn some algorithms for limiting average Markov decision processesOptimal Preventive Maintenance of a Production-Inventory System When the Action of “Idling” Is PermissibleOn stochastic optimality of policies in first passage problemsBiased random walksControlled Markov processes on the infinite planning horizon: Weighted and overtaking cost criteriaMarkov: A methodology for the solution of infinite time horizon markov decision processesConstrained Semi-Markov decision processes with average rewardsMulti-objective discounted Markov decision processes with expectation and variance criteriaGeneralized Markovian decision processesSurvey of linear programming for standard and nonstandard Markovian control problems. Part I: TheoryLinear programming formulation of MDPs in countable state space: The multichain caseError bounds for stochastic shortest path problemsUnnamed Item\textsc{ULTraS} at work: compositionality metaresults for bisimulation and trace semanticsFirst Passage Optimality and Variance Minimisation of Markov Decision Processes with Varying Discount FactorsOptimal repair allocation in a series system expected discounted operation time criterionRisk-Sensitive Reinforcement Learning via Policy Gradient SearchA Simple P-Matrix Linear Complementarity Problem for Discounted GamesStochastic shortest path problems with associative accumulative criteriaThe Linear Program approach in multi-chain Markov Decision Processes revisitedPartially observable Markov decision model for the treatment of early prostate cancerFirst passage models for denumerable semi-Markov decision processes with nonnegative discounted costsDeterministic discrete dynamic programming with discount factor greater than one: Some further results and algorithmsOn nonzero-sum game considered on solutions of a hybrid system with frequent random jumpsA value iteration method for undiscounted multichain Markov decision processesOn optimal replacement policySTRONG AVERAGE OPTIMALITY FOR CONTROLLED NONHOMOGENEOUS MARKOV CHAINS*Towards solving 2-TBSG efficientlyThe LP approach in average reward MDPs with multiple cost constraints: The countable state caseVerification of the randomized consensus algorithm of Aspnes and Herlihy: a case studyScheduling service in tandem queues attended by a single serverFuzzy optimality relation for perceptive MDPs-the average caseConstructive logical characterizations of bisimilarity for reactive probabilistic systemsTowards general axiomatizations for bisimilarity and trace semanticsConcurrent reachability gamesA Subexponential Lower Bound for Zadeh’s Pivoting Rule for Solving Linear Programs and GamesOn Markov gamesTask-structured probabilistic I/O automataCode aware resource managementCustomizing exponential semi-Markov decision processes under the discounted cost criterionA decision exclusion algorithm for a class of Markovian Decision ProcessesA Generalisation of Stationary Distributions, and Probabilistic Program AlgebraRelating strong behavioral equivalences for processes with nondeterminism and probabilitiesDynamic multi-appointment patient scheduling for radiation therapySolution procedures for multi-objective markov decision processesIllustrated review of convergence conditions of the value iteration algorithm and the rolling horizon procedure for average-cost MDPsRevisiting bisimilarity and its modal logic for nondeterministic and probabilistic processesA survey of maintenance models: The control and surveillance of deteriorating systemsTransient policies in discrete dynamic programming: Linear programming including suboptimality tests and additional constraintsOptimization of moves and measurements in networks with stochastic costsMARKOVIAN DETERIORATION WITH UNCERTAIN INFORMATION — A MORE GENERAL MODELRemarks on the hypercube modelOn the Fixed Points of the Optimal Reward Operator in Stochastic Dynamic Programming with Discount Factor Greater than OneFirst passage gameSolving stochastic dynamic programming problems by linear programming — An annotated bibliographyOptimal dynamic rules for assigning customers to servers in a heterogeneous queuing systemSome approaches to solving inventory control problemsOn Solving Finite State Multi-Armed Bandit Problem by Linear ProgrammingSolution of continuous-time markovian decision models using infinite linear programmingComputing Optimal Policies for Markovian Decision Processes Using SimulationA Markovian Decision Process with hidden states and hidden costsOptimal state-dependent pricing policies for a class of stochastic multiunit service systemsSuboptimal inspection policies for imperfectly observed realistic systemsSolving a general discounted dynamic program by linear programmingOptimal maintenance models for systems subject to failure–A ReviewAverage Reward Markov Decision Processes with Multiple Cost ConstraintsAdaptive optimization and the harvest of biological populationsAllocation of distinguishable serversA parametric characterization and an \(\epsilon\)-approximation scheme for the minimization of a quasiconcave programMean, variance and probabilistic criteria in finite Markov decision processes: A reviewGroup-by-Group Probabilistic Bisimilarities and Their Logical CharacterizationsAverage optimality for Markov decision processes in borel spaces: a new condition and approachOn the solvability of Bellman's functional equation for a Markovian decision processA finite algorithm for the switching control stochastic gameDynamic Repair Allocation for a k−Out−of−n System Maintained by Distinguishable RepairmenFinite state multi-armed bandit problems: Sensitive-discount, average-reward and average-overtaking optimalityOn the Optimality of Trunk Reservation in Overflow ProcessesOptimal preventive maintenance of a production system with an intermediate bufferUnnamed ItemRobust shortest path planning and semicontractive dynamic programmingRepeated bargaining with opportunities for learningOptimal policy for minimizing risk models in Markov decision processesPreventive replacement for multi-parts systemsMaintenance of a device with age-dependent exponential failuresRemarks on Testing Probabilistic ProcessesQuantitative program logic and expected time bounds in probabilistic distributed algorithms.Unnamed ItemFirst passage Markov decision processes with constraints and varying discount factorsJoint optimization of \(\overline{X}\) control chart and preventive maintenance policies: a discrete-time Markov chain approach




This page was built for publication: Finite state Markovian decision processes