Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
Robust Control of Markov Decision Processes with Uncertain Transition Matrices - MaRDI portal

Robust Control of Markov Decision Processes with Uncertain Transition Matrices

From MaRDI portal
Publication:5322137

DOI10.1287/opre.1050.0216zbMath1165.90674OpenAlexW1965878388MaRDI QIDQ5322137

No author found.

Publication date: 18 July 2009

Published in: Operations Research (Search for Journal in Brave)

Full work available at URL: https://semanticscholar.org/paper/6db16608fccddef51202af84112b34cfebfbe20a




Related Items

Explicit explore, exploit, or escape \((E^4)\): near-optimal safety-constrained reinforcement learning in polynomial timeData-driven remanufacturing planning with parameter uncertaintyParameter synthesis in Markov models: a gentle surveyA few lessons learned in reinforcement learning for quadcopter attitude controlStochastic control for organ donations: a reviewJoint chance-constrained Markov decision processesDistributionally Robust Strategy Synthesis for Switched Stochastic SystemsInterval Markov Decision Processes with Continuous Action-SpacesRobust Control for Dynamical Systems with Non-Gaussian Noise via Formal AbstractionsA survey of nonlinear robust optimizationRobust optimization in countably infinite linear programsVariable demand and multi-commodity flow in Markovian network equilibriumScenario-Based Verification of Uncertain MDPsAmbiguous partially observable Markov decision processes: structural results and applicationsProbabilistic Timed Automata with One Clock and Initialised Clock-Dependent ProbabilitiesRobust decomposable Markov decision processes motivated by allocating school budgetsTight Approximations of Dynamic Risk MeasuresOracle-Based Robust Optimization via Online LearningPartially observable Markov decision processes incorporatingQuantitative verification and strategy synthesis for stochastic gamesDynamic programming for deterministic discrete-time systems with uncertain gainOptimal Information Blending with Measurements in the L2 SphereReachability analysis of uncertain systems using bounded-parameter Markov decision processesDeterministic policies based on maximum regrets in MDPs with imprecise rewardsLipschitzness is all you need to tame off-policy generative adversarial imitation learningData-Driven Pricing for a New ProductRobust Markov Decision Processes with Data-Driven, Distance-Based Ambiguity SetsRobust stability, ℋ2 analysis and stabilisation of discrete-time Markov jump linear systems with uncertain probability matrixPoisoning finite-horizon Markov decision processes at design timeUncertainty quantification for Markov chain modelsMinimax and risk averse multistage stochastic programmingConstrained Markov decision processes with uncertain costsEstimating permanent price impact via machine learningQuantile Markov Decision ProcessesRobust Markov control processesZ-relation-based multistage decision makingVariance-constrained actor-critic algorithms for discounted and average reward MDPsRobust solutions to Stackelberg games: addressing bounded rationality and limited observations in human cognitionRisk-Averse Stochastic Programming: Time Consistency and Optimal StoppingMinimax Q-learning control for linear systems using the Wasserstein metricON ROBUST MULTI-PERIOD PRE-COMMITMENT AND TIME-CONSISTENT MEAN-VARIANCE PORTFOLIO OPTIMIZATIONRisk-aware controller for autonomous vehicles using model-based collision prediction and reinforcement learningCoefficients of ergodicity for Markov chains with uncertain parametersUnnamed ItemRobustness to Incorrect System Models in Stochastic ControlA graph-theoretic-based method for analyzing conduction problemsEfficient solutions to factored MDPs with imprecise transition probabilitiesUsing mathematical programming to solve factored Markov decision processes with imprecise probabilitiesOnline First-Order Framework for Robust Convex OptimizationLearning parametric policies and transition probability models of Markov decision processes from dataUnnamed ItemRobust topological policy iteration for infinite horizon bounded Markov decision processesComputation of weighted sums of rewards for concurrent MDPsTechnical Note—Time Inconsistency of Optimal Policies of Distributionally Robust Inventory ModelsDistributionally Robust Partially Observable Markov Decision Process with Moment-Based AmbiguityOn the Complexity of Reachability in Parametric Markov Decision ProcessesA dynamic inventory rationing problem with uncertain demand and production ratesRobust bounds and optimization at the large deviations scale for queueing models via Rényi divergenceA survey of decision making and optimization under uncertaintyUnnamed ItemUnnamed ItemPolicy iteration for robust nonstationary Markov decision processesAn analysis of model-based interval estimation for Markov decision processesDiscrete time Markov chains with interval probabilitiesLight robustness in the optimization of Markov decision processes with uncertain parametersReinforcement learning with limited reinforcement: using Bayes risk for active learning in POMDPsRectangular Sets of Probability MeasuresRobust Actuarial Risk AnalysisA dynamic programming approach to adjustable robust optimizationPolicy-based branch-and-bound for infinite-horizon multi-model Markov decision processesTutorial on risk neutral, distributionally robust and risk averse multistage stochastic programmingTime (in)consistency of multistage distributionally robust inventory models with moment constraintsAsymptotic optimality of the generalized \(c\mu\) rule under model uncertaintyLikelihood robust optimization for data-driven problemsSelected topics in robust convex optimizationRobust response-guided dosingRobust analysis of discounted Markov decision processes with uncertain transition probabilitiesRobust Adversarial Risk Analysis: A Level-k ApproachExtended Laplace principle for empirical measures of a Markov chainContinuous-Time Robust Dynamic ProgrammingRobust Adaptive Routing Under UncertaintyDiscrete Approximation and Quantification in Distributionally Robust OptimizationQuantifying Distributional Model Risk via Optimal TransportData Uncertainty in Markov Chains: Application to Cost-Effectiveness Analyses of Medical InnovationsRobust Analysis in Stochastic Simulation: Computation and Performance GuaranteesDistributionally robust optimal control and MDP modelingIMPRECISE MARKOV CHAINS AND THEIR LIMIT BEHAVIORPartially observable Markov decision processes with imprecise parametersRobust Control of Partially Observable Failing SystemsROBUST ASSET ALLOCATION WITH BENCHMARKED OBJECTIVESRobust Sensitivity Analysis for Stochastic SystemsReinforcement Learning in Robust Markov Decision ProcessesRobust MDPs with k-Rectangular UncertaintyROBUST DYNAMIC PRICING OVER INFINITE HORIZON IN THE PRESENCE OF MODEL UNCERTAINTYDistributionally robust optimization for sequential decision-makingSensitivity Analysis in Markov Decision Processes with Uncertain Reward ParametersGittins' theorem under uncertaintyLearning and planning in partially observable environments without prior domain knowledgeRisk-averse policy optimization via risk-neutral policy optimizationDistributionally robust modeling of optimal controlRobust control of the multi-armed bandit problemConcurrent MDPs with Finite Markovian PoliciesDistributionally Robust Markov Decision Processes and Their Connection to Risk MeasuresDistributionally Robust Inventory Control When Demand Is a MartingaleFormulation and properties of a divergence used to compare probability measures without absolute continuityToward theoretical understandings of robust Markov decision processes: sample complexity and asymptoticsAlgorithmic aspects of mean-variance optimization in Markov decision processes