Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
scientific article - MaRDI portal

scientific article

From MaRDI portal

Publication:3266141

Jump to:navigation, search

zbMath0091.16001MaRDI QIDQ3266141

Ronald A. Howard

Publication date: 1960

Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.

zbMATH Keywords

applications of probability theory and statistics

Related Items (only showing first 100 items - show all)

Provisioning of public health can be designed to anticipate public policy responses ⋮ A complexity analysis of policy iteration through combinatorial matrices arising from unique sink orientations ⋮ Serial and parallel value iteration algorithms for discounted Markov decision processes ⋮ An information-theoretic analysis of return maximization in reinforcement learning ⋮ Cyclic flow-shop scheduling with no-wait constraints and missing operations ⋮ The policy iteration algorithm for a compound Poisson process applied to optimal dividend strategies under a Cramér-Lundberg risk model ⋮ Multi-asset portfolio selection problem with transaction costs ⋮ Markovian real-time adaptive control of signal systems ⋮ Dynamic programming, Markov chains, and the method of successive approximations ⋮ Stable sequential control rules and Markov chains ⋮ Numerical approximation of a system of Hamilton-Jacobi-Bellman equations arising in innovation dynamics ⋮ Semi-Lipschitz functions and machine learning for discrete dynamical systems on graphs ⋮ Sporadic overtaking optimality in Markov decision problems ⋮ Genetic algorithms and call admission to telecommunications networks ⋮ A learning large neighborhood search for the staff rerostering problem ⋮ Convergence of deep fictitious play for stochastic differential games ⋮ Efficient incremental planning and learning with multi-valued decision diagrams ⋮ Symblicit algorithms for mean-payoff and shortest path in monotonic Markov decision processes ⋮ Generic uniqueness of the bias vector of finite zero-sum stochastic games with perfect information ⋮ Computing semi-stationary optimal policies for multichain semi-Markov decision processes ⋮ A general decomposition approach for multi-criteria decision trees ⋮ A mean first passage time genome rearrangement distance ⋮ The stochastic shortest path problem: a polyhedral combinatorics perspective ⋮ Some remarks on cops and drunk robbers ⋮ When to parasitize? A dynamic optimization model of reproductive strategies in a cooperative breeder ⋮ Reachability and safety objectives in Markov decision processes on long but finite horizons ⋮ Dynamic and game theory of infectious disease stigmas ⋮ An epistemic approach to stochastic games ⋮ State partitioning based linear program for stochastic dynamic programs: an invariance property ⋮ Optimal timing of disease transmission in an age-structured population ⋮ Markov decision processes in service facilities holding perishable inventory ⋮ Finding the \(K\) best policies in a finite-horizon Markov decision process ⋮ Detection-averse optimal and receding-horizon control for Markov decision processes ⋮ A conservative index heuristic for routing problems with multiple heterogeneous service facilities ⋮ A superpolynomial lower bound for strategy iteration based on snare memorization ⋮ SLAP: specification logic of actions with probability ⋮ Strong polynomiality of policy iterations for average-cost MDPs modeling replacement and maintenance problems ⋮ Lexicographic refinements in stationary possibilistic Markov decision processes ⋮ Adaptive \(C^0\) interior penalty methods for Hamilton-Jacobi-Bellman equations with cordes coefficients ⋮ On optimal replacement thresholds with technological expectations ⋮ The average cost of Markov chains subject to total variation distance uncertainty ⋮ Fuzzy optimality relation for perceptive MDPs-the average case ⋮ A multi-period TSP with stochastic regular and urgent demands ⋮ Experiences with an interactive museum tour-guide robot ⋮ Markov decision processes with sequential sensor measurements ⋮ Stochastic dynamic programming with non-linear discounting ⋮ A modified MSA for stochastic control problems ⋮ Control of \(M|M|1|N\) queue parameters under constraints ⋮ Algorithms and conditional lower bounds for planning problems ⋮ Feedback control of parametrized PDEs via model order reduction and dynamic programming principle ⋮ Markov solution processes: modeling human problem solving with procedural knowledge space theory ⋮ A review on deep reinforcement learning for fluid mechanics ⋮ Dynamic repositioning strategy in a bike-sharing system; how to prioritize and how to rebalance a bike station ⋮ A mean field games model for finite mixtures of Bernoulli and categorical distributions ⋮ Symbolic algorithms for qualitative analysis of Markov decision processes with Büchi objectives ⋮ Dynamic programming with state-dependent discounting ⋮ Game-theoretic control of the object's random jump structure in the class of pure strategies ⋮ Piracy on the internet: accommodate it or fight it? A dynamic approach ⋮ Domain decomposition based parallel Howard's algorithm ⋮ Approximating the minimum cycle mean ⋮ Unifying temporal and organizational scales in multiscale decision-making ⋮ The complexity of solving reachability games using value and strategy iteration ⋮ Determining the optimal strategies for discrete control problems on stochastic networks with discounted costs ⋮ Average case analysis of the classical algorithm for Markov decision processes with Büchi objectives ⋮ Semi-Markov decision processes with limiting ratio average rewards ⋮ On the existence of relative values for undiscounted multichain Markov decision processes ⋮ Continuous-time stochastic games ⋮ Valuing programs with deterministic and stochastic cycles ⋮ IPL: an integration property language for multi-model cyber-physical systems ⋮ Dimensioning a queue with state-dependent arrival rates ⋮ Modified policy iteration algorithms are not strongly polynomial for discounted dynamic programming ⋮ Monotone mixed finite difference scheme for Monge-Ampère equation ⋮ A performance-centred approach to optimising maintenance of complex systems ⋮ Reliability models and analysis of systems with protection ⋮ Rollout sampling approximate policy iteration ⋮ Artificial viscosity joint spacetime multigrid method for Hamilton-Jacobi-Bellman and Kolmogorov-Fokker-Planck system arising from mean field games ⋮ Calculating expected incomes in open Markov networks with requests of different classes and different peculiarities ⋮ Long-term values in Markov decision processes, (co)algebraically ⋮ Gainfree Leontief substitution flow problems ⋮ Learning chordal extensions ⋮ A time-replacement policy for multistate systems with aging components under maintenance, from a component perspective ⋮ Estimating the target survival probability in the attackers-target-defenders problem ⋮ Analysis of adaptive cost functions for dynamic update policies for QoS routing in hierarchical networks ⋮ Deciding probabilistic bisimilarity distance one for probabilistic automata ⋮ Sequential identification and adaptive control in stochastic systems ⋮ Adaptive optimization and the harvest of biological populations ⋮ A parametric characterization and an \(\epsilon\)-approximation scheme for the minimization of a quasiconcave program ⋮ Infinite horizon Markov decision processes with unknown or variable discount factors ⋮ Controlled semi-Markov models under long-run average rewards ⋮ Dispatching to parallel servers. Solutions of Poisson's equation for first-policy improvement ⋮ Multigrid methods for image registration model based on optimal mass transport ⋮ Computational comparison of value iteration algorithms for discounted Markov decision processes ⋮ A network learning model with GERT analysis ⋮ Generosity, selfishness and exploitation as optimal greedy strategies for resource sharing ⋮ The stochastic opportunistic replacement problem. II: A two-stage solution approach ⋮ A class of dual fuzzy dynamic programs ⋮ Controlled Markov set-chains under average criteria ⋮ Learning-based vs model-free adaptive control of a MAV under wind gust ⋮ POMDPs under probabilistic semantics ⋮ Tool path optimization of selective laser sintering processes using deep learning

This page was built for publication:

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:3266141&oldid=16463227"