Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
scientific article - MaRDI portal

scientific article

From MaRDI portal
Publication:3266141

zbMath0091.16001MaRDI QIDQ3266141

Ronald A. Howard

Publication date: 1960


Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.



Related Items (only showing first 100 items - show all)

Provisioning of public health can be designed to anticipate public policy responsesA complexity analysis of policy iteration through combinatorial matrices arising from unique sink orientationsSerial and parallel value iteration algorithms for discounted Markov decision processesAn information-theoretic analysis of return maximization in reinforcement learningCyclic flow-shop scheduling with no-wait constraints and missing operationsThe policy iteration algorithm for a compound Poisson process applied to optimal dividend strategies under a Cramér-Lundberg risk modelMulti-asset portfolio selection problem with transaction costsMarkovian real-time adaptive control of signal systemsDynamic programming, Markov chains, and the method of successive approximationsStable sequential control rules and Markov chainsNumerical approximation of a system of Hamilton-Jacobi-Bellman equations arising in innovation dynamicsSemi-Lipschitz functions and machine learning for discrete dynamical systems on graphsSporadic overtaking optimality in Markov decision problemsGenetic algorithms and call admission to telecommunications networksA learning large neighborhood search for the staff rerostering problemConvergence of deep fictitious play for stochastic differential gamesEfficient incremental planning and learning with multi-valued decision diagramsSymblicit algorithms for mean-payoff and shortest path in monotonic Markov decision processesGeneric uniqueness of the bias vector of finite zero-sum stochastic games with perfect informationComputing semi-stationary optimal policies for multichain semi-Markov decision processesA general decomposition approach for multi-criteria decision treesA mean first passage time genome rearrangement distanceThe stochastic shortest path problem: a polyhedral combinatorics perspectiveSome remarks on cops and drunk robbersWhen to parasitize? A dynamic optimization model of reproductive strategies in a cooperative breederReachability and safety objectives in Markov decision processes on long but finite horizonsDynamic and game theory of infectious disease stigmasAn epistemic approach to stochastic gamesState partitioning based linear program for stochastic dynamic programs: an invariance propertyOptimal timing of disease transmission in an age-structured populationMarkov decision processes in service facilities holding perishable inventoryFinding the \(K\) best policies in a finite-horizon Markov decision processDetection-averse optimal and receding-horizon control for Markov decision processesA conservative index heuristic for routing problems with multiple heterogeneous service facilitiesA superpolynomial lower bound for strategy iteration based on snare memorizationSLAP: specification logic of actions with probabilityStrong polynomiality of policy iterations for average-cost MDPs modeling replacement and maintenance problemsLexicographic refinements in stationary possibilistic Markov decision processesAdaptive \(C^0\) interior penalty methods for Hamilton-Jacobi-Bellman equations with cordes coefficientsOn optimal replacement thresholds with technological expectationsThe average cost of Markov chains subject to total variation distance uncertaintyFuzzy optimality relation for perceptive MDPs-the average caseA multi-period TSP with stochastic regular and urgent demandsExperiences with an interactive museum tour-guide robotMarkov decision processes with sequential sensor measurementsStochastic dynamic programming with non-linear discountingA modified MSA for stochastic control problemsControl of \(M|M|1|N\) queue parameters under constraintsAlgorithms and conditional lower bounds for planning problemsFeedback control of parametrized PDEs via model order reduction and dynamic programming principleMarkov solution processes: modeling human problem solving with procedural knowledge space theoryA review on deep reinforcement learning for fluid mechanicsDynamic repositioning strategy in a bike-sharing system; how to prioritize and how to rebalance a bike stationA mean field games model for finite mixtures of Bernoulli and categorical distributionsSymbolic algorithms for qualitative analysis of Markov decision processes with Büchi objectivesDynamic programming with state-dependent discountingGame-theoretic control of the object's random jump structure in the class of pure strategiesPiracy on the internet: accommodate it or fight it? A dynamic approachDomain decomposition based parallel Howard's algorithmApproximating the minimum cycle meanUnifying temporal and organizational scales in multiscale decision-makingThe complexity of solving reachability games using value and strategy iterationDetermining the optimal strategies for discrete control problems on stochastic networks with discounted costsAverage case analysis of the classical algorithm for Markov decision processes with Büchi objectivesSemi-Markov decision processes with limiting ratio average rewardsOn the existence of relative values for undiscounted multichain Markov decision processesContinuous-time stochastic gamesValuing programs with deterministic and stochastic cyclesIPL: an integration property language for multi-model cyber-physical systemsDimensioning a queue with state-dependent arrival ratesModified policy iteration algorithms are not strongly polynomial for discounted dynamic programmingMonotone mixed finite difference scheme for Monge-Ampère equationA performance-centred approach to optimising maintenance of complex systemsReliability models and analysis of systems with protectionRollout sampling approximate policy iterationArtificial viscosity joint spacetime multigrid method for Hamilton-Jacobi-Bellman and Kolmogorov-Fokker-Planck system arising from mean field gamesCalculating expected incomes in open Markov networks with requests of different classes and different peculiaritiesLong-term values in Markov decision processes, (co)algebraicallyGainfree Leontief substitution flow problemsLearning chordal extensionsA time-replacement policy for multistate systems with aging components under maintenance, from a component perspectiveEstimating the target survival probability in the attackers-target-defenders problemAnalysis of adaptive cost functions for dynamic update policies for QoS routing in hierarchical networksDeciding probabilistic bisimilarity distance one for probabilistic automataSequential identification and adaptive control in stochastic systemsAdaptive optimization and the harvest of biological populationsA parametric characterization and an \(\epsilon\)-approximation scheme for the minimization of a quasiconcave programInfinite horizon Markov decision processes with unknown or variable discount factorsControlled semi-Markov models under long-run average rewardsDispatching to parallel servers. Solutions of Poisson's equation for first-policy improvementMultigrid methods for image registration model based on optimal mass transportComputational comparison of value iteration algorithms for discounted Markov decision processesA network learning model with GERT analysisGenerosity, selfishness and exploitation as optimal greedy strategies for resource sharingThe stochastic opportunistic replacement problem. II: A two-stage solution approachA class of dual fuzzy dynamic programsControlled Markov set-chains under average criteriaLearning-based vs model-free adaptive control of a MAV under wind gustPOMDPs under probabilistic semanticsTool path optimization of selective laser sintering processes using deep learning




This page was built for publication: