scientific article

Publication date: 24 March 2006

Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.

optimal control infinite horizon dynamic programming finite horizon deterministic discrete-time suboptimal control stochastic continuous-time

Mathematics Subject Classification ID

Dynamic programming in optimal control and differential games (49L20) Dynamic programming (90C39) Optimal stochastic control (93E20) Markov and semi-Markov decision processes (90C40) Introductory exposition (textbooks, tutorial papers, etc.) pertaining to operations research and mathematical programming (90-01)

Related Items

Inventory control and pricing for perishable products under age and price dependent stochastic demand, Second-order necessary optimality conditions for a discrete optimal control problem with mixed constraints, Optimal control with learning on the fly: a toy problem, Sampling-rate-dependent probabilistic Boolean networks, A new learning algorithm for optimal stopping, Computational approaches for mixed integer optimal control problems with indicator constraints, Approximate dynamic programming based control of proppant concentration in hydraulic fracturing, Multiscale Q-learning with linear function approximation, Value iteration and adaptive dynamic programming for data-driven adaptive optimal control design, Stochastic modelling and control of antibiotic subtilin production, Centralized systemic risk control in the interbank system: weak formulation and gamma-convergence, Inventory performance under staggered deliveries and autocorrelated demand, A limited-feedback approximation scheme for optimal switching problems with execution delays, Faster rollout search for the vehicle routing problem with stochastic demands and restocking, Optimal decisions for continuous time Markov decision processes over finite planning horizons, Robust economic model predictive control using stochastic information, Dynamic control of a closed two-stage queueing network for outfitting process in shipbuilding, The constrained shortest path tour problem, Stability-constrained Markov decision processes using MPC, Optimal strategies for pay-as-you-go pension finance: a sustainability framework, Using nonlinear model predictive control for dynamic decision problems in economics, Dynamic pricing and advertising of perishable products with inventory holding costs, Continuous-time stochastic games of fixed duration, Reinforcement learning solution for HJB equation arising in constrained optimal control problem, Policies for inventory models with product returns forecast from past demands and past sales, Deep reinforcement learning for wireless sensor scheduling in cyber-physical systems, Asymptotically optimal Bayesian sequential change detection and identification rules, Time-optimal control of large-scale systems of systems using compositional optimization, Dynamic programming with shape-preserving rational spline Hermite interpolation, Numerical analysis of continuous time Markov decision processes over finite horizons, Cost-to-travel functions: a new perspective on optimal and model predictive control, The stochastic shortest path problem: a polyhedral combinatorics perspective, CGMurphi: automatic synthesis of numerical controllers for nonlinear hybrid systems, MIDAS: a mixed integer dynamic approximation scheme, Continuous learning methods in two-buyer pricing problem, Optimal compression of generalized Prandtl-Ishlinskii hysteresis models, Energy-optimal trajectory planning for robot manipulators with holonomic constraints, The Mordukhovich coderivative and the local metric regularity of the solution map to a parametric discrete optimal control problem, Control of nonlinear vibrations using the adjoint method, Minimax estimation with intermittent observations, Generalized Markov models of infectious disease spread: a novel framework for developing dynamic health policies, Submodular optimization problems and greedy strategies: a survey, Using flexible products to cope with demand uncertainty in revenue management, Mixed spatial and temporal decompositions for large-scale multistage stochastic optimization problems, A rollout algorithm framework for heuristic solutions to finite-horizon stochastic dynamic programs, Symplectic Runge-Kutta discretization of a regularized forward-backward sweep iteration for optimal control problems, An outer-approximation approach for information-maximizing sensor selection, Model-free control of Lorenz chaos using an approximate optimal control strategy, Accuracy and response-time distributions for decision-making: linear perfect integrators versus nonlinear attractor-based neural circuits, Deconvolution estimation of mixture distributions with boundaries, Convergence analysis of the deep neural networks based globalized dual heuristic programming, Optimal low-thrust trajectories to asteroids through an algorithm based on differential dynamic programming, Toolgraph design of optimal and feasible control strategies for time-varying dynamical systems, Performance analysis of a manufacturing line operated under optimal surplus-based production control, Explicit/multi-parametric model predictive control (MPC) of linear discrete-time systems by dynamic and multi-parametric programming, Using negotiable features for prescription problems, Optimal, quality-aware scheduling of data consumption in mobile ad hoc networks, Symmetry reduction for dynamic programming, A new synthetic output tracking scheme for non-minimum phase affine nonlinear systems, Maximizing the set of recurrent states of an MDP subject to convex constraints, Stochastic model predictive control of photovoltaic battery systems using a probabilistic forecast model, A multi-parametric programming approach for constrained dynamic programming problems, Sensitivity-based nested partitions for solving finite-horizon Markov decision processes, Large-scale unit commitment under uncertainty: an updated literature survey, A matrix approach to modeling and optimization for dynamic games with random entrance, Optimal investment policy with fixed adjustment costs and complete irreversibility, Non-constant discounting and differential games with random time horizon, Switched LQG control for linear systems with multiple sensing methods, Markov decision processes with sequential sensor measurements, Optimal control for estimation in partially observed elliptic and hypoelliptic linear stochastic differential equations, Robust MPC via min-max differential inequalities, Modeling uncertain passenger arrivals in the elevator dispatching problem with destination control, Approximating convex functions via non-convex oracles under the relative noise model, A multi-channel transmission schedule for remote state estimation under DoS attacks, A minmax regret price control model for managing perishable products with uncertain parameters, Differential stability of convex discrete optimal control problems, Dynamic expediting of an urgent order with uncertain progress, Stochastic programming for off-line adaptive radiotherapy, Dynamic control mechanisms for revenue management with flexible products, Optimal node visitation in acyclic stochastic digraphs with multi-threaded traversals and internal visitation requirements, Capacity reservation and utilization for a manufacturer with uncertain capacity and demand, Sequential Monte Carlo pricing of American-style options under stochastic volatility models, Robust stabilizing inventory control in supply networks under uncertainty of external demand and supply time-delays, Lagrangian approximations for stochastic reachability of a target tube, Addressing state space multicollinearity in solving an ozone pollution dynamic control problem, Joint pricing and inventory decisions for substitutable and perishable products under demand uncertainty, Sectorization and configuration transition in airspace design, Linear programming relaxations and marginal productivity index policies for the buffer sharing problem, Mordukhovich subgradients of the value function to a parametric discrete optimal control problem, Analysis of a class of dynamic programming models for multi-stage uncertain systems, Asymptotically optimal index policies for an abandonment queue with convex holding cost, Performance bounds for linear stochastic control, Constructive design of open-loop Nash equilibrium strategies that admit a feedback synthesis in LQ games, A deep reinforcement learning framework for continuous intraday market bidding, The Lipschitz properties of the value function and the solution map to a parametric discrete optimal control problem, Dynamic programming strategy based on a type-2 fuzzy wavelet neural network, Fully polynomial time $(\Sigma,\Pi)$-approximation schemes for continuous nonlinear newsvendor and continuous stochastic dynamic programs, Multi-agent reinforcement learning: a selective overview of theories and algorithms, Optimal price-threshold control for battery operation with aging phenomenon: a quasiconvex optimization approach, A decomposition method for large scale MILPs, with performance guarantees and a power system application, The constrained forward shortest path tour problem: Mathematical modeling and GRASP approximate solutions, A simple but powerful simulated certainty equivalent approximation method for dynamic stochastic problems, The policy graph decomposition of multistage stochastic programming problems, Deep reinforcement trading with predictable returns, Generative adversarial networks applied to synthetic financial scenarios generation, Optimal output tracking control of linear discrete-time systems with unknown dynamics by adaptive dynamic programming and output feedback, Solving nonlinear and dynamic programming equations on extended $b$-metric spaces with the fixed-point technique, Dual SDDP for risk-averse multistage stochastic programs, Ergodicity and large deviations in physical systems with stochastic dynamics, Inverse optimal control for averaged cost per stage linear quadratic regulators, On the sample complexity of actor-critic method for reinforcement learning with function approximation, Risk-neutral valuation of GLWB riders in variable annuities, A general optimization framework for dynamic time warping, Discrete‐time decentralized linear quadratic control for linear time‐varying systems, Simulation-based search, The computability of LQR and LQG control, Time Consistency of the Mean-Risk Problem, On the no-gap second-order optimality conditions for a discrete optimal control problem with mixed constraints, Bellman's principle of optimality and deep reinforcement learning for time-varying tasks, Derivative-Free Methods for Policy Optimization: Guarantees for Linear Quadratic Systems, Unnamed Item, Adaptive critic motion control design of autonomous wheeled mobile robot by dual heuristic programming, Algorithms for Optimal Control of Stochastic Switching Systems, Scalable Reinforcement Learning for Multiagent Networked Systems, Optimal routeing in two-queue polling systems, Shape-preserving dynamic programming, Scalable Online Planning for Multi-Agent MDPs, Optimization of vehicle speed for batches to minimize supply chain cost under uncertain demand, On Some Special Network Flow Problems: The Shortest Path Tour Problems, Stochastic Optimal Control as a Theory of Brain-Machine Interface Operation, A multi-objective approach for PH-graphs with applications to stochastic shortest paths, Model predictive control for drift counteraction of stochastic constrained linear systems, LQG control and linear policies for noisy communication links with synchronized side information at the decoder, Stress scenario generation for solvency and risk management, Computing Behavioral Relations for Probabilistic Concurrent Systems, Minkowski-Bellman inequality and equation, The valuation of GMWB variable annuities under alternative fund distributions and policyholder behaviours, Error bounds for stochastic shortest path problems, A queueing model for customer rescheduling and no-shows in service systems, Search Under Accumulated Pressure, Anticipation in leisure—Effects on labor‐leisure choice, Generalized differentiation of a class of normal cone operators and sensitivity of optimal control problems, Suboptimal reduced control of unknown nonlinear singularly perturbed systems via reinforcement learning, Valuation of general GMWB annuities in a low interest rate environment, Exactly optimal Bayesian quickest change detection for hidden Markov models, Continuity of discounted values and the structure of optimal policies for <scp>periodic‐review</scp> inventory systems with setup costs, Optimal control on graphs: existence, uniqueness, and long-term behavior, A CONTINUOUS REVIEW MODEL WITH GENERAL SHELF AGE AND DELAY-DEPENDENT INVENTORY COSTS, Managing Patient Admissions in a Neurology Ward, Route intelligent recommendation model and algorithm under the Pythagorean hesitant fuzzy linguistic environment, A converse sum of squares Lyapunov function for outer approximation of minimal attractor sets of nonlinear systems, SOLUTIONS AND DIAGNOSTICS OF SWITCHING PROBLEMS WITH LINEAR STATE DYNAMICS, Direct Optimal Control and Model Predictive Control, Stochastic switching for partially observable dynamics and optimal asset allocation, A discrete-time optimal filtering approach for non-linear systems as a stable discretization of the Mortensen observer, Dynamic Analysis of Naive Adaptive Brain-Machine Interfaces, Approximate dynamic programming for stochastic $N$-stage optimization with application to optimal consumption under uncertainty, Forward or backward simulation? A comparative study, Global Convergence of Policy Gradient Methods to (Almost) Locally Optimal Policies, Hidden Markov models for stochastic thermodynamics, Provably Near-Optimal Approximation Schemes for Implicit Stochastic and Sample-Based Dynamic Programs, Exact Simulation of Variance Gamma-Related OU Processes: Application to the Pricing of Energy Derivatives, An integrated approach to solving influence diagrams and finite-horizon partially observable decision processes, Policy Gradient Methods for the Noisy Linear Quadratic Regulator over a Finite Horizon, Large-scale dynamic system optimization using dual decomposition method with approximate dynamic programming, Modelling and solving resource allocation problems via a dynamic programming approach, Automated Generation of Optimal Controllers through Model Checking Techniques, Fast Pricing of Energy Derivatives with Mean-Reverting Jump-diffusion Processes, Application of branching cells to QoS aware service orchestrations, Architecture and robustness tradeoffs in speed-scaled queues with application to energy management, Time-dependent and independent control rules for coordinated production and pricing under demand uncertainty and finite planning horizons, A max-plus based fundamental solution for a class of discrete time linear regulator problems, MPC‐based approximate dual controller by information matrix maximization, Allocation planning under service-level contracts, Suboptimal Fault Tolerant Control Design with the Use of Discrete Optimization, A Retrograde Approximation Algorithm for Multi-player Can’t Stop, Expected utility and catastrophic risk in a stochastic economy-climate model, On the structure of the set of active sets in constrained linear quadratic regulation, Event-triggered minimax state estimation with a relative entropy constraint, Shortest path tour problem with time windows, Dynamic programming and suboptimal control: a survey from ADP to MPC, Continuous-Time Robust Dynamic Programming, A Benders decomposition approach to product location in carousel storage systems, Comments on: Recent progress on the combinatorial diameter of polytopes and simplicial complexes, Dynamic exploitation of myopic best response, Multi-step heuristic dynamic programming for optimal control of nonlinear discrete-time systems, Subgradients of value functions in parametric dynamic programming, A dual sourcing inventory model for modal split transport: structural properties and optimal solution, An algorithm for solving a class of multiplayer feedback-Nash differential games, A stochastic goal programming model to derive stable cash management policies, Pathwise Dynamic Programming, Recomposable restricted finite state machines: definition and solution approaches, Approximate solution to optimal linear quadratic Gaussian control over non-acknowledgment networks, Suboptimal Policies for Stochastic $$N$$-Stage Optimization: Accuracy Analysis and a Case Study from Optimal Consumption, On the Complexity of Value Iteration, Diffusion methods for classification with pairwise relationships, Predictive control of discrete time stochastic nonlinear state space dynamical systems: a particle nonparametric approach, Optimal Radio-Mode Switching for Wireless Networked Control, Real-time algorithms for the bilevel double-deck elevator dispatching problem, Dynamic Programming Subject to Total Variation Distance Ambiguity, Robust shortest path planning and semicontractive dynamic programming, Subgradients of the Value Function via Multiplier Sets of Parametric Convex Discrete Optimal Control Problems, A Sufficient Statistic for Influence in Structured Multiagent Environments, Data-based approximate policy iteration for affine nonlinear continuous-time optimal control design, MAX-plus fundamental solution semigroups for a class of difference Riccati equations, A decomposition algorithm for Nash equilibria in intersection management, Worst-case relative cost optimal control for dynamic systems with finite admissible disturbance sequence sets, Unnamed Item, On time-optimal trajectories in non-uniform mediums, Unnamed Item, Performance optimization for a class of generalized stochastic Petri nets, Second-order necessary optimality conditions for a discrete optimal control problem, Large-scale unit commitment under uncertainty, Dynamic programming with Hermite approximation, Solving the shortest path tour problem