scientific article; zbMATH DE number 3378668
From MaRDI portal
Publication:5649557
zbMath0238.90006MaRDI QIDQ5649557
Publication date: 1972
Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.
Related Items (15)
Bounds for the regret loss in dynamic programming under adaptive control ⋮ A unified approach to adaptive control of average reward Markov decision processes ⋮ Unnamed Item ⋮ Adaptive policy-iteration and policy-value-iteration for discounted Markov decision processes ⋮ Adaptive discounted control for piecewise deterministic Markov processes ⋮ Nonparametric estimation and adaptive control in a class of finite Markov decision chains ⋮ Estimation and control in multichain processes ⋮ Adaptive control of stochastic systems with unknown disturbance distribution: discounted criteria ⋮ Semi-Markov control models with partially known holding times distribution: discounted and average criteria ⋮ Unnamed Item ⋮ Adaptive control of diffusion processes with a discounted reward criterion ⋮ Unnamed Item ⋮ Optimal adaptive control of priority assignment in queueing systems ⋮ Adaptive control of discounted Markov decision chains ⋮ Nonstationary value-iteration and adaptive control of discounted semi- Markov processes
This page was built for publication: