Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
Estimation and control in multichain processes - MaRDI portal

Estimation and control in multichain processes (Q1176867)

From MaRDI portal

Jump to:navigation, search

This is the item page for this Wikibase entity, intended for internal use and editing purposes.

Please use this page instead for the normal view: Estimation and control in multichain processes

scientific article; zbMATH DE number 12606

Language	Label	Description	Also known as
English	Estimation and control in multichain processes	scientific article; zbMATH DE number 12606

Statements

scholarly article

0 references

Estimation and control in multichain processes (English)

0 references

Hans-Joachim Girlich

0 references

A. A. Sokolichin

0 references

Annals of Operations Research

0 references

publication date

25 June 1992

0 references

The paper considers Markovian decision processes in discrete time with transition probabilities depending on an unknown parameter which may change step by step. In the case of convergence of such a parameter sequence a policy maximizing the average expected reward over an infinite horizon is looked for. Under continuity conditions, the uniform optimality of a policy based on ``estimation and control'' for some multichain models is shown.

0 references

zbMATH Keywords

adaptive controls

0 references

discrete time

0 references

average expected reward

0 references

Ryszarda Rempała

0 references

MaRDI profile type

0 references

Optimal decision procedures for finite Markov chains. Part II: Communicating systems

0 references

Discrete Dynamic Programming

0 references

Discounted Dynamic Programming

0 references

0 references

The Inventory Problem: II. Case of Unknown Distributions of Demand

0 references

0 references

0 references

0 references

0 references

0 references

A unified approach to adaptive control of average reward Markov decision processes

0 references

0 references

0 references

0 references

Estimation and control in Markov chains

0 references

0 references

Optimality equations and sensitive optimality in bounded Markov decision processes<sup>1</sup>

0 references

Bayesian dynamic programming

0 references

0 references

Existenz durelisehnittsoptimaler Strategien in einem Markoffschen Entscheidungsmodell mit unbekaimter Parameterfolge

0 references

0 references

0 references

full work available at URL

https://doi.org/10.1007/bf02204826

0 references

Identifiers

zbMATH Open document ID

0 references

10.1007/BF02204826

0 references

Mathematics Subject Classification ID

0 references

0 references

zbMATH DE Number

0 references

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:1176867

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Item:Q1176867&oldid=42288199"