Mathematical Research Data Initiative
Main page
Recent changes
Random page
Help about MediaWiki
Create a new Item
Create a new Property
Merge two items
In other projects
MaRDI portal item
Discussion
View source
View history
Purge
English
Log in

A LEARNING ALGORITHM FOR DISCRETE-TIME STOCHASTIC CONTROL

From MaRDI portal
Publication:2711577
Jump to:navigation, search

DOI10.1017/S0269964800142081zbMath1029.93065OpenAlexW1965413238MaRDI QIDQ2711577

Vivek S. Borkar

Publication date: 2 February 2004

Published in: Probability in the Engineering and Informational Sciences (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1017/s0269964800142081


zbMATH Keywords

nonlinear controllearningalmost sure convergencecompact state spacediscrete-time stochastic controlcompact action space\(Q\)-learning algorithmsimulation based algorithm


Mathematics Subject Classification ID

Nonlinear systems in control theory (93C10) Discrete-time control/observation systems (93C55) Stochastic learning and adaptive control (93E35)


Related Items (1)

Stochastic approximation algorithms: overview and recent trends.







This page was built for publication: A LEARNING ALGORITHM FOR DISCRETE-TIME STOCHASTIC CONTROL

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:2711577&oldid=15561663"
Tools
What links here
Related changes
Special pages
Printable version
Permanent link
Page information
This page was last edited on 3 February 2024, at 11:04.
Privacy policy
About MaRDI portal
Disclaimers
Imprint
Powered by MediaWiki