Real-Time Reinforcement Learning of Constrained Markov Decision Processes with Weak Derivatives (Q4925757)

scientific article; zbMATH DE number 6174824

Language	Label	Description	Also known as
English	Real-Time Reinforcement Learning of Constrained Markov Decision Processes with Weak Derivatives	scientific article; zbMATH DE number 6174824

Statements

0 references

0 references

0 references

12 June 2013

0 references

0 references

stochastic approximation algorithm

0 references

spherical coordinate parametrization

0 references

asymptotic bias

0 references

0 references

math.OC

0 references

0 references

0 references

0 references

0 references