Random search for constrained Markov decision processes with multi-policy improvement (Q895275)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: Random search for constrained Markov decision processes with multi-policy improvement |
scientific article; zbMATH DE number 6514067
| Language | Label | Description | Also known as |
|---|---|---|---|
| English | Random search for constrained Markov decision processes with multi-policy improvement |
scientific article; zbMATH DE number 6514067 |
Statements
Random search for constrained Markov decision processes with multi-policy improvement (English)
0 references
26 November 2015
0 references
Markov decision processes
0 references
random search
0 references
policy improvement
0 references
constrained optimization
0 references
0.92228734
0 references
0.91004515
0 references
0.9052095
0 references
0.90336096
0 references
0.90255946
0 references
0.89284474
0 references
0 references
0.89013785
0 references