A least squares temporal difference actor–critic algorithm with applications to warehouse management (Q3120552)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: A least squares temporal difference actor–critic algorithm with applications to warehouse management |
scientific article
| Language | Label | Description | Also known as |
|---|---|---|---|
| English | A least squares temporal difference actor–critic algorithm with applications to warehouse management |
scientific article |
Statements
A least squares temporal difference actor–critic algorithm with applications to warehouse management (English)
0 references
5 March 2019
0 references
Markov decision processes
0 references
partial observability
0 references
approximate dynamic programming
0 references
actor-critic algorithms
0 references
warehouse management
0 references
vehicle routing
0 references
0.84386337
0 references
0.8355564
0 references
0.8340347
0 references
0 references
0.8273796
0 references
0.82500887
0 references