Mathematical Research Data Initiative
Main page
Recent changes
Random page
Help about MediaWiki
Create a new Item
Create a new Property
Create a new EntitySchema
Merge two items
In other projects
Discussion
View source
View history
Purge
English
Log in

A least squares temporal difference actor–critic algorithm with applications to warehouse management

From MaRDI portal
Publication:3120552
Jump to:navigation, search

DOI10.1002/nav.21481zbMath1407.90334OpenAlexW1964782533MaRDI QIDQ3120552

Ioannis Ch. Paschalidis, Reza Moazzez Estanjini, Keyong Li

Publication date: 5 March 2019

Published in: Naval Research Logistics (NRL) (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1002/nav.21481


zbMATH Keywords

Markov decision processesvehicle routingpartial observabilityactor-critic algorithmsapproximate dynamic programmingwarehouse management


Mathematics Subject Classification ID

Dynamic programming (90C39) Markov and semi-Markov decision processes (90C40)


Related Items (2)

Neural circuits for learning context-dependent associations of stimuli ⋮ Performance optimization for a class of generalized stochastic Petri nets




This page was built for publication: A least squares temporal difference actor–critic algorithm with applications to warehouse management

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:3120552&oldid=16208994"
Tools
What links here
Related changes
Special pages
Printable version
Permanent link
Page information
MaRDI portal item
This page was last edited on 3 February 2024, at 21:51.
Privacy policy
About MaRDI portal
Disclaimers
Imprint
Powered by MediaWiki