A policy gradient method for semi-Markov decision processes with application to call admission control

From MaRDI portal

Publication:859693

Jump to:navigation, search

DOI10.1016/J.EJOR.2006.02.023zbMath1163.90790OpenAlexW2169293926MaRDI QIDQ859693

Sumeetpal S. Singh, Arnaud Doucet, Vladislav B. Tadić

Publication date: 16 January 2007

Published in: European Journal of Operational Research (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1016/j.ejor.2006.02.023

zbMATH Keywords

stochastic processes call admission control semi-Markov decision process policy gradient two time-scale

Mathematics Subject Classification ID

Markov and semi-Markov decision processes (90C40)

Related Items (9)

A reinforcement-learning approach for admission control in distributed network service systems ⋮ A minimization problem of the risk probability in first passage semi-Markov decision processes with loss rates ⋮ Finite horizon semi-Markov decision processes with application to maintenance systems ⋮ Semiconductor final test scheduling with Sarsa\((\lambda , k)\) algorithm ⋮ Approximate dynamic programming for capacity allocation in the service industry ⋮ Performance analysis for controlled semi-Markov systems with application to maintenance ⋮ Minimizing mean weighted tardiness in unrelated parallel machine scheduling with reinforcement learning ⋮ Look-ahead control of conveyor-serviced production station by using potential-based online policy iteration ⋮ FLOW SHOP SCHEDULING WITH REINFORCEMENT LEARNING

Cites Work

This page was built for publication: A policy gradient method for semi-Markov decision processes with application to call admission control

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:859693&oldid=12804534"