Reinforcement learning for long-run average cost.

From MaRDI portal
Publication:1427588

DOI10.1016/S0377-2217(02)00874-3zbMath1102.90374MaRDI QIDQ1427588

Abhijit Gosavi

Publication date: 14 March 2004

Published in: European Journal of Operational Research (Search for Journal in Brave)




Related Items



Cites Work