Achieving zero constraint violation for concave utility constrained reinforcement learning via primal-dual approach
From MaRDI portal
Publication:6535433
DOI10.1613/jair.1.15383MaRDI QIDQ6535433
Vaneet Aggarwal, Mridul Agarwal, Qinbo Bai, Alec Koppel, Amrit Singh Bedi
Publication date: 20 December 2023
Published in: The Journal of Artificial Intelligence Research (JAIR) (Search for Journal in Brave)
Applications of mathematical programming (90C90) Learning and adaptive systems in artificial intelligence (68T05) Markov and semi-Markov decision processes (90C40) Problem solving in the context of artificial intelligence (heuristics, search strategies, etc.) (68T20)
This page was built for publication: Achieving zero constraint violation for concave utility constrained reinforcement learning via primal-dual approach