Infinite Time Horizon Maximum Causal Entropy Inverse Reinforcement Learning
From MaRDI portal
Publication:4682336
DOI10.1109/TAC.2017.2775960zbMath1423.93427OpenAlexW2070469928MaRDI QIDQ4682336
Nicholas Bambos, Michael Bloem, Zhengyuan Zhou
Publication date: 18 September 2018
Published in: IEEE Transactions on Automatic Control (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1109/tac.2017.2775960
Learning and adaptive systems in artificial intelligence (68T05) Applications of Markov chains and discrete-time Markov processes on general state spaces (social mobility, learning theory, industrial processes, etc.) (60J20) Stochastic learning and adaptive control (93E35)
Related Items (4)
Model-based inverse reinforcement learning for deterministic systems ⋮ Task-guided IRL in POMDPs that scales ⋮ Global Convergence of Policy Gradient Methods to (Almost) Locally Optimal Policies ⋮ Inverse reinforcement learning for multi-player noncooperative apprentice games
This page was built for publication: Infinite Time Horizon Maximum Causal Entropy Inverse Reinforcement Learning