Inverse reinforcement learning in contextual MDPs
From MaRDI portal
Publication:2071371
DOI10.1007/S10994-021-05984-XOpenAlexW3162904728MaRDI QIDQ2071371
Stav Belogolovsky, Shie Mannor, Tom Zahavy, Philip Korsunsky, Chen Tessler
Publication date: 28 January 2022
Published in: Machine Learning (Search for Journal in Brave)
Full work available at URL: https://arxiv.org/abs/1905.09710
Related Items (1)
Uses Software
Cites Work
- Near-optimal reinforcement learning in polynomial time
- Mirror descent and nonlinear projected subgradient methods for convex optimization.
- Random gradient-free minimization of convex functions
- A Linearly Convergent Variant of the Conditional Gradient Algorithm under Strong Convexity, with Applications to Online and Stochastic Optimization
- A Stochastic Approximation Method
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
This page was built for publication: Inverse reinforcement learning in contextual MDPs