Computational Benefits of Intermediate Rewards for Goal-Reaching Policy Learning
From MaRDI portal
Publication:5076329
DOI10.1613/jair.1.13326OpenAlexW4220923746WikidataQ113424376 ScholiaQ113424376MaRDI QIDQ5076329
Yuexiang Zhai, Christina Baek, Zhengyuan Zhou, Jiantao Jiao, Yi Ma
Publication date: 16 May 2022
Published in: Journal of Artificial Intelligence Research (Search for Journal in Brave)
Full work available at URL: https://arxiv.org/abs/2107.03961
Uses Software
Cites Work
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Stochastic dynamic programming with factored representations
- Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
- Quantitative Multi-objective Verification for Probabilistic Systems
- Algorithms for Reinforcement Learning
- Planning Algorithms
- Recent advances in hierarchical reinforcement learning
This page was built for publication: Computational Benefits of Intermediate Rewards for Goal-Reaching Policy Learning