scientific article; zbMATH DE number 7626804
From MaRDI portal
Publication:5053336
Sebastian Schulze, Katja Hofmann, Shimon Whiteson, Kyriacos Shiarlis, Luisa M. Zintgraf, Yarin Gal, Leo Feng, Maximilian Igl, Cong Lu
Publication date: 6 December 2022
Full work available at URL: https://jmlr.csail.mit.edu/papers/v22/21-0657.html
Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.
reinforcement learningmeta learningrecurrent networksapproximate variational inferenceBayes-adaptive Markov decision processes
Related Items (3)
Embedding active learning in batch-to-batch optimization using reinforcement learning ⋮ Reward Maximization Through Discrete Active Inference ⋮ Unnamed Item
Uses Software
Cites Work
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Planning and acting in partially observable stochastic domains
- Near-optimal reinforcement learning in polynomial time
- Convex Optimization: Algorithms and Complexity
- Scalable and Efficient Bayes-Adaptive Reinforcement Learning Based on Monte-Carlo Tree Search
- 10.1162/153244303765208377
- An Introduction to Variational Autoencoders
This page was built for publication: