scientific article
From MaRDI portal
Publication:2880944
zbMath1235.68196MaRDI QIDQ2880944
No author found.
Publication date: 17 April 2012
Full work available at URL: http://www.jmlr.org/papers/v10/taylor09a.html
Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.
Related Items (33)
Fast rates by transferring from auxiliary hypotheses ⋮ Unnamed Item ⋮ Making friends on the fly: cooperating with new teammates ⋮ Optimal design of priors constrained by external predictors ⋮ Scalable transfer learning in heterogeneous, dynamic environments ⋮ Visual transfer for reinforcement learning via gradient penalty based Wasserstein domain confusion ⋮ A Bayesian approach to (online) transfer learning: theory and algorithms ⋮ Embedding active learning in batch-to-batch optimization using reinforcement learning ⋮ Bayesian policy reuse ⋮ Robust optimal well control using an adaptive multigrid reinforcement learning framework ⋮ A taxonomy for similarity metrics between Markov decision processes ⋮ Overlapping layered learning ⋮ Transferring knowledge as heuristics in reinforcement learning: a case-based approach ⋮ Improving the efficiency of reinforcement learning for a spacecraft powered descent with Q-learning ⋮ Preference-based reinforcement learning: a formal framework and a policy iteration algorithm ⋮ Quantum classifiers for domain adaptation ⋮ Multi-fidelity reinforcement learning framework for shape optimization ⋮ Action-model acquisition for planning via transfer learning ⋮ Progressive learning: a deep learning framework for continual learning ⋮ Reward is enough ⋮ Abstraction in data-sparse task transfer ⋮ A physics-informed deep learning framework for inversion and surrogate modeling in solid mechanics ⋮ A transfer forecasting model for container throughput guided by discrete PSO ⋮ Phase portraits as movement primitives for fast humanoid robot control ⋮ POD-DL-ROM: enhancing deep learning-based reduced order models for nonlinear parametrized PDEs by proper orthogonal decomposition ⋮ Permutation flow shop scheduling with multiple lines and demand plans using reinforcement learning ⋮ Unnamed Item ⋮ Hierarchical clustering optimizes the tradeoff between compositionality and expressivity of task structures for flexible reinforcement learning ⋮ Reinforcement learning: an industrial perspective ⋮ Unnamed Item ⋮ Unnamed Item ⋮ Model-based Reinforcement Learning: A Survey ⋮ A Gentle Introduction to Reinforcement Learning
This page was built for publication: