scientific article; zbMATH DE number 6433482
From MaRDI portal
Publication:5249589
zbMath1312.90089MaRDI QIDQ5249589
Ann Nowé, Kristof van Moffaert
Publication date: 6 May 2015
Full work available at URL: http://jmlr.csail.mit.edu/papers/v15/vanmoffaert14a.html
Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.
Ridge regression; shrinkage estimators (Lasso) (62J07) Multi-objective and goal programming (90C29) Derivative-free methods and methods using generalized derivatives (90C56)
Related Items (8)
Bellman's principle of optimality and deep reinforcement learning for time-varying tasks ⋮ Digital twin-enabled dynamic scheduling with preventive maintenance using a double-layer Q-learning algorithm ⋮ Automated Reinforcement Learning (AutoRL): A Survey and Open Problems ⋮ Revisiting norm optimization for multi-objective black-box problems: a finite-time analysis ⋮ Placing approach-avoidance conflict within the framework of multi-objective reinforcement learning ⋮ Challenges of real-world reinforcement learning: definitions, benchmarks and analysis ⋮ A Gentle Introduction to Reinforcement Learning ⋮ Multi-objective dynamic programming with limited precision
This page was built for publication: