Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
scientific article; zbMATH DE number 6433482 - MaRDI portal

scientific article; zbMATH DE number 6433482

From MaRDI portal

Publication:5249589

Jump to:navigation, search

zbMath1312.90089MaRDI QIDQ5249589

Ann Nowé, Kristof van Moffaert

Publication date: 6 May 2015

Full work available at URL: http://jmlr.csail.mit.edu/papers/v15/vanmoffaert14a.html

Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.

zbMATH Keywords

hypervolume multiple criteria analysis reinforcement learning multi-objective Pareto sets

Mathematics Subject Classification ID

Ridge regression; shrinkage estimators (Lasso) (62J07) Multi-objective and goal programming (90C29) Derivative-free methods and methods using generalized derivatives (90C56)

Related Items (8)

Bellman's principle of optimality and deep reinforcement learning for time-varying tasks ⋮ Digital twin-enabled dynamic scheduling with preventive maintenance using a double-layer Q-learning algorithm ⋮ Automated Reinforcement Learning (AutoRL): A Survey and Open Problems ⋮ Revisiting norm optimization for multi-objective black-box problems: a finite-time analysis ⋮ Placing approach-avoidance conflict within the framework of multi-objective reinforcement learning ⋮ Challenges of real-world reinforcement learning: definitions, benchmarks and analysis ⋮ A Gentle Introduction to Reinforcement Learning ⋮ Multi-objective dynamic programming with limited precision

This page was built for publication:

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:5249589&oldid=19873471"