Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
Preference-based reinforcement learning: a formal framework and a policy iteration algorithm - MaRDI portal

Preference-based reinforcement learning: a formal framework and a policy iteration algorithm (Q1945130)

From MaRDI portal

Jump to:navigation, search

This is the item page for this Wikibase entity, intended for internal use and editing purposes.

Please use this page instead for the normal view: Preference-based reinforcement learning: a formal framework and a policy iteration algorithm

scientific article; zbMATH DE number 6149404

Language	Label	Description	Also known as
English	Preference-based reinforcement learning: a formal framework and a policy iteration algorithm	scientific article; zbMATH DE number 6149404

Statements

scholarly article

0 references

Preference-based reinforcement learning: a formal framework and a policy iteration algorithm (English)

0 references

Johannes Fürnkranz

0 references

Eyke Hüllermeier

0 references

0 references

Sang-Hyeun Park

0 references

Machine Learning

0 references

publication date

2 April 2013

0 references

zbMATH Keywords

reinforcement learning

0 references

preference learning

0 references

describes a project that uses

0 references

MaRDI profile type

0 references

full work available at URL

https://doi.org/10.1007/s10994-012-5313-8

0 references

0 references

Finite-time analysis of the multiarmed bandit problem

0 references

Learning to play chess using temporal differences

0 references

Temporal difference learning applied to game playing and the results of application to Shogi

0 references

Natural actor-critic algorithms

0 references

Modeling agents as qualitative decision makers

0 references

Elevator group control using multiple reinforcement learning agents

0 references

0 references

Rollout sampling approximate policy iteration

0 references

Integrating guidance into relational reinforcement learning

0 references

Qualitative decision theory with preference relations and comparative uncertainty: an axiomatic approach

0 references

Relational reinforcement learning

0 references

0 references

Qualitative decision under uncertainty: back to expected utility

0 references

0 references

Preference Learning

0 references

Label ranking by learning pairwise preferences

0 references

A Survey and Empirical Comparison of Object Ranking Methods

0 references

Policy search for motor primitives in robotics

0 references

OnActor-Critic Algorithms

0 references

0 references

Stochastic Orderings for Markov Processes on Partially Ordered Spaces

0 references

Efficient prediction algorithms for binary decomposition techniques

0 references

0 references

Transfer learning for reinforcement learning domains: a survey

0 references

Practical issues in temporal difference learning

0 references

Programming backgammon using self-teaching neural nets

0 references

A generalized path integral control approach to reinforcement learning

0 references

Label Ranking Algorithms: A Survey

0 references

\({\mathcal Q}\)-learning

0 references

Simple statistical gradient-following algorithms for connectionist reinforcement learning

0 references

Identifiers

zbMATH Open document ID

0 references

Mathematics Subject Classification ID

0 references

zbMATH DE Number

0 references

0 references

0 references

10.1007/S10994-012-5313-8

0 references

DBLP publication ID

journals/ml/FurnkranzHCP12

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:1945130

Retrieved from "https://mardi.schubotz.org/w/index.php?title=Item:Q1945130&oldid=43436746"