Preference-based reinforcement learning: a formal framework and a policy iteration algorithm (Q1945130)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: Preference-based reinforcement learning: a formal framework and a policy iteration algorithm |
scientific article; zbMATH DE number 6149404
| Language | Label | Description | Also known as |
|---|---|---|---|
| English | Preference-based reinforcement learning: a formal framework and a policy iteration algorithm |
scientific article; zbMATH DE number 6149404 |
Statements
Preference-based reinforcement learning: a formal framework and a policy iteration algorithm (English)
0 references
2 April 2013
0 references
reinforcement learning
0 references
preference learning
0 references
0 references
0 references