Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
Learning One Representation to Optimize All Rewards - MaRDI portal

Learning One Representation to Optimize All Rewards (Q6362810)

From MaRDI portal

Jump to:navigation, search

This is the item page for this Wikibase entity, intended for internal use and editing purposes.

Please use this page instead for the normal view: Learning One Representation to Optimize All Rewards

preprint article from arXiv

Language	Label	Description	Also known as
English	Learning One Representation to Optimize All Rewards	preprint article from arXiv

Statements

scholarly article

0 references

publication date

14 March 2021

0 references

arXiv classification

cs.LG

0 references

cs.AI

0 references

math.OC

0 references

author name string

Ahmed Touati

0 references

Yann Ollivier

0 references

MaRDI profile type

0 references

has companion code repository

https://github.com/ahmed-touati/controllable_agent

1 reference

PapersWithCode reference URL

https://paperswithcode.com/paper/learning-one-representation-to-optimize-all

publication

Identifiers

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:6362810

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Item:Q6362810&oldid=40689762"