Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
Settling the sample complexity of model-based offline reinforcement learning - MaRDI portal

Settling the sample complexity of model-based offline reinforcement learning

From MaRDI portal

Publication:6192326

Jump to:navigation, search

DOI10.1214/23-aos2342arXiv2204.05275MaRDI QIDQ6192326

Yuting Wei, Yuejie Chi, Laixi Shi, Unnamed Author, Yuxin Chen

Publication date: 11 March 2024

Published in: The Annals of Statistics (Search for Journal in Brave)

Full work available at URL: https://arxiv.org/abs/2204.05275

zbMATH Keywords

Markov decision process minimax optimality sample complexity distribution shift offline reinforcement learning

Mathematics Subject Classification ID

Minimax procedures in statistical decision theory (62C20)

Cites Work

This page was built for publication: Settling the sample complexity of model-based offline reinforcement learning

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:6192326&oldid=35690328"