Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
A novel policy based on action confidence limit to improve exploration efficiency in reinforcement learning - MaRDI portal

A novel policy based on action confidence limit to improve exploration efficiency in reinforcement learning

From MaRDI portal

Publication:6121659

Jump to:navigation, search

DOI10.1016/J.INS.2023.119011MaRDI QIDQ6121659

Xinyang Deng, Yixin He, Wen Jiang, Fanghui Huang

Publication date: 26 March 2024

Published in: Information Sciences (Search for Journal in Brave)

zbMATH Keywords

reinforcement learning action confidence limit deep auto-encoder network exploration policy uncertainty of action

Mathematics Subject Classification ID

Nonparametric tolerance and confidence regions (62G15) Learning and adaptive systems in artificial intelligence (68T05) Source coding (94A29)

Cites Work

This page was built for publication: A novel policy based on action confidence limit to improve exploration efficiency in reinforcement learning

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:6121659&oldid=35574660"