AsyncQVI: Asynchronous-Parallel Q-Value Iteration for Discounted Markov Decision Processes with Near-Optimal Sample Complexity (Q6310634)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: AsyncQVI: Asynchronous-Parallel Q-Value Iteration for Discounted Markov Decision Processes with Near-Optimal Sample Complexity |
preprint article from arXiv
| Language | Label | Description | Also known as |
|---|---|---|---|
| English | AsyncQVI: Asynchronous-Parallel Q-Value Iteration for Discounted Markov Decision Processes with Near-Optimal Sample Complexity |
preprint article from arXiv |
Statements
3 December 2018
0 references
math.OC
0 references
cs.LG
0 references
Yibo Zeng
0 references
Fei Feng
0 references
Wotao Yin
0 references