Solving reward-collecting problems with UAVs: a comparison of online optimization and Q-learning (Q6384455)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: Solving reward-collecting problems with UAVs: a comparison of online optimization and Q-learning |
preprint article from arXiv
| Language | Label | Description | Also known as |
|---|---|---|---|
| English | Solving reward-collecting problems with UAVs: a comparison of online optimization and Q-learning |
preprint article from arXiv |
Statements
30 November 2021
0 references
cs.LG
0 references
math.OC
0 references
Yixuan Liu
0 references
Chrysafis Vogiatzis
0 references
Ruriko Yoshida
0 references
Erich Morman
0 references