Finite-time analysis of natural actor-critic for POMDPs (Q6633040)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: Finite-time analysis of natural actor-critic for POMDPs |
scientific article; zbMATH DE number 7938968
| Language | Label | Description | Also known as |
|---|---|---|---|
| English | Finite-time analysis of natural actor-critic for POMDPs |
scientific article; zbMATH DE number 7938968 |
Statements
Finite-time analysis of natural actor-critic for POMDPs (English)
0 references
5 November 2024
0 references
reinforcement learning
0 references
partially observable Markov decision processes
0 references
natural policy gradient
0 references
actor-critic method
0 references
filter stability
0 references