Explicit explore, exploit, or escape \((E^4)\): near-optimal safety-constrained reinforcement learning in polynomial time (Q6106432)

scientific article; zbMATH DE number 7702687

Language	Label	Description	Also known as
English	Explicit explore, exploit, or escape \((E^4)\): near-optimal safety-constrained reinforcement learning in polynomial time	scientific article; zbMATH DE number 7702687

Statements

instance of

scholarly article

0 references

title

Explicit explore, exploit, or escape \((E^4)\): near-optimal safety-constrained reinforcement learning in polynomial time (English)

0 references

0 references

0 references

0 references

27 June 2023

0 references

full work available at URL

https://arxiv.org/abs/2111.07395

0 references

zbMATH Keywords

safe artificial intelligence

0 references

safe exploration

0 references

model-based reinforcement learning

0 references

constrained Markov decision processes

0 references

robust Markov decision processes

0 references

MaRDI profile type

Publication

0 references

cites work

Constrained Markov decision processes with total cost criteria: Lagrangian approach and dual linear program

0 references

Q4264741

0 references

10.1162/153244303765208377

0 references

Q4955315

0 references

Probability Inequalities for Sums of Bounded Random Variables

0 references

Robust Dynamic Programming

0 references

Near-optimal regret bounds for reinforcement learning

0 references

A new polynomial-time algorithm for linear programming

0 references

Near-optimal reinforcement learning in polynomial time

0 references

Q3050157

0 references

Robust Control of Markov Decision Processes with Uncertain Transition Matrices

0 references

Interior-point methods

0 references

\({\mathcal Q}\)-learning

0 references

Robust Markov Decision Processes

0 references

A block coordinate descent method for regularized multiconvex optimization with applications to nonnegative tensor factorization and completion

0 references

Identifiers

arXiv ID

2111.07395

0 references

Mathematics Subject Classification ID

0 references

0 references

10.1007/S10994-022-06201-Z

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:6106432