Mathematical Research Data Initiative
Main page
Recent changes
Random page
Help about MediaWiki
Create a new Item
Create a new Property
Create a new EntitySchema
Merge two items
In other projects
Discussion
View source
View history
Purge
English
Log in

Min Max Generalization for Deterministic Batch Mode Reinforcement Learning: Relaxation Schemes

From MaRDI portal
Publication:2873838
Jump to:navigation, search

DOI10.1137/120867263zbMath1284.49012OpenAlexW2057088773MaRDI QIDQ2873838

Raphael Fonteneau, Quentin Louveaux, Damien Ernst, Bernard Boigelot

Publication date: 27 January 2014

Published in: SIAM Journal on Control and Optimization (Search for Journal in Brave)

Full work available at URL: http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.711.5580


zbMATH Keywords

computational complexitynonconvex optimizationrelaxation schemesbatch mode reinforcement learningmin--max problem


Mathematics Subject Classification ID

Minimax problems in mathematical programming (90C47) Abstract computational complexity for mathematical programming problems (90C60) Nonconvex programming, global optimization (90C26) Learning and adaptive systems in artificial intelligence (68T05) Existence of solutions for minimax problems (49J35) Methods involving semicontinuity and convergence; relaxation (49J45)



Uses Software

  • SeDuMi



This page was built for publication: Min Max Generalization for Deterministic Batch Mode Reinforcement Learning: Relaxation Schemes

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:2873838&oldid=15818431"
Tools
What links here
Related changes
Special pages
Printable version
Permanent link
Page information
MaRDI portal item
This page was last edited on 3 February 2024, at 19:28.
Privacy policy
About MaRDI portal
Disclaimers
Imprint
Powered by MediaWiki