scientific article; zbMATH DE number 7626783
From MaRDI portal
Publication:5053301
Antonin Raffin, Anssi Kanervisto, Noah Dormann, Maximilian Ernestus, Adam Gleave, Ashley Hill
Publication date: 6 December 2022
Full work available at URL: https://jmlr.csail.mit.edu/papers/v22/20-1364.html
Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.
Related Items (9)
A deep reinforcement learning framework for dynamic optimization of numerical schemes for compressible flow simulations ⋮ Robust optimal well control using an adaptive multigrid reinforcement learning framework ⋮ Dynamic shielding for reinforcement learning in black-box environments ⋮ Robust moving target defense against unknown attacks: a meta-reinforcement learning approach ⋮ Proximal policy optimization‐based controller for chaotic systems ⋮ Accelerating actor-critic-based algorithms via pseudo-labels derived from prior knowledge ⋮ Control of chaos with time-delayed feedback based on deep reinforcement learning ⋮ Distributed web hacking by adaptive consensus-based reinforcement learning ⋮ Stable Baselines3
Uses Software
Cites Work
This page was built for publication: