Pages that link to "Item:Q5218653"
From MaRDI portal
The following pages link to A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play (Q5218653):
Displaying 23 items.
- A reinforcement learning approach to the stochastic cutting stock problem (Q6114929) (← links)
- Forecasting Hamiltonian dynamics without canonical coordinates (Q6117176) (← links)
- Enhancing differential-neural cryptanalysis (Q6135401) (← links)
- What will drive global economic growth in the digital age? (Q6138252) (← links)
- Pessimistic value iteration for multi-task data sharing in offline reinforcement learning (Q6152665) (← links)
- A Recursively Recurrent Neural Network (R2N2) Architecture for Learning Iterative Algorithms (Q6154930) (← links)
- Synthesizing explainable counterfactual policies for algorithmic recourse with program synthesis (Q6161200) (← links)
- Learning key steps to attack deep reinforcement learning agents (Q6161208) (← links)
- Quantum circuit compilation for nearest-neighbor architecture based on reinforcement learning (Q6171467) (← links)
- Reward tampering problems and solutions in reinforcement learning: a causal influence diagram perspective (Q6182771) (← links)
- Simulation-based search (Q6198646) (← links)
- Provable Training of a ReLU Gate with an Iterative Non-Gradient Algorithm (Q6340320) (← links)
- Almost surely safe exploration and exploitation for deep reinforcement learning with state safety estimation (Q6495127) (← links)
- Working with machines in mathematics (Q6554710) (← links)
- Improving strategic decisions in sequential games by exploiting positional similarity (Q6555614) (← links)
- Deep learning of first-order nonlinear hyperbolic conservation law solvers (Q6560690) (← links)
- A Comparative Tutorial of Bayesian Sequential Design and Reinforcement Learning (Q6562787) (← links)
- A \(K\)-means supported reinforcement learning framework to multi-dimensional knapsack (Q6568952) (← links)
- Exploring the constraints on artificial general intelligence: a game-theoretic model of human vs machine interaction (Q6575513) (← links)
- Scalable imaginary time evolution with neural network quantum states (Q6598157) (← links)
- DSMC evaluation stages: fostering robust and safe behavior in deep reinforcement learning -- extended version (Q6599368) (← links)
- Routing in reinforcement learning Markov chains (Q6606710) (← links)
- Solving optimal predictor-feedback control using approximate dynamic programming (Q6632499) (← links)