Pages that link to "Item:Q1818283"
From MaRDI portal
The following pages link to Regret in the on-line decision problem (Q1818283):
Displaying 50 items.
- A general internal regret-free strategy (Q291209) (← links)
- Robust mean field games (Q338211) (← links)
- Learning to compete, coordinate, and cooperate in repeated games using reinforcement learning (Q413845) (← links)
- Opinion dynamics and learning in social networks (Q545653) (← links)
- Load balancing without regret in the bulletin board model (Q661046) (← links)
- Approachability in population games (Q828023) (← links)
- Replicator dynamics: old and new (Q828036) (← links)
- Regret minimization in repeated matrix games with variable stage duration (Q926893) (← links)
- Exponential weight algorithm in continuous time (Q959954) (← links)
- Learning by trial and error (Q1007782) (← links)
- If multi-agent learning is the answer, what is the question? (Q1028919) (← links)
- Agendas for multi-agent learning (Q1028923) (← links)
- The possible and the impossible in multi-agent learning (Q1028928) (← links)
- A hierarchy of prescriptive goals for multiagent learning (Q1028932) (← links)
- Predicting a binary sequence almost as well as the optimal biased coin (Q1398365) (← links)
- Learning, hypothesis testing, and Nash equilibrium. (Q1413211) (← links)
- Randomized prediction of individual sequences (Q1733293) (← links)
- Strategic learning in games with symmetric information. (Q1811548) (← links)
- A wide range no-regret theorem (Q1811553) (← links)
- Adaptive game playing using multiplicative weights (Q1818286) (← links)
- Conditional universal consistency. (Q1818287) (← links)
- Minimizing regret: The general case (Q1818295) (← links)
- Price probabilities: a class of Bayesian and non-Bayesian prediction rules (Q2059056) (← links)
- Constrained no-regret learning (Q2178579) (← links)
- Stable games and their dynamics (Q2271376) (← links)
- Computer science and decision theory (Q2271874) (← links)
- When autonomous agents model other agents: an appeal for altered judgment coupled with mouths, ears, and a little more tape (Q2302289) (← links)
- Learning correlated equilibria in games with compact sets of strategies (Q2371157) (← links)
- Online calibrated forecasts: memory efficiency versus universality for learning in games (Q2384142) (← links)
- A general criterion and an algorithmic framework for learning in multi-agent systems (Q2384145) (← links)
- Approachability with bounded memory (Q2389319) (← links)
- Dynamic benchmark targeting (Q2397633) (← links)
- Approachability, regret and calibration: implications and equivalences (Q2438352) (← links)
- Deterministic calibration and Nash equilibrium (Q2462508) (← links)
- Maximin effects in inhomogeneous large-scale data (Q2515497) (← links)
- Minimizing regret in dynamic decision problems (Q2629329) (← links)
- Online discrete optimization in social networks in the presence of Knightian uncertainty (Q2830750) (← links)
- Achieving Unbounded Resolution in<i>Finite</i>Player Goore Games Using Stochastic Automata, and Its Applications (Q2888572) (← links)
- The multiplicative weights update method: a meta-algorithm and applications (Q2913806) (← links)
- Simple regret optimization in online planning for Markov decision processes (Q2921080) (← links)
- Decision Making Approach with Focus Point and Regret (Q3307374) (← links)
- Rationality Authority for Provable Rational Behavior (Q3464466) (← links)
- Calibration and Internal No-Regret with Random Signals (Q3648743) (← links)
- A Robust Saturated Strategy for $n$-Player Prisoner's Dilemma (Q4685372) (← links)
- (Q4969141) (← links)
- (Q4998975) (← links)
- Optimal Exploration–Exploitation in a Multi-armed Bandit Problem with Non-stationary Rewards (Q5113912) (← links)
- Repeated Games with Incomplete Information (Q5149732) (← links)
- Internal regret in on-line portfolio selection (Q5916205) (← links)
- Internal regret in on-line portfolio selection (Q5921688) (← links)