Pages that link to "Item:Q5388036"
From MaRDI portal
The following pages link to Risk-Sensitive and Risk-Neutral Multiarmed Bandits (Q5388036):
Displaying 18 items.
- Four proofs of Gittins' multiarmed bandit theorem (Q333080) (← links)
- The multi-armed bandit, with constraints (Q378726) (← links)
- Optimal halting policies in Markov population decision chains with constant risk posture (Q490217) (← links)
- Stochastic scheduling in an in-forest (Q951118) (← links)
- Risk aversion in expected intertemporal discounted utilities bandit problems (Q1036105) (← links)
- An online algorithm for the risk-aware restless bandit (Q2029383) (← links)
- A revised approach for risk-averse multi-armed bandits under CVaR criterion (Q2060576) (← links)
- Constant risk aversion in stochastic contests with exponential completion times (Q3120604) (← links)
- On the reduction of total‐cost and average‐cost MDPs to discounted MDPs (Q3120606) (← links)
- Game of Thrones: Fully Distributed Learning for Multiplayer Bandits (Q4991671) (← links)
- Minimax Off-Policy Evaluation for Multi-Armed Bandits (Q5096994) (← links)
- Open Bandit Processes with Uncountable States and Time-Backward Effects (Q5299564) (← links)
- ON THE IDENTIFICATION AND MITIGATION OF WEAKNESSES IN THE KNOWLEDGE GRADIENT POLICY FOR MULTI-ARMED BANDITS (Q5358114) (← links)
- Optimal activation of halting multi‐armed bandit models (Q6057028) (← links)
- Index policy for multiarmed bandit problem with dynamic risk measures (Q6090163) (← links)
- Multi-armed bandits with censored consumption of resources (Q6097147) (← links)
- Approximate solutions to constrained risk-sensitive Markov decision processes (Q6113325) (← links)
- Markov decision processes with risk-sensitive criteria: an overview (Q6540475) (← links)