The following pages link to Shie Mannor (Q239358):
Displaying 50 items.
- Robustness and generalization (Q420915) (← links)
- A primal condition for approachability with partial monitoring (Q482547) (← links)
- Regret minimization in repeated matrix games with variable stage duration (Q926893) (← links)
- Approachability in repeated games: Computational aspects and a Stackelberg variant (Q1021600) (← links)
- Multi-agent learning for engineers (Q1028926) (← links)
- Approximately optimal bidding policies for repeated first-price auctions (Q1761811) (← links)
- Inverse reinforcement learning in contextual MDPs (Q2071371) (← links)
- Algorithmic aspects of mean-variance optimization in Markov decision processes (Q2356186) (← links)
- Online calibrated forecasts: memory efficiency versus universality for learning in games (Q2384142) (← links)
- A tutorial on the cross-entropy method (Q2485925) (← links)
- Basis function adaptation in temporal difference reinforcement learning (Q2485935) (← links)
- A contract-based model for directed network formation (Q2507673) (← links)
- Dynamics in tree formation games (Q2636765) (← links)
- A state action frequency approach to throughput maximization over uncertain wireless channels (Q2809567) (← links)
- Bayesian reinforcement learning: a survey (Q2809805) (← links)
- Learning the variance of the reward-to-go (Q2810778) (← links)
- Statistical optimization in high dimensions (Q2830768) (← links)
- Reinforcement learning in robust Markov decision processes (Q2833106) (← links)
- Robust MDPs with \(k\)-rectangular uncertainty (Q2833114) (← links)
- Regularized policy iteration with nonparametric function spaces (Q2834459) (← links)
- Online learning with sample path constraints (Q2880891) (← links)
- Robustness and regularization of support vector machines (Q2880935) (← links)
- A distributional interpretation of robust optimization (Q2884306) (← links)
- Distributionally robust Markov decision processes (Q2884318) (← links)
- (Q2934107) (← links)
- Approximate Value Iteration with Temporally Extended Actions (Q2941739) (← links)
- Distinguishing Infections on Different Graph Topologies (Q2977411) (← links)
- Outlier-Robust PCA: The High-Dimensional Case (Q2989483) (← links)
- (Q3046711) (← links)
- (Q3046715) (← links)
- (Q3093188) (← links)
- (Q3093197) (← links)
- (Q3093383) (← links)
- Percentile Optimization for Markov Decision Processes with Parameter Uncertainty (Q3100462) (← links)
- Bias and Variance Approximation in Value Function Estimates (Q3116079) (← links)
- (Q3148802) (← links)
- (Q3148820) (← links)
- Strategies for Prediction Under Imperfect Monitoring (Q3168980) (← links)
- Markov Decision Processes with Arbitrary Reward Processes (Q3169064) (← links)
- A Geometric Proof of Calibration (Q3169115) (← links)
- Oracle-Based Robust Optimization via Online Learning (Q3450465) (← links)
- An Inequality for Nearly Log-Concave Distributions With Applications to Learning (Q3548145) (← links)
- (Q4410094) (← links)
- Fully Parallel Stochastic LDPC Decoders (Q4568856) (← links)
- Majority-Based Tracking Forecast Memories for Stochastic LDPC Decoding (Q4570533) (← links)
- Relaxation Dynamics in Stochastic Iterative Decoders (Q4570672) (← links)
- Delayed Stochastic Decoding of LDPC Codes (Q4573332) (← links)
- High-Throughput Energy-Efficient LDPC Decoders Using Differential Binary Message Passing (Q4578975) (← links)
- Nonstochastic Multi-Armed Bandits with Graph-Structured Feedback (Q4596721) (← links)
- Optimization Under Probabilistic Envelope Constraints (Q4648264) (← links)