Picture for Martin J. Wainwright

Martin J. Wainwright

Exploiting Exogenous Structure for Sample-Efficient Reinforcement Learning

Add code
Sep 22, 2024
Figure 1 for Exploiting Exogenous Structure for Sample-Efficient Reinforcement Learning
Figure 2 for Exploiting Exogenous Structure for Sample-Efficient Reinforcement Learning
Figure 3 for Exploiting Exogenous Structure for Sample-Efficient Reinforcement Learning
Figure 4 for Exploiting Exogenous Structure for Sample-Efficient Reinforcement Learning
Viaarxiv icon

Finite-Sample Guarantees for Best-Response Learning Dynamics in Zero-Sum Matrix Games

Add code
Jul 29, 2024
Viaarxiv icon

Entrywise Inference for Causal Panel Data: A Simple and Instance-Optimal Approach

Add code
Jan 24, 2024
Viaarxiv icon

Taming "data-hungry" reinforcement learning? Stability in continuous state-action spaces

Add code
Jan 10, 2024
Viaarxiv icon

Doubly High-Dimensional Contextual Bandits: An Interpretable Model for Joint Assortment-Pricing

Add code
Sep 14, 2023
Viaarxiv icon

Semi-parametric inference based on adaptively collected data

Add code
Mar 05, 2023
Viaarxiv icon

Kernel-based off-policy estimation without overlap: Instance optimality beyond semiparametric efficiency

Add code
Jan 16, 2023
Figure 1 for Kernel-based off-policy estimation without overlap: Instance optimality beyond semiparametric efficiency
Figure 2 for Kernel-based off-policy estimation without overlap: Instance optimality beyond semiparametric efficiency
Figure 3 for Kernel-based off-policy estimation without overlap: Instance optimality beyond semiparametric efficiency
Viaarxiv icon

Policy evaluation from a single path: Multi-step methods, mixing and mis-specification

Add code
Nov 07, 2022
Viaarxiv icon

Krylov-Bellman boosting: Super-linear policy evaluation in general state spaces

Add code
Oct 20, 2022
Figure 1 for Krylov-Bellman boosting: Super-linear policy evaluation in general state spaces
Figure 2 for Krylov-Bellman boosting: Super-linear policy evaluation in general state spaces
Figure 3 for Krylov-Bellman boosting: Super-linear policy evaluation in general state spaces
Figure 4 for Krylov-Bellman boosting: Super-linear policy evaluation in general state spaces
Viaarxiv icon

QuTE: decentralized multiple testing on sensor networks with false discovery rate control

Add code
Oct 09, 2022
Figure 1 for QuTE: decentralized multiple testing on sensor networks with false discovery rate control
Figure 2 for QuTE: decentralized multiple testing on sensor networks with false discovery rate control
Figure 3 for QuTE: decentralized multiple testing on sensor networks with false discovery rate control
Figure 4 for QuTE: decentralized multiple testing on sensor networks with false discovery rate control
Viaarxiv icon