Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Eduardo Ochoa Rivera

Near Optimal Pure Exploration in Logistic Bandits

Oct 28, 2024

Eduardo Ochoa Rivera, Ambuj Tewari

Abstract:Bandit algorithms have garnered significant attention due to their practical applications in real-world scenarios. However, beyond simple settings such as multi-arm or linear bandits, optimal algorithms remain scarce. Notably, no optimal solution exists for pure exploration problems in the context of generalized linear model (GLM) bandits. In this paper, we narrow this gap and develop the first track-and-stop algorithm for general pure exploration problems under the logistic bandit called logistic track-and-stop (Log-TS). Log-TS is an efficient algorithm that asymptotically matches an approximation for the instance-specific lower bound of the expected sample complexity up to a logarithmic factor.

* 25 pages, 2 figures

Via

Access Paper or Ask Questions

Conformalized Late Fusion Multi-View Learning

May 25, 2024

Eduardo Ochoa Rivera, Yash Patel, Ambuj Tewari

Figure 1 for Conformalized Late Fusion Multi-View Learning

Figure 2 for Conformalized Late Fusion Multi-View Learning

Figure 3 for Conformalized Late Fusion Multi-View Learning

Figure 4 for Conformalized Late Fusion Multi-View Learning

Abstract:Uncertainty quantification for multi-view learning is motivated by the increasing use of multi-view data in scientific problems. A common variant of multi-view learning is late fusion: train separate predictors on individual views and combine them after single-view predictions are available. Existing methods for uncertainty quantification for late fusion often rely on undesirable distributional assumptions for validity. Conformal prediction is one approach that avoids such distributional assumptions. However, naively applying conformal prediction to late-stage fusion pipelines often produces overly conservative and uninformative prediction regions, limiting its downstream utility. We propose a novel methodology, Multi-View Conformal Prediction (MVCP), where conformal prediction is instead performed separately on the single-view predictors and only fused subsequently. Our framework extends the standard scalar formulation of a score function to a multivariate score that produces more efficient downstream prediction regions in both classification and regression settings. We then demonstrate that such improvements can be realized in methods built atop conformalized regressors, specifically in robust predict-then-optimize pipelines.

Via

Access Paper or Ask Questions

Optimal Thresholding Linear Bandit

Feb 11, 2024

Eduardo Ochoa Rivera, Ambuj Tewari

Abstract:We study a novel pure exploration problem: the $\epsilon$-Thresholding Bandit Problem (TBP) with fixed confidence in stochastic linear bandits. We prove a lower bound for the sample complexity and extend an algorithm designed for Best Arm Identification in the linear case to TBP that is asymptotically optimal.

* arXiv admin note: substantial text overlap with arXiv:2006.16073 by other authors

Via

Access Paper or Ask Questions