Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Constrained Online Two-stage Stochastic Optimization: New Algorithms via Adversarial Learning

Feb 02, 2023

Jiashuo Jiang

Figure 1 for Constrained Online Two-stage Stochastic Optimization: New Algorithms via Adversarial Learning

Figure 2 for Constrained Online Two-stage Stochastic Optimization: New Algorithms via Adversarial Learning

Share this with someone who'll enjoy it:

Abstract:We consider an online two-stage stochastic optimization with long-term constraints over a finite horizon of $T$ periods. At each period, we take the first-stage action, observe a model parameter realization and then take the second-stage action from a feasible set that depends both on the first-stage decision and the model parameter. We aim to minimize the cumulative objective value while guaranteeing that the long-term average second-stage decision belongs to a set. We propose a general algorithmic framework that derives online algorithms for the online two-stage problem from adversarial learning algorithms. Also, the regret bound of our algorithm cam be reduced to the regret bound of embedded adversarial learning algorithms. Based on our framework, we obtain new results under various settings. When the model parameter at each period is drawn from identical distributions, we derive state-of-art regret bound that improves previous bounds under special cases. Our algorithm is also robust to adversarial corruptions of model parameter realizations. When the model parameters are drawn from unknown non-stationary distributions and we are given prior estimates of the distributions, we develop a new algorithm from our framework with a regret $O(W_T+\sqrt{T})$, where $W_T$ measures the total inaccuracy of the prior estimates.

View paper on

Share this with someone who'll enjoy it:

Title:Constrained Online Two-stage Stochastic Optimization: New Algorithms via Adversarial Learning

Paper and Code