Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Jean Pauphilet

Improved Approximation Algorithms for Low-Rank Problems Using Semidefinite Optimization

Jan 06, 2025

Ryan Cory-Wright, Jean Pauphilet

Figure 1 for Improved Approximation Algorithms for Low-Rank Problems Using Semidefinite Optimization

Figure 2 for Improved Approximation Algorithms for Low-Rank Problems Using Semidefinite Optimization

Figure 3 for Improved Approximation Algorithms for Low-Rank Problems Using Semidefinite Optimization

Figure 4 for Improved Approximation Algorithms for Low-Rank Problems Using Semidefinite Optimization

Abstract:Inspired by the impact of the Goemans-Williamson algorithm on combinatorial optimization, we construct an analogous relax-then-sample strategy for low-rank optimization problems. First, for orthogonally constrained quadratic optimization problems, we derive a semidefinite relaxation and a randomized rounding scheme, which obtains provably near-optimal solutions, mimicking the blueprint from Goemans and Williamson for the Max-Cut problem. We then extend our approach to generic low-rank optimization problems by developing new semidefinite relaxations that are both tighter and more broadly applicable than those in prior works. Although our original proposal introduces large semidefinite matrices as decision variables, we show that most of the blocks in these matrices can be safely omitted without altering the optimal value, hence improving the scalability of our approach. Using several examples (including matrix completion, basis pursuit, and reduced-rank regression), we show how to reduce the size of our relaxation even further. Finally, we numerically illustrate the effectiveness and scalability of our relaxation and our sampling scheme on orthogonally constrained quadratic optimization and matrix completion problems.

* 30 pages, 5 figures, plus references and appendices

Via

Access Paper or Ask Questions

Adaptive Optimization for Prediction with Missing Data

Feb 02, 2024

Dimitris Bertsimas, Arthur Delarue, Jean Pauphilet

Abstract:When training predictive models on data with missing entries, the most widely used and versatile approach is a pipeline technique where we first impute missing entries and then compute predictions. In this paper, we view prediction with missing data as a two-stage adaptive optimization problem and propose a new class of models, adaptive linear regression models, where the regression coefficients adapt to the set of observed features. We show that some adaptive linear regression models are equivalent to learning an imputation rule and a downstream linear regression model simultaneously instead of sequentially. We leverage this joint-impute-then-regress interpretation to generalize our framework to non-linear models. In settings where data is strongly not missing at random, our methods achieve a 2-10% improvement in out-of-sample accuracy.

* arXiv admin note: text overlap with arXiv:2104.03158

Via

Access Paper or Ask Questions

Patient Outcome Predictions Improve Operations at a Large Hospital Network

May 25, 2023

Liangyuan Na, Kimberly Villalobos Carballo, Jean Pauphilet, Ali Haddad-Sisakht, Daniel Kombert, Melissa Boisjoli-Langlois, Andrew Castiglione, Maram Khalifa, Pooja Hebbal, Barry Stein(+1 more)

Figure 1 for Patient Outcome Predictions Improve Operations at a Large Hospital Network

Figure 2 for Patient Outcome Predictions Improve Operations at a Large Hospital Network

Figure 3 for Patient Outcome Predictions Improve Operations at a Large Hospital Network

Figure 4 for Patient Outcome Predictions Improve Operations at a Large Hospital Network

Abstract:Problem definition: Access to accurate predictions of patients' outcomes can enhance medical staff's decision-making, which ultimately benefits all stakeholders in the hospitals. A large hospital network in the US has been collaborating with academics and consultants to predict short-term and long-term outcomes for all inpatients across their seven hospitals. Methodology/results: We develop machine learning models that predict the probabilities of next 24-hr/48-hr discharge and intensive care unit transfers, end-of-stay mortality and discharge dispositions. All models achieve high out-of-sample AUC (75.7%-92.5%) and are well calibrated. In addition, combining 48-hr discharge predictions with doctors' predictions simultaneously enables more patient discharges (10%-28.7%) and fewer 7-day/30-day readmissions ($p$-value $<0.001$). We implement an automated pipeline that extracts data and updates predictions every morning, as well as user-friendly software and a color-coded alert system to communicate these patient-level predictions (alongside explanations) to clinical teams. Managerial implications: Since we have been gradually deploying the tool, and training medical staff, over 200 doctors, nurses, and case managers across seven hospitals use it in their daily patient review process. We observe a significant reduction in the average length of stay (0.67 days per patient) following its adoption and anticipate substantial financial benefits (between \$55 and \$72 million annually) for the healthcare system.

* 41 pages, 13 figures

Via

Access Paper or Ask Questions

Optimal Low-Rank Matrix Completion: Semidefinite Relaxations and Eigenvector Disjunctions

May 20, 2023

Dimitris Bertsimas, Ryan Cory-Wright, Sean Lo, Jean Pauphilet

Figure 1 for Optimal Low-Rank Matrix Completion: Semidefinite Relaxations and Eigenvector Disjunctions

Figure 2 for Optimal Low-Rank Matrix Completion: Semidefinite Relaxations and Eigenvector Disjunctions

Figure 3 for Optimal Low-Rank Matrix Completion: Semidefinite Relaxations and Eigenvector Disjunctions

Figure 4 for Optimal Low-Rank Matrix Completion: Semidefinite Relaxations and Eigenvector Disjunctions

Abstract:Low-rank matrix completion consists of computing a matrix of minimal complexity that recovers a given set of observations as accurately as possible, and has numerous applications such as product recommendation. Unfortunately, existing methods for solving low-rank matrix completion are heuristics that, while highly scalable and often identifying high-quality solutions, do not possess any optimality guarantees. We reexamine matrix completion with an optimality-oriented eye, by reformulating low-rank problems as convex problems over the non-convex set of projection matrices and implementing a disjunctive branch-and-bound scheme that solves them to certifiable optimality. Further, we derive a novel and often tight class of convex relaxations by decomposing a low-rank matrix as a sum of rank-one matrices and incentivizing, via a Shor relaxation, that each two-by-two minor in each rank-one matrix has determinant zero. In numerical experiments, our new convex relaxations decrease the optimality gap by two orders of magnitude compared to existing attempts. Moreover, we showcase the performance of our disjunctive branch-and-bound scheme and demonstrate that it solves matrix completion problems over 150x150 matrices to certifiable optimality in hours, constituting an order of magnitude improvement on the state-of-the-art for certifiably optimal methods.

* Also posted to Optimization Online on 15 May; code freely available on GitHub; feedback always appreciated

Via

Access Paper or Ask Questions

Sparse PCA With Multiple Components

Sep 29, 2022

Ryan Cory-Wright, Jean Pauphilet

Figure 1 for Sparse PCA With Multiple Components

Figure 2 for Sparse PCA With Multiple Components

Figure 3 for Sparse PCA With Multiple Components

Figure 4 for Sparse PCA With Multiple Components

Abstract:Sparse Principal Component Analysis is a cardinal technique for obtaining combinations of features, or principal components (PCs), that explain the variance of high-dimensional datasets in an interpretable manner. At its heart, this involves solving a sparsity and orthogonality constrained convex maximization problem, which is extremely computationally challenging. Most existing work address sparse PCA via heuristics such as iteratively computing one sparse PC and deflating the covariance matrix, which does not guarantee the orthogonality, let alone the optimality, of the resulting solution. We challenge this status by reformulating the orthogonality conditions as rank constraints and optimizing over the sparsity and rank constraints simultaneously. We design tight semidefinite relaxations and propose tractable second-order cone versions of these relaxations which supply high-quality upper bounds. We also design valid second-order cone inequalities which hold when each PC's individual sparsity is specified, and demonstrate that these inequalities tighten our relaxations significantly. Moreover, we propose exact methods and rounding mechanisms that exploit these relaxations' tightness to obtain solutions with a bound gap on the order of 1%-5% for real-world datasets with p = 100s or 1000s of features and r \in {2, 3} components. We investigate the performance of our methods in spiked covariance settings and demonstrate that simultaneously considering the orthogonality and sparsity constraints leads to improvements in the Area Under the ROC curve of 2%-8% compared to state-of-the-art deflation methods. All in all, our approach solves sparse PCA problems with multiple components to certifiable (near) optimality in a practically tractable fashion.

* Submitted to Operations Research; comments or suggestions welcome

Via

Access Paper or Ask Questions

The Best Decisions Are Not the Best Advice: Making Adherence-Aware Recommendations

Sep 07, 2022

Julien Grand-Clément, Jean Pauphilet

Figure 1 for The Best Decisions Are Not the Best Advice: Making Adherence-Aware Recommendations

Figure 2 for The Best Decisions Are Not the Best Advice: Making Adherence-Aware Recommendations

Figure 3 for The Best Decisions Are Not the Best Advice: Making Adherence-Aware Recommendations

Figure 4 for The Best Decisions Are Not the Best Advice: Making Adherence-Aware Recommendations

Abstract:Many high-stake decisions follow an expert-in-loop structure in that a human operator receives recommendations from an algorithm but is the ultimate decision maker. Hence, the algorithm's recommendation may differ from the actual decision implemented in practice. However, most algorithmic recommendations are obtained by solving an optimization problem that assumes recommendations will be perfectly implemented. We propose an adherence-aware optimization framework to capture the dichotomy between the recommended and the implemented policy and analyze the impact of partial adherence on the optimal recommendation. We show that overlooking the partial adherence phenomenon, as is currently being done by most recommendation engines, can lead to arbitrarily severe performance deterioration, compared with both the current human baseline performance and what is expected by the recommendation algorithm. Our framework also provides useful tools to analyze the structure and to compute optimal recommendation policies that are naturally immune against such human deviations, and are guaranteed to improve upon the baseline policy.

Via

Access Paper or Ask Questions

Robust and Heterogenous Odds Ratio: Estimating Price Sensitivity for Unbought Items

Jun 21, 2021

Jean Pauphilet

Figure 1 for Robust and Heterogenous Odds Ratio: Estimating Price Sensitivity for Unbought Items

Figure 2 for Robust and Heterogenous Odds Ratio: Estimating Price Sensitivity for Unbought Items

Figure 3 for Robust and Heterogenous Odds Ratio: Estimating Price Sensitivity for Unbought Items

Figure 4 for Robust and Heterogenous Odds Ratio: Estimating Price Sensitivity for Unbought Items

Abstract:Problem definition: Mining for heterogeneous responses to an intervention is a crucial step for data-driven operations, for instance to personalize treatment or pricing. We investigate how to estimate price sensitivity from transaction-level data. In causal inference terms, we estimate heterogeneous treatment effects when (a) the response to treatment (here, whether a customer buys a product) is binary, and (b) treatment assignments are partially observed (here, full information is only available for purchased items). Methodology/Results: We propose a recursive partitioning procedure to estimate heterogeneous odds ratio, a widely used measure of treatment effect in medicine and social sciences. We integrate an adversarial imputation step to allow for robust inference even in presence of partially observed treatment assignments. We validate our methodology on synthetic data and apply it to three case studies from political science, medicine, and revenue management. Managerial Implications: Our robust heterogeneous odds ratio estimation method is a simple and intuitive tool to quantify heterogeneity in patients or customers and personalize interventions, while lifting a central limitation in many revenue management data.

Via

Access Paper or Ask Questions

A new perspective on low-rank optimization

May 12, 2021

Dimitris Bertsimas, Ryan Cory-Wright, Jean Pauphilet

Figure 1 for A new perspective on low-rank optimization

Figure 2 for A new perspective on low-rank optimization

Figure 3 for A new perspective on low-rank optimization

Figure 4 for A new perspective on low-rank optimization

Abstract:A key question in many low-rank problems throughout optimization, machine learning, and statistics is to characterize the convex hulls of simple low-rank sets and judiciously apply these convex hulls to obtain strong yet computationally tractable convex relaxations. We invoke the matrix perspective function - the matrix analog of the perspective function-and characterize explicitly the convex hull of epigraphs of convex quadratic, matrix exponential, and matrix power functions under low-rank constraints. Further, we exploit these characterizations to develop strong relaxations for a variety of low-rank problems including reduced rank regression, non-negative matrix factorization, and factor analysis. We establish that these relaxations can be modeled via semidefinite and matrix power cone constraints, and thus optimized over tractably. The proposed approach parallels and generalizes the perspective reformulation technique in mixed-integer optimization, and leads to new relaxations for a broad class of problems.

* Submitted to Mathematical Programming

Via

Access Paper or Ask Questions

Prediction with Missing Data

Apr 07, 2021

Dimitris Bertsimas, Arthur Delarue, Jean Pauphilet

Figure 1 for Prediction with Missing Data

Figure 2 for Prediction with Missing Data

Figure 3 for Prediction with Missing Data

Figure 4 for Prediction with Missing Data

Abstract:Missing information is inevitable in real-world data sets. While imputation is well-suited and theoretically sound for statistical inference, its relevance and practical implementation for out-of-sample prediction remains unsettled. We provide a theoretical analysis of widely used data imputation methods and highlight their key deficiencies in making accurate predictions. Alternatively, we propose adaptive linear regression, a new class of models that can be directly trained and evaluated on partially observed data, adapting to the set of available features. In particular, we show that certain adaptive regression models are equivalent to impute-then-regress methods where the imputation and the regression models are learned simultaneously instead of sequentially. We validate our theoretical findings and adaptive regression approach with numerical results with real-world data sets.

Via

Access Paper or Ask Questions

Mixed-Projection Conic Optimization: A New Paradigm for Modeling Rank Constraints

Sep 22, 2020

Dimitris Bertsimas, Ryan Cory-Wright, Jean Pauphilet

Figure 1 for Mixed-Projection Conic Optimization: A New Paradigm for Modeling Rank Constraints

Figure 2 for Mixed-Projection Conic Optimization: A New Paradigm for Modeling Rank Constraints

Figure 3 for Mixed-Projection Conic Optimization: A New Paradigm for Modeling Rank Constraints

Figure 4 for Mixed-Projection Conic Optimization: A New Paradigm for Modeling Rank Constraints

Abstract:We propose a framework for modeling and solving low-rank optimization problems to certifiable optimality. We introduce symmetric projection matrices that satisfy $Y^2=Y$, the matrix analog of binary variables that satisfy $z^2=z$, to model rank constraints. By leveraging regularization and strong duality, we prove that this modeling paradigm yields tractable convex optimization problems over the non-convex set of orthogonal projection matrices. Furthermore, we design outer-approximation algorithms to solve low-rank problems to certifiable optimality, compute lower bounds via their semidefinite relaxations, and provide near optimal solutions through rounding and local search techniques. We implement these numerical ingredients and, for the first time, solve low-rank optimization problems to certifiable optimality. Our algorithms also supply certifiably near-optimal solutions for larger problem sizes and outperform existing heuristics, by deriving an alternative to the popular nuclear norm relaxation which generalizes the perspective relaxation from vectors to matrices. All in all, our framework, which we name Mixed-Projection Conic Optimization, solves low-rank problems to certifiable optimality in a tractable and unified fashion.

* submitted to Operations Research

Via

Access Paper or Ask Questions