Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Corinna Cortes

Cardinality-Aware Set Prediction and Top-$k$ Classification

Jul 09, 2024

Corinna Cortes, Anqi Mao, Christopher Mohri, Mehryar Mohri, Yutao Zhong

Abstract:We present a detailed study of cardinality-aware top-$k$ classification, a novel approach that aims to learn an accurate top-$k$ set predictor while maintaining a low cardinality. We introduce a new target loss function tailored to this setting that accounts for both the classification error and the cardinality of the set predicted. To optimize this loss function, we propose two families of surrogate losses: cost-sensitive comp-sum losses and cost-sensitive constrained losses. Minimizing these loss functions leads to new cardinality-aware algorithms that we describe in detail in the case of both top-$k$ and threshold-based classifiers. We establish $H$-consistency bounds for our cardinality-aware surrogate loss functions, thereby providing a strong theoretical foundation for our algorithms. We report the results of extensive experiments on CIFAR-10, CIFAR-100, ImageNet, and SVHN datasets demonstrating the effectiveness and benefits of our cardinality-aware algorithms.

* arXiv admin note: substantial text overlap with arXiv:2403.19625

Via

Access Paper or Ask Questions

Differentially Private Domain Adaptation with Theoretical Guarantees

Jun 15, 2023

Raef Bassily, Corinna Cortes, Anqi Mao, Mehryar Mohri

Figure 1 for Differentially Private Domain Adaptation with Theoretical Guarantees

Figure 2 for Differentially Private Domain Adaptation with Theoretical Guarantees

Figure 3 for Differentially Private Domain Adaptation with Theoretical Guarantees

Figure 4 for Differentially Private Domain Adaptation with Theoretical Guarantees

Abstract:In many applications, the labeled data at the learner's disposal is subject to privacy constraints and is relatively limited. To derive a more accurate predictor for the target domain, it is often beneficial to leverage publicly available labeled data from an alternative domain, somewhat close to the target domain. This is the modern problem of supervised domain adaptation from a public source to a private target domain. We present two $(\epsilon, \delta)$-differentially private adaptation algorithms for supervised adaptation, for which we make use of a general optimization problem, recently shown to benefit from favorable theoretical learning guarantees. Our first algorithm is designed for regression with linear predictors and shown to solve a convex optimization problem. Our second algorithm is a more general solution for loss functions that may be non-convex but Lipschitz and smooth. While our main objective is a theoretical analysis, we also report the results of several experiments first demonstrating that the non-private versions of our algorithms outperform adaptation baselines and next showing that, for larger values of the target sample size or $\epsilon$, the performance of our private algorithms remains close to that of the non-private formulation.

Via

Access Paper or Ask Questions

Best-Effort Adaptation

May 10, 2023

Pranjal Awasthi, Corinna Cortes, Mehryar Mohri

Abstract:We study a problem of best-effort adaptation motivated by several applications and considerations, which consists of determining an accurate predictor for a target domain, for which a moderate amount of labeled samples are available, while leveraging information from another domain for which substantially more labeled samples are at one's disposal. We present a new and general discrepancy-based theoretical analysis of sample reweighting methods, including bounds holding uniformly over the weights. We show how these bounds can guide the design of learning algorithms that we discuss in detail. We further show that our learning guarantees and algorithms provide improved solutions for standard domain adaptation problems, for which few labeled data or none are available from the target domain. We finally report the results of a series of experiments demonstrating the effectiveness of our best-effort adaptation and domain adaptation algorithms, as well as comparisons with several baselines. We also discuss how our analysis can benefit the design of principled solutions for fine-tuning.

Via

Access Paper or Ask Questions

Inconsistency in Conference Peer Review: Revisiting the 2014 NeurIPS Experiment

Sep 20, 2021

Corinna Cortes, Neil D. Lawrence

Figure 1 for Inconsistency in Conference Peer Review: Revisiting the 2014 NeurIPS Experiment

Figure 2 for Inconsistency in Conference Peer Review: Revisiting the 2014 NeurIPS Experiment

Figure 3 for Inconsistency in Conference Peer Review: Revisiting the 2014 NeurIPS Experiment

Figure 4 for Inconsistency in Conference Peer Review: Revisiting the 2014 NeurIPS Experiment

Abstract:In this paper we revisit the 2014 NeurIPS experiment that examined inconsistency in conference peer review. We determine that 50\% of the variation in reviewer quality scores was subjective in origin. Further, with seven years passing since the experiment we find that for \emph{accepted} papers, there is no correlation between quality scores and impact of the paper as measured as a function of citation count. We trace the fate of rejected papers, recovering where these papers were eventually published. For these papers we find a correlation between quality scores and impact. We conclude that the reviewing process for the 2014 conference was good for identifying poor papers, but poor for identifying good papers. We give some suggestions for improving the reviewing process but also warn against removing the subjective element. Finally, we suggest that the real conclusion of the experiment is that the community should place less onus on the notion of `top-tier conference publications' when assessing the quality of individual researchers. For NeurIPS 2021, the PCs are repeating the experiment, as well as conducting new ones.

* Source code available at https://github.com/lawrennd/neurips2014/

Via

Access Paper or Ask Questions

Multiple-Source Adaptation with Domain Classifiers

Aug 25, 2020

Corinna Cortes, Mehryar Mohri, Ananda Theertha Suresh, Ningshan Zhang

Figure 1 for Multiple-Source Adaptation with Domain Classifiers

Figure 2 for Multiple-Source Adaptation with Domain Classifiers

Figure 3 for Multiple-Source Adaptation with Domain Classifiers

Abstract:We consider the multiple-source adaptation (MSA) problem and improve a previously proposed MSA solution, where accurate density estimation per domain is required to obtain favorable learning guarantees. In this work, we replace the difficult task of density estimation per domain with a much easier task of domain classification, and show that the two solutions are equivalent given the true densities and domain classifier, yet the newer approach benefits from more favorable guarantees when densities and domain classifier are estimated from finite samples. Our experiments with real-world applications demonstrate that the new discriminative MSA solution outperforms the previous solution with density estimation, as well as other domain adaptation baselines.

Via

Access Paper or Ask Questions

Beyond Individual and Group Fairness

Aug 21, 2020

Pranjal Awasthi, Corinna Cortes, Yishay Mansour, Mehryar Mohri

Figure 1 for Beyond Individual and Group Fairness

Figure 2 for Beyond Individual and Group Fairness

Figure 3 for Beyond Individual and Group Fairness

Figure 4 for Beyond Individual and Group Fairness

Abstract:We present a new data-driven model of fairness that, unlike existing static definitions of individual or group fairness is guided by the unfairness complaints received by the system. Our model supports multiple fairness criteria and takes into account their potential incompatibilities. We consider both a stochastic and an adversarial setting of our model. In the stochastic setting, we show that our framework can be naturally cast as a Markov Decision Process with stochastic losses, for which we give efficient vanishing regret algorithmic solutions. In the adversarial setting, we design efficient algorithms with competitive ratio guarantees. We also report the results of experiments with our algorithms and the stochastic framework on artificial datasets, to demonstrate their effectiveness empirically.

Via

Access Paper or Ask Questions

Relative Deviation Margin Bounds

Jun 26, 2020

Corinna Cortes, Mehryar Mohri, Ananda Theertha Suresh

Figure 1 for Relative Deviation Margin Bounds

Abstract:We present a series of new and more favorable margin-based learning guarantees that depend on the empirical margin loss of a predictor. We give two types of learning bounds, both data-dependent ones and bounds valid for general families, in terms of the Rademacher complexity or the empirical $\ell_\infty$ covering number of the hypothesis set used. We also briefly highlight several applications of these bounds and discuss their connection with existing results.

* 29 pages

Via

Access Paper or Ask Questions

Adaptive Region-Based Active Learning

Feb 18, 2020

Corinna Cortes, Giulia DeSalvo, Claudio Gentile, Mehryar Mohri, Ningshan Zhang

Figure 1 for Adaptive Region-Based Active Learning

Figure 2 for Adaptive Region-Based Active Learning

Figure 3 for Adaptive Region-Based Active Learning

Figure 4 for Adaptive Region-Based Active Learning

Abstract:We present a new active learning algorithm that adaptively partitions the input space into a finite number of regions, and subsequently seeks a distinct predictor for each region, both phases actively requesting labels. We prove theoretical guarantees for both the generalization error and the label complexity of our algorithm, and analyze the number of regions defined by the algorithm under some mild assumptions. We also report the results of an extensive suite of experiments on several real-world datasets demonstrating substantial empirical benefits over existing single-region and non-adaptive region-based active learning baselines.

Via

Access Paper or Ask Questions

Learning GANs and Ensembles Using Discrepancy

Nov 06, 2019

Ben Adlam, Corinna Cortes, Mehryar Mohri, Ningshan Zhang

Figure 1 for Learning GANs and Ensembles Using Discrepancy

Figure 2 for Learning GANs and Ensembles Using Discrepancy

Figure 3 for Learning GANs and Ensembles Using Discrepancy

Figure 4 for Learning GANs and Ensembles Using Discrepancy

Abstract:Generative adversarial networks (GANs) generate data based on minimizing a divergence between two distributions. The choice of that divergence is therefore critical. We argue that the divergence must take into account the hypothesis set and the loss function used in a subsequent learning task, where the data generated by a GAN serves for training. Taking that structural information into account is also important to derive generalization guarantees. Thus, we propose to use the discrepancy measure, which was originally introduced for the closely related problem of domain adaptation and which precisely takes into account the hypothesis set and the loss function. We show that discrepancy admits favorable properties for training GANs and prove explicit generalization guarantees. We present efficient algorithms using discrepancy for two tasks: training a GAN directly, namely DGAN, and mixing previously trained generative models, namely EDGAN. Our experiments on toy examples and several benchmark datasets show that DGAN is competitive with other GANs and that EDGAN outperforms existing GAN ensembles, such as AdaGAN.

Via

Access Paper or Ask Questions

AdaNet: A Scalable and Flexible Framework for Automatically Learning Ensembles

Apr 30, 2019

Charles Weill, Javier Gonzalvo, Vitaly Kuznetsov, Scott Yang, Scott Yak, Hanna Mazzawi, Eugen Hotaj, Ghassen Jerfel, Vladimir Macko, Ben Adlam(+2 more)

Figure 1 for AdaNet: A Scalable and Flexible Framework for Automatically Learning Ensembles

Figure 2 for AdaNet: A Scalable and Flexible Framework for Automatically Learning Ensembles

Figure 3 for AdaNet: A Scalable and Flexible Framework for Automatically Learning Ensembles

Abstract:AdaNet is a lightweight TensorFlow-based (Abadi et al., 2015) framework for automatically learning high-quality ensembles with minimal expert intervention. Our framework is inspired by the AdaNet algorithm (Cortes et al., 2017) which learns the structure of a neural network as an ensemble of subnetworks. We designed it to: (1) integrate with the existing TensorFlow ecosystem, (2) offer sensible default search spaces to perform well on novel datasets, (3) present a flexible API to utilize expert information when available, and (4) efficiently accelerate training with distributed CPU, GPU, and TPU hardware. The code is open-source and available at: https://github.com/tensorflow/adanet.

Via

Access Paper or Ask Questions