Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

François Bachoc

IMT

A Tight Regret Analysis of Non-Parametric Repeated Contextual Brokerage

Mar 03, 2025

François Bachoc, Tommaso Cesari, Roberto Colomboni

Abstract:We study a contextual version of the repeated brokerage problem. In each interaction, two traders with private valuations for an item seek to buy or sell based on the learner's-a broker-proposed price, which is informed by some contextual information. The broker's goal is to maximize the traders' net utility-also known as the gain from trade-by minimizing regret compared to an oracle with perfect knowledge of traders' valuation distributions. We assume that traders' valuations are zero-mean perturbations of the unknown item's current market value-which can change arbitrarily from one interaction to the next-and that similar contexts will correspond to similar market prices. We analyze two feedback settings: full-feedback, where after each interaction the traders' valuations are revealed to the broker, and limited-feedback, where only transaction attempts are revealed. For both feedback types, we propose algorithms achieving tight regret bounds. We further strengthen our performance guarantees by providing a tight 1/2-approximation result showing that the oracle that knows the traders' valuation distributions achieves at least 1/2 of the gain from trade of the omniscient oracle that knows in advance the actual realized traders' valuations.

* AISTATS 2025

Via

Access Paper or Ask Questions

Fair Online Bilateral Trade

May 22, 2024

François Bachoc, Nicolò Cesa-Bianchi, Tommaso Cesari, Roberto Colomboni

Abstract:In online bilateral trade, a platform posts prices to incoming pairs of buyers and sellers that have private valuations for a certain good. If the price is lower than the buyers' valuation and higher than the sellers' valuation, then a trade takes place. Previous work focused on the platform perspective, with the goal of setting prices maximizing the gain from trade (the sum of sellers' and buyers' utilities). Gain from trade is, however, potentially unfair to traders, as they may receive highly uneven shares of the total utility. In this work we enforce fairness by rewarding the platform with the fair gain from trade, defined as the minimum between sellers' and buyers' utilities. After showing that any no-regret learning algorithm designed to maximize the sum of the utilities may fail badly with fair gain from trade, we present our main contribution: a complete characterization of the regret regimes for fair gain from trade when, after each interaction, the platform only learns whether each trader accepted the current price. Specifically, we prove the following regret bounds: $\Theta(\ln T)$ in the deterministic setting, $\Omega(T)$ in the stochastic setting, and $\tilde{\Theta}(T^{2/3})$ in the stochastic setting when sellers' and buyers' valuations are independent of each other. We conclude by providing tight regret bounds when, after each interaction, the platform is allowed to observe the true traders' valuations.

Via

Access Paper or Ask Questions

Variational autoencoder with weighted samples for high-dimensional non-parametric adaptive importance sampling

Oct 13, 2023

Julien Demange-Chryst, François Bachoc, Jérôme Morio, Timothé Krauth

Abstract:Probability density function estimation with weighted samples is the main foundation of all adaptive importance sampling algorithms. Classically, a target distribution is approximated either by a non-parametric model or within a parametric family. However, these models suffer from the curse of dimensionality or from their lack of flexibility. In this contribution, we suggest to use as the approximating model a distribution parameterised by a variational autoencoder. We extend the existing framework to the case of weighted samples by introducing a new objective function. The flexibility of the obtained family of distributions makes it as expressive as a non-parametric model, and despite the very high number of parameters to estimate, this family is much more efficient in high dimension than the classical Gaussian or Gaussian mixture families. Moreover, in order to add flexibility to the model and to be able to learn multimodal distributions, we consider a learnable prior distribution for the variational autoencoder latent variables. We also introduce a new pre-training procedure for the variational autoencoder to find good starting weights of the neural networks to prevent as much as possible the posterior collapse phenomenon to happen. At last, we explicit how the resulting distribution can be combined with importance sampling, and we exploit the proposed procedure in existing adaptive importance sampling algorithms to draw points from a target distribution and to estimate a rare event probability in high dimension on two multimodal problems.

* 20 pages, 5 figures

Via

Access Paper or Ask Questions

Improved learning theory for kernel distribution regression with two-stage sampling

Aug 28, 2023

François Bachoc, Louis Béthune, Alberto González-Sanz, Jean-Michel Loubes

Abstract:The distribution regression problem encompasses many important statistics and machine learning tasks, and arises in a large range of applications. Among various existing approaches to tackle this problem, kernel methods have become a method of choice. Indeed, kernel distribution regression is both computationally favorable, and supported by a recent learning theory. This theory also tackles the two-stage sampling setting, where only samples from the input distributions are available. In this paper, we improve the learning theory of kernel distribution regression. We address kernels based on Hilbertian embeddings, that encompass most, if not all, of the existing approaches. We introduce the novel near-unbiased condition on the Hilbertian embeddings, that enables us to provide new error bounds on the effect of the two-stage sampling, thanks to a new analysis. We show that this near-unbiased condition holds for three important classes of kernels, based on optimal transport and mean embedding. As a consequence, we strictly improve the existing convergence rates for these kernels. Our setting and results are illustrated by numerical experiments.

Via

Access Paper or Ask Questions

Gaussian Processes on Distributions based on Regularized Optimal Transport

Oct 12, 2022

François Bachoc, Louis Béthune, Alberto Gonzalez-Sanz, Jean-Michel Loubes

Figure 1 for Gaussian Processes on Distributions based on Regularized Optimal Transport

Figure 2 for Gaussian Processes on Distributions based on Regularized Optimal Transport

Figure 3 for Gaussian Processes on Distributions based on Regularized Optimal Transport

Figure 4 for Gaussian Processes on Distributions based on Regularized Optimal Transport

Abstract:We present a novel kernel over the space of probability measures based on the dual formulation of optimal regularized transport. We propose an Hilbertian embedding of the space of probabilities using their Sinkhorn potentials, which are solutions of the dual entropic relaxed optimal transport between the probabilities and a reference measure $\mathcal{U}$. We prove that this construction enables to obtain a valid kernel, by using the Hilbert norms. We prove that the kernel enjoys theoretical properties such as universality and some invariances, while still being computationally feasible. Moreover we provide theoretical guarantees on the behaviour of a Gaussian process based on this kernel. The empirical performances are compared with other traditional choices of kernels for processes indexed on distributions.

Via

Access Paper or Ask Questions

Large-Sample Properties of Non-Stationary Source Separation for Gaussian Signals

Oct 10, 2022

François Bachoc, Christoph Muehlmann, Klaus Nordhausen, Joni Virta

Figure 1 for Large-Sample Properties of Non-Stationary Source Separation for Gaussian Signals

Figure 2 for Large-Sample Properties of Non-Stationary Source Separation for Gaussian Signals

Figure 3 for Large-Sample Properties of Non-Stationary Source Separation for Gaussian Signals

Abstract:Non-stationary source separation is a well-established branch of blind source separation with many different methods. However, for none of these methods large-sample results are available. To bridge this gap, we develop large-sample theory for NSS-JD, a popular method of non-stationary source separation based on the joint diagonalization of block-wise covariance matrices. We work under an instantaneous linear mixing model for independent Gaussian non-stationary source signals together with a very general set of assumptions: besides boundedness conditions, the only assumptions we make are that the sources exhibit finite dependency and that their variance functions differ sufficiently to be asymptotically separable. The consistency of the unmixing estimator and its convergence to a limiting Gaussian distribution at the standard square root rate are shown to hold under the previous conditions. Simulation experiments are used to verify the theoretical results and to study the impact of block length on the separation.

Via

Access Paper or Ask Questions

Regret Analysis of Dyadic Search

Sep 02, 2022

François Bachoc, Tommaso Cesari, Roberto Colomboni, Andrea Paudice

Figure 1 for Regret Analysis of Dyadic Search

Abstract:We analyze the cumulative regret of the Dyadic Search algorithm of Bachoc et al. [2022].

* arXiv admin note: substantial text overlap with arXiv:2208.06720

Via

Access Paper or Ask Questions

A Near-Optimal Algorithm for Univariate Zeroth-Order Budget Convex Optimization

Aug 13, 2022

François Bachoc, Tommaso Cesari, Roberto Colomboni, Andrea Paudice

Figure 1 for A Near-Optimal Algorithm for Univariate Zeroth-Order Budget Convex Optimization

Figure 2 for A Near-Optimal Algorithm for Univariate Zeroth-Order Budget Convex Optimization

Figure 3 for A Near-Optimal Algorithm for Univariate Zeroth-Order Budget Convex Optimization

Figure 4 for A Near-Optimal Algorithm for Univariate Zeroth-Order Budget Convex Optimization

Abstract:This paper studies a natural generalization of the problem of minimizing a univariate convex function $f$ by querying its values sequentially. At each time-step $t$, the optimizer can invest a budget $b_t$ in a query point $X_t$ of their choice to obtain a fuzzy evaluation of $f$ at $X_t$ whose accuracy depends on the amount of budget invested in $X_t$ across times. This setting is motivated by the minimization of objectives whose values can only be determined approximately through lengthy or expensive computations. We design an any-time parameter-free algorithm called Dyadic Search, for which we prove near-optimal optimization error guarantees. As a byproduct of our analysis, we show that the classical dependence on the global Lipschitz constant in the error bounds is an artifact of the granularity of the budget. Finally, we illustrate our theoretical findings with numerical simulations.

Via

Access Paper or Ask Questions

Local Identifiability of Deep ReLU Neural Networks: the Theory

Jun 15, 2022

Joachim Bona-Pellissier, François Malgouyres, François Bachoc

Figure 1 for Local Identifiability of Deep ReLU Neural Networks: the Theory

Figure 2 for Local Identifiability of Deep ReLU Neural Networks: the Theory

Figure 3 for Local Identifiability of Deep ReLU Neural Networks: the Theory

Abstract:Is a sample rich enough to determine, at least locally, the parameters of a neural network? To answer this question, we introduce a new local parameterization of a given deep ReLU neural network by fixing the values of some of its weights. This allows us to define local lifting operators whose inverses are charts of a smooth manifold of a high dimensional space. The function implemented by the deep ReLU neural network composes the local lifting with a linear operator which depends on the sample. We derive from this convenient representation a geometrical necessary and sufficient condition of local identifiability. Looking at tangent spaces, the geometrical condition provides: 1/ a sharp and testable necessary condition of identifiability and 2/ a sharp and testable sufficient condition of local identifiability. The validity of the conditions can be tested numerically using backpropagation and matrix rank computations.

Via

Access Paper or Ask Questions

High-dimensional additive Gaussian processes under monotonicity constraints

May 17, 2022

Andrés F. López-Lopera, François Bachoc, Olivier Roustant

Figure 1 for High-dimensional additive Gaussian processes under monotonicity constraints

Figure 2 for High-dimensional additive Gaussian processes under monotonicity constraints

Figure 3 for High-dimensional additive Gaussian processes under monotonicity constraints

Figure 4 for High-dimensional additive Gaussian processes under monotonicity constraints

Abstract:We introduce an additive Gaussian process framework accounting for monotonicity constraints and scalable to high dimensions. Our contributions are threefold. First, we show that our framework enables to satisfy the constraints everywhere in the input space. We also show that more general componentwise linear inequality constraints can be handled similarly, such as componentwise convexity. Second, we propose the additive MaxMod algorithm for sequential dimension reduction. By sequentially maximizing a squared-norm criterion, MaxMod identifies the active input dimensions and refines the most important ones. This criterion can be computed explicitly at a linear cost. Finally, we provide open-source codes for our full framework. We demonstrate the performance and scalability of the methodology in several synthetic examples with hundreds of dimensions under monotonicity constraints as well as on a real-world flood application.

Via

Access Paper or Ask Questions