Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Clément Chadebec

CRC, Université de Paris

LBM: Latent Bridge Matching for Fast Image-to-Image Translation

Mar 10, 2025

Clément Chadebec, Onur Tasar, Sanjeev Sreetharan, Benjamin Aubin

Abstract:In this paper, we introduce Latent Bridge Matching (LBM), a new, versatile and scalable method that relies on Bridge Matching in a latent space to achieve fast image-to-image translation. We show that the method can reach state-of-the-art results for various image-to-image tasks using only a single inference step. In addition to its efficiency, we also demonstrate the versatility of the method across different image translation tasks such as object removal, normal and depth estimation, and object relighting. We also derive a conditional framework of LBM and demonstrate its effectiveness by tackling the tasks of controllable image relighting and shadow generation. We provide an open-source implementation of the method at https://github.com/gojasper/LBM.

Via

Access Paper or Ask Questions

Controllable Shadow Generation with Single-Step Diffusion Models from Synthetic Data

Dec 16, 2024

Onur Tasar, Clément Chadebec, Benjamin Aubin

Abstract:Realistic shadow generation is a critical component for high-quality image compositing and visual effects, yet existing methods suffer from certain limitations: Physics-based approaches require a 3D scene geometry, which is often unavailable, while learning-based techniques struggle with control and visual artifacts. We introduce a novel method for fast, controllable, and background-free shadow generation for 2D object images. We create a large synthetic dataset using a 3D rendering engine to train a diffusion model for controllable shadow generation, generating shadow maps for diverse light source parameters. Through extensive ablation studies, we find that rectified flow objective achieves high-quality results with just a single sampling step enabling real-time applications. Furthermore, our experiments demonstrate that the model generalizes well to real-world images. To facilitate further research in evaluating quality and controllability in shadow generation, we release a new public benchmark containing a diverse set of object images and shadow maps in various settings. The project page is available at https://gojasper.github.io/controllable-shadow-generation-project/

Via

Access Paper or Ask Questions

Improving Multimodal Joint Variational Autoencoders through Normalizing Flows and Correlation Analysis

May 19, 2023

Agathe Senellart, Clément Chadebec, Stéphanie Allassonnière

Abstract:We propose a new multimodal variational autoencoder that enables to generate from the joint distribution and conditionally to any number of complex modalities. The unimodal posteriors are conditioned on the Deep Canonical Correlation Analysis embeddings which preserve the shared information across modalities leading to more coherent cross-modal generations. Furthermore, we use Normalizing Flows to enrich the unimodal posteriors and achieve more diverse data generation. Finally, we propose to use a Product of Experts for inferring one modality from several others which makes the model scalable to any number of modalities. We demonstrate that our method improves likelihood estimates, diversity of the generations and in particular coherence metrics in the conditional generations on several datasets.

Via

Access Paper or Ask Questions

Variational Inference for Longitudinal Data Using Normalizing Flows

Mar 24, 2023

Clément Chadebec, Stéphanie Allassonnière

Abstract:This paper introduces a new latent variable generative model able to handle high dimensional longitudinal data and relying on variational inference. The time dependency between the observations of an input sequence is modelled using normalizing flows over the associated latent variables. The proposed method can be used to generate either fully synthetic longitudinal sequences or trajectories that are conditioned on several data in a sequence and demonstrates good robustness properties to missing data. We test the model on 6 datasets of different complexity and show that it can achieve better likelihood estimates than some competitors as well as more reliable missing data imputation. A code is made available at \url{https://github.com/clementchadebec/variational_inference_for_longitudinal_data}.

Via

Access Paper or Ask Questions

A Geometric Perspective on Variational Autoencoders

Sep 15, 2022

Clément Chadebec, Stéphanie Allassonnière

Figure 1 for A Geometric Perspective on Variational Autoencoders

Figure 2 for A Geometric Perspective on Variational Autoencoders

Figure 3 for A Geometric Perspective on Variational Autoencoders

Figure 4 for A Geometric Perspective on Variational Autoencoders

Abstract:This paper introduces a new interpretation of the Variational Autoencoder framework by taking a fully geometric point of view. We argue that vanilla VAE models unveil naturally a Riemannian structure in their latent space and that taking into consideration those geometrical aspects can lead to better interpolations and an improved generation procedure. This new proposed sampling method consists in sampling from the uniform distribution deriving intrinsically from the learned Riemannian latent space and we show that using this scheme can make a vanilla VAE competitive and even better than more advanced versions on several benchmark datasets. Since generative models are known to be sensitive to the number of training samples we also stress the method's robustness in the low data regime.

* Accepted to NeurIPS 2022

Via

Access Paper or Ask Questions

Pythae: Unifying Generative Autoencoders in Python -- A Benchmarking Use Case

Jun 16, 2022

Clément Chadebec, Louis J. Vincent, Stéphanie Allassonnière

Figure 1 for Pythae: Unifying Generative Autoencoders in Python -- A Benchmarking Use Case

Figure 2 for Pythae: Unifying Generative Autoencoders in Python -- A Benchmarking Use Case

Figure 3 for Pythae: Unifying Generative Autoencoders in Python -- A Benchmarking Use Case

Figure 4 for Pythae: Unifying Generative Autoencoders in Python -- A Benchmarking Use Case

Abstract:In recent years, deep generative models have attracted increasing interest due to their capacity to model complex distributions. Among those models, variational autoencoders have gained popularity as they have proven both to be computationally efficient and yield impressive results in multiple fields. Following this breakthrough, extensive research has been done in order to improve the original publication, resulting in a variety of different VAE models in response to different tasks. In this paper we present Pythae, a versatile open-source Python library providing both a unified implementation and a dedicated framework allowing straightforward, reproducible and reliable use of generative autoencoder models. We then propose to use this library to perform a case study benchmark where we present and compare 19 generative autoencoder models representative of some of the main improvements on downstream tasks such as image reconstruction, generation, classification, clustering and interpolation. The open-source library can be found at https://github.com/clementchadebec/benchmark_VAE.

Via

Access Paper or Ask Questions

Data Augmentation in High Dimensional Low Sample Size Setting Using a Geometry-Based Variational Autoencoder

Apr 30, 2021

Clément Chadebec, Elina Thibeau-Sutre, Ninon Burgos, Stéphanie Allassonnière

Figure 1 for Data Augmentation in High Dimensional Low Sample Size Setting Using a Geometry-Based Variational Autoencoder

Figure 2 for Data Augmentation in High Dimensional Low Sample Size Setting Using a Geometry-Based Variational Autoencoder

Figure 3 for Data Augmentation in High Dimensional Low Sample Size Setting Using a Geometry-Based Variational Autoencoder

Figure 4 for Data Augmentation in High Dimensional Low Sample Size Setting Using a Geometry-Based Variational Autoencoder

Abstract:In this paper, we propose a new method to perform data augmentation in a reliable way in the High Dimensional Low Sample Size (HDLSS) setting using a geometry-based variational autoencoder. Our approach combines a proper latent space modeling of the VAE seen as a Riemannian manifold with a new generation scheme which produces more meaningful samples especially in the context of small data sets. The proposed method is tested through a wide experimental study where its robustness to data sets, classifiers and training samples size is stressed. It is also validated on a medical imaging classification task on the challenging ADNI database where a small number of 3D brain MRIs are considered and augmented using the proposed VAE framework. In each case, the proposed method allows for a significant and reliable gain in the classification metrics. For instance, balanced accuracy jumps from 66.3% to 74.3% for a state-of-the-art CNN classifier trained with 50 MRIs of cognitively normal (CN) and 50 Alzheimer disease (AD) patients and from 77.7% to 86.3% when trained with 243 CN and 210 AD while improving greatly sensitivity and specificity metrics.

Via

Access Paper or Ask Questions

Data Generation in Low Sample Size Setting Using Manifold Sampling and a Geometry-Aware VAE

Mar 25, 2021

Clément Chadebec, Stéphanie Allassonnière

Figure 1 for Data Generation in Low Sample Size Setting Using Manifold Sampling and a Geometry-Aware VAE

Figure 2 for Data Generation in Low Sample Size Setting Using Manifold Sampling and a Geometry-Aware VAE

Figure 3 for Data Generation in Low Sample Size Setting Using Manifold Sampling and a Geometry-Aware VAE

Figure 4 for Data Generation in Low Sample Size Setting Using Manifold Sampling and a Geometry-Aware VAE

Abstract:While much efforts have been focused on improving Variational Autoencoders through richer posterior and prior distributions, little interest was shown in amending the way we generate the data. In this paper, we develop two non \emph{prior-dependent} generation procedures based on the geometry of the latent space seen as a Riemannian manifold. The first one consists in sampling along geodesic paths which is a natural way to explore the latent space while the second one consists in sampling from the inverse of the metric volume element which is easier to use in practice. Both methods are then compared to \emph{prior-based} methods on various data sets and appear well suited for a limited data regime. Finally, the latter method is used to perform data augmentation in a small sample size setting and is validated across various standard and \emph{real-life} data sets. In particular, this scheme allows to greatly improve classification results on the OASIS database where balanced accuracy jumps from 80.7% for a classifier trained with the raw data to 89.1% when trained only with the synthetic data generated by our method. Such results were also observed on 4 standard data sets.

Via

Access Paper or Ask Questions

Geometry-Aware Hamiltonian Variational Auto-Encoder

Oct 22, 2020

Clément Chadebec, Clément Mantoux, Stéphanie Allassonnière

Figure 1 for Geometry-Aware Hamiltonian Variational Auto-Encoder

Figure 2 for Geometry-Aware Hamiltonian Variational Auto-Encoder

Figure 3 for Geometry-Aware Hamiltonian Variational Auto-Encoder

Figure 4 for Geometry-Aware Hamiltonian Variational Auto-Encoder

Abstract:Variational auto-encoders (VAEs) have proven to be a well suited tool for performing dimensionality reduction by extracting latent variables lying in a potentially much smaller dimensional space than the data. Their ability to capture meaningful information from the data can be easily apprehended when considering their capability to generate new realistic samples or perform potentially meaningful interpolations in a much smaller space. However, such generative models may perform poorly when trained on small data sets which are abundant in many real-life fields such as medicine. This may, among others, come from the lack of structure of the latent space, the geometry of which is often under-considered. We thus propose in this paper to see the latent space as a Riemannian manifold endowed with a parametrized metric learned at the same time as the encoder and decoder networks. This metric is then used in what we called the Riemannian Hamiltonian VAE which extends the Hamiltonian VAE introduced by arXiv:1805.11328 to better exploit the underlying geometry of the latent space. We argue that such latent space modelling provides useful information about its underlying structure leading to far more meaningful interpolations, more realistic data-generation and more reliable clustering.

* 44 pages, 23 figures

Via

Access Paper or Ask Questions