Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Thibault Vatter

Vine Copulas as Differentiable Computational Graphs

Jun 16, 2025

Tuoyuan Cheng, Thibault Vatter, Thomas Nagler, Kan Chen

Abstract:Vine copulas are sophisticated models for multivariate distributions and are increasingly used in machine learning. To facilitate their integration into modern ML pipelines, we introduce the vine computational graph, a DAG that abstracts the multilevel vine structure and associated computations. On this foundation, we devise new algorithms for conditional sampling, efficient sampling-order scheduling, and constructing vine structures for customized conditioning variables. We implement these ideas in torchvinecopulib, a GPU-accelerated Python library built upon PyTorch, delivering improved scalability for fitting, sampling, and density evaluation. Our experiments illustrate how gradient flowing through the vine can improve Vine Copula Autoencoders and that incorporating vines for uncertainty quantification in deep learning can outperform MC-dropout, deep ensembles, and Bayesian Neural Networks in sharpness, calibration, and runtime. By recasting vine copula models as computational graphs, our work connects classical dependence modeling with modern deep-learning toolchains and facilitates the integration of state-of-the-art copula methods in modern machine learning pipelines.

Via

Access Paper or Ask Questions

Marginalizable Density Models

Jun 08, 2021

Dar Gilboa, Ari Pakman, Thibault Vatter

Figure 1 for Marginalizable Density Models

Figure 2 for Marginalizable Density Models

Figure 3 for Marginalizable Density Models

Figure 4 for Marginalizable Density Models

Abstract:Probability density models based on deep networks have achieved remarkable success in modeling complex high-dimensional datasets. However, unlike kernel density estimators, modern neural models do not yield marginals or conditionals in closed form, as these quantities require the evaluation of seldom tractable integrals. In this work, we present the Marginalizable Density Model Approximator (MDMA), a novel deep network architecture which provides closed form expressions for the probabilities, marginals and conditionals of any subset of the variables. The MDMA learns deep scalar representations for each individual variable and combines them via learned hierarchical tensor decompositions into a tractable yet expressive CDF, from which marginals and conditional densities are easily obtained. We illustrate the advantage of exact marginalizability in several tasks that are out of reach of previous deep network-based density estimation models, such as estimating mutual information between arbitrary subsets of variables, inferring causality by testing for conditional independence, and inference with missing data without the need for data imputation, outperforming state-of-the-art models on these tasks. The model also allows for parallelized sampling with only a logarithmic dependence of the time complexity on the number of variables.

Via

Access Paper or Ask Questions

Copulas as High-Dimensional Generative Models: Vine Copula Autoencoders

Jun 12, 2019

Natasa Tagasovska, Damien Ackerer, Thibault Vatter

Figure 1 for Copulas as High-Dimensional Generative Models: Vine Copula Autoencoders

Figure 2 for Copulas as High-Dimensional Generative Models: Vine Copula Autoencoders

Figure 3 for Copulas as High-Dimensional Generative Models: Vine Copula Autoencoders

Figure 4 for Copulas as High-Dimensional Generative Models: Vine Copula Autoencoders

Abstract:We propose a vine copula autoencoder to construct flexible generative models for high-dimensional distributions in a straightforward three-step procedure. First, an autoencoder compresses the data using a lower dimensional representation. Second, the multivariate distribution of the encoded data is estimated with vine copulas. Third, a generative model is obtained by combining the estimated distribution with the decoder part of the autoencoder. This approach can transform any already trained autoencoder into a flexible generative model at a low computational cost. This is an advantage over existing generative models such as adversarial networks and variational autoencoders which can be difficult to train and can impose strong assumptions on the latent space. Experiments on MNIST, Street View House Numbers and Large-Scale CelebFaces Attributes datasets show that vine copulas autoencoders can achieve competitive results to standard baselines.

Via

Access Paper or Ask Questions

Deep Smoothing of the Implied Volatility Surface

Jun 12, 2019

Damien Ackerer, Natasa Tagasovska, Thibault Vatter

Figure 1 for Deep Smoothing of the Implied Volatility Surface

Figure 2 for Deep Smoothing of the Implied Volatility Surface

Figure 3 for Deep Smoothing of the Implied Volatility Surface

Figure 4 for Deep Smoothing of the Implied Volatility Surface

Abstract:We present an artificial neural network (ANN) approach to value financial derivatives. Atypically to standard ANN applications, practitioners equally use option pricing models to validate market prices and to infer unobserved prices. Importantly, models need to generate realistic arbitrage-free prices, meaning that no option portfolio can lead to risk-free profits. The absence of arbitrage opportunities is guaranteed by penalizing the loss using soft constraints on an extended grid of input values. ANNs can be pre-trained by first calibrating a standard option pricing model, and then training an ANN to a larger synthetic dataset generated from the calibrated model. The parameters transfer as well as the non-arbitrage constraints appear to be particularly useful when only sparse or erroneous data are available. We also explore how deeper ANNs improve over shallower ones, as well as other properties of the network architecture. We benchmark our method against standard option pricing models, such as Heston with and without jumps. We validate our method both on training sets, and testing sets, namely, highlighting both their capacity to reproduce observed prices and predict new ones.

Via

Access Paper or Ask Questions

Generative Models for Simulating Mobility Trajectories

Nov 30, 2018

Vaibhav Kulkarni, Natasa Tagasovska, Thibault Vatter, Benoit Garbinato

Figure 1 for Generative Models for Simulating Mobility Trajectories

Figure 2 for Generative Models for Simulating Mobility Trajectories

Figure 3 for Generative Models for Simulating Mobility Trajectories

Figure 4 for Generative Models for Simulating Mobility Trajectories

Abstract:Mobility datasets are fundamental for evaluating algorithms pertaining to geographic information systems and facilitating experimental reproducibility. But privacy implications restrict sharing such datasets, as even aggregated location-data is vulnerable to membership inference attacks. Current synthetic mobility dataset generators attempt to superficially match a priori modeled mobility characteristics which do not accurately reflect the real-world characteristics. Modeling human mobility to generate synthetic yet semantically and statistically realistic trajectories is therefore crucial for publishing trajectory datasets having satisfactory utility level while preserving user privacy. Specifically, long-range dependencies inherent to human mobility are challenging to capture with both discriminative and generative models. In this paper, we benchmark the performance of recurrent neural architectures (RNNs), generative adversarial networks (GANs) and nonparametric copulas to generate synthetic mobility traces. We evaluate the generated trajectories with respect to their geographic and semantic similarity, circadian rhythms, long-range dependencies, training and generation time. We also include two sample tests to assess statistical similarity between the observed and simulated distributions, and we analyze the privacy tradeoffs with respect to membership inference and location-sequence attacks.

Via

Access Paper or Ask Questions

Nonparametric Quantile-Based Causal Discovery

Oct 07, 2018

Natasa Tagasovska, Thibault Vatter, Valérie Chavez-Demoulin

Figure 1 for Nonparametric Quantile-Based Causal Discovery

Figure 2 for Nonparametric Quantile-Based Causal Discovery

Figure 3 for Nonparametric Quantile-Based Causal Discovery

Figure 4 for Nonparametric Quantile-Based Causal Discovery

Abstract:Distinguishing cause from effect using observational data is a challenging problem, especially in the bivariate case. Contemporary methods often assume an independence between the cause and the generating mechanism of the effect given the cause. From this postulate, they derive asymmetries to uncover causal relationships. Leveraging the same postulate, in this work, we propose a novel approach based on the link between Kolmogorov complexity and quantile scoring. We use a nonparametric conditional quantile estimator based on copulas to implement our procedure, thus avoiding restrictive assumptions about the joint distribution between cause and effect. In an extensive study on real and synthetic data, we show that quantile copula causal discovery (QCCD) compares favorably to state-of-the-art methods.

Via

Access Paper or Ask Questions