Search | arXiv e-print repository

Marginalizable Density Models

Authors: Dar Gilboa, Ari Pakman, Thibault Vatter

Abstract: Probability density models based on deep networks have achieved remarkable success in modeling complex high-dimensional datasets. However, unlike kernel density estimators, modern neural models do not yield marginals or conditionals in closed form, as these quantities require the evaluation of seldom tractable integrals. In this work, we present the Marginalizable Density Model Approximator (MDMA)… ▽ More Probability density models based on deep networks have achieved remarkable success in modeling complex high-dimensional datasets. However, unlike kernel density estimators, modern neural models do not yield marginals or conditionals in closed form, as these quantities require the evaluation of seldom tractable integrals. In this work, we present the Marginalizable Density Model Approximator (MDMA), a novel deep network architecture which provides closed form expressions for the probabilities, marginals and conditionals of any subset of the variables. The MDMA learns deep scalar representations for each individual variable and combines them via learned hierarchical tensor decompositions into a tractable yet expressive CDF, from which marginals and conditional densities are easily obtained. We illustrate the advantage of exact marginalizability in several tasks that are out of reach of previous deep network-based density estimation models, such as estimating mutual information between arbitrary subsets of variables, inferring causality by testing for conditional independence, and inference with missing data without the need for data imputation, outperforming state-of-the-art models on these tasks. The model also allows for parallelized sampling with only a logarithmic dependence of the time complexity on the number of variables. △ Less

Submitted 8 June, 2021; originally announced June 2021.

arXiv:1906.05423 [pdf, other]

Copulas as High-Dimensional Generative Models: Vine Copula Autoencoders

Authors: Natasa Tagasovska, Damien Ackerer, Thibault Vatter

Abstract: We introduce the vine copula autoencoder (VCAE), a flexible generative model for high-dimensional distributions built in a straightforward three-step procedure. First, an autoencoder (AE) compresses the data into a lower dimensional representation. Second, the multivariate distribution of the encoded data is estimated with vine copulas. Third, a generative model is obtained by combining the esti… ▽ More We introduce the vine copula autoencoder (VCAE), a flexible generative model for high-dimensional distributions built in a straightforward three-step procedure. First, an autoencoder (AE) compresses the data into a lower dimensional representation. Second, the multivariate distribution of the encoded data is estimated with vine copulas. Third, a generative model is obtained by combining the estimated distribution with the decoder part of the AE. As such, the proposed approach can transform any already trained AE into a flexible generative model at a low computational cost. This is an advantage over existing generative models such as adversarial networks and variational AEs which can be difficult to train and can impose strong assumptions on the latent space. Experiments on MNIST, Street View House Numbers and Large-Scale CelebFaces Attributes datasets show that VCAEs can achieve competitive results to standard baselines. △ Less

Submitted 27 November, 2019; v1 submitted 12 June, 2019; originally announced June 2019.

Journal ref: Advances in Neural Information Processing Systems 32, pages: 6525--6537, year: 2019

arXiv:1906.05065 [pdf, other]

Deep Smoothing of the Implied Volatility Surface

Authors: Damien Ackerer, Natasa Tagasovska, Thibault Vatter

Abstract: We present a neural network (NN) approach to fit and predict implied volatility surfaces (IVSs). Atypically to standard NN applications, financial industry practitioners use such models equally to replicate market prices and to value other financial instruments. In other words, low training losses are as important as generalization capabilities. Importantly, IVS models need to generate realistic a… ▽ More We present a neural network (NN) approach to fit and predict implied volatility surfaces (IVSs). Atypically to standard NN applications, financial industry practitioners use such models equally to replicate market prices and to value other financial instruments. In other words, low training losses are as important as generalization capabilities. Importantly, IVS models need to generate realistic arbitrage-free option prices, meaning that no portfolio can lead to risk-free profits. We propose an approach guaranteeing the absence of arbitrage opportunities by penalizing the loss using soft constraints. Furthermore, our method can be combined with standard IVS models in quantitative finance, thus providing a NN-based correction when such models fail at replicating observed market prices. This lets practitioners use our approach as a plug-in on top of classical methods. Empirical results show that this approach is particularly useful when only sparse or erroneous data are available. We also quantify the uncertainty of the model predictions in regions with few or no observations. We further explore how deeper NNs improve over shallower ones, as well as other properties of the network architecture. We benchmark our method against standard IVS models. By evaluating our method on both training sets, and testing sets, namely, we highlight both their capacity to reproduce observed prices and predict new ones. △ Less

Submitted 26 October, 2020; v1 submitted 12 June, 2019; originally announced June 2019.

Comments: forthcoming NeurIPS 2020

arXiv:1811.12801 [pdf, other]

Generative Models for Simulating Mobility Trajectories

Authors: Vaibhav Kulkarni, Natasa Tagasovska, Thibault Vatter, Benoit Garbinato

Abstract: Mobility datasets are fundamental for evaluating algorithms pertaining to geographic information systems and facilitating experimental reproducibility. But privacy implications restrict sharing such datasets, as even aggregated location-data is vulnerable to membership inference attacks. Current synthetic mobility dataset generators attempt to superficially match a priori modeled mobility characte… ▽ More Mobility datasets are fundamental for evaluating algorithms pertaining to geographic information systems and facilitating experimental reproducibility. But privacy implications restrict sharing such datasets, as even aggregated location-data is vulnerable to membership inference attacks. Current synthetic mobility dataset generators attempt to superficially match a priori modeled mobility characteristics which do not accurately reflect the real-world characteristics. Modeling human mobility to generate synthetic yet semantically and statistically realistic trajectories is therefore crucial for publishing trajectory datasets having satisfactory utility level while preserving user privacy. Specifically, long-range dependencies inherent to human mobility are challenging to capture with both discriminative and generative models. In this paper, we benchmark the performance of recurrent neural architectures (RNNs), generative adversarial networks (GANs) and nonparametric copulas to generate synthetic mobility traces. We evaluate the generated trajectories with respect to their geographic and semantic similarity, circadian rhythms, long-range dependencies, training and generation time. We also include two sample tests to assess statistical similarity between the observed and simulated distributions, and we analyze the privacy tradeoffs with respect to membership inference and location-sequence attacks. △ Less

Submitted 30 November, 2018; originally announced November 2018.

arXiv:1801.10579 [pdf, other]

Distinguishing Cause from Effect Using Quantiles: Bivariate Quantile Causal Discovery

Authors: Natasa Tagasovska, Valérie Chavez-Demoulin, Thibault Vatter

Abstract: Causal inference using observational data is challenging, especially in the bivariate case. Through the minimum description length principle, we link the postulate of independence between the generating mechanisms of the cause and of the effect given the cause to quantile regression. Based on this theory, we develop Bivariate Quantile Causal Discovery (bQCD), a new method to distinguish cause from… ▽ More Causal inference using observational data is challenging, especially in the bivariate case. Through the minimum description length principle, we link the postulate of independence between the generating mechanisms of the cause and of the effect given the cause to quantile regression. Based on this theory, we develop Bivariate Quantile Causal Discovery (bQCD), a new method to distinguish cause from effect assuming no confounding, selection bias or feedback. Because it uses multiple quantile levels instead of the conditional mean only, bQCD is adaptive not only to additive, but also to multiplicative or even location-scale generating mechanisms. To illustrate the effectiveness of our approach, we perform an extensive empirical comparison on both synthetic and real datasets. This study shows that bQCD is robust across different implementations of the method (i.e., the quantile regression), computationally efficient, and compares favorably to state-of-the-art methods. △ Less

Submitted 14 August, 2020; v1 submitted 31 January, 2018; originally announced January 2018.

Comments: To appear ICML 2020

arXiv:1801.10576 [pdf, other]

Solving estimating equations with copulas

Authors: Thomas Nagler, Thibault Vatter

Abstract: Thanks to their ability to capture complex dependence structures, copulas are frequently used to glue random variables into a joint model with arbitrary marginal distributions. More recently, they have been applied to solve statistical learning problems such as regression or classification. Framing such approaches as solutions of estimating equations, we generalize them in a unified framework. We… ▽ More Thanks to their ability to capture complex dependence structures, copulas are frequently used to glue random variables into a joint model with arbitrary marginal distributions. More recently, they have been applied to solve statistical learning problems such as regression or classification. Framing such approaches as solutions of estimating equations, we generalize them in a unified framework. We can then obtain simultaneous, coherent inferences across multiple regression-like problems. We derive consistency, asymptotic normality, and validity of the bootstrap for corresponding estimators. The conditions allow for both continuous and discrete data as well as parametric, nonparametric, and semiparametric estimators of the copula and marginal distributions. The versatility of this methodology is illustrated by several theoretical examples, a simulation study, and an application to financial portfolio allocation. △ Less

Submitted 19 August, 2022; v1 submitted 31 January, 2018; originally announced January 2018.

arXiv:1610.03050 [pdf, other]

doi 10.1515/demo-2017-0022

Dependent Defaults and Losses with Factor Copula Models

Authors: Damien Ackerer, Thibault Vatter

Abstract: We present a class of flexible and tractable static factor models for the term structure of joint default probabilities, the factor copula models. These high-dimensional models remain parsimonious with pair-copula constructions, and nest many standard models as special cases. The loss distribution of a portfolio of contingent claims can be exactly and efficiently computed when individual losses ar… ▽ More We present a class of flexible and tractable static factor models for the term structure of joint default probabilities, the factor copula models. These high-dimensional models remain parsimonious with pair-copula constructions, and nest many standard models as special cases. The loss distribution of a portfolio of contingent claims can be exactly and efficiently computed when individual losses are discretely supported on a finite grid. Numerical examples study the key features affecting the loss distribution and multi-name credit derivatives prices. An empirical exercise illustrates the flexibility of our approach by fitting credit index tranche prices. △ Less

Submitted 17 January, 2018; v1 submitted 10 October, 2016; originally announced October 2016.

Comments: 29 pages, 11 figures, 3 tables

MSC Class: 60E05; 60E10; 62H05; 62H20; 65T50; 91G20; 91G40; 91G60

Journal ref: Dependence Modeling, Volume 5, Issue 1, Pages 375-399, 2017

arXiv:1608.01593 [pdf, other]

Generalized Additive Models for Pair-Copula Constructions

Authors: Thibault Vatter, Thomas Nagler

Abstract: Pair-copula constructions are flexible dependence models that use bivariate copulas as building blocks. In this paper, we use generalized additive models to extend them by allowing covariates effects. Borrowing ideas from a traditionally univariate context, we let each pair-copula parameter depend directly on the covariates in a parametric, semiparametric or nonparametric way. We propose a sequent… ▽ More Pair-copula constructions are flexible dependence models that use bivariate copulas as building blocks. In this paper, we use generalized additive models to extend them by allowing covariates effects. Borrowing ideas from a traditionally univariate context, we let each pair-copula parameter depend directly on the covariates in a parametric, semiparametric or nonparametric way. We propose a sequential estimation method that we study by simulation, and apply it to investigate the time-varying dependence structure between the intraday returns on four major foreign exchange rates. An R package, a script reproducing the results in this article, and additional simulation results are provided as supplementary material. △ Less

Submitted 15 August, 2017; v1 submitted 4 August, 2016; originally announced August 2016.

Showing 1–8 of 8 results for author: Vatter, T