-
Tail calibration of probabilistic forecasts
Authors:
Sam Allen,
Jonathan Koh,
Johan Segers,
Johanna Ziegel
Abstract:
Probabilistic forecasts comprehensively describe the uncertainty in the unknown future outcome, making them essential for decision making and risk management. While several methods have been introduced to evaluate probabilistic forecasts, existing evaluation techniques are ill-suited to the evaluation of tail properties of such forecasts. However, these tail properties are often of particular inte…
▽ More
Probabilistic forecasts comprehensively describe the uncertainty in the unknown future outcome, making them essential for decision making and risk management. While several methods have been introduced to evaluate probabilistic forecasts, existing evaluation techniques are ill-suited to the evaluation of tail properties of such forecasts. However, these tail properties are often of particular interest to forecast users due to the severe impacts caused by extreme outcomes. In this work, we introduce a general notion of tail calibration for probabilistic forecasts, which allows forecasters to assess the reliability of their predictions for extreme outcomes. We study the relationships between tail calibration and standard notions of forecast calibration, and discuss connections to peaks-over-threshold models in extreme value theory. Diagnostic tools are introduced and applied in a case study on European precipitation forecasts
△ Less
Submitted 3 July, 2024;
originally announced July 2024.
-
Sliced-Wasserstein Estimation with Spherical Harmonics as Control Variates
Authors:
Rémi Leluc,
Aymeric Dieuleveut,
François Portier,
Johan Segers,
Aigerim Zhuman
Abstract:
The Sliced-Wasserstein (SW) distance between probability measures is defined as the average of the Wasserstein distances resulting for the associated one-dimensional projections. As a consequence, the SW distance can be written as an integral with respect to the uniform measure on the sphere and the Monte Carlo framework can be employed for calculating the SW distance. Spherical harmonics are poly…
▽ More
The Sliced-Wasserstein (SW) distance between probability measures is defined as the average of the Wasserstein distances resulting for the associated one-dimensional projections. As a consequence, the SW distance can be written as an integral with respect to the uniform measure on the sphere and the Monte Carlo framework can be employed for calculating the SW distance. Spherical harmonics are polynomials on the sphere that form an orthonormal basis of the set of square-integrable functions on the sphere. Putting these two facts together, a new Monte Carlo method, hereby referred to as Spherical Harmonics Control Variates (SHCV), is proposed for approximating the SW distance using spherical harmonics as control variates. The resulting approach is shown to have good theoretical properties, e.g., a no-error property for Gaussian measures under a certain form of linear dependency between the variables. Moreover, an improved rate of convergence, compared to Monte Carlo, is established for general measures. The convergence analysis relies on the Lipschitz property associated to the SW integrand. Several numerical experiments demonstrate the superior performance of SHCV against state-of-the-art methods for SW distance computation.
△ Less
Submitted 15 May, 2024; v1 submitted 2 February, 2024;
originally announced February 2024.
-
X-Vine Models for Multivariate Extremes
Authors:
Anna Kiriliouk,
Jeong** Lee,
Johan Segers
Abstract:
Regular vine sequences permit the organisation of variables in a random vector along a sequence of trees. Regular vine models have become greatly popular in dependence modelling as a way to combine arbitrary bivariate copulas into higher-dimensional ones, offering flexibility, parsimony, and tractability. In this project, we use regular vine structures to decompose and construct the exponent measu…
▽ More
Regular vine sequences permit the organisation of variables in a random vector along a sequence of trees. Regular vine models have become greatly popular in dependence modelling as a way to combine arbitrary bivariate copulas into higher-dimensional ones, offering flexibility, parsimony, and tractability. In this project, we use regular vine structures to decompose and construct the exponent measure density of a multivariate extreme value distribution, or, equivalently, the tail copula density. Although these densities pose theoretical challenges due to their infinite mass, their homogeneity property offers simplifications. The theory sheds new light on existing parametric families and facilitates the construction of new ones, called X-vines. Computations proceed via recursive formulas in terms of bivariate model components. We develop simulation algorithms for X-vine multivariate Pareto distributions as well as methods for parameter estimation and model selection on the basis of threshold exceedances. The methods are illustrated by Monte Carlo experiments and a case study on US flight delay data.
△ Less
Submitted 27 June, 2024; v1 submitted 23 December, 2023;
originally announced December 2023.
-
Speeding up Monte Carlo Integration: Control Neighbors for Optimal Convergence
Authors:
Rémi Leluc,
François Portier,
Johan Segers,
Aigerim Zhuman
Abstract:
A novel linear integration rule called $\textit{control neighbors}$ is proposed in which nearest neighbor estimates act as control variates to speed up the convergence rate of the Monte Carlo procedure on metric spaces. The main result is the $\mathcal{O}(n^{-1/2} n^{-s/d})$ convergence rate -- where $n$ stands for the number of evaluations of the integrand and $d$ for the dimension of the domain…
▽ More
A novel linear integration rule called $\textit{control neighbors}$ is proposed in which nearest neighbor estimates act as control variates to speed up the convergence rate of the Monte Carlo procedure on metric spaces. The main result is the $\mathcal{O}(n^{-1/2} n^{-s/d})$ convergence rate -- where $n$ stands for the number of evaluations of the integrand and $d$ for the dimension of the domain -- of this estimate for Hölder functions with regularity $s \in (0,1]$, a rate which, in some sense, is optimal. Several numerical experiments validate the complexity bound and highlight the good performance of the proposed estimator.
△ Less
Submitted 4 April, 2024; v1 submitted 10 May, 2023;
originally announced May 2023.
-
Statistical Inference for Hüsler-Reiss Graphical Models Through Matrix Completions
Authors:
Manuel Hentschel,
Sebastian Engelke,
Johan Segers
Abstract:
The severity of multivariate extreme events is driven by the dependence between the largest marginal observations. The Hüsler-Reiss distribution is a versatile model for this extremal dependence, and it is usually parameterized by a variogram matrix. In order to represent conditional independence relations and obtain sparse parameterizations, we introduce the novel Hüsler-Reiss precision matrix. S…
▽ More
The severity of multivariate extreme events is driven by the dependence between the largest marginal observations. The Hüsler-Reiss distribution is a versatile model for this extremal dependence, and it is usually parameterized by a variogram matrix. In order to represent conditional independence relations and obtain sparse parameterizations, we introduce the novel Hüsler-Reiss precision matrix. Similarly to the Gaussian case, this matrix appears naturally in density representations of the Hüsler-Reiss Pareto distribution and encodes the extremal graphical structure through its zero pattern. For a given, arbitrary graph we prove the existence and uniqueness of the completion of a partially specified Hüsler-Reiss variogram matrix so that its precision matrix has zeros on non-edges in the graph. Using suitable estimators for the parameters on the edges, our theory provides the first consistent estimator of graph structured Hüsler-Reiss distributions. If the graph is unknown, our method can be combined with recent structure learning algorithms to jointly infer the graph and the corresponding parameter matrix. Based on our methodology, we propose new tools for statistical inference of sparse Hüsler-Reiss models and illustrate them on large flight delay data in the U.S., as well as Danube river flow data.
△ Less
Submitted 13 October, 2023; v1 submitted 25 October, 2022;
originally announced October 2022.
-
Max-linear graphical models with heavy-tailed factors on trees of transitive tournaments
Authors:
Johan Segers,
Stefka Asenova
Abstract:
Graphical models with heavy-tailed factors can be used to model extremal dependence or causality between extreme events. In a Bayesian network, variables are recursively defined in terms of their parents according to a directed acyclic graph (DAG). We focus on max-linear graphical models with respect to a special type of graphs, which we call a tree of transitive tournaments. The latter are block…
▽ More
Graphical models with heavy-tailed factors can be used to model extremal dependence or causality between extreme events. In a Bayesian network, variables are recursively defined in terms of their parents according to a directed acyclic graph (DAG). We focus on max-linear graphical models with respect to a special type of graphs, which we call a tree of transitive tournaments. The latter are block graphs combining in a tree-like structure a finite number of transitive tournaments, each of which is a DAG in which every two nodes are connected. We study the limit of the joint tails of the max-linear model conditionally on the event that a given variable exceeds a high threshold. Under a suitable condition, the limiting distribution involves the factorization into independent increments along the shortest trail between two variables, thereby imitating the behavior of a Markov random field. We are also interested in the identifiability of the model parameters in case some variables are latent and only a subvector is observed. It turns out that the parameters are identifiable under a criterion on the nodes carrying the latent variables which is easy and quick to check.
△ Less
Submitted 7 June, 2023; v1 submitted 29 September, 2022;
originally announced September 2022.
-
Modelling multivariate extreme value distributions via Markov trees
Authors:
Shuang Hu,
Zuoxiang Peng,
Johan Segers
Abstract:
Multivariate extreme value distributions are a common choice for modelling multivariate extremes. In high dimensions, however, the construction of flexible and parsimonious models is challenging. We propose to combine bivariate extreme value distributions into a Markov random field with respect to a tree. Although in general not an extreme value distribution itself, this Markov tree is attracted b…
▽ More
Multivariate extreme value distributions are a common choice for modelling multivariate extremes. In high dimensions, however, the construction of flexible and parsimonious models is challenging. We propose to combine bivariate extreme value distributions into a Markov random field with respect to a tree. Although in general not an extreme value distribution itself, this Markov tree is attracted by a multivariate extreme value distribution. The latter serves as a tree-based approximation to an unknown extreme value distribution with the given bivariate distributions as margins. Given data, we learn an appropriate tree structure by Prim's algorithm with estimated pairwise upper tail dependence coefficients or Kendall's tau values as edge weights. The distributions of pairs of connected variables can be fitted in various ways. The resulting tree-structured extreme value distribution allows for inference on rare event probabilities, as illustrated on river discharge data from the upper Danube basin.
△ Less
Submitted 29 July, 2022;
originally announced August 2022.
-
A Quadrature Rule combining Control Variates and Adaptive Importance Sampling
Authors:
Rémi Leluc,
François Portier,
Johan Segers,
Aigerim Zhuman
Abstract:
Driven by several successful applications such as in stochastic gradient descent or in Bayesian computation, control variates have become a major tool for Monte Carlo integration. However, standard methods do not allow the distribution of the particles to evolve during the algorithm, as is the case in sequential simulation methods. Within the standard adaptive importance sampling framework, a simp…
▽ More
Driven by several successful applications such as in stochastic gradient descent or in Bayesian computation, control variates have become a major tool for Monte Carlo integration. However, standard methods do not allow the distribution of the particles to evolve during the algorithm, as is the case in sequential simulation methods. Within the standard adaptive importance sampling framework, a simple weighted least squares approach is proposed to improve the procedure with control variates. The procedure takes the form of a quadrature rule with adapted quadrature weights to reflect the information brought in by the control variates. The quadrature points and weights do not depend on the integrand, a computational advantage in case of multiple integrands. Moreover, the target density needs to be known only up to a multiplicative constant. Our main result is a non-asymptotic bound on the probabilistic error of the procedure. The bound proves that for improving the estimate's accuracy, the benefits from adaptive importance sampling and control variates can be combined. The good behavior of the method is illustrated empirically on synthetic examples and real-world data for Bayesian linear regression.
△ Less
Submitted 5 October, 2022; v1 submitted 24 May, 2022;
originally announced May 2022.
-
Extremes of Markov random fields on block graphs: max-stable limits and structured Hüsler-Reiss distributions
Authors:
Stefka Asenova,
Johan Segers
Abstract:
We study the joint occurrence of large values of a Markov random field or undirected graphical model associated to a block graph. On such graphs, containing trees as special cases, we aim to generalize recent results for extremes of Markov trees. Every pair of nodes in a block graph is connected by a unique shortest path. These paths are shown to determine the limiting distribution of the properly…
▽ More
We study the joint occurrence of large values of a Markov random field or undirected graphical model associated to a block graph. On such graphs, containing trees as special cases, we aim to generalize recent results for extremes of Markov trees. Every pair of nodes in a block graph is connected by a unique shortest path. These paths are shown to determine the limiting distribution of the properly rescaled random field given that a fixed variable exceeds a high threshold. The latter limit relation implies that the random field is multivariate regularly varying and it determines the max-stable distribution to which component-wise maxima of independent random samples from the field are attracted. When the sub-vectors induced by the blocks have certain limits parametrized by Hüsler-Reiss distributions, the global Markov property of the original field induces a particular structure on the parameter matrix of the limiting max-stable Hüsler-Reiss distribution. The multivariate Pareto version of the latter turns out to be an extremal graphical model according to the original block graph. Thanks to these algebraic relations, the parameters are still identifiable even if some variables are latent.
△ Less
Submitted 8 March, 2023; v1 submitted 9 December, 2021;
originally announced December 2021.
-
Concentration bounds for the empirical angular measure with statistical learning applications
Authors:
Stéphan Clémençon,
Hamid Jalalzai,
Stéphane Lhaut,
Anne Sabourin,
Johan Segers
Abstract:
The angular measure on the unit sphere characterizes the first-order dependence structure of the components of a random vector in extreme regions and is defined in terms of standardized margins. Its statistical recovery is an important step in learning problems involving observations far away from the center. In the common situation that the components of the vector have different distributions, t…
▽ More
The angular measure on the unit sphere characterizes the first-order dependence structure of the components of a random vector in extreme regions and is defined in terms of standardized margins. Its statistical recovery is an important step in learning problems involving observations far away from the center. In the common situation that the components of the vector have different distributions, the rank transformation offers a convenient and robust way of standardizing data in order to build an empirical version of the angular measure based on the most extreme observations. However, the study of the sampling distribution of the resulting empirical angular measure is challenging. It is the purpose of the paper to establish finite-sample bounds for the maximal deviations between the empirical and true angular measures, uniformly over classes of Borel sets of controlled combinatorial complexity. The bounds are valid with high probability and, up to logarithmic factors, scale as the square root of the effective sample size. The bounds are applied to provide performance guarantees for two statistical learning procedures tailored to extreme regions of the input space and built upon the empirical angular measure: binary classification in extreme regions through empirical risk minimization and unsupervised anomaly detection through minimum-volume sets of the sphere.
△ Less
Submitted 17 October, 2022; v1 submitted 7 April, 2021;
originally announced April 2021.
-
Risk bounds when learning infinitely many response functions by ordinary linear regression
Authors:
Vincent Plassier,
François Portier,
Johan Segers
Abstract:
Consider the problem of learning a large number of response functions simultaneously based on the same input variables. The training data consist of a single independent random sample of the input variables drawn from a common distribution together with the associated responses. The input variables are mapped into a high-dimensional linear space, called the feature space, and the response function…
▽ More
Consider the problem of learning a large number of response functions simultaneously based on the same input variables. The training data consist of a single independent random sample of the input variables drawn from a common distribution together with the associated responses. The input variables are mapped into a high-dimensional linear space, called the feature space, and the response functions are modelled as linear functionals of the mapped features, with coefficients calibrated via ordinary least squares. We provide convergence guarantees on the worst-case excess prediction risk by controlling the convergence rate of the excess risk uniformly in the response function. The dimension of the feature map is allowed to tend to infinity with the sample size. The collection of response functions, although potentially infinite, is supposed to have a finite Vapnik-Chervonenkis dimension. The bound derived can be applied when building multiple surrogate models in a reasonable computing time.
△ Less
Submitted 27 November, 2021; v1 submitted 16 June, 2020;
originally announced June 2020.
-
Multivariate goodness-of-Fit tests based on Wasserstein distance
Authors:
Marc Hallin,
Gilles Mordant,
Johan Segers
Abstract:
Goodness-of-fit tests based on the empirical Wasserstein distance are proposed for simple and composite null hypotheses involving general multivariate distributions. For group families, the procedure is to be implemented after preliminary reduction of the data via invariance.This property allows for calculation of exact critical values and p-values at finite sample sizes. Applications include test…
▽ More
Goodness-of-fit tests based on the empirical Wasserstein distance are proposed for simple and composite null hypotheses involving general multivariate distributions. For group families, the procedure is to be implemented after preliminary reduction of the data via invariance.This property allows for calculation of exact critical values and p-values at finite sample sizes. Applications include testing for location--scale families and testing for families arising from affine transformations, such as elliptical distributions with given standard radial density and unspecified location vector and scatter matrix. A novel test for multivariate normality with unspecified mean vector and covariance matrix arises as a special case. For more general parametric families, we propose a parametric bootstrap procedure to calculate critical values. The lack of asymptotic distribution theory for the empirical Wasserstein distance means that the validity of the parametric bootstrap under the null hypothesis remains a conjecture. Nevertheless, we show that the test is consistent against fixed alternatives. To this end, we prove a uniform law of large numbers for the empirical distribution in Wasserstein distance, where the uniformity is over any class of underlying distributions satisfying a uniform integrability condition but no additional moment assumptions. The calculation of test statistics boils down to solving the well-studied semi-discrete optimal transport problem. Extensive numerical experiments demonstrate the practical feasibility and the excellent performance of the proposed tests for the Wasserstein distance of order p = 1 and p = 2 and for dimensions at least up to d = 5. The simulations also lend support to the conjecture of the asymptotic validity of the parametric bootstrap.
△ Less
Submitted 27 January, 2021; v1 submitted 14 March, 2020;
originally announced March 2020.
-
Inference on extremal dependence in the domain of attraction of a structured Hüsler-Reiss distribution motivated by a Markov tree with latent variables
Authors:
Stefka Asenova,
Gildas Mazo,
Johan Segers
Abstract:
A Markov tree is a probabilistic graphical model for a random vector indexed by the nodes of an undirected tree encoding conditional independence relations between variables. One possible limit distribution of partial maxima of samples from such a Markov tree is a max-stable Hüsler-Reiss distribution whose parameter matrix inherits its structure from the tree, each edge contributing one free depen…
▽ More
A Markov tree is a probabilistic graphical model for a random vector indexed by the nodes of an undirected tree encoding conditional independence relations between variables. One possible limit distribution of partial maxima of samples from such a Markov tree is a max-stable Hüsler-Reiss distribution whose parameter matrix inherits its structure from the tree, each edge contributing one free dependence parameter. Our central assumption is that, upon marginal standardization, the data-generating distribution is in the max-domain of attraction of the said Hüsler-Reiss distribution, an assumption much weaker than the one that data are generated according to a graphical model. Even if some of the variables are unobservable (latent), we show that the underlying model parameters are still identifiable if and only if every node corresponding to a latent variable has degree at least three. Three estimation procedures, based on the method of moments, maximum composite likelihood, and pairwise extremal coefficients, are proposed for usage on multivariate peaks over thresholds data when some variables are latent. A typical application is a river network in the form of a tree where, on some locations, no data are available. We illustrate the model and the identifiability criterion on a data set of high water levels on the Seine, France, with two latent variables. The structured Hüsler-Reiss distribution is found to fit the observed extremal dependence patterns well. The parameters being identifiable we are able to quantify tail dependence between locations for which there are no data.
△ Less
Submitted 17 January, 2021; v1 submitted 26 January, 2020;
originally announced January 2020.
-
Identifying groups of variables with the potential of being large simultaneously
Authors:
Maël Chiapino,
Anne Sabourin,
Johan Segers
Abstract:
Identifying groups of variables that may be large simultaneously amounts to finding out which joint tail dependence coefficients of a multivariate distribution are positive. The asymptotic distribution of a vector of nonparametric, rank-based estimators of these coefficients justifies a stop** criterion in an algorithm that searches the collection of all possible groups of variables in a systema…
▽ More
Identifying groups of variables that may be large simultaneously amounts to finding out which joint tail dependence coefficients of a multivariate distribution are positive. The asymptotic distribution of a vector of nonparametric, rank-based estimators of these coefficients justifies a stop** criterion in an algorithm that searches the collection of all possible groups of variables in a systematic way, from smaller groups to larger ones. The issue that the tolerance level in the stop** criterion should depend on the size of the groups is circumvented by the use of a conditional tail dependence coefficient. Alternatively, such stop** criteria can be based on limit distributions of rank-based estimators of the coefficient of tail dependence, quantifying the speed of decay of joint survival functions. Numerical experiments indicate that the algorithm's effectiveness for detecting tail-dependent groups of variables is highest when paired with a criterion based on a Hill-type estimator of the coefficient of tail dependence.
△ Less
Submitted 27 February, 2018;
originally announced February 2018.
-
Bayesian inference for bivariate ranks
Authors:
Simon Guillotte,
François Perron,
Johan Segers
Abstract:
A recommender system based on ranks is proposed, where an expert's ranking of a set of objects and a user's ranking of a subset of those objects are combined to make a prediction of the user's ranking of all objects. The rankings are assumed to be induced by latent continuous variables corresponding to the grades assigned by the expert and the user to the objects. The dependence between the expert…
▽ More
A recommender system based on ranks is proposed, where an expert's ranking of a set of objects and a user's ranking of a subset of those objects are combined to make a prediction of the user's ranking of all objects. The rankings are assumed to be induced by latent continuous variables corresponding to the grades assigned by the expert and the user to the objects. The dependence between the expert and user grades is modelled by a copula in some parametric family. Given a prior distribution on the copula parameter, the user's complete ranking is predicted by the mode of the posterior predictive distribution of the user's complete ranking conditional on the expert's complete and the user's incomplete rankings. Various Markov chain Monte-Carlo algorithms are proposed to approximate the predictive distribution or only its mode. The predictive distribution can be obtained exactly for the Farlie-Gumbel-Morgenstern copula family, providing a benchmark for the approximation accuracy of the algorithms. The method is applied to the MovieLens 100k dataset with a Gaussian copula modelling dependence between the expert's and user's grades.
△ Less
Submitted 9 February, 2018;
originally announced February 2018.
-
Monte Carlo integration with a growing number of control variates
Authors:
François Portier,
Johan Segers
Abstract:
It is well known that Monte Carlo integration with variance reduction by means of control variates can be implemented by the ordinary least squares estimator for the intercept in a multiple linear regression model. A central limit theorem is established for the integration error if the number of control variates tends to infinity. The integration error is scaled by the standard deviation of the er…
▽ More
It is well known that Monte Carlo integration with variance reduction by means of control variates can be implemented by the ordinary least squares estimator for the intercept in a multiple linear regression model. A central limit theorem is established for the integration error if the number of control variates tends to infinity. The integration error is scaled by the standard deviation of the error term in the regression model. If the linear span of the control variates is dense in a function space that contains the integrand, the integration error tends to zero at a rate which is faster than the square root of the number of Monte Carlo replicates. Depending on the situation, increasing the number of control variates may or may not be computationally more efficient than increasing the Monte Carlo sample size.
△ Less
Submitted 9 October, 2019; v1 submitted 5 January, 2018;
originally announced January 2018.
-
An estimator of the stable tail dependence function based on the empirical beta copula
Authors:
Anna Kiriliouk,
Johan Segers,
Laleh Tafakori
Abstract:
The replacement of indicator functions by integrated beta kernels in the definition of the empirical stable tail dependence function is shown to produce a smoothed version of the latter estimator with the same asymptotic distribution but superior finite-sample performance. The link of the new estimator with the empirical beta copula enables a simple but effective resampling scheme.
The replacement of indicator functions by integrated beta kernels in the definition of the empirical stable tail dependence function is shown to produce a smoothed version of the latter estimator with the same asymptotic distribution but superior finite-sample performance. The link of the new estimator with the empirical beta copula enables a simple but effective resampling scheme.
△ Less
Submitted 12 September, 2017;
originally announced September 2017.
-
Bayesian model averaging over tree-based dependence structures for multivariate extremes
Authors:
Sabrina Vettori,
Raphaël Huser,
Johan Segers,
Marc G. Genton
Abstract:
Describing the complex dependence structure of extreme phenomena is particularly challenging. To tackle this issue we develop a novel statistical algorithm that describes extremal dependence taking advantage of the inherent hierarchical dependence structure of the max-stable nested logistic distribution and that identifies possible clusters of extreme variables using reversible jump Markov chain M…
▽ More
Describing the complex dependence structure of extreme phenomena is particularly challenging. To tackle this issue we develop a novel statistical algorithm that describes extremal dependence taking advantage of the inherent hierarchical dependence structure of the max-stable nested logistic distribution and that identifies possible clusters of extreme variables using reversible jump Markov chain Monte Carlo techniques. Parsimonious representations are achieved when clusters of extreme variables are found to be completely independent. Moreover, we significantly decrease the computational complexity of full likelihood inference by deriving a recursive formula for the nested logistic model likelihood. The algorithm performance is verified through extensive simulation experiments which also compare different likelihood procedures. The new methodology is used to investigate the dependence relationships between extreme concentration of multiple pollutants in California and how these pollutants are related to extreme weather conditions. Overall, we show that our approach allows for the representation of complex extremal dependence structures and has valid applications in multivariate data analysis, such as air pollution monitoring, where it can guide policymaking.
△ Less
Submitted 22 July, 2018; v1 submitted 30 May, 2017;
originally announced May 2017.
-
Peaks over thresholds modelling with multivariate generalized Pareto distributions
Authors:
Anna Kiriliouk,
Holger Rootzén,
Johan Segers,
Jennifer L. Wadsworth
Abstract:
When assessing the impact of extreme events, it is often not just a single component, but the combined behaviour of several components which is important. Statistical modelling using multivariate generalized Pareto (GP) distributions constitutes the multivariate analogue of univariate peaks over thresholds modelling, which is widely used in finance and engineering. We develop general methods for c…
▽ More
When assessing the impact of extreme events, it is often not just a single component, but the combined behaviour of several components which is important. Statistical modelling using multivariate generalized Pareto (GP) distributions constitutes the multivariate analogue of univariate peaks over thresholds modelling, which is widely used in finance and engineering. We develop general methods for construction of multivariate GP distributions and use them to create a variety of new statistical models. A censored likelihood procedure is proposed to make inference on these models, together with a threshold selection procedure, goodness-of-fit diagnostics, and a computationally tractable strategy for model selection. The models are fitted to returns of stock prices of four UK-based banks and to rainfall data in the context of landslide risk estimation. Supplementary materials and codes are available online.
△ Less
Submitted 6 February, 2018; v1 submitted 6 December, 2016;
originally announced December 2016.
-
Inference on the tail process with application to financial time series modelling
Authors:
R. A. Davis,
H. Drees,
J. Segers,
M. Warchoł
Abstract:
To draw inference on serial extremal dependence within heavy-tailed Markov chains, Drees, Segers and Warchoł [Extremes (2015) 18, 369--402] proposed nonparametric estimators of the spectral tail process. The methodology can be extended to the more general setting of a stationary, regularly varying time series. The large-sample distribution of the estimators is derived via empirical process theory…
▽ More
To draw inference on serial extremal dependence within heavy-tailed Markov chains, Drees, Segers and Warchoł [Extremes (2015) 18, 369--402] proposed nonparametric estimators of the spectral tail process. The methodology can be extended to the more general setting of a stationary, regularly varying time series. The large-sample distribution of the estimators is derived via empirical process theory for cluster functionals. The finite-sample performance of these estimators is evaluated via Monte Carlo simulations. Moreover, two different bootstrap schemes are employed which yield confidence intervals for the pre-asymptotic spectral tail process: the stationary bootstrap and the multiplier block bootstrap. The estimators are applied to stock price data to study the persistence of positive and negative shocks.
△ Less
Submitted 29 January, 2018; v1 submitted 4 April, 2016;
originally announced April 2016.
-
Multivariate peaks over thresholds models
Authors:
Holger Rootzén,
Johan Segers,
Jennifer L. Wadsworth
Abstract:
Multivariate peaks over thresholds modeling based on generalized Pareto distributions has up to now only been used in few and mostly 2-dimensional situations. This paper contributes theoretical understanding, physically based models, inference tools, and simulation methods to support routine use, with an aim at higher dimensions. We derive a general point process model for extreme episodes in data…
▽ More
Multivariate peaks over thresholds modeling based on generalized Pareto distributions has up to now only been used in few and mostly 2-dimensional situations. This paper contributes theoretical understanding, physically based models, inference tools, and simulation methods to support routine use, with an aim at higher dimensions. We derive a general point process model for extreme episodes in data, and show how conditioning the distribution of extreme episodes on threshold exceedance gives four basic representations of the family of generalized Pareto distributions. The first representation is constructed on the real scale of the observations. The second one starts with a model on a standard exponential scale which then is transformed to the real scale. The third and fourth are reformulations of a spectral representation proposed in A. Ferreira and L. de Haan [Bernoulli 20 (2014) 1717--1737]. Numerically tractable forms of densities and censored densities are found and give tools for flexible parametric likelihood inference. New simulation algorithms, explicit formulas for probabilities and conditional probabilities, and conditions which make the conditional distribution of weighted component sums generalized Pareto are derived.
△ Less
Submitted 3 May, 2017; v1 submitted 21 March, 2016;
originally announced March 2016.
-
A continuous updating weighted least squares estimator of tail dependence in high dimensions
Authors:
John H. J. Einmahl,
Anna Kiriliouk,
Johan Segers
Abstract:
Likelihood-based procedures are a common way to estimate tail dependence parameters. They are not applicable, however, in non-differentiable models such as those arising from recent max-linear structural equation models. Moreover, they can be hard to compute in higher dimensions. An adaptive weighted least-squares procedure matching nonparametric estimates of the stable tail dependence function wi…
▽ More
Likelihood-based procedures are a common way to estimate tail dependence parameters. They are not applicable, however, in non-differentiable models such as those arising from recent max-linear structural equation models. Moreover, they can be hard to compute in higher dimensions. An adaptive weighted least-squares procedure matching nonparametric estimates of the stable tail dependence function with the corresponding values of a parametrically specified proposal yields a novel minimum-distance estimator. The estimator is easy to calculate and applies to a wide range of sampling schemes and tail dependence models. In large samples, it is asymptotically normal with an explicit and estimable covariance matrix. The minimum distance obtained forms the basis of a goodness-of-fit statistic whose asymptotic distribution is chi-square. Extensive Monte Carlo simulations confirm the excellent finite-sample performance of the estimator and demonstrate that it is a strong competitor to currently available methods. The estimator is then applied to disentangle sources of tail dependence in European stock markets.
△ Less
Submitted 19 January, 2016;
originally announced January 2016.
-
Max-factor individual risk models with application to credit portfolios
Authors:
Michel Denuit,
Anna Kiriliouk,
Johan Segers
Abstract:
Individual risk models need to capture possible correlations as failing to do so typically results in an underestimation of extreme quantiles of the aggregate loss. Such dependence modelling is particularly important for managing credit risk, for instance, where joint defaults are a major cause of concern. Often, the dependence between the individual loss occurrence indicators is driven by a small…
▽ More
Individual risk models need to capture possible correlations as failing to do so typically results in an underestimation of extreme quantiles of the aggregate loss. Such dependence modelling is particularly important for managing credit risk, for instance, where joint defaults are a major cause of concern. Often, the dependence between the individual loss occurrence indicators is driven by a small number of unobservable factors. Conditional loss probabilities are then expressed as monotone functions of linear combinations of these hidden factors. However, combining the factors in a linear way allows for some compensation between them. Such diversification effects are not always desirable and this is why the present work proposes a new model replacing linear combinations with maxima. These max-factor models give more insight into which of the factors is dominant.
△ Less
Submitted 10 December, 2014;
originally announced December 2014.
-
Nonparametric estimation of extremal dependence
Authors:
Anna Kiriliouk,
Johan Segers,
Michal Warchol
Abstract:
There is an increasing interest to understand the dependence structure of a random vector not only in the center of its distribution but also in the tails. Extreme-value theory tackles the problem of modelling the joint tail of a multivariate distribution by modelling the marginal distributions and the dependence structure separately. For estimating dependence at high levels, the stable tail depen…
▽ More
There is an increasing interest to understand the dependence structure of a random vector not only in the center of its distribution but also in the tails. Extreme-value theory tackles the problem of modelling the joint tail of a multivariate distribution by modelling the marginal distributions and the dependence structure separately. For estimating dependence at high levels, the stable tail dependence function and the spectral measure are particularly convenient. These objects also lie at the basis of nonparametric techniques for modelling the dependence among extremes in the max-domain of attraction setting. In case of asymptotic independence, this setting is inadequate, and more refined tail dependence coefficients exist, serving, among others, to discriminate between asymptotic dependence and independence. Throughout, the methods are illustrated on financial data.
△ Less
Submitted 3 November, 2014;
originally announced November 2014.
-
On the asymptotic distribution of the mean absolute deviation about the mean
Authors:
Johan Segers
Abstract:
The mean absolute deviation about the mean is an alternative to the standard deviation for measuring dispersion in a sample or in a population. For stationary, ergodic time series with a finite first moment, an asymptotic expansion for the sample mean absolute deviation is proposed. The expansion yields the asymptotic distribution of the sample mean absolute deviation under a wide range of setting…
▽ More
The mean absolute deviation about the mean is an alternative to the standard deviation for measuring dispersion in a sample or in a population. For stationary, ergodic time series with a finite first moment, an asymptotic expansion for the sample mean absolute deviation is proposed. The expansion yields the asymptotic distribution of the sample mean absolute deviation under a wide range of settings, allowing for serial dependence or an infinite second moment.
△ Less
Submitted 16 June, 2014;
originally announced June 2014.
-
Statistics for Tail Processes of Markov Chains
Authors:
Holger Drees,
Johan Segers,
Michał Warchoł
Abstract:
At high levels, the asymptotic distribution of a stationary, regularly varying Markov chain is conveniently given by its tail process. The latter takes the form of a geometric random walk, the increment distribution depending on the sign of the process at the current state and on the flow of time, either forward or backward. Estimation of the tail process provides a nonparametric approach to analy…
▽ More
At high levels, the asymptotic distribution of a stationary, regularly varying Markov chain is conveniently given by its tail process. The latter takes the form of a geometric random walk, the increment distribution depending on the sign of the process at the current state and on the flow of time, either forward or backward. Estimation of the tail process provides a nonparametric approach to analyze extreme values. A duality between the distributions of the forward and backward increments provides additional information that can be exploited in the construction of more efficient estimators. The large-sample distribution of such estimators is derived via empirical process theory for cluster functionals. Their finite-sample performance is evaluated via Monte Carlo simulations involving copula-based Markov models and solutions to stochastic recurrence equations. The estimators are applied to stock price data to study the absence or presence of symmetries in the succession of large gains and losses.
△ Less
Submitted 10 December, 2014; v1 submitted 29 May, 2014;
originally announced May 2014.
-
Multivariate Nonparametric Estimation of the Pickands Dependence Function using Bernstein Polynomials
Authors:
G. Marcon,
S. A. Padoan,
P. Naveau,
P. Muliere,
J. Segers
Abstract:
Many applications in risk analysis, especially in environmental sciences, require the estimation of the dependence among multivariate maxima. A way to do this is by inferring the Pickands dependence function of the underlying extreme-value copula. A nonparametric estimator is constructed as the sample equivalent of a multivariate extension of the madogram. Shape constraints on the family of Pickan…
▽ More
Many applications in risk analysis, especially in environmental sciences, require the estimation of the dependence among multivariate maxima. A way to do this is by inferring the Pickands dependence function of the underlying extreme-value copula. A nonparametric estimator is constructed as the sample equivalent of a multivariate extension of the madogram. Shape constraints on the family of Pickands dependence functions are taken into account by means of a representation in terms of a specific type of Bernstein polynomials. The large-sample theory of the estimator is developed and its finite-sample performance is evaluated with a simulation study. The approach is illustrated by analyzing clusters consisting of seven weather stations that have recorded weekly maxima of hourly rainfall in France from 1993 to 2011.
△ Less
Submitted 15 April, 2016; v1 submitted 20 May, 2014;
originally announced May 2014.
-
Hybrid Copula Estimators
Authors:
Johan Segers
Abstract:
An extension of the empirical copula is considered by combining an estimator of a multivariate cumulative distribution function with estimators of the marginal cumulative distribution functions for marginal estimators that are not necessarily equal to the margins of the joint estimator. Such a hybrid estimator may be reasonable when there is additional information available for some margins in the…
▽ More
An extension of the empirical copula is considered by combining an estimator of a multivariate cumulative distribution function with estimators of the marginal cumulative distribution functions for marginal estimators that are not necessarily equal to the margins of the joint estimator. Such a hybrid estimator may be reasonable when there is additional information available for some margins in the form of additional data or stronger modelling assumptions. A functional central limit theorem is established and some examples are developed.
△ Less
Submitted 28 November, 2014; v1 submitted 8 May, 2014;
originally announced May 2014.
-
An M-estimator of spatial tail dependence
Authors:
John Einmahl,
Anna Kiriliouk,
Andrea Kra**a,
Johan Segers
Abstract:
Tail dependence models for distributions attracted to a max-stable law are fitted using observations above a high threshold. To cope with spatial, high-dimensional data, a rank-based M-estimator is proposed relying on bivariate margins only. A data-driven weight matrix is used to minimize the asymptotic variance. Empirical process arguments show that the estimator is consistent and asymptotically…
▽ More
Tail dependence models for distributions attracted to a max-stable law are fitted using observations above a high threshold. To cope with spatial, high-dimensional data, a rank-based M-estimator is proposed relying on bivariate margins only. A data-driven weight matrix is used to minimize the asymptotic variance. Empirical process arguments show that the estimator is consistent and asymptotically normal. Its finite-sample performance is assessed in simulation experiments involving popular max-stable processes perturbed with additive noise. An analysis of wind speed data from the Netherlands illustrates the method.
△ Less
Submitted 9 January, 2015; v1 submitted 8 March, 2014;
originally announced March 2014.
-
Semiparametric Gaussian copula models: Geometry and efficient rank-based estimation
Authors:
Johan Segers,
Ramon van den Akker,
Bas J. M. Werker
Abstract:
We propose, for multivariate Gaussian copula models with unknown margins and structured correlation matrices, a rank-based, semiparametrically efficient estimator for the Euclidean copula parameter. This estimator is defined as a one-step update of a rank-based pilot estimator in the direction of the efficient influence function, which is calculated explicitly. Moreover, finite-dimensional algebra…
▽ More
We propose, for multivariate Gaussian copula models with unknown margins and structured correlation matrices, a rank-based, semiparametrically efficient estimator for the Euclidean copula parameter. This estimator is defined as a one-step update of a rank-based pilot estimator in the direction of the efficient influence function, which is calculated explicitly. Moreover, finite-dimensional algebraic conditions are given that completely characterize efficiency of the pseudo-likelihood estimator and adaptivity of the model with respect to the unknown marginal distributions. For correlation matrices structured according to a factor model, the pseudo-likelihood estimator turns out to be semiparametrically efficient. On the other hand, for Toeplitz correlation matrices, the asymptotic relative efficiency of the pseudo-likelihood estimator can be as low as 20%. These findings are confirmed by Monte Carlo simulations. We indicate how our results can be extended to joint regression models.
△ Less
Submitted 1 October, 2014; v1 submitted 27 June, 2013;
originally announced June 2013.
-
Nonparametric estimation of the tree structure of a nested Archimedean copula
Authors:
Johan Segers,
Nathan Uyttendaele
Abstract:
One of the features inherent in nested Archimedean copulas, also called hierarchical Archimedean copulas, is their rooted tree structure. A nonparametric, rank-based method to estimate this structure is presented. The idea is to represent the target structure as a set of trivariate structures, each of which can be estimated individually with ease. Indeed, for any three variables there are only fou…
▽ More
One of the features inherent in nested Archimedean copulas, also called hierarchical Archimedean copulas, is their rooted tree structure. A nonparametric, rank-based method to estimate this structure is presented. The idea is to represent the target structure as a set of trivariate structures, each of which can be estimated individually with ease. Indeed, for any three variables there are only four possible rooted tree structures and, based on a sample, a choice can be made by performing comparisons between the three bivariate margins of the empirical distribution of the three variables. The set of estimated trivariate structures can then be used to build an estimate of the target structure. The advantage of this estimation method is that it does not require any parametric assumptions concerning the generator functions at the nodes of the tree.
△ Less
Submitted 17 December, 2013; v1 submitted 4 April, 2013;
originally announced April 2013.
-
Nonparametric Inference for Max-Stable Dependence
Authors:
Johan Segers
Abstract:
Discussion of "Statistical Modeling of Spatial Extremes" by A. C. Davison, S. A. Padoan and M. Ribatet [arXiv:1208.3378].
Discussion of "Statistical Modeling of Spatial Extremes" by A. C. Davison, S. A. Padoan and M. Ribatet [arXiv:1208.3378].
△ Less
Submitted 17 August, 2012;
originally announced August 2012.
-
Detecting changes in cross-sectional dependence in multivariate time series
Authors:
Axel Bücher,
Ivan Kojadinovic,
Tom Rohmer,
Johan Segers
Abstract:
Classical and more recent tests for detecting distributional changes in multivariate time series often lack power against alternatives that involve changes in the cross-sectional dependence structure. To be able to detect such changes better, a test is introduced based on a recently studied variant of the sequential empirical copula process. In contrast to earlier attempts, ranks are computed with…
▽ More
Classical and more recent tests for detecting distributional changes in multivariate time series often lack power against alternatives that involve changes in the cross-sectional dependence structure. To be able to detect such changes better, a test is introduced based on a recently studied variant of the sequential empirical copula process. In contrast to earlier attempts, ranks are computed with respect to relevant subsamples, with beneficial consequences for the sensitivity of the test. For the computation of p-values we propose a multiplier resampling scheme that takes the serial dependence into account. The large-sample theory for the test statistic and the resampling scheme is developed. The finite-sample performance of the procedure is assessed by Monte Carlo simulations. Two case studies involving time series of financial returns are presented as well.
△ Less
Submitted 21 May, 2014; v1 submitted 12 June, 2012;
originally announced June 2012.
-
A Euclidean likelihood estimator for bivariate tail dependence
Authors:
Miguel de Carvalho,
Boris Oumow,
Johan Segers,
Michał Warchoł
Abstract:
The spectral measure plays a key role in the statistical modeling of multivariate extremes. Estimation of the spectral measure is a complex issue, given the need to obey a certain moment condition. We propose a Euclidean likelihood-based estimator for the spectral measure which is simple and explicitly defined, with its expression being free of Lagrange multipliers. Our estimator is shown to have…
▽ More
The spectral measure plays a key role in the statistical modeling of multivariate extremes. Estimation of the spectral measure is a complex issue, given the need to obey a certain moment condition. We propose a Euclidean likelihood-based estimator for the spectral measure which is simple and explicitly defined, with its expression being free of Lagrange multipliers. Our estimator is shown to have the same limit distribution as the maximum empirical likelihood estimator of J. H. J. Einmahl and J. Segers, Annals of Statistics 37(5B), 2953--2989 (2009). Numerical experiments suggest an overall good performance and identical behavior to the maximum empirical likelihood estimator. We illustrate the method in an extreme temperature data analysis.
△ Less
Submitted 16 April, 2012;
originally announced April 2012.
-
Max-stable models for multivariate extremes
Authors:
Johan Segers
Abstract:
Multivariate extreme-value analysis is concerned with the extremes in a multivariate random sample, that is, points of which at least some components have exceptionally large values. Mathematical theory suggests the use of max-stable models for univariate and multivariate extremes. A comprehensive account is given of the various ways in which max-stable models are described. Furthermore, a constru…
▽ More
Multivariate extreme-value analysis is concerned with the extremes in a multivariate random sample, that is, points of which at least some components have exceptionally large values. Mathematical theory suggests the use of max-stable models for univariate and multivariate extremes. A comprehensive account is given of the various ways in which max-stable models are described. Furthermore, a construction device is proposed for generating parametric families of max-stable distributions. Although the device is not new, its role as a model generator seems not yet to have been fully exploited.
△ Less
Submitted 2 April, 2012;
originally announced April 2012.
-
Nonparametric estimation of pair-copula constructions with the empirical pair-copula
Authors:
Ingrid Hobaek Haff,
Johan Segers
Abstract:
A pair-copula construction is a decomposition of a multivariate copula into a structured system, called regular vine, of bivariate copulae or pair-copulae. The standard practice is to model these pair-copulae parametrically, which comes at the cost of a large model risk, with errors propagating throughout the vine structure. The empirical pair-copula proposed in the paper provides a nonparametric…
▽ More
A pair-copula construction is a decomposition of a multivariate copula into a structured system, called regular vine, of bivariate copulae or pair-copulae. The standard practice is to model these pair-copulae parametrically, which comes at the cost of a large model risk, with errors propagating throughout the vine structure. The empirical pair-copula proposed in the paper provides a nonparametric alternative still achieving the parametric convergence rate. It can be used as a basis for inference on dependence measures, for selecting and pruning the vine structure, and for hypothesis tests concerning the form of the pair-copulae.
△ Less
Submitted 24 January, 2012;
originally announced January 2012.
-
Measuring Association between Random Vectors
Authors:
Oliver Grothe,
Friedrich Schmid,
Julius Schnieders,
Johan Segers
Abstract:
This paper suggests five measures of association between two random vectors X = (X_1, ..., X_p) and Y = (Y_1, ..., Y_q). They are copula based and therefore invariant with respect to the marginal distributions of the components X_i and Y_j. The measures capture positive as well as negative association of X and Y. In case p = q = 1 they reduce to Spearman's rho. Various properties of these new meas…
▽ More
This paper suggests five measures of association between two random vectors X = (X_1, ..., X_p) and Y = (Y_1, ..., Y_q). They are copula based and therefore invariant with respect to the marginal distributions of the components X_i and Y_j. The measures capture positive as well as negative association of X and Y. In case p = q = 1 they reduce to Spearman's rho. Various properties of these new measures are investigated. Nonparametric estimators, based on ranks, for the measures are derived and their small sample behaviour is investigated by simulation. The measures are applied to characterise strength and direction of association of bond and stock indices of five countries over time.
△ Less
Submitted 21 July, 2011;
originally announced July 2011.
-
Nonparametric estimation of multivariate extreme-value copulas
Authors:
Gordon Gudendorf,
Johan Segers
Abstract:
Extreme-value copulas arise in the asymptotic theory for componentwise maxima of independent random samples. An extreme-value copula is determined by its Pickands dependence function, which is a function on the unit simplex subject to certain shape constraints that arise from an integral transform of an underlying measure called spectral measure. Multivariate extensions are provided of certain ran…
▽ More
Extreme-value copulas arise in the asymptotic theory for componentwise maxima of independent random samples. An extreme-value copula is determined by its Pickands dependence function, which is a function on the unit simplex subject to certain shape constraints that arise from an integral transform of an underlying measure called spectral measure. Multivariate extensions are provided of certain rank-based nonparametric estimators of the Pickands dependence function. The shape constraint that the estimator should itself be a Pickands dependence function is enforced by replacing an initial estimator by its best least-squares approximation in the set of Pickands dependence functions having a discrete spectral measure supported on a sufficiently fine grid. Weak convergence of the standardized estimators is demonstrated and the finite-sample performance of the estimators is investigated by means of a simulation experiment.
△ Less
Submitted 29 November, 2011; v1 submitted 12 July, 2011;
originally announced July 2011.
-
Large-sample tests of extreme-value dependence for multivariate copulas
Authors:
Ivan Kojadinovic,
Johan Segers,
Jun Yan
Abstract:
Starting from the characterization of extreme-value copulas based on max-stability, large-sample tests of extreme-value dependence for multivariate copulas are studied. The two key ingredients of the proposed tests are the empirical copula of the data and a multiplier technique for obtaining approximate p-values for the derived statistics. The asymptotic validity of the multiplier approach is esta…
▽ More
Starting from the characterization of extreme-value copulas based on max-stability, large-sample tests of extreme-value dependence for multivariate copulas are studied. The two key ingredients of the proposed tests are the empirical copula of the data and a multiplier technique for obtaining approximate p-values for the derived statistics. The asymptotic validity of the multiplier approach is established, and the finite-sample performance of a large number of candidate test statistics is studied through extensive Monte Carlo experiments for data sets of dimension two to five. In the bivariate case, the rejection rates of the best versions of the tests are compared with those of the test of Ghoudi, Khoudraji and Rivest (1998) recently revisited by Ben Ghorbal, Genest and Neslehova (2009). The proposed procedures are illustrated on bivariate financial data and trivariate geological data.
△ Less
Submitted 11 May, 2011;
originally announced May 2011.