Search | arXiv e-print repository

arXiv:2303.13237 [pdf, other]

Improving estimation for asymptotically independent bivariate extremes via global estimators for the angular dependence function

Authors: C. J. R. Murphy-Barltrop, J. L. Wadsworth, E. F. Eastoe

Abstract: Modelling the extremal dependence of bivariate variables is important in a wide variety of practical applications, including environmental planning, catastrophe modelling and hydrology. The majority of these approaches are based on the framework of bivariate regular variation, and a wide range of literature is available for estimating the dependence structure in this setting. However, such procedu… ▽ More Modelling the extremal dependence of bivariate variables is important in a wide variety of practical applications, including environmental planning, catastrophe modelling and hydrology. The majority of these approaches are based on the framework of bivariate regular variation, and a wide range of literature is available for estimating the dependence structure in this setting. However, such procedures are only applicable to variables exhibiting asymptotic dependence, even though asymptotic independence is often observed in practice. In this paper, we consider the so-called `angular dependence function'; this quantity summarises the extremal dependence structure for asymptotically independent variables. Until recently, only pointwise estimators of the angular dependence function have been available. We introduce a range of global estimators and compare them to another recently introduced technique for global estimation through a systematic simulation study, and a case study on river flow data from the north of England, UK. △ Less

Submitted 26 June, 2024; v1 submitted 23 March, 2023; originally announced March 2023.

arXiv:2209.05795 [pdf, other]

doi 10.1016/j.csda.2023.107841

Joint modelling of the body and tail of bivariate data

Authors: Lídia M. André, Jennifer L. Wadsworth, Adrian O'Hagan

Abstract: In situations where both extreme and non-extreme data are of interest, modelling the whole data set accurately is important. In a univariate framework, modelling the bulk and tail of a distribution has been extensively studied before. However, when more than one variable is of concern, models that aim specifically at capturing both regions correctly are scarce in the literature. A dependence model… ▽ More In situations where both extreme and non-extreme data are of interest, modelling the whole data set accurately is important. In a univariate framework, modelling the bulk and tail of a distribution has been extensively studied before. However, when more than one variable is of concern, models that aim specifically at capturing both regions correctly are scarce in the literature. A dependence model that blends two copulas with different characteristics over the whole range of the data support is proposed. One copula is tailored to the bulk and the other to the tail, with a dynamic weighting function employed to transition smoothly between them. Tail dependence properties are investigated numerically and simulation is used to confirm that the blended model is sufficiently flexible to capture a wide variety of structures. The model is applied to study the dependence between temperature and ozone concentration at two sites in the UK and compared with a single copula fit. The proposed model provides a better, more flexible, fit to the data, and is also capable of capturing complex dependence structures. △ Less

Submitted 10 October, 2023; v1 submitted 13 September, 2022; originally announced September 2022.

Comments: 36 pages, 12 figures

arXiv:2203.05860 [pdf, other]

Modelling non-stationarity in asymptotically independent extremes

Authors: C. J. R. Murphy-Barltrop, J. L. Wadsworth

Abstract: In many practical applications, evaluating the joint impact of combinations of environmental variables is important for risk management and structural design analysis. When such variables are considered simultaneously, non-stationarity can exist within both the marginal distributions and dependence structure, resulting in complex data structures. In the context of extremes, few methods have been p… ▽ More In many practical applications, evaluating the joint impact of combinations of environmental variables is important for risk management and structural design analysis. When such variables are considered simultaneously, non-stationarity can exist within both the marginal distributions and dependence structure, resulting in complex data structures. In the context of extremes, few methods have been proposed for modelling trends in extremal dependence, even though capturing this feature is important for quantifying joint impact. Moreover, most proposed techniques are only applicable to data structures exhibiting asymptotic dependence. Motivated by observed dependence trends of data from the UK Climate Projections, we propose a novel semi-parametric modelling framework for bivariate extremal dependence structures. This framework allows us to capture a wide variety of dependence trends for data exhibiting asymptotic independence. When applied to the climate projection dataset, our model detects significant dependence trends in observations and, in combination with models for marginal non-stationarity, can be used to produce estimates of bivariate risk measures at future time points. △ Less

Submitted 22 April, 2024; v1 submitted 11 March, 2022; originally announced March 2022.

arXiv:2107.01942 [pdf, other]

New estimation methods for extremal bivariate return curves

Authors: C. J. R. Murphy-Barltrop, J. L. Wadsworth, E. F. Eastoe

Abstract: In the multivariate setting, estimates of extremal risk measures are important in many contexts, such as environmental planning and structural engineering. In this paper, we propose new estimation methods for extremal bivariate return curves, a risk measure that is the natural bivariate extension to a return level. Unlike several existing techniques, our estimates are based on bivariate extreme va… ▽ More In the multivariate setting, estimates of extremal risk measures are important in many contexts, such as environmental planning and structural engineering. In this paper, we propose new estimation methods for extremal bivariate return curves, a risk measure that is the natural bivariate extension to a return level. Unlike several existing techniques, our estimates are based on bivariate extreme value models that can capture both key forms of extremal dependence. We devise tools for validating return curve estimates, as well as representing their uncertainty, and compare a selection of curve estimation techniques through simulation studies. We apply the methodology to two metocean data sets, with diagnostics indicating generally good performance. △ Less

Submitted 10 October, 2022; v1 submitted 5 July, 2021; originally announced July 2021.

Comments: 40 pages (without supplementary), 12 figures, 2 tables

arXiv:2105.05314 [pdf, ps, other]

Modeling spatial extremes using normal mean-variance mixtures

Authors: Zhongwei Zhang, Raphaël Huser, Thomas Opitz, Jennifer L. Wadsworth

Abstract: Classical models for multivariate or spatial extremes are mainly based upon the asymptotically justified max-stable or generalized Pareto processes. These models are suitable when asymptotic dependence is present, i.e., the joint tail decays at the same rate as the marginal tail. However, recent environmental data applications suggest that asymptotic independence is equally important and, unfortun… ▽ More Classical models for multivariate or spatial extremes are mainly based upon the asymptotically justified max-stable or generalized Pareto processes. These models are suitable when asymptotic dependence is present, i.e., the joint tail decays at the same rate as the marginal tail. However, recent environmental data applications suggest that asymptotic independence is equally important and, unfortunately, existing spatial models in this setting that are both flexible and can be fitted efficiently are scarce. Here, we propose a new spatial copula model based on the generalized hyperbolic distribution, which is a specific normal mean-variance mixture and is very popular in financial modeling. The tail properties of this distribution have been studied in the literature, but with contradictory results. It turns out that the proofs from the literature contain mistakes. We here give a corrected theoretical description of its tail dependence structure and then exploit the model to analyze a simulated dataset from the inverted Brown-Resnick process, hindcast significant wave height data in the North Sea, and wind gust data in the state of Oklahoma, USA. We demonstrate that our proposed model is flexible enough to capture the dependence structure not only in the tail but also in the bulk. △ Less

Submitted 11 May, 2021; originally announced May 2021.

Comments: 24 pages, 6 figures

arXiv:2101.07167 [pdf, ps, other]

doi 10.1002/env.2671

Spatial deformation for non-stationary extremal dependence

Authors: Jordan Richards, Jennifer L. Wadsworth

Abstract: Modelling the extremal dependence structure of spatial data is considerably easier if that structure is stationary. However, for data observed over large or complicated domains, non-stationarity will often prevail. Current methods for modelling non-stationarity in extremal dependence rely on models that are either computationally difficult to fit or require prior knowledge of covariates. Sampson a… ▽ More Modelling the extremal dependence structure of spatial data is considerably easier if that structure is stationary. However, for data observed over large or complicated domains, non-stationarity will often prevail. Current methods for modelling non-stationarity in extremal dependence rely on models that are either computationally difficult to fit or require prior knowledge of covariates. Sampson and Guttorp (1992) proposed a simple technique for handling non-stationarity in spatial dependence by smoothly map** the sampling locations of the process from the original geographical space to a latent space where stationarity can be reasonably assumed. We present an extension of this method to a spatial extremes framework by considering least squares minimisation of pairwise theoretical and empirical extremal dependence measures. Along with some practical advice on applying these deformations, we provide a detailed simulation study in which we propose three spatial processes with varying degrees of non-stationarity in their extremal and central dependence structures. The methodology is applied to Australian summer temperature extremes and UK precipitation to illustrate its efficacy compared to a naive modelling approach. △ Less

Submitted 18 January, 2021; originally announced January 2021.

Comments: 41 pages, 10 figures

Journal ref: Environmetrics, e2671 (2021)

arXiv:2012.09623 [pdf, other]

doi 10.1016/j.jmva.2021.104736

A geometric investigation into the tail dependence of vine copulas

Authors: Emma S. Simpson, Jennifer L. Wadsworth, Jonathan A. Tawn

Abstract: Vine copulas are a type of multivariate dependence model, composed of a collection of bivariate copulas that are combined according to a specific underlying graphical structure. Their flexibility and practicality in moderate and high dimensions have contributed to the popularity of vine copulas, but relatively little attention has been paid to their extremal properties. To address this issue, we p… ▽ More Vine copulas are a type of multivariate dependence model, composed of a collection of bivariate copulas that are combined according to a specific underlying graphical structure. Their flexibility and practicality in moderate and high dimensions have contributed to the popularity of vine copulas, but relatively little attention has been paid to their extremal properties. To address this issue, we present results on the tail dependence properties of some of the most widely studied vine copula classes. We focus our study on the coefficient of tail dependence and the asymptotic shape of the sample cloud, which we calculate using the geometric approach of Nolde (2014). We offer new insights by presenting results for trivariate vine copulas constructed from asymptotically dependent and asymptotically independent bivariate copulas, focusing on bivariate extreme value and inverted extreme value copulas, with additional detail provided for logistic and inverted logistic examples. We also present new theory for a class of higher dimensional vine copulas, constructed from bivariate inverted extreme value copulas. △ Less

Submitted 17 December, 2020; originally announced December 2020.

Journal ref: Journal of Multivariate Analysis 2021, Volume 184, 104736

arXiv:2012.00990 [pdf, other]

Linking representations for multivariate extremes via a limit set

Authors: Natalia Nolde, Jennifer L. Wadsworth

Abstract: The study of multivariate extremes is dominated by multivariate regular variation, although it is well known that this approach does not provide adequate distinction between random vectors whose components are not always simultaneously large. Various alternative dependence measures and representations have been proposed, with the most well-known being hidden regular variation and the conditional e… ▽ More The study of multivariate extremes is dominated by multivariate regular variation, although it is well known that this approach does not provide adequate distinction between random vectors whose components are not always simultaneously large. Various alternative dependence measures and representations have been proposed, with the most well-known being hidden regular variation and the conditional extreme value model. These varying depictions of extremal dependence arise through consideration of different parts of the multivariate domain, and particularly exploring what happens when extremes of one variable may grow at different rates to other variables. Thus far, these alternative representations have come from distinct sources and links between them are limited. In this work we elucidate many of the relevant connections through a geometrical approach. In particular, the shape of the limit set of scaled sample clouds in light-tailed margins is shown to provide a description of several different extremal dependence representations. △ Less

Submitted 14 August, 2021; v1 submitted 2 December, 2020; originally announced December 2020.

Comments: Former title: "Connections between representations for multivariate extremes"

arXiv:2011.04486 [pdf, other]

doi 10.1007/s10687-023-00468-8

High-dimensional modeling of spatial and spatio-temporal conditional extremes using INLA and Gaussian Markov random fields

Authors: Emma S. Simpson, Thomas Opitz, Jennifer L. Wadsworth

Abstract: The conditional extremes framework allows for event-based stochastic modeling of dependent extremes, and has recently been extended to spatial and spatio-temporal settings. After standardizing the marginal distributions and applying an appropriate linear normalization, certain non-stationary Gaussian processes can be used as asymptotically-motivated models for the process conditioned on threshold… ▽ More The conditional extremes framework allows for event-based stochastic modeling of dependent extremes, and has recently been extended to spatial and spatio-temporal settings. After standardizing the marginal distributions and applying an appropriate linear normalization, certain non-stationary Gaussian processes can be used as asymptotically-motivated models for the process conditioned on threshold exceedances at a fixed reference location and time. In this work, we adapt existing conditional extremes models to allow for the handling of large spatial datasets. This involves specifying the model for spatial observations at $d$ locations in terms of a latent $m\ll d$ dimensional Gaussian model, whose structure is specified by a Gaussian Markov random field. We perform Bayesian inference for such models for datasets containing thousands of observation locations using the integrated nested Laplace approximation, or INLA. We explain how constraints on the spatial and spatio-temporal Gaussian processes, arising from the conditioning mechanism, can be implemented through the latent variable approach without losing the computationally convenient Markov property. We discuss tools for the comparison of models via their posterior distributions, and illustrate the flexibility of the approach with gridded Red Sea surface temperature data at over $6,000$ observed locations. Posterior sampling is exploited to study the probability distribution of cluster functionals of spatial and spatio-temporal extreme episodes. △ Less

Submitted 13 May, 2022; v1 submitted 9 November, 2020; originally announced November 2020.

Journal ref: Extremes 2023, Volume 26, Pages 669-713

arXiv:2007.00774 [pdf, other]

Advances in Statistical Modeling of Spatial Extremes

Authors: Raphaël Huser, Jennifer L. Wadsworth

Abstract: The classical modeling of spatial extremes relies on asymptotic models (i.e., max-stable processes or $r$-Pareto processes) for block maxima or peaks over high thresholds, respectively. However, at finite levels, empirical evidence often suggests that such asymptotic models are too rigidly constrained, and that they do not adequately capture the frequent situation where more severe events tend to… ▽ More The classical modeling of spatial extremes relies on asymptotic models (i.e., max-stable processes or $r$-Pareto processes) for block maxima or peaks over high thresholds, respectively. However, at finite levels, empirical evidence often suggests that such asymptotic models are too rigidly constrained, and that they do not adequately capture the frequent situation where more severe events tend to be spatially more localized. In other words, these asymptotic models have a strong tail dependence that persists at increasingly high levels, while data usually suggest that it should weaken instead. Another well-known limitation of classical spatial extremes models is that they are either computationally prohibitive to fit in high dimensions, or they need to be fitted using less efficient techniques. In this review paper, we describe recent progress in the modeling and inference for spatial extremes, focusing on new models that have more flexible tail structures that can bridge asymptotic dependence classes, and that are more easily amenable to likelihood-based inference for large datasets. In particular, we discuss various types of random scale constructions, as well as the conditional spatial extremes model, which have recently been getting increasing attention within the statistics of extremes community. We illustrate some of these new spatial models on two different environmental applications. △ Less

Submitted 13 September, 2020; v1 submitted 1 July, 2020; originally announced July 2020.

arXiv:2002.04362 [pdf, other]

doi 10.1016/j.spasta.2020.100482

Conditional Modelling of Spatio-Temporal Extremes for Red Sea Surface Temperatures

Authors: Emma S. Simpson, Jennifer L. Wadsworth

Abstract: Recent extreme value theory literature has seen significant emphasis on the modelling of spatial extremes, with comparatively little consideration of spatio-temporal extensions. This neglects an important feature of extreme events: their evolution over time. Many existing models for the spatial case are limited by the number of locations they can handle; this impedes extension to space-time settin… ▽ More Recent extreme value theory literature has seen significant emphasis on the modelling of spatial extremes, with comparatively little consideration of spatio-temporal extensions. This neglects an important feature of extreme events: their evolution over time. Many existing models for the spatial case are limited by the number of locations they can handle; this impedes extension to space-time settings, where models for higher dimensions are required. Moreover, the spatio-temporal models that do exist are restrictive in terms of the range of extremal dependence types they can capture. Recently, conditional approaches for studying multivariate and spatial extremes have been proposed, which enjoy benefits in terms of computational efficiency and an ability to capture both asymptotic dependence and asymptotic independence. We extend this class of models to a spatio-temporal setting, conditioning on the occurrence of an extreme value at a single space-time location. We adopt a composite likelihood approach for inference, which combines information from full likelihoods across multiple space-time conditioning locations. We apply our model to Red Sea surface temperatures, show that it fits well using a range of diagnostic plots, and demonstrate how it can be used to assess the risk of coral bleaching attributed to high water temperatures over consecutive days. △ Less

Submitted 24 June, 2020; v1 submitted 11 February, 2020; originally announced February 2020.

Journal ref: Spatial Statistics 2021, Volume 41, 100482

arXiv:1912.06560 [pdf, other]

Higher-dimensional spatial extremes via single-site conditioning

Authors: Jennifer L. Wadsworth, Jonathan Tawn

Abstract: Currently available models for spatial extremes suffer either from inflexibility in the dependence structures that they can capture, lack of scalability to high dimensions, or in most cases, both of these. We present an approach to spatial extreme value theory based on the conditional multivariate extreme value model, whereby the limit theory is formed through conditioning upon the value at a part… ▽ More Currently available models for spatial extremes suffer either from inflexibility in the dependence structures that they can capture, lack of scalability to high dimensions, or in most cases, both of these. We present an approach to spatial extreme value theory based on the conditional multivariate extreme value model, whereby the limit theory is formed through conditioning upon the value at a particular site being extreme. The ensuing methodology allows for a flexible class of dependence structures, as well as models that can be fitted in high dimensions. To overcome issues of conditioning on a single site, we suggest a joint inference scheme based on all observation locations, and implement an importance sampling algorithm to provide spatial realizations and estimates of quantities conditioning upon the process being extreme at any of one of an arbitrary set of locations. The modelling approach is applied to Australian summer temperature extremes, permitting assessment of the spatial extent of high temperature events over the continent. △ Less

Submitted 16 June, 2022; v1 submitted 13 December, 2019; originally announced December 2019.

arXiv:1907.09617 [pdf, other]

doi 10.1080/01621459.2020.1858838

Hierarchical Transformed Scale Mixtures for Flexible Modeling of Spatial Extremes on Datasets with Many Locations

Authors: Likun Zhang, Benjamin A. Shaby, Jennifer L. Wadsworth

Abstract: Flexible spatial models that allow transitions between tail dependence classes have recently appeared in the literature. However, inference for these models is computationally prohibitive, even in moderate dimensions, due to the necessity of repeatedly evaluating the multivariate Gaussian distribution function. In this work, we attempt to achieve truly high-dimensional inference for extremes of sp… ▽ More Flexible spatial models that allow transitions between tail dependence classes have recently appeared in the literature. However, inference for these models is computationally prohibitive, even in moderate dimensions, due to the necessity of repeatedly evaluating the multivariate Gaussian distribution function. In this work, we attempt to achieve truly high-dimensional inference for extremes of spatial processes, while retaining the desirable flexibility in the tail dependence structure, by modifying an established class of models based on scale mixtures Gaussian processes. We show that the desired extremal dependence properties from the original models are preserved under the modification, and demonstrate that the corresponding Bayesian hierarchical model does not involve the expensive computation of the multivariate Gaussian distribution function. We fit our model to exceedances of a high threshold, and perform coverage analyses and cross-model checks to validate its ability to capture different types of tail characteristics. We use a standard adaptive Metropolis algorithm for model fitting, and further accelerate the computation via parallelization and Rcpp. Lastly, we apply the model to a dataset of a fire threat index on the Great Plains region of the US, which is vulnerable to massively destructive wildfires. We find that the joint tail of the fire threat index exhibits a decaying dependence structure that cannot be captured by limiting extreme value models. △ Less

Submitted 9 December, 2019; v1 submitted 22 July, 2019; originally announced July 2019.

arXiv:1809.01606 [pdf, ps, other]

doi 10.1093/biomet/asaa018

Determining the Dependence Structure of Multivariate Extremes

Authors: Emma S. Simpson, Jennifer L. Wadsworth, Jonathan A. Tawn

Abstract: In multivariate extreme value analysis, the nature of the extremal dependence between variables should be considered when selecting appropriate statistical models. Interest often lies with determining which subsets of variables can take their largest values simultaneously, while the others are of smaller order. Our approach to this problem exploits hidden regular variation properties on a collecti… ▽ More In multivariate extreme value analysis, the nature of the extremal dependence between variables should be considered when selecting appropriate statistical models. Interest often lies with determining which subsets of variables can take their largest values simultaneously, while the others are of smaller order. Our approach to this problem exploits hidden regular variation properties on a collection of non-standard cones and provides a new set of indices that reveal aspects of the extremal dependence structure not available through existing measures of dependence. We derive theoretical properties of these indices, demonstrate their value through a series of examples, and develop methods of inference that also estimate the proportion of extremal mass associated with each cone. We apply the methods to UK river flows, estimating the probabilities of different subsets of sites being large simultaneously. △ Less

Submitted 11 October, 2019; v1 submitted 5 September, 2018; originally announced September 2018.

Journal ref: Biometrika 2020, Volume 107, Issue 3, Pages 513-532

arXiv:1705.07987 [pdf, ps, other]

Multivariate generalized Pareto distributions: parametrizations, representations, and properties

Authors: Holger Rootzén, Johan Segers, Jennifer L. Wadsworth

Abstract: Multivariate generalized Pareto distributions arise as the limit distributions of exceedances over multivariate thresholds of random vectors in the domain of attraction of a max-stable distribution. These distributions can be parametrized and represented in a number of different ways. Moreover, generalized Pareto distributions enjoy a number of interesting stability properties. An overview of the… ▽ More Multivariate generalized Pareto distributions arise as the limit distributions of exceedances over multivariate thresholds of random vectors in the domain of attraction of a max-stable distribution. These distributions can be parametrized and represented in a number of different ways. Moreover, generalized Pareto distributions enjoy a number of interesting stability properties. An overview of the main features of such distributions are given, expressed compactly in several parametrizations, giving the potential user of these distributions a convenient catalogue of ways to handle and work with generalized Pareto distributions. △ Less

Submitted 22 May, 2017; originally announced May 2017.

Comments: 20 pages

MSC Class: 62G32

arXiv:1703.06031 [pdf, other]

Modeling spatial processes with unknown extremal dependence class

Authors: Raphaël G. Huser, Jennifer L. Wadsworth

Abstract: Many environmental processes exhibit weakening spatial dependence as events become more extreme. Well-known limiting models, such as max-stable or generalized Pareto processes, cannot capture this, which can lead to a preference for models that exhibit a property known as asymptotic independence. However, weakening dependence does not automatically imply asymptotic independence, and whether the pr… ▽ More Many environmental processes exhibit weakening spatial dependence as events become more extreme. Well-known limiting models, such as max-stable or generalized Pareto processes, cannot capture this, which can lead to a preference for models that exhibit a property known as asymptotic independence. However, weakening dependence does not automatically imply asymptotic independence, and whether the process is truly asymptotically (in)dependent is usually far from clear. The distinction is key as it can have a large impact upon extrapolation, i.e., the estimated probabilities of events more extreme than those observed. In this work, we present a single spatial model that is able to capture both dependence classes in a parsimonious manner, and with a smooth transition between the two cases. The model covers a wide range of possibilities from asymptotic independence through to complete dependence, and permits weakening dependence of extremes even under asymptotic dependence. Censored likelihood-based inference for the implied copula is feasible in moderate dimensions due to closed-form margins. The model is applied to oceanographic datasets with ambiguous true limiting dependence structure. △ Less

Submitted 5 September, 2017; v1 submitted 17 March, 2017; originally announced March 2017.

MSC Class: 62G32; 62M30

arXiv:1612.01773 [pdf, other]

Peaks over thresholds modelling with multivariate generalized Pareto distributions

Authors: Anna Kiriliouk, Holger Rootzén, Johan Segers, Jennifer L. Wadsworth

Abstract: When assessing the impact of extreme events, it is often not just a single component, but the combined behaviour of several components which is important. Statistical modelling using multivariate generalized Pareto (GP) distributions constitutes the multivariate analogue of univariate peaks over thresholds modelling, which is widely used in finance and engineering. We develop general methods for c… ▽ More When assessing the impact of extreme events, it is often not just a single component, but the combined behaviour of several components which is important. Statistical modelling using multivariate generalized Pareto (GP) distributions constitutes the multivariate analogue of univariate peaks over thresholds modelling, which is widely used in finance and engineering. We develop general methods for construction of multivariate GP distributions and use them to create a variety of new statistical models. A censored likelihood procedure is proposed to make inference on these models, together with a threshold selection procedure, goodness-of-fit diagnostics, and a computationally tractable strategy for model selection. The models are fitted to returns of stock prices of four UK-based banks and to rainfall data in the context of landslide risk estimation. Supplementary materials and codes are available online. △ Less

Submitted 6 February, 2018; v1 submitted 6 December, 2016; originally announced December 2016.

MSC Class: 62G32; 62P05; 62P12

arXiv:1603.06619 [pdf, other]

doi 10.1007/s10687-017-0294-4

Multivariate peaks over thresholds models

Authors: Holger Rootzén, Johan Segers, Jennifer L. Wadsworth

Abstract: Multivariate peaks over thresholds modeling based on generalized Pareto distributions has up to now only been used in few and mostly 2-dimensional situations. This paper contributes theoretical understanding, physically based models, inference tools, and simulation methods to support routine use, with an aim at higher dimensions. We derive a general point process model for extreme episodes in data… ▽ More Multivariate peaks over thresholds modeling based on generalized Pareto distributions has up to now only been used in few and mostly 2-dimensional situations. This paper contributes theoretical understanding, physically based models, inference tools, and simulation methods to support routine use, with an aim at higher dimensions. We derive a general point process model for extreme episodes in data, and show how conditioning the distribution of extreme episodes on threshold exceedance gives four basic representations of the family of generalized Pareto distributions. The first representation is constructed on the real scale of the observations. The second one starts with a model on a standard exponential scale which then is transformed to the real scale. The third and fourth are reformulations of a spectral representation proposed in A. Ferreira and L. de Haan [Bernoulli 20 (2014) 1717--1737]. Numerically tractable forms of densities and censored densities are found and give tools for flexible parametric likelihood inference. New simulation algorithms, explicit formulas for probabilities and conditional probabilities, and conditions which make the conditional distribution of weighted component sums generalized Pareto are derived. △ Less

Submitted 3 May, 2017; v1 submitted 21 March, 2016; originally announced March 2016.

Comments: 25 pages, 3 figure

MSC Class: 60G55; 60G70

arXiv:1410.6733 [pdf, ps, other]

On the occurrence times of componentwise maxima and bias in likelihood inference for multivariate max-stable distributions

Authors: J. L. Wadsworth

Abstract: Full likelihood-based inference for high-dimensional multivariate extreme value distributions, or max-stable processes, is feasible when incorporating occurrence times of the maxima; without this information, $d$-dimensional likelihood inference is usually precluded due to the large number of terms in the likelihood. However, some studies have noted bias when performing high-dimensional inference… ▽ More Full likelihood-based inference for high-dimensional multivariate extreme value distributions, or max-stable processes, is feasible when incorporating occurrence times of the maxima; without this information, $d$-dimensional likelihood inference is usually precluded due to the large number of terms in the likelihood. However, some studies have noted bias when performing high-dimensional inference that incorporates such event information, particularly when dependence is weak. We elucidate this phenomenon, showing that for unbiased inference in moderate dimensions, dimension $d$ should be of a magnitude smaller than the square root of the number of vectors over which one takes the componentwise maximum. A bias reduction technique is suggested and illustrated on the extreme value logistic model. △ Less

Submitted 31 March, 2015; v1 submitted 24 October, 2014; originally announced October 2014.

Comments: 7 pages

MSC Class: 62G32; 62H12

arXiv:1312.5442 [pdf, ps, other]

doi 10.3150/12-BEJ471

A new representation for multivariate tail probabilities

Authors: J. L. Wadsworth, J. A. Tawn

Abstract: Existing theory for multivariate extreme values focuses upon characterizations of the distributional tails when all components of a random vector, standardized to identical margins, grow at the same rate. In this paper, we consider the effect of allowing the components to grow at different rates, and characterize the link between these marginal growth rates and the multivariate tail probability de… ▽ More Existing theory for multivariate extreme values focuses upon characterizations of the distributional tails when all components of a random vector, standardized to identical margins, grow at the same rate. In this paper, we consider the effect of allowing the components to grow at different rates, and characterize the link between these marginal growth rates and the multivariate tail probability decay rate. Our approach leads to a whole class of univariate regular variation conditions, in place of the single but multivariate regular variation conditions that underpin the current theories. These conditions are indexed by a homogeneous function and an angular dependence function, which, for asymptotically independent random vectors, mirror the role played by the exponent measure and Pickands' dependence function in classical multivariate extremes. We additionally offer an inferential approach to joint survivor probability estimation. The key feature of our methodology is that extreme set probabilities can be estimated by extrapolating upon rays emanating from the origin when the margins of the variables are exponential. This offers an appreciable improvement over existing techniques where extrapolation in exponential margins is upon lines parallel to the diagonal. △ Less

Submitted 19 December, 2013; originally announced December 2013.

Comments: Published in at http://dx.doi.org/10.3150/12-BEJ471 the Bernoulli (http://isi.cbs.nl/bernoulli/) by the International Statistical Institute/Bernoulli Society (http://isi.cbs.nl/BS/bshome.htm)

Report number: IMS-BEJ-BEJ471

Journal ref: Bernoulli 2013, Vol. 19, No. 5B, 2689-2714

arXiv:1011.3612 [pdf, ps, other]

doi 10.1214/10-AOAS333

Accounting for choice of measurement scale in extreme value modeling

Authors: J. L. Wadsworth, J. A. Tawn, P. Jonathan

Abstract: We investigate the effect that the choice of measurement scale has upon inference and extrapolation in extreme value analysis. Separate analyses of variables from a single process on scales which are linked by a nonlinear transformation may lead to discrepant conclusions concerning the tail behavior of the process. We propose the use of a Box--Cox power transformation incorporated as part of the i… ▽ More We investigate the effect that the choice of measurement scale has upon inference and extrapolation in extreme value analysis. Separate analyses of variables from a single process on scales which are linked by a nonlinear transformation may lead to discrepant conclusions concerning the tail behavior of the process. We propose the use of a Box--Cox power transformation incorporated as part of the inference procedure to account parametrically for the uncertainty surrounding the scale of extrapolation. This has the additional feature of increasing the rate of convergence of the distribution tails to an extreme value form in certain cases and thus reducing bias in the model estimation. Inference without reparameterization is practicably infeasible, so we explore a reparameterization which exploits the asymptotic theory of normalizing constants required for nondegenerate limit distributions. Inference is carried out in a Bayesian setting, an advantage of this being the availability of posterior predictive return levels. The methodology is illustrated on both simulated data and significant wave height data from the North Sea. △ Less

Submitted 16 November, 2010; originally announced November 2010.

Comments: Published in at http://dx.doi.org/10.1214/10-AOAS333 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)

Report number: IMS-AOAS-AOAS333

Journal ref: Annals of Applied Statistics 2010, Vol. 4, No. 3, 1558-1578

Showing 1–21 of 21 results for author: Wadsworth, J L