-
Heterogeneous extremes in the presence of random covariates and censoring
Authors:
Martin Bladt,
Christoffer Øhlenschlæger
Abstract:
The task of analyzing extreme events with censoring effects is considered under a framework allowing for random covariate information. A wide class of estimators that can be cast as product-limit integrals is considered, for when the conditional distributions belong to the Frechet max-domain of attraction. The main mathematical contribution is establishing uniform conditions on the families of the…
▽ More
The task of analyzing extreme events with censoring effects is considered under a framework allowing for random covariate information. A wide class of estimators that can be cast as product-limit integrals is considered, for when the conditional distributions belong to the Frechet max-domain of attraction. The main mathematical contribution is establishing uniform conditions on the families of the regularly varying tails for which the asymptotic behaviour of the resulting estimators is tractable. In particular, a decomposition of the integral estimators in terms of exchangeable sums is provided, which leads to a law of large numbers and several central limit theorems. Subsequently, the finite-sample behaviour of the estimators is explored through a simulation study, and through the analysis of two real-life datasets. In particular, the inclusion of covariates makes the model significantly versatile and, as a consequence, practically relevant.
△ Less
Submitted 10 June, 2024;
originally announced June 2024.
-
Extremile scalar-on-function regression with application to climate scenarios
Authors:
Maria Laura Battagliola,
Martin Bladt
Abstract:
Extremiles provide a generalization of quantiles which are not only robust, but also have an intrinsic link with extreme value theory. This paper introduces an extremile regression model tailored for functional covariate spaces. The estimation procedure turns out to be a weighted version of local linear scalar-on-function regression, where now a double kernel approach plays a crucial role. Asympto…
▽ More
Extremiles provide a generalization of quantiles which are not only robust, but also have an intrinsic link with extreme value theory. This paper introduces an extremile regression model tailored for functional covariate spaces. The estimation procedure turns out to be a weighted version of local linear scalar-on-function regression, where now a double kernel approach plays a crucial role. Asymptotic expressions for the bias and variance are established, applicable to both decreasing bandwidth sequences and automatically selected bandwidths. The methodology is then investigated in detail through a simulation study. Furthermore, we highlight the applicability of the model through the analysis of data sourced from the CH2018 Swiss climate scenarios project, offering insights into its ability to serve as a modern tool to quantify climate behaviour.
△ Less
Submitted 31 May, 2024;
originally announced May 2024.
-
Censored extreme value estimation
Authors:
Martin Bladt,
Igor Rodionov
Abstract:
A novel and comprehensive methodology designed to tackle the challenges posed by extreme values in the context of random censorship is introduced. The main focus is on the analysis of integrals based on the product-limit estimator of normalized upper order statistics, called extreme Kaplan--Meier integrals. These integrals allow for the transparent derivation of various important asymptotic distri…
▽ More
A novel and comprehensive methodology designed to tackle the challenges posed by extreme values in the context of random censorship is introduced. The main focus is on the analysis of integrals based on the product-limit estimator of normalized upper order statistics, called extreme Kaplan--Meier integrals. These integrals allow for the transparent derivation of various important asymptotic distributional properties, offering an alternative approach to conventional plug-in estimation methods. Notably, this methodology demonstrates robustness and wide applicability within the scope of max-domains of attraction. A noteworthy by-product is the extension of generalized Hill-type estimators of extremes to encompass all max-domains of attraction, which is of independent interest. The theoretical framework is applied to construct novel estimators for positive and real-valued extreme value indices for right-censored data. Simulation studies supporting the theory are provided.
△ Less
Submitted 27 June, 2024; v1 submitted 16 December, 2023;
originally announced December 2023.
-
Pathwise and distributional approximations of semi-Markov processes
Authors:
Martin Bladt,
Andreea Minca,
Oscar Peralta
Abstract:
Continuous-time semi-Markov finite state-space jump processes are considered, inspired by a duration-dependent life insurance model. New approximations using grid-conditional homogeneous Markov jump-processes are developed, based on a recent adaptation of the uniformization principle which results in a strong pathwise convergent sequence of jump processes. Unlike traditional methods that use class…
▽ More
Continuous-time semi-Markov finite state-space jump processes are considered, inspired by a duration-dependent life insurance model. New approximations using grid-conditional homogeneous Markov jump-processes are developed, based on a recent adaptation of the uniformization principle which results in a strong pathwise convergent sequence of jump processes. Unlike traditional methods that use classical approximations to integro-differential equation solutions to compute their value functions, the proposed grid-conditional homogeneous Markov jump-processes allows for a direct and tractable approximation. In particular, these approximations simplify to easily implementable expressions, making them useful in areas where evaluating pathwise distributional functionals is difficult. Our homogeneous approximation, initially of a grid-conditional kind, is evolved into an unconditional version that holds well under fair regularity assumptions. The practicality of this approach is demonstrated on a disability life insurance model, with realistic underlying semi-Markov process parameters, showcasing its broader applicability in operations research and related fields.
△ Less
Submitted 11 December, 2023;
originally announced December 2023.
-
Individual claims reserving using the Aalen--Johansen estimator
Authors:
Martin Bladt,
Gabriele Pittarello
Abstract:
We propose an individual claims reserving model based on the conditional Aalen-Johansen estimator, as developed in Bladt and Furrer (2023b). In our approach, we formulate a multi-state problem, where the underlying variable is the individual claim size, rather than time. The states in this model represent development periods, and we estimate the cumulative density function of individual claim size…
▽ More
We propose an individual claims reserving model based on the conditional Aalen-Johansen estimator, as developed in Bladt and Furrer (2023b). In our approach, we formulate a multi-state problem, where the underlying variable is the individual claim size, rather than time. The states in this model represent development periods, and we estimate the cumulative density function of individual claim sizes using the conditional Aalen-Johansen method as transition probabilities to an absorbing state. Our methodology reinterprets the concept of multi-state models and offers a strategy for modeling the complete curve of individual claim sizes. To illustrate our approach, we apply our model to both simulated and real datasets. Having access to the entire dataset enables us to support the use of our approach by comparing the predicted total final cost with the actual amount, as well as evaluating it in terms of the continuously ranked probability score.
△ Less
Submitted 3 June, 2024; v1 submitted 13 November, 2023;
originally announced November 2023.
-
Conditional Aalen--Johansen estimation
Authors:
Martin Bladt,
Christian Furrer
Abstract:
The conditional Aalen--Johansen estimator, a general-purpose non-parametric estimator of conditional state occupation probabilities, is introduced. The estimator is applicable for any finite-state jump process and supports conditioning on external as well as internal covariate information. The conditioning feature permits for a much more detailed analysis of the distributional characteristics of t…
▽ More
The conditional Aalen--Johansen estimator, a general-purpose non-parametric estimator of conditional state occupation probabilities, is introduced. The estimator is applicable for any finite-state jump process and supports conditioning on external as well as internal covariate information. The conditioning feature permits for a much more detailed analysis of the distributional characteristics of the process. The estimator reduces to the conditional Kaplan--Meier estimator in the special case of a survival model and also englobes other, more recent, landmark estimators when covariates are discrete. Strong uniform consistency and asymptotic normality are established under lax moment conditions on the multivariate counting process, allowing in particular for an unbounded number of transitions.
△ Less
Submitted 4 June, 2024; v1 submitted 3 March, 2023;
originally announced March 2023.
-
Aggregate Markov models in life insurance: estimation via the EM algorithm
Authors:
Jamaal Ahmad,
Mogens Bladt
Abstract:
In this paper, we consider statistical estimation of time-inhomogeneous aggregate Markov models. Unaggregated models, which corresponds to Markov chains, are commonly used in multi-state life insurance to model the biometric states of an insured. By aggregating microstates to each biometric state, we are able to model dependencies between transitions of the biometric states as well as the distribu…
▽ More
In this paper, we consider statistical estimation of time-inhomogeneous aggregate Markov models. Unaggregated models, which corresponds to Markov chains, are commonly used in multi-state life insurance to model the biometric states of an insured. By aggregating microstates to each biometric state, we are able to model dependencies between transitions of the biometric states as well as the distribution of occupancy in these. This allows for non--Markovian modelling in general. Since only paths of the macrostates are observed, we develop an expectation-maximization (EM) algorithm to obtain maximum likelihood estimates of transition intensities on the micro level. Special attention is given to a semi-Markovian case, known as the reset property, which leads to simplified estimation procedures where EM algorithms for inhomogeneous phase-type distributions can be used as building blocks. We provide a numerical example of the latter in combination with piecewise constant transition rates in a three-state disability model with data simulated from a time-inhomogeneous semi-Markov model. Comparisons of our fits with more classic GLM-based fits as well as true and empirical distributions are provided to relate our model with existing models and their tools.
△ Less
Submitted 10 August, 2023; v1 submitted 20 December, 2022;
originally announced December 2022.
-
Aggregate Markov models in life insurance: properties and valuation
Authors:
Jamaal Ahmad,
Mogens Bladt,
Christian Furrer
Abstract:
In multi-state life insurance, an adequate balance between analytic tractability, computational efficiency, and statistical flexibility is of great importance. This might explain the popularity of Markov chain modelling, where matrix analytic methods allow for a comprehensive treatment. Unfortunately, Markov chain modelling is unable to capture duration effects, so this paper presents aggregate Ma…
▽ More
In multi-state life insurance, an adequate balance between analytic tractability, computational efficiency, and statistical flexibility is of great importance. This might explain the popularity of Markov chain modelling, where matrix analytic methods allow for a comprehensive treatment. Unfortunately, Markov chain modelling is unable to capture duration effects, so this paper presents aggregate Markov models as an alternative. Aggregate Markov models retain most of the analytical tractability of Markov chains, yet are non-Markovian and thus more flexible. Based on an explicit characterization of the fundamental martingales, matrix representations of the expected accumulated cash flows and corresponding prospective reserves are derived for duration-dependent payments with and without incidental policyholder behaviour. Throughout, special attention is given to a semi-Markovian case. Finally, the methods and results are illustrated in a numerical example.
△ Less
Submitted 23 April, 2024; v1 submitted 7 December, 2022;
originally announced December 2022.
-
Estimating absorption time distributions of general Markov jump processes
Authors:
Jamaal Ahmad,
Martin Bladt,
Mogens Bladt
Abstract:
The estimation of absorption time distributions of Markov jump processes is an important task in various branches of statistics and applied probability. While the time-homogeneous case is classic, the time-inhomogeneous case has recently received increased attention due to its added flexibility and advances in computational power. However, commuting sub-intensity matrices are assumed, which in var…
▽ More
The estimation of absorption time distributions of Markov jump processes is an important task in various branches of statistics and applied probability. While the time-homogeneous case is classic, the time-inhomogeneous case has recently received increased attention due to its added flexibility and advances in computational power. However, commuting sub-intensity matrices are assumed, which in various cases limits the parsimonious properties of the resulting representation. This paper develops the theory required to solve the general case through maximum likelihood estimation, and in particular, using the expectation-maximization algorithm. A reduction to a piecewise constant intensity matrix function is proposed in order to provide succinct representations, where a parametric linear model binds the intensities together. Practical aspects are discussed and illustrated through the estimation of notoriously demanding theoretical distributions and real data, from the perspective of matrix analytic methods.
△ Less
Submitted 22 July, 2022;
originally announced July 2022.
-
Phase-type representations of stochastic interest rates with applications to life insurance
Authors:
Jamaal Ahmad,
Mogens Bladt
Abstract:
The purpose of the present paper is to incorporate stochastic interest rates into a matrix-approach to multi-state life insurance, where formulas for reserves, moments of future payments and equivalence premiums can be obtained as explicit formulas in terms of product integrals or matrix exponentials. To this end we consider the Markovian interest model, where the rates are piecewise deterministic…
▽ More
The purpose of the present paper is to incorporate stochastic interest rates into a matrix-approach to multi-state life insurance, where formulas for reserves, moments of future payments and equivalence premiums can be obtained as explicit formulas in terms of product integrals or matrix exponentials. To this end we consider the Markovian interest model, where the rates are piecewise deterministic (or even constant) in the different states of a Markov jump process, and which is shown to integrate naturally into the matrix framework. The discounting factor then becomes the price of a zero-coupon bond which may or may not be correlated with the biometric insurance process. Another nice feature about the Markovian interest model is that the price of the bond coincides with the survival function of a phase-type distributed random variable. This, in particular, allows for calibrating the Markovian interest rate models using a maximum likelihood approach to observed data (prices) or to theoretical models like e.g. a Vasicek model. Due to the denseness of phase-type distributions, we can approximate the price behaviour of any zero-coupon bond with interest rates bounded from below by choosing the number of possible interest rate values sufficiently large. For observed data models with few data points, lower dimensions will usually suffice, while for theoretical models the dimensionality is only a computational issue.
△ Less
Submitted 17 November, 2022; v1 submitted 22 July, 2022;
originally announced July 2022.
-
Joint discrete and continuous matrix distribution modelling
Authors:
Martin Bladt,
Clara Brimnes Gardner
Abstract:
In this paper we introduce a bivariate distribution on $\mathbb{R}_{+} \times \mathbb{N}$ arising from a single underlying Markov jump process. The marginal distributions are phase-type and discrete phase-type distributed, respectively, which allow for flexible behavior for modeling purposes. We show that the distribution is dense in the class of distributions on…
▽ More
In this paper we introduce a bivariate distribution on $\mathbb{R}_{+} \times \mathbb{N}$ arising from a single underlying Markov jump process. The marginal distributions are phase-type and discrete phase-type distributed, respectively, which allow for flexible behavior for modeling purposes. We show that the distribution is dense in the class of distributions on $\mathbb{R}_{+} \times \mathbb{N}$ and derive some of its main properties, all explicit in terms of matrix calculus. Furthermore, we develop an effective EM algorithm for the statistical estimation of the distribution parameters. In the last part of the paper, we apply our methodology to an insurance dataset, where we model the number of claims and the mean claim sizes of policyholders, which is seen to perform favorably. An additional consequence of the latter analysis is that the total loss size in the entire portfolio is captured substantially better than with independent phase-type models.
△ Less
Submitted 4 July, 2022;
originally announced July 2022.
-
Expert Kaplan--Meier estimation
Authors:
Martin Bladt,
Christian Furrer
Abstract:
The setting of a right-censored random sample subject to contamination is considered. In various fields, expert information is often available and used to overcome the contamination. This paper integrates expert knowledge into the product-limit estimator in two different ways with distinct interpretations. Strong uniform consistency is proved for both cases under certain assumptions on the kind of…
▽ More
The setting of a right-censored random sample subject to contamination is considered. In various fields, expert information is often available and used to overcome the contamination. This paper integrates expert knowledge into the product-limit estimator in two different ways with distinct interpretations. Strong uniform consistency is proved for both cases under certain assumptions on the kind of contamination and the quality of expert information, which sheds light on the techniques and decisions that practitioners may take. The nuances of the techniques are discussed -- also with a view towards semi-parametric estimation -- and they are illustrated using simulated and real-world insurance data.
△ Less
Submitted 27 March, 2023; v1 submitted 27 June, 2022;
originally announced June 2022.
-
Informed censoring: the parametric combination of data and expert information
Authors:
Hansjörg Albrecher,
Martin Bladt
Abstract:
The statistical censoring setup is extended to the situation when random measures can be assigned to the realization of datapoints, leading to a new way of incorporating expert information into the usual parametric estimation procedures. The asymptotic theory is provided for the resulting estimators, and some special cases of practical relevance are studied in more detail. Although the proposed fr…
▽ More
The statistical censoring setup is extended to the situation when random measures can be assigned to the realization of datapoints, leading to a new way of incorporating expert information into the usual parametric estimation procedures. The asymptotic theory is provided for the resulting estimators, and some special cases of practical relevance are studied in more detail. Although the proposed framework mathematically generalizes censoring and coarsening at random, and borrows techniques from M-estimation theory, it provides a novel and transparent methodology which enjoys significant practical applicability in situations where expert information is present. The potential of the approach is illustrated by a concrete actuarial application of tail parameter estimation for a heavy-tailed MTPL dataset with limited available expert information.
△ Less
Submitted 3 December, 2023; v1 submitted 27 June, 2022;
originally announced June 2022.
-
Strongly convergent homogeneous approximations to inhomogeneous Markov jump processes and applications
Authors:
Martin Bladt,
Oscar Peralta
Abstract:
The study of time-inhomogeneous Markov jump processes is a traditional topic within probability theory that has recently attracted substantial attention in various applications. However, their flexibility also incurs a substantial mathematical burden which is usually circumvented by using well-known generic distributional approximations or simulations. This article provides a novel approximation m…
▽ More
The study of time-inhomogeneous Markov jump processes is a traditional topic within probability theory that has recently attracted substantial attention in various applications. However, their flexibility also incurs a substantial mathematical burden which is usually circumvented by using well-known generic distributional approximations or simulations. This article provides a novel approximation method that tailors the dynamics of a time-homogeneous Markov jump process to meet those of its time-inhomogeneous counterpart on an increasingly fine Poisson grid. Strong convergence of the processes in terms of the Skorokhod $J_1$ metric is established, and convergence rates are provided. Under traditional regularity assumptions, distributional convergence is established for unconditional proxies, to the same limit. Special attention is devoted to the case where the target process has one absorbing state and the remaining ones transient, for which the absorption times also converge. Some applications are outlined, such as univariate hazard-rate density estimation, ruin probabilities, and multivariate phase-type density evaluation.
△ Less
Submitted 1 November, 2023; v1 submitted 6 April, 2022;
originally announced April 2022.
-
Phase-type mixture-of-experts regression for loss severities
Authors:
Martin Bladt,
Jorge Yslas
Abstract:
The task of modeling claim severities is addressed when data is not consistent with the classical regression assumptions. This framework is common in several lines of business within insurance and reinsurance, where catastrophic losses or heterogeneous sub-populations result in data difficult to model. Their correct analysis is required for pricing insurance products, and some of the most prevalen…
▽ More
The task of modeling claim severities is addressed when data is not consistent with the classical regression assumptions. This framework is common in several lines of business within insurance and reinsurance, where catastrophic losses or heterogeneous sub-populations result in data difficult to model. Their correct analysis is required for pricing insurance products, and some of the most prevalent recent specifications in this direction are mixture-of-experts models. This paper proposes a regression model that generalizes the latter approach to the phase-type distribution setting. More specifically, the concept of mixing is extended to the case where an entire Markov jump process is unobserved and where states can communicate with each other. The covariates then act on the initial probabilities of such underlying chain, which play the role of expert weights. The basic properties of such a model are computed in terms of matrix functionals, and denseness properties are derived, demonstrating their flexibility. An effective estimation procedure is proposed, based on the EM algorithm and multinomial logistic regression, and subsequently illustrated using simulated and real-world datasets. The increased flexibility of the proposed models does not come at a high computational cost, and the motivation and interpretation are equally transparent to simpler MoE models.
△ Less
Submitted 31 March, 2022; v1 submitted 31 October, 2021;
originally announced November 2021.
-
Phase-type distributions for claim severity regression modeling
Authors:
Martin Bladt
Abstract:
This paper addresses the task of modeling severity losses using segmentation when the data distribution does not fall into the usual regression frameworks. This situation is not uncommon in lines of business such as third-party liability insurance, where heavy-tails and multimodality often hamper a direct statistical analysis. We propose to use regression models based on phase-type distributions,…
▽ More
This paper addresses the task of modeling severity losses using segmentation when the data distribution does not fall into the usual regression frameworks. This situation is not uncommon in lines of business such as third-party liability insurance, where heavy-tails and multimodality often hamper a direct statistical analysis. We propose to use regression models based on phase-type distributions, regressing on their underlying inhomogeneous Markov intensity and using an extension of the EM algorithm. These models are interpretable and tractable in terms of multi-state processes and generalize the proportional hazards specification when the dimension of the state space is larger than one. We show that the combination of matrix parameters, inhomogeneity transforms, and covariate information provides flexible regression models that effectively capture the entire distribution of loss severities.
△ Less
Submitted 26 November, 2021; v1 submitted 11 October, 2021;
originally announced October 2021.
-
Fractional Inhomogeneous Multi-state Models in Life Insurance
Authors:
Martin Bladt
Abstract:
In this paper, we demonstrate through the use of matrix calculus a transparent analysis of fractional inhomogeneous Markov models for life insurance where transition matrices commute. The resulting formulae are intuitive matrix generalizations of their single-state counterparts, and the absorption times are matrix versions of well-known scalar distributions. A further advantage of this approach is…
▽ More
In this paper, we demonstrate through the use of matrix calculus a transparent analysis of fractional inhomogeneous Markov models for life insurance where transition matrices commute. The resulting formulae are intuitive matrix generalizations of their single-state counterparts, and the absorption times are matrix versions of well-known scalar distributions. A further advantage of this approach is that it allows extending the analysis to the non-Markovian case where sojourns are Mittag-Leffler distributed, and where the absorption times are fractional phase-type distributed. Considering deterministic time transforms gives rise to fractional inhomogeneous phase-type distributions as absorption times. The latter underlying processes are an example of a regime where not only the present but also the history of a policyholder influences its future evolution. The sub-exponential nature of stable distributions translates into the multi-state insurance model as a random longevity risk at any given state of the chain.
△ Less
Submitted 21 October, 2021; v1 submitted 11 October, 2021;
originally announced October 2021.
-
A tractable class of multivariate phase-type distributions for loss modeling
Authors:
Martin Bladt
Abstract:
Phase-type (PH) distributions are a popular tool for the analysis of univariate risks in numerous actuarial applications. Their multivariate counterparts (MPH$^\ast$), however, have not seen such a proliferation, due to lack of explicit formulas and complicated estimation procedures. A simple construction of multivariate phase-type distributions -- mPH -- is proposed for the parametric description…
▽ More
Phase-type (PH) distributions are a popular tool for the analysis of univariate risks in numerous actuarial applications. Their multivariate counterparts (MPH$^\ast$), however, have not seen such a proliferation, due to lack of explicit formulas and complicated estimation procedures. A simple construction of multivariate phase-type distributions -- mPH -- is proposed for the parametric description of multivariate risks, leading to models of considerable probabilistic flexibility and statistical tractability. The main idea is to start different Markov processes at the same state, and allow them to evolve independently thereafter, leading to dependent absorption times. By dimension augmentation arguments, this construction can be cast into the umbrella of MPH$^\ast$ class, but enjoys explicit formulas which the general specification lacks, including common measures of dependence. Moreover, it is shown that the class is still rich enough to be dense on the set of multivariate risks supported on the positive orthant, and it is the smallest known sub-class to have this property. In particular, the latter result provides a new short proof of the denseness of the MPH$^\ast$ class. In practice this means that the mPH class allows for modeling of bivariate risks with any given correlation or copula. We derive an EM algorithm for its statistical estimation, and illustrate it on bivariate insurance data. Extensions to more general settings are outlined.
△ Less
Submitted 21 December, 2022; v1 submitted 11 October, 2021;
originally announced October 2021.
-
Heavy-tailed phase-type distributions: A unified approach
Authors:
Martin Bladt,
Jorge Yslas
Abstract:
A phase-type distribution is the distribution of the time until absorption in a finite state-space time-homogeneous Markov jump process, with one absorbing state and the rest being transient. These distributions are mathematically tractable and conceptually attractive to model physical phenomena due to their interpretation in terms of a hidden Markov structure. Three recent extensions of regular p…
▽ More
A phase-type distribution is the distribution of the time until absorption in a finite state-space time-homogeneous Markov jump process, with one absorbing state and the rest being transient. These distributions are mathematically tractable and conceptually attractive to model physical phenomena due to their interpretation in terms of a hidden Markov structure. Three recent extensions of regular phase-type distributions give rise to models which allow for heavy tails: discrete- or continuous-scaling; fractional-time semi-Markov extensions; and inhomogeneous time-change of the underlying Markov process. In this paper, we present a unifying theory for heavy-tailed phase-type distributions for which all three approaches are particular cases. Our main objective is to provide useful models for heavy-tailed phase-type distributions, but any other tail behavior is also captured by our specification. We provide relevant new examples and also show how existing approaches are naturally embedded. Subsequently, two multivariate extensions are presented, inspired by the univariate construction which can be considered as a matrix version of a frailty model. We provide fully explicit EM-algorithms for all models and illustrate them using synthetic and real-life data.
△ Less
Submitted 6 December, 2021; v1 submitted 19 July, 2021;
originally announced July 2021.
-
Time series models with infinite-order partial copula dependence
Authors:
Martin Bladt,
Alexander J. McNeil
Abstract:
Stationary and ergodic time series can be constructed using an s-vine decomposition based on sets of bivariate copula functions. The extension of such processes to infinite copula sequences is considered and shown to yield a rich class of models that generalizes Gaussian ARMA and ARFIMA processes to allow both non-Gaussian marginal behaviour and a non-Gaussian description of the serial partial dep…
▽ More
Stationary and ergodic time series can be constructed using an s-vine decomposition based on sets of bivariate copula functions. The extension of such processes to infinite copula sequences is considered and shown to yield a rich class of models that generalizes Gaussian ARMA and ARFIMA processes to allow both non-Gaussian marginal behaviour and a non-Gaussian description of the serial partial dependence structure. Extensions of classical causal and invertible representations of linear processes to general s-vine processes are proposed and investigated. A practical and parsimonious method for parameterizing s-vine processes using the Kendall partial autocorrelation function is developed. The potential of the resulting models to give improved statistical fits in many applications is indicated with an example using macroeconomic data.
△ Less
Submitted 2 July, 2021;
originally announced July 2021.
-
Trimmed extreme value estimators for censored heavy-tailed data
Authors:
Martin Bladt,
Hansjoerg Albrecher,
Jan Beirlant
Abstract:
We consider estimation of the extreme value index and extreme quantiles for heavy-tailed data that are right-censored. We study a general procedure of removing low importance observations in tail estimators. This trimming procedure is applied to the state-of-the-art estimators for randomly right-censored tail estimators. Through an averaging procedure over the amount of trimming we derive new kern…
▽ More
We consider estimation of the extreme value index and extreme quantiles for heavy-tailed data that are right-censored. We study a general procedure of removing low importance observations in tail estimators. This trimming procedure is applied to the state-of-the-art estimators for randomly right-censored tail estimators. Through an averaging procedure over the amount of trimming we derive new kernel type estimators. Extensive simulation suggests that one of the new considered kernels leads to a highly competitive estimator against virtually any other available alternative in this framework. Moreover, we propose an adaptive selection method for the amount of top data used in estimation based on the trimming procedure minimizing the asymptotic mean squared error. We also provide an illustration of this approach to simulated as well as to real-world MTPL insurance data.
△ Less
Submitted 12 May, 2021;
originally announced May 2021.
-
Tail Measures and Regular Variation
Authors:
Martin Bladt,
Enkelejd Hashorva,
Georgiy Shevchenko
Abstract:
A general framework for the study of regular variation (RV) is that of Polish star-shaped metric spaces, while recent developments in [1] have discussed RV with respect to some properly localised boundedness $\mathcal{B}$ imposing weak assumptions on the structure of Polish space.
Along the lines of the latter approach, we discuss the RV of Borel measures and random processes on general Polish m…
▽ More
A general framework for the study of regular variation (RV) is that of Polish star-shaped metric spaces, while recent developments in [1] have discussed RV with respect to some properly localised boundedness $\mathcal{B}$ imposing weak assumptions on the structure of Polish space.
Along the lines of the latter approach, we discuss the RV of Borel measures and random processes on general Polish metric spaces. Tail measures introduced in [2] appear naturally as limiting measures of regularly varying time series.
We define tail measures on a measurable space indexed by $\mathcal{H}(D)$, a countable family of homogeneous coordinate maps, and show some tractable instances for the investigation of RV when $\mathcal{B}$ is determined by $\mathcal{H}(D)$. This allows us to study the regular variation of cadlag processes on $D(R^l, R^d)$ retrieving in particular results obtained in [1] for RV of stationary cadlag processes on the real line removing $l=1$ therein. Further, we discuss potential applications and open questions.
△ Less
Submitted 30 June, 2022; v1 submitted 7 March, 2021;
originally announced March 2021.
-
Continuous scaled phase-type distributions
Authors:
Hansjoerg Albrecher,
Martin Bladt,
Mogens Bladt,
Jorge Yslas
Abstract:
Products between phase-type distributed random variables and any independent, positive and continuous random variable are studied. Their asymptotic properties are established, and an expectation-maximization algorithm for their effective statistical inference is derived and implemented using real-world datasets. In contrast to discrete scaling studied in earlier literature, in the present continuo…
▽ More
Products between phase-type distributed random variables and any independent, positive and continuous random variable are studied. Their asymptotic properties are established, and an expectation-maximization algorithm for their effective statistical inference is derived and implemented using real-world datasets. In contrast to discrete scaling studied in earlier literature, in the present continuous case closed-form formulas for various functionals of the resulting distributions are obtained, which facilitates both their analysis and implementation. The resulting mixture distributions are very often heavy-tailed and yet retain various properties of phase-type distributions, such as being dense (in weak convergence) on the set of distributions with positive support.
△ Less
Submitted 24 November, 2021; v1 submitted 3 March, 2021;
originally announced March 2021.
-
Fluctuation theory for one-sided Lévy processes with a matrix-exponential time horizon
Authors:
Mogens Bladt,
Jevgenijs Ivanovs
Abstract:
There is an abundance of useful fluctuation identities for one-sided Lévy processes observed up to an independent exponentially distributed time horizon. We show that all the fundamental formulas generalize to time horizons having matrix exponential distributions, and the structure is preserved. Essentially, the positive killing rate is replaced by a matrix with eigenvalues in the right half of th…
▽ More
There is an abundance of useful fluctuation identities for one-sided Lévy processes observed up to an independent exponentially distributed time horizon. We show that all the fundamental formulas generalize to time horizons having matrix exponential distributions, and the structure is preserved. Essentially, the positive killing rate is replaced by a matrix with eigenvalues in the right half of the complex plane which, in particular, applies to the positive root of the Laplace exponent and the scale function. Various fundamental properties of thus obtained matrices and functions are established, resulting in an easy to use toolkit. An important application concerns deterministic time horizons which can be well approximated by concentrated matrix exponential distributions. Numerical illustrations are also provided.
△ Less
Submitted 20 January, 2021;
originally announced January 2021.
-
matrixdist: An R Package for Statistical Analysis of Matrix Distributions
Authors:
Martin Bladt,
Alaric Mueller,
Jorge Yslas
Abstract:
The matrixdist R package provides a comprehensive suite of tools for the statistical analysis of matrix distributions, including phase-type, inhomogeneous phase-type, discrete phase-type, and related multivariate distributions. This paper introduces the package and its key features, including the estimation of these distributions and their extensions through expectation-maximisation algorithms, as…
▽ More
The matrixdist R package provides a comprehensive suite of tools for the statistical analysis of matrix distributions, including phase-type, inhomogeneous phase-type, discrete phase-type, and related multivariate distributions. This paper introduces the package and its key features, including the estimation of these distributions and their extensions through expectation-maximisation algorithms, as well as the implementation of regression through the proportional intensities and mixture-of-experts models. Additionally, the paper provides an overview of the theoretical background, discusses the algorithms and methods implemented in the package, and offers practical examples to illustrate the application of matrixdist in real-world scenarios. The matrixdist R package aims to provide researchers and practitioners a wide set of tools for analysing and modelling complex data using matrix distributions.
△ Less
Submitted 15 August, 2023; v1 submitted 20 January, 2021;
originally announced January 2021.
-
Multivariate phase-type theory for the site frequency spectrum
Authors:
Asger Hobolth,
Mogens Bladt,
Lars Nørvang Andersen
Abstract:
Linear functions of the site frequency spectrum (SFS) play a major role for understanding and investigating genetic diversity. Estimators of the mutation rate (e.g. based on the total number of segregating sites or average of the pairwise differences) and tests for neutrality (e.g. Tajima's D) are perhaps the most well-known examples. The distribution of linear functions of the SFS is important fo…
▽ More
Linear functions of the site frequency spectrum (SFS) play a major role for understanding and investigating genetic diversity. Estimators of the mutation rate (e.g. based on the total number of segregating sites or average of the pairwise differences) and tests for neutrality (e.g. Tajima's D) are perhaps the most well-known examples. The distribution of linear functions of the SFS is important for constructing confidence intervals for the estimators, and to determine significance thresholds for neutrality tests. These distributions are often approximated using simulation procedures. In this paper we use multivariate phase-type theory to specify, characterize and calculate the distribution of linear functions of the site frequency spectrum. In particular, we show that many of the classical estimators of the mutation rate are distributed according to a discrete phase-type distribution. Neutrality tests, however, are generally not discrete phase-type distributed. For neutrality tests we derive the probability generating function using continuous multivariate phase-type theory, and numerically invert the function to obtain the distribution. A main result is an analytically tractable formula for the probability generating function of the SFS. Software implementation of the phase-type methodology is available in the R package phasty, and R code for the reproduction of our results is available as an accompanying vignette.
△ Less
Submitted 13 January, 2021;
originally announced January 2021.
-
Mortality modeling and regression with matrix distributions
Authors:
Hansjoerg Albrecher,
Martin Bladt,
Mogens Bladt,
Jorge Yslas
Abstract:
In this paper we investigate the flexibility of matrix distributions for the modeling of mortality. Starting from a simple Gompertz law, we show how the introduction of matrix-valued parameters via inhomogeneous phase-type distributions can lead to reasonably accurate and relatively parsimonious models for mortality curves across the entire lifespan. A particular feature of the proposed model fram…
▽ More
In this paper we investigate the flexibility of matrix distributions for the modeling of mortality. Starting from a simple Gompertz law, we show how the introduction of matrix-valued parameters via inhomogeneous phase-type distributions can lead to reasonably accurate and relatively parsimonious models for mortality curves across the entire lifespan. A particular feature of the proposed model framework is that it allows for a more direct interpretation of the implied underlying aging process than some previous approaches. Subsequently, towards applications of the approach for multi-population mortality modeling, we introduce regression via the concept of proportional intensities, which are more flexible than proportional hazard models, and we show that the two classes are asymptotically equivalent. We illustrate how the model parameters can be estimated from data by providing an adapted EM algorithm for which the likelihood increases at each iteration. The practical feasibility and competitiveness of the proposed approach, including the right-censored case, are illustrated by several sets of mortality and survival data.
△ Less
Submitted 1 August, 2022; v1 submitted 6 November, 2020;
originally announced November 2020.
-
Fitting inhomogeneous phase-type distributions to data: the univariate and the multivariate case
Authors:
Hansjoerg Albrecher,
Mogens Bladt,
Jorge Yslas
Abstract:
The class of inhomogeneous phase-type distributions (IPH) was recently introduced in Albrecher and Bladt (2019) as an extension of the classical phase-type (PH) distributions. Like PH distributions, the class of IPH is dense in the class of distributions on the positive halfline, but leads to more parsimonious models in the presence of heavy tails. In this paper we propose a fitting procedure for…
▽ More
The class of inhomogeneous phase-type distributions (IPH) was recently introduced in Albrecher and Bladt (2019) as an extension of the classical phase-type (PH) distributions. Like PH distributions, the class of IPH is dense in the class of distributions on the positive halfline, but leads to more parsimonious models in the presence of heavy tails. In this paper we propose a fitting procedure for this class to given data. We furthermore consider an analogous extension of Kulkarni's multivariate phase-type class (Kulkarni, 1989) to the inhomogeneous framework and study parameter estimation for the resulting new and flexible class of multivariate distributions. As a by-product, we amend a previously suggested fitting procedure for the homogeneous multivariate phase-type case and provide appropriate adaptations for censored data. The performance of the algorithms is illustrated in several numerical examples, both for simulated and real-life insurance data.
△ Less
Submitted 14 November, 2020; v1 submitted 23 June, 2020;
originally announced June 2020.
-
Time series copula models using d-vines and v-transforms
Authors:
Martin Bladt,
Alexander J. McNeil
Abstract:
An approach to modelling volatile financial return series using stationary d-vine copula processes combined with Lebesgue-measure-preserving transformations known as v-transforms is proposed. By develo** a method of stochastically inverting v-transforms, models are constructed that can describe both stochastic volatility in the magnitude of price movements and serial correlation in their directi…
▽ More
An approach to modelling volatile financial return series using stationary d-vine copula processes combined with Lebesgue-measure-preserving transformations known as v-transforms is proposed. By develo** a method of stochastically inverting v-transforms, models are constructed that can describe both stochastic volatility in the magnitude of price movements and serial correlation in their directions. In combination with parametric marginal distributions it is shown that these models can rival and sometimes outperform well-known models in the extended GARCH family.
△ Less
Submitted 13 July, 2021; v1 submitted 19 June, 2020;
originally announced June 2020.
-
Efficient simulation of ruin probabilities when claims are mixtures of heavy and light tails
Authors:
Hansjörg Albrecher,
Martin Bladt,
Eleni Vatamidou
Abstract:
We consider the classical Cramér-Lundberg risk model with claim sizes that are mixtures of phase-type and subexponential variables. Exploiting a specific geometric compound representation, we propose control variate techniques to efficiently simulate the ruin probability in this situation. The resulting estimators perform well for both small and large initial capital. We quantify the variance redu…
▽ More
We consider the classical Cramér-Lundberg risk model with claim sizes that are mixtures of phase-type and subexponential variables. Exploiting a specific geometric compound representation, we propose control variate techniques to efficiently simulate the ruin probability in this situation. The resulting estimators perform well for both small and large initial capital. We quantify the variance reduction as well as the efficiency gain of our method over another fast standard technique based on the classical Pollaczek-Khinchine formula. We provide a numerical example to illustrate the performance, and show that for more time-consuming conditional Monte Carlo techniques, the new series representation also does not compare unfavorably to the one based on the Pollaczek- Khinchine formula.
△ Less
Submitted 12 June, 2020;
originally announced June 2020.
-
Multivariate fractional phase--type distributions
Authors:
Hansjoerg Albrecher,
Martin Bladt,
Mogens Bladt
Abstract:
We extend the Kulkarni class of multivariate phase--type distributions in a natural time--fractional way to construct a new class of multivariate distributions with heavy-tailed Mittag-Leffler(ML)-distributed marginals. The approach relies on assigning rewards to a non--Mar\-ko\-vi\-an jump process with ML sojourn times. This new class complements an earlier multivariate ML construction \cite{mult…
▽ More
We extend the Kulkarni class of multivariate phase--type distributions in a natural time--fractional way to construct a new class of multivariate distributions with heavy-tailed Mittag-Leffler(ML)-distributed marginals. The approach relies on assigning rewards to a non--Mar\-ko\-vi\-an jump process with ML sojourn times. This new class complements an earlier multivariate ML construction \cite{multiml} and in contrast to the former also allows for tail dependence. We derive properties and characterizations of this class, and work out some special cases that lead to explicit density representations.
△ Less
Submitted 24 March, 2020;
originally announced March 2020.
-
Multivariate Matrix Mittag--Leffler distributions
Authors:
Hansjoerg Albrecher,
Martin Bladt,
Mogens Bladt
Abstract:
We extend the construction principle of multivariate phase-type distributions to establish an analytically tractable class of heavy-tailed multivariate random variables whose marginal distributions are of Mittag-Leffler type with arbitrary index of regular variation. The construction can essentially be seen as allowing a scalar parameter to become matrix-valued. The class of distributions is shown…
▽ More
We extend the construction principle of multivariate phase-type distributions to establish an analytically tractable class of heavy-tailed multivariate random variables whose marginal distributions are of Mittag-Leffler type with arbitrary index of regular variation. The construction can essentially be seen as allowing a scalar parameter to become matrix-valued. The class of distributions is shown to be dense among all multivariate positive random variables and hence provides a versatile candidate for the modelling of heavy-tailed, but tail-independent, risks in various fields of application.
△ Less
Submitted 23 March, 2020;
originally announced March 2020.
-
Combined Tail Estimation Using Censored Data and Expert Information
Authors:
Martin Bladt,
Hansjoerg Albrecher,
Jan Beirlant
Abstract:
We study tail estimation in Pareto-like settings for datasets with a high percentage of randomly right-censored data, and where some expert information on the tail index is available for the censored observations. This setting arises for instance naturally for liability insurance claims, where actuarial experts build reserves based on the specificity of each open claim, which can be used to improv…
▽ More
We study tail estimation in Pareto-like settings for datasets with a high percentage of randomly right-censored data, and where some expert information on the tail index is available for the censored observations. This setting arises for instance naturally for liability insurance claims, where actuarial experts build reserves based on the specificity of each open claim, which can be used to improve the estimation based on the already available data points from closed claims. Through an entropy-perturbed likelihood we derive an explicit estimator and establish a close analogy with Bayesian methods. Embedded in an extreme value approach, asymptotic normality of the estimator is shown, and when the expert is clair-voyant, a simple combination formula can be deduced, bridging the classical statistical approach with the expert information. Following the aforementioned combination formula, a combination of quantile estimators can be naturally defined. In a simulation study, the estimator is shown to often outperform the Hill estimator for censored observations and recent Bayesian solutions, some of which require more information than usually available. Finally we perform a case study on a motor third-party liability insurance claim dataset, where Hill-type and quantile plots incorporate ultimate values into the estimation procedure in an intuitive manner.
△ Less
Submitted 12 November, 2019; v1 submitted 9 August, 2019;
originally announced August 2019.
-
Matrix Mittag--Leffler distributions and modeling heavy-tailed risks
Authors:
Hansjoerg Albrecher,
Martin Bladt,
Mogens Bladt
Abstract:
In this paper we define the class of matrix Mittag-Leffler distributions and study some of its properties. We show that it can be interpreted as a particular case of an inhomogeneous phase-type distribution with random scaling factor, and alternatively also as the absorption time of a semi-Markov process with Mittag-Leffler distributed interarrival times. We then identify this class and its power…
▽ More
In this paper we define the class of matrix Mittag-Leffler distributions and study some of its properties. We show that it can be interpreted as a particular case of an inhomogeneous phase-type distribution with random scaling factor, and alternatively also as the absorption time of a semi-Markov process with Mittag-Leffler distributed interarrival times. We then identify this class and its power transforms as a remarkably parsimonious and versatile family for the modelling of heavy-tailed risks, which overcomes some disadvantages of other approaches like the problem of threshold selection in extreme value theory. We illustrate this point both on simulated data as well as on a set of real-life MTPL insurance data that were modeled differently in the past.
△ Less
Submitted 27 April, 2020; v1 submitted 12 June, 2019;
originally announced June 2019.
-
Matrix calculations for inhomogeneous Markov reward processes, with applications to life insurance and point processes
Authors:
Mogens Bladt,
Søren Asmussen,
Mogens Steffensen
Abstract:
A multi--state life insurance model is naturally described in terms of the intensity matrix of an underlying (time--inhomogeneous) Markov process which describes the dynamics for the states of an insured person. Between and at transitions, benefits and premiums are paid, defining a payment process, and the technical reserve is defined as the present value of all future payments of the contract. Cl…
▽ More
A multi--state life insurance model is naturally described in terms of the intensity matrix of an underlying (time--inhomogeneous) Markov process which describes the dynamics for the states of an insured person. Between and at transitions, benefits and premiums are paid, defining a payment process, and the technical reserve is defined as the present value of all future payments of the contract. Classical methods for finding the reserve and higher order moments involve the solution of certain differential equations (Thiele and Hattendorf, respectively). In this paper we present an alternative matrix--oriented approach based on general reward considerations for Markov jump processes. The matrix approach provides a general framework for effortlessly setting up general and even complex multi--state models, where moments of all orders are then expressed explicitly in terms of so--called product integrals (matrix--exponentials) of certain matrices. As Thiele and Hattendorf type of theorems can be retrieved immediately from the matrix formulae, this methods also provides a quick and transparent approach to proving these classical results. Methods for obtaining distributions and related properties of interest (e.g. quantiles or survival functions) of the future payments are presented from both a theoretical and practical point of view (via Laplace transforms and methods involving orthogonal polynomials).
△ Less
Submitted 11 May, 2019;
originally announced May 2019.
-
Threshold selection and trimming in extremes
Authors:
Martin Bladt,
Hansjoerg Albrecher,
Jan Beirlant
Abstract:
We consider removing lower order statistics from the classical Hill estimator in extreme value statistics, and compensating for it by rescaling the remaining terms. Trajectories of these trimmed statistics as a function of the extent of trimming turn out to be quite flat near the optimal threshold value. For the regularly varying case, the classical threshold selection problem in tail estimation i…
▽ More
We consider removing lower order statistics from the classical Hill estimator in extreme value statistics, and compensating for it by rescaling the remaining terms. Trajectories of these trimmed statistics as a function of the extent of trimming turn out to be quite flat near the optimal threshold value. For the regularly varying case, the classical threshold selection problem in tail estimation is then revisited, both visually via trimmed Hill plots and, for the Hall class, also mathematically via minimizing the expected empirical variance. This leads to a simple threshold selection procedure for the classical Hill estimator which circumvents the estimation of some of the tail characteristics, a problem which is usually the bottleneck in threshold selection. As a by-product, we derive an alternative estimator of the tail index, which assigns more weight to large observations, and works particularly well for relatively lighter tails. A simple ratio statistic routine is suggested to evaluate the goodness of the implied selection of the threshold. We illustrate the favourable performance and the potential of the proposed method with simulation studies and real insurance data.
△ Less
Submitted 28 June, 2020; v1 submitted 19 March, 2019;
originally announced March 2019.
-
Characterisation of exchangeable sequences through empirical distributions
Authors:
Martin Bladt,
Dimitry Shaiderman
Abstract:
It is a well-known fact that an exchangeable sequence has empirical distributions that form a reverse-martingale. This paper is devoted to proof of the converse statement. As a byproduct of the proof for the binary case, we introduce and discuss the notion of two-coloring exchangeability.
It is a well-known fact that an exchangeable sequence has empirical distributions that form a reverse-martingale. This paper is devoted to proof of the converse statement. As a byproduct of the proof for the binary case, we introduce and discuss the notion of two-coloring exchangeability.
△ Less
Submitted 24 September, 2023; v1 submitted 19 March, 2019;
originally announced March 2019.
-
Inhomogeneous phase--type distributions and heavy tails
Authors:
Hansjörg Albrecher,
Mogens Bladt
Abstract:
We extend the construction principle of phase-type (PH) distributions to allow for inhomogeneous transition rates and show that this naturally leads to direct probabilistic descriptions of certain transformations of PH distributions. In particular, the resulting matrix distributions enable to carry over fitting properties of PH distributions to distributions with heavy tails, providing a general m…
▽ More
We extend the construction principle of phase-type (PH) distributions to allow for inhomogeneous transition rates and show that this naturally leads to direct probabilistic descriptions of certain transformations of PH distributions. In particular, the resulting matrix distributions enable to carry over fitting properties of PH distributions to distributions with heavy tails, providing a general modelling framework for heavy-tail phenomena. We also illustrate the versatility and parsimony of the proposed approach for the modelling of a real-world heavy-tailed fire insurance dataset.
△ Less
Submitted 28 June, 2019; v1 submitted 10 December, 2018;
originally announced December 2018.
-
Phase-type distributions in population genetics
Authors:
Asger Hobolth,
Arno Siri-Jégousse,
Mogens Bladt
Abstract:
Probability modelling for DNA sequence evolution is well established and provides a rich framework for understanding genetic variation between samples of individuals from one or more populations. We show that both classical and more recent models for coalescence (with or without recombination) can be described in terms of the so-called phase-type theory, where complicated and tedious calculations…
▽ More
Probability modelling for DNA sequence evolution is well established and provides a rich framework for understanding genetic variation between samples of individuals from one or more populations. We show that both classical and more recent models for coalescence (with or without recombination) can be described in terms of the so-called phase-type theory, where complicated and tedious calculations are circumvented by the use of matrices. The application of phase-type theory consists of describing the stochastic model as a Markov model by appropriately setting up a state space and calculating the corresponding intensity and reward matrices. Formulae of interest are then expressed in terms of these aforementioned matrices. We illustrate this by a few examples calculating the mean, variance and even higher order moments of the site frequency spectrum in the multiple merger coalescent models, and by analysing the mean and variance for the number of segregating sites for multiple samples in the two-locus ancestral recombination graph. We believe that phase-type theory has great potential as a tool for analysing probability models in population genetics. The compact matrix notation is useful for clarification of current models, in particular their formal manipulation (calculation), but also for further development or extensions.
△ Less
Submitted 4 June, 2018;
originally announced June 2018.
-
Fitting phase--type scale mixtures to heavy--tailed data and distributions
Authors:
Mogens Bladt,
Leonardo Rojas-Nandayapa
Abstract:
We consider the fitting of heavy tailed data and distribution with a special attention to distributions with a non--standard shape in the "body" of the distribution. To this end we consider a dense class of heavy tailed distributions introduced recently, employing an EM algorithm for the the maximum likelihood estimates of its parameters. We present methods for fitting to observed data, histograms…
▽ More
We consider the fitting of heavy tailed data and distribution with a special attention to distributions with a non--standard shape in the "body" of the distribution. To this end we consider a dense class of heavy tailed distributions introduced recently, employing an EM algorithm for the the maximum likelihood estimates of its parameters. We present methods for fitting to observed data, histograms, censored data, as well as to theoretical distributions. Numerical examples are provided with simulated data and a benchmark reinsurance dataset. We empirically demonstrate that our model can provide excellent fits to heavy--tailed data/distributions with minimal assumptions
△ Less
Submitted 11 May, 2017;
originally announced May 2017.
-
Simulation of multivariate diffusion bridge
Authors:
Mogens Bladt,
Samuel Finch,
Michael Sørensen
Abstract:
We propose simple methods for multivariate diffusion bridge simulation, which plays a fundamental role in simulation-based likelihood and Bayesian inference for stochastic differential equations. By a novel application of classical coupling methods, the new approach generalizes a previously proposed simulation method for one-dimensional bridges to the multi-variate setting. First a method of simul…
▽ More
We propose simple methods for multivariate diffusion bridge simulation, which plays a fundamental role in simulation-based likelihood and Bayesian inference for stochastic differential equations. By a novel application of classical coupling methods, the new approach generalizes a previously proposed simulation method for one-dimensional bridges to the multi-variate setting. First a method of simulating approximate, but often very accurate, diffusion bridges is proposed. These approximate bridges are used as proposal for easily implementable MCMC algorithms that produce exact diffusion bridges. The new method is much more generally applicable than previous methods. Another advantage is that the new method works well for diffusion bridges in long intervals because the computational complexity of the method is linear in the length of the interval. In a simulation study the new method performs well, and its usefulness is illustrated by an application to Bayesian estimation for the multivariate hyperbolic diffusion model.
△ Less
Submitted 29 May, 2014;
originally announced May 2014.
-
Simple simulation of diffusion bridges with application to likelihood inference for diffusions
Authors:
Mogens Bladt,
Michael Sørensen
Abstract:
With a view to statistical inference for discretely observed diffusion models, we propose simple methods of simulating diffusion bridges, approximately and exactly. Diffusion bridge simulation plays a fundamental role in likelihood and Bayesian inference for diffusion processes. First a simple method of simulating approximate diffusion bridges is proposed and studied. Then these approximate bridge…
▽ More
With a view to statistical inference for discretely observed diffusion models, we propose simple methods of simulating diffusion bridges, approximately and exactly. Diffusion bridge simulation plays a fundamental role in likelihood and Bayesian inference for diffusion processes. First a simple method of simulating approximate diffusion bridges is proposed and studied. Then these approximate bridges are used as proposal for an easily implemented Metropolis-Hastings algorithm that produces exact diffusion bridges. The new method utilizes time-reversibility properties of one-dimensional diffusions and is applicable to all one-dimensional diffusion processes with finite speed-measure. One advantage of the new approach is that simple simulation methods like the Milstein scheme can be applied to bridge simulation. Another advantage over previous bridge simulation methods is that the proposed method works well for diffusion bridges in long intervals because the computational complexity of the method is linear in the length of the interval. For $ρ$-mixing diffusions the approximate method is shown to be particularly accurate for long time intervals. In a simulation study, we investigate the accuracy and efficiency of the approximate method and compare it to exact simulation methods. In the study, our method provides a very good approximation to the distribution of a diffusion bridge for bridges that are likely to occur in applications to statistical inference. To illustrate the usefulness of the new method, we present an EM-algorithm for a discretely observed diffusion process.
△ Less
Submitted 7 March, 2014;
originally announced March 2014.