Search | arXiv e-print repository

Heterogeneous extremes in the presence of random covariates and censoring

Authors: Martin Bladt, Christoffer Øhlenschlæger

Abstract: The task of analyzing extreme events with censoring effects is considered under a framework allowing for random covariate information. A wide class of estimators that can be cast as product-limit integrals is considered, for when the conditional distributions belong to the Frechet max-domain of attraction. The main mathematical contribution is establishing uniform conditions on the families of the… ▽ More The task of analyzing extreme events with censoring effects is considered under a framework allowing for random covariate information. A wide class of estimators that can be cast as product-limit integrals is considered, for when the conditional distributions belong to the Frechet max-domain of attraction. The main mathematical contribution is establishing uniform conditions on the families of the regularly varying tails for which the asymptotic behaviour of the resulting estimators is tractable. In particular, a decomposition of the integral estimators in terms of exchangeable sums is provided, which leads to a law of large numbers and several central limit theorems. Subsequently, the finite-sample behaviour of the estimators is explored through a simulation study, and through the analysis of two real-life datasets. In particular, the inclusion of covariates makes the model significantly versatile and, as a consequence, practically relevant. △ Less

Submitted 10 June, 2024; originally announced June 2024.

Comments: 53 pages, 15 figures

MSC Class: 62G32

arXiv:2405.20817 [pdf, other]

Extremile scalar-on-function regression with application to climate scenarios

Authors: Maria Laura Battagliola, Martin Bladt

Abstract: Extremiles provide a generalization of quantiles which are not only robust, but also have an intrinsic link with extreme value theory. This paper introduces an extremile regression model tailored for functional covariate spaces. The estimation procedure turns out to be a weighted version of local linear scalar-on-function regression, where now a double kernel approach plays a crucial role. Asympto… ▽ More Extremiles provide a generalization of quantiles which are not only robust, but also have an intrinsic link with extreme value theory. This paper introduces an extremile regression model tailored for functional covariate spaces. The estimation procedure turns out to be a weighted version of local linear scalar-on-function regression, where now a double kernel approach plays a crucial role. Asymptotic expressions for the bias and variance are established, applicable to both decreasing bandwidth sequences and automatically selected bandwidths. The methodology is then investigated in detail through a simulation study. Furthermore, we highlight the applicability of the model through the analysis of data sourced from the CH2018 Swiss climate scenarios project, offering insights into its ability to serve as a modern tool to quantify climate behaviour. △ Less

Submitted 31 May, 2024; originally announced May 2024.

arXiv:2312.10499 [pdf, other]

Censored extreme value estimation

Authors: Martin Bladt, Igor Rodionov

Abstract: A novel and comprehensive methodology designed to tackle the challenges posed by extreme values in the context of random censorship is introduced. The main focus is on the analysis of integrals based on the product-limit estimator of normalized upper order statistics, called extreme Kaplan--Meier integrals. These integrals allow for the transparent derivation of various important asymptotic distri… ▽ More A novel and comprehensive methodology designed to tackle the challenges posed by extreme values in the context of random censorship is introduced. The main focus is on the analysis of integrals based on the product-limit estimator of normalized upper order statistics, called extreme Kaplan--Meier integrals. These integrals allow for the transparent derivation of various important asymptotic distributional properties, offering an alternative approach to conventional plug-in estimation methods. Notably, this methodology demonstrates robustness and wide applicability within the scope of max-domains of attraction. A noteworthy by-product is the extension of generalized Hill-type estimators of extremes to encompass all max-domains of attraction, which is of independent interest. The theoretical framework is applied to construct novel estimators for positive and real-valued extreme value indices for right-censored data. Simulation studies supporting the theory are provided. △ Less

Submitted 27 June, 2024; v1 submitted 16 December, 2023; originally announced December 2023.

arXiv:2312.06784 [pdf, other]

Pathwise and distributional approximations of semi-Markov processes

Authors: Martin Bladt, Andreea Minca, Oscar Peralta

Abstract: Continuous-time semi-Markov finite state-space jump processes are considered, inspired by a duration-dependent life insurance model. New approximations using grid-conditional homogeneous Markov jump-processes are developed, based on a recent adaptation of the uniformization principle which results in a strong pathwise convergent sequence of jump processes. Unlike traditional methods that use class… ▽ More Continuous-time semi-Markov finite state-space jump processes are considered, inspired by a duration-dependent life insurance model. New approximations using grid-conditional homogeneous Markov jump-processes are developed, based on a recent adaptation of the uniformization principle which results in a strong pathwise convergent sequence of jump processes. Unlike traditional methods that use classical approximations to integro-differential equation solutions to compute their value functions, the proposed grid-conditional homogeneous Markov jump-processes allows for a direct and tractable approximation. In particular, these approximations simplify to easily implementable expressions, making them useful in areas where evaluating pathwise distributional functionals is difficult. Our homogeneous approximation, initially of a grid-conditional kind, is evolved into an unconditional version that holds well under fair regularity assumptions. The practicality of this approach is demonstrated on a disability life insurance model, with realistic underlying semi-Markov process parameters, showcasing its broader applicability in operations research and related fields. △ Less

Submitted 11 December, 2023; originally announced December 2023.

arXiv:2311.07384 [pdf, ps, other]

Individual claims reserving using the Aalen--Johansen estimator

Authors: Martin Bladt, Gabriele Pittarello

Abstract: We propose an individual claims reserving model based on the conditional Aalen-Johansen estimator, as developed in Bladt and Furrer (2023b). In our approach, we formulate a multi-state problem, where the underlying variable is the individual claim size, rather than time. The states in this model represent development periods, and we estimate the cumulative density function of individual claim size… ▽ More We propose an individual claims reserving model based on the conditional Aalen-Johansen estimator, as developed in Bladt and Furrer (2023b). In our approach, we formulate a multi-state problem, where the underlying variable is the individual claim size, rather than time. The states in this model represent development periods, and we estimate the cumulative density function of individual claim sizes using the conditional Aalen-Johansen method as transition probabilities to an absorbing state. Our methodology reinterprets the concept of multi-state models and offers a strategy for modeling the complete curve of individual claim sizes. To illustrate our approach, we apply our model to both simulated and real datasets. Having access to the entire dataset enables us to support the use of our approach by comparing the predicted total final cost with the actual amount, as well as evaluating it in terms of the continuously ranked probability score. △ Less

Submitted 3 June, 2024; v1 submitted 13 November, 2023; originally announced November 2023.

arXiv:2303.02119 [pdf, other]

Conditional Aalen--Johansen estimation

Authors: Martin Bladt, Christian Furrer

Abstract: The conditional Aalen--Johansen estimator, a general-purpose non-parametric estimator of conditional state occupation probabilities, is introduced. The estimator is applicable for any finite-state jump process and supports conditioning on external as well as internal covariate information. The conditioning feature permits for a much more detailed analysis of the distributional characteristics of t… ▽ More The conditional Aalen--Johansen estimator, a general-purpose non-parametric estimator of conditional state occupation probabilities, is introduced. The estimator is applicable for any finite-state jump process and supports conditioning on external as well as internal covariate information. The conditioning feature permits for a much more detailed analysis of the distributional characteristics of the process. The estimator reduces to the conditional Kaplan--Meier estimator in the special case of a survival model and also englobes other, more recent, landmark estimators when covariates are discrete. Strong uniform consistency and asymptotic normality are established under lax moment conditions on the multivariate counting process, allowing in particular for an unbounded number of transitions. △ Less

Submitted 4 June, 2024; v1 submitted 3 March, 2023; originally announced March 2023.

MSC Class: 62N02; 62G20; 60J74

arXiv:2212.10661 [pdf, other]

Aggregate Markov models in life insurance: estimation via the EM algorithm

Authors: Jamaal Ahmad, Mogens Bladt

Abstract: In this paper, we consider statistical estimation of time-inhomogeneous aggregate Markov models. Unaggregated models, which corresponds to Markov chains, are commonly used in multi-state life insurance to model the biometric states of an insured. By aggregating microstates to each biometric state, we are able to model dependencies between transitions of the biometric states as well as the distribu… ▽ More In this paper, we consider statistical estimation of time-inhomogeneous aggregate Markov models. Unaggregated models, which corresponds to Markov chains, are commonly used in multi-state life insurance to model the biometric states of an insured. By aggregating microstates to each biometric state, we are able to model dependencies between transitions of the biometric states as well as the distribution of occupancy in these. This allows for non--Markovian modelling in general. Since only paths of the macrostates are observed, we develop an expectation-maximization (EM) algorithm to obtain maximum likelihood estimates of transition intensities on the micro level. Special attention is given to a semi-Markovian case, known as the reset property, which leads to simplified estimation procedures where EM algorithms for inhomogeneous phase-type distributions can be used as building blocks. We provide a numerical example of the latter in combination with piecewise constant transition rates in a three-state disability model with data simulated from a time-inhomogeneous semi-Markov model. Comparisons of our fits with more classic GLM-based fits as well as true and empirical distributions are provided to relate our model with existing models and their tools. △ Less

Submitted 10 August, 2023; v1 submitted 20 December, 2022; originally announced December 2022.

arXiv:2212.03705 [pdf, other]

Aggregate Markov models in life insurance: properties and valuation

Authors: Jamaal Ahmad, Mogens Bladt, Christian Furrer

Abstract: In multi-state life insurance, an adequate balance between analytic tractability, computational efficiency, and statistical flexibility is of great importance. This might explain the popularity of Markov chain modelling, where matrix analytic methods allow for a comprehensive treatment. Unfortunately, Markov chain modelling is unable to capture duration effects, so this paper presents aggregate Ma… ▽ More In multi-state life insurance, an adequate balance between analytic tractability, computational efficiency, and statistical flexibility is of great importance. This might explain the popularity of Markov chain modelling, where matrix analytic methods allow for a comprehensive treatment. Unfortunately, Markov chain modelling is unable to capture duration effects, so this paper presents aggregate Markov models as an alternative. Aggregate Markov models retain most of the analytical tractability of Markov chains, yet are non-Markovian and thus more flexible. Based on an explicit characterization of the fundamental martingales, matrix representations of the expected accumulated cash flows and corresponding prospective reserves are derived for duration-dependent payments with and without incidental policyholder behaviour. Throughout, special attention is given to a semi-Markovian case. Finally, the methods and results are illustrated in a numerical example. △ Less

Submitted 23 April, 2024; v1 submitted 7 December, 2022; originally announced December 2022.

arXiv:2207.11303 [pdf, other]

Estimating absorption time distributions of general Markov jump processes

Authors: Jamaal Ahmad, Martin Bladt, Mogens Bladt

Abstract: The estimation of absorption time distributions of Markov jump processes is an important task in various branches of statistics and applied probability. While the time-homogeneous case is classic, the time-inhomogeneous case has recently received increased attention due to its added flexibility and advances in computational power. However, commuting sub-intensity matrices are assumed, which in var… ▽ More The estimation of absorption time distributions of Markov jump processes is an important task in various branches of statistics and applied probability. While the time-homogeneous case is classic, the time-inhomogeneous case has recently received increased attention due to its added flexibility and advances in computational power. However, commuting sub-intensity matrices are assumed, which in various cases limits the parsimonious properties of the resulting representation. This paper develops the theory required to solve the general case through maximum likelihood estimation, and in particular, using the expectation-maximization algorithm. A reduction to a piecewise constant intensity matrix function is proposed in order to provide succinct representations, where a parametric linear model binds the intensities together. Practical aspects are discussed and illustrated through the estimation of notoriously demanding theoretical distributions and real data, from the perspective of matrix analytic methods. △ Less

Submitted 22 July, 2022; originally announced July 2022.

arXiv:2207.11292 [pdf, other]

Phase-type representations of stochastic interest rates with applications to life insurance

Authors: Jamaal Ahmad, Mogens Bladt

Abstract: The purpose of the present paper is to incorporate stochastic interest rates into a matrix-approach to multi-state life insurance, where formulas for reserves, moments of future payments and equivalence premiums can be obtained as explicit formulas in terms of product integrals or matrix exponentials. To this end we consider the Markovian interest model, where the rates are piecewise deterministic… ▽ More The purpose of the present paper is to incorporate stochastic interest rates into a matrix-approach to multi-state life insurance, where formulas for reserves, moments of future payments and equivalence premiums can be obtained as explicit formulas in terms of product integrals or matrix exponentials. To this end we consider the Markovian interest model, where the rates are piecewise deterministic (or even constant) in the different states of a Markov jump process, and which is shown to integrate naturally into the matrix framework. The discounting factor then becomes the price of a zero-coupon bond which may or may not be correlated with the biometric insurance process. Another nice feature about the Markovian interest model is that the price of the bond coincides with the survival function of a phase-type distributed random variable. This, in particular, allows for calibrating the Markovian interest rate models using a maximum likelihood approach to observed data (prices) or to theoretical models like e.g. a Vasicek model. Due to the denseness of phase-type distributions, we can approximate the price behaviour of any zero-coupon bond with interest rates bounded from below by choosing the number of possible interest rate values sufficiently large. For observed data models with few data points, lower dimensions will usually suffice, while for theoretical models the dimensionality is only a computational issue. △ Less

Submitted 17 November, 2022; v1 submitted 22 July, 2022; originally announced July 2022.

arXiv:2207.01364 [pdf, other]

Joint discrete and continuous matrix distribution modelling

Authors: Martin Bladt, Clara Brimnes Gardner

Abstract: In this paper we introduce a bivariate distribution on $\mathbb{R}_{+} \times \mathbb{N}$ arising from a single underlying Markov jump process. The marginal distributions are phase-type and discrete phase-type distributed, respectively, which allow for flexible behavior for modeling purposes. We show that the distribution is dense in the class of distributions on… ▽ More In this paper we introduce a bivariate distribution on $\mathbb{R}_{+} \times \mathbb{N}$ arising from a single underlying Markov jump process. The marginal distributions are phase-type and discrete phase-type distributed, respectively, which allow for flexible behavior for modeling purposes. We show that the distribution is dense in the class of distributions on $\mathbb{R}_{+} \times \mathbb{N}$ and derive some of its main properties, all explicit in terms of matrix calculus. Furthermore, we develop an effective EM algorithm for the statistical estimation of the distribution parameters. In the last part of the paper, we apply our methodology to an insurance dataset, where we model the number of claims and the mean claim sizes of policyholders, which is seen to perform favorably. An additional consequence of the latter analysis is that the total loss size in the entire portfolio is captured substantially better than with independent phase-type models. △ Less

Submitted 4 July, 2022; originally announced July 2022.

arXiv:2206.13120 [pdf, other]

Expert Kaplan--Meier estimation

Authors: Martin Bladt, Christian Furrer

Abstract: The setting of a right-censored random sample subject to contamination is considered. In various fields, expert information is often available and used to overcome the contamination. This paper integrates expert knowledge into the product-limit estimator in two different ways with distinct interpretations. Strong uniform consistency is proved for both cases under certain assumptions on the kind of… ▽ More The setting of a right-censored random sample subject to contamination is considered. In various fields, expert information is often available and used to overcome the contamination. This paper integrates expert knowledge into the product-limit estimator in two different ways with distinct interpretations. Strong uniform consistency is proved for both cases under certain assumptions on the kind of contamination and the quality of expert information, which sheds light on the techniques and decisions that practitioners may take. The nuances of the techniques are discussed -- also with a view towards semi-parametric estimation -- and they are illustrated using simulated and real-world insurance data. △ Less

Submitted 27 March, 2023; v1 submitted 27 June, 2022; originally announced June 2022.

arXiv:2206.13091 [pdf, other]

Informed censoring: the parametric combination of data and expert information

Authors: Hansjörg Albrecher, Martin Bladt

Abstract: The statistical censoring setup is extended to the situation when random measures can be assigned to the realization of datapoints, leading to a new way of incorporating expert information into the usual parametric estimation procedures. The asymptotic theory is provided for the resulting estimators, and some special cases of practical relevance are studied in more detail. Although the proposed fr… ▽ More The statistical censoring setup is extended to the situation when random measures can be assigned to the realization of datapoints, leading to a new way of incorporating expert information into the usual parametric estimation procedures. The asymptotic theory is provided for the resulting estimators, and some special cases of practical relevance are studied in more detail. Although the proposed framework mathematically generalizes censoring and coarsening at random, and borrows techniques from M-estimation theory, it provides a novel and transparent methodology which enjoys significant practical applicability in situations where expert information is present. The potential of the approach is illustrated by a concrete actuarial application of tail parameter estimation for a heavy-tailed MTPL dataset with limited available expert information. △ Less

Submitted 3 December, 2023; v1 submitted 27 June, 2022; originally announced June 2022.

arXiv:2204.02954 [pdf, other]

Strongly convergent homogeneous approximations to inhomogeneous Markov jump processes and applications

Authors: Martin Bladt, Oscar Peralta

Abstract: The study of time-inhomogeneous Markov jump processes is a traditional topic within probability theory that has recently attracted substantial attention in various applications. However, their flexibility also incurs a substantial mathematical burden which is usually circumvented by using well-known generic distributional approximations or simulations. This article provides a novel approximation m… ▽ More The study of time-inhomogeneous Markov jump processes is a traditional topic within probability theory that has recently attracted substantial attention in various applications. However, their flexibility also incurs a substantial mathematical burden which is usually circumvented by using well-known generic distributional approximations or simulations. This article provides a novel approximation method that tailors the dynamics of a time-homogeneous Markov jump process to meet those of its time-inhomogeneous counterpart on an increasingly fine Poisson grid. Strong convergence of the processes in terms of the Skorokhod $J_1$ metric is established, and convergence rates are provided. Under traditional regularity assumptions, distributional convergence is established for unconditional proxies, to the same limit. Special attention is devoted to the case where the target process has one absorbing state and the remaining ones transient, for which the absorption times also converge. Some applications are outlined, such as univariate hazard-rate density estimation, ruin probabilities, and multivariate phase-type density evaluation. △ Less

Submitted 1 November, 2023; v1 submitted 6 April, 2022; originally announced April 2022.

arXiv:2111.00581 [pdf, other]

Phase-type mixture-of-experts regression for loss severities

Authors: Martin Bladt, Jorge Yslas

Abstract: The task of modeling claim severities is addressed when data is not consistent with the classical regression assumptions. This framework is common in several lines of business within insurance and reinsurance, where catastrophic losses or heterogeneous sub-populations result in data difficult to model. Their correct analysis is required for pricing insurance products, and some of the most prevalen… ▽ More The task of modeling claim severities is addressed when data is not consistent with the classical regression assumptions. This framework is common in several lines of business within insurance and reinsurance, where catastrophic losses or heterogeneous sub-populations result in data difficult to model. Their correct analysis is required for pricing insurance products, and some of the most prevalent recent specifications in this direction are mixture-of-experts models. This paper proposes a regression model that generalizes the latter approach to the phase-type distribution setting. More specifically, the concept of mixing is extended to the case where an entire Markov jump process is unobserved and where states can communicate with each other. The covariates then act on the initial probabilities of such underlying chain, which play the role of expert weights. The basic properties of such a model are computed in terms of matrix functionals, and denseness properties are derived, demonstrating their flexibility. An effective estimation procedure is proposed, based on the EM algorithm and multinomial logistic regression, and subsequently illustrated using simulated and real-world datasets. The increased flexibility of the proposed models does not come at a high computational cost, and the motivation and interpretation are equally transparent to simpler MoE models. △ Less

Submitted 31 March, 2022; v1 submitted 31 October, 2021; originally announced November 2021.

arXiv:2110.05207 [pdf, other]

Phase-type distributions for claim severity regression modeling

Authors: Martin Bladt

Abstract: This paper addresses the task of modeling severity losses using segmentation when the data distribution does not fall into the usual regression frameworks. This situation is not uncommon in lines of business such as third-party liability insurance, where heavy-tails and multimodality often hamper a direct statistical analysis. We propose to use regression models based on phase-type distributions,… ▽ More This paper addresses the task of modeling severity losses using segmentation when the data distribution does not fall into the usual regression frameworks. This situation is not uncommon in lines of business such as third-party liability insurance, where heavy-tails and multimodality often hamper a direct statistical analysis. We propose to use regression models based on phase-type distributions, regressing on their underlying inhomogeneous Markov intensity and using an extension of the EM algorithm. These models are interpretable and tractable in terms of multi-state processes and generalize the proportional hazards specification when the dimension of the state space is larger than one. We show that the combination of matrix parameters, inhomogeneity transforms, and covariate information provides flexible regression models that effectively capture the entire distribution of loss severities. △ Less

Submitted 26 November, 2021; v1 submitted 11 October, 2021; originally announced October 2021.

arXiv:2110.05199 [pdf, other]

Fractional Inhomogeneous Multi-state Models in Life Insurance

Authors: Martin Bladt

Abstract: In this paper, we demonstrate through the use of matrix calculus a transparent analysis of fractional inhomogeneous Markov models for life insurance where transition matrices commute. The resulting formulae are intuitive matrix generalizations of their single-state counterparts, and the absorption times are matrix versions of well-known scalar distributions. A further advantage of this approach is… ▽ More In this paper, we demonstrate through the use of matrix calculus a transparent analysis of fractional inhomogeneous Markov models for life insurance where transition matrices commute. The resulting formulae are intuitive matrix generalizations of their single-state counterparts, and the absorption times are matrix versions of well-known scalar distributions. A further advantage of this approach is that it allows extending the analysis to the non-Markovian case where sojourns are Mittag-Leffler distributed, and where the absorption times are fractional phase-type distributed. Considering deterministic time transforms gives rise to fractional inhomogeneous phase-type distributions as absorption times. The latter underlying processes are an example of a regime where not only the present but also the history of a policyholder influences its future evolution. The sub-exponential nature of stable distributions translates into the multi-state insurance model as a random longevity risk at any given state of the chain. △ Less

Submitted 21 October, 2021; v1 submitted 11 October, 2021; originally announced October 2021.

arXiv:2110.05179 [pdf, other]

A tractable class of multivariate phase-type distributions for loss modeling

Authors: Martin Bladt

Abstract: Phase-type (PH) distributions are a popular tool for the analysis of univariate risks in numerous actuarial applications. Their multivariate counterparts (MPH$^\ast$), however, have not seen such a proliferation, due to lack of explicit formulas and complicated estimation procedures. A simple construction of multivariate phase-type distributions -- mPH -- is proposed for the parametric description… ▽ More Phase-type (PH) distributions are a popular tool for the analysis of univariate risks in numerous actuarial applications. Their multivariate counterparts (MPH$^\ast$), however, have not seen such a proliferation, due to lack of explicit formulas and complicated estimation procedures. A simple construction of multivariate phase-type distributions -- mPH -- is proposed for the parametric description of multivariate risks, leading to models of considerable probabilistic flexibility and statistical tractability. The main idea is to start different Markov processes at the same state, and allow them to evolve independently thereafter, leading to dependent absorption times. By dimension augmentation arguments, this construction can be cast into the umbrella of MPH$^\ast$ class, but enjoys explicit formulas which the general specification lacks, including common measures of dependence. Moreover, it is shown that the class is still rich enough to be dense on the set of multivariate risks supported on the positive orthant, and it is the smallest known sub-class to have this property. In particular, the latter result provides a new short proof of the denseness of the MPH$^\ast$ class. In practice this means that the mPH class allows for modeling of bivariate risks with any given correlation or copula. We derive an EM algorithm for its statistical estimation, and illustrate it on bivariate insurance data. Extensions to more general settings are outlined. △ Less

Submitted 21 December, 2022; v1 submitted 11 October, 2021; originally announced October 2021.

arXiv:2107.09023 [pdf, other]

Heavy-tailed phase-type distributions: A unified approach

Authors: Martin Bladt, Jorge Yslas

Abstract: A phase-type distribution is the distribution of the time until absorption in a finite state-space time-homogeneous Markov jump process, with one absorbing state and the rest being transient. These distributions are mathematically tractable and conceptually attractive to model physical phenomena due to their interpretation in terms of a hidden Markov structure. Three recent extensions of regular p… ▽ More A phase-type distribution is the distribution of the time until absorption in a finite state-space time-homogeneous Markov jump process, with one absorbing state and the rest being transient. These distributions are mathematically tractable and conceptually attractive to model physical phenomena due to their interpretation in terms of a hidden Markov structure. Three recent extensions of regular phase-type distributions give rise to models which allow for heavy tails: discrete- or continuous-scaling; fractional-time semi-Markov extensions; and inhomogeneous time-change of the underlying Markov process. In this paper, we present a unifying theory for heavy-tailed phase-type distributions for which all three approaches are particular cases. Our main objective is to provide useful models for heavy-tailed phase-type distributions, but any other tail behavior is also captured by our specification. We provide relevant new examples and also show how existing approaches are naturally embedded. Subsequently, two multivariate extensions are presented, inspired by the univariate construction which can be considered as a matrix version of a frailty model. We provide fully explicit EM-algorithms for all models and illustrate them using synthetic and real-life data. △ Less

Submitted 6 December, 2021; v1 submitted 19 July, 2021; originally announced July 2021.

arXiv:2107.00960 [pdf, other]

Time series models with infinite-order partial copula dependence

Authors: Martin Bladt, Alexander J. McNeil

Abstract: Stationary and ergodic time series can be constructed using an s-vine decomposition based on sets of bivariate copula functions. The extension of such processes to infinite copula sequences is considered and shown to yield a rich class of models that generalizes Gaussian ARMA and ARFIMA processes to allow both non-Gaussian marginal behaviour and a non-Gaussian description of the serial partial dep… ▽ More Stationary and ergodic time series can be constructed using an s-vine decomposition based on sets of bivariate copula functions. The extension of such processes to infinite copula sequences is considered and shown to yield a rich class of models that generalizes Gaussian ARMA and ARFIMA processes to allow both non-Gaussian marginal behaviour and a non-Gaussian description of the serial partial dependence structure. Extensions of classical causal and invertible representations of linear processes to general s-vine processes are proposed and investigated. A practical and parsimonious method for parameterizing s-vine processes using the Kendall partial autocorrelation function is developed. The potential of the resulting models to give improved statistical fits in many applications is indicated with an example using macroeconomic data. △ Less

Submitted 2 July, 2021; originally announced July 2021.

Comments: 30 pages, 4 figures

arXiv:2105.05523 [pdf, other]

Trimmed extreme value estimators for censored heavy-tailed data

Authors: Martin Bladt, Hansjoerg Albrecher, Jan Beirlant

Abstract: We consider estimation of the extreme value index and extreme quantiles for heavy-tailed data that are right-censored. We study a general procedure of removing low importance observations in tail estimators. This trimming procedure is applied to the state-of-the-art estimators for randomly right-censored tail estimators. Through an averaging procedure over the amount of trimming we derive new kern… ▽ More We consider estimation of the extreme value index and extreme quantiles for heavy-tailed data that are right-censored. We study a general procedure of removing low importance observations in tail estimators. This trimming procedure is applied to the state-of-the-art estimators for randomly right-censored tail estimators. Through an averaging procedure over the amount of trimming we derive new kernel type estimators. Extensive simulation suggests that one of the new considered kernels leads to a highly competitive estimator against virtually any other available alternative in this framework. Moreover, we propose an adaptive selection method for the amount of top data used in estimation based on the trimming procedure minimizing the asymptotic mean squared error. We also provide an illustration of this approach to simulated as well as to real-world MTPL insurance data. △ Less

Submitted 12 May, 2021; originally announced May 2021.

arXiv:2103.04396 [pdf, ps, other]

doi 10.1214/22-ejp788

Tail Measures and Regular Variation

Authors: Martin Bladt, Enkelejd Hashorva, Georgiy Shevchenko

Abstract: A general framework for the study of regular variation (RV) is that of Polish star-shaped metric spaces, while recent developments in [1] have discussed RV with respect to some properly localised boundedness $\mathcal{B}$ imposing weak assumptions on the structure of Polish space. Along the lines of the latter approach, we discuss the RV of Borel measures and random processes on general Polish m… ▽ More A general framework for the study of regular variation (RV) is that of Polish star-shaped metric spaces, while recent developments in [1] have discussed RV with respect to some properly localised boundedness $\mathcal{B}$ imposing weak assumptions on the structure of Polish space. Along the lines of the latter approach, we discuss the RV of Borel measures and random processes on general Polish metric spaces. Tail measures introduced in [2] appear naturally as limiting measures of regularly varying time series. We define tail measures on a measurable space indexed by $\mathcal{H}(D)$, a countable family of homogeneous coordinate maps, and show some tractable instances for the investigation of RV when $\mathcal{B}$ is determined by $\mathcal{H}(D)$. This allows us to study the regular variation of cadlag processes on $D(R^l, R^d)$ retrieving in particular results obtained in [1] for RV of stationary cadlag processes on the real line removing $l=1$ therein. Further, we discuss potential applications and open questions. △ Less

Submitted 30 June, 2022; v1 submitted 7 March, 2021; originally announced March 2021.

Comments: 37 pages, published

arXiv:2103.02457 [pdf, other]

Continuous scaled phase-type distributions

Authors: Hansjoerg Albrecher, Martin Bladt, Mogens Bladt, Jorge Yslas

Abstract: Products between phase-type distributed random variables and any independent, positive and continuous random variable are studied. Their asymptotic properties are established, and an expectation-maximization algorithm for their effective statistical inference is derived and implemented using real-world datasets. In contrast to discrete scaling studied in earlier literature, in the present continuo… ▽ More Products between phase-type distributed random variables and any independent, positive and continuous random variable are studied. Their asymptotic properties are established, and an expectation-maximization algorithm for their effective statistical inference is derived and implemented using real-world datasets. In contrast to discrete scaling studied in earlier literature, in the present continuous case closed-form formulas for various functionals of the resulting distributions are obtained, which facilitates both their analysis and implementation. The resulting mixture distributions are very often heavy-tailed and yet retain various properties of phase-type distributions, such as being dense (in weak convergence) on the set of distributions with positive support. △ Less

Submitted 24 November, 2021; v1 submitted 3 March, 2021; originally announced March 2021.

arXiv:2101.08076 [pdf, other]

Fluctuation theory for one-sided Lévy processes with a matrix-exponential time horizon

Authors: Mogens Bladt, Jevgenijs Ivanovs

Abstract: There is an abundance of useful fluctuation identities for one-sided Lévy processes observed up to an independent exponentially distributed time horizon. We show that all the fundamental formulas generalize to time horizons having matrix exponential distributions, and the structure is preserved. Essentially, the positive killing rate is replaced by a matrix with eigenvalues in the right half of th… ▽ More There is an abundance of useful fluctuation identities for one-sided Lévy processes observed up to an independent exponentially distributed time horizon. We show that all the fundamental formulas generalize to time horizons having matrix exponential distributions, and the structure is preserved. Essentially, the positive killing rate is replaced by a matrix with eigenvalues in the right half of the complex plane which, in particular, applies to the positive root of the Laplace exponent and the scale function. Various fundamental properties of thus obtained matrices and functions are established, resulting in an easy to use toolkit. An important application concerns deterministic time horizons which can be well approximated by concentrated matrix exponential distributions. Numerical illustrations are also provided. △ Less

Submitted 20 January, 2021; originally announced January 2021.

arXiv:2101.07987 [pdf, other]

matrixdist: An R Package for Statistical Analysis of Matrix Distributions

Authors: Martin Bladt, Alaric Mueller, Jorge Yslas

Abstract: The matrixdist R package provides a comprehensive suite of tools for the statistical analysis of matrix distributions, including phase-type, inhomogeneous phase-type, discrete phase-type, and related multivariate distributions. This paper introduces the package and its key features, including the estimation of these distributions and their extensions through expectation-maximisation algorithms, as… ▽ More The matrixdist R package provides a comprehensive suite of tools for the statistical analysis of matrix distributions, including phase-type, inhomogeneous phase-type, discrete phase-type, and related multivariate distributions. This paper introduces the package and its key features, including the estimation of these distributions and their extensions through expectation-maximisation algorithms, as well as the implementation of regression through the proportional intensities and mixture-of-experts models. Additionally, the paper provides an overview of the theoretical background, discusses the algorithms and methods implemented in the package, and offers practical examples to illustrate the application of matrixdist in real-world scenarios. The matrixdist R package aims to provide researchers and practitioners a wide set of tools for analysing and modelling complex data using matrix distributions. △ Less

Submitted 15 August, 2023; v1 submitted 20 January, 2021; originally announced January 2021.

arXiv:2101.04941 [pdf, other]

Multivariate phase-type theory for the site frequency spectrum

Authors: Asger Hobolth, Mogens Bladt, Lars Nørvang Andersen

Abstract: Linear functions of the site frequency spectrum (SFS) play a major role for understanding and investigating genetic diversity. Estimators of the mutation rate (e.g. based on the total number of segregating sites or average of the pairwise differences) and tests for neutrality (e.g. Tajima's D) are perhaps the most well-known examples. The distribution of linear functions of the SFS is important fo… ▽ More Linear functions of the site frequency spectrum (SFS) play a major role for understanding and investigating genetic diversity. Estimators of the mutation rate (e.g. based on the total number of segregating sites or average of the pairwise differences) and tests for neutrality (e.g. Tajima's D) are perhaps the most well-known examples. The distribution of linear functions of the SFS is important for constructing confidence intervals for the estimators, and to determine significance thresholds for neutrality tests. These distributions are often approximated using simulation procedures. In this paper we use multivariate phase-type theory to specify, characterize and calculate the distribution of linear functions of the site frequency spectrum. In particular, we show that many of the classical estimators of the mutation rate are distributed according to a discrete phase-type distribution. Neutrality tests, however, are generally not discrete phase-type distributed. For neutrality tests we derive the probability generating function using continuous multivariate phase-type theory, and numerically invert the function to obtain the distribution. A main result is an analytically tractable formula for the probability generating function of the SFS. Software implementation of the phase-type methodology is available in the R package phasty, and R code for the reproduction of our results is available as an accompanying vignette. △ Less

Submitted 13 January, 2021; originally announced January 2021.

MSC Class: 60J90 (Primary) 60J27; 60J28; 60J95; 92D15 (Secondary)

arXiv:2011.03219 [pdf, other]

Mortality modeling and regression with matrix distributions

Authors: Hansjoerg Albrecher, Martin Bladt, Mogens Bladt, Jorge Yslas

Abstract: In this paper we investigate the flexibility of matrix distributions for the modeling of mortality. Starting from a simple Gompertz law, we show how the introduction of matrix-valued parameters via inhomogeneous phase-type distributions can lead to reasonably accurate and relatively parsimonious models for mortality curves across the entire lifespan. A particular feature of the proposed model fram… ▽ More In this paper we investigate the flexibility of matrix distributions for the modeling of mortality. Starting from a simple Gompertz law, we show how the introduction of matrix-valued parameters via inhomogeneous phase-type distributions can lead to reasonably accurate and relatively parsimonious models for mortality curves across the entire lifespan. A particular feature of the proposed model framework is that it allows for a more direct interpretation of the implied underlying aging process than some previous approaches. Subsequently, towards applications of the approach for multi-population mortality modeling, we introduce regression via the concept of proportional intensities, which are more flexible than proportional hazard models, and we show that the two classes are asymptotically equivalent. We illustrate how the model parameters can be estimated from data by providing an adapted EM algorithm for which the likelihood increases at each iteration. The practical feasibility and competitiveness of the proposed approach, including the right-censored case, are illustrated by several sets of mortality and survival data. △ Less

Submitted 1 August, 2022; v1 submitted 6 November, 2020; originally announced November 2020.

arXiv:2006.13003 [pdf, other]

Fitting inhomogeneous phase-type distributions to data: the univariate and the multivariate case

Authors: Hansjoerg Albrecher, Mogens Bladt, Jorge Yslas

Abstract: The class of inhomogeneous phase-type distributions (IPH) was recently introduced in Albrecher and Bladt (2019) as an extension of the classical phase-type (PH) distributions. Like PH distributions, the class of IPH is dense in the class of distributions on the positive halfline, but leads to more parsimonious models in the presence of heavy tails. In this paper we propose a fitting procedure for… ▽ More The class of inhomogeneous phase-type distributions (IPH) was recently introduced in Albrecher and Bladt (2019) as an extension of the classical phase-type (PH) distributions. Like PH distributions, the class of IPH is dense in the class of distributions on the positive halfline, but leads to more parsimonious models in the presence of heavy tails. In this paper we propose a fitting procedure for this class to given data. We furthermore consider an analogous extension of Kulkarni's multivariate phase-type class (Kulkarni, 1989) to the inhomogeneous framework and study parameter estimation for the resulting new and flexible class of multivariate distributions. As a by-product, we amend a previously suggested fitting procedure for the homogeneous multivariate phase-type case and provide appropriate adaptations for censored data. The performance of the algorithms is illustrated in several numerical examples, both for simulated and real-life insurance data. △ Less

Submitted 14 November, 2020; v1 submitted 23 June, 2020; originally announced June 2020.

MSC Class: Primary: 60E05 Secondary: 60J22; 62F10; 62N01; 62P05

arXiv:2006.11088 [pdf, other]

Time series copula models using d-vines and v-transforms

Authors: Martin Bladt, Alexander J. McNeil

Abstract: An approach to modelling volatile financial return series using stationary d-vine copula processes combined with Lebesgue-measure-preserving transformations known as v-transforms is proposed. By develo** a method of stochastically inverting v-transforms, models are constructed that can describe both stochastic volatility in the magnitude of price movements and serial correlation in their directi… ▽ More An approach to modelling volatile financial return series using stationary d-vine copula processes combined with Lebesgue-measure-preserving transformations known as v-transforms is proposed. By develo** a method of stochastically inverting v-transforms, models are constructed that can describe both stochastic volatility in the magnitude of price movements and serial correlation in their directions. In combination with parametric marginal distributions it is shown that these models can rival and sometimes outperform well-known models in the extended GARCH family. △ Less

Submitted 13 July, 2021; v1 submitted 19 June, 2020; originally announced June 2020.

arXiv:2006.07447 [pdf, other]

Efficient simulation of ruin probabilities when claims are mixtures of heavy and light tails

Authors: Hansjörg Albrecher, Martin Bladt, Eleni Vatamidou

Abstract: We consider the classical Cramér-Lundberg risk model with claim sizes that are mixtures of phase-type and subexponential variables. Exploiting a specific geometric compound representation, we propose control variate techniques to efficiently simulate the ruin probability in this situation. The resulting estimators perform well for both small and large initial capital. We quantify the variance redu… ▽ More We consider the classical Cramér-Lundberg risk model with claim sizes that are mixtures of phase-type and subexponential variables. Exploiting a specific geometric compound representation, we propose control variate techniques to efficiently simulate the ruin probability in this situation. The resulting estimators perform well for both small and large initial capital. We quantify the variance reduction as well as the efficiency gain of our method over another fast standard technique based on the classical Pollaczek-Khinchine formula. We provide a numerical example to illustrate the performance, and show that for more time-consuming conditional Monte Carlo techniques, the new series representation also does not compare unfavorably to the one based on the Pollaczek- Khinchine formula. △ Less

Submitted 12 June, 2020; originally announced June 2020.

Comments: 18 pages, 8 figures

arXiv:2003.11122 [pdf, other]

Multivariate fractional phase--type distributions

Authors: Hansjoerg Albrecher, Martin Bladt, Mogens Bladt

Abstract: We extend the Kulkarni class of multivariate phase--type distributions in a natural time--fractional way to construct a new class of multivariate distributions with heavy-tailed Mittag-Leffler(ML)-distributed marginals. The approach relies on assigning rewards to a non--Mar\-ko\-vi\-an jump process with ML sojourn times. This new class complements an earlier multivariate ML construction \cite{mult… ▽ More We extend the Kulkarni class of multivariate phase--type distributions in a natural time--fractional way to construct a new class of multivariate distributions with heavy-tailed Mittag-Leffler(ML)-distributed marginals. The approach relies on assigning rewards to a non--Mar\-ko\-vi\-an jump process with ML sojourn times. This new class complements an earlier multivariate ML construction \cite{multiml} and in contrast to the former also allows for tail dependence. We derive properties and characterizations of this class, and work out some special cases that lead to explicit density representations. △ Less

Submitted 24 March, 2020; originally announced March 2020.

arXiv:2003.10517 [pdf, other]

Multivariate Matrix Mittag--Leffler distributions

Authors: Hansjoerg Albrecher, Martin Bladt, Mogens Bladt

Abstract: We extend the construction principle of multivariate phase-type distributions to establish an analytically tractable class of heavy-tailed multivariate random variables whose marginal distributions are of Mittag-Leffler type with arbitrary index of regular variation. The construction can essentially be seen as allowing a scalar parameter to become matrix-valued. The class of distributions is shown… ▽ More We extend the construction principle of multivariate phase-type distributions to establish an analytically tractable class of heavy-tailed multivariate random variables whose marginal distributions are of Mittag-Leffler type with arbitrary index of regular variation. The construction can essentially be seen as allowing a scalar parameter to become matrix-valued. The class of distributions is shown to be dense among all multivariate positive random variables and hence provides a versatile candidate for the modelling of heavy-tailed, but tail-independent, risks in various fields of application. △ Less

Submitted 23 March, 2020; originally announced March 2020.

arXiv:1908.03390 [pdf, other]

Combined Tail Estimation Using Censored Data and Expert Information

Authors: Martin Bladt, Hansjoerg Albrecher, Jan Beirlant

Abstract: We study tail estimation in Pareto-like settings for datasets with a high percentage of randomly right-censored data, and where some expert information on the tail index is available for the censored observations. This setting arises for instance naturally for liability insurance claims, where actuarial experts build reserves based on the specificity of each open claim, which can be used to improv… ▽ More We study tail estimation in Pareto-like settings for datasets with a high percentage of randomly right-censored data, and where some expert information on the tail index is available for the censored observations. This setting arises for instance naturally for liability insurance claims, where actuarial experts build reserves based on the specificity of each open claim, which can be used to improve the estimation based on the already available data points from closed claims. Through an entropy-perturbed likelihood we derive an explicit estimator and establish a close analogy with Bayesian methods. Embedded in an extreme value approach, asymptotic normality of the estimator is shown, and when the expert is clair-voyant, a simple combination formula can be deduced, bridging the classical statistical approach with the expert information. Following the aforementioned combination formula, a combination of quantile estimators can be naturally defined. In a simulation study, the estimator is shown to often outperform the Hill estimator for censored observations and recent Bayesian solutions, some of which require more information than usually available. Finally we perform a case study on a motor third-party liability insurance claim dataset, where Hill-type and quantile plots incorporate ultimate values into the estimation procedure in an intuitive manner. △ Less

Submitted 12 November, 2019; v1 submitted 9 August, 2019; originally announced August 2019.

Comments: 21 pages, 9 figures

arXiv:1906.05316 [pdf, other]

Matrix Mittag--Leffler distributions and modeling heavy-tailed risks

Authors: Hansjoerg Albrecher, Martin Bladt, Mogens Bladt

Abstract: In this paper we define the class of matrix Mittag-Leffler distributions and study some of its properties. We show that it can be interpreted as a particular case of an inhomogeneous phase-type distribution with random scaling factor, and alternatively also as the absorption time of a semi-Markov process with Mittag-Leffler distributed interarrival times. We then identify this class and its power… ▽ More In this paper we define the class of matrix Mittag-Leffler distributions and study some of its properties. We show that it can be interpreted as a particular case of an inhomogeneous phase-type distribution with random scaling factor, and alternatively also as the absorption time of a semi-Markov process with Mittag-Leffler distributed interarrival times. We then identify this class and its power transforms as a remarkably parsimonious and versatile family for the modelling of heavy-tailed risks, which overcomes some disadvantages of other approaches like the problem of threshold selection in extreme value theory. We illustrate this point both on simulated data as well as on a set of real-life MTPL insurance data that were modeled differently in the past. △ Less

Submitted 27 April, 2020; v1 submitted 12 June, 2019; originally announced June 2019.

arXiv:1905.04605 [pdf, other]

Matrix calculations for inhomogeneous Markov reward processes, with applications to life insurance and point processes

Authors: Mogens Bladt, Søren Asmussen, Mogens Steffensen

Abstract: A multi--state life insurance model is naturally described in terms of the intensity matrix of an underlying (time--inhomogeneous) Markov process which describes the dynamics for the states of an insured person. Between and at transitions, benefits and premiums are paid, defining a payment process, and the technical reserve is defined as the present value of all future payments of the contract. Cl… ▽ More A multi--state life insurance model is naturally described in terms of the intensity matrix of an underlying (time--inhomogeneous) Markov process which describes the dynamics for the states of an insured person. Between and at transitions, benefits and premiums are paid, defining a payment process, and the technical reserve is defined as the present value of all future payments of the contract. Classical methods for finding the reserve and higher order moments involve the solution of certain differential equations (Thiele and Hattendorf, respectively). In this paper we present an alternative matrix--oriented approach based on general reward considerations for Markov jump processes. The matrix approach provides a general framework for effortlessly setting up general and even complex multi--state models, where moments of all orders are then expressed explicitly in terms of so--called product integrals (matrix--exponentials) of certain matrices. As Thiele and Hattendorf type of theorems can be retrieved immediately from the matrix formulae, this methods also provides a quick and transparent approach to proving these classical results. Methods for obtaining distributions and related properties of interest (e.g. quantiles or survival functions) of the future payments are presented from both a theoretical and practical point of view (via Laplace transforms and methods involving orthogonal polynomials). △ Less

Submitted 11 May, 2019; originally announced May 2019.

Comments: 32 pages, 3 figures

arXiv:1903.07942 [pdf, other]

Threshold selection and trimming in extremes

Authors: Martin Bladt, Hansjoerg Albrecher, Jan Beirlant

Abstract: We consider removing lower order statistics from the classical Hill estimator in extreme value statistics, and compensating for it by rescaling the remaining terms. Trajectories of these trimmed statistics as a function of the extent of trimming turn out to be quite flat near the optimal threshold value. For the regularly varying case, the classical threshold selection problem in tail estimation i… ▽ More We consider removing lower order statistics from the classical Hill estimator in extreme value statistics, and compensating for it by rescaling the remaining terms. Trajectories of these trimmed statistics as a function of the extent of trimming turn out to be quite flat near the optimal threshold value. For the regularly varying case, the classical threshold selection problem in tail estimation is then revisited, both visually via trimmed Hill plots and, for the Hall class, also mathematically via minimizing the expected empirical variance. This leads to a simple threshold selection procedure for the classical Hill estimator which circumvents the estimation of some of the tail characteristics, a problem which is usually the bottleneck in threshold selection. As a by-product, we derive an alternative estimator of the tail index, which assigns more weight to large observations, and works particularly well for relatively lighter tails. A simple ratio statistic routine is suggested to evaluate the goodness of the implied selection of the threshold. We illustrate the favourable performance and the potential of the proposed method with simulation studies and real insurance data. △ Less

Submitted 28 June, 2020; v1 submitted 19 March, 2019; originally announced March 2019.

arXiv:1903.07861 [pdf, ps, other]

Characterisation of exchangeable sequences through empirical distributions

Authors: Martin Bladt, Dimitry Shaiderman

Abstract: It is a well-known fact that an exchangeable sequence has empirical distributions that form a reverse-martingale. This paper is devoted to proof of the converse statement. As a byproduct of the proof for the binary case, we introduce and discuss the notion of two-coloring exchangeability. It is a well-known fact that an exchangeable sequence has empirical distributions that form a reverse-martingale. This paper is devoted to proof of the converse statement. As a byproduct of the proof for the binary case, we introduce and discuss the notion of two-coloring exchangeability. △ Less

Submitted 24 September, 2023; v1 submitted 19 March, 2019; originally announced March 2019.

Comments: 8 pages, 0 figures

arXiv:1812.04139 [pdf, other]

Inhomogeneous phase--type distributions and heavy tails

Authors: Hansjörg Albrecher, Mogens Bladt

Abstract: We extend the construction principle of phase-type (PH) distributions to allow for inhomogeneous transition rates and show that this naturally leads to direct probabilistic descriptions of certain transformations of PH distributions. In particular, the resulting matrix distributions enable to carry over fitting properties of PH distributions to distributions with heavy tails, providing a general m… ▽ More We extend the construction principle of phase-type (PH) distributions to allow for inhomogeneous transition rates and show that this naturally leads to direct probabilistic descriptions of certain transformations of PH distributions. In particular, the resulting matrix distributions enable to carry over fitting properties of PH distributions to distributions with heavy tails, providing a general modelling framework for heavy-tail phenomena. We also illustrate the versatility and parsimony of the proposed approach for the modelling of a real-world heavy-tailed fire insurance dataset. △ Less

Submitted 28 June, 2019; v1 submitted 10 December, 2018; originally announced December 2018.

arXiv:1806.01416 [pdf, other]

Phase-type distributions in population genetics

Authors: Asger Hobolth, Arno Siri-Jégousse, Mogens Bladt

Abstract: Probability modelling for DNA sequence evolution is well established and provides a rich framework for understanding genetic variation between samples of individuals from one or more populations. We show that both classical and more recent models for coalescence (with or without recombination) can be described in terms of the so-called phase-type theory, where complicated and tedious calculations… ▽ More Probability modelling for DNA sequence evolution is well established and provides a rich framework for understanding genetic variation between samples of individuals from one or more populations. We show that both classical and more recent models for coalescence (with or without recombination) can be described in terms of the so-called phase-type theory, where complicated and tedious calculations are circumvented by the use of matrices. The application of phase-type theory consists of describing the stochastic model as a Markov model by appropriately setting up a state space and calculating the corresponding intensity and reward matrices. Formulae of interest are then expressed in terms of these aforementioned matrices. We illustrate this by a few examples calculating the mean, variance and even higher order moments of the site frequency spectrum in the multiple merger coalescent models, and by analysing the mean and variance for the number of segregating sites for multiple samples in the two-locus ancestral recombination graph. We believe that phase-type theory has great potential as a tool for analysing probability models in population genetics. The compact matrix notation is useful for clarification of current models, in particular their formal manipulation (calculation), but also for further development or extensions. △ Less

Submitted 4 June, 2018; originally announced June 2018.

arXiv:1705.04357 [pdf, other]

Fitting phase--type scale mixtures to heavy--tailed data and distributions

Authors: Mogens Bladt, Leonardo Rojas-Nandayapa

Abstract: We consider the fitting of heavy tailed data and distribution with a special attention to distributions with a non--standard shape in the "body" of the distribution. To this end we consider a dense class of heavy tailed distributions introduced recently, employing an EM algorithm for the the maximum likelihood estimates of its parameters. We present methods for fitting to observed data, histograms… ▽ More We consider the fitting of heavy tailed data and distribution with a special attention to distributions with a non--standard shape in the "body" of the distribution. To this end we consider a dense class of heavy tailed distributions introduced recently, employing an EM algorithm for the the maximum likelihood estimates of its parameters. We present methods for fitting to observed data, histograms, censored data, as well as to theoretical distributions. Numerical examples are provided with simulated data and a benchmark reinsurance dataset. We empirically demonstrate that our model can provide excellent fits to heavy--tailed data/distributions with minimal assumptions △ Less

Submitted 11 May, 2017; originally announced May 2017.

arXiv:1405.7728 [pdf, other]

Simulation of multivariate diffusion bridge

Authors: Mogens Bladt, Samuel Finch, Michael Sørensen

Abstract: We propose simple methods for multivariate diffusion bridge simulation, which plays a fundamental role in simulation-based likelihood and Bayesian inference for stochastic differential equations. By a novel application of classical coupling methods, the new approach generalizes a previously proposed simulation method for one-dimensional bridges to the multi-variate setting. First a method of simul… ▽ More We propose simple methods for multivariate diffusion bridge simulation, which plays a fundamental role in simulation-based likelihood and Bayesian inference for stochastic differential equations. By a novel application of classical coupling methods, the new approach generalizes a previously proposed simulation method for one-dimensional bridges to the multi-variate setting. First a method of simulating approximate, but often very accurate, diffusion bridges is proposed. These approximate bridges are used as proposal for easily implementable MCMC algorithms that produce exact diffusion bridges. The new method is much more generally applicable than previous methods. Another advantage is that the new method works well for diffusion bridges in long intervals because the computational complexity of the method is linear in the length of the interval. In a simulation study the new method performs well, and its usefulness is illustrated by an application to Bayesian estimation for the multivariate hyperbolic diffusion model. △ Less

Submitted 29 May, 2014; originally announced May 2014.

Comments: arXiv admin note: text overlap with arXiv:1403.1762

arXiv:1403.1762 [pdf, ps, other]

doi 10.3150/12-BEJ501

Simple simulation of diffusion bridges with application to likelihood inference for diffusions

Authors: Mogens Bladt, Michael Sørensen

Abstract: With a view to statistical inference for discretely observed diffusion models, we propose simple methods of simulating diffusion bridges, approximately and exactly. Diffusion bridge simulation plays a fundamental role in likelihood and Bayesian inference for diffusion processes. First a simple method of simulating approximate diffusion bridges is proposed and studied. Then these approximate bridge… ▽ More With a view to statistical inference for discretely observed diffusion models, we propose simple methods of simulating diffusion bridges, approximately and exactly. Diffusion bridge simulation plays a fundamental role in likelihood and Bayesian inference for diffusion processes. First a simple method of simulating approximate diffusion bridges is proposed and studied. Then these approximate bridges are used as proposal for an easily implemented Metropolis-Hastings algorithm that produces exact diffusion bridges. The new method utilizes time-reversibility properties of one-dimensional diffusions and is applicable to all one-dimensional diffusion processes with finite speed-measure. One advantage of the new approach is that simple simulation methods like the Milstein scheme can be applied to bridge simulation. Another advantage over previous bridge simulation methods is that the proposed method works well for diffusion bridges in long intervals because the computational complexity of the method is linear in the length of the interval. For $ρ$-mixing diffusions the approximate method is shown to be particularly accurate for long time intervals. In a simulation study, we investigate the accuracy and efficiency of the approximate method and compare it to exact simulation methods. In the study, our method provides a very good approximation to the distribution of a diffusion bridge for bridges that are likely to occur in applications to statistical inference. To illustrate the usefulness of the new method, we present an EM-algorithm for a discretely observed diffusion process. △ Less

Submitted 7 March, 2014; originally announced March 2014.

Comments: Published in at http://dx.doi.org/10.3150/12-BEJ501 the Bernoulli (http://isi.cbs.nl/bernoulli/) by the International Statistical Institute/Bernoulli Society (http://isi.cbs.nl/BS/bshome.htm)

Report number: IMS-BEJ-BEJ501

Journal ref: Bernoulli 2014, Vol. 20, No. 2, 645-675

Showing 1–42 of 42 results for author: Bladt, M