-
Autocalibration and Tweedie-dominance for Insurance Pricing with Machine Learning
Authors:
Michel Denuit,
Arthur Charpentier,
Julien Trufin
Abstract:
Boosting techniques and neural networks are particularly effective machine learning methods for insurance pricing. Often in practice, there are nevertheless endless debates about the choice of the right loss function to be used to train the machine learning model, as well as about the appropriate metric to assess the performances of competing models. Also, the sum of fitted values can depart from…
▽ More
Boosting techniques and neural networks are particularly effective machine learning methods for insurance pricing. Often in practice, there are nevertheless endless debates about the choice of the right loss function to be used to train the machine learning model, as well as about the appropriate metric to assess the performances of competing models. Also, the sum of fitted values can depart from the observed totals to a large extent and this often confuses actuarial analysts. The lack of balance inherent to training models by minimizing deviance outside the familiar GLM with canonical link setting has been empirically documented in Wüthrich (2019, 2020) who attributes it to the early stop** rule in gradient descent methods for model fitting. The present paper aims to further study this phenomenon when learning proceeds by minimizing Tweedie deviance. It is shown that minimizing deviance involves a trade-off between the integral of weighted differences of lower partial moments and the bias measured on a specific scale. Autocalibration is then proposed as a remedy. This new method to correct for bias adds an extra local GLM step to the analysis. Theoretically, it is shown that it implements the autocalibration concept in pure premium calculation and ensures that balance also holds on a local scale, not only at portfolio level as with existing bias-correction techniques. The convex order appears to be the natural tool to compare competing models, putting a new light on the diagnostic graphs and associated metrics proposed by Denuit et al. (2019).
△ Less
Submitted 9 July, 2021; v1 submitted 5 March, 2021;
originally announced March 2021.
-
From Pareto to Weibull -- a constructive review of distributions on $\mathbb{R}^+$
Authors:
Corinne Sinner,
Yves Dominicy,
Julien Trufin,
Wout Waterschoot,
Patrick Weber,
Christophe Ley
Abstract:
Power laws and power laws with exponential cut-off are two distinct families of distributions on the positive real half-line. In the present paper, we propose a unified treatment of both families by building a family of distributions that interpolates between them, which we call Interpolating Family (IF) of distributions. Our original construction, which relies on techniques from statistical physi…
▽ More
Power laws and power laws with exponential cut-off are two distinct families of distributions on the positive real half-line. In the present paper, we propose a unified treatment of both families by building a family of distributions that interpolates between them, which we call Interpolating Family (IF) of distributions. Our original construction, which relies on techniques from statistical physics, provides a connection for hitherto unrelated distributions like the Pareto and Weibull distributions, and sheds new light on them. The IF also contains several distributions that are neither of power law nor of power law with exponential cut-off type. We calculate quantile-based properties, moments and modes for the IF. This allows us to review known properties of famous distributions on $\mathbb{R}^+$ and to provide in a single sweep these characteristics for various less known (and new) special cases of our Interpolating Family.
△ Less
Submitted 17 February, 2022; v1 submitted 20 December, 2020;
originally announced December 2020.
-
An Interpolating Family of Size Distributions
Authors:
Corinne Sinner,
Yves Dominicy,
Christophe Ley,
Julien Trufin,
Patrick Weber
Abstract:
We introduce a new five-parameter family of size distributions on the semi-finite interval $[x_0, \infty), x_0 \geqslant 0$, with two attractive features. First, it interpolates between power laws, such as the Pareto distribution, and power laws with exponential cut-off, such as the Weibull distribution. The proposed family is thus very flexible and spans over a broad range of well-known size dist…
▽ More
We introduce a new five-parameter family of size distributions on the semi-finite interval $[x_0, \infty), x_0 \geqslant 0$, with two attractive features. First, it interpolates between power laws, such as the Pareto distribution, and power laws with exponential cut-off, such as the Weibull distribution. The proposed family is thus very flexible and spans over a broad range of well-known size distributions which are special cases of our family. Second, it has important tractability advantages over the popular five-parameter Generalized Beta distribution. We derive the hazard function, survival function, modes and quantiles, propose a random number generation procedure and discuss maximum likelihood estimation issues. Finally, we illustrate the wide applicability and fitting capacities of our new model on basis of three real data sets from very diverse domains, namely actuarial science, environmental science and survival analysis.
△ Less
Submitted 7 July, 2016; v1 submitted 14 June, 2016;
originally announced June 2016.