Neural networks for insurance pricing with frequency and severity data: a benchmark study from data preprocessing to technical tariff

Holvoet, Freek; Antonio, Katrien; Henckaerts, Roel

Computer Science > Machine Learning

arXiv:2310.12671 (cs)

[Submitted on 19 Oct 2023 (v1), last revised 30 Oct 2023 (this version, v2)]

Title:Neural networks for insurance pricing with frequency and severity data: a benchmark study from data preprocessing to technical tariff

Authors:Freek Holvoet, Katrien Antonio, Roel Henckaerts

View PDF

Abstract:Insurers usually turn to generalized linear models for modelling claim frequency and severity data. Due to their success in other fields, machine learning techniques are gaining popularity within the actuarial toolbox. Our paper contributes to the literature on frequency-severity insurance pricing with machine learning via deep learning structures. We present a benchmark study on four insurance data sets with frequency and severity targets in the presence of multiple types of input features. We compare in detail the performance of: a generalized linear model on binned input data, a gradient-boosted tree model, a feed-forward neural network (FFNN), and the combined actuarial neural network (CANN). Our CANNs combine a baseline prediction established with a GLM and GBM, respectively, with a neural network correction. We explain the data preprocessing steps with specific focus on the multiple types of input features typically present in tabular insurance data sets, such as postal codes, numeric and categorical covariates. Autoencoders are used to embed the categorical variables into the neural network and we explore their potential advantages in a frequency-severity setting. Finally, we construct global surrogate models for the neural nets' frequency and severity models. These surrogates enable the translation of the essential insights captured by the FFNNs or CANNs to GLMs. As such, a technical tariff table results that can easily be deployed in practice.

Subjects:	Machine Learning (cs.LG); Risk Management (q-fin.RM)
Cite as:	arXiv:2310.12671 [cs.LG]
	(or arXiv:2310.12671v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2310.12671

Submission history

From: Freek Holvoet [view email]
[v1] Thu, 19 Oct 2023 12:00:33 UTC (16,273 KB)
[v2] Mon, 30 Oct 2023 10:03:07 UTC (16,274 KB)

Computer Science > Machine Learning

Title:Neural networks for insurance pricing with frequency and severity data: a benchmark study from data preprocessing to technical tariff

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Neural networks for insurance pricing with frequency and severity data: a benchmark study from data preprocessing to technical tariff

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators