-
European Football Player Valuation: Integrating Financial Models and Network Theory
Authors:
Albert Cohen,
Jimmy Risk
Abstract:
This paper presents a new framework for player valuation in European football by fusing principles from financial mathematics and network theory. The valuation model leverages a "passing matrix" to encapsulate player interactions on the field, utilizing centrality measures to quantify individual influence. Unlike traditional approaches, this model is both metric-driven and cohort-free, providing a…
▽ More
This paper presents a new framework for player valuation in European football by fusing principles from financial mathematics and network theory. The valuation model leverages a "passing matrix" to encapsulate player interactions on the field, utilizing centrality measures to quantify individual influence. Unlike traditional approaches, this model is both metric-driven and cohort-free, providing a dynamic and individualized framework for ascertaining a player's fair market value. The methodology is empirically validated through a case study in European football, employing real-world match and financial data. The paper advances the disciplines of sports analytics and financial mathematics by offering a cross-disciplinary mechanism for player valuation, and also links together two well-known econometric methods in marginal revenue product and expected present valuation.
△ Less
Submitted 15 December, 2023;
originally announced December 2023.
-
Congressional Districting: "Rocks-Pebbles-Sand"
Authors:
Jimmy Risk,
Jennifer Switkes,
Ann Zhang
Abstract:
As a case study into an algorithmic approach to congressional districting, North Carolina provides a lot to explore. Statistical modeling has called into question whether recent North Carolina district plans are unbiased. In particular, the literature suggests that the district plan used in the 2016 U.S. House of Representatives election yields outlier results that are statistically unlikely to be…
▽ More
As a case study into an algorithmic approach to congressional districting, North Carolina provides a lot to explore. Statistical modeling has called into question whether recent North Carolina district plans are unbiased. In particular, the literature suggests that the district plan used in the 2016 U.S. House of Representatives election yields outlier results that are statistically unlikely to be obtained without the application of bias. Therefore, methods for creating strong and fair district plans are needed. Informed by previous districting models and algorithms, we build a model and algorithm to produce an ensemble of viable Congressional district plans. Our work contributes a ``Rocks-Pebbles-Sand'' concept and procedure facilitating reasonable population equity with a small overall number of county splits among districts. Additionally, our methodology minimizes the initial need for granular, precinct-level data, thereby reducing the risk of covert gerrymandering. This case study indicates plausibility of an approach built upon an easy-to-understand intuition.
△ Less
Submitted 20 November, 2023;
originally announced November 2023.
-
Expressive Mortality Models through Gaussian Process Kernels
Authors:
Mike Ludkovski,
Jimmy Risk
Abstract:
We develop a flexible Gaussian Process (GP) framework for learning the covariance structure of Age- and Year-specific mortality surfaces. Utilizing the additive and multiplicative structure of GP kernels, we design a genetic programming algorithm to search for the most expressive kernel for a given population. Our compositional search builds off the Age-Period-Cohort (APC) paradigm to construct a…
▽ More
We develop a flexible Gaussian Process (GP) framework for learning the covariance structure of Age- and Year-specific mortality surfaces. Utilizing the additive and multiplicative structure of GP kernels, we design a genetic programming algorithm to search for the most expressive kernel for a given population. Our compositional search builds off the Age-Period-Cohort (APC) paradigm to construct a covariance prior best matching the spatio-temporal dynamics of a mortality dataset. We apply the resulting genetic algorithm (GA) on synthetic case studies to validate the ability of the GA to recover APC structure, and on real-life national-level datasets from the Human Mortality Database. Our machine-learning based analysis provides novel insight into the presence/absence of Cohort effects in different populations, and into the relative smoothness of mortality surfaces along the Age and Year dimensions. Our modelling work is done with the PyTorch libraries in Python and provides an in-depth investigation of employing GA to aid in compositional kernel search for GP surrogates.
△ Less
Submitted 2 May, 2023;
originally announced May 2023.
-
Sequential Design and Spatial Modeling for Portfolio Tail Risk Measurement
Authors:
Michael Ludkovski,
James Risk
Abstract:
We consider calculation of capital requirements when the underlying economic scenarios are determined by simulatable risk factors. In the respective nested simulation framework, the goal is to estimate portfolio tail risk, quantified via VaR or TVaR of a given collection of future economic scenarios representing factor levels at the risk horizon. Traditionally, evaluating portfolio losses of an ou…
▽ More
We consider calculation of capital requirements when the underlying economic scenarios are determined by simulatable risk factors. In the respective nested simulation framework, the goal is to estimate portfolio tail risk, quantified via VaR or TVaR of a given collection of future economic scenarios representing factor levels at the risk horizon. Traditionally, evaluating portfolio losses of an outer scenario is done by computing a conditional expectation via inner-level Monte Carlo and is computationally expensive. We introduce several inter-related machine learning techniques to speed up this computation, in particular by properly accounting for the simulation noise. Our main workhorse is an advanced Gaussian Process (GP) regression approach which uses nonparametric spatial modeling to efficiently learn the relationship between the stochastic factors defining scenarios and corresponding portfolio value. Leveraging this emulator, we develop sequential algorithms that adaptively allocate inner simulation budgets to target the quantile region. The GP framework also yields better uncertainty quantification for the resulting VaR/TVaR estimators that reduces bias and variance compared to existing methods. We illustrate the proposed strategies with two case-studies in two and six dimensions.
△ Less
Submitted 17 May, 2018; v1 submitted 14 October, 2017;
originally announced October 2017.
-
Gaussian Process Models for Mortality Rates and Improvement Factors
Authors:
Mike Ludkovski,
Jimmy Risk,
Howard Zail
Abstract:
We develop a Gaussian process ("GP") framework for modeling mortality rates and mortality improvement factors. GP regression is a nonparametric, data-driven approach for determining the spatial dependence in mortality rates and jointly smoothing raw rates across dimensions, such as calendar year and age. The GP model quantifies uncertainty associated with smoothed historical experience and generat…
▽ More
We develop a Gaussian process ("GP") framework for modeling mortality rates and mortality improvement factors. GP regression is a nonparametric, data-driven approach for determining the spatial dependence in mortality rates and jointly smoothing raw rates across dimensions, such as calendar year and age. The GP model quantifies uncertainty associated with smoothed historical experience and generates full stochastic trajectories for out-of-sample forecasts. Our framework is well suited for updating projections when newly available data arrives, and for dealing with "edge" issues where credibility is lower. We present a detailed analysis of Gaussian process model performance for US mortality experience based on the CDC datasets. We investigate the interaction between mean and residual modeling, Bayesian and non-Bayesian GP methodologies, accuracy of in-sample and out-of-sample forecasting, and stability of model parameters. We also document the general decline, along with strong age-dependency, in mortality improvement factors over the past few years, contrasting our findings with the Society of Actuaries ("SOA") MP-2014 and -2015 models that do not fully reflect these recent trends.
△ Less
Submitted 11 April, 2018; v1 submitted 29 August, 2016;
originally announced August 2016.
-
Statistical Emulators for Pricing and Hedging Longevity Risk Products
Authors:
James Risk,
Michael Ludkovski
Abstract:
We propose the use of statistical emulators for the purpose of valuing mortality-linked contracts in stochastic mortality models. Such models typically require (nested) evaluation of expected values of nonlinear functionals of multi-dimensional stochastic processes. Except in the simplest cases, no closed-form expressions are available, necessitating numerical approximation. Rather than building a…
▽ More
We propose the use of statistical emulators for the purpose of valuing mortality-linked contracts in stochastic mortality models. Such models typically require (nested) evaluation of expected values of nonlinear functionals of multi-dimensional stochastic processes. Except in the simplest cases, no closed-form expressions are available, necessitating numerical approximation. Rather than building ad hoc analytic approximations, we advocate the use of modern statistical tools from machine learning to generate a flexible, non-parametric surrogate for the true map**s. This method allows performance guarantees regarding approximation accuracy and removes the need for nested simulation. We illustrate our approach with case studies involving (i) a Lee-Carter model with mortality shocks, (ii) index-based static hedging with longevity basis risk; (iii) a Cairns-Blake-Dowd stochastic survival probability model.
△ Less
Submitted 14 September, 2015; v1 submitted 3 August, 2015;
originally announced August 2015.
-
Correlations between Google search data and Mortality Rates
Authors:
James Risk
Abstract:
Inspired by correlations recently discovered between Google search data and financial markets, we show correlations between Google search data mortality rates. Words with negative connotations may provide for increased mortality rates, while words with positive connotations may provide for decreased mortality rates, and so statistical methods were employed to determine to investigate further.
Inspired by correlations recently discovered between Google search data and financial markets, we show correlations between Google search data mortality rates. Words with negative connotations may provide for increased mortality rates, while words with positive connotations may provide for decreased mortality rates, and so statistical methods were employed to determine to investigate further.
△ Less
Submitted 6 October, 2012; v1 submitted 11 September, 2012;
originally announced September 2012.