-
Estimation of spatio-temporal extremes via generative neural networks
Authors:
Christopher Bülte,
Lisa Leimenstoll,
Melanie Schienle
Abstract:
Recent methods in modeling spatial extreme events have focused on utilizing parametric max-stable processes and their underlying dependence structure. In this work, we provide a unified approach for analyzing spatial extremes with little available data by estimating the distribution of model parameters or the spatial dependence directly. By employing recent developments in generative neural networ…
▽ More
Recent methods in modeling spatial extreme events have focused on utilizing parametric max-stable processes and their underlying dependence structure. In this work, we provide a unified approach for analyzing spatial extremes with little available data by estimating the distribution of model parameters or the spatial dependence directly. By employing recent developments in generative neural networks we predict a full sample-based distribution, allowing for direct assessment of uncertainty regarding model parameters or other parameter dependent functionals. We validate our method by fitting several simulated max-stable processes, showing a high accuracy of the approach, regarding parameter estimation, as well as uncertainty quantification. Additional robustness checks highlight the generalization and extrapolation capabilities of the model, while an application to precipitation extremes across Western Germany demonstrates the usability of our approach in real-world scenarios.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
Direction Augmentation in the Evaluation of Armed Conflict Predictions
Authors:
Johannes Bracher,
Lotta Rüter,
Fabian Krüger,
Sebastian Lerch,
Melanie Schienle
Abstract:
In many forecasting settings, there is a specific interest in predicting the sign of an outcome variable correctly in addition to its magnitude. For instance, when forecasting armed conflicts, positive and negative log-changes in monthly fatalities represent escalation and de-escalation, respectively, and have very different implications. In the ViEWS forecasting challenge, a prediction competitio…
▽ More
In many forecasting settings, there is a specific interest in predicting the sign of an outcome variable correctly in addition to its magnitude. For instance, when forecasting armed conflicts, positive and negative log-changes in monthly fatalities represent escalation and de-escalation, respectively, and have very different implications. In the ViEWS forecasting challenge, a prediction competition on state-based violence, a novel evaluation score called targeted absolute deviation with direction augmentation (TADDA) has therefore been suggested, which accounts for both for the sign and magnitude of log-changes. While it has a straightforward intuitive motivation, the empirical results of the challenge show that a no-change model always predicting a log-change of zero outperforms all submitted forecasting models under the TADDA score. We provide a statistical explanation for this phenomenon. Analyzing the properties of TADDA, we find that in order to achieve good scores, forecasters often have an incentive to predict no or only modest log-changes. In particular, there is often an incentive to report conservative point predictions considerably closer to zero than the forecaster's actual predictive median or mean. In an empirical application, we demonstrate that a no-change model can be improved upon by tailoring predictions to the particularities of the TADDA score. We conclude by outlining some alternative scoring concepts.
△ Less
Submitted 24 April, 2023;
originally announced April 2023.
-
Robust Knockoffs for Controlling False Discoveries With an Application to Bond Recovery Rates
Authors:
Konstantin Görgen,
Abdolreza Nazemi,
Melanie Schienle
Abstract:
We address challenges in variable selection with highly correlated data that are frequently present in finance, economics, but also in complex natural systems as e.g. weather. We develop a robustified version of the knockoff framework, which addresses challenges with high dependence among possibly many influencing factors and strong time correlation. In particular, the repeated subsampling strateg…
▽ More
We address challenges in variable selection with highly correlated data that are frequently present in finance, economics, but also in complex natural systems as e.g. weather. We develop a robustified version of the knockoff framework, which addresses challenges with high dependence among possibly many influencing factors and strong time correlation. In particular, the repeated subsampling strategy tackles the variability of the knockoffs and the dependency of factors. Simultaneously, we also control the proportion of false discoveries over a grid of all possible values, which mitigates variability of selected factors from ad-hoc choices of a specific false discovery level. In the application for corporate bond recovery rates, we identify new important groups of relevant factors on top of the known standard drivers. But we also show that out-of-sample, the resulting sparse model has similar predictive power to state-of-the-art machine learning models that use the entire set of predictors.
△ Less
Submitted 13 June, 2022;
originally announced June 2022.
-
Predicting Value at Risk for Cryptocurrencies With Generalized Random Forests
Authors:
Konstantin Görgen,
Jonas Meirer,
Melanie Schienle
Abstract:
We study the prediction of Value at Risk (VaR) for cryptocurrencies. In contrast to classic assets, returns of cryptocurrencies are often highly volatile and characterized by large fluctuations around single events. Analyzing a comprehensive set of 105 major cryptocurrencies, we show that Generalized Random Forests (GRF) (Athey et al., 2019) adapted to quantile prediction have superior performance…
▽ More
We study the prediction of Value at Risk (VaR) for cryptocurrencies. In contrast to classic assets, returns of cryptocurrencies are often highly volatile and characterized by large fluctuations around single events. Analyzing a comprehensive set of 105 major cryptocurrencies, we show that Generalized Random Forests (GRF) (Athey et al., 2019) adapted to quantile prediction have superior performance over other established methods such as quantile regression, GARCH-type and CAViaR models. This advantage is especially pronounced in unstable times and for classes of highly-volatile cryptocurrencies. Furthermore, we identify important predictors during such times and show their influence on forecasting over time. Moreover, a comprehensive simulation study also indicates that the GRF methodology is at least on par with existing methods in VaR predictions for standard types of financial returns and clearly superior in the cryptocurrency setup.
△ Less
Submitted 24 June, 2022; v1 submitted 24 February, 2022;
originally announced March 2022.
-
How have German University Tuition Fees Affected Enrollment Rates: Robust Model Selection and Design-based Inference in High-Dimensions
Authors:
Konstantin Görgen,
Melanie Schienle
Abstract:
We use official data for all 16 federal German states to study the causal effect of a flat 1000 Euro state-dependent university tuition fee on the enrollment behavior of students during the years 2006-2014. In particular, we show how the variation in the introduction scheme across states and times can be exploited to identify the federal average causal effect of tuition fees by controlling for a l…
▽ More
We use official data for all 16 federal German states to study the causal effect of a flat 1000 Euro state-dependent university tuition fee on the enrollment behavior of students during the years 2006-2014. In particular, we show how the variation in the introduction scheme across states and times can be exploited to identify the federal average causal effect of tuition fees by controlling for a large amount of potentially influencing attributes for state heterogeneity. We suggest a stability post-double selection methodology to robustly determine the causal effect across types in the transparently modeled unknown response components. The proposed stability resampling scheme in the two LASSO selection steps efficiently mitigates the risk of model underspecification and thus biased effects when the tuition fee policy decision also depends on relevant variables for the state enrollment rates. Correct inference for the full cross-section state population in the sample requires adequate design -- rather than sampling-based standard errors. With the data-driven model selection and explicit control for spatial cross-effects we detect that tuition fees induce substantial migration effects where the mobility occurs both from fee but also from non-fee states suggesting also a general movement for quality. Overall, we find a significant negative impact of up to 4.5 percentage points of fees on student enrollment. This is in contrast to plain one-step LASSO or previous empirical studies with full fixed effects linear panel regressions which generally underestimate the size and get an only insignificant effect.
△ Less
Submitted 4 January, 2021; v1 submitted 18 September, 2019;
originally announced September 2019.
-
Nonparametric regression with nonparametrically generated covariates
Authors:
Enno Mammen,
Christoph Rothe,
Melanie Schienle
Abstract:
We analyze the statistical properties of nonparametric regression estimators using covariates which are not directly observable, but have be estimated from data in a preliminary step. These so-called generated covariates appear in numerous applications, including two-stage nonparametric regression, estimation of simultaneous equation models or censored regression models. Yet so far there seems to…
▽ More
We analyze the statistical properties of nonparametric regression estimators using covariates which are not directly observable, but have be estimated from data in a preliminary step. These so-called generated covariates appear in numerous applications, including two-stage nonparametric regression, estimation of simultaneous equation models or censored regression models. Yet so far there seems to be no general theory for their impact on the final estimator's statistical properties. Our paper provides such results. We derive a stochastic expansion that characterizes the influence of the generation step on the final estimator, and use it to derive rates of consistency and asymptotic distributions accounting for the presence of generated covariates.
△ Less
Submitted 24 July, 2012;
originally announced July 2012.
-
CMOS-Based Biosensor Arrays
Authors:
R. Thewes,
C. Paulus,
M. Schienle,
F. Hofmann,
A. Frey,
R. Brederlow,
M. Augustyniak,
M. Jenkner,
B. Eversmann,
P. Schindler-Bauer,
M. Atzesberger,
B. Holzapfl,
G. Beer,
T. Haneder,
H. -C. Hanke
Abstract:
CMOS-based sensor array chips provide new and attractive features as compared to today's standard tools for medical, diagnostic, and biotechnical applications. Examples for molecule- and cell-based approaches and related circuit design issues are discussed.
CMOS-based sensor array chips provide new and attractive features as compared to today's standard tools for medical, diagnostic, and biotechnical applications. Examples for molecule- and cell-based approaches and related circuit design issues are discussed.
△ Less
Submitted 25 October, 2007;
originally announced October 2007.