-
Emoji Driven Crypto Assets Market Reactions
Authors:
Xiaorui Zuo,
Yao-Tsung Chen,
Wolfgang Karl Härdle
Abstract:
In the burgeoning realm of cryptocurrency, social media platforms like Twitter have become pivotal in influencing market trends and investor sentiments. In our study, we leverage GPT-4 and a fine-tuned transformer-based BERT model for a multimodal sentiment analysis, focusing on the impact of emoji sentiment on cryptocurrency markets. By translating emojis into quantifiable sentiment data, we corr…
▽ More
In the burgeoning realm of cryptocurrency, social media platforms like Twitter have become pivotal in influencing market trends and investor sentiments. In our study, we leverage GPT-4 and a fine-tuned transformer-based BERT model for a multimodal sentiment analysis, focusing on the impact of emoji sentiment on cryptocurrency markets. By translating emojis into quantifiable sentiment data, we correlate these insights with key market indicators like BTC Price and the VCRIX index. Our architecture's analysis of emoji sentiment demonstrated a distinct advantage over FinBERT's pure text sentiment analysis in such predicting power. This approach may be fed into the development of trading strategies aimed at utilizing social media elements to identify and forecast market trends. Crucially, our findings suggest that strategies based on emoji sentiment can facilitate the avoidance of significant market downturns and contribute to the stabilization of returns. This research underscores the practical benefits of integrating advanced AI-driven analyses into financial strategies, offering a nuanced perspective on the interplay between digital communication and market dynamics in an academic context.
△ Less
Submitted 4 May, 2024; v1 submitted 16 February, 2024;
originally announced February 2024.
-
Forecasting Cryptocurrency Prices Using Deep Learning: Integrating Financial, Blockchain, and Text Data
Authors:
Vincent Gurgul,
Stefan Lessmann,
Wolfgang Karl Härdle
Abstract:
This paper explores the application of Machine Learning (ML) and Natural Language Processing (NLP) techniques in cryptocurrency price forecasting, specifically Bitcoin (BTC) and Ethereum (ETH). Focusing on news and social media data, primarily from Twitter and Reddit, we analyse the influence of public sentiment on cryptocurrency valuations using advanced deep learning NLP methods. Alongside conve…
▽ More
This paper explores the application of Machine Learning (ML) and Natural Language Processing (NLP) techniques in cryptocurrency price forecasting, specifically Bitcoin (BTC) and Ethereum (ETH). Focusing on news and social media data, primarily from Twitter and Reddit, we analyse the influence of public sentiment on cryptocurrency valuations using advanced deep learning NLP methods. Alongside conventional price regression, we treat cryptocurrency price forecasting as a classification problem. This includes both the prediction of price movements (up or down) and the identification of local extrema. We compare the performance of various ML models, both with and without NLP data integration. Our findings reveal that incorporating NLP data significantly enhances the forecasting performance of our models. We discover that pre-trained models, such as Twitter-RoBERTa and BART MNLI, are highly effective in capturing market sentiment, and that fine-tuning Large Language Models (LLMs) also yields substantial forecasting improvements. Notably, the BART MNLI zero-shot classification model shows considerable proficiency in extracting bullish and bearish signals from textual data. All of our models consistently generate profit across different validation scenarios, with no observed decline in profits or reduction in the impact of NLP data over time. The study highlights the potential of text analysis in improving financial forecasts and demonstrates the effectiveness of various NLP techniques in capturing nuanced market sentiment.
△ Less
Submitted 23 November, 2023;
originally announced November 2023.
-
A novel statistical framework for the analysis of the degree of technology adoption
Authors:
Vahidin Jeleskovic,
David Alexander Behrens,
Wolfgang Karl Härdle
Abstract:
Technology adoption research aims to determine the reasons why and how individuals, corporations, and industries start using new technology. Furthermore, technology adoption itself is decomposed into underlying sub-processes which are characterized by a finite number of sequential states in order to capture its evolutionary nature. Building upon that, in this paper a technology adoption index is b…
▽ More
Technology adoption research aims to determine the reasons why and how individuals, corporations, and industries start using new technology. Furthermore, technology adoption itself is decomposed into underlying sub-processes which are characterized by a finite number of sequential states in order to capture its evolutionary nature. Building upon that, in this paper a technology adoption index is being constructed that allows for statistical testing. This new framework is flexible with respect to the number of underlying models, and accounts for nonlinearities within the evolution of technology adoption. It can be considered as novel because it gives opportunity to a quantitative analysis of technology adoption that has not existed before. Subsequently, this framework is applied for an integrated model of technology adoption.
△ Less
Submitted 18 March, 2023;
originally announced March 2023.
-
Robustifying Markowitz
Authors:
Wolfgang Karl Härdle,
Yegor Klochkov,
Alla Petukhina,
Nikita Zhivotovskiy
Abstract:
Markowitz mean-variance portfolios with sample mean and covariance as input parameters feature numerous issues in practice. They perform poorly out of sample due to estimation error, they experience extreme weights together with high sensitivity to change in input parameters. The heavy-tail characteristics of financial time series are in fact the cause for these erratic fluctuations of weights tha…
▽ More
Markowitz mean-variance portfolios with sample mean and covariance as input parameters feature numerous issues in practice. They perform poorly out of sample due to estimation error, they experience extreme weights together with high sensitivity to change in input parameters. The heavy-tail characteristics of financial time series are in fact the cause for these erratic fluctuations of weights that consequently create substantial transaction costs. In robustifying the weights we present a toolbox for stabilizing costs and weights for global minimum Markowitz portfolios. Utilizing a projected gradient descent (PGD) technique, we avoid the estimation and inversion of the covariance operator as a whole and concentrate on robust estimation of the gradient descent increment. Using modern tools of robust statistics we construct a computationally efficient estimator with almost Gaussian properties based on median-of-means uniformly over weights. This robustified Markowitz approach is confirmed by empirical studies on equity markets. We demonstrate that robustified portfolios reach the lowest turnover compared to shrinkage-based and constrained portfolios while preserving or slightly improving out-of-sample performance.
△ Less
Submitted 28 December, 2022;
originally announced December 2022.
-
Shapley Curves: A Smoothing Perspective
Authors:
Ratmir Miftachov,
Georg Keilbar,
Wolfgang Karl Härdle
Abstract:
This paper fills the limited statistical understanding of Shapley values as a variable importance measure from a nonparametric (or smoothing) perspective. We introduce population-level \textit{Shapley curves} to measure the true variable importance, determined by the conditional expectation function and the distribution of covariates. Having defined the estimand, we derive minimax convergence rate…
▽ More
This paper fills the limited statistical understanding of Shapley values as a variable importance measure from a nonparametric (or smoothing) perspective. We introduce population-level \textit{Shapley curves} to measure the true variable importance, determined by the conditional expectation function and the distribution of covariates. Having defined the estimand, we derive minimax convergence rates and asymptotic normality under general conditions for the two leading estimation strategies. For finite sample inference, we propose a novel version of the wild bootstrap procedure tailored for capturing lower-order terms in the estimation of Shapley curves. Numerical studies confirm our theoretical findings, and an empirical application analyzes the determining factors of vehicle prices.
△ Less
Submitted 3 April, 2024; v1 submitted 23 November, 2022;
originally announced November 2022.
-
Quantinar: a blockchain p2p ecosystem for honest scientific research
Authors:
Raul Bag,
Bruno Spilak,
Julian Winkel,
Wolfgang Karl Härdle
Abstract:
Living in the Information Age, the power of data and correct statistical analysis has never been more prevalent. Academics and practitioners require nowadays an accurate application of quantitative methods. Yet many branches are subject to a crisis of integrity, which is shown in an improper use of statistical models, $p$-hacking, HARKing, or failure to replicate results. We propose the use of a P…
▽ More
Living in the Information Age, the power of data and correct statistical analysis has never been more prevalent. Academics and practitioners require nowadays an accurate application of quantitative methods. Yet many branches are subject to a crisis of integrity, which is shown in an improper use of statistical models, $p$-hacking, HARKing, or failure to replicate results. We propose the use of a Peer-to-Peer (P2P) ecosystem based on a blockchain network, Quantinar (quantinar.com), to support quantitative analytics knowledge paired with code in the form of Quantlets (quantlet.com) or software snippets. The integration of blockchain technology makes Quantinar a decentralized autonomous organization (DAO) that ensures fully transparent and reproducible scientific research.
△ Less
Submitted 31 March, 2023; v1 submitted 13 November, 2022;
originally announced November 2022.
-
A Data-driven Case-based Reasoning in Bankruptcy Prediction
Authors:
Wei Li,
Wolfgang Karl Härdle,
Stefan Lessmann
Abstract:
There has been intensive research regarding machine learning models for predicting bankruptcy in recent years. However, the lack of interpretability limits their growth and practical implementation. This study proposes a data-driven explainable case-based reasoning (CBR) system for bankruptcy prediction. Empirical results from a comparative study show that the proposed approach performs superior t…
▽ More
There has been intensive research regarding machine learning models for predicting bankruptcy in recent years. However, the lack of interpretability limits their growth and practical implementation. This study proposes a data-driven explainable case-based reasoning (CBR) system for bankruptcy prediction. Empirical results from a comparative study show that the proposed approach performs superior to existing, alternative CBR systems and is competitive with state-of-the-art machine learning models. We also demonstrate that the asymmetrical feature similarity comparison mechanism in the proposed CBR system can effectively capture the asymmetrically distributed nature of financial attributes, such as a few companies controlling more cash than the majority, hence improving both the accuracy and explainability of predictions. In addition, we delicately examine the explainability of the CBR system in the decision-making process of bankruptcy prediction. While much research suggests a trade-off between improving prediction accuracy and explainability, our findings show a prospective research avenue in which an explainable model that thoroughly incorporates data attributes by design can reconcile the dilemma.
△ Less
Submitted 2 November, 2022;
originally announced November 2022.
-
Risk budget portfolios with convex Non-negative Matrix Factorization
Authors:
Bruno Spilak,
Wolfgang Karl Härdle
Abstract:
We propose a portfolio allocation method based on risk factor budgeting using convex Nonnegative Matrix Factorization (NMF). Unlike classical factor analysis, PCA, or ICA, NMF ensures positive factor loadings to obtain interpretable long-only portfolios. As the NMF factors represent separate sources of risk, they have a quasi-diagonal correlation matrix, promoting diversified portfolio allocations…
▽ More
We propose a portfolio allocation method based on risk factor budgeting using convex Nonnegative Matrix Factorization (NMF). Unlike classical factor analysis, PCA, or ICA, NMF ensures positive factor loadings to obtain interpretable long-only portfolios. As the NMF factors represent separate sources of risk, they have a quasi-diagonal correlation matrix, promoting diversified portfolio allocations. We evaluate our method in the context of volatility targeting on two long-only global portfolios of cryptocurrencies and traditional assets. Our method outperforms classical portfolio allocations regarding diversification and presents a better risk profile than hierarchical risk parity (HRP). We assess the robustness of our findings using Monte Carlo simulation.
△ Less
Submitted 12 June, 2023; v1 submitted 6 April, 2022;
originally announced April 2022.
-
Hedging Cryptocurrency Options
Authors:
Jovanka Lili Matic,
Natalie Packham,
Wolfgang Karl Härdle
Abstract:
The cryptocurrency market is volatile, non-stationary and non-continuous. Together with liquid derivatives markets, this poses a unique opportunity to study risk management, especially the hedging of options, in a turbulent market. We study the hedge behaviour and effectiveness for the class of affine jump diffusion models and infinite activity Levy processes. First, market data is calibrated to s…
▽ More
The cryptocurrency market is volatile, non-stationary and non-continuous. Together with liquid derivatives markets, this poses a unique opportunity to study risk management, especially the hedging of options, in a turbulent market. We study the hedge behaviour and effectiveness for the class of affine jump diffusion models and infinite activity Levy processes. First, market data is calibrated to stochastic volatility inspired (SVI)-implied volatility surfaces to price options. To cover a wide range of market dynamics, we generate Monte Carlo price paths using an SVCJ model (stochastic volatility with correlated jumps), a close-to-actual-market GARCH-filtered kernel density estimation as well as a historical backtest. In all three settings, options are dynamically hedged with Delta, Delta-Gamma, Delta-Vega and Minimum Variance strategies. Including a wide range of market models allows to understand the trade-off in the hedge performance between complete, but overly parsimonious models, and more complex, but incomplete models. The calibration results reveal a strong indication for stochastic volatility, low jump frequency and evidence of infinite activity. Short-dated options are less sensitive to volatility or Gamma hedges. For longer-dated options, tail risk is consistently reduced by multiple-instrument hedges, in particular by employing complete market models with stochastic volatility.
△ Less
Submitted 2 December, 2022; v1 submitted 23 November, 2021;
originally announced December 2021.
-
Understanding jumps in high frequency digital asset markets
Authors:
Danial Saef,
Odett Nagy,
Sergej Sizov,
Wolfgang Karl Härdle
Abstract:
While attention is a predictor for digital asset prices, and jumps in Bitcoin prices are well-known, we know little about its alternatives. Studying high frequency crypto data gives us the unique possibility to confirm that cross market digital asset returns are driven by high frequency jumps clustered around black swan events, resembling volatility and trading volume seasonalities. Regressions sh…
▽ More
While attention is a predictor for digital asset prices, and jumps in Bitcoin prices are well-known, we know little about its alternatives. Studying high frequency crypto data gives us the unique possibility to confirm that cross market digital asset returns are driven by high frequency jumps clustered around black swan events, resembling volatility and trading volume seasonalities. Regressions show that intra-day jumps significantly influence end of day returns in size and direction. This provides fundamental research for crypto option pricing models. However, we need better econometric methods for capturing the specific market microstructure of cryptos. All calculations are reproducible via the quantlet.com technology.
△ Less
Submitted 18 October, 2021;
originally announced October 2021.
-
A Time-Varying Network for Cryptocurrencies
Authors:
Li Guo,
Wolfgang Karl Härdle,
Yubo Tao
Abstract:
Cryptocurrencies return cross-predictability and technological similarity yield information on risk propagation and market segmentation. To investigate these effects, we build a time-varying network for cryptocurrencies, based on the evolution of return cross-predictability and technological similarities. We develop a dynamic covariate-assisted spectral clustering method to consistently estimate t…
▽ More
Cryptocurrencies return cross-predictability and technological similarity yield information on risk propagation and market segmentation. To investigate these effects, we build a time-varying network for cryptocurrencies, based on the evolution of return cross-predictability and technological similarities. We develop a dynamic covariate-assisted spectral clustering method to consistently estimate the latent community structure of cryptocurrencies network that accounts for both sets of information. We demonstrate that investors can achieve better risk diversification by investing in cryptocurrencies from different communities. A cross-sectional portfolio that implements an inter-crypto momentum trading strategy earns a 1.08% daily return. By dissecting the portfolio returns on behavioral factors, we confirm that our results are not driven by behavioral mechanisms.
△ Less
Submitted 26 August, 2021;
originally announced August 2021.
-
Cooling Measures and Housing Wealth: Evidence from Singapore
Authors:
Wolfgang Karl Härdle,
Rainer Schulz,
Taojun Sie
Abstract:
Excessive house price growth was at the heart of the financial crisis in 2007/08. Since then, many countries have added cooling measures to their regulatory frameworks. It has been found that these measures can indeed control price growth, but no one has examined whether this has adverse consequences for the housing wealth distribution. We examine this for Singapore, which started in 2009 to targe…
▽ More
Excessive house price growth was at the heart of the financial crisis in 2007/08. Since then, many countries have added cooling measures to their regulatory frameworks. It has been found that these measures can indeed control price growth, but no one has examined whether this has adverse consequences for the housing wealth distribution. We examine this for Singapore, which started in 2009 to target price growth over ten rounds in total. We find that welfare from housing wealth in the last round might not be higher than before 2009. This depends on the deflator used to convert nominal into real prices. Irrespective of the deflator, we can reject that welfare increased monotonically over the different rounds.
△ Less
Submitted 26 August, 2021;
originally announced August 2021.
-
Networks of News and Cross-Sectional Returns
Authors:
Junjie Hu,
Wolfgang Karl Härdle
Abstract:
We uncover networks from news articles to study cross-sectional stock returns. By analyzing a huge dataset of more than 1 million news articles collected from the internet, we construct time-varying directed networks of the S&P500 stocks. The well-defined directed news networks are formed based on a modest assumption about firm-specific news structure, and we propose an algorithm to tackle type-I…
▽ More
We uncover networks from news articles to study cross-sectional stock returns. By analyzing a huge dataset of more than 1 million news articles collected from the internet, we construct time-varying directed networks of the S&P500 stocks. The well-defined directed news networks are formed based on a modest assumption about firm-specific news structure, and we propose an algorithm to tackle type-I errors in identifying the stock tickers. We find strong evidence for the comovement effect between the news-linked stocks returns and reversal effect from the lead stock return on the 1-day ahead follower stock return, after controlling for many known effects. Furthermore, a series of portfolio tests reveal that the news network attention proxy, network degree, provides a robust and significant cross-sectional predictability of the monthly stock returns. Among different types of news linkages, the linkages of within-sector stocks, large size lead firms, and lead firms with lower stock liquidity are crucial for cross-sectional predictability.
△ Less
Submitted 17 October, 2021; v1 submitted 12 August, 2021;
originally announced August 2021.
-
Cryptocurrency Dynamics: Rodeo or Ascot?
Authors:
Konstantin Häusler,
Wolfgang Karl Härdle
Abstract:
We model the dynamics of the cryptocurrency (CC) asset class via a stochastic volatility with correlated jumps (SVCJ) model with rolling-window parameter estimates. By analyzing the time-series of parameters, stylized patterns are observable which are robust to changes of the window size and supported by cluster analysis. During bullish periods, volatility stabilizes at low levels and the size and…
▽ More
We model the dynamics of the cryptocurrency (CC) asset class via a stochastic volatility with correlated jumps (SVCJ) model with rolling-window parameter estimates. By analyzing the time-series of parameters, stylized patterns are observable which are robust to changes of the window size and supported by cluster analysis. During bullish periods, volatility stabilizes at low levels and the size and volatility of jumps in mean decreases. In bearish periods though, volatility increases and takes longer to return to its long-run trend. Furthermore, jumps in mean and jumps in volatility are independent. With the rise of the CC market in 2017, a level shift of the volatility of volatility occurred. All codes are available on Quantlet.com.
△ Less
Submitted 6 January, 2022; v1 submitted 23 March, 2021;
originally announced March 2021.
-
K-expectiles clustering
Authors:
Bingling Wang,
Yinxing Li,
Wolfgang Karl Härdle
Abstract:
$K$-means clustering is one of the most widely-used partitioning algorithm in cluster analysis due to its simplicity and computational efficiency. However, $K…
▽ More
$K$-means clustering is one of the most widely-used partitioning algorithm in cluster analysis due to its simplicity and computational efficiency. However, $K$-means does not provide an appropriate clustering result when applying to data with non-spherically shaped clusters. We propose a novel partitioning clustering algorithm based on expectiles. The cluster centers are defined as multivariate expectiles and clusters are searched via a greedy algorithm by minimizing the within cluster '$τ$ -variance'. We suggest two schemes: fixed $τ$ clustering, and adaptive $τ$ clustering. Validated by simulation results, this method beats both $K$-means and spectral clustering on data with asymmetric shaped clusters, or clusters with a complicated structure, including asymmetric normal, beta, skewed $t$ and $F$ distributed clusters. Applications of adaptive $τ$ clustering on crypto-currency (CC) market data are provided. One finds that the expectiles clusters of CC markets show the phenomena of an institutional investors dominated market. The second application is on image segmentation. compared to other center based clustering methods, the adaptive $τ$ cluster centers of pixel data can better capture and describe the features of an image. The fixed $τ$ clustering brings more flexibility on segmentation with a decent accuracy.
△ Less
Submitted 16 March, 2021;
originally announced March 2021.
-
Understanding Smart Contracts: Hype or Hope?
Authors:
Elizaveta Zinovyeva,
Raphael C. G. Reule,
Wolfgang Karl Härdle
Abstract:
Smart Contracts are commonly considered to be an important component or even a key to many business solutions in an immense variety of sectors and promises to securely increase their individual efficiency in an ever more digitized environment. Introduced in the early 1990s, the technology has gained a lot of attention with its application to blockchain technology to an extent, that can be consider…
▽ More
Smart Contracts are commonly considered to be an important component or even a key to many business solutions in an immense variety of sectors and promises to securely increase their individual efficiency in an ever more digitized environment. Introduced in the early 1990s, the technology has gained a lot of attention with its application to blockchain technology to an extent, that can be considered a veritable hype. Reflecting the growing institutional interest, this intertwined exploratory study between statistics, information technology, and law contrasts these idealistic stories with the data reality and provides a mandatory step of understanding the matter, before any further relevant applications are discussed as being "factually" able to replace traditional constructions. Besides fundamental flaws and applica-tion difficulties of currently employed Smart Contracts, the technological drive and enthusiasm backing it may however serve as a jump-off board for future developments thrusting well in the presently unshakeable traditional structures.
△ Less
Submitted 15 March, 2021;
originally announced March 2021.
-
FRM Financial Risk Meter for Emerging Markets
Authors:
Souhir Ben Amor,
Michael Althof,
Wolfgang Karl Härdle
Abstract:
The fast-growing Emerging Market (EM) economies and their improved transparency and liquidity have attracted international investors. However, the external price shocks can result in a higher level of volatility as well as domestic policy instability. Therefore, an efficient risk measure and hedging strategies are needed to help investors protect their investments against this risk. In this paper,…
▽ More
The fast-growing Emerging Market (EM) economies and their improved transparency and liquidity have attracted international investors. However, the external price shocks can result in a higher level of volatility as well as domestic policy instability. Therefore, an efficient risk measure and hedging strategies are needed to help investors protect their investments against this risk. In this paper, a daily systemic risk measure, called FRM (Financial Risk Meter) is proposed. The FRM-EM is applied to capture systemic risk behavior embedded in the returns of the 25 largest EMs FIs, covering the BRIMST (Brazil, Russia, India, Mexico, South Africa, and Turkey), and thereby reflects the financial linkages between these economies. Concerning the Macro factors, in addition to the Adrian and Brunnermeier (2016) Macro, we include the EM sovereign yield spread over respective US Treasuries and the above-mentioned countries currencies. The results indicated that the FRM of EMs FIs reached its maximum during the US financial crisis following by COVID 19 crisis and the Macro factors explain the BRIMST FIs with various degrees of sensibility. We then study the relationship between those factors and the tail event network behavior to build our policy recommendations to help the investors to choose the suitable market for in-vestment and tail-event optimized portfolios. For that purpose, an overlap** region between portfolio optimization strategies and FRM network centrality is developed. We propose a robust and well-diversified tail-event and cluster risk-sensitive portfolio allocation model and compare it to more classical approaches
△ Less
Submitted 10 February, 2021;
originally announced February 2021.
-
SONIC: SOcial Network with Influencers and Communities
Authors:
Cathy Yi-Hsuan Chen,
Wolfgang Karl Härdle,
Yegor Klochkov
Abstract:
The integration of social media characteristics into an econometric framework requires modeling a high dimensional dynamic network with dimensions of parameter typically much larger than the number of observations. To cope with this problem, we introduce SONIC, a new high-dimensional network model that assumes that (1) only few influencers drive the network dynamics; (2) the community structure of…
▽ More
The integration of social media characteristics into an econometric framework requires modeling a high dimensional dynamic network with dimensions of parameter typically much larger than the number of observations. To cope with this problem, we introduce SONIC, a new high-dimensional network model that assumes that (1) only few influencers drive the network dynamics; (2) the community structure of the network is characterized by homogeneity of response to specific influencers, implying their underlying similarity. An estimation procedure is proposed based on a greedy algorithm and LASSO regularization. Through theoretical study and simulations, we show that the matrix parameter can be estimated even when sample size is smaller than the size of the network. Using a novel dataset retrieved from one of leading social media platforms - StockTwits and quantifying their opinions via natural language processing, we model the opinions network dynamics among a select group of users and further detect the latent communities. With a sparsity regularization, we can identify important nodes in the network.
△ Less
Submitted 8 February, 2021;
originally announced February 2021.
-
Surrogate Models for Optimization of Dynamical Systems
Authors:
Kainat Khowaja,
Mykhaylo Shcherbatyy,
Wolfgang Karl Härdle
Abstract:
Driven by increased complexity of dynamical systems, the solution of system of differential equations through numerical simulation in optimization problems has become computationally expensive. This paper provides a smart data driven mechanism to construct low dimensional surrogate models. These surrogate models reduce the computational time for solution of the complex optimization problems by usi…
▽ More
Driven by increased complexity of dynamical systems, the solution of system of differential equations through numerical simulation in optimization problems has become computationally expensive. This paper provides a smart data driven mechanism to construct low dimensional surrogate models. These surrogate models reduce the computational time for solution of the complex optimization problems by using training instances derived from the evaluations of the true objective functions. The surrogate models are constructed using combination of proper orthogonal decomposition and radial basis functions and provides system responses by simple matrix multiplication. Using relative maximum absolute error as the measure of accuracy of approximation, it is shown surrogate models with latin hypercube sampling and spline radial basis functions dominate variable order methods in computational time of optimization, while preserving the accuracy. These surrogate models also show robustness in presence of model non-linearities. Therefore, these computational efficient predictive surrogate models are applicable in various fields, specifically to solve inverse problems and optimal control problems, some examples of which are demonstrated in this paper.
△ Less
Submitted 24 August, 2021; v1 submitted 22 January, 2021;
originally announced January 2021.
-
Data Analytics Driven Controlling: bridging statistical modeling and managerial intuition
Authors:
Kainat Khowaja,
Danial Saef,
Sergej Sizov,
Wolfgang Karl Härdle
Abstract:
Strategic planning in a corporate environment is often based on experience and intuition, although internal data is usually available and can be a valuable source of information. Predicting merger & acquisition (M&A) events is at the heart of strategic management, yet not sufficiently motivated by data analytics driven controlling. One of the main obstacles in using e.g. count data time series for…
▽ More
Strategic planning in a corporate environment is often based on experience and intuition, although internal data is usually available and can be a valuable source of information. Predicting merger & acquisition (M&A) events is at the heart of strategic management, yet not sufficiently motivated by data analytics driven controlling. One of the main obstacles in using e.g. count data time series for M&A seems to be the fact that the intensity of M&A is time varying at least in certain business sectors, e.g. communications. We propose a new automatic procedure to bridge this obstacle using novel statistical methods. The proposed approach allows for a selection of adaptive windows in count data sets by detecting significant changes in the intensity of events. We test the efficacy of the proposed method on a simulated count data set and put it into action on various M&A data sets. It is robust to aberrant behaviour and generates accurate forecasts for the evaluated business sectors. It also provides guidance for an a-priori selection of fixed windows for forecasting. Furthermore, it can be generalized to other business lines, e.g. for managing supply chains, sales forecasts, or call center arrivals, thus giving managers new ways for incorporating statistical modeling in strategic planning decisions.
△ Less
Submitted 25 April, 2022; v1 submitted 10 December, 2020;
originally announced December 2020.
-
Blockchain mechanism and distributional characteristics of cryptos
Authors:
Min-Bin Lin,
Kainat Khowaja,
Cathy Yi-Hsuan Chen,
Wolfgang Karl Härdle
Abstract:
We investigate the relationship between underlying blockchain mechanism of cryptocurrencies and its distributional characteristics. In addition to price, we emphasise on using actual block size and block time as the operational features of cryptos. We use distributional characteristics such as fourier power spectrum, moments, quantiles, global we optimums, as well as the measures for long term dep…
▽ More
We investigate the relationship between underlying blockchain mechanism of cryptocurrencies and its distributional characteristics. In addition to price, we emphasise on using actual block size and block time as the operational features of cryptos. We use distributional characteristics such as fourier power spectrum, moments, quantiles, global we optimums, as well as the measures for long term dependencies, risk and noise to summarise the information from crypto time series. With the hypothesis that the blockchain structure explains the distributional characteristics of cryptos, we use characteristic based spectral clustering to cluster the selected cryptos into five groups. We scrutinise these clusters and find that indeed, the clusters of cryptos share similar mechanism such as origin of fork, difficulty adjustment frequency, and the nature of block size. This paper provides crypto creators and users with a better understanding toward the connection between the blockchain protocol design and distributional characteristics of cryptos.
△ Less
Submitted 24 August, 2021; v1 submitted 26 November, 2020;
originally announced November 2020.
-
A data-driven P-spline smoother and the P-Spline-GARCH-models
Authors:
Yuanhua Feng,
Wolfgang Karl Härdle
Abstract:
Penalized spline smoothing of time series and its asymptotic properties are studied. A data-driven algorithm for selecting the smoothing parameter is developed. The proposal is applied to define a semiparametric extension of the well-known Spline-GARCH, called a P-Spline-GARCH, based on the log-data transformation of the squared returns. It is shown that now the errors process is exponentially str…
▽ More
Penalized spline smoothing of time series and its asymptotic properties are studied. A data-driven algorithm for selecting the smoothing parameter is developed. The proposal is applied to define a semiparametric extension of the well-known Spline-GARCH, called a P-Spline-GARCH, based on the log-data transformation of the squared returns. It is shown that now the errors process is exponentially strong mixing with finite moments of all orders. Asymptotic normality of the P-spline smoother in this context is proved. Practical relevance of the proposal is illustrated by data examples and simulation. The proposal is further applied to value at risk and expected shortfall.
△ Less
Submitted 25 August, 2021; v1 submitted 19 October, 2020;
originally announced October 2020.
-
Tail-risk protection: Machine Learning meets modern Econometrics
Authors:
Bruno Spilak,
Wolfgang Karl Härdle
Abstract:
Tail risk protection is in the focus of the financial industry and requires solid mathematical and statistical tools, especially when a trading strategy is derived. Recent hype driven by machine learning (ML) mechanisms has raised the necessity to display and understand the functionality of ML tools. In this paper, we present a dynamic tail risk protection strategy that targets a maximum predefine…
▽ More
Tail risk protection is in the focus of the financial industry and requires solid mathematical and statistical tools, especially when a trading strategy is derived. Recent hype driven by machine learning (ML) mechanisms has raised the necessity to display and understand the functionality of ML tools. In this paper, we present a dynamic tail risk protection strategy that targets a maximum predefined level of risk measured by Value-At-Risk while controlling for participation in bull market regimes. We propose different weak classifiers, parametric and non-parametric, that estimate the exceedance probability of the risk level from which we derive trading signals in order to hedge tail events. We then compare the different approaches both with statistical and trading strategy performance, finally we propose an ensemble classifier that produces a meta tail risk protection strategy improving both generalization and trading performance.
△ Less
Submitted 24 August, 2021; v1 submitted 7 October, 2020;
originally announced October 2020.
-
An AI approach to measuring financial risk
Authors:
Lining Yu,
Wolfgang Karl Härdle,
Lukas Borke,
Thijs Benschop
Abstract:
AI artificial intelligence brings about new quantitative techniques to assess the state of an economy. Here we describe a new measure for systemic risk: the Financial Risk Meter (FRM). This measure is based on the penalization parameter (lambda) of a linear quantile lasso regression. The FRM is calculated by taking the average of the penalization parameters over the 100 largest US publicly traded…
▽ More
AI artificial intelligence brings about new quantitative techniques to assess the state of an economy. Here we describe a new measure for systemic risk: the Financial Risk Meter (FRM). This measure is based on the penalization parameter (lambda) of a linear quantile lasso regression. The FRM is calculated by taking the average of the penalization parameters over the 100 largest US publicly traded financial institutions. We demonstrate the suitability of this AI based risk measure by comparing the proposed FRM to other measures for systemic risk, such as VIX, SRISK and Google Trends. We find that mutual Granger causality exists between the FRM and these measures, which indicates the validity of the FRM as a systemic risk measure. The implementation of this project is carried out using parallel computing, the codes are published on www.quantlet.de with keyword FRM. The R package RiskAnalytics is another tool with the purpose of integrating and facilitating the research, calculation and analysis methods around the FRM project. The visualization and the up-to-date FRM can be found on hu.berlin/frm.
△ Less
Submitted 28 September, 2020;
originally announced September 2020.
-
lCARE -- localizing Conditional AutoRegressive Expectiles
Authors:
Xiu Xu,
Andrija Mihoci,
Wolfgang Karl Härdle
Abstract:
We account for time-varying parameters in the conditional expectile-based value at risk (EVaR) model. The EVaR downside risk is more sensitive to the magnitude of portfolio losses compared to the quantile-based value at risk (QVaR). Rather than fitting the expectile models over ad-hoc fixed data windows, this study focuses on parameter instability of tail risk dynamics by utilising a local paramet…
▽ More
We account for time-varying parameters in the conditional expectile-based value at risk (EVaR) model. The EVaR downside risk is more sensitive to the magnitude of portfolio losses compared to the quantile-based value at risk (QVaR). Rather than fitting the expectile models over ad-hoc fixed data windows, this study focuses on parameter instability of tail risk dynamics by utilising a local parametric approach. Our framework yields a data-driven optimal interval length at each time point by a sequential test. Empirical evidence at three stock markets from 2005-2016 shows that the selected lengths account for approximately 3-6 months of daily observations. This method performs favorable compared to the models with one-year fixed intervals, as well as quantile based candidates while employing a time invariant portfolio protection (TIPP) strategy for the DAX, FTSE 100 and S&P 500 portfolios. The tail risk measure implied by our model finally provides valuable insights for asset allocation and portfolio insurance.
△ Less
Submitted 28 September, 2020;
originally announced September 2020.
-
A first econometric analysis of the CRIX family
Authors:
Shi Chen,
Cathy Yi-Hsuan Chen,
Wolfgang Karl Härdle
Abstract:
In order to price contingent claims one needs to first understand the dynamics of these indices. Here we provide a first econometric analysis of the CRIX family within a time-series framework. The key steps of our analysis include model selection, estimation and testing. Linear dependence is removed by an ARIMA model, the diagnostic checking resulted in an ARIMA(2,0,2) model for the available samp…
▽ More
In order to price contingent claims one needs to first understand the dynamics of these indices. Here we provide a first econometric analysis of the CRIX family within a time-series framework. The key steps of our analysis include model selection, estimation and testing. Linear dependence is removed by an ARIMA model, the diagnostic checking resulted in an ARIMA(2,0,2) model for the available sample period from Aug 1st, 2014 to April 6th, 2016. The model residuals showed the well known phenomenon of volatility clustering. Therefore a further refinement lead us to an ARIMA(2,0,2)-t-GARCH(1,1) process. This specification conveniently takes care of fat-tail properties that are typical for financial markets. The multivariate GARCH models are implemented on the CRIX index family to explore the interaction.
△ Less
Submitted 25 September, 2020;
originally announced September 2020.
-
A Machine Learning Based Regulatory Risk Index for Cryptocurrencies
Authors:
Xinwen Ni,
Wolfgang Karl Härdle,
Taojun Xie
Abstract:
Cryptocurrencies' values often respond aggressively to major policy changes, but none of the existing indices informs on the market risks associated with regulatory changes. In this paper, we quantify the risks originating from new regulations on FinTech and cryptocurrencies (CCs), and analyse their impact on market dynamics. Specifically, a Cryptocurrency Regulatory Risk IndeX (CRRIX) is construc…
▽ More
Cryptocurrencies' values often respond aggressively to major policy changes, but none of the existing indices informs on the market risks associated with regulatory changes. In this paper, we quantify the risks originating from new regulations on FinTech and cryptocurrencies (CCs), and analyse their impact on market dynamics. Specifically, a Cryptocurrency Regulatory Risk IndeX (CRRIX) is constructed based on policy-related news coverage frequency. The unlabeled news data are collected from the top online CC news platforms and further classified using a Latent Dirichlet Allocation model and Hellinger distance. Our results show that the machine-learning-based CRRIX successfully captures major policy-changing moments. The movements for both the VCRIX, a market volatility index, and the CRRIX are synchronous, meaning that the CRRIX could be helpful for all participants in the cryptocurrency market. The algorithms and Python code are available for research purposes on www.quantlet.de.
△ Less
Submitted 24 August, 2021; v1 submitted 25 September, 2020;
originally announced September 2020.
-
Towards the interpretation of time-varying regularization parameters in streaming penalized regression models
Authors:
Lenka Zboňáková,
Ricardo Pio Monti,
Wolfgang Karl Härdle
Abstract:
High-dimensional, streaming datasets are ubiquitous in modern applications. Examples range from finance and e-commerce to the study of biomedical and neuroimaging data. As a result, many novel algorithms have been proposed to address challenges posed by such datasets. In this work, we focus on the use of $\ell_1$ regularized linear models in the context of (possibly non-stationary) streaming data…
▽ More
High-dimensional, streaming datasets are ubiquitous in modern applications. Examples range from finance and e-commerce to the study of biomedical and neuroimaging data. As a result, many novel algorithms have been proposed to address challenges posed by such datasets. In this work, we focus on the use of $\ell_1$ regularized linear models in the context of (possibly non-stationary) streaming data Recently, it has been noted that the choice of the regularization parameter is fundamental in such models and several methods have been proposed which iteratively tune such a parameter in a~time-varying manner; thereby allowing the underlying sparsity of estimated models to vary. Moreover, in many applications, inference on the regularization parameter may itself be of interest, as such a parameter is related to the underlying \textit{sparsity} of the model. However, in this work, we highlight and provide extensive empirical evidence regarding how various (often unrelated) statistical properties in the data can lead to changes in the regularization parameter. In particular, through various synthetic experiments, we demonstrate that changes in the regularization parameter may be driven by changes in the true underlying sparsity, signal-to-noise ratio or even model misspecification. The purpose of this letter is, therefore, to highlight and catalog various statistical properties which induce changes in the associated regularization parameter. We conclude by presenting two applications: one relating to financial data and another to neuroimaging data, where the aforementioned discussion is relevant.
△ Less
Submitted 25 September, 2020;
originally announced September 2020.
-
Copula-Based Factor Model for Credit Risk Analysis
Authors:
Meng-Jou Lu,
Cathy Yi-Hsuan Chen,
Wolfgang Karl Härdle
Abstract:
A standard quantitative method to access credit risk employs a factor model based on joint multivariate normal distribution properties. By extending a one-factor Gaussian copula model to make a more accurate default forecast, this paper proposes to incorporate a state-dependent recovery rate into the conditional factor loading, and model them by sharing a unique common factor. The common factor go…
▽ More
A standard quantitative method to access credit risk employs a factor model based on joint multivariate normal distribution properties. By extending a one-factor Gaussian copula model to make a more accurate default forecast, this paper proposes to incorporate a state-dependent recovery rate into the conditional factor loading, and model them by sharing a unique common factor. The common factor governs the default rate and recovery rate simultaneously and creates their association implicitly. In accordance with Basel III, this paper shows that the tendency of default is more governed by systematic risk rather than idiosyncratic risk during a hectic period. Among the models considered, the one with random factor loading and a state-dependent recovery rate turns out to be the most superior on the default prediction.
△ Less
Submitted 6 October, 2020; v1 submitted 25 September, 2020;
originally announced September 2020.
-
A note on the impact of news on US household inflation expectations
Authors:
Ben Zhe Wang,
Jeffrey Sheen,
Stefan Trück,
Shih-Kang Chao,
Wolfgang Karl Härdle
Abstract:
Monthly disaggregated US data from 1978 to 2016 reveals that exposure to news on inflation and monetary policy helps to explain inflation expectations. This remains true when controlling for household personal characteristics, perceptions of government policy effectiveness, future interest rates and unemployment expectations, and sentiment. We find an asymmetric impact of news on inflation and mon…
▽ More
Monthly disaggregated US data from 1978 to 2016 reveals that exposure to news on inflation and monetary policy helps to explain inflation expectations. This remains true when controlling for household personal characteristics, perceptions of government policy effectiveness, future interest rates and unemployment expectations, and sentiment. We find an asymmetric impact of news on inflation and monetary policy after 1983, with news on rising inflation and easier monetary policy having a stronger effect in comparison to news on lowering inflation and tightening monetary policy. Our results indicate the impact on inflation expectations of monetary policy news manifested through consumer sentiment during the lower bound period.
△ Less
Submitted 24 September, 2020;
originally announced September 2020.
-
Pricing Cryptocurrency Options
Authors:
Ai Jun Hou,
Weining Wang,
Cathy Y. H. Chen,
Wolfgang Karl Härdle
Abstract:
Cryptocurrencies, especially Bitcoin (BTC), which comprise a new digital asset class, have drawn extraordinary worldwide attention. The characteristics of the cryptocurrency/BTC include a high level of speculation, extreme volatility and price discontinuity. We propose a pricing mechanism based on a stochastic volatility with a correlated jump (SVCJ) model and compare it to a flexible co-jump mode…
▽ More
Cryptocurrencies, especially Bitcoin (BTC), which comprise a new digital asset class, have drawn extraordinary worldwide attention. The characteristics of the cryptocurrency/BTC include a high level of speculation, extreme volatility and price discontinuity. We propose a pricing mechanism based on a stochastic volatility with a correlated jump (SVCJ) model and compare it to a flexible co-jump model by Bandi and Renò (2016). The estimation results of both models confirm the impact of jumps and co-jumps on options obtained via simulation and an analysis of the implied volatility curve. We show that a sizeable proportion of price jumps are significantly and contemporaneously anti-correlated with jumps in volatility. Our study comprises pioneering research on pricing BTC options. We show how the proposed pricing mechanism underlines the importance of jumps in cryptocurrency markets.
△ Less
Submitted 23 September, 2020;
originally announced September 2020.
-
Distillation of News Flow into Analysis of Stock Reactions
Authors:
Junni L. Zhang,
Wolfgang Karl Härdle,
Cathy Y. Chen,
Elisabeth Bommes
Abstract:
The gargantuan plethora of opinions, facts and tweets on financial business offers the opportunity to test and analyze the influence of such text sources on future directions of stocks. It also creates though the necessity to distill via statistical technology the informative elements of this prodigious and indeed colossal data source. Using mixed text sources from professional platforms, blog for…
▽ More
The gargantuan plethora of opinions, facts and tweets on financial business offers the opportunity to test and analyze the influence of such text sources on future directions of stocks. It also creates though the necessity to distill via statistical technology the informative elements of this prodigious and indeed colossal data source. Using mixed text sources from professional platforms, blog fora and stock message boards we distill via different lexica sentiment variables. These are employed for an analysis of stock reactions: volatility, volume and returns. An increased sentiment, especially for those with negative prospection, will influence volatility as well as volume. This influence is contingent on the lexical projection and different across Global Industry Classification Standard (GICS) sectors. Based on review articles on 100 S&P 500 constituents for the period of October 20, 2009, to October 13, 2014, we project into BL, MPQA, LM lexica and use the distilled sentiment variables to forecast individual stock indicators in a panel context. Exploiting different lexical projections to test different stock reaction indicators we aim at answering the following research questions: (i) Are the lexica consistent in their analytic ability? (ii) To which degree is there an asymmetric response given the sentiment scales (positive v.s. negative)? (iii) Are the news of high attention firms diffusing faster and result in more timely and efficient stock reaction? (iv) Is there a sector-specific reaction from the distilled sentiment measures? We find there is significant incremental information in the distilled news flow and the sentiment effect is characterized as an asymmetric, attention-specific and sector-specific response of stock reactions.
△ Less
Submitted 22 September, 2020;
originally announced September 2020.
-
CRIX an index for cryptocurrencies
Authors:
Simon Trimborn,
Wolfgang Karl Härdle
Abstract:
The cryptocurrency market is unique on many levels: Very volatile, frequently changing market structure, emerging and vanishing of cryptocurrencies on a daily level. Following its development became a difficult task with the success of cryptocurrencies (CCs) other than Bitcoin. For fiat currency markets, the IMF offers the index SDR and, prior to the EUR, the ECU existed, which was an index repres…
▽ More
The cryptocurrency market is unique on many levels: Very volatile, frequently changing market structure, emerging and vanishing of cryptocurrencies on a daily level. Following its development became a difficult task with the success of cryptocurrencies (CCs) other than Bitcoin. For fiat currency markets, the IMF offers the index SDR and, prior to the EUR, the ECU existed, which was an index representing the development of European currencies. Index providers decide on a fixed number of index constituents which will represent the market segment. It is a challenge to fix a number and develop rules for the constituents in view of the market changes. In the frequently changing CC market, this challenge is even more severe. A method relying on the AIC is proposed to quickly react to market changes and therefore enable us to create an index, referred to as CRIX, for the cryptocurrency market. CRIX is chosen by model selection such that it represents the market well to enable each interested party studying economic questions in this market and to invest into the market. The diversified nature of the CC market makes the inclusion of altcoins in the index product critical to improve tracking performance. We have shown that assigning optimal weights to altcoins helps to reduce the tracking errors of a CC portfolio, despite the fact that their market cap is much smaller relative to Bitcoin. The codes used here are available via www.quantlet.de.
△ Less
Submitted 21 September, 2020;
originally announced September 2020.
-
Implied Basket Correlation Dynamics
Authors:
Wolfgang Karl Härdle,
Elena Silyakova
Abstract:
Equity basket correlation can be estimated both using the physical measure from stock prices, and also using the risk neutral measure from option prices. The difference between the two estimates motivates a so-called "dispersion strategy''. We study the performance of this strategy on the German market and propose several profitability improvement schemes based on implied correlation (IC) forecast…
▽ More
Equity basket correlation can be estimated both using the physical measure from stock prices, and also using the risk neutral measure from option prices. The difference between the two estimates motivates a so-called "dispersion strategy''. We study the performance of this strategy on the German market and propose several profitability improvement schemes based on implied correlation (IC) forecasts. Modelling IC conceals several challenges. Firstly the number of correlation coefficients would grow with the size of the basket. Secondly, IC is not constant over maturities and strikes. Finally, IC changes over time. We reduce the dimensionality of the problem by assuming equicorrelation. The IC surface (ICS) is then approximated from the implied volatilities of stocks and the implied volatility of the basket. To analyze the dynamics of the ICS we employ a dynamic semiparametric factor model.
△ Less
Submitted 21 September, 2020;
originally announced September 2020.
-
Regularization Approach for Network Modeling of German Power Derivative Market
Authors:
Shi Chen,
Wolfgang Karl Härdle,
Brenda López Cabrera
Abstract:
In this paper we propose a regularization approach for network modeling of German power derivative market. To deal with the large portfolio, we combine high-dimensional variable selection techniques with dynamic network analysis. The estimated sparse interconnectedness of the full German power derivative market, clearly identify the significant channels of relevant potential risk spillovers. Our e…
▽ More
In this paper we propose a regularization approach for network modeling of German power derivative market. To deal with the large portfolio, we combine high-dimensional variable selection techniques with dynamic network analysis. The estimated sparse interconnectedness of the full German power derivative market, clearly identify the significant channels of relevant potential risk spillovers. Our empirical findings show the importance of interdependence between different contract types, and identify the main risk contributors. We further observe strong pairwise interconnections between the neighboring contracts especially for the spot contracts trading in the peak hours, its implications for regulators and investors are also discussed. The network analysis of the full German power derivative market helps us to complement a full picture of system risk, and have a better understanding of the German power market functioning and environment.
△ Less
Submitted 21 September, 2020;
originally announced September 2020.
-
Model-driven statistical arbitrage on LETF option markets
Authors:
Sergey Nasekin,
Wolfgang Karl Härdle
Abstract:
In this paper, we study the statistical properties of the moneyness scaling transformation by Leung and Sircar (2015). This transformation adjusts the moneyness coordinate of the implied volatility smile in an attempt to remove the discrepancy between the IV smiles for levered and unlevered ETF options. We construct bootstrap uniform confidence bands which indicate that the implied volatility smil…
▽ More
In this paper, we study the statistical properties of the moneyness scaling transformation by Leung and Sircar (2015). This transformation adjusts the moneyness coordinate of the implied volatility smile in an attempt to remove the discrepancy between the IV smiles for levered and unlevered ETF options. We construct bootstrap uniform confidence bands which indicate that the implied volatility smiles are statistically different after moneyness scaling has been performed. An empirical application shows that there are trading opportunities possible on the LETF market. A statistical arbitrage type strategy based on a dynamic semiparametric factor model is presented. This strategy presents a statistical decision algorithm which generates trade recommendations based on comparison of model and observed LETF implied volatility surface. It is shown to generate positive returns with a high probability. Extensive econometric analysis of LETF implied volatility process is performed including out-of-sample forecasting based on a semiparametric factor model and uniform confidence bands' study. It provides new insights into the latent dynamics of the implied volatility surface. We also incorporate Heston stochastic volatility into the moneyness scaling method for better tractability of the model.
△ Less
Submitted 21 September, 2020;
originally announced September 2020.
-
How to Measure the Performance of a Collaborative Research Center
Authors:
Alona Zharova,
Janine Tellinger-Rice,
Wolfgang Karl Härdle
Abstract:
New Public Management helps universities and research institutions to perform in a highly competitive research environment. Evaluating publicly financed research improves transparency, helps in reflection and self-assessment, and provides information for strategic decision making. In this paper we provide empirical evidence using data from a Collaborative Research Center (CRC) on financial inputs…
▽ More
New Public Management helps universities and research institutions to perform in a highly competitive research environment. Evaluating publicly financed research improves transparency, helps in reflection and self-assessment, and provides information for strategic decision making. In this paper we provide empirical evidence using data from a Collaborative Research Center (CRC) on financial inputs and research output from 2005 to 2016. After selecting performance indicators suitable for a CRC, we describe main properties of the data using visualization techniques. To study the relationship between the dimensions of research performance, we use a time fixed effects panel data model and fixed effects Poisson model. With the help of year dummy variables, we show how the pattern of research productivity changes over time after controlling for staff and travel costs. The joint depiction of the time fixed effects and the research project's life cycle allows a better understanding of the development of the number of discussion papers over time.
△ Less
Submitted 16 September, 2020;
originally announced September 2020.
-
Data driven value-at-risk forecasting using a SVR-GARCH-KDE hybrid
Authors:
Marius Lux,
Wolfgang Karl Härdle,
Stefan Lessmann
Abstract:
Appropriate risk management is crucial to ensure the competitiveness of financial institutions and the stability of the economy. One widely used financial risk measure is Value-at-Risk (VaR). VaR estimates based on linear and parametric models can lead to biased results or even underestimation of risk due to time varying volatility, skewness and leptokurtosis of financial return series. The paper…
▽ More
Appropriate risk management is crucial to ensure the competitiveness of financial institutions and the stability of the economy. One widely used financial risk measure is Value-at-Risk (VaR). VaR estimates based on linear and parametric models can lead to biased results or even underestimation of risk due to time varying volatility, skewness and leptokurtosis of financial return series. The paper proposes a nonlinear and nonparametric framework to forecast VaR that is motivated by overcoming the disadvantages of parametric models with a purely data driven approach. Mean and volatility are modeled via support vector regression (SVR) where the volatility model is motivated by the standard generalized autoregressive conditional heteroscedasticity (GARCH) formulation. Based on this, VaR is derived by applying kernel density estimation (KDE). This approach allows for flexible tail shapes of the profit and loss distribution, adapts for a wide class of tail events and is able to capture complex structures regarding mean and volatility.
The SVR-GARCH-KDE hybrid is compared to standard, exponential and threshold GARCH models coupled with different error distributions. To examine the performance in different markets, one-day-ahead and ten-days-ahead forecasts are produced for different financial indices. Model evaluation using a likelihood ratio based test framework for interval forecasts and a test for superior predictive ability indicates that the SVR-GARCH-KDE hybrid performs competitive to benchmark models and reduces potential losses especially for ten-days-ahead forecasts significantly. Especially models that are coupled with a normal distribution are systematically outperformed.
△ Less
Submitted 15 September, 2020;
originally announced September 2020.
-
Portfolio Decisions and Brain Reactions via the CEAD method
Authors:
Piotr Majer,
Peter N. C. Mohr,
Hauke R. Heekeren,
Wolfgang K. Härdle
Abstract:
Decision making can be a complex process requiring the integration of several attributes of choice options. Understanding the neural processes underlying (uncertain) investment decisions is an important topic in neuroeconomics. We analyzed functional magnetic resonance imaging (fMRI) data from an investment decision study for stimulus-related effects. We propose a new technique for identifying act…
▽ More
Decision making can be a complex process requiring the integration of several attributes of choice options. Understanding the neural processes underlying (uncertain) investment decisions is an important topic in neuroeconomics. We analyzed functional magnetic resonance imaging (fMRI) data from an investment decision study for stimulus-related effects. We propose a new technique for identifying activated brain regions: Cluster, Estimation, Activation and Decision (CEAD) method. Our analysis is focused on clusters of voxels rather than voxel units. Thus, we achieve a higher signal to noise ratio within the unit tested and a smaller number of hypothesis tests compared with the often used General Linear Model (GLM). We propose to first conduct the brain parcellation by applying spatially constrained spectral clustering. The information within each cluster can then be extracted by the flexible Dynamic Semiparametric Factor Model (DSFM) dimension reduction technique and finally be tested for differences in activation between conditions. This sequence of Cluster, Estimation, Activation and Decision admits a model-free analysis of the local fMRI signal. Applying a GLM on the DSFM-based time series resulted in a significant correlation between the risk of choice options and changes in fMRI signal in the anterior insula and dorsomedial prefrontal cortex. Additionally, individual differences in decision-related reactions within the DSFM time series predicted individual differences in risk attitudes as modeled with the framework of the mean-variance model.
△ Less
Submitted 10 September, 2020;
originally announced September 2020.
-
Statistical Inference for Generalized Additive Partially Linear Model
Authors:
Rong Liu,
Wolfgang Karl Härdle
Abstract:
The Generalized Additive Model (GAM) is a powerful tool and has been well studied. This model class helps to identify additive regression structure. Via available test procedures one may identify the regression structure even sharper if some component functions have parametric form. The Generalized Additive Partially Linear Models (GAPLM) enjoy the simplicity of the GLM and the flexibility of the…
▽ More
The Generalized Additive Model (GAM) is a powerful tool and has been well studied. This model class helps to identify additive regression structure. Via available test procedures one may identify the regression structure even sharper if some component functions have parametric form. The Generalized Additive Partially Linear Models (GAPLM) enjoy the simplicity of the GLM and the flexibility of the GAM because they combine both parametric and nonparametric components. We use the hybrid spline-backfitted kernel estimation method, which combines the best features of both spline and kernel methods for making fast, efficient and reliable estimation under alpha-mixing condition. In addition, simultaneous confidence corridors (SCCs) for testing overall trends and empirical likelihood confidence region for parameters are provided under independent condition. The asymptotic properties are obtained and simulation results support the theoretical properties. For the application, we use the GAPLM to improve the accuracy ratio of the default predictions for $19610$ German companies. The quantlets for this paper are available on https://github.com.
△ Less
Submitted 10 September, 2020;
originally announced September 2020.
-
Investing with Cryptocurrencies -- evaluating their potential for portfolio allocation strategies
Authors:
Alla Petukhina,
Simon Trimborn,
Wolfgang Karl Härdle,
Hermann Elendner
Abstract:
Cryptocurrencies (CCs) have risen rapidly in market capitalization over the last years. Despite striking price volatility, their high average returns have drawn attention to CCs as alternative investment assets for portfolio and risk management. We investigate the utility gains for different types of investors when they consider cryptocurrencies as an addition to their portfolio of traditional ass…
▽ More
Cryptocurrencies (CCs) have risen rapidly in market capitalization over the last years. Despite striking price volatility, their high average returns have drawn attention to CCs as alternative investment assets for portfolio and risk management. We investigate the utility gains for different types of investors when they consider cryptocurrencies as an addition to their portfolio of traditional assets. We consider risk-averse, return-seeking as well as diversificationpreferring investors who trade along different allocation frequencies, namely daily, weekly or monthly. Out-of-sample performance and diversification benefits are studied for the most popular portfolio-construction rules, including mean-variance optimization, risk-parity, and maximum-diversification strategies, as well as combined strategies. To account for low liquidity in CC markets, we incorporate liquidity constraints via the LIBRO method. Our results show that CCs can improve the risk-return profile of portfolios. In particular, a maximum-diversification strategy (maximizing the Portfolio Diversification Index, PDI) draws appreciably on CCs, and spanning tests clearly indicate that CC returns are non-redundant additions to the investment universe. Though our analysis also shows that illiquidity of CCs potentially reverses the results.
△ Less
Submitted 16 September, 2020; v1 submitted 9 September, 2020;
originally announced September 2020.
-
A Mortality Model for Multi-populations: A Semi-Parametric Approach
Authors:
Lei Fang,
Wolfgang K. Härdle,
Juhyun Park
Abstract:
Mortality is different across countries, states and regions. Several empirical research works however reveal that mortality trends exhibit a common pattern and show similar structures across populations. The key element in analyzing mortality rate is a time-varying indicator curve. Our main interest lies in validating the existence of the common trends among these curves, the similar gender differ…
▽ More
Mortality is different across countries, states and regions. Several empirical research works however reveal that mortality trends exhibit a common pattern and show similar structures across populations. The key element in analyzing mortality rate is a time-varying indicator curve. Our main interest lies in validating the existence of the common trends among these curves, the similar gender differences and their variability in location among the curves at the national level. Motivated by the empirical findings, we make the study of estimating and forecasting mortality rates based on a semi-parametric approach, which is applied to multiple curves with the shape-related nonlinear variation. This approach allows us to capture the common features contained in the curve functions and meanwhile provides the possibility to characterize the nonlinear variation via a few deviation parameters. These parameters carry an instructive summary of the time-varying curve functions and can be further used to make a suggestive forecast analysis for countries with barren data sets. In this research the model is illustrated with mortality rates of Japan and China, and extended to incorporate more countries.
△ Less
Submitted 9 September, 2020;
originally announced September 2020.
-
Analysis of Deviance for Hypothesis Testing in Generalized Partially Linear Models
Authors:
Wolfgang Karl Härdle,
Li-Shan Huang
Abstract:
In this study, we develop nonparametric analysis of deviance tools for generalized partially linear models based on local polynomial fitting. Assuming a canonical link, we propose expressions for both local and global analysis of deviance, which admit an additivity property that reduces to analysis of variance decompositions in the Gaussian case. Chi-square tests based on integrated likelihood fun…
▽ More
In this study, we develop nonparametric analysis of deviance tools for generalized partially linear models based on local polynomial fitting. Assuming a canonical link, we propose expressions for both local and global analysis of deviance, which admit an additivity property that reduces to analysis of variance decompositions in the Gaussian case. Chi-square tests based on integrated likelihood functions are proposed to formally test whether the nonparametric term is significant. Simulation results are shown to illustrate the proposed chi-square tests and to compare them with an existing procedure based on penalized splines. The methodology is applied to German Bundesbank Federal Reserve data.
△ Less
Submitted 9 September, 2020;
originally announced September 2020.
-
Rise of the Machines? Intraday High-Frequency Trading Patterns of Cryptocurrencies
Authors:
Alla A. Petukhina,
Raphael C. G. Reule,
Wolfgang Karl Härdle
Abstract:
This research analyses high-frequency data of the cryptocurrency market in regards to intraday trading patterns related to algorithmic trading and its impact on the European cryptocurrency market. We study trading quantitatives such as returns, traded volumes, volatility periodicity, and provide summary statistics of return correlations to CRIX (CRyptocurrency IndeX), as well as respective overall…
▽ More
This research analyses high-frequency data of the cryptocurrency market in regards to intraday trading patterns related to algorithmic trading and its impact on the European cryptocurrency market. We study trading quantitatives such as returns, traded volumes, volatility periodicity, and provide summary statistics of return correlations to CRIX (CRyptocurrency IndeX), as well as respective overall high-frequency based market statistics with respect to temporal aspects. Our results provide mandatory insight into a market, where the grand scale employment of automated trading algorithms and the extremely rapid execution of trades might seem to be a standard based on media reports. Our findings on intraday momentum of trading patterns lead to a new quantitative view on approaching the predictability of economic value in this new digital market.
△ Less
Submitted 9 September, 2020;
originally announced September 2020.
-
Improving Crime Count Forecasts Using Twitter and Taxi Data
Authors:
Lara Vomfell,
Wolfgang Karl Härdle,
Stefan Lessmann
Abstract:
Crime prediction is crucial to criminal justice decision makers and efforts to prevent crime. The paper evaluates the explanatory and predictive value of human activity patterns derived from taxi trip, Twitter and Foursquare data. Analysis of a six-month period of crime data for New York City shows that these data sources improve predictive accuracy for property crime by 19% compared to using only…
▽ More
Crime prediction is crucial to criminal justice decision makers and efforts to prevent crime. The paper evaluates the explanatory and predictive value of human activity patterns derived from taxi trip, Twitter and Foursquare data. Analysis of a six-month period of crime data for New York City shows that these data sources improve predictive accuracy for property crime by 19% compared to using only demographic data. This effect is strongest when the novel features are used together, yielding new insights into crime prediction. Notably and in line with social disorganization theory, the novel features cannot improve predictions for violent crimes.
△ Less
Submitted 8 September, 2020;
originally announced September 2020.
-
Editorial: Understanding Cryptocurrencies
Authors:
Wolfgang Karl Härdle,
Campbell R. Harvey,
Raphael C. G. Reule
Abstract:
Cryptocurrency refers to a type of digital asset that uses distributed ledger, or blockchain, technology to enable a secure transaction. Although the technology is widely misunderstood, many central banks are considering launching their own national cryptocurrency. In contrast to most data in financial economics, detailed data on the history of every transaction in the cryptocurrency complex are f…
▽ More
Cryptocurrency refers to a type of digital asset that uses distributed ledger, or blockchain, technology to enable a secure transaction. Although the technology is widely misunderstood, many central banks are considering launching their own national cryptocurrency. In contrast to most data in financial economics, detailed data on the history of every transaction in the cryptocurrency complex are freely available. Furthermore, empirically-oriented research is only now beginning, presenting an extraordinary research opportunity for academia. We provide some insights into the mechanics of cryptocurrencies, describing summary statistics and focusing on potential future research avenues in financial economics.
△ Less
Submitted 29 July, 2020;
originally announced July 2020.
-
Risk of Bitcoin Market: Volatility, Jumps, and Forecasts
Authors:
Junjie Hu,
Wolfgang Karl Härdle,
Weiyu Kuo
Abstract:
Cryptocurrency, the most controversial and simultaneously the most interesting asset, has attracted many investors and speculators in recent years. The visibly significant market capitalization of cryptos also motivates modern financial instruments such as futures and options. Those will depend on the dynamics, volatility, or even the jumps of cryptos. We provide a comprehensive investigation of t…
▽ More
Cryptocurrency, the most controversial and simultaneously the most interesting asset, has attracted many investors and speculators in recent years. The visibly significant market capitalization of cryptos also motivates modern financial instruments such as futures and options. Those will depend on the dynamics, volatility, or even the jumps of cryptos. We provide a comprehensive investigation of the risk dynamics of the Bitcoin Market from a realized volatility perspective. The Bitcoin market is extremely risky in the sense of volatility, entangled jumps, and extensive consecutive jumps, which reflect the major incidents worldwide. Empirical study shows that the lagged realized variance increases the future realized variance, while the jumps, especially positive ones, significantly reduce future realized variance. The out-of-sample forecasting model reveals that, in terms of forecasting accuracy and utility gain, investors interested in the long-term realized variance benefit from explicitly modelling the jumps and signed estimators, which is unnecessary for the short-term realized variance forecast.
△ Less
Submitted 9 December, 2021; v1 submitted 11 December, 2019;
originally announced December 2019.
-
LASSO-Driven Inference in Time and Space
Authors:
Victor Chernozhukov,
Wolfgang K. Härdle,
Chen Huang,
Weining Wang
Abstract:
We consider the estimation and inference in a system of high-dimensional regression equations allowing for temporal and cross-sectional dependency in covariates and error processes, covering rather general forms of weak temporal dependence. A sequence of regressions with many regressors using LASSO (Least Absolute Shrinkage and Selection Operator) is applied for variable selection purpose, and an…
▽ More
We consider the estimation and inference in a system of high-dimensional regression equations allowing for temporal and cross-sectional dependency in covariates and error processes, covering rather general forms of weak temporal dependence. A sequence of regressions with many regressors using LASSO (Least Absolute Shrinkage and Selection Operator) is applied for variable selection purpose, and an overall penalty level is carefully chosen by a block multiplier bootstrap procedure to account for multiplicity of the equations and dependencies in the data. Correspondingly, oracle properties with a jointly selected tuning parameter are derived. We further provide high-quality de-biased simultaneous inference on the many target parameters of the system. We provide bootstrap consistency results of the test procedure, which are based on a general Bahadur representation for the $Z$-estimators with dependent data. Simulations demonstrate good performance of the proposed inference procedure. Finally, we apply the method to quantify spillover effects of textual sentiment indices in a financial market and to test the connectedness among sectors.
△ Less
Submitted 15 May, 2020; v1 submitted 13 June, 2018;
originally announced June 2018.
-
A Time-Varying Network for Cryptocurrencies
Authors:
Li Guo,
Wolfgang Karl Härdle,
Yubo Tao
Abstract:
Cryptocurrencies return cross-predictability and technological similarity yield information on risk propagation and market segmentation. To investigate these effects, we build a time-varying network for cryptocurrencies, based on the evolution of return cross-predictability and technological similarities. We develop a dynamic covariate-assisted spectral clustering method to consistently estimate t…
▽ More
Cryptocurrencies return cross-predictability and technological similarity yield information on risk propagation and market segmentation. To investigate these effects, we build a time-varying network for cryptocurrencies, based on the evolution of return cross-predictability and technological similarities. We develop a dynamic covariate-assisted spectral clustering method to consistently estimate the latent community structure of cryptocurrencies network that accounts for both sets of information. We demonstrate that investors can achieve better risk diversification by investing in cryptocurrencies from different communities. A cross-sectional portfolio that implements an inter-crypto momentum trading strategy earns a 1.08% daily return. By dissecting the portfolio returns on behavioral factors, we confirm that our results are not driven by behavioral mechanisms.
△ Less
Submitted 17 November, 2022; v1 submitted 11 February, 2018;
originally announced February 2018.
-
Factorisable Multitask Quantile Regression
Authors:
Shih-Kang Chao,
Wolfgang Karl Härdle,
Ming Yuan
Abstract:
A multivariate quantile regression model with a factor structure is proposed to study data with many responses of interest. The factor structure is allowed to vary with the quantile levels, which makes our framework more flexible than the classical factor models. The model is estimated with the nuclear norm regularization in order to accommodate the high dimensionality of data, but the incurred op…
▽ More
A multivariate quantile regression model with a factor structure is proposed to study data with many responses of interest. The factor structure is allowed to vary with the quantile levels, which makes our framework more flexible than the classical factor models. The model is estimated with the nuclear norm regularization in order to accommodate the high dimensionality of data, but the incurred optimization problem can only be efficiently solved in an approximate manner by off-the-shelf optimization methods. Such a scenario is often seen when the empirical risk is non-smooth or the numerical procedure involves expensive subroutines such as singular value decomposition. To ensure that the approximate estimator accurately estimates the model, non-asymptotic bounds on error of the the approximate estimator is established. For implementation, a numerical procedure that provably marginalizes the approximate error is proposed. The merits of our model and the proposed numerical procedures are demonstrated through Monte Carlo experiments and an application to finance involving a large pool of asset returns.
△ Less
Submitted 18 January, 2020; v1 submitted 14 July, 2015;
originally announced July 2015.