Search | arXiv e-print repository

Forecasting Cryptocurrency Staking Rewards

Authors: Sauren Gupta, Apoorva Hathi Katharaki, Yifan Xu, Bhaskar Krishnamachari, Rajarshi Gupta

Abstract: This research explores a relatively unexplored area of predicting cryptocurrency staking rewards, offering potential insights to researchers and investors. We investigate two predictive methodologies: a) a straightforward sliding-window average, and b) linear regression models predicated on historical data. The findings reveal that ETH staking rewards can be forecasted with an RMSE within 0.7% and… ▽ More This research explores a relatively unexplored area of predicting cryptocurrency staking rewards, offering potential insights to researchers and investors. We investigate two predictive methodologies: a) a straightforward sliding-window average, and b) linear regression models predicated on historical data. The findings reveal that ETH staking rewards can be forecasted with an RMSE within 0.7% and 1.1% of the mean value for 1-day and 7-day look-aheads respectively, using a 7-day sliding-window average approach. Additionally, we discern diverse prediction accuracies across various cryptocurrencies, including SOL, XTZ, ATOM, and MATIC. Linear regression is identified as superior to the moving-window average for perdicting in the short term for XTZ and ATOM. The results underscore the generally stable and predictable nature of staking rewards for most assets, with MATIC presenting a noteworthy exception. △ Less

Submitted 16 January, 2024; originally announced January 2024.

Comments: 9 pages, 18 figures

arXiv:2401.06139 [pdf, other]

Stockformer: A Price-Volume Factor Stock Selection Model Based on Wavelet Transform and Multi-Task Self-Attention Networks

Authors: Bohan Ma, Yushan Xue, Yuan Lu, **g Chen

Abstract: As the Chinese stock market continues to evolve and its market structure grows increasingly complex, traditional quantitative trading methods are facing escalating challenges. Particularly, due to policy uncertainty and the frequent market fluctuations triggered by sudden economic events, existing models often struggle to accurately predict market dynamics. To address these challenges, this paper… ▽ More As the Chinese stock market continues to evolve and its market structure grows increasingly complex, traditional quantitative trading methods are facing escalating challenges. Particularly, due to policy uncertainty and the frequent market fluctuations triggered by sudden economic events, existing models often struggle to accurately predict market dynamics. To address these challenges, this paper introduces Stockformer, a price-volume factor stock selection model that integrates wavelet transformation and a multitask self-attention network, aimed at enhancing responsiveness and predictive accuracy regarding market instabilities. Through discrete wavelet transform, Stockformer decomposes stock returns into high and low frequencies, meticulously capturing long-term market trends and short-term fluctuations, including abrupt events. Moreover, the model incorporates a Dual-Frequency Spatiotemporal Encoder and graph embedding techniques to effectively capture complex temporal and spatial relationships among stocks. Employing a multitask learning strategy, it simultaneously predicts stock returns and directional trends. Experimental results show that Stockformer outperforms existing advanced methods on multiple real stock market datasets. In strategy backtesting, Stockformer consistently demonstrates exceptional stability and reliability across market conditions-whether rising, falling, or fluctuating-particularly maintaining high performance during downturns or volatile periods, indicating a high adaptability to market fluctuations. To foster innovation and collaboration in the financial analysis sector, the Stockformer model's code has been open-sourced and is available on the GitHub repository: https://github.com/Eric991005/Multitask-Stockformer. △ Less

Submitted 17 June, 2024; v1 submitted 22 November, 2023; originally announced January 2024.

Comments: Currently under consideration for publication in the Expert Systems With Applications

arXiv:2211.05581 [pdf, other]

Graph-Regularized Tensor Regression: A Domain-Aware Framework for Interpretable Multi-Way Financial Modelling

Authors: Yao Lei Xu, Kriton Konstantinidis, Danilo P. Mandic

Abstract: Analytics of financial data is inherently a Big Data paradigm, as such data are collected over many assets, asset classes, countries, and time periods. This represents a challenge for modern machine learning models, as the number of model parameters needed to process such data grows exponentially with the data dimensions; an effect known as the Curse-of-Dimensionality. Recently, Tensor Decompositi… ▽ More Analytics of financial data is inherently a Big Data paradigm, as such data are collected over many assets, asset classes, countries, and time periods. This represents a challenge for modern machine learning models, as the number of model parameters needed to process such data grows exponentially with the data dimensions; an effect known as the Curse-of-Dimensionality. Recently, Tensor Decomposition (TD) techniques have shown promising results in reducing the computational costs associated with large-dimensional financial models while achieving comparable performance. However, tensor models are often unable to incorporate the underlying economic domain knowledge. To this end, we develop a novel Graph-Regularized Tensor Regression (GRTR) framework, whereby knowledge about cross-asset relations is incorporated into the model in the form of a graph Laplacian matrix. This is then used as a regularization tool to promote an economically meaningful structure within the model parameters. By virtue of tensor algebra, the proposed framework is shown to be fully interpretable, both coefficient-wise and dimension-wise. The GRTR model is validated in a multi-way financial forecasting setting and compared against competing models, and is shown to achieve improved performance at reduced computational costs. Detailed visualizations are provided to help the reader gain an intuitive understanding of the employed tensor operations. △ Less

Submitted 26 October, 2022; originally announced November 2022.

arXiv:2112.02365 [pdf, other]

TransBoost: A Boosting-Tree Kernel Transfer Learning Algorithm for Improving Financial Inclusion

Authors: Yiheng Sun, Tian Lu, Cong Wang, Yuan Li, Huaiyu Fu, **gran Dong, Yunjie Xu

Abstract: The prosperity of mobile and financial technologies has bred and expanded various kinds of financial products to a broader scope of people, which contributes to advocating financial inclusion. It has non-trivial social benefits of diminishing financial inequality. However, the technical challenges in individual financial risk evaluation caused by the distinct characteristic distribution and limite… ▽ More The prosperity of mobile and financial technologies has bred and expanded various kinds of financial products to a broader scope of people, which contributes to advocating financial inclusion. It has non-trivial social benefits of diminishing financial inequality. However, the technical challenges in individual financial risk evaluation caused by the distinct characteristic distribution and limited credit history of new users, as well as the inexperience of newly-entered companies in handling complex data and obtaining accurate labels, impede further promoting financial inclusion. To tackle these challenges, this paper develops a novel transfer learning algorithm (i.e., TransBoost) that combines the merits of tree-based models and kernel methods. The TransBoost is designed with a parallel tree structure and efficient weights updating mechanism with theoretical guarantee, which enables it to excel in tackling real-world data with high dimensional features and sparsity in $O(n)$ time complexity. We conduct extensive experiments on two public datasets and a unique large-scale dataset from Tencent Mobile Payment. The results show that the TransBoost outperforms other state-of-the-art benchmark transfer learning algorithms in terms of prediction accuracy with superior efficiency, shows stronger robustness to data sparsity, and provides meaningful model interpretation. Besides, given a financial risk level, the TransBoost enables financial service providers to serve the largest number of users including those who would otherwise be excluded by other algorithms. That is, the TransBoost improves financial inclusion. △ Less

Submitted 15 December, 2021; v1 submitted 4 December, 2021; originally announced December 2021.

Comments: Accepted at AAAI-22

arXiv:2107.03926 [pdf, other]

doi 10.1007/978-3-030-86957-1_5

Measuring Financial Time Series Similarity With a View to Identifying Profitable Stock Market Opportunities

Authors: Rian Dolphin, Barry Smyth, Yang Xu, Ruihai Dong

Abstract: Forecasting stock returns is a challenging problem due to the highly stochastic nature of the market and the vast array of factors and events that can influence trading volume and prices. Nevertheless it has proven to be an attractive target for machine learning research because of the potential for even modest levels of prediction accuracy to deliver significant benefits. In this paper, we descri… ▽ More Forecasting stock returns is a challenging problem due to the highly stochastic nature of the market and the vast array of factors and events that can influence trading volume and prices. Nevertheless it has proven to be an attractive target for machine learning research because of the potential for even modest levels of prediction accuracy to deliver significant benefits. In this paper, we describe a case-based reasoning approach to predicting stock market returns using only historical pricing data. We argue that one of the impediments for case-based stock prediction has been the lack of a suitable similarity metric when it comes to identifying similar pricing histories as the basis for a future prediction -- traditional Euclidean and correlation based approaches are not effective for a variety of reasons -- and in this regard, a key contribution of this work is the development of a novel similarity metric for comparing historical pricing data. We demonstrate the benefits of this metric and the case-based approach in a real-world application in comparison to a variety of conventional benchmarks. △ Less

Submitted 7 July, 2021; originally announced July 2021.

Comments: 15 pages. Accepted for presentation at the International Conference on Case-Based Reasoning 2021 (ICCBR)

arXiv:2010.01996 [pdf]

Evaluation of company investment value based on machine learning

Authors: Junfeng Hu, Xiaosa Li, Yuru Xu, Shaowu Wu, Bin Zheng

Abstract: In this paper, company investment value evaluation models are established based on comprehensive company information. After data mining and extracting a set of 436 feature parameters, an optimal subset of features is obtained by dimension reduction through tree-based feature selection, followed by the 5-fold cross-validation using XGBoost and LightGBM models. The results show that the Root-Mean-Sq… ▽ More In this paper, company investment value evaluation models are established based on comprehensive company information. After data mining and extracting a set of 436 feature parameters, an optimal subset of features is obtained by dimension reduction through tree-based feature selection, followed by the 5-fold cross-validation using XGBoost and LightGBM models. The results show that the Root-Mean-Square Error (RMSE) reached 3.098 and 3.059, respectively. In order to further improve the stability and generalization capability, Bayesian Ridge Regression has been used to train a stacking model based on the XGBoost and LightGBM models. The corresponding RMSE is up to 3.047. Finally, the importance of different features to the LightGBM model is analysed. △ Less

Submitted 30 September, 2020; originally announced October 2020.

arXiv:2007.03573 [pdf, other]

regvis.net -- A Visual Bibliography of Regulatory Visualization

Authors: Zhibin Niu, Runlin Li, Junqi Wu, Yaqi Xue, Jiawan Zhang

Abstract: Information visualization and visual analytics technology has attracted significant attention from the financial regulation community. In this research, we present regvis.net, a visual survey of regulatory visualization that allows researchers from both the computing and financial communities to review their literature of interest. We have collected and manually tagged more than 80 regulation visu… ▽ More Information visualization and visual analytics technology has attracted significant attention from the financial regulation community. In this research, we present regvis.net, a visual survey of regulatory visualization that allows researchers from both the computing and financial communities to review their literature of interest. We have collected and manually tagged more than 80 regulation visualization related publications. To the best of our knowledge, this is the first publication set tailored for regulatory visualization. We have provided a webpage (http://regvis.net) for interactive searches and filtering. Each publication is represented by a thumbnail of the representative system interface or key visualization chart, and users can conduct multi-condition screening explorations and fixed text searches. △ Less

Submitted 7 July, 2020; originally announced July 2020.

Comments: 2 pages. Refer to http://regvis.net

arXiv:2003.02515 [pdf, other]

Time-varying neural network for stock return prediction

Authors: Steven Y. K. Wong, Jennifer Chan, Lamiae Azizi, Richard Y. D. Xu

Abstract: We consider the problem of neural network training in a time-varying context. Machine learning algorithms have excelled in problems that do not change over time. However, problems encountered in financial markets are often time-varying. We propose the online early stop** algorithm and show that a neural network trained using this algorithm can track a function changing with unknown dynamics. We… ▽ More We consider the problem of neural network training in a time-varying context. Machine learning algorithms have excelled in problems that do not change over time. However, problems encountered in financial markets are often time-varying. We propose the online early stop** algorithm and show that a neural network trained using this algorithm can track a function changing with unknown dynamics. We compare the proposed algorithm to current approaches on predicting monthly U.S. stock returns and show its superiority. We also show that prominent factors (such as the size and momentum effects) and industry indicators, exhibit time varying stock return predictiveness. We find that during market distress, industry indicators experience an increase in importance at the expense of firm level features. This indicates that industries play a role in explaining stock returns during periods of heightened risk. △ Less

Submitted 22 January, 2021; v1 submitted 5 March, 2020; originally announced March 2020.

Comments: 35 pages, 9 figures

arXiv:1906.09024 [pdf, ps, other]

BERT-based Financial Sentiment Index and LSTM-based Stock Return Predictability

Authors: Joshua Zoen Git Hiew, Xin Huang, Hao Mou, Duan Li, Qi Wu, Yabo Xu

Abstract: Traditional sentiment construction in finance relies heavily on the dictionary-based approach, with a few exceptions using simple machine learning techniques such as Naive Bayes classifier. While the current literature has not yet invoked the rapid advancement in the natural language processing, we construct in this research a textual-based sentiment index using a well-known pre-trained model BERT… ▽ More Traditional sentiment construction in finance relies heavily on the dictionary-based approach, with a few exceptions using simple machine learning techniques such as Naive Bayes classifier. While the current literature has not yet invoked the rapid advancement in the natural language processing, we construct in this research a textual-based sentiment index using a well-known pre-trained model BERT developed by Google, especially for three actively trading individual stocks in Hong Kong market with at the same time the hot discussion on Weibo.com. On the one hand, we demonstrate a significant enhancement of applying BERT in financial sentiment analysis when compared with the existing models. On the other hand, by combining with the other two commonly-used methods when it comes to building the sentiment index in the financial literature, i.e., the option-implied and the market-implied approaches, we propose a more general and comprehensive framework for the financial sentiment analysis, and further provide convincing outcomes for the predictability of individual stock return by combining LSTM (with a feature of a nonlinear map**). It is significantly distinct with the dominating econometric methods in sentiment influence analysis which are all of a nature of linear regression. △ Less

Submitted 7 July, 2022; v1 submitted 21 June, 2019; originally announced June 2019.

Comments: Manuscript

arXiv:1906.03309 [pdf, ps, other]

doi 10.1214/21-AAP1678

An optimal transport problem with backward martingale constraints motivated by insider trading

Authors: Dmitry Kramkov, Yan Xu

Abstract: We study a single-period optimal transport problem on $\mathbb{R}^2$ with a covariance-type cost function $c(x,y) = (x_1-y_1)(x_2-y_2)$ and a backward martingale constraint. We show that a transport plan $γ$ is optimal if and only if there is a maximal monotone set $G$ that supports the $x$-marginal of $γ$ and such that $c(x,y) = \min_{z\in G}c(z,y)$ for every $(x,y)$ in the support of $γ$. We obt… ▽ More We study a single-period optimal transport problem on $\mathbb{R}^2$ with a covariance-type cost function $c(x,y) = (x_1-y_1)(x_2-y_2)$ and a backward martingale constraint. We show that a transport plan $γ$ is optimal if and only if there is a maximal monotone set $G$ that supports the $x$-marginal of $γ$ and such that $c(x,y) = \min_{z\in G}c(z,y)$ for every $(x,y)$ in the support of $γ$. We obtain sharp regularity conditions for the uniqueness of an optimal plan and for its representation in terms of a map. Our study is motivated by a variant of the classical Kyle model of insider trading from Rochet and Vila (1994). △ Less

Submitted 7 June, 2019; originally announced June 2019.

Comments: 46 pages

Journal ref: Annals of Applied Probability 2022, Vol. 32, No. 1, 294-326

arXiv:1710.02127 [pdf, other]

Intervention On Default Contagion Under Partial Information

Authors: Yang Xu

Abstract: We model the default contagion process in a large heterogeneous financial network under the interventions of a regulator (a central bank) with only partial information which is a more realistic setting than most current literature. We provide the analytical results for the asymptotic optimal intervention policies and the asymptotic magnitude of default contagion in terms of the network characteris… ▽ More We model the default contagion process in a large heterogeneous financial network under the interventions of a regulator (a central bank) with only partial information which is a more realistic setting than most current literature. We provide the analytical results for the asymptotic optimal intervention policies and the asymptotic magnitude of default contagion in terms of the network characteristics. We extend the results of Amini et al. (2013) to incorporate interventions and the model of Amini et al. (2015); Amini et al. (2017) to heterogeneous networks with a given degree sequence and arbitrary initial equity levels. The insights from the results are that the optimal intervention policy is "monotonic" in terms of the intervention cost, the closeness to invulnerability and connectivity. Moreover, we should keep intervening on a bank once we have intervened on it. Our simulation results show a good agreement with the theoretical results. △ Less

Submitted 5 October, 2017; originally announced October 2017.

Comments: 63 pages, 9 figures

arXiv:1602.04975 [pdf, other]

Dynamic portfolio selection without risk-free assets

Authors: Chi Kin Lam, Yuhong Xu, Guosheng Yin

Abstract: We consider the mean--variance portfolio optimization problem under the game theoretic framework and without risk-free assets. The problem is solved semi-explicitly by applying the extended Hamilton--Jacobi--Bellman equation. Although the coefficient of risk aversion in our model is a constant, the optimal amounts of money invested in each stock still depend on the current wealth in general. The o… ▽ More We consider the mean--variance portfolio optimization problem under the game theoretic framework and without risk-free assets. The problem is solved semi-explicitly by applying the extended Hamilton--Jacobi--Bellman equation. Although the coefficient of risk aversion in our model is a constant, the optimal amounts of money invested in each stock still depend on the current wealth in general. The optimal solution is obtained by solving a system of ordinary differential equations whose existence and uniqueness are proved and a numerical algorithm as well as its convergence speed are provided. Different from portfolio selection with risk-free assets, our value function is quadratic in the current wealth, and the equilibrium allocation is linearly sensitive to the initial wealth. Numerical results show that this model performs better than both the classical one and the variance model in a bull market. △ Less

Submitted 16 February, 2016; originally announced February 2016.

Comments: 41 pages,8 figures

arXiv:1407.8024 [pdf, other]

Robust valuation and risk measurement under model uncertainty

Authors: Yuhong Xu

Abstract: Model uncertainty is a type of inevitable financial risk. Mistakes on the choice of pricing model may cause great financial losses. In this paper we investigate financial markets with mean-volatility uncertainty. Models for stock markets and option markets with uncertain prior distribution are established by Peng's G-stochastic calculus. The process of stock price is described by generalized geome… ▽ More Model uncertainty is a type of inevitable financial risk. Mistakes on the choice of pricing model may cause great financial losses. In this paper we investigate financial markets with mean-volatility uncertainty. Models for stock markets and option markets with uncertain prior distribution are established by Peng's G-stochastic calculus. The process of stock price is described by generalized geometric G-Brownian motion in which the mean uncertainty may move together with or regardless of the volatility uncertainty. On the hedging market, the upper price of an (exotic) option is derived following the Black-Scholes-Barenblatt equation. It is interesting that the corresponding Barenblatt equation does not depend on the risk preference of investors and the mean-uncertainty of underlying stocks. Hence under some appropriate sublinear expectation, neither the risk preference of investors nor the mean-uncertainty of underlying stocks pose effects on our super and subhedging strategies. Appropriate definitions of arbitrage for super and sub-hedging strategies are presented such that the super and sub-hedging prices are reasonable. Especially the condition of arbitrage for sub-hedging strategy fills the gap of the theory of arbitrage under model uncertainty. Finally we show that the term $K$ of finite-variance arising in the super-hedging strategy is interpreted as the max Profit\&Loss of being short a delta-hedged option. The ask-bid spread is in fact the accumulation of summation of the superhedging $P\&L$ and the subhedging $P\&L $. △ Less

Submitted 30 July, 2014; originally announced July 2014.

Comments: 29 pages

MSC Class: 91G20; 91B24; 91B26; 91B28; 91G80; 60H05; 60H10; 60H30

arXiv:1011.3685 [pdf, other]

Multidimensional dynamic risk measure via conditional g-expectation

Authors: Yuhong Xu

Abstract: This paper deals with multidimensional dynamic risk measures induced by conditional $g$-expectations. A notion of multidimensional $g$-expectation is proposed to provide a multidimensional version of nonlinear expectations. By a technical result on explicit expressions for the comparison theorem, uniqueness theorem and viability on a rectangle of solutions to multidimensional backward stochastic d… ▽ More This paper deals with multidimensional dynamic risk measures induced by conditional $g$-expectations. A notion of multidimensional $g$-expectation is proposed to provide a multidimensional version of nonlinear expectations. By a technical result on explicit expressions for the comparison theorem, uniqueness theorem and viability on a rectangle of solutions to multidimensional backward stochastic differential equations, some necessary and sufficient conditions are given for the constancy, monotonicity, positivity, homogeneity and translatability properties of multidimensional conditional $g$-expectations and multidimensional dynamic risk measures; we prove that a multidimensional dynamic $g$-risk measure is nonincreasingly convex if and only if the generator $g$ satisfies a quasi-monotone increasingly convex condition. A general dual representation is given for the multidimensional dynamic convex $g$-risk measure in which the penalty term is expressed more precisely. It is shown that model uncertainty leads to the convexity of risk measures. As to applications, we show how this multidimensional approach can be applied to measure the insolvency risk of a firm with interacted subsidiaries; optimal risk sharing for $\protectγ$-tolerant $g$-risk measures is investigated. Insurance $g$-risk measure and other ways to induce $g$-risk measures are also studied at the end of the paper. △ Less

Submitted 8 March, 2012; v1 submitted 16 November, 2010; originally announced November 2010.

Comments: 37 pages

MSC Class: 60G42; 60G44; 60H10

Showing 1–14 of 14 results for author: Xu, Y