Search | arXiv e-print repository

AlphaForge: A Framework to Mine and Dynamically Combine Formulaic Alpha Factors

Authors: Hao Shi, Cuicui Luo, Weili Song, Xinting Zhang, Xiang Ao

Abstract: The variability and low signal-to-noise ratio in financial data, combined with the necessity for interpretability, make the alpha factor mining workflow a crucial component of quantitative investment. Transitioning from early manual extraction to genetic programming, the most advanced approach in this domain currently employs reinforcement learning to mine a set of combination factors with fixed w… ▽ More The variability and low signal-to-noise ratio in financial data, combined with the necessity for interpretability, make the alpha factor mining workflow a crucial component of quantitative investment. Transitioning from early manual extraction to genetic programming, the most advanced approach in this domain currently employs reinforcement learning to mine a set of combination factors with fixed weights. However, the performance of resultant alpha factors exhibits inconsistency, and the inflexibility of fixed factor weights proves insufficient in adapting to the dynamic nature of financial markets. To address this issue, this paper proposes a two-stage formulaic alpha generating framework AlphaForge, for alpha factor mining and factor combination. This framework employs a generative-predictive neural network to generate factors, leveraging the robust spatial exploration capabilities inherent in deep learning while concurrently preserving diversity. The combination model within the framework incorporates the temporal performance of factors for selection and dynamically adjusts the weights assigned to each component alpha factor. Experiments conducted on real-world datasets demonstrate that our proposed model outperforms contemporary benchmarks in formulaic alpha factor mining. Furthermore, our model exhibits a notable enhancement in portfolio returns within the realm of quantitative investment. △ Less

Submitted 26 June, 2024; originally announced June 2024.

arXiv:2406.16505 [pdf, other]

$\text{Alpha}^2$: Discovering Logical Formulaic Alphas using Deep Reinforcement Learning

Authors: Feng Xu, Yan Yin, Xinyu Zhang, Tianyuan Liu, Shengyi Jiang, Zongzhang Zhang

Abstract: Alphas are pivotal in providing signals for quantitative trading. The industry highly values the discovery of formulaic alphas for their interpretability and ease of analysis, compared with the expressive yet overfitting-prone black-box alphas. In this work, we focus on discovering formulaic alphas. Prior studies on automatically generating a collection of formulaic alphas were mostly based on gen… ▽ More Alphas are pivotal in providing signals for quantitative trading. The industry highly values the discovery of formulaic alphas for their interpretability and ease of analysis, compared with the expressive yet overfitting-prone black-box alphas. In this work, we focus on discovering formulaic alphas. Prior studies on automatically generating a collection of formulaic alphas were mostly based on genetic programming (GP), which is known to suffer from the problems of being sensitive to the initial population, converting to local optima, and slow computation speed. Recent efforts employing deep reinforcement learning (DRL) for alpha discovery have not fully addressed key practical considerations such as alpha correlations and validity, which are crucial for their effectiveness. In this work, we propose a novel framework for alpha discovery using DRL by formulating the alpha discovery process as program construction. Our agent, $\text{Alpha}^2$, assembles an alpha program optimized for an evaluation metric. A search algorithm guided by DRL navigates through the search space based on value estimates for potential alpha outcomes. The evaluation metric encourages both the performance and the diversity of alphas for a better final trading strategy. Our formulation of searching alphas also brings the advantage of pre-calculation dimensional analysis, ensuring the logical soundness of alphas, and pruning the vast search space to a large extent. Empirical experiments on real-world stock markets demonstrates $\text{Alpha}^2$'s capability to identify a diverse set of logical and effective alphas, which significantly improves the performance of the final trading strategy. The code of our method is available at https://github.com/x35f/alpha2. △ Less

Submitted 26 June, 2024; v1 submitted 24 June, 2024; originally announced June 2024.

arXiv:2405.14767 [pdf, other]

FinRobot: An Open-Source AI Agent Platform for Financial Applications using Large Language Models

Authors: Hongyang Yang, Boyu Zhang, Neng Wang, Cheng Guo, Xiaoli Zhang, Likun Lin, Junlin Wang, Tianyu Zhou, Mao Guan, Runjia Zhang, Christina Dan Wang

Abstract: As financial institutions and professionals increasingly incorporate Large Language Models (LLMs) into their workflows, substantial barriers, including proprietary data and specialized knowledge, persist between the finance sector and the AI community. These challenges impede the AI community's ability to enhance financial tasks effectively. Acknowledging financial analysis's critical role, we aim… ▽ More As financial institutions and professionals increasingly incorporate Large Language Models (LLMs) into their workflows, substantial barriers, including proprietary data and specialized knowledge, persist between the finance sector and the AI community. These challenges impede the AI community's ability to enhance financial tasks effectively. Acknowledging financial analysis's critical role, we aim to devise financial-specialized LLM-based toolchains and democratize access to them through open-source initiatives, promoting wider AI adoption in financial decision-making. In this paper, we introduce FinRobot, a novel open-source AI agent platform supporting multiple financially specialized AI agents, each powered by LLM. Specifically, the platform consists of four major layers: 1) the Financial AI Agents layer that formulates Financial Chain-of-Thought (CoT) by breaking sophisticated financial problems down into logical sequences; 2) the Financial LLM Algorithms layer dynamically configures appropriate model application strategies for specific tasks; 3) the LLMOps and DataOps layer produces accurate models by applying training/fine-tuning techniques and using task-relevant data; 4) the Multi-source LLM Foundation Models layer that integrates various LLMs and enables the above layers to access them directly. Finally, FinRobot provides hands-on for both professional-grade analysts and laypersons to utilize powerful AI techniques for advanced financial analysis. We open-source FinRobot at \url{https://github.com/AI4Finance-Foundation/FinRobot}. △ Less

Submitted 27 May, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

Comments: FinRobot Whitepaper V1.0

arXiv:2405.11431 [pdf, other]

Review of deep learning models for crypto price prediction: implementation and evaluation

Authors: **gyang Wu, Xinyi Zhang, Fangyixuan Huang, Haochen Zhou, Rohtiash Chandra

Abstract: There has been much interest in accurate cryptocurrency price forecast models by investors and researchers. Deep Learning models are prominent machine learning techniques that have transformed various fields and have shown potential for finance and economics. Although various deep learning models have been explored for cryptocurrency price forecasting, it is not clear which models are suitable due… ▽ More There has been much interest in accurate cryptocurrency price forecast models by investors and researchers. Deep Learning models are prominent machine learning techniques that have transformed various fields and have shown potential for finance and economics. Although various deep learning models have been explored for cryptocurrency price forecasting, it is not clear which models are suitable due to high market volatility. In this study, we review the literature about deep learning for cryptocurrency price forecasting and evaluate novel deep learning models for cryptocurrency stock price prediction. Our deep learning models include variants of long short-term memory (LSTM) recurrent neural networks, variants of convolutional neural networks (CNNs), and the Transformer model. We evaluate univariate and multivariate approaches for multi-step ahead predicting of cryptocurrencies close-price. We also carry out volatility analysis on the four cryptocurrencies which reveals significant fluctuations in their prices throughout the COVID-19 pandemic. Additionally, we investigate the prediction accuracy of two scenarios identified by different training sets for the models. First, we use the pre-COVID-19 datasets to model cryptocurrency close-price forecasting during the early period of COVID-19. Secondly, we utilise data from the COVID-19 period to predict prices for 2023 to 2024. Our results show that the convolutional LSTM with a multivariate approach provides the best prediction accuracy in two major experimental settings. Our results also indicate that the multivariate deep learning models exhibit better performance in forecasting four different cryptocurrencies when compared to the univariate models. △ Less

Submitted 2 June, 2024; v1 submitted 18 May, 2024; originally announced May 2024.

arXiv:2309.16196 [pdf, other]

Stock Volatility Prediction Based on Transformer Model Using Mixed-Frequency Data

Authors: Wenting Liu, Zhaozhong Gui, Guilin Jiang, Lihua Tang, Lichun Zhou, Wan Leng, Xulong Zhang, Yujiang Liu

Abstract: With the increasing volume of high-frequency data in the information age, both challenges and opportunities arise in the prediction of stock volatility. On one hand, the outcome of prediction using tradition method combining stock technical and macroeconomic indicators still leaves room for improvement; on the other hand, macroeconomic indicators and peoples' search record on those search engines… ▽ More With the increasing volume of high-frequency data in the information age, both challenges and opportunities arise in the prediction of stock volatility. On one hand, the outcome of prediction using tradition method combining stock technical and macroeconomic indicators still leaves room for improvement; on the other hand, macroeconomic indicators and peoples' search record on those search engines affecting their interested topics will intuitively have an impact on the stock volatility. For the convenience of assessment of the influence of these indicators, macroeconomic indicators and stock technical indicators are then grouped into objective factors, while Baidu search indices implying people's interested topics are defined as subjective factors. To align different frequency data, we introduce GARCH-MIDAS model. After mixing all the above data, we then feed them into Transformer model as part of the training data. Our experiments show that this model outperforms the baselines in terms of mean square error. The adaption of both types of data under Transformer model significantly reduces the mean square error from 1.00 to 0.86. △ Less

Submitted 28 September, 2023; originally announced September 2023.

Comments: Accepted by the 7th APWeb-WAIM International Joint Conference on Web and Big Data. (APWeb 2023)

arXiv:2303.11959 [pdf, other]

Optimizing Trading Strategies in Quantitative Markets using Multi-Agent Reinforcement Learning

Authors: Hengxi Zhang, Zhendong Shi, Yuanquan Hu, Wenbo Ding, Ercan E. Kuruoglu, Xiao-** Zhang

Abstract: Quantitative markets are characterized by swift dynamics and abundant uncertainties, making the pursuit of profit-driven stock trading actions inherently challenging. Within this context, reinforcement learning (RL), which operates on a reward-centric mechanism for optimal control, has surfaced as a potentially effective solution to the intricate financial decision-making conundrums presented. Thi… ▽ More Quantitative markets are characterized by swift dynamics and abundant uncertainties, making the pursuit of profit-driven stock trading actions inherently challenging. Within this context, reinforcement learning (RL), which operates on a reward-centric mechanism for optimal control, has surfaced as a potentially effective solution to the intricate financial decision-making conundrums presented. This paper delves into the fusion of two established financial trading strategies, namely the constant proportion portfolio insurance (CPPI) and the time-invariant portfolio protection (TIPP), with the multi-agent deep deterministic policy gradient (MADDPG) framework. As a result, we introduce two novel multi-agent RL (MARL) methods, CPPI-MADDPG and TIPP-MADDPG, tailored for probing strategic trading within quantitative markets. To validate these innovations, we implemented them on a diverse selection of 100 real-market shares. Our empirical findings reveal that the CPPI-MADDPG and TIPP-MADDPG strategies consistently outpace their traditional counterparts, affirming their efficacy in the realm of quantitative trading. △ Less

Submitted 21 December, 2023; v1 submitted 15 March, 2023; originally announced March 2023.

arXiv:2206.10736 [pdf]

Imitate then Transcend: Multi-Agent Optimal Execution with Dual-Window Denoise PPO

Authors: ** Fang, Jiacheng Weng, Yi Xiang, Xinwen Zhang

Abstract: A novel framework for solving the optimal execution and placement problems using reinforcement learning (RL) with imitation was proposed. The RL agents trained from the proposed framework consistently outperformed the industry benchmark time-weighted average price (TWAP) strategy in execution cost and showed great generalization across out-of-sample trading dates and tickers. The impressive perfor… ▽ More A novel framework for solving the optimal execution and placement problems using reinforcement learning (RL) with imitation was proposed. The RL agents trained from the proposed framework consistently outperformed the industry benchmark time-weighted average price (TWAP) strategy in execution cost and showed great generalization across out-of-sample trading dates and tickers. The impressive performance was achieved from three aspects. First, our RL network architecture called Dual-window Denoise PPO enabled efficient learning in a noisy market environment. Second, a reward scheme with imitation learning was designed, and a comprehensive set of market features was studied. Third, our flexible action formulation allowed the RL agent to tackle optimal execution and placement collectively resulting in better performance than solving individual problems separately. The RL agent's performance was evaluated in our multi-agent realistic historical limit order book simulator in which price impact was accurately assessed. In addition, ablation studies were also performed, confirming the superiority of our framework. △ Less

Submitted 21 June, 2022; originally announced June 2022.

arXiv:2204.11849 [pdf, other]

Heterogeneous Information Network based Default Analysis on Banking Micro and Small Enterprise Users

Authors: Zheng Zhang, Yingsheng Ji, Jiachen Shen, Xi Zhang, Guangwen Yang

Abstract: Risk assessment is a substantial problem for financial institutions that has been extensively studied both for its methodological richness and its various practical applications. With the expansion of inclusive finance, recent attentions are paid to micro and small-sized enterprises (MSEs). Compared with large companies, MSEs present a higher exposure rate to default owing to their insecure financ… ▽ More Risk assessment is a substantial problem for financial institutions that has been extensively studied both for its methodological richness and its various practical applications. With the expansion of inclusive finance, recent attentions are paid to micro and small-sized enterprises (MSEs). Compared with large companies, MSEs present a higher exposure rate to default owing to their insecure financial stability. Conventional efforts learn classifiers from historical data with elaborate feature engineering. However, the main obstacle for MSEs involves severe deficiency in credit-related information, which may degrade the performance of prediction. Besides, financial activities have diverse explicit and implicit relations, which have not been fully exploited for risk judgement in commercial banks. In particular, the observations on real data show that various relationships between company users have additional power in financial risk analysis. In this paper, we consider a graph of banking data, and propose a novel HIDAM model for the purpose. Specifically, we attempt to incorporate heterogeneous information network with rich attributes on multi-typed nodes and links for modeling the scenario of business banking service. To enhance feature representation of MSEs, we extract interactive information through meta-paths and fully exploit path information. Furthermore, we devise a hierarchical attention mechanism respectively to learn the importance of contents inside each meta-path and the importance of different metapahs. Experimental results verify that HIDAM outperforms state-of-the-art competitors on real-world banking data. △ Less

Submitted 2 May, 2022; v1 submitted 24 April, 2022; originally announced April 2022.

Comments: Corrected typos

arXiv:2203.13999 [pdf]

Distributional Robust Portfolio Construction based on Investor Aversion

Authors: Xin Zhang

Abstract: In behavioral finance, aversion affects investors' judgment of future uncertainty when profit and loss occur. Considering investors' aversion to loss and risk, and the ambiguous uncertainty characterizing asset returns, we construct a distributional robust portfolio model (DRP) under the condition that the distribution of risky asset returns is unknown. Specifically, our objective is to find an op… ▽ More In behavioral finance, aversion affects investors' judgment of future uncertainty when profit and loss occur. Considering investors' aversion to loss and risk, and the ambiguous uncertainty characterizing asset returns, we construct a distributional robust portfolio model (DRP) under the condition that the distribution of risky asset returns is unknown. Specifically, our objective is to find an optimal portfolio of assets that maximizes the worst-case utility level on the Wasserstein ball, which is centered on the empirical distribution of sample returns and the radius of the ball quantifies the investor's ambiguity level. The model is also formulated as a mixed-integer quadratic programming problem with cardinality constraints. In addition, we propose a hybrid algorithm to improve the efficiency of the solution and make it more suitable for large-scale problems. The distributional robust portfolio model considering aversion is empirically tested for superior performance in asset allocation, and we also compare common asset allocation strategies to further enhance the credibility of the portfolio. △ Less

Submitted 5 May, 2022; v1 submitted 26 March, 2022; originally announced March 2022.

arXiv:2202.00871 [pdf, other]

Bayesian Imputation with Optimal Look-Ahead-Bias and Variance Tradeoff

Authors: Jose Blanchet, Fernando Hernandez, Viet Anh Nguyen, Markus Pelger, Xuhui Zhang

Abstract: Missing time-series data is a prevalent problem in many prescriptive analytics models in operations management, healthcare and finance. Imputation methods for time-series data are usually applied to the full panel data with the purpose of training a prescriptive model for a downstream out-of-sample task. For example, the imputation of missing asset returns may be applied before estimating an optim… ▽ More Missing time-series data is a prevalent problem in many prescriptive analytics models in operations management, healthcare and finance. Imputation methods for time-series data are usually applied to the full panel data with the purpose of training a prescriptive model for a downstream out-of-sample task. For example, the imputation of missing asset returns may be applied before estimating an optimal portfolio allocation. However, this practice can result in a look-ahead-bias in the future performance of the downstream task, and there is an inherent trade-off between the look-ahead-bias of using the entire data set for imputation and the larger variance of using only the training portion of the data set for imputation. By connecting layers of information revealed in time, we propose a Bayesian consensus posterior that fuses an arbitrary number of posteriors to optimize the variance and look-ahead-bias trade-off in the imputation. We derive tractable two-step optimization procedures for finding the optimal consensus posterior, with Kullback-Leibler divergence and Wasserstein distance as the dissimilarity measure between posterior distributions. We demonstrate in simulations and in an empirical study the benefit of our imputation mechanism for portfolio allocation with missing returns. △ Less

Submitted 11 April, 2023; v1 submitted 1 February, 2022; originally announced February 2022.

Comments: This work merges and supersedes arXiv:2102.12736

arXiv:2201.02958 [pdf, other]

Smooth Nested Simulation: Bridging Cubic and Square Root Convergence Rates in High Dimensions

Authors: Wenjia Wang, Yanyuan Wang, Xiaowei Zhang

Abstract: Nested simulation concerns estimating functionals of a conditional expectation via simulation. In this paper, we propose a new method based on kernel ridge regression to exploit the smoothness of the conditional expectation as a function of the multidimensional conditioning variable. Asymptotic analysis shows that the proposed method can effectively alleviate the curse of dimensionality on the con… ▽ More Nested simulation concerns estimating functionals of a conditional expectation via simulation. In this paper, we propose a new method based on kernel ridge regression to exploit the smoothness of the conditional expectation as a function of the multidimensional conditioning variable. Asymptotic analysis shows that the proposed method can effectively alleviate the curse of dimensionality on the convergence rate as the simulation budget increases, provided that the conditional expectation is sufficiently smooth. The smoothness bridges the gap between the cubic root convergence rate (that is, the optimal rate for the standard nested simulation) and the square root convergence rate (that is, the canonical rate for the standard Monte Carlo simulation). We demonstrate the performance of the proposed method via numerical examples from portfolio risk management and input uncertainty quantification. △ Less

Submitted 11 October, 2023; v1 submitted 9 January, 2022; originally announced January 2022.

Comments: Main body: 46 pages, 5 figures, 5 tables; Supplemental material: 28 pages

arXiv:2201.01874 [pdf, other]

Combining Reinforcement Learning and Inverse Reinforcement Learning for Asset Allocation Recommendations

Authors: Igor Halperin, Jiayu Liu, Xiao Zhang

Abstract: We suggest a simple practical method to combine the human and artificial intelligence to both learn best investment practices of fund managers, and provide recommendations to improve them. Our approach is based on a combination of Inverse Reinforcement Learning (IRL) and RL. First, the IRL component learns the intent of fund managers as suggested by their trading history, and recovers their implie… ▽ More We suggest a simple practical method to combine the human and artificial intelligence to both learn best investment practices of fund managers, and provide recommendations to improve them. Our approach is based on a combination of Inverse Reinforcement Learning (IRL) and RL. First, the IRL component learns the intent of fund managers as suggested by their trading history, and recovers their implied reward function. At the second step, this reward function is used by a direct RL algorithm to optimize asset allocation decisions. We show that our method is able to improve over the performance of individual fund managers. △ Less

Submitted 5 January, 2022; originally announced January 2022.

Comments: 9 pages, 12 figures

arXiv:2201.01026 [pdf, other]

How Does Risk Hedging Impact Operations? Insights from a Price-Setting Newsvendor Model

Authors: Liao Wang, ** Yao, Xiaowei Zhang

Abstract: If a financial asset's price movement impacts a firm's product demand, the firm can respond to the impact by adjusting its operational decisions. For example, in the automotive industry, car makers decrease the selling prices of fuel-inefficient cars when the oil price rises. Meanwhile, the firm can implement a risk-hedging strategy using the financial asset jointly with its operational decisions.… ▽ More If a financial asset's price movement impacts a firm's product demand, the firm can respond to the impact by adjusting its operational decisions. For example, in the automotive industry, car makers decrease the selling prices of fuel-inefficient cars when the oil price rises. Meanwhile, the firm can implement a risk-hedging strategy using the financial asset jointly with its operational decisions. Motivated by this, we develop and solve a general risk-management model integrating risk hedging into a price-setting newsvendor. The optimal hedging strategy is calculated analytically, which leads to an explicit objective function for optimizing price and ``virtual production quantity'' (VPQ). (The latter determines the service level, i.e., the demand fulfillment probability.) We find that hedging generally reduces the optimal price {when the firm sets the target mean return as its production-only maximum expected profit. With the same condition on the target mean return}, hedging also reduces the optimal VPQ when the asset price trend positively impacts product demand; meanwhile, it may increase the VPQ by a small margin when the impact is negative. We construct the return-risk efficient frontier that characterizes the optimal return-risk trade-off. Our numerical study using data from a prominent automotive manufacturer shows that the markdowns in price and reduction in VPQ are small under our model and that the hedging strategy substantially reduces risk without materially reducing operational profit. △ Less

Submitted 20 June, 2023; v1 submitted 4 January, 2022; originally announced January 2022.

Comments: main body: 36 pages, 3 figures, 2 tables; supplmental material: 68 pages

arXiv:2108.04941 [pdf, other]

Arbitrage-Free Implied Volatility Surface Generation with Variational Autoencoders

Authors: Brian Ning, Sebastian Jaimungal, Xiaorong Zhang, Maxime Bergeron

Abstract: We propose a hybrid method for generating arbitrage-free implied volatility (IV) surfaces consistent with historical data by combining model-free Variational Autoencoders (VAEs) with continuous time stochastic differential equation (SDE) driven models. We focus on two classes of SDE models: regime switching models and Lévy additive processes. By projecting historical surfaces onto the space of SDE… ▽ More We propose a hybrid method for generating arbitrage-free implied volatility (IV) surfaces consistent with historical data by combining model-free Variational Autoencoders (VAEs) with continuous time stochastic differential equation (SDE) driven models. We focus on two classes of SDE models: regime switching models and Lévy additive processes. By projecting historical surfaces onto the space of SDE model parameters, we obtain a distribution on the parameter subspace faithful to the data on which we then train a VAE. Arbitrage-free IV surfaces are then generated by sampling from the posterior distribution on the latent space, decoding to obtain SDE model parameters, and finally map** those parameters to IV surfaces. We further refine the VAE model by including conditional features and demonstrate its superior generative out-of-sample performance. △ Less

Submitted 27 January, 2022; v1 submitted 10 August, 2021; originally announced August 2021.

Comments: 20 pages, 7 figures

arXiv:2104.12484 [pdf, other]

Constructing long-short stock portfolio with a new listwise learn-to-rank algorithm

Authors: Xin Zhang, Lan Wu, Zhixue Chen

Abstract: Factor strategies have gained growing popularity in industry with the fast development of machine learning. Usually, multi-factors are fed to an algorithm for some cross-sectional return predictions, which are further used to construct a long-short portfolio. Instead of predicting the value of the stock return, emerging studies predict a ranked stock list using the mature learn-to-rank technology.… ▽ More Factor strategies have gained growing popularity in industry with the fast development of machine learning. Usually, multi-factors are fed to an algorithm for some cross-sectional return predictions, which are further used to construct a long-short portfolio. Instead of predicting the value of the stock return, emerging studies predict a ranked stock list using the mature learn-to-rank technology. In this study, we propose a new listwise learn-to-rank loss function which aims to emphasize both the top and the bottom of a rank list. Our loss function, motivated by the long-short strategy, is endogenously shift-invariant and can be viewed as a direct generalization of ListMLE. Under different transformation functions, our loss can lead to consistency with binary classification loss or permutation level 0-1 loss. A probabilistic explanation for our model is also given as a generalized Plackett-Luce model. Based on a dataset of 68 factors in China A-share market from 2006 to 2019, our empirical study has demonstrated the strength of our method which achieves an out-of-sample annual return of 38% with the Sharpe ratio being 2. △ Less

Submitted 26 April, 2021; originally announced April 2021.

arXiv:2103.11557 [pdf]

Optimal exit decision of venture capital under time-inconsistent preferences

Authors: Yanzhao Li, Ju'e Guo, Yongwu Li, Xu Zhang

Abstract: This paper proposes two kinds of time-inconsistent preferences (i.e. time flow inconsistency and critical time point inconsistency) to further advance the research on the exit decision of venture capital. Time-inconsistent preference, different from time-consistent preference, assumes that decision makers prefer recent returns rather than future returns. Based on venture capitalists' understanding… ▽ More This paper proposes two kinds of time-inconsistent preferences (i.e. time flow inconsistency and critical time point inconsistency) to further advance the research on the exit decision of venture capital. Time-inconsistent preference, different from time-consistent preference, assumes that decision makers prefer recent returns rather than future returns. Based on venture capitalists' understanding of future preferences, we consider four types of venture capitalists, namely time-consistent venture capitalists, venture capitalists who only realize critical time point inconsistency, naive venture capitalists and sophisticated venture capitalists, of which the latter three are time-inconsistent. All types of time-inconsistent venture capitalists are aware of critical time point inconsistency. Naive venture capitalists misunderstand time flow inconsistency while sophisticated ones understand it correctly. We propose an optimal exit timing of venture capital model. Then we derive and compare the above four types of venture capitalists' exit thresholds. The main results are as follows: (1) all types of time-inconsistent venture capitalists tend to exit earlier than time-consistent venture capitalists. (2) The longer the expire date are, the more likely venture capitalists are to delay the exit, but the delay degree decreases successively (venture capitalists who only realize critical time point inconsistency > naive venture capitalists > sophisticated venture capitalists). △ Less

Submitted 21 March, 2021; originally announced March 2021.

arXiv:2012.07368 [pdf, ps, other]

Effective Algorithms for Optimal Portfolio Deleveraging Problem with Cross Impact

Authors: Hezhi Luo, Yuanyuan Chen, Xianye Zhang, Duan Li, Huixian Wu

Abstract: We investigate the optimal portfolio deleveraging (OPD) problem with permanent and temporary price impacts, where the objective is to maximize equity while meeting a prescribed debt/equity requirement. We take the real situation with cross impact among different assets into consideration. The resulting problem is, however, a non-convex quadratic program with a quadratic constraint and a box constr… ▽ More We investigate the optimal portfolio deleveraging (OPD) problem with permanent and temporary price impacts, where the objective is to maximize equity while meeting a prescribed debt/equity requirement. We take the real situation with cross impact among different assets into consideration. The resulting problem is, however, a non-convex quadratic program with a quadratic constraint and a box constraint, which is known to be NP-hard. In this paper, we first develop a successive convex optimization (SCO) approach for solving the OPD problem and show that the SCO algorithm converges to a KKT point of its transformed problem. Second, we propose an effective global algorithm for the OPD problem, which integrates the SCO method, simple convex relaxation and a branch-and-bound framework, to identify a global optimal solution to the OPD problem within a pre-specified $ε$-tolerance. We establish the global convergence of our algorithm and estimate its complexity. We also conduct numerical experiments to demonstrate the effectiveness of our proposed algorithms with both the real data and the randomly generated medium- and large-scale OPD problem instances. △ Less

Submitted 15 January, 2021; v1 submitted 14 December, 2020; originally announced December 2020.

arXiv:1908.06207 [pdf, ps, other]

On non-uniqueness in mean field games

Authors: Erhan Bayraktar, Xin Zhang

Abstract: We analyze an $N+1$-player game and the corresponding mean field game with state space $\{0,1\}$. The transition rate of $j$-th player is the sum of his control $α^j$ plus a minimum jum** rate $η$. Instead of working under monotonicity conditions, here we consider an anti-monotone running cost. We show that the mean field game equation may have multiple solutions if $η< \frac{1}{2}$. We also pro… ▽ More We analyze an $N+1$-player game and the corresponding mean field game with state space $\{0,1\}$. The transition rate of $j$-th player is the sum of his control $α^j$ plus a minimum jum** rate $η$. Instead of working under monotonicity conditions, here we consider an anti-monotone running cost. We show that the mean field game equation may have multiple solutions if $η< \frac{1}{2}$. We also prove that that although multiple solutions exist, only the one coming from the entropy solution is charged (when $η=0$), and therefore resolve a conjecture of ArXiv: 1903.05788. △ Less

Submitted 16 March, 2020; v1 submitted 16 August, 2019; originally announced August 2019.

Comments: To appear in the Proceedings of the AMS. Keywords: Mean field game, Entropy solution, master equation, Nash equilibrium, Non-uniqueness

MSC Class: 60F99; 60J27; 60K36; 93E20

arXiv:1811.00122 [pdf, ps, other]

Affine Jump-Diffusions: Stochastic Stability and Limit Theorems

Authors: Xiaowei Zhang, Peter W. Glynn

Abstract: Affine jump-diffusions constitute a large class of continuous-time stochastic models that are particularly popular in finance and economics due to their analytical tractability. Methods for parameter estimation for such processes require ergodicity in order establish consistency and asymptotic normality of the associated estimators. In this paper, we develop stochastic stability conditions for aff… ▽ More Affine jump-diffusions constitute a large class of continuous-time stochastic models that are particularly popular in finance and economics due to their analytical tractability. Methods for parameter estimation for such processes require ergodicity in order establish consistency and asymptotic normality of the associated estimators. In this paper, we develop stochastic stability conditions for affine jump-diffusions, thereby providing the needed large-sample theoretical support for estimating such processes. We establish ergodicity for such models by imposing a `strong mean reversion' condition and a mild condition on the distribution of the jumps, i.e. the finiteness of a logarithmic moment. Exponential ergodicity holds if the jumps have a finite moment of a positive order. In addition, we prove strong laws of large numbers and functional central limit theorems for additive functionals for this class of models. △ Less

Submitted 31 October, 2018; originally announced November 2018.

arXiv:1809.00306 [pdf, other]

Enhancing Stock Market Prediction with Extended Coupled Hidden Markov Model over Multi-Sourced Data

Authors: Xi Zhang, Yixuan Li, Senzhang Wang, Binxing Fang, Philip S. Yu

Abstract: Traditional stock market prediction methods commonly only utilize the historical trading data, ignoring the fact that stock market fluctuations can be impacted by various other information sources such as stock related events. Although some recent works propose event-driven prediction approaches by considering the event data, how to leverage the joint impacts of multiple data sources still remains… ▽ More Traditional stock market prediction methods commonly only utilize the historical trading data, ignoring the fact that stock market fluctuations can be impacted by various other information sources such as stock related events. Although some recent works propose event-driven prediction approaches by considering the event data, how to leverage the joint impacts of multiple data sources still remains an open research problem. In this work, we study how to explore multiple data sources to improve the performance of the stock prediction. We introduce an Extended Coupled Hidden Markov Model incorporating the news events with the historical trading data. To address the data sparsity issue of news events for each single stock, we further study the fluctuation correlations between the stocks and incorporate the correlations into the model to facilitate the prediction task. Evaluations on China A-share market data in 2016 show the superior performance of our model against previous methods. △ Less

Submitted 2 September, 2018; originally announced September 2018.

Comments: 19 pages

arXiv:1807.08081 [pdf, ps, other]

Optimal Dividend of Compound Poisson Process under a Stochastic Interest Rate

Authors: Linlin Tian, Xiaoyi Zhang

Abstract: In this paper we assume the insurance wealth process is driven by the compound Poisson process. The discounting factor is modelled as a geometric Brownian motion at first and then as an exponential function of an integrated Ornstein-Uhlenbeck process. The objective is to maximize the cumulated value of expected discounted dividends up to the time of ruin. We give an explicit expression of the valu… ▽ More In this paper we assume the insurance wealth process is driven by the compound Poisson process. The discounting factor is modelled as a geometric Brownian motion at first and then as an exponential function of an integrated Ornstein-Uhlenbeck process. The objective is to maximize the cumulated value of expected discounted dividends up to the time of ruin. We give an explicit expression of the value function and the optimal strategy in the case of interest rate following a geometric Brownian motion. For the case of the Vasicek model, we explore some properties of the value function. Since we can not find an explicit expression for the value function in the second case, we prove that the value function is the viscosity solution of the corresponding HJB equation. △ Less

Submitted 20 July, 2018; originally announced July 2018.

Comments: 16 pages, no figures

MSC Class: 93E20; 49Lxx

arXiv:1804.04283 [pdf, ps, other]

Transport plans with domain constraints

Authors: Erhan Bayraktar, Xin Zhang, Zhou Zhou

Abstract: This paper focuses on martingale optimal transport problems when the martingales are assumed to have bounded quadratic variation. First, we give a result that characterizes the existence of a probability measure satisfying some convex transport constraints in addition to having given initial and terminal marginals. Several applications are provided: martingale measures with volatility uncertainty,… ▽ More This paper focuses on martingale optimal transport problems when the martingales are assumed to have bounded quadratic variation. First, we give a result that characterizes the existence of a probability measure satisfying some convex transport constraints in addition to having given initial and terminal marginals. Several applications are provided: martingale measures with volatility uncertainty, optimal transport with capacity constraints, and Skorokhod embedding with bounded times. Next, we extend this result to multi-marginal constraints. Finally, we consider an optimal transport problem with constraints and obtain its Kantorovich duality. A corollary of this result is a monotonicity principle which gives a geometric way of identifying the optimizer. △ Less

Submitted 16 March, 2020; v1 submitted 11 April, 2018; originally announced April 2018.

Comments: To appear in Applied Mathematics and Optimization. Keywords:Strassen's Theorem, Kellerer's Theorem, Martingale optimal transport, domain constraints, bounded volatility/quadratic variation, $G$-expectations, Kantorovich duality, monotonicity principle

arXiv:1801.00597 [pdf, other]

doi 10.1016/j.jocs.2017.10.013

Exploiting Investors Social Network for Stock Prediction in China's Market

Authors: Xi Zhang, Jiawei Shi, Di Wang, Binxing Fang

Abstract: Recent works have shown that social media platforms are able to influence the trends of stock price movements. However, existing works have majorly focused on the U.S. stock market and lacked attention to certain emerging countries such as China, where retail investors dominate the market. In this regard, as retail investors are prone to be influenced by news or other social media, psychological a… ▽ More Recent works have shown that social media platforms are able to influence the trends of stock price movements. However, existing works have majorly focused on the U.S. stock market and lacked attention to certain emerging countries such as China, where retail investors dominate the market. In this regard, as retail investors are prone to be influenced by news or other social media, psychological and behavioral features extracted from social media platforms are thought to well predict stock price movements in the China's market. Recent advances in the investor social network in China enables the extraction of such features from web-scale data. In this paper, on the basis of tweets from Xueqiu, a popular Chinese Twitter-like social platform specialized for investors, we analyze features with regard to collective sentiment and perception on stock relatedness and predict stock price movements by employing nonlinear models. The features of interest prove to be effective in our experiments. △ Less

Submitted 2 January, 2018; originally announced January 2018.

Comments: accepted by Journal of Computational Science

arXiv:1801.00588 [pdf, other]

doi 10.1016/j.knosys.2017.12.025

Improving Stock Market Prediction via Heterogeneous Information Fusion

Authors: Xi Zhang, Yunjia Zhang, Senzhang Wang, Yuntao Yao, Binxing Fang, Philip S. Yu

Abstract: Traditional stock market prediction approaches commonly utilize the historical price-related data of the stocks to forecast their future trends. As the Web information grows, recently some works try to explore financial news to improve the prediction. Effective indicators, e.g., the events related to the stocks and the people's sentiments towards the market and stocks, have been proved to play imp… ▽ More Traditional stock market prediction approaches commonly utilize the historical price-related data of the stocks to forecast their future trends. As the Web information grows, recently some works try to explore financial news to improve the prediction. Effective indicators, e.g., the events related to the stocks and the people's sentiments towards the market and stocks, have been proved to play important roles in the stocks' volatility, and are extracted to feed into the prediction models for improving the prediction accuracy. However, a major limitation of previous methods is that the indicators are obtained from only a single source whose reliability might be low, or from several data sources but their interactions and correlations among the multi-sourced data are largely ignored. In this work, we extract the events from Web news and the users' sentiments from social media, and investigate their joint impacts on the stock price movements via a coupled matrix and tensor factorization framework. Specifically, a tensor is firstly constructed to fuse heterogeneous data and capture the intrinsic relations among the events and the investors' sentiments. Due to the sparsity of the tensor, two auxiliary matrices, the stock quantitative feature matrix and the stock correlation matrix, are constructed and incorporated to assist the tensor decomposition. The intuition behind is that stocks that are highly correlated with each other tend to be affected by the same event. Thus, instead of conducting each stock prediction task separately and independently, we predict multiple correlated stocks simultaneously through their commonalities, which are enabled via sharing the collaboratively factorized low rank matrices between matrices and the tensor. Evaluations on the China A-share stock data and the HK stock data in the year 2015 demonstrate the effectiveness of the proposed model. △ Less

Submitted 2 January, 2018; originally announced January 2018.

Comments: Accepted by Knowledge-Based Systems

arXiv:1505.05256 [pdf, other]

Small-time asymptotics for Gaussian self-similar stochastic volatility models

Authors: Archil Gulisashvili, Frederi Viens, Xin Zhang

Abstract: We consider the class of self-similar Gaussian stochastic volatility models, and compute the small-time (near-maturity) asymptotics for the corresponding asset price density, the call and put pricing functions, and the implied volatilities. Unlike the well-known model-free behavior for extreme-strike asymptotics, small-time behaviors of the above depend heavily on the model, and require a control… ▽ More We consider the class of self-similar Gaussian stochastic volatility models, and compute the small-time (near-maturity) asymptotics for the corresponding asset price density, the call and put pricing functions, and the implied volatilities. Unlike the well-known model-free behavior for extreme-strike asymptotics, small-time behaviors of the above depend heavily on the model, and require a control of the asset price density which is uniform with respect to the asset price variable, in order to translate into results for call prices and implied volatilities. Away from the money, we express the asymptotics explicitly using the volatility process' self-similarity parameter $H$, its first Karhunen-Loeve eigenvalue at time 1, and the latter's multiplicity. Several model-free estimators for $H$ result. At the money, a separate study is required: the asymptotics for small time depend instead on the integrated variance's moments of orders 1/2 and 3/2, and the estimator for $H$ sees an affine adjustment, while remaining model-free. △ Less

Submitted 14 March, 2016; v1 submitted 20 May, 2015; originally announced May 2015.

Comments: 40 pages, 6 included pdf images

MSC Class: 60G15; 91G20; 40E05

arXiv:1502.05442 [pdf, other]

Extreme-Strike Asymptotics for General Gaussian Stochastic Volatility Models

Authors: Archil Gulisashvili, Frederi Viens, Xin Zhang

Abstract: We consider a stochastic volatility asset price model in which the volatility is the absolute value of a continuous Gaussian process with arbitrary prescribed mean and covariance. By exhibiting a Karhunen-Loève expansion for the integrated variance, and using sharp estimates of the density of a general second-chaos variable, we derive asymptotics for the asset price density for large or small valu… ▽ More We consider a stochastic volatility asset price model in which the volatility is the absolute value of a continuous Gaussian process with arbitrary prescribed mean and covariance. By exhibiting a Karhunen-Loève expansion for the integrated variance, and using sharp estimates of the density of a general second-chaos variable, we derive asymptotics for the asset price density for large or small values of the variable, and study the wing behavior of the implied volatility in these models. Our main result provides explicit expressions for the first five terms in the expansion of the implied volatility. The expressions for the leading three terms are simple, and based on three basic spectral-type statistics of the Gaussian process: the top eigenvalue of its covariance operator, the multiplicity of this eigenvalue, and the $L^{2}$ norm of the projection of the mean function on the top eigenspace. The fourth term requires knowledge of all eigen-elements. We present detailed numerics based on realistic liquidity assumptions in which classical and long-memory volatility models are calibrated based on our expansion. △ Less

Submitted 6 February, 2017; v1 submitted 18 February, 2015; originally announced February 2015.

Comments: 38 pages, 12 figures

MSC Class: 60G15; 91G20; 40E05

arXiv:1406.7606 [pdf, ps, other]

Optimal Hybrid Dividend Strategy Under The Markovian Regime-Switching Economy

Authors: Xiaoxiao Zheng, Xin Zhang

Abstract: In this paper, we consider the optimal dividend problem for a company. We describe the surplus process of the company by a diffusion model with regime switching. The aim of the company is to choose a dividend policy to maximize the expected total discounted payments until ruin. In this article, we consider a hybrid dividend strategy, that is, the company is allowed to conduct continuous dividend s… ▽ More In this paper, we consider the optimal dividend problem for a company. We describe the surplus process of the company by a diffusion model with regime switching. The aim of the company is to choose a dividend policy to maximize the expected total discounted payments until ruin. In this article, we consider a hybrid dividend strategy, that is, the company is allowed to conduct continuous dividend strategy as well as impulsive dividend strategy. In addition, we consider the change of economy, which is characterized by a markovian regime-switching, and under the setting of two regimes, we solve the problem and obtain the analytical solution for the value function. △ Less

Submitted 30 June, 2014; originally announced June 2014.

arXiv:1406.7604 [pdf, other]

Optimal investment-reinsurance policy under a long-term perspective

Authors: Xiaoxiao Zheng, Xin Zhang

Abstract: In this paper, we assume an insure is allowed to purchase proportional reinsurance and can invest his or her wealth into the financial market where a savings account, stocks and bonds are available. Different from classical optimal investment and reinsurance problem, this paper studies the insurer's long-term investment decision. Under this setting, our model consider the interest risk and the inf… ▽ More In this paper, we assume an insure is allowed to purchase proportional reinsurance and can invest his or her wealth into the financial market where a savings account, stocks and bonds are available. Different from classical optimal investment and reinsurance problem, this paper studies the insurer's long-term investment decision. Under this setting, our model consider the interest risk and the inflation risk. Specifically, we suppose the interest rate follows a stochastic process, while price index is described by a classical model. By solving Hamilton-Jacobi-Bellman equation, the closed-form expression of the optimal policy is obtained. Further, we prove the corresponding verification theorem without the usual Lipschitz condition. In the end, numerical examples are made to illustrate the difference of the optimal polices under Ho-lee model and Vasicek model. △ Less

Submitted 30 June, 2014; originally announced June 2014.

Showing 1–28 of 28 results for author: Zhang, X