-
Predicting Failure of P2P Lending Platforms through Machine Learning: The Case in China
Authors:
Jen-Yin Yeh,
Hsin-Yu Chiu,
Jhih-Huei Huang
Abstract:
This study employs machine learning models to predict the failure of Peer-to-Peer (P2P) lending platforms, specifically in China. By employing the filter method and wrapper method with forward selection and backward elimination, we establish a rigorous and practical procedure that ensures the robustness and importance of variables in predicting platform failures. The research identifies a set of r…
▽ More
This study employs machine learning models to predict the failure of Peer-to-Peer (P2P) lending platforms, specifically in China. By employing the filter method and wrapper method with forward selection and backward elimination, we establish a rigorous and practical procedure that ensures the robustness and importance of variables in predicting platform failures. The research identifies a set of robust variables that consistently appear in the feature subsets across different selection methods and models, suggesting their reliability and relevance in predicting platform failures. The study highlights that reducing the number of variables in the feature subset leads to an increase in the false acceptance rate while the performance metrics remain stable, with an AUC value of approximately 0.96 and an F1 score of around 0.88. The findings of this research provide significant practical implications for regulatory authorities and investors operating in the Chinese P2P lending industry.
△ Less
Submitted 24 November, 2023;
originally announced November 2023.
-
The Wall Street Neophyte: A Zero-Shot Analysis of ChatGPT Over MultiModal Stock Movement Prediction Challenges
Authors:
Qianqian Xie,
Weiguang Han,
Yanzhao Lai,
Min Peng,
Jimin Huang
Abstract:
Recently, large language models (LLMs) like ChatGPT have demonstrated remarkable performance across a variety of natural language processing tasks. However, their effectiveness in the financial domain, specifically in predicting stock market movements, remains to be explored. In this paper, we conduct an extensive zero-shot analysis of ChatGPT's capabilities in multimodal stock movement prediction…
▽ More
Recently, large language models (LLMs) like ChatGPT have demonstrated remarkable performance across a variety of natural language processing tasks. However, their effectiveness in the financial domain, specifically in predicting stock market movements, remains to be explored. In this paper, we conduct an extensive zero-shot analysis of ChatGPT's capabilities in multimodal stock movement prediction, on three tweets and historical stock price datasets. Our findings indicate that ChatGPT is a "Wall Street Neophyte" with limited success in predicting stock movements, as it underperforms not only state-of-the-art methods but also traditional methods like linear regression using price features. Despite the potential of Chain-of-Thought prompting strategies and the inclusion of tweets, ChatGPT's performance remains subpar. Furthermore, we observe limitations in its explainability and stability, suggesting the need for more specialized training or fine-tuning. This research provides insights into ChatGPT's capabilities and serves as a foundation for future work aimed at improving financial market analysis and prediction by leveraging social media sentiment and historical stock data.
△ Less
Submitted 28 April, 2023; v1 submitted 10 April, 2023;
originally announced April 2023.
-
Mastering Pair Trading with Risk-Aware Recurrent Reinforcement Learning
Authors:
Weiguang Han,
Jimin Huang,
Qianqian Xie,
Boyi Zhang,
Yanzhao Lai,
Min Peng
Abstract:
Although pair trading is the simplest hedging strategy for an investor to eliminate market risk, it is still a great challenge for reinforcement learning (RL) methods to perform pair trading as human expertise. It requires RL methods to make thousands of correct actions that nevertheless have no obvious relations to the overall trading profit, and to reason over infinite states of the time-varying…
▽ More
Although pair trading is the simplest hedging strategy for an investor to eliminate market risk, it is still a great challenge for reinforcement learning (RL) methods to perform pair trading as human expertise. It requires RL methods to make thousands of correct actions that nevertheless have no obvious relations to the overall trading profit, and to reason over infinite states of the time-varying market most of which have never appeared in history. However, existing RL methods ignore the temporal connections between asset price movements and the risk of the performed trading. These lead to frequent tradings with high transaction costs and potential losses, which barely reach the human expertise level of trading. Therefore, we introduce CREDIT, a risk-aware agent capable of learning to exploit long-term trading opportunities in pair trading similar to a human expert. CREDIT is the first to apply bidirectional GRU along with the temporal attention mechanism to fully consider the temporal correlations embedded in the states, which allows CREDIT to capture long-term patterns of the price movements of two assets to earn higher profit. We also design the risk-aware reward inspired by the economic theory, that models both the profit and risk of the tradings during the trading period. It helps our agent to master pair trading with a robust trading preference that avoids risky trading with possible high returns and losses. Experiments show that it outperforms existing reinforcement learning methods in pair trading and achieves a significant profit over five years of U.S. stock data.
△ Less
Submitted 1 April, 2023;
originally announced April 2023.
-
Company Competition Graph
Authors:
Yanci Zhang,
Yutong Lu,
Haitao Mao,
Jiawei Huang,
Cien Zhang,
Xinyi Li,
Rui Dai
Abstract:
Financial market participants frequently rely on numerous business relationships to make investment decisions. Investors can learn about potential risks and opportunities associated with other connected entities through these corporate connections. Nonetheless, human annotation of a large corpus to extract such relationships is highly time-consuming, not to mention that it requires a considerable…
▽ More
Financial market participants frequently rely on numerous business relationships to make investment decisions. Investors can learn about potential risks and opportunities associated with other connected entities through these corporate connections. Nonetheless, human annotation of a large corpus to extract such relationships is highly time-consuming, not to mention that it requires a considerable amount of industry expertise and professional training. Meanwhile, we have yet to observe means to generate reliable knowledge graphs of corporate relationships due to the lack of impartial and granular data sources. This study proposes a system to process financial reports and construct the public competitor graph to fill the void. Our method can retrieve more than 83\% competition relationship of the S\&P 500 index companies. Based on the output from our system, we construct a knowledge graph with more than 700 nodes and 1200 edges. A demo interactive graph interface is available.
△ Less
Submitted 1 April, 2023;
originally announced April 2023.
-
Select and Trade: Towards Unified Pair Trading with Hierarchical Reinforcement Learning
Authors:
Weiguang Han,
Boyi Zhang,
Qianqian Xie,
Min Peng,
Yanzhao Lai,
Jimin Huang
Abstract:
Pair trading is one of the most effective statistical arbitrage strategies which seeks a neutral profit by hedging a pair of selected assets. Existing methods generally decompose the task into two separate steps: pair selection and trading. However, the decoupling of two closely related subtasks can block information propagation and lead to limited overall performance. For pair selection, ignoring…
▽ More
Pair trading is one of the most effective statistical arbitrage strategies which seeks a neutral profit by hedging a pair of selected assets. Existing methods generally decompose the task into two separate steps: pair selection and trading. However, the decoupling of two closely related subtasks can block information propagation and lead to limited overall performance. For pair selection, ignoring the trading performance results in the wrong assets being selected with irrelevant price movements, while the agent trained for trading can overfit to the selected assets without any historical information of other assets. To address it, in this paper, we propose a paradigm for automatic pair trading as a unified task rather than a two-step pipeline. We design a hierarchical reinforcement learning framework to jointly learn and optimize two subtasks. A high-level policy would select two assets from all possible combinations and a low-level policy would then perform a series of trading actions. Experimental results on real-world stock data demonstrate the effectiveness of our method on pair trading compared with both existing pair selection and trading methods.
△ Less
Submitted 5 February, 2023; v1 submitted 25 January, 2023;
originally announced January 2023.
-
Double-jump stochastic volatility model for VIX: evidence from VVIX
Authors:
Xin Zang,
Jun Ni,
**g-Zhi Huang,
Lan Wu
Abstract:
The paper studies the continuous-time dynamics of VIX with stochastic volatility and jumps in VIX and volatility. Built on the general parametric affine model with stochastic volatility and jump in logarithm of VIX, we derive a linear relation between the stochastic volatility factor and VVIX index. We detect the existence of co-jump of VIX and VVIX and put forward a double-jump stochastic volatil…
▽ More
The paper studies the continuous-time dynamics of VIX with stochastic volatility and jumps in VIX and volatility. Built on the general parametric affine model with stochastic volatility and jump in logarithm of VIX, we derive a linear relation between the stochastic volatility factor and VVIX index. We detect the existence of co-jump of VIX and VVIX and put forward a double-jump stochastic volatility model for VIX through its joint property with VVIX. With VVIX index as a proxy for the stochastic volatility, we use MCMC method to estimate the dynamics of VIX. Comparing nested models on VIX, we show the jump in VIX and the volatility factor is statistically significant. The jump intensity is also statedependent. We analyze the impact of jump factor on the VIX dynamics.
△ Less
Submitted 1 July, 2015; v1 submitted 24 June, 2015;
originally announced June 2015.
-
Optimal dual martingales, their analysis and application to new algorithms for Bermudan products
Authors:
John Schoenmakers,
Junbo Huang,
Jianing Zhang
Abstract:
In this paper we introduce and study the concept of optimal and surely optimal dual martingales in the context of dual valuation of Bermudan options, and outline the development of new algorithms in this context. We provide a characterization theorem, a theorem which gives conditions for a martingale to be surely optimal, and a stability theorem concerning martingales which are near to be surely o…
▽ More
In this paper we introduce and study the concept of optimal and surely optimal dual martingales in the context of dual valuation of Bermudan options, and outline the development of new algorithms in this context. We provide a characterization theorem, a theorem which gives conditions for a martingale to be surely optimal, and a stability theorem concerning martingales which are near to be surely optimal in a sense. Guided by these results we develop a framework of backward algorithms for constructing such a martingale. In turn this martingale may then be utilized for computing an upper bound of the Bermudan product. The methodology is pure dual in the sense that it doesn't require certain input approximations to the Snell envelope. In an Itô-Lévy environment we outline a particular regression based backward algorithm which allows for computing dual upper bounds without nested Monte Carlo simulation. Moreover, as a by-product this algorithm also provides approximations to the continuation values of the product, which in turn determine a stop** policy. Hence, we may obtain lower bounds at the same time. In a first numerical study we demonstrate the backward dual regression algorithm in a Wiener environment at well known benchmark examples. It turns out that the method is at least comparable to the one in Belomestny et. al. (2009) regarding accuracy, but regarding computational robustness there are even several advantages.
△ Less
Submitted 13 February, 2012; v1 submitted 25 November, 2011;
originally announced November 2011.
-
Optimal dividend and investing control of a insurance company with higher solvency constraints
Authors:
Zongxia Liang,
Jian** Huang
Abstract:
This paper considers optimal control problem of a large insurance company under a fixed insolvency probability. The company controls proportional reinsurance rate, dividend pay-outs and investing process to maximize the expected present value of the dividend pay-outs until the time of bankruptcy. This paper aims at describing the optimal return function as well as the optimal policy. As a by-produ…
▽ More
This paper considers optimal control problem of a large insurance company under a fixed insolvency probability. The company controls proportional reinsurance rate, dividend pay-outs and investing process to maximize the expected present value of the dividend pay-outs until the time of bankruptcy. This paper aims at describing the optimal return function as well as the optimal policy. As a by-product, the paper theoretically sets a risk-based capital standard to ensure the capital requirement of can cover the total risk.
△ Less
Submitted 31 May, 2010; v1 submitted 8 May, 2010;
originally announced May 2010.