-
A Survey of Large Language Models for Financial Applications: Progress, Prospects and Challenges
Authors:
Yuqi Nie,
Yaxuan Kong,
Xiaowen Dong,
John M. Mulvey,
H. Vincent Poor,
Qingsong Wen,
Stefan Zohren
Abstract:
Recent advances in large language models (LLMs) have unlocked novel opportunities for machine learning applications in the financial domain. These models have demonstrated remarkable capabilities in understanding context, processing vast amounts of data, and generating human-preferred contents. In this survey, we explore the application of LLMs on various financial tasks, focusing on their potenti…
▽ More
Recent advances in large language models (LLMs) have unlocked novel opportunities for machine learning applications in the financial domain. These models have demonstrated remarkable capabilities in understanding context, processing vast amounts of data, and generating human-preferred contents. In this survey, we explore the application of LLMs on various financial tasks, focusing on their potential to transform traditional practices and drive innovation. We provide a discussion of the progress and advantages of LLMs in financial contexts, analyzing their advanced technologies as well as prospective capabilities in contextual understanding, transfer learning flexibility, complex emotion detection, etc. We then highlight this survey for categorizing the existing literature into key application areas, including linguistic tasks, sentiment analysis, financial time series, financial reasoning, agent-based modeling, and other applications. For each application area, we delve into specific methodologies, such as textual analysis, knowledge-based analysis, forecasting, data augmentation, planning, decision support, and simulations. Furthermore, a comprehensive collection of datasets, model assets, and useful codes associated with mainstream applications are presented as resources for the researchers and practitioners. Finally, we outline the challenges and opportunities for future research, particularly emphasizing a number of distinctive aspects in this field. We hope our work can help facilitate the adoption and further development of LLMs in the financial sector.
△ Less
Submitted 15 June, 2024;
originally announced June 2024.
-
Dynamic Asset Allocation with Asset-Specific Regime Forecasts
Authors:
Yizhan Shu,
Chenyu Yu,
John M. Mulvey
Abstract:
This article introduces a novel hybrid regime identification-forecasting framework designed to enhance multi-asset portfolio construction by integrating asset-specific regime forecasts. Unlike traditional approaches that focus on broad economic regimes affecting the entire asset universe, our framework leverages both unsunpervised and supervised learning to generate tailored regime forecasts for i…
▽ More
This article introduces a novel hybrid regime identification-forecasting framework designed to enhance multi-asset portfolio construction by integrating asset-specific regime forecasts. Unlike traditional approaches that focus on broad economic regimes affecting the entire asset universe, our framework leverages both unsunpervised and supervised learning to generate tailored regime forecasts for individual assets. Initially, we use the statistical jump model, a robust unsupervised regime identification model, to derive regime labels for historical periods, classifying them into bullish or bearish states based on features extracted from an asset return series. Following this, a supervised gradient-boosted decision tree classifier is trained to predict these regimes using a combination of asset-specific return features and cross-asset macro-features. We apply this framework individually to each asset in our universe. Subsequently, return and risk forecasts which incorporate these regime predictions are input into Markowitz mean-variance optimization to determine optimal asset allocation weights. We demonstrate the efficacy of our approach through an empirical study on a multi-asset portfolio comprising twelve risky assets, including global equity, bond, real estate, and commodity indexes spanning from 1991 to 2023. The results consistently show outperformance across various portfolio models, including minimum-variance, mean-variance, and naive-diversified portfolios, highlighting the advantages of integrating asset-specific regime forecasts into dynamic asset allocation.
△ Less
Submitted 13 June, 2024;
originally announced June 2024.
-
Downside Risk Reduction Using Regime-Switching Signals: A Statistical Jump Model Approach
Authors:
Yizhan Shu,
Chenyu Yu,
John M. Mulvey
Abstract:
This article investigates a regime-switching investment strategy aimed at mitigating downside risk by reducing market exposure during anticipated unfavorable market regimes. We highlight the statistical jump model (JM) for market regime identification, a recently developed robust model that distinguishes itself from traditional Markov-switching models by enhancing regime persistence through a jump…
▽ More
This article investigates a regime-switching investment strategy aimed at mitigating downside risk by reducing market exposure during anticipated unfavorable market regimes. We highlight the statistical jump model (JM) for market regime identification, a recently developed robust model that distinguishes itself from traditional Markov-switching models by enhancing regime persistence through a jump penalty applied at each state transition. Our JM utilizes a feature set comprising risk and return measures derived solely from the return series, with the optimal jump penalty selected through a time-series cross-validation method that directly optimizes strategy performance. Our empirical analysis evaluates the realistic out-of-sample performance of various strategies on major equity indices from the US, Germany, and Japan from 1990 to 2023, in the presence of transaction costs and trading delays. The results demonstrate the consistent outperformance of the JM-guided strategy in reducing risk metrics such as volatility and maximum drawdown, and enhancing risk-adjusted returns like the Sharpe ratio, when compared to both hidden Markov model-guided strategy and the buy-and-hold strategy. These findings underline the enhanced persistence, practicality, and versatility of strategies utilizing JMs for regime-switching signals.
△ Less
Submitted 10 July, 2024; v1 submitted 7 February, 2024;
originally announced February 2024.
-
Optimal Portfolio Execution in a Regime-switching Market with Non-linear Impact Costs: Combining Dynamic Program and Neural Network
Authors:
Xiaoyue Li,
John M. Mulvey
Abstract:
Optimal execution of a portfolio have been a challenging problem for institutional investors. Traders face the trade-off between average trading price and uncertainty, and traditional methods suffer from the curse of dimensionality. Here, we propose a four-step numerical framework for the optimal portfolio execution problem where multiple market regimes exist, with the underlying regime switching…
▽ More
Optimal execution of a portfolio have been a challenging problem for institutional investors. Traders face the trade-off between average trading price and uncertainty, and traditional methods suffer from the curse of dimensionality. Here, we propose a four-step numerical framework for the optimal portfolio execution problem where multiple market regimes exist, with the underlying regime switching based on a Markov process. The market impact costs are modelled with a temporary part and a permanent part, where the former affects only the current trade while the latter persists. Our approach accepts impact cost functions in generic forms. First, we calculate the approximated orthogonal portfolios based on estimated impact cost functions; second, we employ dynamic program to learn the optimal selling schedule of each approximated orthogonal portfolio; third, weights of a neural network are pre-trained with the strategy suggested by previous step; last, we train the neural network to optimize on the original trading model. In our experiment of a 10-asset liquidation example with quadratic impact costs, the proposed combined method provides promising selling strategy for both CRRA (constant relative risk aversion) and mean-variance objectives. The running time is linear in the number of risky assets in the portfolio as well as in the number of trading periods. Possible improvements in running time are discussed for potential large-scale usages.
△ Less
Submitted 14 June, 2023;
originally announced June 2023.
-
Solving Multi-Period Financial Planning Models: Combining Monte Carlo Tree Search and Neural Networks
Authors:
Afşar Onat Aydınhan,
Xiaoyue Li,
John M. Mulvey
Abstract:
This paper introduces the MCTS algorithm to the financial world and focuses on solving significant multi-period financial planning models by combining a Monte Carlo Tree Search algorithm with a deep neural network. The MCTS provides an advanced start for the neural network so that the combined method outperforms either approach alone, yielding competitive results. Several innovations improve the c…
▽ More
This paper introduces the MCTS algorithm to the financial world and focuses on solving significant multi-period financial planning models by combining a Monte Carlo Tree Search algorithm with a deep neural network. The MCTS provides an advanced start for the neural network so that the combined method outperforms either approach alone, yielding competitive results. Several innovations improve the computations, including a variant of the upper confidence bound applied to trees (UTC) and a special lookup search. We compare the two-step algorithm with employing dynamic programs/neural networks. Both approaches solve regime switching models with 50-time steps and transaction costs with twelve asset categories. Heretofore, these problems have been outside the range of solvable optimization models via traditional algorithms.
△ Less
Submitted 17 May, 2022; v1 submitted 15 February, 2022;
originally announced February 2022.
-
End-to-End Risk Budgeting Portfolio Optimization with Neural Networks
Authors:
Ayse Sinem Uysal,
Xiaoyue Li,
John M. Mulvey
Abstract:
Portfolio optimization has been a central problem in finance, often approached with two steps: calibrating the parameters and then solving an optimization problem. Yet, the two-step procedure sometimes encounter the "error maximization" problem where inaccuracy in parameter estimation translates to unwise allocation decisions. In this paper, we combine the prediction and optimization tasks in a si…
▽ More
Portfolio optimization has been a central problem in finance, often approached with two steps: calibrating the parameters and then solving an optimization problem. Yet, the two-step procedure sometimes encounter the "error maximization" problem where inaccuracy in parameter estimation translates to unwise allocation decisions. In this paper, we combine the prediction and optimization tasks in a single feed-forward neural network and implement an end-to-end approach, where we learn the portfolio allocation directly from the input features. Two end-to-end portfolio constructions are included: a model-free network and a model-based network. The model-free approach is seen as a black-box, whereas in the model-based approach, we learn the optimal risk contribution on the assets and solve the allocation with an implicit optimization layer embedded in the neural network. The model-based end-to-end framework provides robust performance in the out-of-sample (2017-2021) tests when maximizing Sharpe ratio is used as the training objective function, achieving a Sharpe ratio of 1.16 when nominal risk parity yields 0.79 and equal-weight fix-mix yields 0.83. Noticing that risk-based portfolios can be sensitive to the underlying asset universe, we develop an asset selection mechanism embedded in the neural network with stochastic gates, in order to prevent the portfolio being hurt by the low-volatility assets with low returns. The gated end-to-end with filter outperforms the nominal risk-parity benchmarks with naive filtering mechanism, boosting the Sharpe ratio of the out-of-sample period (2017-2021) to 1.24 in the market data.
△ Less
Submitted 9 July, 2021;
originally announced July 2021.
-
PoBRL: Optimizing Multi-Document Summarization by Blending Reinforcement Learning Policies
Authors:
Andy Su,
Difei Su,
John M. Mulvey,
H. Vincent Poor
Abstract:
We propose a novel reinforcement learning based framework PoBRL for solving multi-document summarization. PoBRL jointly optimizes over the following three objectives necessary for a high-quality summary: importance, relevance, and length. Our strategy decouples this multi-objective optimization into different subproblems that can be solved individually by reinforcement learning. Utilizing PoBRL, w…
▽ More
We propose a novel reinforcement learning based framework PoBRL for solving multi-document summarization. PoBRL jointly optimizes over the following three objectives necessary for a high-quality summary: importance, relevance, and length. Our strategy decouples this multi-objective optimization into different subproblems that can be solved individually by reinforcement learning. Utilizing PoBRL, we then blend each learned policies together to produce a summary that is a concise and complete representation of the original input. Our empirical analysis shows state-of-the-art performance on several multi-document datasets. Human evaluation also shows that our method produces high-quality output.
△ Less
Submitted 17 May, 2021;
originally announced May 2021.
-
Multi-Period Portfolio Optimization using Model Predictive Control with Mean-Variance and Risk Parity Frameworks
Authors:
Xiaoyue Li,
A. Sinem Uysal,
John M. Mulvey
Abstract:
We employ model predictive control for a multi-period portfolio optimization problem. In addition to the mean-variance objective, we construct a portfolio whose allocation is given by model predictive control with a risk-parity objective, and provide a successive convex program algorithm that provides 30 times faster and robust solutions in the experiments. Computational results on the multi-asset…
▽ More
We employ model predictive control for a multi-period portfolio optimization problem. In addition to the mean-variance objective, we construct a portfolio whose allocation is given by model predictive control with a risk-parity objective, and provide a successive convex program algorithm that provides 30 times faster and robust solutions in the experiments. Computational results on the multi-asset universe show that multi-period models perform better than their single period counterparts in out-of-sample period, 2006-2020. The out-of-sample risk-adjusted performance of both mean-variance and risk-parity formulations beat the fix-mix benchmark, and achieve Sharpe ratio of 0.64 and 0.97, respectively.
△ Less
Submitted 19 March, 2021;
originally announced March 2021.
-
MUSBO: Model-based Uncertainty Regularized and Sample Efficient Batch Optimization for Deployment Constrained Reinforcement Learning
Authors:
DiJia Su,
Jason D. Lee,
John M. Mulvey,
H. Vincent Poor
Abstract:
In many contemporary applications such as healthcare, finance, robotics, and recommendation systems, continuous deployment of new policies for data collection and online learning is either cost ineffective or impractical. We consider a setting that lies between pure offline reinforcement learning (RL) and pure online RL called deployment constrained RL in which the number of policy deployments for…
▽ More
In many contemporary applications such as healthcare, finance, robotics, and recommendation systems, continuous deployment of new policies for data collection and online learning is either cost ineffective or impractical. We consider a setting that lies between pure offline reinforcement learning (RL) and pure online RL called deployment constrained RL in which the number of policy deployments for data sampling is limited. To solve this challenging task, we propose a new algorithmic learning framework called Model-based Uncertainty regularized and Sample Efficient Batch Optimization (MUSBO). Our framework discovers novel and high quality samples for each deployment to enable efficient data collection. During each offline training session, we bootstrap the policy update by quantifying the amount of uncertainty within our collected data. In the high support region (low uncertainty), we encourage our policy by taking an aggressive update. In the low support region (high uncertainty) when the policy bootstraps into the out-of-distribution region, we downweight it by our estimated uncertainty quantification. Experimental results show that MUSBO achieves state-of-the-art performance in the deployment constrained RL setting.
△ Less
Submitted 3 June, 2021; v1 submitted 22 February, 2021;
originally announced February 2021.