-
A Survey of Large Language Models in Finance (FinLLMs)
Authors:
Jean Lee,
Nicholas Stevens,
Soyeon Caren Han,
Minseok Song
Abstract:
Large Language Models (LLMs) have shown remarkable capabilities across a wide variety of Natural Language Processing (NLP) tasks and have attracted attention from multiple domains, including financial services. Despite the extensive research into general-domain LLMs, and their immense potential in finance, Financial LLM (FinLLM) research remains limited. This survey provides a comprehensive overvi…
▽ More
Large Language Models (LLMs) have shown remarkable capabilities across a wide variety of Natural Language Processing (NLP) tasks and have attracted attention from multiple domains, including financial services. Despite the extensive research into general-domain LLMs, and their immense potential in finance, Financial LLM (FinLLM) research remains limited. This survey provides a comprehensive overview of FinLLMs, including their history, techniques, performance, and opportunities and challenges. Firstly, we present a chronological overview of general-domain Pre-trained Language Models (PLMs) through to current FinLLMs, including the GPT-series, selected open-source LLMs, and financial LMs. Secondly, we compare five techniques used across financial PLMs and FinLLMs, including training methods, training data, and fine-tuning methods. Thirdly, we summarize the performance evaluations of six benchmark tasks and datasets. In addition, we provide eight advanced financial NLP tasks and datasets for develo** more sophisticated FinLLMs. Finally, we discuss the opportunities and the challenges facing FinLLMs, such as hallucination, privacy, and efficiency. To support AI research in finance, we compile a collection of accessible datasets and evaluation benchmarks on GitHub.
△ Less
Submitted 3 February, 2024;
originally announced February 2024.
-
EU cost action on future generation optical wireless communication technologies -- newfocus ca19111, a white paper
Authors:
M A Khalighi,
Z Ghassemlooy,
S Zvanovec,
N Stevens,
L N Alves,
A Shrestha,
M Uysal,
A M Vegni,
P D Diamantoulakis,
V K Papanikolaou,
G K Karagiannidis,
B Ortega,
V Almenar,
O Bouchet,
L Ladid
Abstract:
The EU COST Action NEWFOCUS is focused on investigating radical solutions with the potential to impact the design of future wireless networks. It aims to address some of the challenges in OWC and establish it as an efficient technology that can satisfy the demanding requirements of backhaul and access network levels in 5G networks. This also includes the use of hybrid links that associate OWC with…
▽ More
The EU COST Action NEWFOCUS is focused on investigating radical solutions with the potential to impact the design of future wireless networks. It aims to address some of the challenges in OWC and establish it as an efficient technology that can satisfy the demanding requirements of backhaul and access network levels in 5G networks. This also includes the use of hybrid links that associate OWC with radiofrequency or wired/fiber-based technologies. The focus of this White Paper is on the use of optical wireless communication (OWC) as enabling technology in a range of areas outlined in HE's Pillar II including Health, Manufacturing, Intelligent Transportation Systems (ITS), Unmanned Aerial Vehicles and Network and Protocol.
△ Less
Submitted 18 July, 2022;
originally announced October 2022.
-
FedNLP: An interpretable NLP System to Decode Federal Reserve Communications
Authors:
Jean Lee,
Hoyoul Luis Youn,
Nicholas Stevens,
Josiah Poon,
Soyeon Caren Han
Abstract:
The Federal Reserve System (the Fed) plays a significant role in affecting monetary policy and financial conditions worldwide. Although it is important to analyse the Fed's communications to extract useful information, it is generally long-form and complex due to the ambiguous and esoteric nature of content. In this paper, we present FedNLP, an interpretable multi-component Natural Language Proces…
▽ More
The Federal Reserve System (the Fed) plays a significant role in affecting monetary policy and financial conditions worldwide. Although it is important to analyse the Fed's communications to extract useful information, it is generally long-form and complex due to the ambiguous and esoteric nature of content. In this paper, we present FedNLP, an interpretable multi-component Natural Language Processing system to decode Federal Reserve communications. This system is designed for end-users to explore how NLP techniques can assist their holistic understanding of the Fed's communications with NO coding. Behind the scenes, FedNLP uses multiple NLP models from traditional machine learning algorithms to deep neural network architectures in each downstream task. The demonstration shows multiple results at once including sentiment analysis, summary of the document, prediction of the Federal Funds Rate movement and visualization for interpreting the prediction model's result.
△ Less
Submitted 11 June, 2021;
originally announced June 2021.
-
Monitoring dynamic networks: a simulation-based strategy for comparing monitoring methods and a comparative study
Authors:
Lisha Yu,
Inez M. Zwetsloot,
Nathaniel T. Stevens,
James D. Wilson,
Kwok Leung Tsui
Abstract:
Recently there has been a lot of interest in monitoring and identifying changes in dynamic networks, which has led to the development of a variety of monitoring methods. Unfortunately, these methods have not been systematically compared; moreover, new methods are often designed for a specialized use case. In light of this, we propose the use of simulation to compare the performance of network moni…
▽ More
Recently there has been a lot of interest in monitoring and identifying changes in dynamic networks, which has led to the development of a variety of monitoring methods. Unfortunately, these methods have not been systematically compared; moreover, new methods are often designed for a specialized use case. In light of this, we propose the use of simulation to compare the performance of network monitoring methods over a variety of dynamic network changes. Using our family of simulated dynamic networks, we compare the performance of several state-of-the-art social network monitoring methods in the literature. We compare their performance over a variety of types of change; we consider both increases in communication levels, node propensity change as well as changes in community structure. We show that there does not exist one method that is uniformly superior to the others; the best method depends on the context and the type of change one wishes to detect. As such, we conclude that a variety of methods is needed for network monitoring and that it is important to understand in which scenarios a given method is appropriate.
△ Less
Submitted 24 May, 2019;
originally announced May 2019.
-
Modeling and detecting change in temporal networks via a dynamic degree corrected stochastic block model
Authors:
James D. Wilson,
Nathaniel T. Stevens,
William H. Woodall
Abstract:
In many applications it is of interest to identify anomalous behavior within a dynamic interacting system. Such anomalous interactions are reflected by structural changes in the network representation of the system. We propose and investigate the use of a dynamic version of the degree corrected stochastic block model (DCSBM) to model and monitor dynamic networks that undergo a significant structur…
▽ More
In many applications it is of interest to identify anomalous behavior within a dynamic interacting system. Such anomalous interactions are reflected by structural changes in the network representation of the system. We propose and investigate the use of a dynamic version of the degree corrected stochastic block model (DCSBM) to model and monitor dynamic networks that undergo a significant structural change. We apply statistical process monitoring techniques to the estimated parameters of the DCSBM to identify significant structural changes in the network. Application of our surveillance strategy to the dynamic U.S. Senate co-voting network reveals that we are able to detect significant changes in the network that reflect both times of cohesion and times of polarization among Republican and Democratic party members. These findings provide valuable insight about the evolution of the bipartisan political system in the United States. Our analysis demonstrates that the dynamic DCSBM monitoring procedure effectively detects local and global structural changes in dynamic networks. The DCSBM approach is an example of a more general framework that combines parametric random graph models and statistical process monitoring techniques for network surveillance.
△ Less
Submitted 30 November, 2016; v1 submitted 13 May, 2016;
originally announced May 2016.