FinGPT: Instruction Tuning Benchmark for Open-Source Large Language Models in Financial Datasets

Wang, Neng; Yang, Hongyang; Wang, Christina Dan

Computer Science > Computation and Language

arXiv:2310.04793 (cs)

[Submitted on 7 Oct 2023 (v1), last revised 11 Nov 2023 (this version, v2)]

Title:FinGPT: Instruction Tuning Benchmark for Open-Source Large Language Models in Financial Datasets

Authors:Neng Wang, Hongyang Yang, Christina Dan Wang

View PDF

Abstract:In the swiftly expanding domain of Natural Language Processing (NLP), the potential of GPT-based models for the financial sector is increasingly evident. However, the integration of these models with financial datasets presents challenges, notably in determining their adeptness and relevance. This paper introduces a distinctive approach anchored in the Instruction Tuning paradigm for open-source large language models, specifically adapted for financial contexts. Through this methodology, we capitalize on the interoperability of open-source models, ensuring a seamless and transparent integration. We begin by explaining the Instruction Tuning paradigm, highlighting its effectiveness for immediate integration. The paper presents a benchmarking scheme designed for end-to-end training and testing, employing a cost-effective progression. Firstly, we assess basic competencies and fundamental tasks, such as Named Entity Recognition (NER) and sentiment analysis to enhance specialization. Next, we delve into a comprehensive model, executing multi-task operations by amalgamating all instructional tunings to examine versatility. Finally, we explore the zero-shot capabilities by earmarking unseen tasks and incorporating novel datasets to understand adaptability in uncharted terrains. Such a paradigm fortifies the principles of openness and reproducibility, laying a robust foundation for future investigations in open-source financial large language models (FinLLMs).

Comments:	Workshop on Instruction Tuning and Instruction Following at NeurIPS 2023
Subjects:	Computation and Language (cs.CL); Trading and Market Microstructure (q-fin.TR)
Cite as:	arXiv:2310.04793 [cs.CL]
	(or arXiv:2310.04793v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2310.04793

Submission history

From: Hongyang Yang [view email]
[v1] Sat, 7 Oct 2023 12:52:58 UTC (82 KB)
[v2] Sat, 11 Nov 2023 06:51:24 UTC (122 KB)

Computer Science > Computation and Language

Title:FinGPT: Instruction Tuning Benchmark for Open-Source Large Language Models in Financial Datasets

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:FinGPT: Instruction Tuning Benchmark for Open-Source Large Language Models in Financial Datasets

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators