Parameter-Efficient Fine-Tuning of Pre-trained Large Language Models for Financial Text Analysis

Langa, Kelly Kiba

Parameter-Efficient Fine-Tuning of Pre-trained Large Language Models for Financial Text Analysis

Files

Langa_Parameter_2024.pdf (7.18 MB)

Date

2024-07

Authors

Langa, Kelly Kiba

Publisher

University of the Witwatersrand, Johannesburg

Abstract

The recent advancements in natural language processing (NLP) have been largely fueled by the emergence of large language models (LLMs), which excel in capturing the complex semantic and syntactic structures of natural language. These models have revolutionized NLP tasks by leveraging transfer learning, where pre-trained LLMs are fine-tuned on domain-specific datasets. Financial sentiment analysis poses unique challenges due to the intricate nature of financial language, often necessitating more sophisticated approaches beyond what traditional sentiment analysis methods offer. Fine-tuning LLMs holds potential for improving modeling performance within the financial domain, but the computational expense of the standard full fine-tuning poses a challenge. This study investigates the efficacy of Parameter-Efficient Fine-Tuning (PEFT) methods for fine-tuning LLMs to specific tasks, with a focus on sentiment analysis in the financial domain. Through extensive analysis of PEFT methods, including Low-Rank Adaptation (LoRA), prompt tuning, prefix tuning, and adapters, several critical insights have emerged. The results demonstrate that by employing PEFT methods, performance levels that match or surpass those of full fine-tuning can be achieved. Particularly, adapting the Open Pre-trained Transformers (OPT) model with LoRA achieved the highest modeling performance, with an accuracy of 89%, while utilizing 0.19% of the model’s total parameters. This highlights the high modularity of PEFT methods, necessitating minimal storage sizes for trainable parameters, ranging from 0.1MB to 7MB for the OPT model. Despite slower convergence rates than full fine-tuning, PEFT methods resulted in substantial reductions in Graphics Processing Unit (GPU) memory consumption, with savings of up to 80%. Small-scale fine-tuned LLMs outperformed large-scale general-purpose LLMs such as ChatGPT, emphasizing the importance of domain-specific fine-tuning. Model head fine-tuning fell short compared to PEFT methods, suggesting additional benefits from training more layers. Compared to state-of-the-art non-LLM-based deep learning models, Long Short-Term Memory (LSTM), LLMs demonstrated superiority achieving a 17% increase in accuracy, thereby validating their higher implementation costs.

Description

A research report submitted in partial fulfillment of the requirements for the degree of Master of Science in the field of e-Science, to the Faculty of Science, School of Computer Science and Applied Mathematics, University of the Witwatersrand, Johannesburg, 2024

Keywords

Natural Language Processing, Large Language Models, Parameter Efficient Fine Tuning, Transfer Learning, Transformer Architecture, Deep Learning, UCTD

Citation

Langa, Kelly Kiba. (2024). Parameter-Efficient Fine-Tuning of Pre-trained Large Language Models for Financial Text Analysis. [Master's dissertation, University of the Witwatersrand, Johannesburg]. WIReDSpace. https://hdl.handle.net/10539/47000

URI

https://hdl.handle.net/10539/47000

Collections

Electronic Theses and Dissertations (Masters)

Full item page

Parameter-Efficient Fine-Tuning of Pre-trained Large Language Models for Financial Text Analysis

Files

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

Citation

URI

Collections

Endorsement

Review

Supplemented By

Referenced By