Fine-tuning Smaller Language Models for Question Answering over Financial Documents

Phogat, Karmvir Singh; Puranam, Sai Akhil; Dasaratha, Sridhar; Harsha, Chetan; Ramakrishna, Shashishekar

Computer Science > Computation and Language

arXiv:2408.12337 (cs)

[Submitted on 22 Aug 2024]

Title:Fine-tuning Smaller Language Models for Question Answering over Financial Documents

Authors:Karmvir Singh Phogat, Sai Akhil Puranam, Sridhar Dasaratha, Chetan Harsha, Shashishekar Ramakrishna

View PDF HTML (experimental)

Abstract:Recent research has shown that smaller language models can acquire substantial reasoning abilities when fine-tuned with reasoning exemplars crafted by a significantly larger teacher model. We explore this paradigm for the financial domain, focusing on the challenge of answering questions that require multi-hop numerical reasoning over financial texts. We assess the performance of several smaller models that have been fine-tuned to generate programs that encode the required financial reasoning and calculations. Our findings demonstrate that these fine-tuned smaller models approach the performance of the teacher model.
To provide a granular analysis of model performance, we propose an approach to investigate the specific student model capabilities that are enhanced by fine-tuning. Our empirical analysis indicates that fine-tuning refines the student models ability to express and apply the required financial concepts along with adapting the entity extraction for the specific data format. In addition, we hypothesize and demonstrate that comparable financial reasoning capability can be induced using relatively smaller datasets.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Systems and Control (eess.SY)
Cite as:	arXiv:2408.12337 [cs.CL]
	(or arXiv:2408.12337v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2408.12337

Submission history

From: Karmvir Singh Phogat [view email]
[v1] Thu, 22 Aug 2024 12:23:29 UTC (78 KB)

Computer Science > Computation and Language

Title:Fine-tuning Smaller Language Models for Question Answering over Financial Documents

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Fine-tuning Smaller Language Models for Question Answering over Financial Documents

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators