Fine-Tuning Large Language Models for Stock Return Prediction Using Newsflow

Guo, Tian; Hauptmann, Emmanuel

Quantitative Finance > Computational Finance

arXiv:2407.18103 (q-fin)

[Submitted on 25 Jul 2024 (v1), last revised 5 Aug 2024 (this version, v2)]

Title:Fine-Tuning Large Language Models for Stock Return Prediction Using Newsflow

Authors:Tian Guo, Emmanuel Hauptmann

View PDF HTML (experimental)

Abstract:Large language models (LLMs) and their fine-tuning techniques have demonstrated superior performance in various language understanding and generation tasks. This paper explores fine-tuning LLMs for stock return forecasting with financial newsflow. In quantitative investing, return forecasting is fundamental for subsequent tasks like stock picking, portfolio optimization, etc. We formulate the model to include text representation and forecasting modules. We propose to compare the encoder-only and decoder-only LLMs, considering they generate text representations in distinct ways. The impact of these different representations on forecasting performance remains an open question. Meanwhile, we compare two simple methods of integrating LLMs' token-level representations into the forecasting module. The experiments on real news and investment universes reveal that: (1) aggregated representations from LLMs' token-level embeddings generally produce return predictions that enhance the performance of long-only and long-short portfolios; (2) in the relatively large investment universe, the decoder LLMs-based prediction model leads to stronger portfolios, whereas in the small universes, there are no consistent winners. Among the three LLMs studied (DeBERTa, Mistral, Llama), Mistral performs more robustly across different universes; (3) return predictions derived from LLMs' text representations are a strong signal for portfolio construction, outperforming conventional sentiment scores.

Subjects:	Computational Finance (q-fin.CP); Machine Learning (cs.LG); Portfolio Management (q-fin.PM)
Cite as:	arXiv:2407.18103 [q-fin.CP]
	(or arXiv:2407.18103v2 [q-fin.CP] for this version)
	https://doi.org/10.48550/arXiv.2407.18103

Submission history

From: Tian Guo [view email]
[v1] Thu, 25 Jul 2024 15:07:35 UTC (570 KB)
[v2] Mon, 5 Aug 2024 11:13:57 UTC (573 KB)

Quantitative Finance > Computational Finance

Title:Fine-Tuning Large Language Models for Stock Return Prediction Using Newsflow

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Quantitative Finance > Computational Finance

Title:Fine-Tuning Large Language Models for Stock Return Prediction Using Newsflow

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators