FinAudio: A Benchmark for Audio Large Language Models in Financial Applications

Cao, Yupeng; Li, Haohang; Yu, Yangyang; Javaji, Shashidhar Reddy; He, Yueru; Huang, Jimin; Zhu, Zining; Xie, Qianqian; Liu, Xiao-yang; Subbalakshmi, Koduvayur; Qiu, Meikang; Ananiadou, Sophia; Nie, Jian-Yun

Computer Science > Computational Engineering, Finance, and Science

arXiv:2503.20990 (cs)

[Submitted on 26 Mar 2025]

Title:FinAudio: A Benchmark for Audio Large Language Models in Financial Applications

Authors:Yupeng Cao, Haohang Li, Yangyang Yu, Shashidhar Reddy Javaji, Yueru He, Jimin Huang, Zining Zhu, Qianqian Xie, Xiao-yang Liu, Koduvayur Subbalakshmi, Meikang Qiu, Sophia Ananiadou, Jian-Yun Nie

View PDF HTML (experimental)

Abstract:Audio Large Language Models (AudioLLMs) have received widespread attention and have significantly improved performance on audio tasks such as conversation, audio understanding, and automatic speech recognition (ASR). Despite these advancements, there is an absence of a benchmark for assessing AudioLLMs in financial scenarios, where audio data, such as earnings conference calls and CEO speeches, are crucial resources for financial analysis and investment decisions. In this paper, we introduce \textsc{FinAudio}, the first benchmark designed to evaluate the capacity of AudioLLMs in the financial domain. We first define three tasks based on the unique characteristics of the financial domain: 1) ASR for short financial audio, 2) ASR for long financial audio, and 3) summarization of long financial audio. Then, we curate two short and two long audio datasets, respectively, and develop a novel dataset for financial audio summarization, comprising the \textsc{FinAudio} benchmark. Then, we evaluate seven prevalent AudioLLMs on \textsc{FinAudio}. Our evaluation reveals the limitations of existing AudioLLMs in the financial domain and offers insights for improving AudioLLMs. All datasets and codes will be released.

Subjects:	Computational Engineering, Finance, and Science (cs.CE); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
Cite as:	arXiv:2503.20990 [cs.CE]
	(or arXiv:2503.20990v1 [cs.CE] for this version)
	https://doi.org/10.48550/arXiv.2503.20990

Submission history

From: Yupeng Cao [view email]
[v1] Wed, 26 Mar 2025 21:07:51 UTC (2,145 KB)

Computer Science > Computational Engineering, Finance, and Science

Title:FinAudio: A Benchmark for Audio Large Language Models in Financial Applications

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computational Engineering, Finance, and Science

Title:FinAudio: A Benchmark for Audio Large Language Models in Financial Applications

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators