"I've Heard of You!": Generate Spoken Named Entity Recognition Data for Unseen Entities

Yu, Jiawei; Geng, Xiang; Li, Yuang; Ren, Mengxin; Tang, Wei; Li, Jiahuan; Lan, Zhibin; Zhang, Min; Yang, Hao; Huang, Shujian; Su, Jinsong

Computer Science > Computation and Language

arXiv:2412.19102 (cs)

[Submitted on 26 Dec 2024]

Title:"I've Heard of You!": Generate Spoken Named Entity Recognition Data for Unseen Entities

Authors:Jiawei Yu, Xiang Geng, Yuang Li, Mengxin Ren, Wei Tang, Jiahuan Li, Zhibin Lan, Min Zhang, Hao Yang, Shujian Huang, Jinsong Su

View PDF HTML (experimental)

Abstract:Spoken named entity recognition (NER) aims to identify named entities from speech, playing an important role in speech processing. New named entities appear every day, however, annotating their Spoken NER data is costly. In this paper, we demonstrate that existing Spoken NER systems perform poorly when dealing with previously unseen named entities. To tackle this challenge, we propose a method for generating Spoken NER data based on a named entity dictionary (NED) to reduce costs. Specifically, we first use a large language model (LLM) to generate sentences from the sampled named entities and then use a text-to-speech (TTS) system to generate the speech. Furthermore, we introduce a noise metric to filter out noisy data. To evaluate our approach, we release a novel Spoken NER benchmark along with a corresponding NED containing 8,853 entities. Experiment results show that our method achieves state-of-the-art (SOTA) performance in the in-domain, zero-shot domain adaptation, and fully zero-shot settings. Our data will be available at this https URL.

Comments:	Accepted by ICASSP 2025
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2412.19102 [cs.CL]
	(or arXiv:2412.19102v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2412.19102

Submission history

From: Jiawei Yu [view email]
[v1] Thu, 26 Dec 2024 07:43:18 UTC (1,106 KB)

Computer Science > Computation and Language

Title:"I've Heard of You!": Generate Spoken Named Entity Recognition Data for Unseen Entities

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:"I've Heard of You!": Generate Spoken Named Entity Recognition Data for Unseen Entities

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators