Named Entity Detection and Injection for Direct Speech Translation

Gaido, Marco; Tang, Yun; Kulikov, Ilia; Huang, Rongqing; Gong, Hongyu; Inaguma, Hirofumi

Computer Science > Computation and Language

arXiv:2210.11981 (cs)

[Submitted on 21 Oct 2022 (v1), last revised 11 Mar 2023 (this version, v2)]

Title:Named Entity Detection and Injection for Direct Speech Translation

Authors:Marco Gaido, Yun Tang, Ilia Kulikov, Rongqing Huang, Hongyu Gong, Hirofumi Inaguma

View PDF

Abstract:In a sentence, certain words are critical for its semantic. Among them, named entities (NEs) are notoriously challenging for neural models. Despite their importance, their accurate handling has been neglected in speech-to-text (S2T) translation research, and recent work has shown that S2T models perform poorly for locations and notably person names, whose spelling is challenging unless known in advance. In this work, we explore how to leverage dictionaries of NEs known to likely appear in a given context to improve S2T model outputs. Our experiments show that we can reliably detect NEs likely present in an utterance starting from S2T encoder outputs. Indeed, we demonstrate that the current detection quality is sufficient to improve NE accuracy in the translation with a 31% reduction in person name errors.

Comments:	\c{opyright} 2022 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2210.11981 [cs.CL]
	(or arXiv:2210.11981v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2210.11981

Submission history

From: Marco Gaido [view email]
[v1] Fri, 21 Oct 2022 14:16:51 UTC (1,994 KB)
[v2] Sat, 11 Mar 2023 11:31:12 UTC (1,994 KB)

Computer Science > Computation and Language

Title:Named Entity Detection and Injection for Direct Speech Translation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Named Entity Detection and Injection for Direct Speech Translation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators