Predicting Compact Phrasal Rewrites with Large Language Models for ASR Post Editing

Zhang, Hao; Stahlberg, Felix; Kumar, Shankar

Computer Science > Computation and Language

arXiv:2501.13831 (cs)

[Submitted on 23 Jan 2025]

Title:Predicting Compact Phrasal Rewrites with Large Language Models for ASR Post Editing

Authors:Hao Zhang, Felix Stahlberg, Shankar Kumar

View PDF HTML (experimental)

Abstract:Large Language Models (LLMs) excel at rewriting tasks such as text style transfer and grammatical error correction. While there is considerable overlap between the inputs and outputs in these tasks, the decoding cost still increases with output length, regardless of the amount of overlap. By leveraging the overlap between the input and the output, Kaneko and Okazaki (2023) proposed model-agnostic edit span representations to compress the rewrites to save computation. They reported an output length reduction rate of nearly 80% with minimal accuracy impact in four rewriting tasks. In this paper, we propose alternative edit phrase representations inspired by phrase-based statistical machine translation. We systematically compare our phrasal representations with their span representations. We apply the LLM rewriting model to the task of Automatic Speech Recognition (ASR) post editing and show that our target-phrase-only edit representation has the best efficiency-accuracy trade-off. On the LibriSpeech test set, our method closes 50-60% of the WER gap between the edit span model and the full rewrite model while losing only 10-20% of the length reduction rate of the edit span model.

Comments:	accepted by ICASSP 2025
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2501.13831 [cs.CL]
	(or arXiv:2501.13831v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2501.13831

Submission history

From: Hao Zhang [view email]
[v1] Thu, 23 Jan 2025 16:54:27 UTC (323 KB)

Computer Science > Computation and Language

Title:Predicting Compact Phrasal Rewrites with Large Language Models for ASR Post Editing

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Predicting Compact Phrasal Rewrites with Large Language Models for ASR Post Editing

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators