Competitive and Resource Efficient Factored Hybrid HMM Systems are Simpler Than You Think

Raissi, Tina; Lüscher, Christoph; Gunz, Moritz; Schlüter, Ralf; Ney, Hermann

Computer Science > Sound

arXiv:2306.09517 (cs)

[Submitted on 15 Jun 2023]

Title:Competitive and Resource Efficient Factored Hybrid HMM Systems are Simpler Than You Think

Authors:Tina Raissi, Christoph Lüscher, Moritz Gunz, Ralf Schlüter, Hermann Ney

View PDF

Abstract:Building competitive hybrid hidden Markov model~(HMM) systems for automatic speech recognition~(ASR) requires a complex multi-stage pipeline consisting of several training criteria. The recent sequence-to-sequence models offer the advantage of having simpler pipelines that can start from-scratch. We propose a purely neural based single-stage from-scratch pipeline for a context-dependent hybrid HMM that offers similar simplicity. We use an alignment from a full-sum trained zero-order posterior HMM with a BLSTM encoder. We show that with this alignment we can build a Conformer factored hybrid that performs even better than both a state-of-the-art classic hybrid and a factored hybrid trained with alignments taken from more complex Gaussian mixture based systems. Our finding is confirmed on Switchboard 300h and LibriSpeech 960h tasks with comparable results to other approaches in the literature, and by additionally relying on a responsible choice of available computational resources.

Comments:	Accepted for presentation at InterSpeech 2023
Subjects:	Sound (cs.SD); Audio and Speech Processing (eess.AS)
Cite as:	arXiv:2306.09517 [cs.SD]
	(or arXiv:2306.09517v1 [cs.SD] for this version)
	https://doi.org/10.48550/arXiv.2306.09517

Submission history

From: Tina Raissi [view email]
[v1] Thu, 15 Jun 2023 21:34:31 UTC (33 KB)

Computer Science > Sound

Title:Competitive and Resource Efficient Factored Hybrid HMM Systems are Simpler Than You Think

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Sound

Title:Competitive and Resource Efficient Factored Hybrid HMM Systems are Simpler Than You Think

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators