Supervector Compression Strategies to Speed up I-Vector System Development

Vestman, Ville; Kinnunen, Tomi

Electrical Engineering and Systems Science > Audio and Speech Processing

arXiv:1805.01156 (eess)

[Submitted on 3 May 2018]

Title:Supervector Compression Strategies to Speed up I-Vector System Development

Authors:Ville Vestman, Tomi Kinnunen

View PDF

Abstract:The front-end factor analysis (FEFA), an extension of principal component analysis (PPCA) tailored to be used with Gaussian mixture models (GMMs), is currently the prevalent approach to extract compact utterance-level features (i-vectors) for automatic speaker verification (ASV) systems. Little research has been conducted comparing FEFA to the conventional PPCA applied to maximum a posteriori (MAP) adapted GMM supervectors. We study several alternative methods, including PPCA, factor analysis (FA), and two supervised approaches, supervised PPCA (SPPCA) and the recently proposed probabilistic partial least squares (PPLS), to compress MAP-adapted GMM supervectors. The resulting i-vectors are used in ASV tasks with a probabilistic linear discriminant analysis (PLDA) back-end. We experiment on two different datasets, on the telephone condition of NIST SRE 2010 and on the recent VoxCeleb corpus collected from YouTube videos containing celebrity interviews recorded in various acoustical and technical conditions. The results suggest that, in terms of ASV accuracy, the supervector compression approaches are on a par with FEFA. The supervised approaches did not result in improved performance. In comparison to FEFA, we obtained more than hundred-fold (100x) speedups in the total variability model (TVM) training using the PPCA and FA supervector compression approaches.

Comments:	To appear in Speaker Odyssey 2018: The Speaker and Language Recognition Workshop
Subjects:	Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD); Machine Learning (stat.ML)
Cite as:	arXiv:1805.01156 [eess.AS]
	(or arXiv:1805.01156v1 [eess.AS] for this version)
	https://doi.org/10.48550/arXiv.1805.01156

Submission history

From: Ville Vestman [view email]
[v1] Thu, 3 May 2018 08:12:39 UTC (63 KB)

Electrical Engineering and Systems Science > Audio and Speech Processing

Title:Supervector Compression Strategies to Speed up I-Vector System Development

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Audio and Speech Processing

Title:Supervector Compression Strategies to Speed up I-Vector System Development

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators