Hear Your Face: Face-based voice conversion with F0 estimation

Lee, Jaejun; Oh, Yoori; Hwang, Injune; Lee, Kyogu

Computer Science > Sound

arXiv:2408.09802 (cs)

[Submitted on 19 Aug 2024]

Title:Hear Your Face: Face-based voice conversion with F0 estimation

Authors:Jaejun Lee, Yoori Oh, Injune Hwang, Kyogu Lee

View PDF HTML (experimental)

Abstract:This paper delves into the emerging field of face-based voice conversion, leveraging the unique relationship between an individual's facial features and their vocal characteristics. We present a novel face-based voice conversion framework that particularly utilizes the average fundamental frequency of the target speaker, derived solely from their facial images. Through extensive analysis, our framework demonstrates superior speech generation quality and the ability to align facial features with voice characteristics, including tracking of the target speaker's fundamental frequency.

Comments:	Interspeech 2024
Subjects:	Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Audio and Speech Processing (eess.AS)
Cite as:	arXiv:2408.09802 [cs.SD]
	(or arXiv:2408.09802v1 [cs.SD] for this version)
	https://doi.org/10.48550/arXiv.2408.09802

Submission history

From: Jaejun Lee [view email]
[v1] Mon, 19 Aug 2024 08:47:03 UTC (374 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.SD

< prev | next >

new | recent | 2024-08

Change to browse by:

cs
cs.CV
eess
eess.AS

References & Citations

export BibTeX citation

Computer Science > Sound

Title:Hear Your Face: Face-based voice conversion with F0 estimation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Sound

Title:Hear Your Face: Face-based voice conversion with F0 estimation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators