PERSONA: An Application for Emotion Recognition, Gender Recognition and Age Estimation

Koshal, Devyani; Phukan, Orchid Chetia; Jain, Sarthak; Buduru, Arun Balaji; Sharma, Rajesh

Electrical Engineering and Systems Science > Audio and Speech Processing

arXiv:2406.06781 (eess)

[Submitted on 10 Jun 2024]

Title:PERSONA: An Application for Emotion Recognition, Gender Recognition and Age Estimation

Authors:Devyani Koshal, Orchid Chetia Phukan, Sarthak Jain, Arun Balaji Buduru, Rajesh Sharma

View PDF HTML (experimental)

Abstract:Emotion Recognition (ER), Gender Recognition (GR), and Age Estimation (AE) constitute paralinguistic tasks that rely not on the spoken content but primarily on speech characteristics such as pitch and tone. While previous research has made significant strides in developing models for each task individually, there has been comparatively less emphasis on concurrently learning these tasks, despite their inherent interconnectedness. As such in this demonstration, we present PERSONA, an application for predicting ER, GR, and AE with a single model in the backend. One notable point is we show that representations from speaker recognition pre-trained model (PTM) is better suited for such a multi-task learning format than the state-of-the-art (SOTA) self-supervised (SSL) PTM by carrying out a comparative study. Our methodology obviates the need for deploying separate models for each task and can potentially conserve resources and time during the training and deployment phases.

Comments:	Accepted to INTERSPEECH 2024 Show & Tell Demonstrations
Subjects:	Audio and Speech Processing (eess.AS); Sound (cs.SD)
Cite as:	arXiv:2406.06781 [eess.AS]
	(or arXiv:2406.06781v1 [eess.AS] for this version)
	https://doi.org/10.48550/arXiv.2406.06781

Submission history

From: Orchid Chetia Phukan [view email]
[v1] Mon, 10 Jun 2024 20:38:48 UTC (6,583 KB)

Electrical Engineering and Systems Science > Audio and Speech Processing

Title:PERSONA: An Application for Emotion Recognition, Gender Recognition and Age Estimation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Audio and Speech Processing

Title:PERSONA: An Application for Emotion Recognition, Gender Recognition and Age Estimation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators