Automatic Speech Recognition for Sanskrit with Transfer Learning

Sadhukhan, Bidit; Punyeshwarananda, Swami

doi:10.1109/C3IT60531.2024.10829416

Computer Science > Computation and Language

arXiv:2501.10024 (cs)

[Submitted on 17 Jan 2025]

Title:Automatic Speech Recognition for Sanskrit with Transfer Learning

Authors:Bidit Sadhukhan, Swami Punyeshwarananda

View PDF HTML (experimental)

Abstract:Sanskrit, one of humanity's most ancient languages, has a vast collection of books and manuscripts on diverse topics that have been accumulated over millennia. However, its digital content (audio and text), which is vital for the training of AI systems, is profoundly limited. Furthermore, its intricate linguistics make it hard to develop robust NLP tools for wider accessibility. Given these constraints, we have developed an automatic speech recognition model for Sanskrit by employing transfer learning mechanism on OpenAI's Whisper model. After carefully optimising the hyper-parameters, we obtained promising results with our transfer-learned model achieving a word error rate of 15.42% on Vaksancayah dataset. An online demo of our model is made available for the use of public and to evaluate its performance firsthand thereby paving the way for improved accessibility and technological support for Sanskrit learning in the modern era.

Comments:	Paper has been accepted at the 4th International Conference on Computer, Communication, Control & Information Technology (C3IT), Hooghly, India, 2024, pp. 1-5
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2501.10024 [cs.CL]
	(or arXiv:2501.10024v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2501.10024
Journal reference:	4th International Conference on Computer, Communication, Control & Information Technology (C3IT), Hooghly, India, 2024, pp. 1-5
Related DOI:	https://doi.org/10.1109/C3IT60531.2024.10829416

Submission history

From: Punyeshwarananda Swami [view email]
[v1] Fri, 17 Jan 2025 08:20:32 UTC (256 KB)

Computer Science > Computation and Language

Title:Automatic Speech Recognition for Sanskrit with Transfer Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Automatic Speech Recognition for Sanskrit with Transfer Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators