Exploration of VLMs for Driver Monitoring Systems Applications

Cañas, Paola Natalia; Nieto, Marcos; Otaegui, Oihana; Rodríguez, Igor

Computer Science > Computer Vision and Pattern Recognition

arXiv:2503.12281 (cs)

[Submitted on 15 Mar 2025]

Title:Exploration of VLMs for Driver Monitoring Systems Applications

Authors:Paola Natalia Cañas, Marcos Nieto, Oihana Otaegui, Igor Rodríguez

View PDF

Abstract:In recent years, we have witnessed significant progress in emerging deep learning models, particularly Large Language Models (LLMs) and Vision-Language Models (VLMs). These models have demonstrated promising results, indicating a new era of Artificial Intelligence (AI) that surpasses previous methodologies. Their extensive knowledge and zero-shot capabilities suggest a paradigm shift in developing deep learning solutions, moving from data capturing and algorithm training to just writing appropriate prompts. While the application of these technologies has been explored across various industries, including automotive, there is a notable gap in the scientific literature regarding their use in Driver Monitoring Systems (DMS). This paper presents our initial approach to implementing VLMs in this domain, utilising the Driver Monitoring Dataset to evaluate their performance and discussing their advantages and challenges when implemented in real-world scenarios.

Comments:	Accepted in 16th ITS European Congress, Seville, Spain, 19-21 May 2025
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:2503.12281 [cs.CV]
	(or arXiv:2503.12281v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2503.12281

Submission history

From: Paola Natalia Cañas Rodriguez [view email]
[v1] Sat, 15 Mar 2025 22:37:36 UTC (13,136 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Exploration of VLMs for Driver Monitoring Systems Applications

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Exploration of VLMs for Driver Monitoring Systems Applications

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators