Video Summarisation with Incident and Context Information using Generative AI

De Silva, Ulindu; Fernando, Leon; Bandara, Kalinga; Nawaratne, Rashmika

Computer Science > Computer Vision and Pattern Recognition

arXiv:2501.04764 (cs)

[Submitted on 8 Jan 2025]

Title:Video Summarisation with Incident and Context Information using Generative AI

Authors:Ulindu De Silva, Leon Fernando, Kalinga Bandara, Rashmika Nawaratne

View PDF HTML (experimental)

Abstract:The proliferation of video content production has led to vast amounts of data, posing substantial challenges in terms of analysis efficiency and resource utilization. Addressing this issue calls for the development of robust video analysis tools. This paper proposes a novel approach leveraging Generative Artificial Intelligence (GenAI) to facilitate streamlined video analysis. Our tool aims to deliver tailored textual summaries of user-defined queries, offering a focused insight amidst extensive video datasets. Unlike conventional frameworks that offer generic summaries or limited action recognition, our method harnesses the power of GenAI to distil relevant information, enhancing analysis precision and efficiency. Employing YOLO-V8 for object detection and Gemini for comprehensive video and text analysis, our solution achieves heightened contextual accuracy. By combining YOLO with Gemini, our approach furnishes textual summaries extracted from extensive CCTV footage, enabling users to swiftly navigate and verify pertinent events without the need for exhaustive manual review. The quantitative evaluation revealed a similarity of 72.8%, while the qualitative assessment rated an accuracy of 85%, demonstrating the capability of the proposed method.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2501.04764 [cs.CV]
	(or arXiv:2501.04764v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2501.04764

Submission history

From: Ulindu De Silva [view email]
[v1] Wed, 8 Jan 2025 18:35:48 UTC (2,412 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Video Summarisation with Incident and Context Information using Generative AI

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Video Summarisation with Incident and Context Information using Generative AI

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators