Families In Wild Multimedia (FIW-MM): A Multi-Modal Database for Recognizing Kinship

Robinson, Joseph P.; Khan, Zaid; Yin, Yu; Shao, Ming; Fu, Yun

Computer Science > Computer Vision and Pattern Recognition

arXiv:2007.14509v1 (cs)

[Submitted on 28 Jul 2020 (this version), latest version 1 Oct 2021 (v6)]

Title:Families In Wild Multimedia (FIW-MM): A Multi-Modal Database for Recognizing Kinship

Authors:Joseph P. Robinson, Zaid Khan, Yu Yin, Ming Shao, Yun Fu

View PDF

Abstract:Recognizing kinship - a soft biometric with vast applications - in photos has piqued the interest of many machine vision researchers. The large-scale Families In the Wild (FIW) database promoted the problem by supporting annual kinship-based vision challenges that saw consistent performance improvements. We have now begun to approach performance levels for image-based systems acceptable for practical use - something unforeseeable a decade ago. However, biometric systems can benefit from multi-modal perspectives, as information contained in multimedia can add to and complement that of still images. Thus, we aim to narrow the gap from research-to-reality by extending FIW with multimedia data (i.e., video, audio, and contextual transcripts). Specifically, we introduce the first large-scale dataset for recognizing kinship in multimedia, the FIW in Multimedia (FIW-MM) database. We utilize automated machinery to collect, annotate, and prepare the data with minimal human input and no financial cost. This large-scale, multimedia corpus allows problem formulations to follow more realistic template-based protocols. We show significant improvements in benchmarks for multiple kin-based tasks when additional media-types are added. Experiments provide insights by highlighting edge cases to inspire future research and areas of improvement. Emphasis is put on short and long-term research directions, with the overarching intent to increase the potential of systems built to automatically detect kinship in multimedia. Furthermore, we expect a broader range of researchers with recognition tasks, generative modeling, speech understanding, and nature-based narratives.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2007.14509 [cs.CV]
	(or arXiv:2007.14509v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2007.14509

Submission history

From: Joseph Robinson [view email]
[v1] Tue, 28 Jul 2020 22:36:57 UTC (7,149 KB)
[v2] Thu, 1 Oct 2020 20:58:56 UTC (39,144 KB)
[v3] Wed, 24 Feb 2021 04:10:53 UTC (19,579 KB)
[v4] Fri, 16 Jul 2021 06:31:05 UTC (19,581 KB)
[v5] Tue, 3 Aug 2021 14:59:53 UTC (19,588 KB)
[v6] Fri, 1 Oct 2021 20:16:01 UTC (12,410 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Families In Wild Multimedia (FIW-MM): A Multi-Modal Database for Recognizing Kinship

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Families In Wild Multimedia (FIW-MM): A Multi-Modal Database for Recognizing Kinship

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators