Logically Consistent Loss for Visual Question Answering

Le-Ngo, Anh-Cat; Tran, Truyen; Rana, Santu; Gupta, Sunil; Venkatesh, Svetha

Computer Science > Computer Vision and Pattern Recognition

arXiv:2011.10094 (cs)

[Submitted on 19 Nov 2020]

Title:Logically Consistent Loss for Visual Question Answering

Authors:Anh-Cat Le-Ngo, Truyen Tran, Santu Rana, Sunil Gupta, Svetha Venkatesh

View PDF

Abstract:Given an image, a back-ground knowledge, and a set of questions about an object, human learners answer the questions very consistently regardless of question forms and semantic tasks. The current advancement in neural-network based Visual Question Answering (VQA), despite their impressive performance, cannot ensure such consistency due to identically distribution (i.i.d.) assumption. We propose a new model-agnostic logic constraint to tackle this issue by formulating a logically consistent loss in the multi-task learning framework as well as a data organisation called family-batch and hybrid-batch. To demonstrate usefulness of this proposal, we train and evaluate MAC-net based VQA machines with and without the proposed logically consistent loss and the proposed data organization. The experiments confirm that the proposed loss formulae and introduction of hybrid-batch leads to more consistency as well as better performance. Though the proposed approach is tested with MAC-net, it can be utilised in any other QA methods whenever the logical consistency between answers exist.

Comments:	10 pages, 6 figure, 9 tables
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2011.10094 [cs.CV]
	(or arXiv:2011.10094v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2011.10094

Submission history

From: Anh Cat Le Ngo [view email]
[v1] Thu, 19 Nov 2020 20:31:05 UTC (479 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.AI

< prev | next >

new | recent | 2020-11

Change to browse by:

cs
cs.CV

References & Citations

DBLP - CS Bibliography

listing | bibtex

Truyen Tran
Santu Rana
Sunil Gupta
Svetha Venkatesh

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:Logically Consistent Loss for Visual Question Answering

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Logically Consistent Loss for Visual Question Answering

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators