Cross-task Attention Mechanism for Dense Multi-task Learning

Lopes, Ivan; Vu, Tuan-Hung; de Charette, Raoul

Computer Science > Computer Vision and Pattern Recognition

arXiv:2206.08927v1 (cs)

[Submitted on 17 Jun 2022 (this version), latest version 8 Oct 2024 (v2)]

Title:Cross-task Attention Mechanism for Dense Multi-task Learning

Authors:Ivan Lopes, Tuan-Hung Vu, Raoul de Charette

View PDF

Abstract:Multi-task learning has recently become a promising solution for a comprehensive understanding of complex scenes. Not only being memory-efficient, multi-task models with an appropriate design can favor exchange of complementary signals across tasks. In this work, we jointly address 2D semantic segmentation, and two geometry-related tasks, namely dense depth, surface normal estimation as well as edge estimation showing their benefit on indoor and outdoor datasets. We propose a novel multi-task learning architecture that exploits pair-wise cross-task exchange through correlation-guided attention and self-attention to enhance the average representation learning for all tasks. We conduct extensive experiments considering three multi-task setups, showing the benefit of our proposal in comparison to competitive baselines in both synthetic and real benchmarks. We also extend our method to the novel multi-task unsupervised domain adaptation setting. Our code is available at this https URL.

Comments:	10 figures, 6 tables, 23 pages
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
Cite as:	arXiv:2206.08927 [cs.CV]
	(or arXiv:2206.08927v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2206.08927

Submission history

From: Ivan Lopes [view email]
[v1] Fri, 17 Jun 2022 17:59:45 UTC (41,575 KB)
[v2] Tue, 8 Oct 2024 09:55:21 UTC (37,861 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Cross-task Attention Mechanism for Dense Multi-task Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Cross-task Attention Mechanism for Dense Multi-task Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators