InterroGate: Learning to Share, Specialize, and Prune Representations for Multi-task Learning

Bejnordi, Babak Ehteshami; Kumar, Gaurav; Royer, Amelie; Louizos, Christos; Blankevoort, Tijmen; Ghafoorian, Mohsen

Computer Science > Machine Learning

arXiv:2402.16848 (cs)

[Submitted on 26 Feb 2024]

Title:InterroGate: Learning to Share, Specialize, and Prune Representations for Multi-task Learning

Authors:Babak Ehteshami Bejnordi, Gaurav Kumar, Amelie Royer, Christos Louizos, Tijmen Blankevoort, Mohsen Ghafoorian

View PDF HTML (experimental)

Abstract:Jointly learning multiple tasks with a unified model can improve accuracy and data efficiency, but it faces the challenge of task interference, where optimizing one task objective may inadvertently compromise the performance of another. A solution to mitigate this issue is to allocate task-specific parameters, free from interference, on top of shared features. However, manually designing such architectures is cumbersome, as practitioners need to balance between the overall performance across all tasks and the higher computational cost induced by the newly added parameters. In this work, we propose \textit{InterroGate}, a novel multi-task learning (MTL) architecture designed to mitigate task interference while optimizing inference computational efficiency. We employ a learnable gating mechanism to automatically balance the shared and task-specific representations while preserving the performance of all tasks. Crucially, the patterns of parameter sharing and specialization dynamically learned during training, become fixed at inference, resulting in a static, optimized MTL architecture. Through extensive empirical evaluations, we demonstrate SoTA results on three MTL benchmarks using convolutional as well as transformer-based backbones on CelebA, NYUD-v2, and PASCAL-Context.

Comments:	Under review
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2402.16848 [cs.LG]
	(or arXiv:2402.16848v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2402.16848

Submission history

From: Babak Ehteshami Bejnordi [view email]
[v1] Mon, 26 Feb 2024 18:59:52 UTC (1,731 KB)

Computer Science > Machine Learning

Title:InterroGate: Learning to Share, Specialize, and Prune Representations for Multi-task Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:InterroGate: Learning to Share, Specialize, and Prune Representations for Multi-task Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators