Boosting Multitask Learning on Graphs through Higher-Order Task Affinities

Li, Dongyue; Ju, Haotian; Sharma, Aneesh; Zhang, Hongyang R.

doi:10.1145/3580305.3599265

Computer Science > Machine Learning

arXiv:2306.14009v3 (cs)

[Submitted on 24 Jun 2023 (v1), revised 12 Feb 2024 (this version, v3), latest version 14 Mar 2024 (v4)]

Title:Boosting Multitask Learning on Graphs through Higher-Order Task Affinities

Authors:Dongyue Li, Haotian Ju, Aneesh Sharma, Hongyang R. Zhang

View PDF HTML (experimental)

Abstract:Predicting node labels on a given graph is a widely studied problem with many applications, including community detection and molecular graph prediction. This paper considers predicting multiple node labeling functions on graphs simultaneously and revisits this problem from a multitask learning perspective. For a concrete example, consider overlapping community detection: each community membership is a binary node classification task. Due to complex overlapping patterns, we find that negative transfer is prevalent when we apply naive multitask learning to multiple community detection, as task relationships are highly nonlinear across different node labeling. To address the challenge, we develop an algorithm to cluster tasks into groups based on a higher-order task affinity measure. We then fit a multitask model on each task group, resulting in a boosting procedure on top of the baseline model. We estimate the higher-order task affinity measure between two tasks as the prediction loss of one task in the presence of another task and a random subset of other tasks. Then, we use spectral clustering on the affinity score matrix to identify task grouping. We design several speedup techniques to compute the higher-order affinity scores efficiently and show that they can predict negative transfers more accurately than pairwise task affinities. We validate our procedure using various community detection and molecular graph prediction data sets, showing favorable results compared with existing methods. Lastly, we provide a theoretical analysis to show that under a planted block model of tasks on graphs, our affinity scores can provably separate tasks into groups.

Comments:	15 pages. Appeared in KDD 2023
Subjects:	Machine Learning (cs.LG); Social and Information Networks (cs.SI)
Cite as:	arXiv:2306.14009 [cs.LG]
	(or arXiv:2306.14009v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2306.14009
Related DOI:	https://doi.org/10.1145/3580305.3599265

Submission history

From: Dongyue Li [view email]
[v1] Sat, 24 Jun 2023 15:53:38 UTC (1,638 KB)
[v2] Sat, 26 Aug 2023 16:33:19 UTC (1,638 KB)
[v3] Mon, 12 Feb 2024 16:51:41 UTC (1,638 KB)
[v4] Thu, 14 Mar 2024 22:54:18 UTC (1,638 KB)

Computer Science > Machine Learning

Title:Boosting Multitask Learning on Graphs through Higher-Order Task Affinities

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Boosting Multitask Learning on Graphs through Higher-Order Task Affinities

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators