Language Model Decomposition: Quantifying the Dependency and Correlation of Language Models

Zhang, Hao

Computer Science > Computation and Language

arXiv:2210.10289 (cs)

[Submitted on 19 Oct 2022 (v1), last revised 21 Oct 2022 (this version, v2)]

Title:Language Model Decomposition: Quantifying the Dependency and Correlation of Language Models

Authors:Hao Zhang

View PDF

Abstract:Pre-trained language models (LMs), such as BERT (Devlin et al., 2018) and its variants, have led to significant improvements on various NLP tasks in past years. However, a theoretical framework for studying their relationships is still missing. In this paper, we fill this gap by investigating the linear dependency between pre-trained LMs. The linear dependency of LMs is defined analogously to the linear dependency of vectors. We propose Language Model Decomposition (LMD) to represent a LM using a linear combination of other LMs as basis, and derive the closed-form solution. A goodness-of-fit metric for LMD similar to the coefficient of determination is defined and used to measure the linear dependency of a set of LMs. In experiments, we find that BERT and eleven (11) BERT-like LMs are 91% linearly dependent. This observation suggests that current state-of-the-art (SOTA) LMs are highly "correlated". To further advance SOTA we need more diverse and novel LMs that are less dependent on existing LMs.

Comments:	accepted by EMNLP 2022
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
MSC classes:	68T50 (Primary) 68T30, 68T07 (Secondary)
ACM classes:	I.2.7
Cite as:	arXiv:2210.10289 [cs.CL]
	(or arXiv:2210.10289v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2210.10289

Submission history

From: Hao Zhang PhD [view email]
[v1] Wed, 19 Oct 2022 04:28:19 UTC (737 KB)
[v2] Fri, 21 Oct 2022 03:15:24 UTC (553 KB)

Computer Science > Computation and Language

Title:Language Model Decomposition: Quantifying the Dependency and Correlation of Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Language Model Decomposition: Quantifying the Dependency and Correlation of Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators