Comparison and Combination of Sentence Embeddings Derived from Different Supervision Signals

Tsukagoshi, Hayato; Sasano, Ryohei; Takeda, Koichi

Computer Science > Computation and Language

arXiv:2202.02990 (cs)

[Submitted on 7 Feb 2022 (v1), last revised 10 Jun 2022 (this version, v2)]

Title:Comparison and Combination of Sentence Embeddings Derived from Different Supervision Signals

Authors:Hayato Tsukagoshi, Ryohei Sasano, Koichi Takeda

View PDF

Abstract:There have been many successful applications of sentence embedding methods. However, it has not been well understood what properties are captured in the resulting sentence embeddings depending on the supervision signals. In this paper, we focus on two types of sentence embedding methods with similar architectures and tasks: one fine-tunes pre-trained language models on the natural language inference task, and the other fine-tunes pre-trained language models on word prediction task from its definition sentence, and investigate their properties. Specifically, we compare their performances on semantic textual similarity (STS) tasks using STS datasets partitioned from two perspectives: 1) sentence source and 2) superficial similarity of the sentence pairs, and compare their performances on the downstream and probing tasks. Furthermore, we attempt to combine the two methods and demonstrate that combining the two methods yields substantially better performance than the respective methods on unsupervised STS tasks and downstream tasks.

Comments:	Accepted at *SEM 2022
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2202.02990 [cs.CL]
	(or arXiv:2202.02990v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2202.02990

Submission history

From: Hayato Tsukagoshi [view email]
[v1] Mon, 7 Feb 2022 08:15:48 UTC (713 KB)
[v2] Fri, 10 Jun 2022 08:03:48 UTC (717 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2022-02

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Ryohei Sasano
Koichi Takeda

export BibTeX citation

Computer Science > Computation and Language

Title:Comparison and Combination of Sentence Embeddings Derived from Different Supervision Signals

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Comparison and Combination of Sentence Embeddings Derived from Different Supervision Signals

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators