Temporal Attention for Language Models

Rosin, Guy D.; Radinsky, Kira

Computer Science > Computation and Language

arXiv:2202.02093 (cs)

[Submitted on 4 Feb 2022 (v1), last revised 3 May 2022 (this version, v2)]

Title:Temporal Attention for Language Models

Authors:Guy D. Rosin, Kira Radinsky

View PDF

Abstract:Pretrained language models based on the transformer architecture have shown great success in NLP. Textual training data often comes from the web and is thus tagged with time-specific information, but most language models ignore this information. They are trained on the textual data alone, limiting their ability to generalize temporally. In this work, we extend the key component of the transformer architecture, i.e., the self-attention mechanism, and propose temporal attention - a time-aware self-attention mechanism. Temporal attention can be applied to any transformer model and requires the input texts to be accompanied with their relevant time points. It allows the transformer to capture this temporal information and create time-specific contextualized word representations. We leverage these representations for the task of semantic change detection; we apply our proposed mechanism to BERT and experiment on three datasets in different languages (English, German, and Latin) that also vary in time, size, and genre. Our proposed model achieves state-of-the-art results on all the datasets.

Comments:	Findings of NAACL 2022. 9 pages
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2202.02093 [cs.CL]
	(or arXiv:2202.02093v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2202.02093

Submission history

From: Guy Rosin [view email]
[v1] Fri, 4 Feb 2022 11:55:34 UTC (107 KB)
[v2] Tue, 3 May 2022 23:21:05 UTC (112 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2022-02

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Guy D. Rosin
Kira Radinsky

export BibTeX citation

Computer Science > Computation and Language

Title:Temporal Attention for Language Models

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Temporal Attention for Language Models

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators