CSP: Self-Supervised Contrastive Spatial Pre-Training for Geospatial-Visual Representations

Mai, Gengchen; Lao, Ni; He, Yutong; Song, Jiaming; Ermon, Stefano

Computer Science > Computer Vision and Pattern Recognition

arXiv:2305.01118 (cs)

[Submitted on 1 May 2023 (v1), last revised 9 May 2023 (this version, v2)]

Title:CSP: Self-Supervised Contrastive Spatial Pre-Training for Geospatial-Visual Representations

Authors:Gengchen Mai, Ni Lao, Yutong He, Jiaming Song, Stefano Ermon

View PDF

Abstract:Geo-tagged images are publicly available in large quantities, whereas labels such as object classes are rather scarce and expensive to collect. Meanwhile, contrastive learning has achieved tremendous success in various natural image and language tasks with limited labeled data. However, existing methods fail to fully leverage geospatial information, which can be paramount to distinguishing objects that are visually similar. To directly leverage the abundant geospatial information associated with images in pre-training, fine-tuning, and inference stages, we present Contrastive Spatial Pre-Training (CSP), a self-supervised learning framework for geo-tagged images. We use a dual-encoder to separately encode the images and their corresponding geo-locations, and use contrastive objectives to learn effective location representations from images, which can be transferred to downstream supervised tasks such as image classification. Experiments show that CSP can improve model performance on both iNat2018 and fMoW datasets. Especially, on iNat2018, CSP significantly boosts the model performance with 10-34% relative improvement with various labeled training data sampling ratios.

Comments:	In: ICML 2023, Jul 23 - 29, 2023, Honolulu, Hawaii, USA
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
MSC classes:	68T07, 68T45
ACM classes:	I.2.10; I.5.4; I.5.1; J.2
Cite as:	arXiv:2305.01118 [cs.CV]
	(or arXiv:2305.01118v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2305.01118

Submission history

From: Gengchen Mai [view email]
[v1] Mon, 1 May 2023 23:11:18 UTC (3,229 KB)
[v2] Tue, 9 May 2023 01:29:35 UTC (3,509 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:CSP: Self-Supervised Contrastive Spatial Pre-Training for Geospatial-Visual Representations

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:CSP: Self-Supervised Contrastive Spatial Pre-Training for Geospatial-Visual Representations

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators