Effective Fine-Tuning of Vision-Language Models for Accurate Galaxy Morphology Analysis

Wang, Ruoqi; Wang, Haitao; Luo, Qiong

Computer Science > Computer Vision and Pattern Recognition

arXiv:2411.19475 (cs)

[Submitted on 29 Nov 2024]

Title:Effective Fine-Tuning of Vision-Language Models for Accurate Galaxy Morphology Analysis

Authors:Ruoqi Wang, Haitao Wang, Qiong Luo

View PDF HTML (experimental)

Abstract:Galaxy morphology analysis involves classifying galaxies by their shapes and structures. For this task, directly training domain-specific models on large, annotated astronomical datasets is effective but costly. In contrast, fine-tuning vision foundation models on a smaller set of astronomical images is more resource-efficient but generally results in lower accuracy. To harness the benefits of both approaches and address their shortcomings, we propose GalaxAlign, a novel method that fine-tunes pre-trained foundation models to achieve high accuracy on astronomical tasks. Specifically, our method extends a contrastive learning architecture to align three types of data in fine-tuning: (1) a set of schematic symbols representing galaxy shapes and structures, (2) textual labels of these symbols, and (3) galaxy images. This way, GalaxAlign not only eliminates the need for expensive pretraining but also enhances the effectiveness of fine-tuning. Extensive experiments on galaxy classification and similarity search demonstrate that our method effectively fine-tunes general pre-trained models for astronomical tasks by incorporating domain-specific multi-modal knowledge.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Astrophysics of Galaxies (astro-ph.GA); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2411.19475 [cs.CV]
	(or arXiv:2411.19475v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2411.19475

Submission history

From: Ruoqi Wang [view email]
[v1] Fri, 29 Nov 2024 05:10:47 UTC (9,445 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Effective Fine-Tuning of Vision-Language Models for Accurate Galaxy Morphology Analysis

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Effective Fine-Tuning of Vision-Language Models for Accurate Galaxy Morphology Analysis

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators