Astronomaly at scale: searching for anomalies amongst 4 million galaxies

Etsebeth, Verlon; Lochner, Michelle; Walmsley, Mike; Grespan, Margherita

doi:10.1093/mnras/stae496

Astrophysics > Instrumentation and Methods for Astrophysics

arXiv:2309.08660 (astro-ph)

[Submitted on 15 Sep 2023 (v1), last revised 29 Mar 2024 (this version, v2)]

Title:Astronomaly at scale: searching for anomalies amongst 4 million galaxies

Authors:Verlon Etsebeth, Michelle Lochner, Mike Walmsley, Margherita Grespan

View PDF HTML (experimental)

Abstract:Modern astronomical surveys are producing datasets of unprecedented size and richness, increasing the potential for high-impact scientific discovery. This possibility, coupled with the challenge of exploring a large number of sources, has led to the development of novel machine-learning-based anomaly detection approaches, such as Astronomaly. For the first time, we test the scalability of Astronomaly by applying it to almost 4 million images of galaxies from the Dark Energy Camera Legacy Survey. We use a trained deep learning algorithm to learn useful representations of the images and pass these to the anomaly detection algorithm isolation forest, coupled with Astronomaly's active learning method, to discover interesting sources. We find that data selection criteria have a significant impact on the trade-off between finding rare sources such as strong lenses and introducing artefacts into the dataset. We demonstrate that active learning is required to identify the most interesting sources and reduce artefacts, while anomaly detection methods alone are insufficient. Using Astronomaly, we find 1635 anomalies among the top 2000 sources in the dataset after applying active learning, including eight strong gravitational lens candidates, 1609 galaxy merger candidates, and 18 previously unidentified sources exhibiting highly unusual morphology. Our results show that by leveraging the human-machine interface, Astronomaly is able to rapidly identify sources of scientific interest even in large datasets.

Comments:	15 pages, 9 figures. Comments welcome, especially suggestions about the anomalous sources
Subjects:	Instrumentation and Methods for Astrophysics (astro-ph.IM); Astrophysics of Galaxies (astro-ph.GA)
Cite as:	arXiv:2309.08660 [astro-ph.IM]
	(or arXiv:2309.08660v2 [astro-ph.IM] for this version)
	https://doi.org/10.48550/arXiv.2309.08660
Journal reference:	MNRAS Volume 529, Issue 1, March 2024, Pages 732--747
Related DOI:	https://doi.org/10.1093/mnras/stae496

Submission history

From: Verlon Etsebeth [view email]
[v1] Fri, 15 Sep 2023 18:00:01 UTC (14,400 KB)
[v2] Fri, 29 Mar 2024 12:24:42 UTC (10,070 KB)

Astrophysics > Instrumentation and Methods for Astrophysics

Title:Astronomaly at scale: searching for anomalies amongst 4 million galaxies

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Astrophysics > Instrumentation and Methods for Astrophysics

Title:Astronomaly at scale: searching for anomalies amongst 4 million galaxies

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators