Towards scientific discovery with dictionary learning: Extracting biological concepts from microscopy foundation models

Donhauser, Konstantin; Ulicna, Kristina; Moran, Gemma Elyse; Ravuri, Aditya; Kenyon-Dean, Kian; Eastwood, Cian; Hartford, Jason

Computer Science > Machine Learning

arXiv:2412.16247 (cs)

[Submitted on 20 Dec 2024]

Title:Towards scientific discovery with dictionary learning: Extracting biological concepts from microscopy foundation models

Authors:Konstantin Donhauser, Kristina Ulicna, Gemma Elyse Moran, Aditya Ravuri, Kian Kenyon-Dean, Cian Eastwood, Jason Hartford

View PDF HTML (experimental)

Abstract:Dictionary learning (DL) has emerged as a powerful interpretability tool for large language models. By extracting known concepts (e.g., Golden-Gate Bridge) from human-interpretable data (e.g., text), sparse DL can elucidate a model's inner workings. In this work, we ask if DL can also be used to discover unknown concepts from less human-interpretable scientific data (e.g., cell images), ultimately enabling modern approaches to scientific discovery. As a first step, we use DL algorithms to study microscopy foundation models trained on multi-cell image data, where little prior knowledge exists regarding which high-level concepts should arise. We show that sparse dictionaries indeed extract biologically-meaningful concepts such as cell type and genetic perturbation type. We also propose a new DL algorithm, Iterative Codebook Feature Learning~(ICFL), and combine it with a pre-processing step that uses PCA whitening from a control dataset. In our experiments, we demonstrate that both ICFL and PCA improve the selectivity of extracted features compared to TopK sparse autoencoders.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
Cite as:	arXiv:2412.16247 [cs.LG]
	(or arXiv:2412.16247v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2412.16247

Submission history

From: Konstantin Donhauser [view email]
[v1] Fri, 20 Dec 2024 00:01:16 UTC (21,387 KB)

Computer Science > Machine Learning

Title:Towards scientific discovery with dictionary learning: Extracting biological concepts from microscopy foundation models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Towards scientific discovery with dictionary learning: Extracting biological concepts from microscopy foundation models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators