Scalable Whole Slide Image Representation Using K-Mean Clustering and Fisher Vector Aggregation

Gupta, Ravi Kant; Das, Shounak; Sekhar, Ardhendu; Sethi, Amit

Computer Science > Computer Vision and Pattern Recognition

arXiv:2501.12085 (cs)

[Submitted on 21 Jan 2025]

Title:Scalable Whole Slide Image Representation Using K-Mean Clustering and Fisher Vector Aggregation

Authors:Ravi Kant Gupta, Shounak Das, Ardhendu Sekhar, Amit Sethi

View PDF HTML (experimental)

Abstract:Whole slide images (WSIs) are high-resolution, gigapixel sized images that pose significant computational challenges for traditional machine learning models due to their size and this http URL this paper, we present a scalable and efficient methodology for WSI classification by leveraging patch-based feature extraction, clustering, and Fisher vector encoding. Initially, WSIs are divided into fixed size patches, and deep feature embeddings are extracted from each patch using a pre-trained convolutional neural network (CNN). These patch-level embeddings are subsequently clustered using K-means clustering, where each cluster aggregates semantically similar regions of the WSI. To effectively summarize each cluster, Fisher vector representations are computed by modeling the distribution of patch embeddings in each cluster as a parametric Gaussian mixture model (GMM). The Fisher vectors from each cluster are concatenated into a high-dimensional feature vector, creating a compact and informative representation of the entire WSI. This feature vector is then used by a classifier to predict the WSI's diagnostic label. Our method captures local and global tissue structures and yields robust performance for large-scale WSI classification, demonstrating superior accuracy and scalability compared to other approaches.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2501.12085 [cs.CV]
	(or arXiv:2501.12085v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2501.12085

Submission history

From: Ravi Kant Gupta [view email]
[v1] Tue, 21 Jan 2025 12:22:15 UTC (8,224 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Scalable Whole Slide Image Representation Using K-Mean Clustering and Fisher Vector Aggregation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Scalable Whole Slide Image Representation Using K-Mean Clustering and Fisher Vector Aggregation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators