Quantitative Biology > Quantitative Methods
[Submitted on 1 Sep 2017]
Title:Using Deep Convolutional Neural Networks to Circumvent Morphological Feature Specification when Classifying Subvisible Protein Aggregates from Micro-Flow Images
View PDFAbstract:Flow-Imaging Microscopy (FIM) is commonly used in both academia and industry to characterize subvisible particles (those $\le 25 \mu m$ in size) in protein therapeutics. Pharmaceutical companies are required to record vast volumes of FIM data on protein therapeutic products, but are only mandated under US FDA regulations (i.e., USP $\big \langle 788 \big \rangle$) to control the number of particles exceeding $10$ and $25 \mu m$ in delivered products. Hence, a vast amount of digital images are available to analyze. Current state-of-the-art methods rely on a relatively low-dimensional list of "morphological features" to characterize particles, but these methods ignore an enormous amount of information encoded in the existing large digital image repositories. Deep Convolutional Neural Networks (CNNs or "ConvNets") have demonstrated the ability to extract predictive information from raw macroscopic image data without requiring the selection or specification of "morphological features" in a variety of tasks. However, the heterogeneity, polydispersity of protein therapeutics, and optical phenomena associated with subvisible FIM particle measurements introduce new challenges regarding the application of CNNs to FIM image analysis. In this article, we demonstrate a supervised learning technique leveraging CNNs to extract information from raw images in order to predict the process conditions or stress states (freeze-thaw, mechanical shaking, etc.) that produced a variety of different protein images. We demonstrate that our new classifier (in combination with a sample "image pooling" strategy) can obtain nearly perfect predictions using as few as 20 FIM images from a given protein formulation in a variety of scenarios of relevance to protein therapeutics quality control and process monitoring.
Submission history
From: Christopher Calderon [view email][v1] Fri, 1 Sep 2017 04:36:45 UTC (2,832 KB)
References & Citations
Bibliographic and Citation Tools
Bibliographic Explorer (What is the Explorer?)
Litmaps (What is Litmaps?)
scite Smart Citations (What are Smart Citations?)
Code, Data and Media Associated with this Article
CatalyzeX Code Finder for Papers (What is CatalyzeX?)
DagsHub (What is DagsHub?)
Gotit.pub (What is GotitPub?)
Papers with Code (What is Papers with Code?)
ScienceCast (What is ScienceCast?)
Demos
Recommenders and Search Tools
Influence Flower (What are Influence Flowers?)
Connected Papers (What is Connected Papers?)
CORE Recommender (What is CORE?)
arXivLabs: experimental projects with community collaborators
arXivLabs is a framework that allows collaborators to develop and share new arXiv features directly on our website.
Both individuals and organizations that work with arXivLabs have embraced and accepted our values of openness, community, excellence, and user data privacy. arXiv is committed to these values and only works with partners that adhere to them.
Have an idea for a project that will add value for arXiv's community? Learn more about arXivLabs.