Machine Learning Driven Biomarker Selection for Medical Diagnosis

Bavikadi, Divyagna; Agarwal, Ayushi; Ganta, Shashank; Chung, Yunro; Song, Lusheng; Qiu, Ji; Shakarian, Paulo

Quantitative Biology > Quantitative Methods

arXiv:2405.10345 (q-bio)

[Submitted on 16 May 2024]

Title:Machine Learning Driven Biomarker Selection for Medical Diagnosis

Authors:Divyagna Bavikadi, Ayushi Agarwal, Shashank Ganta, Yunro Chung, Lusheng Song, Ji Qiu, Paulo Shakarian

View PDF HTML (experimental)

Abstract:Recent advances in experimental methods have enabled researchers to collect data on thousands of analytes simultaneously. This has led to correlational studies that associated molecular measurements with diseases such as Alzheimer's, Liver, and Gastric Cancer. However, the use of thousands of biomarkers selected from the analytes is not practical for real-world medical diagnosis and is likely undesirable due to potentially formed spurious correlations. In this study, we evaluate 4 different methods for biomarker selection and 4 different machine learning (ML) classifiers for identifying correlations, evaluating 16 approaches in all. We found that contemporary methods outperform previously reported logistic regression in cases where 3 and 10 biomarkers are permitted. When specificity is fixed at 0.9, ML approaches produced a sensitivity of 0.240 (3 biomarkers) and 0.520 (10 biomarkers), while standard logistic regression provided a sensitivity of 0.000 (3 biomarkers) and 0.040 (10 biomarkers). We also noted that causal-based methods for biomarker selection proved to be the most performant when fewer biomarkers were permitted, while univariate feature selection was the most performant when a greater number of biomarkers were permitted.

Subjects:	Quantitative Methods (q-bio.QM); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2405.10345 [q-bio.QM]
	(or arXiv:2405.10345v1 [q-bio.QM] for this version)
	https://doi.org/10.48550/arXiv.2405.10345

Submission history

From: Divyagna Bavikadi [view email]
[v1] Thu, 16 May 2024 01:30:47 UTC (631 KB)

Quantitative Biology > Quantitative Methods

Title:Machine Learning Driven Biomarker Selection for Medical Diagnosis

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Quantitative Biology > Quantitative Methods

Title:Machine Learning Driven Biomarker Selection for Medical Diagnosis

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators