Cargando…
Accurate Blood-Based Diagnostic Biosignatures for Alzheimer’s Disease via Automated Machine Learning
Alzheimer’s disease (AD) is the most common form of neurodegenerative dementia and its timely diagnosis remains a major challenge in biomarker discovery. In the present study, we analyzed publicly available high-throughput low-sample -omics datasets from studies in AD blood, by the AutoML technology...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7563988/ https://www.ncbi.nlm.nih.gov/pubmed/32962113 http://dx.doi.org/10.3390/jcm9093016 |
_version_ | 1783595612641427456 |
---|---|
author | Karaglani, Makrina Gourlia, Krystallia Tsamardinos, Ioannis Chatzaki, Ekaterini |
author_facet | Karaglani, Makrina Gourlia, Krystallia Tsamardinos, Ioannis Chatzaki, Ekaterini |
author_sort | Karaglani, Makrina |
collection | PubMed |
description | Alzheimer’s disease (AD) is the most common form of neurodegenerative dementia and its timely diagnosis remains a major challenge in biomarker discovery. In the present study, we analyzed publicly available high-throughput low-sample -omics datasets from studies in AD blood, by the AutoML technology Just Add Data Bio (JADBIO), to construct accurate predictive models for use as diagnostic biosignatures. Considering data from AD patients and age–sex matched cognitively healthy individuals, we produced three best performing diagnostic biosignatures specific for the presence of AD: A. A 506-feature transcriptomic dataset from 48 AD and 22 controls led to a miRNA-based biosignature via Support Vector Machines with three miRNA predictors (AUC 0.975 (0.906, 1.000)), B. A 38,327-feature transcriptomic dataset from 134 AD and 100 controls led to six mRNA-based statistically equivalent signatures via Classification Random Forests with 25 mRNA predictors (AUC 0.846 (0.778, 0.905)) and C. A 9483-feature proteomic dataset from 25 AD and 37 controls led to a protein-based biosignature via Ridge Logistic Regression with seven protein predictors (AUC 0.921 (0.849, 0.972)). These performance metrics were also validated through the JADBIO pipeline confirming stability. In conclusion, using the automated machine learning tool JADBIO, we produced accurate predictive biosignatures extrapolating available low sample -omics data. These results offer options for minimally invasive blood-based diagnostic tests for AD, awaiting clinical validation based on respective laboratory assays. They also highlight the value of AutoML in biomarker discovery. |
format | Online Article Text |
id | pubmed-7563988 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-75639882020-10-27 Accurate Blood-Based Diagnostic Biosignatures for Alzheimer’s Disease via Automated Machine Learning Karaglani, Makrina Gourlia, Krystallia Tsamardinos, Ioannis Chatzaki, Ekaterini J Clin Med Article Alzheimer’s disease (AD) is the most common form of neurodegenerative dementia and its timely diagnosis remains a major challenge in biomarker discovery. In the present study, we analyzed publicly available high-throughput low-sample -omics datasets from studies in AD blood, by the AutoML technology Just Add Data Bio (JADBIO), to construct accurate predictive models for use as diagnostic biosignatures. Considering data from AD patients and age–sex matched cognitively healthy individuals, we produced three best performing diagnostic biosignatures specific for the presence of AD: A. A 506-feature transcriptomic dataset from 48 AD and 22 controls led to a miRNA-based biosignature via Support Vector Machines with three miRNA predictors (AUC 0.975 (0.906, 1.000)), B. A 38,327-feature transcriptomic dataset from 134 AD and 100 controls led to six mRNA-based statistically equivalent signatures via Classification Random Forests with 25 mRNA predictors (AUC 0.846 (0.778, 0.905)) and C. A 9483-feature proteomic dataset from 25 AD and 37 controls led to a protein-based biosignature via Ridge Logistic Regression with seven protein predictors (AUC 0.921 (0.849, 0.972)). These performance metrics were also validated through the JADBIO pipeline confirming stability. In conclusion, using the automated machine learning tool JADBIO, we produced accurate predictive biosignatures extrapolating available low sample -omics data. These results offer options for minimally invasive blood-based diagnostic tests for AD, awaiting clinical validation based on respective laboratory assays. They also highlight the value of AutoML in biomarker discovery. MDPI 2020-09-18 /pmc/articles/PMC7563988/ /pubmed/32962113 http://dx.doi.org/10.3390/jcm9093016 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Karaglani, Makrina Gourlia, Krystallia Tsamardinos, Ioannis Chatzaki, Ekaterini Accurate Blood-Based Diagnostic Biosignatures for Alzheimer’s Disease via Automated Machine Learning |
title | Accurate Blood-Based Diagnostic Biosignatures for Alzheimer’s Disease via Automated Machine Learning |
title_full | Accurate Blood-Based Diagnostic Biosignatures for Alzheimer’s Disease via Automated Machine Learning |
title_fullStr | Accurate Blood-Based Diagnostic Biosignatures for Alzheimer’s Disease via Automated Machine Learning |
title_full_unstemmed | Accurate Blood-Based Diagnostic Biosignatures for Alzheimer’s Disease via Automated Machine Learning |
title_short | Accurate Blood-Based Diagnostic Biosignatures for Alzheimer’s Disease via Automated Machine Learning |
title_sort | accurate blood-based diagnostic biosignatures for alzheimer’s disease via automated machine learning |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7563988/ https://www.ncbi.nlm.nih.gov/pubmed/32962113 http://dx.doi.org/10.3390/jcm9093016 |
work_keys_str_mv | AT karaglanimakrina accuratebloodbaseddiagnosticbiosignaturesforalzheimersdiseaseviaautomatedmachinelearning AT gourliakrystallia accuratebloodbaseddiagnosticbiosignaturesforalzheimersdiseaseviaautomatedmachinelearning AT tsamardinosioannis accuratebloodbaseddiagnosticbiosignaturesforalzheimersdiseaseviaautomatedmachinelearning AT chatzakiekaterini accuratebloodbaseddiagnosticbiosignaturesforalzheimersdiseaseviaautomatedmachinelearning |