Cargando…

Improving predictive models for Alzheimer’s disease using GWAS data by incorporating misclassified samples modeling

Late-onset Alzheimer’s Disease (LOAD) is the most common form of dementia in the elderly. Genome-wide association studies (GWAS) for LOAD have open new avenues to identify genetic causes and to provide diagnostic tools for early detection. Although several predictive models have been proposed using...

Descripción completa

Detalles Bibliográficos
Autores principales: Romero-Rosales, Brissa-Lizbeth, Tamez-Pena, Jose-Gerardo, Nicolini, Humberto, Moreno-Treviño, Maria-Guadalupe, Trevino, Victor
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7179850/
https://www.ncbi.nlm.nih.gov/pubmed/32324812
http://dx.doi.org/10.1371/journal.pone.0232103
_version_ 1783525712564584448
author Romero-Rosales, Brissa-Lizbeth
Tamez-Pena, Jose-Gerardo
Nicolini, Humberto
Moreno-Treviño, Maria-Guadalupe
Trevino, Victor
author_facet Romero-Rosales, Brissa-Lizbeth
Tamez-Pena, Jose-Gerardo
Nicolini, Humberto
Moreno-Treviño, Maria-Guadalupe
Trevino, Victor
author_sort Romero-Rosales, Brissa-Lizbeth
collection PubMed
description Late-onset Alzheimer’s Disease (LOAD) is the most common form of dementia in the elderly. Genome-wide association studies (GWAS) for LOAD have open new avenues to identify genetic causes and to provide diagnostic tools for early detection. Although several predictive models have been proposed using the few detected GWAS markers, there is still a need for improvement and identification of potential markers. Commonly, polygenic risk scores are being used for prediction. Nevertheless, other methods to generate predictive models have been suggested. In this research, we compared three machine learning methods that have been proved to construct powerful predictive models (genetic algorithms, LASSO, and step-wise) and propose the inclusion of markers from misclassified samples to improve overall prediction accuracy. Our results show that the addition of markers from an initial model plus the markers of the model fitted to misclassified samples improves the area under the receiving operative curve by around 5%, reaching ~0.84, which is highly competitive using only genetic information. The computational strategy used here can help to devise better methods to improve classification models for AD. Our results could have a positive impact on the early diagnosis of Alzheimer’s disease.
format Online
Article
Text
id pubmed-7179850
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-71798502020-05-05 Improving predictive models for Alzheimer’s disease using GWAS data by incorporating misclassified samples modeling Romero-Rosales, Brissa-Lizbeth Tamez-Pena, Jose-Gerardo Nicolini, Humberto Moreno-Treviño, Maria-Guadalupe Trevino, Victor PLoS One Research Article Late-onset Alzheimer’s Disease (LOAD) is the most common form of dementia in the elderly. Genome-wide association studies (GWAS) for LOAD have open new avenues to identify genetic causes and to provide diagnostic tools for early detection. Although several predictive models have been proposed using the few detected GWAS markers, there is still a need for improvement and identification of potential markers. Commonly, polygenic risk scores are being used for prediction. Nevertheless, other methods to generate predictive models have been suggested. In this research, we compared three machine learning methods that have been proved to construct powerful predictive models (genetic algorithms, LASSO, and step-wise) and propose the inclusion of markers from misclassified samples to improve overall prediction accuracy. Our results show that the addition of markers from an initial model plus the markers of the model fitted to misclassified samples improves the area under the receiving operative curve by around 5%, reaching ~0.84, which is highly competitive using only genetic information. The computational strategy used here can help to devise better methods to improve classification models for AD. Our results could have a positive impact on the early diagnosis of Alzheimer’s disease. Public Library of Science 2020-04-23 /pmc/articles/PMC7179850/ /pubmed/32324812 http://dx.doi.org/10.1371/journal.pone.0232103 Text en © 2020 Romero-Rosales et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Romero-Rosales, Brissa-Lizbeth
Tamez-Pena, Jose-Gerardo
Nicolini, Humberto
Moreno-Treviño, Maria-Guadalupe
Trevino, Victor
Improving predictive models for Alzheimer’s disease using GWAS data by incorporating misclassified samples modeling
title Improving predictive models for Alzheimer’s disease using GWAS data by incorporating misclassified samples modeling
title_full Improving predictive models for Alzheimer’s disease using GWAS data by incorporating misclassified samples modeling
title_fullStr Improving predictive models for Alzheimer’s disease using GWAS data by incorporating misclassified samples modeling
title_full_unstemmed Improving predictive models for Alzheimer’s disease using GWAS data by incorporating misclassified samples modeling
title_short Improving predictive models for Alzheimer’s disease using GWAS data by incorporating misclassified samples modeling
title_sort improving predictive models for alzheimer’s disease using gwas data by incorporating misclassified samples modeling
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7179850/
https://www.ncbi.nlm.nih.gov/pubmed/32324812
http://dx.doi.org/10.1371/journal.pone.0232103
work_keys_str_mv AT romerorosalesbrissalizbeth improvingpredictivemodelsforalzheimersdiseaseusinggwasdatabyincorporatingmisclassifiedsamplesmodeling
AT tamezpenajosegerardo improvingpredictivemodelsforalzheimersdiseaseusinggwasdatabyincorporatingmisclassifiedsamplesmodeling
AT nicolinihumberto improvingpredictivemodelsforalzheimersdiseaseusinggwasdatabyincorporatingmisclassifiedsamplesmodeling
AT morenotrevinomariaguadalupe improvingpredictivemodelsforalzheimersdiseaseusinggwasdatabyincorporatingmisclassifiedsamplesmodeling
AT trevinovictor improvingpredictivemodelsforalzheimersdiseaseusinggwasdatabyincorporatingmisclassifiedsamplesmodeling