Cargando…

Genome-Wide Association Studies-Based Machine Learning for Prediction of Age-Related Macular Degeneration Risk

PURPOSE: Because age-related macular degeneration (AMD) is a progressive disorder and advanced AMD is currently hard to cure, an accurate and informative prediction of a person's AMD risk using genetic information is desirable for early diagnosis and potential individualized clinical management...

Descripción completa

Detalles Bibliográficos
Autores principales: Yan, Qi, Jiang, Yale, Huang, Heng, Swaroop, Anand, Chew, Emily Y., Weeks, Daniel E., Chen, Wei, Ding, Ying
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Association for Research in Vision and Ophthalmology 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7900884/
https://www.ncbi.nlm.nih.gov/pubmed/34003914
http://dx.doi.org/10.1167/tvst.10.2.29
_version_ 1783654300269936640
author Yan, Qi
Jiang, Yale
Huang, Heng
Swaroop, Anand
Chew, Emily Y.
Weeks, Daniel E.
Chen, Wei
Ding, Ying
author_facet Yan, Qi
Jiang, Yale
Huang, Heng
Swaroop, Anand
Chew, Emily Y.
Weeks, Daniel E.
Chen, Wei
Ding, Ying
author_sort Yan, Qi
collection PubMed
description PURPOSE: Because age-related macular degeneration (AMD) is a progressive disorder and advanced AMD is currently hard to cure, an accurate and informative prediction of a person's AMD risk using genetic information is desirable for early diagnosis and potential individualized clinical management. The objective of this study was to develop and validate novel prediction models for AMD risk using large genome-wide association studies datasets with different machine learning approaches. METHODS: Genotype data from 32,215 Caucasian individuals with age of ≥50 years from the International AMD Genomics Consortium in dbGaP were used to establish and test prediction models for AMD risk. Four different machine learning approaches—neural network, lasso regression, support vector machine, and random forest—were implemented. A standard logistic regression model using a genetic risk score was also considered. RESULTS: All machine learning–based methods achieved satisfactory performance for predicting advanced AMD cases (vs. normal controls) (area under the curve = 0.81–0.82, Brier score = 0.17–0.18 in a separate test dataset) and any stage AMD (vs. normal controls) (area under the curve = 0.78–0.79, Brier score = 0.18–0.20 in a separate test dataset). The prediction performance was further validated in an independent dataset of 783 subjects from UK Biobank (area under the curve = 0.67). CONCLUSIONS: By applying multiple state-of-art machine learning approaches on large AMD genome-wide association studies datasets, the predictive models we established can provide an accurate estimation of an individual's AMD risk profile based on genetic information along with age. The online prediction interface is available at: https://yanq.shinyapps.io/no_vs_amd_NN/. TRANSLATIONAL RELEVANCE: The accurate and individualized risk prediction model interface will greatly improve early diagnosis and enhance tailored clinical management of AMD.
format Online
Article
Text
id pubmed-7900884
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher The Association for Research in Vision and Ophthalmology
record_format MEDLINE/PubMed
spelling pubmed-79008842021-02-26 Genome-Wide Association Studies-Based Machine Learning for Prediction of Age-Related Macular Degeneration Risk Yan, Qi Jiang, Yale Huang, Heng Swaroop, Anand Chew, Emily Y. Weeks, Daniel E. Chen, Wei Ding, Ying Transl Vis Sci Technol Article PURPOSE: Because age-related macular degeneration (AMD) is a progressive disorder and advanced AMD is currently hard to cure, an accurate and informative prediction of a person's AMD risk using genetic information is desirable for early diagnosis and potential individualized clinical management. The objective of this study was to develop and validate novel prediction models for AMD risk using large genome-wide association studies datasets with different machine learning approaches. METHODS: Genotype data from 32,215 Caucasian individuals with age of ≥50 years from the International AMD Genomics Consortium in dbGaP were used to establish and test prediction models for AMD risk. Four different machine learning approaches—neural network, lasso regression, support vector machine, and random forest—were implemented. A standard logistic regression model using a genetic risk score was also considered. RESULTS: All machine learning–based methods achieved satisfactory performance for predicting advanced AMD cases (vs. normal controls) (area under the curve = 0.81–0.82, Brier score = 0.17–0.18 in a separate test dataset) and any stage AMD (vs. normal controls) (area under the curve = 0.78–0.79, Brier score = 0.18–0.20 in a separate test dataset). The prediction performance was further validated in an independent dataset of 783 subjects from UK Biobank (area under the curve = 0.67). CONCLUSIONS: By applying multiple state-of-art machine learning approaches on large AMD genome-wide association studies datasets, the predictive models we established can provide an accurate estimation of an individual's AMD risk profile based on genetic information along with age. The online prediction interface is available at: https://yanq.shinyapps.io/no_vs_amd_NN/. TRANSLATIONAL RELEVANCE: The accurate and individualized risk prediction model interface will greatly improve early diagnosis and enhance tailored clinical management of AMD. The Association for Research in Vision and Ophthalmology 2021-02-18 /pmc/articles/PMC7900884/ /pubmed/34003914 http://dx.doi.org/10.1167/tvst.10.2.29 Text en Copyright 2021 The Authors http://creativecommons.org/licenses/by-nc-nd/4.0/ This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
spellingShingle Article
Yan, Qi
Jiang, Yale
Huang, Heng
Swaroop, Anand
Chew, Emily Y.
Weeks, Daniel E.
Chen, Wei
Ding, Ying
Genome-Wide Association Studies-Based Machine Learning for Prediction of Age-Related Macular Degeneration Risk
title Genome-Wide Association Studies-Based Machine Learning for Prediction of Age-Related Macular Degeneration Risk
title_full Genome-Wide Association Studies-Based Machine Learning for Prediction of Age-Related Macular Degeneration Risk
title_fullStr Genome-Wide Association Studies-Based Machine Learning for Prediction of Age-Related Macular Degeneration Risk
title_full_unstemmed Genome-Wide Association Studies-Based Machine Learning for Prediction of Age-Related Macular Degeneration Risk
title_short Genome-Wide Association Studies-Based Machine Learning for Prediction of Age-Related Macular Degeneration Risk
title_sort genome-wide association studies-based machine learning for prediction of age-related macular degeneration risk
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7900884/
https://www.ncbi.nlm.nih.gov/pubmed/34003914
http://dx.doi.org/10.1167/tvst.10.2.29
work_keys_str_mv AT yanqi genomewideassociationstudiesbasedmachinelearningforpredictionofagerelatedmaculardegenerationrisk
AT jiangyale genomewideassociationstudiesbasedmachinelearningforpredictionofagerelatedmaculardegenerationrisk
AT huangheng genomewideassociationstudiesbasedmachinelearningforpredictionofagerelatedmaculardegenerationrisk
AT swaroopanand genomewideassociationstudiesbasedmachinelearningforpredictionofagerelatedmaculardegenerationrisk
AT chewemilyy genomewideassociationstudiesbasedmachinelearningforpredictionofagerelatedmaculardegenerationrisk
AT weeksdaniele genomewideassociationstudiesbasedmachinelearningforpredictionofagerelatedmaculardegenerationrisk
AT chenwei genomewideassociationstudiesbasedmachinelearningforpredictionofagerelatedmaculardegenerationrisk
AT dingying genomewideassociationstudiesbasedmachinelearningforpredictionofagerelatedmaculardegenerationrisk