Cargando…

Missing single nucleotide polymorphisms in Genetic Risk Scores: A simulation study

Using a genetic risk score (GRS) to predict a phenotype in a target sample can be complicated by missing data on the single nucleotide polymorphisms (SNPs) that comprise the GRS. This is usually addressed by imputation, omission of the SNPs or by replacing the missing SNPs with proxy SNPs. To assess...

Descripción completa

Detalles Bibliográficos
Autores principales: Chagnon, Miguel, O’Loughlin, Jennifer, Engert, James C., Karp, Igor, Sylvestre, Marie-Pierre
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6053141/
https://www.ncbi.nlm.nih.gov/pubmed/30024900
http://dx.doi.org/10.1371/journal.pone.0200630
_version_ 1783340764027158528
author Chagnon, Miguel
O’Loughlin, Jennifer
Engert, James C.
Karp, Igor
Sylvestre, Marie-Pierre
author_facet Chagnon, Miguel
O’Loughlin, Jennifer
Engert, James C.
Karp, Igor
Sylvestre, Marie-Pierre
author_sort Chagnon, Miguel
collection PubMed
description Using a genetic risk score (GRS) to predict a phenotype in a target sample can be complicated by missing data on the single nucleotide polymorphisms (SNPs) that comprise the GRS. This is usually addressed by imputation, omission of the SNPs or by replacing the missing SNPs with proxy SNPs. To assess the impact of the omission and proxy approaches on effect size estimation and predictive ability of weighted and unweighted GRS with small numbers of SNPs, we simulated a dichotomous phenotype conditional on real genotype data. We considered scenarios in which the proportion of missing SNPs ranged from 20–70%. We assessed the impact of omitting or replacing missing SNPs on the association between the GRS and phenotype, the corresponding statistical power and the area under the receiver operating curve. Omission resulted in a larger bias towards the null value of the effect size, a smaller predictive ability and greater loss of statistical power than proxy approaches. The predictive ability of a weighted GRS that includes SNPs with large weights depends of the availability of these large-weight SNPs.
format Online
Article
Text
id pubmed-6053141
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-60531412018-07-27 Missing single nucleotide polymorphisms in Genetic Risk Scores: A simulation study Chagnon, Miguel O’Loughlin, Jennifer Engert, James C. Karp, Igor Sylvestre, Marie-Pierre PLoS One Research Article Using a genetic risk score (GRS) to predict a phenotype in a target sample can be complicated by missing data on the single nucleotide polymorphisms (SNPs) that comprise the GRS. This is usually addressed by imputation, omission of the SNPs or by replacing the missing SNPs with proxy SNPs. To assess the impact of the omission and proxy approaches on effect size estimation and predictive ability of weighted and unweighted GRS with small numbers of SNPs, we simulated a dichotomous phenotype conditional on real genotype data. We considered scenarios in which the proportion of missing SNPs ranged from 20–70%. We assessed the impact of omitting or replacing missing SNPs on the association between the GRS and phenotype, the corresponding statistical power and the area under the receiver operating curve. Omission resulted in a larger bias towards the null value of the effect size, a smaller predictive ability and greater loss of statistical power than proxy approaches. The predictive ability of a weighted GRS that includes SNPs with large weights depends of the availability of these large-weight SNPs. Public Library of Science 2018-07-19 /pmc/articles/PMC6053141/ /pubmed/30024900 http://dx.doi.org/10.1371/journal.pone.0200630 Text en © 2018 Chagnon et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Chagnon, Miguel
O’Loughlin, Jennifer
Engert, James C.
Karp, Igor
Sylvestre, Marie-Pierre
Missing single nucleotide polymorphisms in Genetic Risk Scores: A simulation study
title Missing single nucleotide polymorphisms in Genetic Risk Scores: A simulation study
title_full Missing single nucleotide polymorphisms in Genetic Risk Scores: A simulation study
title_fullStr Missing single nucleotide polymorphisms in Genetic Risk Scores: A simulation study
title_full_unstemmed Missing single nucleotide polymorphisms in Genetic Risk Scores: A simulation study
title_short Missing single nucleotide polymorphisms in Genetic Risk Scores: A simulation study
title_sort missing single nucleotide polymorphisms in genetic risk scores: a simulation study
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6053141/
https://www.ncbi.nlm.nih.gov/pubmed/30024900
http://dx.doi.org/10.1371/journal.pone.0200630
work_keys_str_mv AT chagnonmiguel missingsinglenucleotidepolymorphismsingeneticriskscoresasimulationstudy
AT oloughlinjennifer missingsinglenucleotidepolymorphismsingeneticriskscoresasimulationstudy
AT engertjamesc missingsinglenucleotidepolymorphismsingeneticriskscoresasimulationstudy
AT karpigor missingsinglenucleotidepolymorphismsingeneticriskscoresasimulationstudy
AT sylvestremariepierre missingsinglenucleotidepolymorphismsingeneticriskscoresasimulationstudy