Cargando…

Does the inclusion of rare variants improve risk prediction?

Every known link between a genetic variant and blood pressure improves the understanding and potentially the risk assessment of related diseases such as hypertension. Genetic data have become increasingly comprehensive and available for an increasing number of samples. The availability of whole-geno...

Descripción completa

Detalles Bibliográficos
Autores principales: Austin, Erin, Pan, Wei, Shen, Xiaotong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4143761/
https://www.ncbi.nlm.nih.gov/pubmed/25519349
http://dx.doi.org/10.1186/1753-6561-8-S1-S94
_version_ 1782331955439730688
author Austin, Erin
Pan, Wei
Shen, Xiaotong
author_facet Austin, Erin
Pan, Wei
Shen, Xiaotong
author_sort Austin, Erin
collection PubMed
description Every known link between a genetic variant and blood pressure improves the understanding and potentially the risk assessment of related diseases such as hypertension. Genetic data have become increasingly comprehensive and available for an increasing number of samples. The availability of whole-genome sequencing data means that statistical genetic models must evolve to meet the challenge of using both rare variants (RVs) and common variants (CVs) to link previously unidentified genome loci to disease-related traits. Penalized regression has two features, variable selection and proportional coefficient shrinkage, that allow researchers to build models tailored to hypothesized characteristics of the genotype-phenotype map. The following work uses the Genetic Analysis Workshop 18 data to investigate the performance of a spectrum of penalized regressions using at first only CVs or only RVs to predict systolic blood pressure (SBP). Next, combinations of CVs and RVs are used to model SBP, and the impact on prediction is quantified. The study demonstrates that penalized regression improves blood pressure prediction for any combination of CVs and RVs compared with maximum likelihood estimation. More significantly, models using both types of variants provide better predictions of SBP than those using only CVs or only RVs. The predictive mean squared error was reduced by up to 11.5% when RVs were added to CV-only penalized regression models. Elastic net regression with equally weighted LASSO and ridge components, in particular, can use large numbers of single-nucleotide polymorphisms to improve prediction.
format Online
Article
Text
id pubmed-4143761
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-41437612014-09-02 Does the inclusion of rare variants improve risk prediction? Austin, Erin Pan, Wei Shen, Xiaotong BMC Proc Proceedings Every known link between a genetic variant and blood pressure improves the understanding and potentially the risk assessment of related diseases such as hypertension. Genetic data have become increasingly comprehensive and available for an increasing number of samples. The availability of whole-genome sequencing data means that statistical genetic models must evolve to meet the challenge of using both rare variants (RVs) and common variants (CVs) to link previously unidentified genome loci to disease-related traits. Penalized regression has two features, variable selection and proportional coefficient shrinkage, that allow researchers to build models tailored to hypothesized characteristics of the genotype-phenotype map. The following work uses the Genetic Analysis Workshop 18 data to investigate the performance of a spectrum of penalized regressions using at first only CVs or only RVs to predict systolic blood pressure (SBP). Next, combinations of CVs and RVs are used to model SBP, and the impact on prediction is quantified. The study demonstrates that penalized regression improves blood pressure prediction for any combination of CVs and RVs compared with maximum likelihood estimation. More significantly, models using both types of variants provide better predictions of SBP than those using only CVs or only RVs. The predictive mean squared error was reduced by up to 11.5% when RVs were added to CV-only penalized regression models. Elastic net regression with equally weighted LASSO and ridge components, in particular, can use large numbers of single-nucleotide polymorphisms to improve prediction. BioMed Central 2014-06-17 /pmc/articles/PMC4143761/ /pubmed/25519349 http://dx.doi.org/10.1186/1753-6561-8-S1-S94 Text en Copyright © 2014 Austin et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Proceedings
Austin, Erin
Pan, Wei
Shen, Xiaotong
Does the inclusion of rare variants improve risk prediction?
title Does the inclusion of rare variants improve risk prediction?
title_full Does the inclusion of rare variants improve risk prediction?
title_fullStr Does the inclusion of rare variants improve risk prediction?
title_full_unstemmed Does the inclusion of rare variants improve risk prediction?
title_short Does the inclusion of rare variants improve risk prediction?
title_sort does the inclusion of rare variants improve risk prediction?
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4143761/
https://www.ncbi.nlm.nih.gov/pubmed/25519349
http://dx.doi.org/10.1186/1753-6561-8-S1-S94
work_keys_str_mv AT austinerin doestheinclusionofrarevariantsimproveriskprediction
AT panwei doestheinclusionofrarevariantsimproveriskprediction
AT shenxiaotong doestheinclusionofrarevariantsimproveriskprediction