Cargando…
Effect of genotype imputation on genome-enabled prediction of complex traits: an empirical study with mice data
BACKGROUND: Genotype imputation is an important tool for whole-genome prediction as it allows cost reduction of individual genotyping. However, benefits of genotype imputation have been evaluated mostly for linear additive genetic models. In this study we investigated the impact of employing imputed...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4333171/ https://www.ncbi.nlm.nih.gov/pubmed/25544265 http://dx.doi.org/10.1186/s12863-014-0149-9 |
_version_ | 1782357987740876800 |
---|---|
author | Felipe, Vivian PS Okut, Hayrettin Gianola, Daniel Silva, Martinho A Rosa, Guilherme JM |
author_facet | Felipe, Vivian PS Okut, Hayrettin Gianola, Daniel Silva, Martinho A Rosa, Guilherme JM |
author_sort | Felipe, Vivian PS |
collection | PubMed |
description | BACKGROUND: Genotype imputation is an important tool for whole-genome prediction as it allows cost reduction of individual genotyping. However, benefits of genotype imputation have been evaluated mostly for linear additive genetic models. In this study we investigated the impact of employing imputed genotypes when using more elaborated models of phenotype prediction. Our hypothesis was that such models would be able to track genetic signals using the observed genotypes only, with no additional information to be gained from imputed genotypes. RESULTS: For the present study, an outbred mice population containing 1,904 individuals and genotypes for 1,809 pre-selected markers was used. The effect of imputation was evaluated for a linear model (the Bayesian LASSO - BL) and for semi and non-parametric models (Reproducing Kernel Hilbert spaces regressions – RKHS, and Bayesian Regularized Artificial Neural Networks – BRANN, respectively). The RKHS method had the best predictive accuracy. Genotype imputation had a similar impact on the effectiveness of BL and RKHS. BRANN predictions were, apparently, more sensitive to imputation errors. In scenarios where the masking rates were 75% and 50%, the genotype imputation was not beneficial. However, genotype imputation incorporated information about important markers and improved predictive ability, especially for body mass index (BMI), when genotype information was sparse (90% masking), and for body weight (BW) when the reference sample for imputation was weakly related to the target population. CONCLUSIONS: In conclusion, genotype imputation is not always helpful for phenotype prediction, and so it should be considered in a case-by-case basis. In summary, factors that can affect the usefulness of genotype imputation for prediction of yet-to-be observed traits are: the imputation accuracy itself, the structure of the population, the genetic architecture of the target trait and also the model used for phenotype prediction. |
format | Online Article Text |
id | pubmed-4333171 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-43331712015-02-20 Effect of genotype imputation on genome-enabled prediction of complex traits: an empirical study with mice data Felipe, Vivian PS Okut, Hayrettin Gianola, Daniel Silva, Martinho A Rosa, Guilherme JM BMC Genet Research Article BACKGROUND: Genotype imputation is an important tool for whole-genome prediction as it allows cost reduction of individual genotyping. However, benefits of genotype imputation have been evaluated mostly for linear additive genetic models. In this study we investigated the impact of employing imputed genotypes when using more elaborated models of phenotype prediction. Our hypothesis was that such models would be able to track genetic signals using the observed genotypes only, with no additional information to be gained from imputed genotypes. RESULTS: For the present study, an outbred mice population containing 1,904 individuals and genotypes for 1,809 pre-selected markers was used. The effect of imputation was evaluated for a linear model (the Bayesian LASSO - BL) and for semi and non-parametric models (Reproducing Kernel Hilbert spaces regressions – RKHS, and Bayesian Regularized Artificial Neural Networks – BRANN, respectively). The RKHS method had the best predictive accuracy. Genotype imputation had a similar impact on the effectiveness of BL and RKHS. BRANN predictions were, apparently, more sensitive to imputation errors. In scenarios where the masking rates were 75% and 50%, the genotype imputation was not beneficial. However, genotype imputation incorporated information about important markers and improved predictive ability, especially for body mass index (BMI), when genotype information was sparse (90% masking), and for body weight (BW) when the reference sample for imputation was weakly related to the target population. CONCLUSIONS: In conclusion, genotype imputation is not always helpful for phenotype prediction, and so it should be considered in a case-by-case basis. In summary, factors that can affect the usefulness of genotype imputation for prediction of yet-to-be observed traits are: the imputation accuracy itself, the structure of the population, the genetic architecture of the target trait and also the model used for phenotype prediction. BioMed Central 2014-12-29 /pmc/articles/PMC4333171/ /pubmed/25544265 http://dx.doi.org/10.1186/s12863-014-0149-9 Text en © Felipe et al.; licensee BioMed Central. 2014 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Felipe, Vivian PS Okut, Hayrettin Gianola, Daniel Silva, Martinho A Rosa, Guilherme JM Effect of genotype imputation on genome-enabled prediction of complex traits: an empirical study with mice data |
title | Effect of genotype imputation on genome-enabled prediction of complex traits: an empirical study with mice data |
title_full | Effect of genotype imputation on genome-enabled prediction of complex traits: an empirical study with mice data |
title_fullStr | Effect of genotype imputation on genome-enabled prediction of complex traits: an empirical study with mice data |
title_full_unstemmed | Effect of genotype imputation on genome-enabled prediction of complex traits: an empirical study with mice data |
title_short | Effect of genotype imputation on genome-enabled prediction of complex traits: an empirical study with mice data |
title_sort | effect of genotype imputation on genome-enabled prediction of complex traits: an empirical study with mice data |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4333171/ https://www.ncbi.nlm.nih.gov/pubmed/25544265 http://dx.doi.org/10.1186/s12863-014-0149-9 |
work_keys_str_mv | AT felipevivianps effectofgenotypeimputationongenomeenabledpredictionofcomplextraitsanempiricalstudywithmicedata AT okuthayrettin effectofgenotypeimputationongenomeenabledpredictionofcomplextraitsanempiricalstudywithmicedata AT gianoladaniel effectofgenotypeimputationongenomeenabledpredictionofcomplextraitsanempiricalstudywithmicedata AT silvamartinhoa effectofgenotypeimputationongenomeenabledpredictionofcomplextraitsanempiricalstudywithmicedata AT rosaguilhermejm effectofgenotypeimputationongenomeenabledpredictionofcomplextraitsanempiricalstudywithmicedata |