Cargando…

Effect of genotype imputation on genome-enabled prediction of complex traits: an empirical study with mice data

BACKGROUND: Genotype imputation is an important tool for whole-genome prediction as it allows cost reduction of individual genotyping. However, benefits of genotype imputation have been evaluated mostly for linear additive genetic models. In this study we investigated the impact of employing imputed...

Descripción completa

Detalles Bibliográficos
Autores principales: Felipe, Vivian PS, Okut, Hayrettin, Gianola, Daniel, Silva, Martinho A, Rosa, Guilherme JM
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4333171/
https://www.ncbi.nlm.nih.gov/pubmed/25544265
http://dx.doi.org/10.1186/s12863-014-0149-9
_version_ 1782357987740876800
author Felipe, Vivian PS
Okut, Hayrettin
Gianola, Daniel
Silva, Martinho A
Rosa, Guilherme JM
author_facet Felipe, Vivian PS
Okut, Hayrettin
Gianola, Daniel
Silva, Martinho A
Rosa, Guilherme JM
author_sort Felipe, Vivian PS
collection PubMed
description BACKGROUND: Genotype imputation is an important tool for whole-genome prediction as it allows cost reduction of individual genotyping. However, benefits of genotype imputation have been evaluated mostly for linear additive genetic models. In this study we investigated the impact of employing imputed genotypes when using more elaborated models of phenotype prediction. Our hypothesis was that such models would be able to track genetic signals using the observed genotypes only, with no additional information to be gained from imputed genotypes. RESULTS: For the present study, an outbred mice population containing 1,904 individuals and genotypes for 1,809 pre-selected markers was used. The effect of imputation was evaluated for a linear model (the Bayesian LASSO - BL) and for semi and non-parametric models (Reproducing Kernel Hilbert spaces regressions – RKHS, and Bayesian Regularized Artificial Neural Networks – BRANN, respectively). The RKHS method had the best predictive accuracy. Genotype imputation had a similar impact on the effectiveness of BL and RKHS. BRANN predictions were, apparently, more sensitive to imputation errors. In scenarios where the masking rates were 75% and 50%, the genotype imputation was not beneficial. However, genotype imputation incorporated information about important markers and improved predictive ability, especially for body mass index (BMI), when genotype information was sparse (90% masking), and for body weight (BW) when the reference sample for imputation was weakly related to the target population. CONCLUSIONS: In conclusion, genotype imputation is not always helpful for phenotype prediction, and so it should be considered in a case-by-case basis. In summary, factors that can affect the usefulness of genotype imputation for prediction of yet-to-be observed traits are: the imputation accuracy itself, the structure of the population, the genetic architecture of the target trait and also the model used for phenotype prediction.
format Online
Article
Text
id pubmed-4333171
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-43331712015-02-20 Effect of genotype imputation on genome-enabled prediction of complex traits: an empirical study with mice data Felipe, Vivian PS Okut, Hayrettin Gianola, Daniel Silva, Martinho A Rosa, Guilherme JM BMC Genet Research Article BACKGROUND: Genotype imputation is an important tool for whole-genome prediction as it allows cost reduction of individual genotyping. However, benefits of genotype imputation have been evaluated mostly for linear additive genetic models. In this study we investigated the impact of employing imputed genotypes when using more elaborated models of phenotype prediction. Our hypothesis was that such models would be able to track genetic signals using the observed genotypes only, with no additional information to be gained from imputed genotypes. RESULTS: For the present study, an outbred mice population containing 1,904 individuals and genotypes for 1,809 pre-selected markers was used. The effect of imputation was evaluated for a linear model (the Bayesian LASSO - BL) and for semi and non-parametric models (Reproducing Kernel Hilbert spaces regressions – RKHS, and Bayesian Regularized Artificial Neural Networks – BRANN, respectively). The RKHS method had the best predictive accuracy. Genotype imputation had a similar impact on the effectiveness of BL and RKHS. BRANN predictions were, apparently, more sensitive to imputation errors. In scenarios where the masking rates were 75% and 50%, the genotype imputation was not beneficial. However, genotype imputation incorporated information about important markers and improved predictive ability, especially for body mass index (BMI), when genotype information was sparse (90% masking), and for body weight (BW) when the reference sample for imputation was weakly related to the target population. CONCLUSIONS: In conclusion, genotype imputation is not always helpful for phenotype prediction, and so it should be considered in a case-by-case basis. In summary, factors that can affect the usefulness of genotype imputation for prediction of yet-to-be observed traits are: the imputation accuracy itself, the structure of the population, the genetic architecture of the target trait and also the model used for phenotype prediction. BioMed Central 2014-12-29 /pmc/articles/PMC4333171/ /pubmed/25544265 http://dx.doi.org/10.1186/s12863-014-0149-9 Text en © Felipe et al.; licensee BioMed Central. 2014 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Felipe, Vivian PS
Okut, Hayrettin
Gianola, Daniel
Silva, Martinho A
Rosa, Guilherme JM
Effect of genotype imputation on genome-enabled prediction of complex traits: an empirical study with mice data
title Effect of genotype imputation on genome-enabled prediction of complex traits: an empirical study with mice data
title_full Effect of genotype imputation on genome-enabled prediction of complex traits: an empirical study with mice data
title_fullStr Effect of genotype imputation on genome-enabled prediction of complex traits: an empirical study with mice data
title_full_unstemmed Effect of genotype imputation on genome-enabled prediction of complex traits: an empirical study with mice data
title_short Effect of genotype imputation on genome-enabled prediction of complex traits: an empirical study with mice data
title_sort effect of genotype imputation on genome-enabled prediction of complex traits: an empirical study with mice data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4333171/
https://www.ncbi.nlm.nih.gov/pubmed/25544265
http://dx.doi.org/10.1186/s12863-014-0149-9
work_keys_str_mv AT felipevivianps effectofgenotypeimputationongenomeenabledpredictionofcomplextraitsanempiricalstudywithmicedata
AT okuthayrettin effectofgenotypeimputationongenomeenabledpredictionofcomplextraitsanempiricalstudywithmicedata
AT gianoladaniel effectofgenotypeimputationongenomeenabledpredictionofcomplextraitsanempiricalstudywithmicedata
AT silvamartinhoa effectofgenotypeimputationongenomeenabledpredictionofcomplextraitsanempiricalstudywithmicedata
AT rosaguilhermejm effectofgenotypeimputationongenomeenabledpredictionofcomplextraitsanempiricalstudywithmicedata