Cargando…

A nested mixture model for genomic prediction using whole-genome SNP genotypes

Genomic prediction exploits single nucleotide polymorphisms (SNPs) across the whole genome for predicting genetic merit of selection candidates. In most models for genomic prediction, e.g. BayesA, B, C, R and GBLUP, independence of SNP effects is assumed. However, SNP effects are expected to be loca...

Descripción completa

Detalles Bibliográficos
Autores principales: Zeng, Jian, Garrick, Dorian, Dekkers, Jack, Fernando, Rohan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5862491/
https://www.ncbi.nlm.nih.gov/pubmed/29561877
http://dx.doi.org/10.1371/journal.pone.0194683
_version_ 1783308237048643584
author Zeng, Jian
Garrick, Dorian
Dekkers, Jack
Fernando, Rohan
author_facet Zeng, Jian
Garrick, Dorian
Dekkers, Jack
Fernando, Rohan
author_sort Zeng, Jian
collection PubMed
description Genomic prediction exploits single nucleotide polymorphisms (SNPs) across the whole genome for predicting genetic merit of selection candidates. In most models for genomic prediction, e.g. BayesA, B, C, R and GBLUP, independence of SNP effects is assumed. However, SNP effects are expected to be locally dependent given the presence of a nearby QTL because SNPs surrounding the QTL do not segregate independently. A consequence of ignoring this dependence is that SNPs with small effects may be overly shrunk, e.g. effects from markers with high minor allele frequencies (MAF) that flank QTL with low MAF. A nested mixture model (BayesN) is developed to account for the dependence of effects of SNPs that are closely linked, where the effects of SNPs in every non-overlapping genomic window a priori follow a point mass at zero for all SNPs or a mixture of some SNPs with nonzero effects and others with zero effects. It can be regarded as a parsimonious alternative to the existing antedependence model, antiBayesB, which allow a nonstationary dependence of SNP effects. Illumina 777K BovineHD genotypes from 948 Angus cattle were used to simulate 5,000 offspring, with 4,000 used for training and 1,000 for validation. Scenarios with 300 common (MAF > 0.05) or rare (MAF < 0.05) QTL randomly selected from segregating SNPs were replicated 8 times. SNPs corresponding to QTL were masked from a 600k panel comprising SNPs with MAF > 0.05 or a 50k evenly spaced subset of these. Compared with BayesB and a modified antiBayesB, BayesN improved the accuracy of prediction up to 2.0% with 50k SNPs and up to 7.0% with 600k SNPs, most improvements occurring in the rare QTL scenario. Computing time was reduced up to 60% with 50k SNPs and up to 75% with 600k SNPs. BayesN is an accurate and computationally efficient method for genomic prediction with whole-genome SNPs, especially for traits with rare QTL.
format Online
Article
Text
id pubmed-5862491
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-58624912018-03-28 A nested mixture model for genomic prediction using whole-genome SNP genotypes Zeng, Jian Garrick, Dorian Dekkers, Jack Fernando, Rohan PLoS One Research Article Genomic prediction exploits single nucleotide polymorphisms (SNPs) across the whole genome for predicting genetic merit of selection candidates. In most models for genomic prediction, e.g. BayesA, B, C, R and GBLUP, independence of SNP effects is assumed. However, SNP effects are expected to be locally dependent given the presence of a nearby QTL because SNPs surrounding the QTL do not segregate independently. A consequence of ignoring this dependence is that SNPs with small effects may be overly shrunk, e.g. effects from markers with high minor allele frequencies (MAF) that flank QTL with low MAF. A nested mixture model (BayesN) is developed to account for the dependence of effects of SNPs that are closely linked, where the effects of SNPs in every non-overlapping genomic window a priori follow a point mass at zero for all SNPs or a mixture of some SNPs with nonzero effects and others with zero effects. It can be regarded as a parsimonious alternative to the existing antedependence model, antiBayesB, which allow a nonstationary dependence of SNP effects. Illumina 777K BovineHD genotypes from 948 Angus cattle were used to simulate 5,000 offspring, with 4,000 used for training and 1,000 for validation. Scenarios with 300 common (MAF > 0.05) or rare (MAF < 0.05) QTL randomly selected from segregating SNPs were replicated 8 times. SNPs corresponding to QTL were masked from a 600k panel comprising SNPs with MAF > 0.05 or a 50k evenly spaced subset of these. Compared with BayesB and a modified antiBayesB, BayesN improved the accuracy of prediction up to 2.0% with 50k SNPs and up to 7.0% with 600k SNPs, most improvements occurring in the rare QTL scenario. Computing time was reduced up to 60% with 50k SNPs and up to 75% with 600k SNPs. BayesN is an accurate and computationally efficient method for genomic prediction with whole-genome SNPs, especially for traits with rare QTL. Public Library of Science 2018-03-21 /pmc/articles/PMC5862491/ /pubmed/29561877 http://dx.doi.org/10.1371/journal.pone.0194683 Text en © 2018 Zeng et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Zeng, Jian
Garrick, Dorian
Dekkers, Jack
Fernando, Rohan
A nested mixture model for genomic prediction using whole-genome SNP genotypes
title A nested mixture model for genomic prediction using whole-genome SNP genotypes
title_full A nested mixture model for genomic prediction using whole-genome SNP genotypes
title_fullStr A nested mixture model for genomic prediction using whole-genome SNP genotypes
title_full_unstemmed A nested mixture model for genomic prediction using whole-genome SNP genotypes
title_short A nested mixture model for genomic prediction using whole-genome SNP genotypes
title_sort nested mixture model for genomic prediction using whole-genome snp genotypes
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5862491/
https://www.ncbi.nlm.nih.gov/pubmed/29561877
http://dx.doi.org/10.1371/journal.pone.0194683
work_keys_str_mv AT zengjian anestedmixturemodelforgenomicpredictionusingwholegenomesnpgenotypes
AT garrickdorian anestedmixturemodelforgenomicpredictionusingwholegenomesnpgenotypes
AT dekkersjack anestedmixturemodelforgenomicpredictionusingwholegenomesnpgenotypes
AT fernandorohan anestedmixturemodelforgenomicpredictionusingwholegenomesnpgenotypes
AT zengjian nestedmixturemodelforgenomicpredictionusingwholegenomesnpgenotypes
AT garrickdorian nestedmixturemodelforgenomicpredictionusingwholegenomesnpgenotypes
AT dekkersjack nestedmixturemodelforgenomicpredictionusingwholegenomesnpgenotypes
AT fernandorohan nestedmixturemodelforgenomicpredictionusingwholegenomesnpgenotypes