Cargando…
A nested mixture model for genomic prediction using whole-genome SNP genotypes
Genomic prediction exploits single nucleotide polymorphisms (SNPs) across the whole genome for predicting genetic merit of selection candidates. In most models for genomic prediction, e.g. BayesA, B, C, R and GBLUP, independence of SNP effects is assumed. However, SNP effects are expected to be loca...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5862491/ https://www.ncbi.nlm.nih.gov/pubmed/29561877 http://dx.doi.org/10.1371/journal.pone.0194683 |
_version_ | 1783308237048643584 |
---|---|
author | Zeng, Jian Garrick, Dorian Dekkers, Jack Fernando, Rohan |
author_facet | Zeng, Jian Garrick, Dorian Dekkers, Jack Fernando, Rohan |
author_sort | Zeng, Jian |
collection | PubMed |
description | Genomic prediction exploits single nucleotide polymorphisms (SNPs) across the whole genome for predicting genetic merit of selection candidates. In most models for genomic prediction, e.g. BayesA, B, C, R and GBLUP, independence of SNP effects is assumed. However, SNP effects are expected to be locally dependent given the presence of a nearby QTL because SNPs surrounding the QTL do not segregate independently. A consequence of ignoring this dependence is that SNPs with small effects may be overly shrunk, e.g. effects from markers with high minor allele frequencies (MAF) that flank QTL with low MAF. A nested mixture model (BayesN) is developed to account for the dependence of effects of SNPs that are closely linked, where the effects of SNPs in every non-overlapping genomic window a priori follow a point mass at zero for all SNPs or a mixture of some SNPs with nonzero effects and others with zero effects. It can be regarded as a parsimonious alternative to the existing antedependence model, antiBayesB, which allow a nonstationary dependence of SNP effects. Illumina 777K BovineHD genotypes from 948 Angus cattle were used to simulate 5,000 offspring, with 4,000 used for training and 1,000 for validation. Scenarios with 300 common (MAF > 0.05) or rare (MAF < 0.05) QTL randomly selected from segregating SNPs were replicated 8 times. SNPs corresponding to QTL were masked from a 600k panel comprising SNPs with MAF > 0.05 or a 50k evenly spaced subset of these. Compared with BayesB and a modified antiBayesB, BayesN improved the accuracy of prediction up to 2.0% with 50k SNPs and up to 7.0% with 600k SNPs, most improvements occurring in the rare QTL scenario. Computing time was reduced up to 60% with 50k SNPs and up to 75% with 600k SNPs. BayesN is an accurate and computationally efficient method for genomic prediction with whole-genome SNPs, especially for traits with rare QTL. |
format | Online Article Text |
id | pubmed-5862491 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-58624912018-03-28 A nested mixture model for genomic prediction using whole-genome SNP genotypes Zeng, Jian Garrick, Dorian Dekkers, Jack Fernando, Rohan PLoS One Research Article Genomic prediction exploits single nucleotide polymorphisms (SNPs) across the whole genome for predicting genetic merit of selection candidates. In most models for genomic prediction, e.g. BayesA, B, C, R and GBLUP, independence of SNP effects is assumed. However, SNP effects are expected to be locally dependent given the presence of a nearby QTL because SNPs surrounding the QTL do not segregate independently. A consequence of ignoring this dependence is that SNPs with small effects may be overly shrunk, e.g. effects from markers with high minor allele frequencies (MAF) that flank QTL with low MAF. A nested mixture model (BayesN) is developed to account for the dependence of effects of SNPs that are closely linked, where the effects of SNPs in every non-overlapping genomic window a priori follow a point mass at zero for all SNPs or a mixture of some SNPs with nonzero effects and others with zero effects. It can be regarded as a parsimonious alternative to the existing antedependence model, antiBayesB, which allow a nonstationary dependence of SNP effects. Illumina 777K BovineHD genotypes from 948 Angus cattle were used to simulate 5,000 offspring, with 4,000 used for training and 1,000 for validation. Scenarios with 300 common (MAF > 0.05) or rare (MAF < 0.05) QTL randomly selected from segregating SNPs were replicated 8 times. SNPs corresponding to QTL were masked from a 600k panel comprising SNPs with MAF > 0.05 or a 50k evenly spaced subset of these. Compared with BayesB and a modified antiBayesB, BayesN improved the accuracy of prediction up to 2.0% with 50k SNPs and up to 7.0% with 600k SNPs, most improvements occurring in the rare QTL scenario. Computing time was reduced up to 60% with 50k SNPs and up to 75% with 600k SNPs. BayesN is an accurate and computationally efficient method for genomic prediction with whole-genome SNPs, especially for traits with rare QTL. Public Library of Science 2018-03-21 /pmc/articles/PMC5862491/ /pubmed/29561877 http://dx.doi.org/10.1371/journal.pone.0194683 Text en © 2018 Zeng et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Zeng, Jian Garrick, Dorian Dekkers, Jack Fernando, Rohan A nested mixture model for genomic prediction using whole-genome SNP genotypes |
title | A nested mixture model for genomic prediction using whole-genome SNP genotypes |
title_full | A nested mixture model for genomic prediction using whole-genome SNP genotypes |
title_fullStr | A nested mixture model for genomic prediction using whole-genome SNP genotypes |
title_full_unstemmed | A nested mixture model for genomic prediction using whole-genome SNP genotypes |
title_short | A nested mixture model for genomic prediction using whole-genome SNP genotypes |
title_sort | nested mixture model for genomic prediction using whole-genome snp genotypes |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5862491/ https://www.ncbi.nlm.nih.gov/pubmed/29561877 http://dx.doi.org/10.1371/journal.pone.0194683 |
work_keys_str_mv | AT zengjian anestedmixturemodelforgenomicpredictionusingwholegenomesnpgenotypes AT garrickdorian anestedmixturemodelforgenomicpredictionusingwholegenomesnpgenotypes AT dekkersjack anestedmixturemodelforgenomicpredictionusingwholegenomesnpgenotypes AT fernandorohan anestedmixturemodelforgenomicpredictionusingwholegenomesnpgenotypes AT zengjian nestedmixturemodelforgenomicpredictionusingwholegenomesnpgenotypes AT garrickdorian nestedmixturemodelforgenomicpredictionusingwholegenomesnpgenotypes AT dekkersjack nestedmixturemodelforgenomicpredictionusingwholegenomesnpgenotypes AT fernandorohan nestedmixturemodelforgenomicpredictionusingwholegenomesnpgenotypes |