Cargando…
The effect of rare alleles on estimated genomic relationships from whole genome sequence data
BACKGROUND: Relationships between individuals and inbreeding coefficients are commonly used for breeding decisions, but may be affected by the type of data used for their estimation. The proportion of variants with low Minor Allele Frequency (MAF) is larger in whole genome sequence (WGS) data compar...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4365517/ https://www.ncbi.nlm.nih.gov/pubmed/25887220 http://dx.doi.org/10.1186/s12863-015-0185-0 |
_version_ | 1782362229578924032 |
---|---|
author | Eynard, Sonia E Windig, Jack J Leroy, Grégoire van Binsbergen, Rianne Calus, Mario PL |
author_facet | Eynard, Sonia E Windig, Jack J Leroy, Grégoire van Binsbergen, Rianne Calus, Mario PL |
author_sort | Eynard, Sonia E |
collection | PubMed |
description | BACKGROUND: Relationships between individuals and inbreeding coefficients are commonly used for breeding decisions, but may be affected by the type of data used for their estimation. The proportion of variants with low Minor Allele Frequency (MAF) is larger in whole genome sequence (WGS) data compared to Single Nucleotide Polymorphism (SNP) chips. Therefore, WGS data provide true relationships between individuals and may influence breeding decisions and prioritisation for conservation of genetic diversity in livestock. This study identifies differences between relationships and inbreeding coefficients estimated using pedigree, SNP or WGS data for 118 Holstein bulls from the 1000 Bull genomes project. To determine the impact of rare alleles on the estimates we compared three scenarios of MAF restrictions: variants with a MAF higher than 5%, variants with a MAF higher than 1% and variants with a MAF between 1% and 5%. RESULTS: We observed significant differences between estimated relationships and, although less significantly, inbreeding coefficients from pedigree, SNP or WGS data, and between MAF restriction scenarios. Computed correlations between pedigree and genomic relationships, within groups with similar relationships, ranged from negative to moderate for both estimated relationships and inbreeding coefficients, but were high between estimates from SNP and WGS (0.49 to 0.99). Estimated relationships from genomic information exhibited higher variation than from pedigree. Inbreeding coefficients analysis showed that more complete pedigree records lead to higher correlation between inbreeding coefficients from pedigree and genomic data. Finally, estimates and correlations between additive genetic (A) and genomic (G) relationship matrices were lower, and variances of the relationships were larger when accounting for allele frequencies than without accounting for allele frequencies. CONCLUSIONS: Using pedigree data or genomic information, and including or excluding variants with a MAF below 5% showed significant differences in relationship and inbreeding coefficient estimates. Estimated relationships and inbreeding coefficients are the basis for selection decisions. Therefore, it can be expected that using WGS instead of SNP can affect selection decision. Inclusion of rare variants will give access to the variation they carry, which is of interest for conservation of genetic diversity. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12863-015-0185-0) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4365517 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-43655172015-03-20 The effect of rare alleles on estimated genomic relationships from whole genome sequence data Eynard, Sonia E Windig, Jack J Leroy, Grégoire van Binsbergen, Rianne Calus, Mario PL BMC Genet Research Article BACKGROUND: Relationships between individuals and inbreeding coefficients are commonly used for breeding decisions, but may be affected by the type of data used for their estimation. The proportion of variants with low Minor Allele Frequency (MAF) is larger in whole genome sequence (WGS) data compared to Single Nucleotide Polymorphism (SNP) chips. Therefore, WGS data provide true relationships between individuals and may influence breeding decisions and prioritisation for conservation of genetic diversity in livestock. This study identifies differences between relationships and inbreeding coefficients estimated using pedigree, SNP or WGS data for 118 Holstein bulls from the 1000 Bull genomes project. To determine the impact of rare alleles on the estimates we compared three scenarios of MAF restrictions: variants with a MAF higher than 5%, variants with a MAF higher than 1% and variants with a MAF between 1% and 5%. RESULTS: We observed significant differences between estimated relationships and, although less significantly, inbreeding coefficients from pedigree, SNP or WGS data, and between MAF restriction scenarios. Computed correlations between pedigree and genomic relationships, within groups with similar relationships, ranged from negative to moderate for both estimated relationships and inbreeding coefficients, but were high between estimates from SNP and WGS (0.49 to 0.99). Estimated relationships from genomic information exhibited higher variation than from pedigree. Inbreeding coefficients analysis showed that more complete pedigree records lead to higher correlation between inbreeding coefficients from pedigree and genomic data. Finally, estimates and correlations between additive genetic (A) and genomic (G) relationship matrices were lower, and variances of the relationships were larger when accounting for allele frequencies than without accounting for allele frequencies. CONCLUSIONS: Using pedigree data or genomic information, and including or excluding variants with a MAF below 5% showed significant differences in relationship and inbreeding coefficient estimates. Estimated relationships and inbreeding coefficients are the basis for selection decisions. Therefore, it can be expected that using WGS instead of SNP can affect selection decision. Inclusion of rare variants will give access to the variation they carry, which is of interest for conservation of genetic diversity. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12863-015-0185-0) contains supplementary material, which is available to authorized users. BioMed Central 2015-03-12 /pmc/articles/PMC4365517/ /pubmed/25887220 http://dx.doi.org/10.1186/s12863-015-0185-0 Text en © Eynard et al.; licensee BioMed Central. 2015 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Eynard, Sonia E Windig, Jack J Leroy, Grégoire van Binsbergen, Rianne Calus, Mario PL The effect of rare alleles on estimated genomic relationships from whole genome sequence data |
title | The effect of rare alleles on estimated genomic relationships from whole genome sequence data |
title_full | The effect of rare alleles on estimated genomic relationships from whole genome sequence data |
title_fullStr | The effect of rare alleles on estimated genomic relationships from whole genome sequence data |
title_full_unstemmed | The effect of rare alleles on estimated genomic relationships from whole genome sequence data |
title_short | The effect of rare alleles on estimated genomic relationships from whole genome sequence data |
title_sort | effect of rare alleles on estimated genomic relationships from whole genome sequence data |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4365517/ https://www.ncbi.nlm.nih.gov/pubmed/25887220 http://dx.doi.org/10.1186/s12863-015-0185-0 |
work_keys_str_mv | AT eynardsoniae theeffectofrareallelesonestimatedgenomicrelationshipsfromwholegenomesequencedata AT windigjackj theeffectofrareallelesonestimatedgenomicrelationshipsfromwholegenomesequencedata AT leroygregoire theeffectofrareallelesonestimatedgenomicrelationshipsfromwholegenomesequencedata AT vanbinsbergenrianne theeffectofrareallelesonestimatedgenomicrelationshipsfromwholegenomesequencedata AT calusmariopl theeffectofrareallelesonestimatedgenomicrelationshipsfromwholegenomesequencedata AT eynardsoniae effectofrareallelesonestimatedgenomicrelationshipsfromwholegenomesequencedata AT windigjackj effectofrareallelesonestimatedgenomicrelationshipsfromwholegenomesequencedata AT leroygregoire effectofrareallelesonestimatedgenomicrelationshipsfromwholegenomesequencedata AT vanbinsbergenrianne effectofrareallelesonestimatedgenomicrelationshipsfromwholegenomesequencedata AT calusmariopl effectofrareallelesonestimatedgenomicrelationshipsfromwholegenomesequencedata |