Cargando…

The effect of rare alleles on estimated genomic relationships from whole genome sequence data

BACKGROUND: Relationships between individuals and inbreeding coefficients are commonly used for breeding decisions, but may be affected by the type of data used for their estimation. The proportion of variants with low Minor Allele Frequency (MAF) is larger in whole genome sequence (WGS) data compar...

Descripción completa

Detalles Bibliográficos
Autores principales: Eynard, Sonia E, Windig, Jack J, Leroy, Grégoire, van Binsbergen, Rianne, Calus, Mario PL
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4365517/
https://www.ncbi.nlm.nih.gov/pubmed/25887220
http://dx.doi.org/10.1186/s12863-015-0185-0
_version_ 1782362229578924032
author Eynard, Sonia E
Windig, Jack J
Leroy, Grégoire
van Binsbergen, Rianne
Calus, Mario PL
author_facet Eynard, Sonia E
Windig, Jack J
Leroy, Grégoire
van Binsbergen, Rianne
Calus, Mario PL
author_sort Eynard, Sonia E
collection PubMed
description BACKGROUND: Relationships between individuals and inbreeding coefficients are commonly used for breeding decisions, but may be affected by the type of data used for their estimation. The proportion of variants with low Minor Allele Frequency (MAF) is larger in whole genome sequence (WGS) data compared to Single Nucleotide Polymorphism (SNP) chips. Therefore, WGS data provide true relationships between individuals and may influence breeding decisions and prioritisation for conservation of genetic diversity in livestock. This study identifies differences between relationships and inbreeding coefficients estimated using pedigree, SNP or WGS data for 118 Holstein bulls from the 1000 Bull genomes project. To determine the impact of rare alleles on the estimates we compared three scenarios of MAF restrictions: variants with a MAF higher than 5%, variants with a MAF higher than 1% and variants with a MAF between 1% and 5%. RESULTS: We observed significant differences between estimated relationships and, although less significantly, inbreeding coefficients from pedigree, SNP or WGS data, and between MAF restriction scenarios. Computed correlations between pedigree and genomic relationships, within groups with similar relationships, ranged from negative to moderate for both estimated relationships and inbreeding coefficients, but were high between estimates from SNP and WGS (0.49 to 0.99). Estimated relationships from genomic information exhibited higher variation than from pedigree. Inbreeding coefficients analysis showed that more complete pedigree records lead to higher correlation between inbreeding coefficients from pedigree and genomic data. Finally, estimates and correlations between additive genetic (A) and genomic (G) relationship matrices were lower, and variances of the relationships were larger when accounting for allele frequencies than without accounting for allele frequencies. CONCLUSIONS: Using pedigree data or genomic information, and including or excluding variants with a MAF below 5% showed significant differences in relationship and inbreeding coefficient estimates. Estimated relationships and inbreeding coefficients are the basis for selection decisions. Therefore, it can be expected that using WGS instead of SNP can affect selection decision. Inclusion of rare variants will give access to the variation they carry, which is of interest for conservation of genetic diversity. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12863-015-0185-0) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4365517
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-43655172015-03-20 The effect of rare alleles on estimated genomic relationships from whole genome sequence data Eynard, Sonia E Windig, Jack J Leroy, Grégoire van Binsbergen, Rianne Calus, Mario PL BMC Genet Research Article BACKGROUND: Relationships between individuals and inbreeding coefficients are commonly used for breeding decisions, but may be affected by the type of data used for their estimation. The proportion of variants with low Minor Allele Frequency (MAF) is larger in whole genome sequence (WGS) data compared to Single Nucleotide Polymorphism (SNP) chips. Therefore, WGS data provide true relationships between individuals and may influence breeding decisions and prioritisation for conservation of genetic diversity in livestock. This study identifies differences between relationships and inbreeding coefficients estimated using pedigree, SNP or WGS data for 118 Holstein bulls from the 1000 Bull genomes project. To determine the impact of rare alleles on the estimates we compared three scenarios of MAF restrictions: variants with a MAF higher than 5%, variants with a MAF higher than 1% and variants with a MAF between 1% and 5%. RESULTS: We observed significant differences between estimated relationships and, although less significantly, inbreeding coefficients from pedigree, SNP or WGS data, and between MAF restriction scenarios. Computed correlations between pedigree and genomic relationships, within groups with similar relationships, ranged from negative to moderate for both estimated relationships and inbreeding coefficients, but were high between estimates from SNP and WGS (0.49 to 0.99). Estimated relationships from genomic information exhibited higher variation than from pedigree. Inbreeding coefficients analysis showed that more complete pedigree records lead to higher correlation between inbreeding coefficients from pedigree and genomic data. Finally, estimates and correlations between additive genetic (A) and genomic (G) relationship matrices were lower, and variances of the relationships were larger when accounting for allele frequencies than without accounting for allele frequencies. CONCLUSIONS: Using pedigree data or genomic information, and including or excluding variants with a MAF below 5% showed significant differences in relationship and inbreeding coefficient estimates. Estimated relationships and inbreeding coefficients are the basis for selection decisions. Therefore, it can be expected that using WGS instead of SNP can affect selection decision. Inclusion of rare variants will give access to the variation they carry, which is of interest for conservation of genetic diversity. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12863-015-0185-0) contains supplementary material, which is available to authorized users. BioMed Central 2015-03-12 /pmc/articles/PMC4365517/ /pubmed/25887220 http://dx.doi.org/10.1186/s12863-015-0185-0 Text en © Eynard et al.; licensee BioMed Central. 2015 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Eynard, Sonia E
Windig, Jack J
Leroy, Grégoire
van Binsbergen, Rianne
Calus, Mario PL
The effect of rare alleles on estimated genomic relationships from whole genome sequence data
title The effect of rare alleles on estimated genomic relationships from whole genome sequence data
title_full The effect of rare alleles on estimated genomic relationships from whole genome sequence data
title_fullStr The effect of rare alleles on estimated genomic relationships from whole genome sequence data
title_full_unstemmed The effect of rare alleles on estimated genomic relationships from whole genome sequence data
title_short The effect of rare alleles on estimated genomic relationships from whole genome sequence data
title_sort effect of rare alleles on estimated genomic relationships from whole genome sequence data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4365517/
https://www.ncbi.nlm.nih.gov/pubmed/25887220
http://dx.doi.org/10.1186/s12863-015-0185-0
work_keys_str_mv AT eynardsoniae theeffectofrareallelesonestimatedgenomicrelationshipsfromwholegenomesequencedata
AT windigjackj theeffectofrareallelesonestimatedgenomicrelationshipsfromwholegenomesequencedata
AT leroygregoire theeffectofrareallelesonestimatedgenomicrelationshipsfromwholegenomesequencedata
AT vanbinsbergenrianne theeffectofrareallelesonestimatedgenomicrelationshipsfromwholegenomesequencedata
AT calusmariopl theeffectofrareallelesonestimatedgenomicrelationshipsfromwholegenomesequencedata
AT eynardsoniae effectofrareallelesonestimatedgenomicrelationshipsfromwholegenomesequencedata
AT windigjackj effectofrareallelesonestimatedgenomicrelationshipsfromwholegenomesequencedata
AT leroygregoire effectofrareallelesonestimatedgenomicrelationshipsfromwholegenomesequencedata
AT vanbinsbergenrianne effectofrareallelesonestimatedgenomicrelationshipsfromwholegenomesequencedata
AT calusmariopl effectofrareallelesonestimatedgenomicrelationshipsfromwholegenomesequencedata