Cargando…

Rare Variants in Transcript and Potential Regulatory Regions Explain a Small Percentage of the Missing Heritability of Complex Traits in Cattle

The proportion of genetic variation in complex traits explained by rare variants is a key question for genomic prediction, and for identifying the basis of “missing heritability”–the proportion of additive genetic variation not captured by common variants on SNP arrays. Sequence variants in transcri...

Descripción completa

Detalles Bibliográficos
Autores principales: Gonzalez-Recio, Oscar, Daetwyler, Hans D., MacLeod, Iona M., Pryce, Jennie E., Bowman, Phil J., Hayes, Ben J., Goddard, Michael E.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4671594/
https://www.ncbi.nlm.nih.gov/pubmed/26642058
http://dx.doi.org/10.1371/journal.pone.0143945
_version_ 1782404423556792320
author Gonzalez-Recio, Oscar
Daetwyler, Hans D.
MacLeod, Iona M.
Pryce, Jennie E.
Bowman, Phil J.
Hayes, Ben J.
Goddard, Michael E.
author_facet Gonzalez-Recio, Oscar
Daetwyler, Hans D.
MacLeod, Iona M.
Pryce, Jennie E.
Bowman, Phil J.
Hayes, Ben J.
Goddard, Michael E.
author_sort Gonzalez-Recio, Oscar
collection PubMed
description The proportion of genetic variation in complex traits explained by rare variants is a key question for genomic prediction, and for identifying the basis of “missing heritability”–the proportion of additive genetic variation not captured by common variants on SNP arrays. Sequence variants in transcript and regulatory regions from 429 sequenced animals were used to impute high density SNP genotypes of 3311 Holstein sires to sequence. There were 675,062 common variants (MAF>0.05), 102,549 uncommon variants (0.01<MAF<0.05), and 83,856 rare variants (MAF<0.01). We describe a novel method for estimating the proportion of the rare variants that are sequencing errors using parent-progeny duos. We then used mixed model methodology to estimate the proportion of variance captured by these different classes of variants for fat, milk and protein yields, as well as for fertility. Common sequence variants captured 83%, 77%, 76% and 84% of the total genetic variance for fat, milk, and protein yields and fertility, respectively. This was between 2 and 5% more variance than that captured from 600k SNPs on a high density chip, although the difference was not significant. Rare variants captured 3%, 0%, 1% and 14% of the genetic variance for fat, milk and protein yields, and fertility respectively, whereas pedigree explained the remaining amount of genetic variance (none for fertility). The proportion of variation explained by rare variants is likely to be under-estimated due to reduced accuracies of imputation for this class of variants. Using common sequence variants slightly improved accuracy of genomic predictions for fat and milk yield, compared to high density SNP array genotypes. However, including rare variants from transcript regions did not increase the accuracy of genomic predictions. These results suggest that rare variants recover a small percentage of the missing heritability for complex traits, however very large reference sets will be required to exploit this to improve the accuracy of genomic predictions. Our results do suggest the contribution of rare variants to genetic variation may be greater for fitness traits.
format Online
Article
Text
id pubmed-4671594
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-46715942015-12-10 Rare Variants in Transcript and Potential Regulatory Regions Explain a Small Percentage of the Missing Heritability of Complex Traits in Cattle Gonzalez-Recio, Oscar Daetwyler, Hans D. MacLeod, Iona M. Pryce, Jennie E. Bowman, Phil J. Hayes, Ben J. Goddard, Michael E. PLoS One Research Article The proportion of genetic variation in complex traits explained by rare variants is a key question for genomic prediction, and for identifying the basis of “missing heritability”–the proportion of additive genetic variation not captured by common variants on SNP arrays. Sequence variants in transcript and regulatory regions from 429 sequenced animals were used to impute high density SNP genotypes of 3311 Holstein sires to sequence. There were 675,062 common variants (MAF>0.05), 102,549 uncommon variants (0.01<MAF<0.05), and 83,856 rare variants (MAF<0.01). We describe a novel method for estimating the proportion of the rare variants that are sequencing errors using parent-progeny duos. We then used mixed model methodology to estimate the proportion of variance captured by these different classes of variants for fat, milk and protein yields, as well as for fertility. Common sequence variants captured 83%, 77%, 76% and 84% of the total genetic variance for fat, milk, and protein yields and fertility, respectively. This was between 2 and 5% more variance than that captured from 600k SNPs on a high density chip, although the difference was not significant. Rare variants captured 3%, 0%, 1% and 14% of the genetic variance for fat, milk and protein yields, and fertility respectively, whereas pedigree explained the remaining amount of genetic variance (none for fertility). The proportion of variation explained by rare variants is likely to be under-estimated due to reduced accuracies of imputation for this class of variants. Using common sequence variants slightly improved accuracy of genomic predictions for fat and milk yield, compared to high density SNP array genotypes. However, including rare variants from transcript regions did not increase the accuracy of genomic predictions. These results suggest that rare variants recover a small percentage of the missing heritability for complex traits, however very large reference sets will be required to exploit this to improve the accuracy of genomic predictions. Our results do suggest the contribution of rare variants to genetic variation may be greater for fitness traits. Public Library of Science 2015-12-07 /pmc/articles/PMC4671594/ /pubmed/26642058 http://dx.doi.org/10.1371/journal.pone.0143945 Text en © 2015 Gonzalez-Recio et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Gonzalez-Recio, Oscar
Daetwyler, Hans D.
MacLeod, Iona M.
Pryce, Jennie E.
Bowman, Phil J.
Hayes, Ben J.
Goddard, Michael E.
Rare Variants in Transcript and Potential Regulatory Regions Explain a Small Percentage of the Missing Heritability of Complex Traits in Cattle
title Rare Variants in Transcript and Potential Regulatory Regions Explain a Small Percentage of the Missing Heritability of Complex Traits in Cattle
title_full Rare Variants in Transcript and Potential Regulatory Regions Explain a Small Percentage of the Missing Heritability of Complex Traits in Cattle
title_fullStr Rare Variants in Transcript and Potential Regulatory Regions Explain a Small Percentage of the Missing Heritability of Complex Traits in Cattle
title_full_unstemmed Rare Variants in Transcript and Potential Regulatory Regions Explain a Small Percentage of the Missing Heritability of Complex Traits in Cattle
title_short Rare Variants in Transcript and Potential Regulatory Regions Explain a Small Percentage of the Missing Heritability of Complex Traits in Cattle
title_sort rare variants in transcript and potential regulatory regions explain a small percentage of the missing heritability of complex traits in cattle
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4671594/
https://www.ncbi.nlm.nih.gov/pubmed/26642058
http://dx.doi.org/10.1371/journal.pone.0143945
work_keys_str_mv AT gonzalezreciooscar rarevariantsintranscriptandpotentialregulatoryregionsexplainasmallpercentageofthemissingheritabilityofcomplextraitsincattle
AT daetwylerhansd rarevariantsintranscriptandpotentialregulatoryregionsexplainasmallpercentageofthemissingheritabilityofcomplextraitsincattle
AT macleodionam rarevariantsintranscriptandpotentialregulatoryregionsexplainasmallpercentageofthemissingheritabilityofcomplextraitsincattle
AT prycejenniee rarevariantsintranscriptandpotentialregulatoryregionsexplainasmallpercentageofthemissingheritabilityofcomplextraitsincattle
AT bowmanphilj rarevariantsintranscriptandpotentialregulatoryregionsexplainasmallpercentageofthemissingheritabilityofcomplextraitsincattle
AT hayesbenj rarevariantsintranscriptandpotentialregulatoryregionsexplainasmallpercentageofthemissingheritabilityofcomplextraitsincattle
AT goddardmichaele rarevariantsintranscriptandpotentialregulatoryregionsexplainasmallpercentageofthemissingheritabilityofcomplextraitsincattle