Cargando…

Meta-analysis for milk fat and protein percentage using imputed sequence variant genotypes in 94,321 cattle from eight cattle breeds

BACKGROUND: Sequence-based genome-wide association studies (GWAS) provide high statistical power to identify candidate causal mutations when a large number of individuals with both sequence variant genotypes and phenotypes is available. A meta-analysis combines summary statistics from multiple GWAS...

Descripción completa

Detalles Bibliográficos
Autores principales: van den Berg, Irene, Xiang, Ruidong, Jenko, Janez, Pausch, Hubert, Boussaha, Mekki, Schrooten, Chris, Tribout, Thierry, Gjuvsland, Arne B., Boichard, Didier, Nordbø, Øyvind, Sanchez, Marie-Pierre, Goddard, Mike E.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7339598/
https://www.ncbi.nlm.nih.gov/pubmed/32635893
http://dx.doi.org/10.1186/s12711-020-00556-4
_version_ 1783554923261067264
author van den Berg, Irene
Xiang, Ruidong
Jenko, Janez
Pausch, Hubert
Boussaha, Mekki
Schrooten, Chris
Tribout, Thierry
Gjuvsland, Arne B.
Boichard, Didier
Nordbø, Øyvind
Sanchez, Marie-Pierre
Goddard, Mike E.
author_facet van den Berg, Irene
Xiang, Ruidong
Jenko, Janez
Pausch, Hubert
Boussaha, Mekki
Schrooten, Chris
Tribout, Thierry
Gjuvsland, Arne B.
Boichard, Didier
Nordbø, Øyvind
Sanchez, Marie-Pierre
Goddard, Mike E.
author_sort van den Berg, Irene
collection PubMed
description BACKGROUND: Sequence-based genome-wide association studies (GWAS) provide high statistical power to identify candidate causal mutations when a large number of individuals with both sequence variant genotypes and phenotypes is available. A meta-analysis combines summary statistics from multiple GWAS and increases the power to detect trait-associated variants without requiring access to data at the individual level of the GWAS mapping cohorts. Because linkage disequilibrium between adjacent markers is conserved only over short distances across breeds, a multi-breed meta-analysis can improve mapping precision. RESULTS: To maximise the power to identify quantitative trait loci (QTL), we combined the results of nine within-population GWAS that used imputed sequence variant genotypes of 94,321 cattle from eight breeds, to perform a large-scale meta-analysis for fat and protein percentage in cattle. The meta-analysis detected (p ≤ 10(−8)) 138 QTL for fat percentage and 176 QTL for protein percentage. This was more than the number of QTL detected in all within-population GWAS together (124 QTL for fat percentage and 104 QTL for protein percentage). Among all the lead variants, 100 QTL for fat percentage and 114 QTL for protein percentage had the same direction of effect in all within-population GWAS. This indicates either persistence of the linkage phase between the causal variant and the lead variant across breeds or that some of the lead variants might indeed be causal or tightly linked with causal variants. The percentage of intergenic variants was substantially lower for significant variants than for non-significant variants, and significant variants had mostly moderate to high minor allele frequencies. Significant variants were also clustered in genes that are known to be relevant for fat and protein percentages in milk. CONCLUSIONS: Our study identified a large number of QTL associated with fat and protein percentage in dairy cattle. We demonstrated that large-scale multi-breed meta-analysis reveals more QTL at the nucleotide resolution than within-population GWAS. Significant variants were more often located in genic regions than non-significant variants and a large part of them was located in potentially regulatory regions.
format Online
Article
Text
id pubmed-7339598
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-73395982020-07-09 Meta-analysis for milk fat and protein percentage using imputed sequence variant genotypes in 94,321 cattle from eight cattle breeds van den Berg, Irene Xiang, Ruidong Jenko, Janez Pausch, Hubert Boussaha, Mekki Schrooten, Chris Tribout, Thierry Gjuvsland, Arne B. Boichard, Didier Nordbø, Øyvind Sanchez, Marie-Pierre Goddard, Mike E. Genet Sel Evol Research Article BACKGROUND: Sequence-based genome-wide association studies (GWAS) provide high statistical power to identify candidate causal mutations when a large number of individuals with both sequence variant genotypes and phenotypes is available. A meta-analysis combines summary statistics from multiple GWAS and increases the power to detect trait-associated variants without requiring access to data at the individual level of the GWAS mapping cohorts. Because linkage disequilibrium between adjacent markers is conserved only over short distances across breeds, a multi-breed meta-analysis can improve mapping precision. RESULTS: To maximise the power to identify quantitative trait loci (QTL), we combined the results of nine within-population GWAS that used imputed sequence variant genotypes of 94,321 cattle from eight breeds, to perform a large-scale meta-analysis for fat and protein percentage in cattle. The meta-analysis detected (p ≤ 10(−8)) 138 QTL for fat percentage and 176 QTL for protein percentage. This was more than the number of QTL detected in all within-population GWAS together (124 QTL for fat percentage and 104 QTL for protein percentage). Among all the lead variants, 100 QTL for fat percentage and 114 QTL for protein percentage had the same direction of effect in all within-population GWAS. This indicates either persistence of the linkage phase between the causal variant and the lead variant across breeds or that some of the lead variants might indeed be causal or tightly linked with causal variants. The percentage of intergenic variants was substantially lower for significant variants than for non-significant variants, and significant variants had mostly moderate to high minor allele frequencies. Significant variants were also clustered in genes that are known to be relevant for fat and protein percentages in milk. CONCLUSIONS: Our study identified a large number of QTL associated with fat and protein percentage in dairy cattle. We demonstrated that large-scale multi-breed meta-analysis reveals more QTL at the nucleotide resolution than within-population GWAS. Significant variants were more often located in genic regions than non-significant variants and a large part of them was located in potentially regulatory regions. BioMed Central 2020-07-07 /pmc/articles/PMC7339598/ /pubmed/32635893 http://dx.doi.org/10.1186/s12711-020-00556-4 Text en © The Author(s) 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research Article
van den Berg, Irene
Xiang, Ruidong
Jenko, Janez
Pausch, Hubert
Boussaha, Mekki
Schrooten, Chris
Tribout, Thierry
Gjuvsland, Arne B.
Boichard, Didier
Nordbø, Øyvind
Sanchez, Marie-Pierre
Goddard, Mike E.
Meta-analysis for milk fat and protein percentage using imputed sequence variant genotypes in 94,321 cattle from eight cattle breeds
title Meta-analysis for milk fat and protein percentage using imputed sequence variant genotypes in 94,321 cattle from eight cattle breeds
title_full Meta-analysis for milk fat and protein percentage using imputed sequence variant genotypes in 94,321 cattle from eight cattle breeds
title_fullStr Meta-analysis for milk fat and protein percentage using imputed sequence variant genotypes in 94,321 cattle from eight cattle breeds
title_full_unstemmed Meta-analysis for milk fat and protein percentage using imputed sequence variant genotypes in 94,321 cattle from eight cattle breeds
title_short Meta-analysis for milk fat and protein percentage using imputed sequence variant genotypes in 94,321 cattle from eight cattle breeds
title_sort meta-analysis for milk fat and protein percentage using imputed sequence variant genotypes in 94,321 cattle from eight cattle breeds
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7339598/
https://www.ncbi.nlm.nih.gov/pubmed/32635893
http://dx.doi.org/10.1186/s12711-020-00556-4
work_keys_str_mv AT vandenbergirene metaanalysisformilkfatandproteinpercentageusingimputedsequencevariantgenotypesin94321cattlefromeightcattlebreeds
AT xiangruidong metaanalysisformilkfatandproteinpercentageusingimputedsequencevariantgenotypesin94321cattlefromeightcattlebreeds
AT jenkojanez metaanalysisformilkfatandproteinpercentageusingimputedsequencevariantgenotypesin94321cattlefromeightcattlebreeds
AT pauschhubert metaanalysisformilkfatandproteinpercentageusingimputedsequencevariantgenotypesin94321cattlefromeightcattlebreeds
AT boussahamekki metaanalysisformilkfatandproteinpercentageusingimputedsequencevariantgenotypesin94321cattlefromeightcattlebreeds
AT schrootenchris metaanalysisformilkfatandproteinpercentageusingimputedsequencevariantgenotypesin94321cattlefromeightcattlebreeds
AT triboutthierry metaanalysisformilkfatandproteinpercentageusingimputedsequencevariantgenotypesin94321cattlefromeightcattlebreeds
AT gjuvslandarneb metaanalysisformilkfatandproteinpercentageusingimputedsequencevariantgenotypesin94321cattlefromeightcattlebreeds
AT boicharddidier metaanalysisformilkfatandproteinpercentageusingimputedsequencevariantgenotypesin94321cattlefromeightcattlebreeds
AT nordbøøyvind metaanalysisformilkfatandproteinpercentageusingimputedsequencevariantgenotypesin94321cattlefromeightcattlebreeds
AT sanchezmariepierre metaanalysisformilkfatandproteinpercentageusingimputedsequencevariantgenotypesin94321cattlefromeightcattlebreeds
AT goddardmikee metaanalysisformilkfatandproteinpercentageusingimputedsequencevariantgenotypesin94321cattlefromeightcattlebreeds