Cargando…

Weighting sequence variants based on their annotation increases the power of genome-wide association studies in dairy cattle

BACKGROUND: Genome-wide association studies (GWAS) are widely used to identify regions of the genome that harbor genetic determinants of quantitative traits. However, the multiple-testing burden from scanning tens of millions of whole-genome sequence variants reduces the power to identify associated...

Descripción completa

Detalles Bibliográficos
Autores principales: Cai, Zexi, Guldbrandtsen, Bernt, Lund, Mogens Sandø, Sahana, Goutam
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6511139/
https://www.ncbi.nlm.nih.gov/pubmed/31077144
http://dx.doi.org/10.1186/s12711-019-0463-9
_version_ 1783417525954936832
author Cai, Zexi
Guldbrandtsen, Bernt
Lund, Mogens Sandø
Sahana, Goutam
author_facet Cai, Zexi
Guldbrandtsen, Bernt
Lund, Mogens Sandø
Sahana, Goutam
author_sort Cai, Zexi
collection PubMed
description BACKGROUND: Genome-wide association studies (GWAS) are widely used to identify regions of the genome that harbor genetic determinants of quantitative traits. However, the multiple-testing burden from scanning tens of millions of whole-genome sequence variants reduces the power to identify associated variants, especially if sample size is limited. In addition, factors such as inaccuracy of imputation, complex linkage disequilibrium structures, and multiple closely-located causal variants may result in an identified causative mutation not being the most significant single nucleotide polymorphism in a particular genomic region. Therefore, the use of information from different sources, particularly variant annotations, was proposed to enhance the fine-mapping of causal variants. Here, we tested whether applying significance thresholds based on variant annotation categories increases the power of GWAS compared with a flat Bonferroni multiple-testing correction. RESULTS: Whole-genome sequence variants in dairy cattle were categorized according to type and predicted impact. Then, GWAS between markers and 17 quantitative traits were analyzed for enrichment for association of each annotation category. By using annotation categories that were determined with the variants effect predictor software and datasets indicating regions of open chromatin, “low impact” variants were found to be highly enriched. Moreover, when the variants annotated as “modifier” and not located at open chromatin regions were further classified into different types of potential regulatory elements, the high impact variants, moderate impact variants, variants located in the 3′ and 5′ untranslated regions, and variants located in potential non-coding RNA regions exhibited relatively more enrichment. In contrast, a similar study on human GWAS data reported that enrichment of association signals was highest with high impact variants. We observed an increase in power when these variant category-based significance thresholds were applied for GWAS results on stature in Nordic Holstein cattle, as more candidate genes from previous large GWAS meta-analysis for cattle stature were confirmed. CONCLUSIONS: Use of variant category-based genome-wide significance thresholds can marginally increase the power to detect the candidate genes in cattle. With the continued improvements in annotation of the bovine genome, we anticipate that the growing usefulness of variant category-based significance thresholds will be demonstrated. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12711-019-0463-9) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6511139
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-65111392019-05-20 Weighting sequence variants based on their annotation increases the power of genome-wide association studies in dairy cattle Cai, Zexi Guldbrandtsen, Bernt Lund, Mogens Sandø Sahana, Goutam Genet Sel Evol Research Article BACKGROUND: Genome-wide association studies (GWAS) are widely used to identify regions of the genome that harbor genetic determinants of quantitative traits. However, the multiple-testing burden from scanning tens of millions of whole-genome sequence variants reduces the power to identify associated variants, especially if sample size is limited. In addition, factors such as inaccuracy of imputation, complex linkage disequilibrium structures, and multiple closely-located causal variants may result in an identified causative mutation not being the most significant single nucleotide polymorphism in a particular genomic region. Therefore, the use of information from different sources, particularly variant annotations, was proposed to enhance the fine-mapping of causal variants. Here, we tested whether applying significance thresholds based on variant annotation categories increases the power of GWAS compared with a flat Bonferroni multiple-testing correction. RESULTS: Whole-genome sequence variants in dairy cattle were categorized according to type and predicted impact. Then, GWAS between markers and 17 quantitative traits were analyzed for enrichment for association of each annotation category. By using annotation categories that were determined with the variants effect predictor software and datasets indicating regions of open chromatin, “low impact” variants were found to be highly enriched. Moreover, when the variants annotated as “modifier” and not located at open chromatin regions were further classified into different types of potential regulatory elements, the high impact variants, moderate impact variants, variants located in the 3′ and 5′ untranslated regions, and variants located in potential non-coding RNA regions exhibited relatively more enrichment. In contrast, a similar study on human GWAS data reported that enrichment of association signals was highest with high impact variants. We observed an increase in power when these variant category-based significance thresholds were applied for GWAS results on stature in Nordic Holstein cattle, as more candidate genes from previous large GWAS meta-analysis for cattle stature were confirmed. CONCLUSIONS: Use of variant category-based genome-wide significance thresholds can marginally increase the power to detect the candidate genes in cattle. With the continued improvements in annotation of the bovine genome, we anticipate that the growing usefulness of variant category-based significance thresholds will be demonstrated. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12711-019-0463-9) contains supplementary material, which is available to authorized users. BioMed Central 2019-05-10 /pmc/articles/PMC6511139/ /pubmed/31077144 http://dx.doi.org/10.1186/s12711-019-0463-9 Text en © The Author(s) 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Cai, Zexi
Guldbrandtsen, Bernt
Lund, Mogens Sandø
Sahana, Goutam
Weighting sequence variants based on their annotation increases the power of genome-wide association studies in dairy cattle
title Weighting sequence variants based on their annotation increases the power of genome-wide association studies in dairy cattle
title_full Weighting sequence variants based on their annotation increases the power of genome-wide association studies in dairy cattle
title_fullStr Weighting sequence variants based on their annotation increases the power of genome-wide association studies in dairy cattle
title_full_unstemmed Weighting sequence variants based on their annotation increases the power of genome-wide association studies in dairy cattle
title_short Weighting sequence variants based on their annotation increases the power of genome-wide association studies in dairy cattle
title_sort weighting sequence variants based on their annotation increases the power of genome-wide association studies in dairy cattle
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6511139/
https://www.ncbi.nlm.nih.gov/pubmed/31077144
http://dx.doi.org/10.1186/s12711-019-0463-9
work_keys_str_mv AT caizexi weightingsequencevariantsbasedontheirannotationincreasesthepowerofgenomewideassociationstudiesindairycattle
AT guldbrandtsenbernt weightingsequencevariantsbasedontheirannotationincreasesthepowerofgenomewideassociationstudiesindairycattle
AT lundmogenssandø weightingsequencevariantsbasedontheirannotationincreasesthepowerofgenomewideassociationstudiesindairycattle
AT sahanagoutam weightingsequencevariantsbasedontheirannotationincreasesthepowerofgenomewideassociationstudiesindairycattle