Cargando…

Estimation of a significance threshold for genome-wide association studies

BACKGROUND: Selection of an appropriate statistical significance threshold in genome-wide association studies is critical to differentiate true positives from false positives and false negatives. Different multiple testing comparison methods have been developed to determine the significance threshol...

Descripción completa

Detalles Bibliográficos
Autores principales: Kaler, Avjinder S., Purcell, Larry C.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6664749/
https://www.ncbi.nlm.nih.gov/pubmed/31357925
http://dx.doi.org/10.1186/s12864-019-5992-7
_version_ 1783439949887963136
author Kaler, Avjinder S.
Purcell, Larry C.
author_facet Kaler, Avjinder S.
Purcell, Larry C.
author_sort Kaler, Avjinder S.
collection PubMed
description BACKGROUND: Selection of an appropriate statistical significance threshold in genome-wide association studies is critical to differentiate true positives from false positives and false negatives. Different multiple testing comparison methods have been developed to determine the significance threshold; however, these methods may be overly conservative and may lead to an increase in false negatives. Here, we developed an empirical formula to determine the statistical significance threshold that is based on the marker-based heritability of the trait. To develop a formula for a significance threshold, we used 45 simulated traits in soybean, maize, and rice that varied in both broad sense heritability and the number of QTLs. RESULTS: A formula to determine a significance threshold was developed based on a regression equation that used one independent variable, marker-based heritability, and one response variable, − log(10) (P)-values. For all species, the threshold –log(10) (P)-values increased as both marker-based and broad-sense heritability increased. Higher broad sense heritability in these crops resulted in higher significant threshold values. Among crop species, maize, with a lower linkage disequilibrium pattern, had higher significant threshold values as compared to soybean and rice. CONCLUSIONS: Our formula was less conservative and identified more true positive associations than the false discovery rate and Bonferroni correction methods. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12864-019-5992-7) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6664749
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-66647492019-08-05 Estimation of a significance threshold for genome-wide association studies Kaler, Avjinder S. Purcell, Larry C. BMC Genomics Methodology Article BACKGROUND: Selection of an appropriate statistical significance threshold in genome-wide association studies is critical to differentiate true positives from false positives and false negatives. Different multiple testing comparison methods have been developed to determine the significance threshold; however, these methods may be overly conservative and may lead to an increase in false negatives. Here, we developed an empirical formula to determine the statistical significance threshold that is based on the marker-based heritability of the trait. To develop a formula for a significance threshold, we used 45 simulated traits in soybean, maize, and rice that varied in both broad sense heritability and the number of QTLs. RESULTS: A formula to determine a significance threshold was developed based on a regression equation that used one independent variable, marker-based heritability, and one response variable, − log(10) (P)-values. For all species, the threshold –log(10) (P)-values increased as both marker-based and broad-sense heritability increased. Higher broad sense heritability in these crops resulted in higher significant threshold values. Among crop species, maize, with a lower linkage disequilibrium pattern, had higher significant threshold values as compared to soybean and rice. CONCLUSIONS: Our formula was less conservative and identified more true positive associations than the false discovery rate and Bonferroni correction methods. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12864-019-5992-7) contains supplementary material, which is available to authorized users. BioMed Central 2019-07-29 /pmc/articles/PMC6664749/ /pubmed/31357925 http://dx.doi.org/10.1186/s12864-019-5992-7 Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Kaler, Avjinder S.
Purcell, Larry C.
Estimation of a significance threshold for genome-wide association studies
title Estimation of a significance threshold for genome-wide association studies
title_full Estimation of a significance threshold for genome-wide association studies
title_fullStr Estimation of a significance threshold for genome-wide association studies
title_full_unstemmed Estimation of a significance threshold for genome-wide association studies
title_short Estimation of a significance threshold for genome-wide association studies
title_sort estimation of a significance threshold for genome-wide association studies
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6664749/
https://www.ncbi.nlm.nih.gov/pubmed/31357925
http://dx.doi.org/10.1186/s12864-019-5992-7
work_keys_str_mv AT kaleravjinders estimationofasignificancethresholdforgenomewideassociationstudies
AT purcelllarryc estimationofasignificancethresholdforgenomewideassociationstudies