Cargando…

Robust meta-analysis for large-scale genomic experiments based on an empirical approach

BACKGROUND: Recent high-throughput technologies have opened avenues for simultaneous analyses of thousands of genes. With the availability of a multitude of public databases, one can easily access multiple genomic study results where each study comprises of significance testing results of thousands...

Descripción completa

Detalles Bibliográficos
Autor principal: Sikdar, Sinjini
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8832678/
https://www.ncbi.nlm.nih.gov/pubmed/35144554
http://dx.doi.org/10.1186/s12874-022-01530-y
_version_ 1784648773473927168
author Sikdar, Sinjini
author_facet Sikdar, Sinjini
author_sort Sikdar, Sinjini
collection PubMed
description BACKGROUND: Recent high-throughput technologies have opened avenues for simultaneous analyses of thousands of genes. With the availability of a multitude of public databases, one can easily access multiple genomic study results where each study comprises of significance testing results of thousands of genes. Researchers currently tend to combine this genomic information from these multiple studies in the form of a meta-analysis. As the number of genes involved is very large, the classical meta-analysis approaches need to be updated to acknowledge this large-scale aspect of the data. METHODS: In this article, we discuss how application of standard theoretical null distributional assumptions of the classical meta-analysis methods, such as Fisher’s p-value combination and Stouffer’s Z, can lead to incorrect significant testing results, and we propose a robust meta-analysis method that empirically modifies the individual test statistics and p-values before combining them. RESULTS: Our proposed meta-analysis method performs best in significance testing among several meta-analysis approaches, especially in presence of hidden confounders, as shown through a wide variety of simulation studies and real genomic data analysis. CONCLUSION: The proposed meta-analysis method produces superior meta-analysis results compared to the standard p-value combination approaches for large-scale simultaneous testing in genomic experiments. This is particularly useful in studies with large number of genes where the standard meta-analysis approaches can result in gross false discoveries due to the presence of unobserved confounding variables. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12874-022-01530-y.
format Online
Article
Text
id pubmed-8832678
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-88326782022-02-11 Robust meta-analysis for large-scale genomic experiments based on an empirical approach Sikdar, Sinjini BMC Med Res Methodol Research BACKGROUND: Recent high-throughput technologies have opened avenues for simultaneous analyses of thousands of genes. With the availability of a multitude of public databases, one can easily access multiple genomic study results where each study comprises of significance testing results of thousands of genes. Researchers currently tend to combine this genomic information from these multiple studies in the form of a meta-analysis. As the number of genes involved is very large, the classical meta-analysis approaches need to be updated to acknowledge this large-scale aspect of the data. METHODS: In this article, we discuss how application of standard theoretical null distributional assumptions of the classical meta-analysis methods, such as Fisher’s p-value combination and Stouffer’s Z, can lead to incorrect significant testing results, and we propose a robust meta-analysis method that empirically modifies the individual test statistics and p-values before combining them. RESULTS: Our proposed meta-analysis method performs best in significance testing among several meta-analysis approaches, especially in presence of hidden confounders, as shown through a wide variety of simulation studies and real genomic data analysis. CONCLUSION: The proposed meta-analysis method produces superior meta-analysis results compared to the standard p-value combination approaches for large-scale simultaneous testing in genomic experiments. This is particularly useful in studies with large number of genes where the standard meta-analysis approaches can result in gross false discoveries due to the presence of unobserved confounding variables. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12874-022-01530-y. BioMed Central 2022-02-10 /pmc/articles/PMC8832678/ /pubmed/35144554 http://dx.doi.org/10.1186/s12874-022-01530-y Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Sikdar, Sinjini
Robust meta-analysis for large-scale genomic experiments based on an empirical approach
title Robust meta-analysis for large-scale genomic experiments based on an empirical approach
title_full Robust meta-analysis for large-scale genomic experiments based on an empirical approach
title_fullStr Robust meta-analysis for large-scale genomic experiments based on an empirical approach
title_full_unstemmed Robust meta-analysis for large-scale genomic experiments based on an empirical approach
title_short Robust meta-analysis for large-scale genomic experiments based on an empirical approach
title_sort robust meta-analysis for large-scale genomic experiments based on an empirical approach
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8832678/
https://www.ncbi.nlm.nih.gov/pubmed/35144554
http://dx.doi.org/10.1186/s12874-022-01530-y
work_keys_str_mv AT sikdarsinjini robustmetaanalysisforlargescalegenomicexperimentsbasedonanempiricalapproach