Cargando…
Increasing reproducibility, robustness, and generalizability of biomarker selection from meta-analysis using Bayesian methodology
A major limitation of gene expression biomarker studies is that they are not reproducible as they simply do not generalize to larger, real-world, heterogeneous populations. Frequentist multi-cohort gene expression meta-analysis has been frequently used as a solution to this problem to identify bioma...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9269905/ https://www.ncbi.nlm.nih.gov/pubmed/35759523 http://dx.doi.org/10.1371/journal.pcbi.1010260 |
_version_ | 1784744335384772608 |
---|---|
author | Kalesinskas, Laurynas Gupta, Sanjana Khatri, Purvesh |
author_facet | Kalesinskas, Laurynas Gupta, Sanjana Khatri, Purvesh |
author_sort | Kalesinskas, Laurynas |
collection | PubMed |
description | A major limitation of gene expression biomarker studies is that they are not reproducible as they simply do not generalize to larger, real-world, heterogeneous populations. Frequentist multi-cohort gene expression meta-analysis has been frequently used as a solution to this problem to identify biomarkers that are truly differentially expressed. However, the frequentist meta-analysis framework has its limitations–it needs at least 4–5 datasets with hundreds of samples, is prone to confounding from outliers and relies on multiple-hypothesis corrected p-values. To address these shortcomings, we have created a Bayesian meta-analysis framework for the analysis of gene expression data. Using real-world data from three different diseases, we show that the Bayesian method is more robust to outliers, creates more informative estimates of between-study heterogeneity, reduces the number of false positive and false negative biomarkers and selects more generalizable biomarkers with less data. We have compared the Bayesian framework to a previously published frequentist framework and have developed a publicly available R package for use. |
format | Online Article Text |
id | pubmed-9269905 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-92699052022-07-09 Increasing reproducibility, robustness, and generalizability of biomarker selection from meta-analysis using Bayesian methodology Kalesinskas, Laurynas Gupta, Sanjana Khatri, Purvesh PLoS Comput Biol Research Article A major limitation of gene expression biomarker studies is that they are not reproducible as they simply do not generalize to larger, real-world, heterogeneous populations. Frequentist multi-cohort gene expression meta-analysis has been frequently used as a solution to this problem to identify biomarkers that are truly differentially expressed. However, the frequentist meta-analysis framework has its limitations–it needs at least 4–5 datasets with hundreds of samples, is prone to confounding from outliers and relies on multiple-hypothesis corrected p-values. To address these shortcomings, we have created a Bayesian meta-analysis framework for the analysis of gene expression data. Using real-world data from three different diseases, we show that the Bayesian method is more robust to outliers, creates more informative estimates of between-study heterogeneity, reduces the number of false positive and false negative biomarkers and selects more generalizable biomarkers with less data. We have compared the Bayesian framework to a previously published frequentist framework and have developed a publicly available R package for use. Public Library of Science 2022-06-27 /pmc/articles/PMC9269905/ /pubmed/35759523 http://dx.doi.org/10.1371/journal.pcbi.1010260 Text en © 2022 Kalesinskas et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Kalesinskas, Laurynas Gupta, Sanjana Khatri, Purvesh Increasing reproducibility, robustness, and generalizability of biomarker selection from meta-analysis using Bayesian methodology |
title | Increasing reproducibility, robustness, and generalizability of biomarker selection from meta-analysis using Bayesian methodology |
title_full | Increasing reproducibility, robustness, and generalizability of biomarker selection from meta-analysis using Bayesian methodology |
title_fullStr | Increasing reproducibility, robustness, and generalizability of biomarker selection from meta-analysis using Bayesian methodology |
title_full_unstemmed | Increasing reproducibility, robustness, and generalizability of biomarker selection from meta-analysis using Bayesian methodology |
title_short | Increasing reproducibility, robustness, and generalizability of biomarker selection from meta-analysis using Bayesian methodology |
title_sort | increasing reproducibility, robustness, and generalizability of biomarker selection from meta-analysis using bayesian methodology |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9269905/ https://www.ncbi.nlm.nih.gov/pubmed/35759523 http://dx.doi.org/10.1371/journal.pcbi.1010260 |
work_keys_str_mv | AT kalesinskaslaurynas increasingreproducibilityrobustnessandgeneralizabilityofbiomarkerselectionfrommetaanalysisusingbayesianmethodology AT guptasanjana increasingreproducibilityrobustnessandgeneralizabilityofbiomarkerselectionfrommetaanalysisusingbayesianmethodology AT khatripurvesh increasingreproducibilityrobustnessandgeneralizabilityofbiomarkerselectionfrommetaanalysisusingbayesianmethodology |