Cargando…

Integrated analysis of numerous heterogeneous gene expression profiles for detecting robust disease-specific biomarkers and proposing drug targets

Genome-wide expression profiling has revolutionized biomedical research; vast amounts of expression data from numerous studies of many diseases are now available. Making the best use of this resource in order to better understand disease processes and treatment remains an open challenge. In particul...

Descripción completa

Detalles Bibliográficos
Autores principales: Amar, David, Hait, Tom, Izraeli, Shai, Shamir, Ron
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4652780/
https://www.ncbi.nlm.nih.gov/pubmed/26261215
http://dx.doi.org/10.1093/nar/gkv810
_version_ 1782401819588165632
author Amar, David
Hait, Tom
Izraeli, Shai
Shamir, Ron
author_facet Amar, David
Hait, Tom
Izraeli, Shai
Shamir, Ron
author_sort Amar, David
collection PubMed
description Genome-wide expression profiling has revolutionized biomedical research; vast amounts of expression data from numerous studies of many diseases are now available. Making the best use of this resource in order to better understand disease processes and treatment remains an open challenge. In particular, disease biomarkers detected in case–control studies suffer from low reliability and are only weakly reproducible. Here, we present a systematic integrative analysis methodology to overcome these shortcomings. We assembled and manually curated more than 14 000 expression profiles spanning 48 diseases and 18 expression platforms. We show that when studying a particular disease, judicious utilization of profiles from other diseases and information on disease hierarchy improves classification quality, avoids overoptimistic evaluation of that quality, and enhances disease-specific biomarker discovery. This approach yielded specific biomarkers for 24 of the analyzed diseases. We demonstrate how to combine these biomarkers with large-scale interaction, mutation and drug target data, forming a highly valuable disease summary that suggests novel directions in disease understanding and drug repurposing. Our analysis also estimates the number of samples required to reach a desired level of biomarker stability. This methodology can greatly improve the exploitation of the mountain of expression profiles for better disease analysis.
format Online
Article
Text
id pubmed-4652780
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-46527802015-11-25 Integrated analysis of numerous heterogeneous gene expression profiles for detecting robust disease-specific biomarkers and proposing drug targets Amar, David Hait, Tom Izraeli, Shai Shamir, Ron Nucleic Acids Res Computational Biology Genome-wide expression profiling has revolutionized biomedical research; vast amounts of expression data from numerous studies of many diseases are now available. Making the best use of this resource in order to better understand disease processes and treatment remains an open challenge. In particular, disease biomarkers detected in case–control studies suffer from low reliability and are only weakly reproducible. Here, we present a systematic integrative analysis methodology to overcome these shortcomings. We assembled and manually curated more than 14 000 expression profiles spanning 48 diseases and 18 expression platforms. We show that when studying a particular disease, judicious utilization of profiles from other diseases and information on disease hierarchy improves classification quality, avoids overoptimistic evaluation of that quality, and enhances disease-specific biomarker discovery. This approach yielded specific biomarkers for 24 of the analyzed diseases. We demonstrate how to combine these biomarkers with large-scale interaction, mutation and drug target data, forming a highly valuable disease summary that suggests novel directions in disease understanding and drug repurposing. Our analysis also estimates the number of samples required to reach a desired level of biomarker stability. This methodology can greatly improve the exploitation of the mountain of expression profiles for better disease analysis. Oxford University Press 2015-09-18 2015-08-10 /pmc/articles/PMC4652780/ /pubmed/26261215 http://dx.doi.org/10.1093/nar/gkv810 Text en © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Computational Biology
Amar, David
Hait, Tom
Izraeli, Shai
Shamir, Ron
Integrated analysis of numerous heterogeneous gene expression profiles for detecting robust disease-specific biomarkers and proposing drug targets
title Integrated analysis of numerous heterogeneous gene expression profiles for detecting robust disease-specific biomarkers and proposing drug targets
title_full Integrated analysis of numerous heterogeneous gene expression profiles for detecting robust disease-specific biomarkers and proposing drug targets
title_fullStr Integrated analysis of numerous heterogeneous gene expression profiles for detecting robust disease-specific biomarkers and proposing drug targets
title_full_unstemmed Integrated analysis of numerous heterogeneous gene expression profiles for detecting robust disease-specific biomarkers and proposing drug targets
title_short Integrated analysis of numerous heterogeneous gene expression profiles for detecting robust disease-specific biomarkers and proposing drug targets
title_sort integrated analysis of numerous heterogeneous gene expression profiles for detecting robust disease-specific biomarkers and proposing drug targets
topic Computational Biology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4652780/
https://www.ncbi.nlm.nih.gov/pubmed/26261215
http://dx.doi.org/10.1093/nar/gkv810
work_keys_str_mv AT amardavid integratedanalysisofnumerousheterogeneousgeneexpressionprofilesfordetectingrobustdiseasespecificbiomarkersandproposingdrugtargets
AT haittom integratedanalysisofnumerousheterogeneousgeneexpressionprofilesfordetectingrobustdiseasespecificbiomarkersandproposingdrugtargets
AT izraelishai integratedanalysisofnumerousheterogeneousgeneexpressionprofilesfordetectingrobustdiseasespecificbiomarkersandproposingdrugtargets
AT shamirron integratedanalysisofnumerousheterogeneousgeneexpressionprofilesfordetectingrobustdiseasespecificbiomarkersandproposingdrugtargets