Cargando…

Analysis and Computational Dissection of Molecular Signature Multiplicity

Molecular signatures are computational or mathematical models created to diagnose disease and other phenotypes and to predict clinical outcomes and response to treatment. It is widely recognized that molecular signatures constitute one of the most important translational and basic science developmen...

Descripción completa

Detalles Bibliográficos
Autores principales: Statnikov, Alexander, Aliferis, Constantin F.
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2873900/
https://www.ncbi.nlm.nih.gov/pubmed/20502670
http://dx.doi.org/10.1371/journal.pcbi.1000790
_version_ 1782181408340443136
author Statnikov, Alexander
Aliferis, Constantin F.
author_facet Statnikov, Alexander
Aliferis, Constantin F.
author_sort Statnikov, Alexander
collection PubMed
description Molecular signatures are computational or mathematical models created to diagnose disease and other phenotypes and to predict clinical outcomes and response to treatment. It is widely recognized that molecular signatures constitute one of the most important translational and basic science developments enabled by recent high-throughput molecular assays. A perplexing phenomenon that characterizes high-throughput data analysis is the ubiquitous multiplicity of molecular signatures. Multiplicity is a special form of data analysis instability in which different analysis methods used on the same data, or different samples from the same population lead to different but apparently maximally predictive signatures. This phenomenon has far-reaching implications for biological discovery and development of next generation patient diagnostics and personalized treatments. Currently the causes and interpretation of signature multiplicity are unknown, and several, often contradictory, conjectures have been made to explain it. We present a formal characterization of signature multiplicity and a new efficient algorithm that offers theoretical guarantees for extracting the set of maximally predictive and non-redundant signatures independent of distribution. The new algorithm identifies exactly the set of optimal signatures in controlled experiments and yields signatures with significantly better predictivity and reproducibility than previous algorithms in human microarray gene expression datasets. Our results shed light on the causes of signature multiplicity, provide computational tools for studying it empirically and introduce a framework for in silico bioequivalence of this important new class of diagnostic and personalized medicine modalities.
format Text
id pubmed-2873900
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-28739002010-05-25 Analysis and Computational Dissection of Molecular Signature Multiplicity Statnikov, Alexander Aliferis, Constantin F. PLoS Comput Biol Research Article Molecular signatures are computational or mathematical models created to diagnose disease and other phenotypes and to predict clinical outcomes and response to treatment. It is widely recognized that molecular signatures constitute one of the most important translational and basic science developments enabled by recent high-throughput molecular assays. A perplexing phenomenon that characterizes high-throughput data analysis is the ubiquitous multiplicity of molecular signatures. Multiplicity is a special form of data analysis instability in which different analysis methods used on the same data, or different samples from the same population lead to different but apparently maximally predictive signatures. This phenomenon has far-reaching implications for biological discovery and development of next generation patient diagnostics and personalized treatments. Currently the causes and interpretation of signature multiplicity are unknown, and several, often contradictory, conjectures have been made to explain it. We present a formal characterization of signature multiplicity and a new efficient algorithm that offers theoretical guarantees for extracting the set of maximally predictive and non-redundant signatures independent of distribution. The new algorithm identifies exactly the set of optimal signatures in controlled experiments and yields signatures with significantly better predictivity and reproducibility than previous algorithms in human microarray gene expression datasets. Our results shed light on the causes of signature multiplicity, provide computational tools for studying it empirically and introduce a framework for in silico bioequivalence of this important new class of diagnostic and personalized medicine modalities. Public Library of Science 2010-05-20 /pmc/articles/PMC2873900/ /pubmed/20502670 http://dx.doi.org/10.1371/journal.pcbi.1000790 Text en Statnikov, Aliferis. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Statnikov, Alexander
Aliferis, Constantin F.
Analysis and Computational Dissection of Molecular Signature Multiplicity
title Analysis and Computational Dissection of Molecular Signature Multiplicity
title_full Analysis and Computational Dissection of Molecular Signature Multiplicity
title_fullStr Analysis and Computational Dissection of Molecular Signature Multiplicity
title_full_unstemmed Analysis and Computational Dissection of Molecular Signature Multiplicity
title_short Analysis and Computational Dissection of Molecular Signature Multiplicity
title_sort analysis and computational dissection of molecular signature multiplicity
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2873900/
https://www.ncbi.nlm.nih.gov/pubmed/20502670
http://dx.doi.org/10.1371/journal.pcbi.1000790
work_keys_str_mv AT statnikovalexander analysisandcomputationaldissectionofmolecularsignaturemultiplicity
AT aliferisconstantinf analysisandcomputationaldissectionofmolecularsignaturemultiplicity