Cargando…

Expanding the Understanding of Biases in Development of Clinical-Grade Molecular Signatures: A Case Study in Acute Respiratory Viral Infections

BACKGROUND: The promise of modern personalized medicine is to use molecular and clinical information to better diagnose, manage, and treat disease, on an individual patient basis. These functions are predominantly enabled by molecular signatures, which are computational models for predicting phenoty...

Descripción completa

Detalles Bibliográficos
Autores principales: Lytkin, Nikita I., McVoy, Lauren, Weitkamp, Jörn-Hendrik, Aliferis, Constantin F., Statnikov, Alexander
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3105991/
https://www.ncbi.nlm.nih.gov/pubmed/21673802
http://dx.doi.org/10.1371/journal.pone.0020662
_version_ 1782204750141325312
author Lytkin, Nikita I.
McVoy, Lauren
Weitkamp, Jörn-Hendrik
Aliferis, Constantin F.
Statnikov, Alexander
author_facet Lytkin, Nikita I.
McVoy, Lauren
Weitkamp, Jörn-Hendrik
Aliferis, Constantin F.
Statnikov, Alexander
author_sort Lytkin, Nikita I.
collection PubMed
description BACKGROUND: The promise of modern personalized medicine is to use molecular and clinical information to better diagnose, manage, and treat disease, on an individual patient basis. These functions are predominantly enabled by molecular signatures, which are computational models for predicting phenotypes and other responses of interest from high-throughput assay data. Data-analytics is a central component of molecular signature development and can jeopardize the entire process if conducted incorrectly. While exploratory data analysis may tolerate suboptimal protocols, clinical-grade molecular signatures are subject to vastly stricter requirements. Closing the gap between standards for exploratory versus clinically successful molecular signatures entails a thorough understanding of possible biases in the data analysis phase and developing strategies to avoid them. METHODOLOGY AND PRINCIPAL FINDINGS: Using a recently introduced data-analytic protocol as a case study, we provide an in-depth examination of the poorly studied biases of the data-analytic protocols related to signature multiplicity, biomarker redundancy, data preprocessing, and validation of signature reproducibility. The methodology and results presented in this work are aimed at expanding the understanding of these data-analytic biases that affect development of clinically robust molecular signatures. CONCLUSIONS AND SIGNIFICANCE: Several recommendations follow from the current study. First, all molecular signatures of a phenotype should be extracted to the extent possible, in order to provide comprehensive and accurate grounds for understanding disease pathogenesis. Second, redundant genes should generally be removed from final signatures to facilitate reproducibility and decrease manufacturing costs. Third, data preprocessing procedures should be designed so as not to bias biomarker selection. Finally, molecular signatures developed and applied on different phenotypes and populations of patients should be treated with great caution.
format Text
id pubmed-3105991
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-31059912011-06-13 Expanding the Understanding of Biases in Development of Clinical-Grade Molecular Signatures: A Case Study in Acute Respiratory Viral Infections Lytkin, Nikita I. McVoy, Lauren Weitkamp, Jörn-Hendrik Aliferis, Constantin F. Statnikov, Alexander PLoS One Research Article BACKGROUND: The promise of modern personalized medicine is to use molecular and clinical information to better diagnose, manage, and treat disease, on an individual patient basis. These functions are predominantly enabled by molecular signatures, which are computational models for predicting phenotypes and other responses of interest from high-throughput assay data. Data-analytics is a central component of molecular signature development and can jeopardize the entire process if conducted incorrectly. While exploratory data analysis may tolerate suboptimal protocols, clinical-grade molecular signatures are subject to vastly stricter requirements. Closing the gap between standards for exploratory versus clinically successful molecular signatures entails a thorough understanding of possible biases in the data analysis phase and developing strategies to avoid them. METHODOLOGY AND PRINCIPAL FINDINGS: Using a recently introduced data-analytic protocol as a case study, we provide an in-depth examination of the poorly studied biases of the data-analytic protocols related to signature multiplicity, biomarker redundancy, data preprocessing, and validation of signature reproducibility. The methodology and results presented in this work are aimed at expanding the understanding of these data-analytic biases that affect development of clinically robust molecular signatures. CONCLUSIONS AND SIGNIFICANCE: Several recommendations follow from the current study. First, all molecular signatures of a phenotype should be extracted to the extent possible, in order to provide comprehensive and accurate grounds for understanding disease pathogenesis. Second, redundant genes should generally be removed from final signatures to facilitate reproducibility and decrease manufacturing costs. Third, data preprocessing procedures should be designed so as not to bias biomarker selection. Finally, molecular signatures developed and applied on different phenotypes and populations of patients should be treated with great caution. Public Library of Science 2011-06-01 /pmc/articles/PMC3105991/ /pubmed/21673802 http://dx.doi.org/10.1371/journal.pone.0020662 Text en Lytkin et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Lytkin, Nikita I.
McVoy, Lauren
Weitkamp, Jörn-Hendrik
Aliferis, Constantin F.
Statnikov, Alexander
Expanding the Understanding of Biases in Development of Clinical-Grade Molecular Signatures: A Case Study in Acute Respiratory Viral Infections
title Expanding the Understanding of Biases in Development of Clinical-Grade Molecular Signatures: A Case Study in Acute Respiratory Viral Infections
title_full Expanding the Understanding of Biases in Development of Clinical-Grade Molecular Signatures: A Case Study in Acute Respiratory Viral Infections
title_fullStr Expanding the Understanding of Biases in Development of Clinical-Grade Molecular Signatures: A Case Study in Acute Respiratory Viral Infections
title_full_unstemmed Expanding the Understanding of Biases in Development of Clinical-Grade Molecular Signatures: A Case Study in Acute Respiratory Viral Infections
title_short Expanding the Understanding of Biases in Development of Clinical-Grade Molecular Signatures: A Case Study in Acute Respiratory Viral Infections
title_sort expanding the understanding of biases in development of clinical-grade molecular signatures: a case study in acute respiratory viral infections
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3105991/
https://www.ncbi.nlm.nih.gov/pubmed/21673802
http://dx.doi.org/10.1371/journal.pone.0020662
work_keys_str_mv AT lytkinnikitai expandingtheunderstandingofbiasesindevelopmentofclinicalgrademolecularsignaturesacasestudyinacuterespiratoryviralinfections
AT mcvoylauren expandingtheunderstandingofbiasesindevelopmentofclinicalgrademolecularsignaturesacasestudyinacuterespiratoryviralinfections
AT weitkampjornhendrik expandingtheunderstandingofbiasesindevelopmentofclinicalgrademolecularsignaturesacasestudyinacuterespiratoryviralinfections
AT aliferisconstantinf expandingtheunderstandingofbiasesindevelopmentofclinicalgrademolecularsignaturesacasestudyinacuterespiratoryviralinfections
AT statnikovalexander expandingtheunderstandingofbiasesindevelopmentofclinicalgrademolecularsignaturesacasestudyinacuterespiratoryviralinfections