Cargando…
Multiplatform biomarker identification using a data-driven approach enables single-sample classification
BACKGROUND: High-throughput gene expression profiles have allowed discovery of potential biomarkers enabling early diagnosis, prognosis and developing individualized treatment. However, it remains a challenge to identify a set of reliable and reproducible biomarkers across various gene expression pl...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6868758/ https://www.ncbi.nlm.nih.gov/pubmed/31752658 http://dx.doi.org/10.1186/s12859-019-3140-7 |
_version_ | 1783472337437327360 |
---|---|
author | Zhang, Ling Thapa, Ishwor Haas, Christian Bastola, Dhundy |
author_facet | Zhang, Ling Thapa, Ishwor Haas, Christian Bastola, Dhundy |
author_sort | Zhang, Ling |
collection | PubMed |
description | BACKGROUND: High-throughput gene expression profiles have allowed discovery of potential biomarkers enabling early diagnosis, prognosis and developing individualized treatment. However, it remains a challenge to identify a set of reliable and reproducible biomarkers across various gene expression platforms and laboratories for single sample diagnosis and prognosis. We address this need with our Data-Driven Reference (DDR) approach, which employs stably expressed housekeeping genes as references to eliminate platform-specific biases and non-biological variabilities. RESULTS: Our method identifies biomarkers with “built-in” features, and these features can be interpreted consistently regardless of profiling technology, which enable classification of single-sample independent of platforms. Validation with RNA-seq data of blood platelets shows that DDR achieves the superior performance in classification of six different tumor types as well as molecular target statuses (such as MET or HER2-positive, and mutant KRAS, EGFR or PIK3CA) with smaller sets of biomarkers. We demonstrate on the three microarray datasets that our method is capable of identifying robust biomarkers for subgrouping medulloblastoma samples with data perturbation due to different microarray platforms. In addition to identifying the majority of subgroup-specific biomarkers in CodeSet of nanoString, some potential new biomarkers for subgrouping medulloblastoma were detected by our method. CONCLUSIONS: In this study, we present a simple, yet powerful data-driven method which contributes significantly to identification of robust cross-platform gene signature for disease classification of single-patient to facilitate precision medicine. In addition, our method provides a new strategy for transcriptome analysis. |
format | Online Article Text |
id | pubmed-6868758 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-68687582019-12-12 Multiplatform biomarker identification using a data-driven approach enables single-sample classification Zhang, Ling Thapa, Ishwor Haas, Christian Bastola, Dhundy BMC Bioinformatics Methodology Article BACKGROUND: High-throughput gene expression profiles have allowed discovery of potential biomarkers enabling early diagnosis, prognosis and developing individualized treatment. However, it remains a challenge to identify a set of reliable and reproducible biomarkers across various gene expression platforms and laboratories for single sample diagnosis and prognosis. We address this need with our Data-Driven Reference (DDR) approach, which employs stably expressed housekeeping genes as references to eliminate platform-specific biases and non-biological variabilities. RESULTS: Our method identifies biomarkers with “built-in” features, and these features can be interpreted consistently regardless of profiling technology, which enable classification of single-sample independent of platforms. Validation with RNA-seq data of blood platelets shows that DDR achieves the superior performance in classification of six different tumor types as well as molecular target statuses (such as MET or HER2-positive, and mutant KRAS, EGFR or PIK3CA) with smaller sets of biomarkers. We demonstrate on the three microarray datasets that our method is capable of identifying robust biomarkers for subgrouping medulloblastoma samples with data perturbation due to different microarray platforms. In addition to identifying the majority of subgroup-specific biomarkers in CodeSet of nanoString, some potential new biomarkers for subgrouping medulloblastoma were detected by our method. CONCLUSIONS: In this study, we present a simple, yet powerful data-driven method which contributes significantly to identification of robust cross-platform gene signature for disease classification of single-patient to facilitate precision medicine. In addition, our method provides a new strategy for transcriptome analysis. BioMed Central 2019-11-21 /pmc/articles/PMC6868758/ /pubmed/31752658 http://dx.doi.org/10.1186/s12859-019-3140-7 Text en © The Author(s) 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Article Zhang, Ling Thapa, Ishwor Haas, Christian Bastola, Dhundy Multiplatform biomarker identification using a data-driven approach enables single-sample classification |
title | Multiplatform biomarker identification using a data-driven approach enables single-sample classification |
title_full | Multiplatform biomarker identification using a data-driven approach enables single-sample classification |
title_fullStr | Multiplatform biomarker identification using a data-driven approach enables single-sample classification |
title_full_unstemmed | Multiplatform biomarker identification using a data-driven approach enables single-sample classification |
title_short | Multiplatform biomarker identification using a data-driven approach enables single-sample classification |
title_sort | multiplatform biomarker identification using a data-driven approach enables single-sample classification |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6868758/ https://www.ncbi.nlm.nih.gov/pubmed/31752658 http://dx.doi.org/10.1186/s12859-019-3140-7 |
work_keys_str_mv | AT zhangling multiplatformbiomarkeridentificationusingadatadrivenapproachenablessinglesampleclassification AT thapaishwor multiplatformbiomarkeridentificationusingadatadrivenapproachenablessinglesampleclassification AT haaschristian multiplatformbiomarkeridentificationusingadatadrivenapproachenablessinglesampleclassification AT bastoladhundy multiplatformbiomarkeridentificationusingadatadrivenapproachenablessinglesampleclassification |