Cargando…
Big data and computational biology strategy for personalized prognosis
The era of big data and precision medicine has led to accumulation of massive datasets of gene expression data and clinical information of patients. For a new patient, we propose that identification of a highly similar reference patient from an existing patient database via similarity matching of bo...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Impact Journals LLC
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5130003/ https://www.ncbi.nlm.nih.gov/pubmed/27229533 http://dx.doi.org/10.18632/oncotarget.9571 |
_version_ | 1782470664429502464 |
---|---|
author | Ow, Ghim Siong Tang, Zhiqun Kuznetsov, Vladimir A. |
author_facet | Ow, Ghim Siong Tang, Zhiqun Kuznetsov, Vladimir A. |
author_sort | Ow, Ghim Siong |
collection | PubMed |
description | The era of big data and precision medicine has led to accumulation of massive datasets of gene expression data and clinical information of patients. For a new patient, we propose that identification of a highly similar reference patient from an existing patient database via similarity matching of both clinical and expression data could be useful for predicting the prognostic risk or therapeutic efficacy. Here, we propose a novel methodology to predict disease/treatment outcome via analysis of the similarity between any pair of patients who are each characterized by a certain set of pre-defined biological variables (biomarkers or clinical features) represented initially as a prognostic binary variable vector (PBVV) and subsequently transformed to a prognostic signature vector (PSV). Our analyses revealed that Euclidean distance rather correlation distance measure was effective in defining an unbiased similarity measure calculated between two PSVs. We implemented our methods to high-grade serous ovarian cancer (HGSC) based on a 36-mRNA predictor that was previously shown to stratify patients into 3 distinct prognostic subgroups. We studied and revealed that patient's age, when converted into binary variable, was positively correlated with the overall risk of succumbing to the disease. When applied to an independent testing dataset, the inclusion of age into the molecular predictor provided more robust personalized prognosis of overall survival correlated with the therapeutic response of HGSC and provided benefit for treatment targeting of the tumors in HGSC patients. Finally, our method can be generalized and implemented in many other diseases to accurately predict personalized patients’ outcomes. |
format | Online Article Text |
id | pubmed-5130003 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Impact Journals LLC |
record_format | MEDLINE/PubMed |
spelling | pubmed-51300032016-12-11 Big data and computational biology strategy for personalized prognosis Ow, Ghim Siong Tang, Zhiqun Kuznetsov, Vladimir A. Oncotarget Research Paper The era of big data and precision medicine has led to accumulation of massive datasets of gene expression data and clinical information of patients. For a new patient, we propose that identification of a highly similar reference patient from an existing patient database via similarity matching of both clinical and expression data could be useful for predicting the prognostic risk or therapeutic efficacy. Here, we propose a novel methodology to predict disease/treatment outcome via analysis of the similarity between any pair of patients who are each characterized by a certain set of pre-defined biological variables (biomarkers or clinical features) represented initially as a prognostic binary variable vector (PBVV) and subsequently transformed to a prognostic signature vector (PSV). Our analyses revealed that Euclidean distance rather correlation distance measure was effective in defining an unbiased similarity measure calculated between two PSVs. We implemented our methods to high-grade serous ovarian cancer (HGSC) based on a 36-mRNA predictor that was previously shown to stratify patients into 3 distinct prognostic subgroups. We studied and revealed that patient's age, when converted into binary variable, was positively correlated with the overall risk of succumbing to the disease. When applied to an independent testing dataset, the inclusion of age into the molecular predictor provided more robust personalized prognosis of overall survival correlated with the therapeutic response of HGSC and provided benefit for treatment targeting of the tumors in HGSC patients. Finally, our method can be generalized and implemented in many other diseases to accurately predict personalized patients’ outcomes. Impact Journals LLC 2016-05-24 /pmc/articles/PMC5130003/ /pubmed/27229533 http://dx.doi.org/10.18632/oncotarget.9571 Text en Copyright: © 2016 Ow et al. http://creativecommons.org/licenses/by/2.5/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Paper Ow, Ghim Siong Tang, Zhiqun Kuznetsov, Vladimir A. Big data and computational biology strategy for personalized prognosis |
title | Big data and computational biology strategy for personalized prognosis |
title_full | Big data and computational biology strategy for personalized prognosis |
title_fullStr | Big data and computational biology strategy for personalized prognosis |
title_full_unstemmed | Big data and computational biology strategy for personalized prognosis |
title_short | Big data and computational biology strategy for personalized prognosis |
title_sort | big data and computational biology strategy for personalized prognosis |
topic | Research Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5130003/ https://www.ncbi.nlm.nih.gov/pubmed/27229533 http://dx.doi.org/10.18632/oncotarget.9571 |
work_keys_str_mv | AT owghimsiong bigdataandcomputationalbiologystrategyforpersonalizedprognosis AT tangzhiqun bigdataandcomputationalbiologystrategyforpersonalizedprognosis AT kuznetsovvladimira bigdataandcomputationalbiologystrategyforpersonalizedprognosis |