Cargando…

A network-based method to evaluate quality of reproducibility of differential expression in cancer genomics studies

BACKGROUND: Personalized cancer treatments depend on the determination of a patient's genetic status according to known genetic profiles for which targeted treatments exist. Such genetic profiles must be scientifically validated before they is applied to general patient population. Reproducibil...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Robin, Lin, Xiao, Geng, Haijiang, Li, Zhihui, Li, Jiabing, Lu, Tao, Yan, Fangrong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Impact Journals LLC 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4792587/
https://www.ncbi.nlm.nih.gov/pubmed/26556852
_version_ 1782421269646409728
author Li, Robin
Lin, Xiao
Geng, Haijiang
Li, Zhihui
Li, Jiabing
Lu, Tao
Yan, Fangrong
author_facet Li, Robin
Lin, Xiao
Geng, Haijiang
Li, Zhihui
Li, Jiabing
Lu, Tao
Yan, Fangrong
author_sort Li, Robin
collection PubMed
description BACKGROUND: Personalized cancer treatments depend on the determination of a patient's genetic status according to known genetic profiles for which targeted treatments exist. Such genetic profiles must be scientifically validated before they is applied to general patient population. Reproducibility of findings that support such genetic profiles is a fundamental challenge in validation studies. The percentage of overlapping genes (POG) criterion and derivative methods produce unstable and misleading results. Furthermore, in a complex disease, comparisons between different tumor subtypes can produce high POG scores that do not capture the consistencies in the functions. RESULTS: We focused on the quality rather than the quantity of the overlapping genes. We defined the rank value of each gene according to importance or quality by PageRank on basis of a particular topological structure. Then, we used the p-value of the rank-sum of the overlapping genes (PRSOG) to evaluate the quality of reproducibility. Though the POG scores were low in different studies of the same disease, the PRSOG was statistically significant, which suggests that sets of differentially expressed genes might be highly reproducible. CONCLUSIONS: Evaluations of eight datasets from breast cancer, lung cancer and four other disorders indicate that quality-based PRSOG method performs better than a quantity-based method. Our analysis of the components of the sets of overlapping genes supports the utility of the PRSOG method.
format Online
Article
Text
id pubmed-4792587
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Impact Journals LLC
record_format MEDLINE/PubMed
spelling pubmed-47925872016-03-29 A network-based method to evaluate quality of reproducibility of differential expression in cancer genomics studies Li, Robin Lin, Xiao Geng, Haijiang Li, Zhihui Li, Jiabing Lu, Tao Yan, Fangrong Oncotarget Research Paper BACKGROUND: Personalized cancer treatments depend on the determination of a patient's genetic status according to known genetic profiles for which targeted treatments exist. Such genetic profiles must be scientifically validated before they is applied to general patient population. Reproducibility of findings that support such genetic profiles is a fundamental challenge in validation studies. The percentage of overlapping genes (POG) criterion and derivative methods produce unstable and misleading results. Furthermore, in a complex disease, comparisons between different tumor subtypes can produce high POG scores that do not capture the consistencies in the functions. RESULTS: We focused on the quality rather than the quantity of the overlapping genes. We defined the rank value of each gene according to importance or quality by PageRank on basis of a particular topological structure. Then, we used the p-value of the rank-sum of the overlapping genes (PRSOG) to evaluate the quality of reproducibility. Though the POG scores were low in different studies of the same disease, the PRSOG was statistically significant, which suggests that sets of differentially expressed genes might be highly reproducible. CONCLUSIONS: Evaluations of eight datasets from breast cancer, lung cancer and four other disorders indicate that quality-based PRSOG method performs better than a quantity-based method. Our analysis of the components of the sets of overlapping genes supports the utility of the PRSOG method. Impact Journals LLC 2015-11-09 /pmc/articles/PMC4792587/ /pubmed/26556852 Text en Copyright: © 2015 Li et al. http://creativecommons.org/licenses/by/2.5/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Paper
Li, Robin
Lin, Xiao
Geng, Haijiang
Li, Zhihui
Li, Jiabing
Lu, Tao
Yan, Fangrong
A network-based method to evaluate quality of reproducibility of differential expression in cancer genomics studies
title A network-based method to evaluate quality of reproducibility of differential expression in cancer genomics studies
title_full A network-based method to evaluate quality of reproducibility of differential expression in cancer genomics studies
title_fullStr A network-based method to evaluate quality of reproducibility of differential expression in cancer genomics studies
title_full_unstemmed A network-based method to evaluate quality of reproducibility of differential expression in cancer genomics studies
title_short A network-based method to evaluate quality of reproducibility of differential expression in cancer genomics studies
title_sort network-based method to evaluate quality of reproducibility of differential expression in cancer genomics studies
topic Research Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4792587/
https://www.ncbi.nlm.nih.gov/pubmed/26556852
work_keys_str_mv AT lirobin anetworkbasedmethodtoevaluatequalityofreproducibilityofdifferentialexpressionincancergenomicsstudies
AT linxiao anetworkbasedmethodtoevaluatequalityofreproducibilityofdifferentialexpressionincancergenomicsstudies
AT genghaijiang anetworkbasedmethodtoevaluatequalityofreproducibilityofdifferentialexpressionincancergenomicsstudies
AT lizhihui anetworkbasedmethodtoevaluatequalityofreproducibilityofdifferentialexpressionincancergenomicsstudies
AT lijiabing anetworkbasedmethodtoevaluatequalityofreproducibilityofdifferentialexpressionincancergenomicsstudies
AT lutao anetworkbasedmethodtoevaluatequalityofreproducibilityofdifferentialexpressionincancergenomicsstudies
AT yanfangrong anetworkbasedmethodtoevaluatequalityofreproducibilityofdifferentialexpressionincancergenomicsstudies
AT lirobin networkbasedmethodtoevaluatequalityofreproducibilityofdifferentialexpressionincancergenomicsstudies
AT linxiao networkbasedmethodtoevaluatequalityofreproducibilityofdifferentialexpressionincancergenomicsstudies
AT genghaijiang networkbasedmethodtoevaluatequalityofreproducibilityofdifferentialexpressionincancergenomicsstudies
AT lizhihui networkbasedmethodtoevaluatequalityofreproducibilityofdifferentialexpressionincancergenomicsstudies
AT lijiabing networkbasedmethodtoevaluatequalityofreproducibilityofdifferentialexpressionincancergenomicsstudies
AT lutao networkbasedmethodtoevaluatequalityofreproducibilityofdifferentialexpressionincancergenomicsstudies
AT yanfangrong networkbasedmethodtoevaluatequalityofreproducibilityofdifferentialexpressionincancergenomicsstudies