Cargando…

Combinatorial Ranking of Gene Sets to Predict Disease Relapse: The Retinoic Acid Pathway in Early Prostate Cancer

BACKGROUND: Quantitative high-throughput data deposited in consortia such as International Cancer Genome Consortium (ICGC) and The Cancer Genome Atlas (TCGA) present opportunities and challenges for computational analyses. METHODS: We present a computational strategy to systematically rank and inves...

Descripción completa

Detalles Bibliográficos
Autores principales: Nim, Hieu T., Furtado, Milena B., Ramialison, Mirana, Boyd, Sarah E.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5350134/
https://www.ncbi.nlm.nih.gov/pubmed/28361034
http://dx.doi.org/10.3389/fonc.2017.00030
_version_ 1782514597172871168
author Nim, Hieu T.
Furtado, Milena B.
Ramialison, Mirana
Boyd, Sarah E.
author_facet Nim, Hieu T.
Furtado, Milena B.
Ramialison, Mirana
Boyd, Sarah E.
author_sort Nim, Hieu T.
collection PubMed
description BACKGROUND: Quantitative high-throughput data deposited in consortia such as International Cancer Genome Consortium (ICGC) and The Cancer Genome Atlas (TCGA) present opportunities and challenges for computational analyses. METHODS: We present a computational strategy to systematically rank and investigate a large number (2(10)–2(20)) of clinically testable gene sets, using combinatorial gene subset generation and disease-free survival (DFS) analyses. This approach integrates protein–protein interaction networks, gene expression, DNA methylation, and copy number data, in association with DFS profiles from patient clinical records. RESULTS: As a case study, we applied this pipeline to systematically analyze the role of ALDH1A2 in prostate cancer (PCa). We have previously found this gene to have multiple roles in disease and homeostasis, and here we investigate the role of the associated ALDH1A2 gene/protein networks in PCa, using our methodology in combination with PCa patient clinical profiles from ICGC and TCGA databases. Relationships between gene signatures and relapse were analyzed using Kaplan–Meier (KM) log-rank analysis and multivariable Cox regression. Relative expression versus pooled mean from diploid population was used for z-statistics calculation. Gene/protein interaction network analyses generated 11 core genes associated with ALDH1A2; combinatorial ranking of the power set of these core genes identified two gene sets (out of 2(11) − 1 = 2,047 combinations) with significant correlation with disease relapse (KM log rank p < 0.05). For the more significant of these two sets, referred to as the optimal gene set (OGS), patients have median survival 62.7 months with OGS alterations compared to >150 months without OGS alterations (p = 0.0248, hazard ratio = 2.213, 95% confidence interval = 1.1–4.098). Two genes comprising OGS (CYP26A1 and RDH10) are strongly associated with ALDH1A2 in the retinoic acid (RA) pathways, suggesting a major role of RA signaling in early PCa progression. Our pipeline complements human expertise in the search for prognostic biomarkers in large-scale datasets.
format Online
Article
Text
id pubmed-5350134
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-53501342017-03-30 Combinatorial Ranking of Gene Sets to Predict Disease Relapse: The Retinoic Acid Pathway in Early Prostate Cancer Nim, Hieu T. Furtado, Milena B. Ramialison, Mirana Boyd, Sarah E. Front Oncol Oncology BACKGROUND: Quantitative high-throughput data deposited in consortia such as International Cancer Genome Consortium (ICGC) and The Cancer Genome Atlas (TCGA) present opportunities and challenges for computational analyses. METHODS: We present a computational strategy to systematically rank and investigate a large number (2(10)–2(20)) of clinically testable gene sets, using combinatorial gene subset generation and disease-free survival (DFS) analyses. This approach integrates protein–protein interaction networks, gene expression, DNA methylation, and copy number data, in association with DFS profiles from patient clinical records. RESULTS: As a case study, we applied this pipeline to systematically analyze the role of ALDH1A2 in prostate cancer (PCa). We have previously found this gene to have multiple roles in disease and homeostasis, and here we investigate the role of the associated ALDH1A2 gene/protein networks in PCa, using our methodology in combination with PCa patient clinical profiles from ICGC and TCGA databases. Relationships between gene signatures and relapse were analyzed using Kaplan–Meier (KM) log-rank analysis and multivariable Cox regression. Relative expression versus pooled mean from diploid population was used for z-statistics calculation. Gene/protein interaction network analyses generated 11 core genes associated with ALDH1A2; combinatorial ranking of the power set of these core genes identified two gene sets (out of 2(11) − 1 = 2,047 combinations) with significant correlation with disease relapse (KM log rank p < 0.05). For the more significant of these two sets, referred to as the optimal gene set (OGS), patients have median survival 62.7 months with OGS alterations compared to >150 months without OGS alterations (p = 0.0248, hazard ratio = 2.213, 95% confidence interval = 1.1–4.098). Two genes comprising OGS (CYP26A1 and RDH10) are strongly associated with ALDH1A2 in the retinoic acid (RA) pathways, suggesting a major role of RA signaling in early PCa progression. Our pipeline complements human expertise in the search for prognostic biomarkers in large-scale datasets. Frontiers Media S.A. 2017-03-15 /pmc/articles/PMC5350134/ /pubmed/28361034 http://dx.doi.org/10.3389/fonc.2017.00030 Text en Copyright © 2017 Nim, Furtado, Ramialison and Boyd. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Oncology
Nim, Hieu T.
Furtado, Milena B.
Ramialison, Mirana
Boyd, Sarah E.
Combinatorial Ranking of Gene Sets to Predict Disease Relapse: The Retinoic Acid Pathway in Early Prostate Cancer
title Combinatorial Ranking of Gene Sets to Predict Disease Relapse: The Retinoic Acid Pathway in Early Prostate Cancer
title_full Combinatorial Ranking of Gene Sets to Predict Disease Relapse: The Retinoic Acid Pathway in Early Prostate Cancer
title_fullStr Combinatorial Ranking of Gene Sets to Predict Disease Relapse: The Retinoic Acid Pathway in Early Prostate Cancer
title_full_unstemmed Combinatorial Ranking of Gene Sets to Predict Disease Relapse: The Retinoic Acid Pathway in Early Prostate Cancer
title_short Combinatorial Ranking of Gene Sets to Predict Disease Relapse: The Retinoic Acid Pathway in Early Prostate Cancer
title_sort combinatorial ranking of gene sets to predict disease relapse: the retinoic acid pathway in early prostate cancer
topic Oncology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5350134/
https://www.ncbi.nlm.nih.gov/pubmed/28361034
http://dx.doi.org/10.3389/fonc.2017.00030
work_keys_str_mv AT nimhieut combinatorialrankingofgenesetstopredictdiseaserelapsetheretinoicacidpathwayinearlyprostatecancer
AT furtadomilenab combinatorialrankingofgenesetstopredictdiseaserelapsetheretinoicacidpathwayinearlyprostatecancer
AT ramialisonmirana combinatorialrankingofgenesetstopredictdiseaserelapsetheretinoicacidpathwayinearlyprostatecancer
AT boydsarahe combinatorialrankingofgenesetstopredictdiseaserelapsetheretinoicacidpathwayinearlyprostatecancer