Cargando…
Random forest-based modelling to detect biomarkers for prostate cancer progression
BACKGROUND: The clinical course of prostate cancer (PCa) is highly variable, demanding an individualized approach to therapy. Overtreatment of indolent PCa cases, which likely do not progress to aggressive stages, may be associated with severe side effects and considerable costs. These could be avoi...
Autores principales: | , , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6805338/ https://www.ncbi.nlm.nih.gov/pubmed/31640781 http://dx.doi.org/10.1186/s13148-019-0736-8 |
_version_ | 1783461359367749632 |
---|---|
author | Toth, Reka Schiffmann, Heiko Hube-Magg, Claudia Büscheck, Franziska Höflmayer, Doris Weidemann, Sören Lebok, Patrick Fraune, Christoph Minner, Sarah Schlomm, Thorsten Sauter, Guido Plass, Christoph Assenov, Yassen Simon, Ronald Meiners, Jan Gerhäuser, Clarissa |
author_facet | Toth, Reka Schiffmann, Heiko Hube-Magg, Claudia Büscheck, Franziska Höflmayer, Doris Weidemann, Sören Lebok, Patrick Fraune, Christoph Minner, Sarah Schlomm, Thorsten Sauter, Guido Plass, Christoph Assenov, Yassen Simon, Ronald Meiners, Jan Gerhäuser, Clarissa |
author_sort | Toth, Reka |
collection | PubMed |
description | BACKGROUND: The clinical course of prostate cancer (PCa) is highly variable, demanding an individualized approach to therapy. Overtreatment of indolent PCa cases, which likely do not progress to aggressive stages, may be associated with severe side effects and considerable costs. These could be avoided by utilizing robust prognostic markers to guide treatment decisions. RESULTS: We present a random forest-based classification model to predict aggressive behaviour of prostate cancer. DNA methylation changes between PCa cases with good or poor prognosis (discovery cohort with n = 70) were used as input. DNA was extracted from formalin-fixed tumour tissue, and genome-wide DNA methylation differences between both groups were assessed using Illumina HumanMethylation450 arrays. For the random forest-based modelling, the discovery cohort was randomly split into a training (80%) and a test set (20%). Our methylation-based classifier demonstrated excellent performance in discriminating prognosis subgroups in the test set (Kaplan-Meier survival analyses with log-rank p value < 0.0001). The area under the receiver operating characteristic curve (AUC) for the sensitivity analysis was 95%. Using the ICGC cohort of early- and late-onset prostate cancer (n = 222) and the TCGA PRAD cohort (n = 477) for external validation, AUCs for sensitivity analyses were 77.1% and 68.7%, respectively. Cancer progression-related DNA hypomethylation was frequently located in ‘partially methylated domains’ (PMDs)—large-scale genomic areas with progressive loss of DNA methylation linked to mitotic cell division. We selected several candidate genes with differential methylation in gene promoter regions for additional validation at the protein expression level by immunohistochemistry in > 12,000 tissue micro-arrayed PCa cases. Loss of ZIC2 protein expression was associated with poor prognosis and correlated with significantly shorter time to biochemical recurrence. The prognostic value of ZIC2 proved to be independent from established clinicopathological variables including Gleason grade, tumour stage, nodal stage and prostate-specific-antigen. CONCLUSIONS: Our results highlight the prognostic relevance of methylation loss in PMD regions, as well as of several candidate genes not previously associated with PCa progression. Our robust and externally validated PCa classification model either directly or via protein expression analyses of the identified top-ranked candidate genes will support the clinical management of prostate cancer. |
format | Online Article Text |
id | pubmed-6805338 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-68053382019-10-24 Random forest-based modelling to detect biomarkers for prostate cancer progression Toth, Reka Schiffmann, Heiko Hube-Magg, Claudia Büscheck, Franziska Höflmayer, Doris Weidemann, Sören Lebok, Patrick Fraune, Christoph Minner, Sarah Schlomm, Thorsten Sauter, Guido Plass, Christoph Assenov, Yassen Simon, Ronald Meiners, Jan Gerhäuser, Clarissa Clin Epigenetics Research BACKGROUND: The clinical course of prostate cancer (PCa) is highly variable, demanding an individualized approach to therapy. Overtreatment of indolent PCa cases, which likely do not progress to aggressive stages, may be associated with severe side effects and considerable costs. These could be avoided by utilizing robust prognostic markers to guide treatment decisions. RESULTS: We present a random forest-based classification model to predict aggressive behaviour of prostate cancer. DNA methylation changes between PCa cases with good or poor prognosis (discovery cohort with n = 70) were used as input. DNA was extracted from formalin-fixed tumour tissue, and genome-wide DNA methylation differences between both groups were assessed using Illumina HumanMethylation450 arrays. For the random forest-based modelling, the discovery cohort was randomly split into a training (80%) and a test set (20%). Our methylation-based classifier demonstrated excellent performance in discriminating prognosis subgroups in the test set (Kaplan-Meier survival analyses with log-rank p value < 0.0001). The area under the receiver operating characteristic curve (AUC) for the sensitivity analysis was 95%. Using the ICGC cohort of early- and late-onset prostate cancer (n = 222) and the TCGA PRAD cohort (n = 477) for external validation, AUCs for sensitivity analyses were 77.1% and 68.7%, respectively. Cancer progression-related DNA hypomethylation was frequently located in ‘partially methylated domains’ (PMDs)—large-scale genomic areas with progressive loss of DNA methylation linked to mitotic cell division. We selected several candidate genes with differential methylation in gene promoter regions for additional validation at the protein expression level by immunohistochemistry in > 12,000 tissue micro-arrayed PCa cases. Loss of ZIC2 protein expression was associated with poor prognosis and correlated with significantly shorter time to biochemical recurrence. The prognostic value of ZIC2 proved to be independent from established clinicopathological variables including Gleason grade, tumour stage, nodal stage and prostate-specific-antigen. CONCLUSIONS: Our results highlight the prognostic relevance of methylation loss in PMD regions, as well as of several candidate genes not previously associated with PCa progression. Our robust and externally validated PCa classification model either directly or via protein expression analyses of the identified top-ranked candidate genes will support the clinical management of prostate cancer. BioMed Central 2019-10-22 /pmc/articles/PMC6805338/ /pubmed/31640781 http://dx.doi.org/10.1186/s13148-019-0736-8 Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Toth, Reka Schiffmann, Heiko Hube-Magg, Claudia Büscheck, Franziska Höflmayer, Doris Weidemann, Sören Lebok, Patrick Fraune, Christoph Minner, Sarah Schlomm, Thorsten Sauter, Guido Plass, Christoph Assenov, Yassen Simon, Ronald Meiners, Jan Gerhäuser, Clarissa Random forest-based modelling to detect biomarkers for prostate cancer progression |
title | Random forest-based modelling to detect biomarkers for prostate cancer progression |
title_full | Random forest-based modelling to detect biomarkers for prostate cancer progression |
title_fullStr | Random forest-based modelling to detect biomarkers for prostate cancer progression |
title_full_unstemmed | Random forest-based modelling to detect biomarkers for prostate cancer progression |
title_short | Random forest-based modelling to detect biomarkers for prostate cancer progression |
title_sort | random forest-based modelling to detect biomarkers for prostate cancer progression |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6805338/ https://www.ncbi.nlm.nih.gov/pubmed/31640781 http://dx.doi.org/10.1186/s13148-019-0736-8 |
work_keys_str_mv | AT tothreka randomforestbasedmodellingtodetectbiomarkersforprostatecancerprogression AT schiffmannheiko randomforestbasedmodellingtodetectbiomarkersforprostatecancerprogression AT hubemaggclaudia randomforestbasedmodellingtodetectbiomarkersforprostatecancerprogression AT buscheckfranziska randomforestbasedmodellingtodetectbiomarkersforprostatecancerprogression AT hoflmayerdoris randomforestbasedmodellingtodetectbiomarkersforprostatecancerprogression AT weidemannsoren randomforestbasedmodellingtodetectbiomarkersforprostatecancerprogression AT lebokpatrick randomforestbasedmodellingtodetectbiomarkersforprostatecancerprogression AT fraunechristoph randomforestbasedmodellingtodetectbiomarkersforprostatecancerprogression AT minnersarah randomforestbasedmodellingtodetectbiomarkersforprostatecancerprogression AT schlommthorsten randomforestbasedmodellingtodetectbiomarkersforprostatecancerprogression AT sauterguido randomforestbasedmodellingtodetectbiomarkersforprostatecancerprogression AT plasschristoph randomforestbasedmodellingtodetectbiomarkersforprostatecancerprogression AT assenovyassen randomforestbasedmodellingtodetectbiomarkersforprostatecancerprogression AT simonronald randomforestbasedmodellingtodetectbiomarkersforprostatecancerprogression AT meinersjan randomforestbasedmodellingtodetectbiomarkersforprostatecancerprogression AT gerhauserclarissa randomforestbasedmodellingtodetectbiomarkersforprostatecancerprogression |