Cargando…
Predictors of colorectal cancer survival using cox regression and random survival forests models based on gene expression data
Understanding and identifying the markers and clinical information that are associated with colorectal cancer (CRC) patient survival is needed for early detection and diagnosis. In this work, we aimed to build a simple model using Cox proportional hazards (PH) and random survival forest (RSF) and fi...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8716055/ https://www.ncbi.nlm.nih.gov/pubmed/34965262 http://dx.doi.org/10.1371/journal.pone.0261625 |
_version_ | 1784624241751097344 |
---|---|
author | Mohammed, Mohanad Mboya, Innocent B. Mwambi, Henry Elbashir, Murtada K. Omolo, Bernard |
author_facet | Mohammed, Mohanad Mboya, Innocent B. Mwambi, Henry Elbashir, Murtada K. Omolo, Bernard |
author_sort | Mohammed, Mohanad |
collection | PubMed |
description | Understanding and identifying the markers and clinical information that are associated with colorectal cancer (CRC) patient survival is needed for early detection and diagnosis. In this work, we aimed to build a simple model using Cox proportional hazards (PH) and random survival forest (RSF) and find a robust signature for predicting CRC overall survival. We used stepwise regression to develop Cox PH model to analyse 54 common differentially expressed genes from three mutations. RSF is applied using log-rank and log-rank-score based on 5000 survival trees, and therefore, variables important obtained to find the genes that are most influential for CRC survival. We compared the predictive performance of the Cox PH model and RSF for early CRC detection and diagnosis. The results indicate that SLC9A8, IER5, ARSJ, ANKRD27, and PIPOX genes were significantly associated with the CRC overall survival. In addition, age, sex, and stages are also affecting the CRC overall survival. The RSF model using log-rank is better than log-rank-score, while log-rank-score needed more trees to stabilize. Overall, the imputation of missing values enhanced the model’s predictive performance. In addition, Cox PH predictive performance was better than RSF. |
format | Online Article Text |
id | pubmed-8716055 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-87160552021-12-30 Predictors of colorectal cancer survival using cox regression and random survival forests models based on gene expression data Mohammed, Mohanad Mboya, Innocent B. Mwambi, Henry Elbashir, Murtada K. Omolo, Bernard PLoS One Research Article Understanding and identifying the markers and clinical information that are associated with colorectal cancer (CRC) patient survival is needed for early detection and diagnosis. In this work, we aimed to build a simple model using Cox proportional hazards (PH) and random survival forest (RSF) and find a robust signature for predicting CRC overall survival. We used stepwise regression to develop Cox PH model to analyse 54 common differentially expressed genes from three mutations. RSF is applied using log-rank and log-rank-score based on 5000 survival trees, and therefore, variables important obtained to find the genes that are most influential for CRC survival. We compared the predictive performance of the Cox PH model and RSF for early CRC detection and diagnosis. The results indicate that SLC9A8, IER5, ARSJ, ANKRD27, and PIPOX genes were significantly associated with the CRC overall survival. In addition, age, sex, and stages are also affecting the CRC overall survival. The RSF model using log-rank is better than log-rank-score, while log-rank-score needed more trees to stabilize. Overall, the imputation of missing values enhanced the model’s predictive performance. In addition, Cox PH predictive performance was better than RSF. Public Library of Science 2021-12-29 /pmc/articles/PMC8716055/ /pubmed/34965262 http://dx.doi.org/10.1371/journal.pone.0261625 Text en https://creativecommons.org/publicdomain/zero/1.0/This is an open access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 (https://creativecommons.org/publicdomain/zero/1.0/) public domain dedication. |
spellingShingle | Research Article Mohammed, Mohanad Mboya, Innocent B. Mwambi, Henry Elbashir, Murtada K. Omolo, Bernard Predictors of colorectal cancer survival using cox regression and random survival forests models based on gene expression data |
title | Predictors of colorectal cancer survival using cox regression and random survival forests models based on gene expression data |
title_full | Predictors of colorectal cancer survival using cox regression and random survival forests models based on gene expression data |
title_fullStr | Predictors of colorectal cancer survival using cox regression and random survival forests models based on gene expression data |
title_full_unstemmed | Predictors of colorectal cancer survival using cox regression and random survival forests models based on gene expression data |
title_short | Predictors of colorectal cancer survival using cox regression and random survival forests models based on gene expression data |
title_sort | predictors of colorectal cancer survival using cox regression and random survival forests models based on gene expression data |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8716055/ https://www.ncbi.nlm.nih.gov/pubmed/34965262 http://dx.doi.org/10.1371/journal.pone.0261625 |
work_keys_str_mv | AT mohammedmohanad predictorsofcolorectalcancersurvivalusingcoxregressionandrandomsurvivalforestsmodelsbasedongeneexpressiondata AT mboyainnocentb predictorsofcolorectalcancersurvivalusingcoxregressionandrandomsurvivalforestsmodelsbasedongeneexpressiondata AT mwambihenry predictorsofcolorectalcancersurvivalusingcoxregressionandrandomsurvivalforestsmodelsbasedongeneexpressiondata AT elbashirmurtadak predictorsofcolorectalcancersurvivalusingcoxregressionandrandomsurvivalforestsmodelsbasedongeneexpressiondata AT omolobernard predictorsofcolorectalcancersurvivalusingcoxregressionandrandomsurvivalforestsmodelsbasedongeneexpressiondata |