Cargando…

Predictors of colorectal cancer survival using cox regression and random survival forests models based on gene expression data

Understanding and identifying the markers and clinical information that are associated with colorectal cancer (CRC) patient survival is needed for early detection and diagnosis. In this work, we aimed to build a simple model using Cox proportional hazards (PH) and random survival forest (RSF) and fi...

Descripción completa

Detalles Bibliográficos
Autores principales: Mohammed, Mohanad, Mboya, Innocent B., Mwambi, Henry, Elbashir, Murtada K., Omolo, Bernard
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8716055/
https://www.ncbi.nlm.nih.gov/pubmed/34965262
http://dx.doi.org/10.1371/journal.pone.0261625
_version_ 1784624241751097344
author Mohammed, Mohanad
Mboya, Innocent B.
Mwambi, Henry
Elbashir, Murtada K.
Omolo, Bernard
author_facet Mohammed, Mohanad
Mboya, Innocent B.
Mwambi, Henry
Elbashir, Murtada K.
Omolo, Bernard
author_sort Mohammed, Mohanad
collection PubMed
description Understanding and identifying the markers and clinical information that are associated with colorectal cancer (CRC) patient survival is needed for early detection and diagnosis. In this work, we aimed to build a simple model using Cox proportional hazards (PH) and random survival forest (RSF) and find a robust signature for predicting CRC overall survival. We used stepwise regression to develop Cox PH model to analyse 54 common differentially expressed genes from three mutations. RSF is applied using log-rank and log-rank-score based on 5000 survival trees, and therefore, variables important obtained to find the genes that are most influential for CRC survival. We compared the predictive performance of the Cox PH model and RSF for early CRC detection and diagnosis. The results indicate that SLC9A8, IER5, ARSJ, ANKRD27, and PIPOX genes were significantly associated with the CRC overall survival. In addition, age, sex, and stages are also affecting the CRC overall survival. The RSF model using log-rank is better than log-rank-score, while log-rank-score needed more trees to stabilize. Overall, the imputation of missing values enhanced the model’s predictive performance. In addition, Cox PH predictive performance was better than RSF.
format Online
Article
Text
id pubmed-8716055
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-87160552021-12-30 Predictors of colorectal cancer survival using cox regression and random survival forests models based on gene expression data Mohammed, Mohanad Mboya, Innocent B. Mwambi, Henry Elbashir, Murtada K. Omolo, Bernard PLoS One Research Article Understanding and identifying the markers and clinical information that are associated with colorectal cancer (CRC) patient survival is needed for early detection and diagnosis. In this work, we aimed to build a simple model using Cox proportional hazards (PH) and random survival forest (RSF) and find a robust signature for predicting CRC overall survival. We used stepwise regression to develop Cox PH model to analyse 54 common differentially expressed genes from three mutations. RSF is applied using log-rank and log-rank-score based on 5000 survival trees, and therefore, variables important obtained to find the genes that are most influential for CRC survival. We compared the predictive performance of the Cox PH model and RSF for early CRC detection and diagnosis. The results indicate that SLC9A8, IER5, ARSJ, ANKRD27, and PIPOX genes were significantly associated with the CRC overall survival. In addition, age, sex, and stages are also affecting the CRC overall survival. The RSF model using log-rank is better than log-rank-score, while log-rank-score needed more trees to stabilize. Overall, the imputation of missing values enhanced the model’s predictive performance. In addition, Cox PH predictive performance was better than RSF. Public Library of Science 2021-12-29 /pmc/articles/PMC8716055/ /pubmed/34965262 http://dx.doi.org/10.1371/journal.pone.0261625 Text en https://creativecommons.org/publicdomain/zero/1.0/This is an open access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 (https://creativecommons.org/publicdomain/zero/1.0/) public domain dedication.
spellingShingle Research Article
Mohammed, Mohanad
Mboya, Innocent B.
Mwambi, Henry
Elbashir, Murtada K.
Omolo, Bernard
Predictors of colorectal cancer survival using cox regression and random survival forests models based on gene expression data
title Predictors of colorectal cancer survival using cox regression and random survival forests models based on gene expression data
title_full Predictors of colorectal cancer survival using cox regression and random survival forests models based on gene expression data
title_fullStr Predictors of colorectal cancer survival using cox regression and random survival forests models based on gene expression data
title_full_unstemmed Predictors of colorectal cancer survival using cox regression and random survival forests models based on gene expression data
title_short Predictors of colorectal cancer survival using cox regression and random survival forests models based on gene expression data
title_sort predictors of colorectal cancer survival using cox regression and random survival forests models based on gene expression data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8716055/
https://www.ncbi.nlm.nih.gov/pubmed/34965262
http://dx.doi.org/10.1371/journal.pone.0261625
work_keys_str_mv AT mohammedmohanad predictorsofcolorectalcancersurvivalusingcoxregressionandrandomsurvivalforestsmodelsbasedongeneexpressiondata
AT mboyainnocentb predictorsofcolorectalcancersurvivalusingcoxregressionandrandomsurvivalforestsmodelsbasedongeneexpressiondata
AT mwambihenry predictorsofcolorectalcancersurvivalusingcoxregressionandrandomsurvivalforestsmodelsbasedongeneexpressiondata
AT elbashirmurtadak predictorsofcolorectalcancersurvivalusingcoxregressionandrandomsurvivalforestsmodelsbasedongeneexpressiondata
AT omolobernard predictorsofcolorectalcancersurvivalusingcoxregressionandrandomsurvivalforestsmodelsbasedongeneexpressiondata