Cargando…

Transcription factor expression as a predictor of colon cancer prognosis: a machine learning practice

BACKGROUND: Colon cancer is one of the leading causes of cancer deaths in the USA and around the world. Molecular level characters, such as gene expression levels and mutations, may provide profound information for precision treatment apart from pathological indicators. Transcription factors functio...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Jiannan, Dong, Chuanpeng, Jiang, Guanglong, Lu, Xiaoyu, Liu, Yunlong, Wu, Huanmei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7504662/
https://www.ncbi.nlm.nih.gov/pubmed/32957968
http://dx.doi.org/10.1186/s12920-020-00775-0
_version_ 1783584675425419264
author Liu, Jiannan
Dong, Chuanpeng
Jiang, Guanglong
Lu, Xiaoyu
Liu, Yunlong
Wu, Huanmei
author_facet Liu, Jiannan
Dong, Chuanpeng
Jiang, Guanglong
Lu, Xiaoyu
Liu, Yunlong
Wu, Huanmei
author_sort Liu, Jiannan
collection PubMed
description BACKGROUND: Colon cancer is one of the leading causes of cancer deaths in the USA and around the world. Molecular level characters, such as gene expression levels and mutations, may provide profound information for precision treatment apart from pathological indicators. Transcription factors function as critical regulators in all aspects of cell life, but transcription factors-based biomarkers for colon cancer prognosis were still rare and necessary. METHODS: We implemented an innovative process to select the transcription factors variables and evaluate the prognostic prediction power by combining the Cox PH model with the random forest algorithm. We picked five top-ranked transcription factors and built a prediction model by using Cox PH regression. Using Kaplan-Meier analysis, we validated our predictive model on four independent publicly available datasets (GSE39582, GSE17536, GSE37892, and GSE17537) from the GEO database, consisting of 925 colon cancer patients. RESULTS: A five-transcription-factors based predictive model for colon cancer prognosis has been developed by using TCGA colon cancer patient data. Five transcription factors identified for the predictive model is HOXC9, ZNF556, HEYL, HOXC4 and HOXC6. The prediction power of the model is validated with four GEO datasets consisting of 1584 patient samples. Kaplan-Meier curve and log-rank tests were conducted on both training and validation datasets, the difference of overall survival time between predicted low and high-risk groups can be clearly observed. Gene set enrichment analysis was performed to further investigate the difference between low and high-risk groups in the gene pathway level. The biological meaning was interpreted. Overall, our results prove our prediction model has a strong prediction power on colon cancer prognosis. CONCLUSIONS: Transcription factors can be used to construct colon cancer prognostic signatures with strong prediction power. The variable selection process used in this study has the potential to be implemented in the prognostic signature discovery of other cancer types. Our five TF-based predictive model would help with understanding the hidden relationship between colon cancer patient survival and transcription factor activities. It will also provide more insights into the precision treatment of colon cancer patients from a genomic information perspective.
format Online
Article
Text
id pubmed-7504662
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-75046622020-09-23 Transcription factor expression as a predictor of colon cancer prognosis: a machine learning practice Liu, Jiannan Dong, Chuanpeng Jiang, Guanglong Lu, Xiaoyu Liu, Yunlong Wu, Huanmei BMC Med Genomics Research BACKGROUND: Colon cancer is one of the leading causes of cancer deaths in the USA and around the world. Molecular level characters, such as gene expression levels and mutations, may provide profound information for precision treatment apart from pathological indicators. Transcription factors function as critical regulators in all aspects of cell life, but transcription factors-based biomarkers for colon cancer prognosis were still rare and necessary. METHODS: We implemented an innovative process to select the transcription factors variables and evaluate the prognostic prediction power by combining the Cox PH model with the random forest algorithm. We picked five top-ranked transcription factors and built a prediction model by using Cox PH regression. Using Kaplan-Meier analysis, we validated our predictive model on four independent publicly available datasets (GSE39582, GSE17536, GSE37892, and GSE17537) from the GEO database, consisting of 925 colon cancer patients. RESULTS: A five-transcription-factors based predictive model for colon cancer prognosis has been developed by using TCGA colon cancer patient data. Five transcription factors identified for the predictive model is HOXC9, ZNF556, HEYL, HOXC4 and HOXC6. The prediction power of the model is validated with four GEO datasets consisting of 1584 patient samples. Kaplan-Meier curve and log-rank tests were conducted on both training and validation datasets, the difference of overall survival time between predicted low and high-risk groups can be clearly observed. Gene set enrichment analysis was performed to further investigate the difference between low and high-risk groups in the gene pathway level. The biological meaning was interpreted. Overall, our results prove our prediction model has a strong prediction power on colon cancer prognosis. CONCLUSIONS: Transcription factors can be used to construct colon cancer prognostic signatures with strong prediction power. The variable selection process used in this study has the potential to be implemented in the prognostic signature discovery of other cancer types. Our five TF-based predictive model would help with understanding the hidden relationship between colon cancer patient survival and transcription factor activities. It will also provide more insights into the precision treatment of colon cancer patients from a genomic information perspective. BioMed Central 2020-09-21 /pmc/articles/PMC7504662/ /pubmed/32957968 http://dx.doi.org/10.1186/s12920-020-00775-0 Text en © The Author(s) 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Liu, Jiannan
Dong, Chuanpeng
Jiang, Guanglong
Lu, Xiaoyu
Liu, Yunlong
Wu, Huanmei
Transcription factor expression as a predictor of colon cancer prognosis: a machine learning practice
title Transcription factor expression as a predictor of colon cancer prognosis: a machine learning practice
title_full Transcription factor expression as a predictor of colon cancer prognosis: a machine learning practice
title_fullStr Transcription factor expression as a predictor of colon cancer prognosis: a machine learning practice
title_full_unstemmed Transcription factor expression as a predictor of colon cancer prognosis: a machine learning practice
title_short Transcription factor expression as a predictor of colon cancer prognosis: a machine learning practice
title_sort transcription factor expression as a predictor of colon cancer prognosis: a machine learning practice
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7504662/
https://www.ncbi.nlm.nih.gov/pubmed/32957968
http://dx.doi.org/10.1186/s12920-020-00775-0
work_keys_str_mv AT liujiannan transcriptionfactorexpressionasapredictorofcoloncancerprognosisamachinelearningpractice
AT dongchuanpeng transcriptionfactorexpressionasapredictorofcoloncancerprognosisamachinelearningpractice
AT jiangguanglong transcriptionfactorexpressionasapredictorofcoloncancerprognosisamachinelearningpractice
AT luxiaoyu transcriptionfactorexpressionasapredictorofcoloncancerprognosisamachinelearningpractice
AT liuyunlong transcriptionfactorexpressionasapredictorofcoloncancerprognosisamachinelearningpractice
AT wuhuanmei transcriptionfactorexpressionasapredictorofcoloncancerprognosisamachinelearningpractice