Cargando…

Systems biology comprehensive analysis on breast cancer for identification of key gene modules and genes associated with TNM-based clinical stages

Breast cancer (BC), as one of the leading causes of death among women, comprises several subtypes with controversial and poor prognosis. Considering the TNM (tumor, lymph node, metastasis) based classification for staging of breast cancer, it is essential to diagnose the disease at early stages. The...

Descripción completa

Detalles Bibliográficos
Autores principales: Amjad, Elham, Asnaashari, Solmaz, Sokouti, Babak, Dastmalchi, Siavoush
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7331704/
https://www.ncbi.nlm.nih.gov/pubmed/32616754
http://dx.doi.org/10.1038/s41598-020-67643-w
_version_ 1783553386035019776
author Amjad, Elham
Asnaashari, Solmaz
Sokouti, Babak
Dastmalchi, Siavoush
author_facet Amjad, Elham
Asnaashari, Solmaz
Sokouti, Babak
Dastmalchi, Siavoush
author_sort Amjad, Elham
collection PubMed
description Breast cancer (BC), as one of the leading causes of death among women, comprises several subtypes with controversial and poor prognosis. Considering the TNM (tumor, lymph node, metastasis) based classification for staging of breast cancer, it is essential to diagnose the disease at early stages. The present study aims to take advantage of the systems biology approach on genome wide gene expression profiling datasets to identify the potential biomarkers involved at stage I, stage II, stage III, and stage IV as well as in the integrated group. Three HER2-negative breast cancer microarray datasets were retrieved from the GEO database, including normal, stage I, stage II, stage III, and stage IV samples. Additionally, one dataset was also extracted to test the developed predictive models trained on the three datasets. The analysis of gene expression profiles to identify differentially expressed genes (DEGs) was performed after preprocessing and normalization of data. Then, statistically significant prioritized DEGs were used to construct protein–protein interaction networks for the stages for module analysis and biomarker identification. Furthermore, the prioritized DEGs were used to determine the involved GO enrichment and KEGG signaling pathways at various stages of the breast cancer. The recurrence survival rate analysis of the identified gene biomarkers was conducted based on Kaplan–Meier methodology. Furthermore, the identified genes were validated not only by using several classification models but also through screening the experimental literature reports on the target genes. Fourteen (21 genes), nine (17 genes), eight (10 genes), four (7 genes), and six (8 genes) gene modules (total of 53 unique genes out of 63 genes with involving those with the same connectivity degree) were identified for stage I, stage II, stage III, stage IV, and the integrated group. Moreover, SMC4, FN1, FOS, JUN, and KIF11 and RACGAP1 genes with the highest connectivity degrees were in module 1 for abovementioned stages, respectively. The biological processes, cellular components, and molecular functions were demonstrated for outcomes of GO analysis and KEGG pathway assessment. Additionally, the Kaplan–Meier analysis revealed that 33 genes were found to be significant while considering the recurrence-free survival rate as an alternative to overall survival rate. Furthermore, the machine learning calcification models show good performance on the determined biomarkers. Moreover, the literature reports have confirmed all of the identified gene biomarkers for breast cancer. According to the literature evidence, the identified hub genes are highly correlated with HER2-negative breast cancer. The 53-mRNA signature might be a potential gene set for TNM based stages as well as possible therapeutics with potentially good performance in predicting and managing recurrence-free survival rates at stages I, II, III, and IV as well as in the integrated group. Moreover, the identified genes for the TNM-based stages can also be used as mRNA profile signatures to determine the current stage of the breast cancer.
format Online
Article
Text
id pubmed-7331704
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-73317042020-07-06 Systems biology comprehensive analysis on breast cancer for identification of key gene modules and genes associated with TNM-based clinical stages Amjad, Elham Asnaashari, Solmaz Sokouti, Babak Dastmalchi, Siavoush Sci Rep Original Research Breast cancer (BC), as one of the leading causes of death among women, comprises several subtypes with controversial and poor prognosis. Considering the TNM (tumor, lymph node, metastasis) based classification for staging of breast cancer, it is essential to diagnose the disease at early stages. The present study aims to take advantage of the systems biology approach on genome wide gene expression profiling datasets to identify the potential biomarkers involved at stage I, stage II, stage III, and stage IV as well as in the integrated group. Three HER2-negative breast cancer microarray datasets were retrieved from the GEO database, including normal, stage I, stage II, stage III, and stage IV samples. Additionally, one dataset was also extracted to test the developed predictive models trained on the three datasets. The analysis of gene expression profiles to identify differentially expressed genes (DEGs) was performed after preprocessing and normalization of data. Then, statistically significant prioritized DEGs were used to construct protein–protein interaction networks for the stages for module analysis and biomarker identification. Furthermore, the prioritized DEGs were used to determine the involved GO enrichment and KEGG signaling pathways at various stages of the breast cancer. The recurrence survival rate analysis of the identified gene biomarkers was conducted based on Kaplan–Meier methodology. Furthermore, the identified genes were validated not only by using several classification models but also through screening the experimental literature reports on the target genes. Fourteen (21 genes), nine (17 genes), eight (10 genes), four (7 genes), and six (8 genes) gene modules (total of 53 unique genes out of 63 genes with involving those with the same connectivity degree) were identified for stage I, stage II, stage III, stage IV, and the integrated group. Moreover, SMC4, FN1, FOS, JUN, and KIF11 and RACGAP1 genes with the highest connectivity degrees were in module 1 for abovementioned stages, respectively. The biological processes, cellular components, and molecular functions were demonstrated for outcomes of GO analysis and KEGG pathway assessment. Additionally, the Kaplan–Meier analysis revealed that 33 genes were found to be significant while considering the recurrence-free survival rate as an alternative to overall survival rate. Furthermore, the machine learning calcification models show good performance on the determined biomarkers. Moreover, the literature reports have confirmed all of the identified gene biomarkers for breast cancer. According to the literature evidence, the identified hub genes are highly correlated with HER2-negative breast cancer. The 53-mRNA signature might be a potential gene set for TNM based stages as well as possible therapeutics with potentially good performance in predicting and managing recurrence-free survival rates at stages I, II, III, and IV as well as in the integrated group. Moreover, the identified genes for the TNM-based stages can also be used as mRNA profile signatures to determine the current stage of the breast cancer. Nature Publishing Group UK 2020-07-02 /pmc/articles/PMC7331704/ /pubmed/32616754 http://dx.doi.org/10.1038/s41598-020-67643-w Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Original Research
Amjad, Elham
Asnaashari, Solmaz
Sokouti, Babak
Dastmalchi, Siavoush
Systems biology comprehensive analysis on breast cancer for identification of key gene modules and genes associated with TNM-based clinical stages
title Systems biology comprehensive analysis on breast cancer for identification of key gene modules and genes associated with TNM-based clinical stages
title_full Systems biology comprehensive analysis on breast cancer for identification of key gene modules and genes associated with TNM-based clinical stages
title_fullStr Systems biology comprehensive analysis on breast cancer for identification of key gene modules and genes associated with TNM-based clinical stages
title_full_unstemmed Systems biology comprehensive analysis on breast cancer for identification of key gene modules and genes associated with TNM-based clinical stages
title_short Systems biology comprehensive analysis on breast cancer for identification of key gene modules and genes associated with TNM-based clinical stages
title_sort systems biology comprehensive analysis on breast cancer for identification of key gene modules and genes associated with tnm-based clinical stages
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7331704/
https://www.ncbi.nlm.nih.gov/pubmed/32616754
http://dx.doi.org/10.1038/s41598-020-67643-w
work_keys_str_mv AT amjadelham systemsbiologycomprehensiveanalysisonbreastcancerforidentificationofkeygenemodulesandgenesassociatedwithtnmbasedclinicalstages
AT asnaasharisolmaz systemsbiologycomprehensiveanalysisonbreastcancerforidentificationofkeygenemodulesandgenesassociatedwithtnmbasedclinicalstages
AT sokoutibabak systemsbiologycomprehensiveanalysisonbreastcancerforidentificationofkeygenemodulesandgenesassociatedwithtnmbasedclinicalstages
AT dastmalchisiavoush systemsbiologycomprehensiveanalysisonbreastcancerforidentificationofkeygenemodulesandgenesassociatedwithtnmbasedclinicalstages