Cargando…
Systems biology comprehensive analysis on breast cancer for identification of key gene modules and genes associated with TNM-based clinical stages
Breast cancer (BC), as one of the leading causes of death among women, comprises several subtypes with controversial and poor prognosis. Considering the TNM (tumor, lymph node, metastasis) based classification for staging of breast cancer, it is essential to diagnose the disease at early stages. The...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7331704/ https://www.ncbi.nlm.nih.gov/pubmed/32616754 http://dx.doi.org/10.1038/s41598-020-67643-w |
_version_ | 1783553386035019776 |
---|---|
author | Amjad, Elham Asnaashari, Solmaz Sokouti, Babak Dastmalchi, Siavoush |
author_facet | Amjad, Elham Asnaashari, Solmaz Sokouti, Babak Dastmalchi, Siavoush |
author_sort | Amjad, Elham |
collection | PubMed |
description | Breast cancer (BC), as one of the leading causes of death among women, comprises several subtypes with controversial and poor prognosis. Considering the TNM (tumor, lymph node, metastasis) based classification for staging of breast cancer, it is essential to diagnose the disease at early stages. The present study aims to take advantage of the systems biology approach on genome wide gene expression profiling datasets to identify the potential biomarkers involved at stage I, stage II, stage III, and stage IV as well as in the integrated group. Three HER2-negative breast cancer microarray datasets were retrieved from the GEO database, including normal, stage I, stage II, stage III, and stage IV samples. Additionally, one dataset was also extracted to test the developed predictive models trained on the three datasets. The analysis of gene expression profiles to identify differentially expressed genes (DEGs) was performed after preprocessing and normalization of data. Then, statistically significant prioritized DEGs were used to construct protein–protein interaction networks for the stages for module analysis and biomarker identification. Furthermore, the prioritized DEGs were used to determine the involved GO enrichment and KEGG signaling pathways at various stages of the breast cancer. The recurrence survival rate analysis of the identified gene biomarkers was conducted based on Kaplan–Meier methodology. Furthermore, the identified genes were validated not only by using several classification models but also through screening the experimental literature reports on the target genes. Fourteen (21 genes), nine (17 genes), eight (10 genes), four (7 genes), and six (8 genes) gene modules (total of 53 unique genes out of 63 genes with involving those with the same connectivity degree) were identified for stage I, stage II, stage III, stage IV, and the integrated group. Moreover, SMC4, FN1, FOS, JUN, and KIF11 and RACGAP1 genes with the highest connectivity degrees were in module 1 for abovementioned stages, respectively. The biological processes, cellular components, and molecular functions were demonstrated for outcomes of GO analysis and KEGG pathway assessment. Additionally, the Kaplan–Meier analysis revealed that 33 genes were found to be significant while considering the recurrence-free survival rate as an alternative to overall survival rate. Furthermore, the machine learning calcification models show good performance on the determined biomarkers. Moreover, the literature reports have confirmed all of the identified gene biomarkers for breast cancer. According to the literature evidence, the identified hub genes are highly correlated with HER2-negative breast cancer. The 53-mRNA signature might be a potential gene set for TNM based stages as well as possible therapeutics with potentially good performance in predicting and managing recurrence-free survival rates at stages I, II, III, and IV as well as in the integrated group. Moreover, the identified genes for the TNM-based stages can also be used as mRNA profile signatures to determine the current stage of the breast cancer. |
format | Online Article Text |
id | pubmed-7331704 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-73317042020-07-06 Systems biology comprehensive analysis on breast cancer for identification of key gene modules and genes associated with TNM-based clinical stages Amjad, Elham Asnaashari, Solmaz Sokouti, Babak Dastmalchi, Siavoush Sci Rep Original Research Breast cancer (BC), as one of the leading causes of death among women, comprises several subtypes with controversial and poor prognosis. Considering the TNM (tumor, lymph node, metastasis) based classification for staging of breast cancer, it is essential to diagnose the disease at early stages. The present study aims to take advantage of the systems biology approach on genome wide gene expression profiling datasets to identify the potential biomarkers involved at stage I, stage II, stage III, and stage IV as well as in the integrated group. Three HER2-negative breast cancer microarray datasets were retrieved from the GEO database, including normal, stage I, stage II, stage III, and stage IV samples. Additionally, one dataset was also extracted to test the developed predictive models trained on the three datasets. The analysis of gene expression profiles to identify differentially expressed genes (DEGs) was performed after preprocessing and normalization of data. Then, statistically significant prioritized DEGs were used to construct protein–protein interaction networks for the stages for module analysis and biomarker identification. Furthermore, the prioritized DEGs were used to determine the involved GO enrichment and KEGG signaling pathways at various stages of the breast cancer. The recurrence survival rate analysis of the identified gene biomarkers was conducted based on Kaplan–Meier methodology. Furthermore, the identified genes were validated not only by using several classification models but also through screening the experimental literature reports on the target genes. Fourteen (21 genes), nine (17 genes), eight (10 genes), four (7 genes), and six (8 genes) gene modules (total of 53 unique genes out of 63 genes with involving those with the same connectivity degree) were identified for stage I, stage II, stage III, stage IV, and the integrated group. Moreover, SMC4, FN1, FOS, JUN, and KIF11 and RACGAP1 genes with the highest connectivity degrees were in module 1 for abovementioned stages, respectively. The biological processes, cellular components, and molecular functions were demonstrated for outcomes of GO analysis and KEGG pathway assessment. Additionally, the Kaplan–Meier analysis revealed that 33 genes were found to be significant while considering the recurrence-free survival rate as an alternative to overall survival rate. Furthermore, the machine learning calcification models show good performance on the determined biomarkers. Moreover, the literature reports have confirmed all of the identified gene biomarkers for breast cancer. According to the literature evidence, the identified hub genes are highly correlated with HER2-negative breast cancer. The 53-mRNA signature might be a potential gene set for TNM based stages as well as possible therapeutics with potentially good performance in predicting and managing recurrence-free survival rates at stages I, II, III, and IV as well as in the integrated group. Moreover, the identified genes for the TNM-based stages can also be used as mRNA profile signatures to determine the current stage of the breast cancer. Nature Publishing Group UK 2020-07-02 /pmc/articles/PMC7331704/ /pubmed/32616754 http://dx.doi.org/10.1038/s41598-020-67643-w Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Original Research Amjad, Elham Asnaashari, Solmaz Sokouti, Babak Dastmalchi, Siavoush Systems biology comprehensive analysis on breast cancer for identification of key gene modules and genes associated with TNM-based clinical stages |
title | Systems biology comprehensive analysis on breast cancer for identification of key gene modules and genes associated with TNM-based clinical stages |
title_full | Systems biology comprehensive analysis on breast cancer for identification of key gene modules and genes associated with TNM-based clinical stages |
title_fullStr | Systems biology comprehensive analysis on breast cancer for identification of key gene modules and genes associated with TNM-based clinical stages |
title_full_unstemmed | Systems biology comprehensive analysis on breast cancer for identification of key gene modules and genes associated with TNM-based clinical stages |
title_short | Systems biology comprehensive analysis on breast cancer for identification of key gene modules and genes associated with TNM-based clinical stages |
title_sort | systems biology comprehensive analysis on breast cancer for identification of key gene modules and genes associated with tnm-based clinical stages |
topic | Original Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7331704/ https://www.ncbi.nlm.nih.gov/pubmed/32616754 http://dx.doi.org/10.1038/s41598-020-67643-w |
work_keys_str_mv | AT amjadelham systemsbiologycomprehensiveanalysisonbreastcancerforidentificationofkeygenemodulesandgenesassociatedwithtnmbasedclinicalstages AT asnaasharisolmaz systemsbiologycomprehensiveanalysisonbreastcancerforidentificationofkeygenemodulesandgenesassociatedwithtnmbasedclinicalstages AT sokoutibabak systemsbiologycomprehensiveanalysisonbreastcancerforidentificationofkeygenemodulesandgenesassociatedwithtnmbasedclinicalstages AT dastmalchisiavoush systemsbiologycomprehensiveanalysisonbreastcancerforidentificationofkeygenemodulesandgenesassociatedwithtnmbasedclinicalstages |