Cargando…

Identification of Key Genes and Key Pathways in Breast Cancer Based on Machine Learning

BACKGROUND: Breast cancer is one of the most common malignant tumors among women worldwide. This study aimed to screen key genes and pathways for breast cancer diagnosis and treatment. MATERIAL/METHODS: We obtained public data from the NCBI GEO database. The data were divided into a control group (n...

Descripción completa

Detalles Bibliográficos
Autores principales: Bao, Shurui, He, Guijin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: International Scientific Literature, Inc. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9145905/
https://www.ncbi.nlm.nih.gov/pubmed/35607268
http://dx.doi.org/10.12659/MSM.935515
Descripción
Sumario:BACKGROUND: Breast cancer is one of the most common malignant tumors among women worldwide. This study aimed to screen key genes and pathways for breast cancer diagnosis and treatment. MATERIAL/METHODS: We obtained public data from the NCBI GEO database. The data were divided into a control group (normal breast tissue) and a treatment group (breast cancer tissue). We screened 32 differentially expressed genes (DEGs) between normal breast and cancerous tissues and used GO analysis and GSEA to identify the key pathways. We then combined LASSO and SVM-RFE analyses to screen key genes, and used CIBERSORT to obtain the proportion of 22 types of immune cells. The relationships between key genes and immune-infiltrating cells were further explored. RESULTS: We screened 32 DEGs from the 2 groups, including 27 downregulated genes and 5 upregulated genes. GO analysis indicated that the DEGs were mainly correlated with collagen-containing extracellular matrix (ECM), Wnt signaling pathway, and glycosaminoglycan binding. GSEA indicated that the treatment group was correlated with chromosome segregation and cell cycle while the control group was correlated with cornification, intermediate filament, and nuclear transcription. Through machine learning, SYNM, TGFBR3, and COL10A1 were screened as key genes. Numbers of CD8 T cells, gamma delta T cells, and M1 macrophages were significantly higher, while monocytes and follicular helper-T cells were significantly lower in the treatment group. The downregulated genes, SYNM and TGFBR3, were positively correlated with CD8 T cells and monocytes, but were negatively correlated with gamma delta T cells and M1 macrophages. The upregulated gene, COL10A1, was positively correlated with gamma delta T cells and M1 macrophages, and was negatively correlated with CD8 T cells, monocytes, and follicular helper-T cells. CONCLUSIONS: SYNM, TGFBR3, and COL10A1 are diagnostic genes of breast cancer. They affect breast cancer cells by modulating immune-infiltrating cells.