Cargando…
Identification and validation of cuproptosis related genes and signature markers in bronchopulmonary dysplasia disease using bioinformatics analysis and machine learning
BACKGROUND: Bronchopulmonary Dysplasia (BPD) has a high incidence and affects the health of preterm infants. Cuproptosis is a novel form of cell death, but its mechanism of action in the disease is not yet clear. Machine learning, the latest tool for the analysis of biological samples, is still rela...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10105406/ https://www.ncbi.nlm.nih.gov/pubmed/37060021 http://dx.doi.org/10.1186/s12911-023-02163-x |
_version_ | 1785026204358672384 |
---|---|
author | Jia, Mingxuan Li, Jieyi Zhang, Jingying Wei, Ningjing Yin, Yating Chen, Hui Yan, Shixing Wang, Yong |
author_facet | Jia, Mingxuan Li, Jieyi Zhang, Jingying Wei, Ningjing Yin, Yating Chen, Hui Yan, Shixing Wang, Yong |
author_sort | Jia, Mingxuan |
collection | PubMed |
description | BACKGROUND: Bronchopulmonary Dysplasia (BPD) has a high incidence and affects the health of preterm infants. Cuproptosis is a novel form of cell death, but its mechanism of action in the disease is not yet clear. Machine learning, the latest tool for the analysis of biological samples, is still relatively rarely used for in-depth analysis and prediction of diseases. METHODS AND RESULTS: First, the differential expression of cuproptosis-related genes (CRGs) in the GSE108754 dataset was extracted and the heat map showed that the expression of NFE2L2 gene was significantly higher in the control group whereas the expression of GLS gene was significantly higher in the treatment group. Chromosome location analysis showed that both the genes were positively correlated and associated with chromosome 2. The results of immune infiltration and immune cell differential analysis showed differences in the four immune cells, significantly in Monocytes cells. Five new pathways were analyzed through two subgroups based on consistent clustering of CRG expression. Weighted correlation network analysis (WGCNA) set the screening condition to the top 25% to obtain the disease signature genes. Four machine learning algorithms: Generalized Linear Models (GLM), Random Forest (RF), Support Vector Machine (SVM), and Extreme Gradient Boosting (XGB) were used to screen the disease signature genes, and the final five marker genes for disease prediction. The models constructed by GLM method were proved to be more accurate in the validation of two datasets, GSE190215 and GSE188944. CONCLUSION: We eventually identified two copper death-associated genes, NFE2L2 and GLS. A machine learning model-GLM was constructed to predict the prevalence of BPD disease, and five disease signature genes NFATC3, ERMN, PLA2G4A, MTMR9LP and LOC440700 were identified. These genes that were bioinformatics analyzed could be potential targets for identifying BPD disease and treatment. |
format | Online Article Text |
id | pubmed-10105406 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-101054062023-04-16 Identification and validation of cuproptosis related genes and signature markers in bronchopulmonary dysplasia disease using bioinformatics analysis and machine learning Jia, Mingxuan Li, Jieyi Zhang, Jingying Wei, Ningjing Yin, Yating Chen, Hui Yan, Shixing Wang, Yong BMC Med Inform Decis Mak Research BACKGROUND: Bronchopulmonary Dysplasia (BPD) has a high incidence and affects the health of preterm infants. Cuproptosis is a novel form of cell death, but its mechanism of action in the disease is not yet clear. Machine learning, the latest tool for the analysis of biological samples, is still relatively rarely used for in-depth analysis and prediction of diseases. METHODS AND RESULTS: First, the differential expression of cuproptosis-related genes (CRGs) in the GSE108754 dataset was extracted and the heat map showed that the expression of NFE2L2 gene was significantly higher in the control group whereas the expression of GLS gene was significantly higher in the treatment group. Chromosome location analysis showed that both the genes were positively correlated and associated with chromosome 2. The results of immune infiltration and immune cell differential analysis showed differences in the four immune cells, significantly in Monocytes cells. Five new pathways were analyzed through two subgroups based on consistent clustering of CRG expression. Weighted correlation network analysis (WGCNA) set the screening condition to the top 25% to obtain the disease signature genes. Four machine learning algorithms: Generalized Linear Models (GLM), Random Forest (RF), Support Vector Machine (SVM), and Extreme Gradient Boosting (XGB) were used to screen the disease signature genes, and the final five marker genes for disease prediction. The models constructed by GLM method were proved to be more accurate in the validation of two datasets, GSE190215 and GSE188944. CONCLUSION: We eventually identified two copper death-associated genes, NFE2L2 and GLS. A machine learning model-GLM was constructed to predict the prevalence of BPD disease, and five disease signature genes NFATC3, ERMN, PLA2G4A, MTMR9LP and LOC440700 were identified. These genes that were bioinformatics analyzed could be potential targets for identifying BPD disease and treatment. BioMed Central 2023-04-14 /pmc/articles/PMC10105406/ /pubmed/37060021 http://dx.doi.org/10.1186/s12911-023-02163-x Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Jia, Mingxuan Li, Jieyi Zhang, Jingying Wei, Ningjing Yin, Yating Chen, Hui Yan, Shixing Wang, Yong Identification and validation of cuproptosis related genes and signature markers in bronchopulmonary dysplasia disease using bioinformatics analysis and machine learning |
title | Identification and validation of cuproptosis related genes and signature markers in bronchopulmonary dysplasia disease using bioinformatics analysis and machine learning |
title_full | Identification and validation of cuproptosis related genes and signature markers in bronchopulmonary dysplasia disease using bioinformatics analysis and machine learning |
title_fullStr | Identification and validation of cuproptosis related genes and signature markers in bronchopulmonary dysplasia disease using bioinformatics analysis and machine learning |
title_full_unstemmed | Identification and validation of cuproptosis related genes and signature markers in bronchopulmonary dysplasia disease using bioinformatics analysis and machine learning |
title_short | Identification and validation of cuproptosis related genes and signature markers in bronchopulmonary dysplasia disease using bioinformatics analysis and machine learning |
title_sort | identification and validation of cuproptosis related genes and signature markers in bronchopulmonary dysplasia disease using bioinformatics analysis and machine learning |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10105406/ https://www.ncbi.nlm.nih.gov/pubmed/37060021 http://dx.doi.org/10.1186/s12911-023-02163-x |
work_keys_str_mv | AT jiamingxuan identificationandvalidationofcuproptosisrelatedgenesandsignaturemarkersinbronchopulmonarydysplasiadiseaseusingbioinformaticsanalysisandmachinelearning AT lijieyi identificationandvalidationofcuproptosisrelatedgenesandsignaturemarkersinbronchopulmonarydysplasiadiseaseusingbioinformaticsanalysisandmachinelearning AT zhangjingying identificationandvalidationofcuproptosisrelatedgenesandsignaturemarkersinbronchopulmonarydysplasiadiseaseusingbioinformaticsanalysisandmachinelearning AT weiningjing identificationandvalidationofcuproptosisrelatedgenesandsignaturemarkersinbronchopulmonarydysplasiadiseaseusingbioinformaticsanalysisandmachinelearning AT yinyating identificationandvalidationofcuproptosisrelatedgenesandsignaturemarkersinbronchopulmonarydysplasiadiseaseusingbioinformaticsanalysisandmachinelearning AT chenhui identificationandvalidationofcuproptosisrelatedgenesandsignaturemarkersinbronchopulmonarydysplasiadiseaseusingbioinformaticsanalysisandmachinelearning AT yanshixing identificationandvalidationofcuproptosisrelatedgenesandsignaturemarkersinbronchopulmonarydysplasiadiseaseusingbioinformaticsanalysisandmachinelearning AT wangyong identificationandvalidationofcuproptosisrelatedgenesandsignaturemarkersinbronchopulmonarydysplasiadiseaseusingbioinformaticsanalysisandmachinelearning |