Cargando…

Identification and validation of cuproptosis related genes and signature markers in bronchopulmonary dysplasia disease using bioinformatics analysis and machine learning

BACKGROUND: Bronchopulmonary Dysplasia (BPD) has a high incidence and affects the health of preterm infants. Cuproptosis is a novel form of cell death, but its mechanism of action in the disease is not yet clear. Machine learning, the latest tool for the analysis of biological samples, is still rela...

Descripción completa

Detalles Bibliográficos
Autores principales: Jia, Mingxuan, Li, Jieyi, Zhang, Jingying, Wei, Ningjing, Yin, Yating, Chen, Hui, Yan, Shixing, Wang, Yong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10105406/
https://www.ncbi.nlm.nih.gov/pubmed/37060021
http://dx.doi.org/10.1186/s12911-023-02163-x
_version_ 1785026204358672384
author Jia, Mingxuan
Li, Jieyi
Zhang, Jingying
Wei, Ningjing
Yin, Yating
Chen, Hui
Yan, Shixing
Wang, Yong
author_facet Jia, Mingxuan
Li, Jieyi
Zhang, Jingying
Wei, Ningjing
Yin, Yating
Chen, Hui
Yan, Shixing
Wang, Yong
author_sort Jia, Mingxuan
collection PubMed
description BACKGROUND: Bronchopulmonary Dysplasia (BPD) has a high incidence and affects the health of preterm infants. Cuproptosis is a novel form of cell death, but its mechanism of action in the disease is not yet clear. Machine learning, the latest tool for the analysis of biological samples, is still relatively rarely used for in-depth analysis and prediction of diseases. METHODS AND RESULTS: First, the differential expression of cuproptosis-related genes (CRGs) in the GSE108754 dataset was extracted and the heat map showed that the expression of NFE2L2 gene was significantly higher in the control group whereas the expression of GLS gene was significantly higher in the treatment group. Chromosome location analysis showed that both the genes were positively correlated and associated with chromosome 2. The results of immune infiltration and immune cell differential analysis showed differences in the four immune cells, significantly in Monocytes cells. Five new pathways were analyzed through two subgroups based on consistent clustering of CRG expression. Weighted correlation network analysis (WGCNA) set the screening condition to the top 25% to obtain the disease signature genes. Four machine learning algorithms: Generalized Linear Models (GLM), Random Forest (RF), Support Vector Machine (SVM), and Extreme Gradient Boosting (XGB) were used to screen the disease signature genes, and the final five marker genes for disease prediction. The models constructed by GLM method were proved to be more accurate in the validation of two datasets, GSE190215 and GSE188944. CONCLUSION: We eventually identified two copper death-associated genes, NFE2L2 and GLS. A machine learning model-GLM was constructed to predict the prevalence of BPD disease, and five disease signature genes NFATC3, ERMN, PLA2G4A, MTMR9LP and LOC440700 were identified. These genes that were bioinformatics analyzed could be potential targets for identifying BPD disease and treatment.
format Online
Article
Text
id pubmed-10105406
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-101054062023-04-16 Identification and validation of cuproptosis related genes and signature markers in bronchopulmonary dysplasia disease using bioinformatics analysis and machine learning Jia, Mingxuan Li, Jieyi Zhang, Jingying Wei, Ningjing Yin, Yating Chen, Hui Yan, Shixing Wang, Yong BMC Med Inform Decis Mak Research BACKGROUND: Bronchopulmonary Dysplasia (BPD) has a high incidence and affects the health of preterm infants. Cuproptosis is a novel form of cell death, but its mechanism of action in the disease is not yet clear. Machine learning, the latest tool for the analysis of biological samples, is still relatively rarely used for in-depth analysis and prediction of diseases. METHODS AND RESULTS: First, the differential expression of cuproptosis-related genes (CRGs) in the GSE108754 dataset was extracted and the heat map showed that the expression of NFE2L2 gene was significantly higher in the control group whereas the expression of GLS gene was significantly higher in the treatment group. Chromosome location analysis showed that both the genes were positively correlated and associated with chromosome 2. The results of immune infiltration and immune cell differential analysis showed differences in the four immune cells, significantly in Monocytes cells. Five new pathways were analyzed through two subgroups based on consistent clustering of CRG expression. Weighted correlation network analysis (WGCNA) set the screening condition to the top 25% to obtain the disease signature genes. Four machine learning algorithms: Generalized Linear Models (GLM), Random Forest (RF), Support Vector Machine (SVM), and Extreme Gradient Boosting (XGB) were used to screen the disease signature genes, and the final five marker genes for disease prediction. The models constructed by GLM method were proved to be more accurate in the validation of two datasets, GSE190215 and GSE188944. CONCLUSION: We eventually identified two copper death-associated genes, NFE2L2 and GLS. A machine learning model-GLM was constructed to predict the prevalence of BPD disease, and five disease signature genes NFATC3, ERMN, PLA2G4A, MTMR9LP and LOC440700 were identified. These genes that were bioinformatics analyzed could be potential targets for identifying BPD disease and treatment. BioMed Central 2023-04-14 /pmc/articles/PMC10105406/ /pubmed/37060021 http://dx.doi.org/10.1186/s12911-023-02163-x Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Jia, Mingxuan
Li, Jieyi
Zhang, Jingying
Wei, Ningjing
Yin, Yating
Chen, Hui
Yan, Shixing
Wang, Yong
Identification and validation of cuproptosis related genes and signature markers in bronchopulmonary dysplasia disease using bioinformatics analysis and machine learning
title Identification and validation of cuproptosis related genes and signature markers in bronchopulmonary dysplasia disease using bioinformatics analysis and machine learning
title_full Identification and validation of cuproptosis related genes and signature markers in bronchopulmonary dysplasia disease using bioinformatics analysis and machine learning
title_fullStr Identification and validation of cuproptosis related genes and signature markers in bronchopulmonary dysplasia disease using bioinformatics analysis and machine learning
title_full_unstemmed Identification and validation of cuproptosis related genes and signature markers in bronchopulmonary dysplasia disease using bioinformatics analysis and machine learning
title_short Identification and validation of cuproptosis related genes and signature markers in bronchopulmonary dysplasia disease using bioinformatics analysis and machine learning
title_sort identification and validation of cuproptosis related genes and signature markers in bronchopulmonary dysplasia disease using bioinformatics analysis and machine learning
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10105406/
https://www.ncbi.nlm.nih.gov/pubmed/37060021
http://dx.doi.org/10.1186/s12911-023-02163-x
work_keys_str_mv AT jiamingxuan identificationandvalidationofcuproptosisrelatedgenesandsignaturemarkersinbronchopulmonarydysplasiadiseaseusingbioinformaticsanalysisandmachinelearning
AT lijieyi identificationandvalidationofcuproptosisrelatedgenesandsignaturemarkersinbronchopulmonarydysplasiadiseaseusingbioinformaticsanalysisandmachinelearning
AT zhangjingying identificationandvalidationofcuproptosisrelatedgenesandsignaturemarkersinbronchopulmonarydysplasiadiseaseusingbioinformaticsanalysisandmachinelearning
AT weiningjing identificationandvalidationofcuproptosisrelatedgenesandsignaturemarkersinbronchopulmonarydysplasiadiseaseusingbioinformaticsanalysisandmachinelearning
AT yinyating identificationandvalidationofcuproptosisrelatedgenesandsignaturemarkersinbronchopulmonarydysplasiadiseaseusingbioinformaticsanalysisandmachinelearning
AT chenhui identificationandvalidationofcuproptosisrelatedgenesandsignaturemarkersinbronchopulmonarydysplasiadiseaseusingbioinformaticsanalysisandmachinelearning
AT yanshixing identificationandvalidationofcuproptosisrelatedgenesandsignaturemarkersinbronchopulmonarydysplasiadiseaseusingbioinformaticsanalysisandmachinelearning
AT wangyong identificationandvalidationofcuproptosisrelatedgenesandsignaturemarkersinbronchopulmonarydysplasiadiseaseusingbioinformaticsanalysisandmachinelearning