Cargando…
Machine learning based refined differential gene expression analysis of pediatric sepsis
BACKGROUND: Differential expression (DE) analysis of transcriptomic data enables genome-wide analysis of gene expression changes associated with biological conditions of interest. Such analysis often provides a wide list of genes that are differentially expressed between two or more groups. In gener...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7453705/ https://www.ncbi.nlm.nih.gov/pubmed/32859206 http://dx.doi.org/10.1186/s12920-020-00771-4 |
_version_ | 1783575402546987008 |
---|---|
author | Abbas, Mostafa EL-Manzalawy, Yasser |
author_facet | Abbas, Mostafa EL-Manzalawy, Yasser |
author_sort | Abbas, Mostafa |
collection | PubMed |
description | BACKGROUND: Differential expression (DE) analysis of transcriptomic data enables genome-wide analysis of gene expression changes associated with biological conditions of interest. Such analysis often provides a wide list of genes that are differentially expressed between two or more groups. In general, identified differentially expressed genes (DEGs) can be subject to further downstream analysis for obtaining more biological insights such as determining enriched functional pathways or gene ontologies. Furthermore, DEGs are treated as candidate biomarkers and a small set of DEGs might be identified as biomarkers using either biological knowledge or data-driven approaches. METHODS: In this work, we present a novel approach for identifying biomarkers from a list of DEGs by re-ranking them according to the Minimum Redundancy Maximum Relevance (MRMR) criteria using repeated cross-validation feature selection procedure. RESULTS: Using gene expression profiles for 199 children with sepsis and septic shock, we identify 108 DEGs and propose a 10-gene signature for reliably predicting pediatric sepsis mortality with an estimated Area Under ROC Curve (AUC) score of 0.89. CONCLUSIONS: Machine learning based refinement of DE analysis is a promising tool for prioritizing DEGs and discovering biomarkers from gene expression profiles. Moreover, our reported 10-gene signature for pediatric sepsis mortality may facilitate the development of reliable diagnosis and prognosis biomarkers for sepsis. |
format | Online Article Text |
id | pubmed-7453705 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-74537052020-08-28 Machine learning based refined differential gene expression analysis of pediatric sepsis Abbas, Mostafa EL-Manzalawy, Yasser BMC Med Genomics Technical Advance BACKGROUND: Differential expression (DE) analysis of transcriptomic data enables genome-wide analysis of gene expression changes associated with biological conditions of interest. Such analysis often provides a wide list of genes that are differentially expressed between two or more groups. In general, identified differentially expressed genes (DEGs) can be subject to further downstream analysis for obtaining more biological insights such as determining enriched functional pathways or gene ontologies. Furthermore, DEGs are treated as candidate biomarkers and a small set of DEGs might be identified as biomarkers using either biological knowledge or data-driven approaches. METHODS: In this work, we present a novel approach for identifying biomarkers from a list of DEGs by re-ranking them according to the Minimum Redundancy Maximum Relevance (MRMR) criteria using repeated cross-validation feature selection procedure. RESULTS: Using gene expression profiles for 199 children with sepsis and septic shock, we identify 108 DEGs and propose a 10-gene signature for reliably predicting pediatric sepsis mortality with an estimated Area Under ROC Curve (AUC) score of 0.89. CONCLUSIONS: Machine learning based refinement of DE analysis is a promising tool for prioritizing DEGs and discovering biomarkers from gene expression profiles. Moreover, our reported 10-gene signature for pediatric sepsis mortality may facilitate the development of reliable diagnosis and prognosis biomarkers for sepsis. BioMed Central 2020-08-28 /pmc/articles/PMC7453705/ /pubmed/32859206 http://dx.doi.org/10.1186/s12920-020-00771-4 Text en © The Author(s) 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Technical Advance Abbas, Mostafa EL-Manzalawy, Yasser Machine learning based refined differential gene expression analysis of pediatric sepsis |
title | Machine learning based refined differential gene expression analysis of pediatric sepsis |
title_full | Machine learning based refined differential gene expression analysis of pediatric sepsis |
title_fullStr | Machine learning based refined differential gene expression analysis of pediatric sepsis |
title_full_unstemmed | Machine learning based refined differential gene expression analysis of pediatric sepsis |
title_short | Machine learning based refined differential gene expression analysis of pediatric sepsis |
title_sort | machine learning based refined differential gene expression analysis of pediatric sepsis |
topic | Technical Advance |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7453705/ https://www.ncbi.nlm.nih.gov/pubmed/32859206 http://dx.doi.org/10.1186/s12920-020-00771-4 |
work_keys_str_mv | AT abbasmostafa machinelearningbasedrefineddifferentialgeneexpressionanalysisofpediatricsepsis AT elmanzalawyyasser machinelearningbasedrefineddifferentialgeneexpressionanalysisofpediatricsepsis |