Cargando…

Interpretable machine learning identifies paediatric Systemic Lupus Erythematosus subtypes based on gene expression data

Transcriptomic analyses are commonly used to identify differentially expressed genes between patients and controls, or within individuals across disease courses. These methods, whilst effective, cannot encompass the combinatorial effects of genes driving disease. We applied rule-based machine learni...

Descripción completa

Detalles Bibliográficos
Autores principales: Yones, Sara A., Annett, Alva, Stoll, Patricia, Diamanti, Klev, Holmfeldt, Linda, Barrenäs, Carl Fredrik, Meadows, Jennifer R. S., Komorowski, Jan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9076598/
https://www.ncbi.nlm.nih.gov/pubmed/35523803
http://dx.doi.org/10.1038/s41598-022-10853-1
_version_ 1784701961246867456
author Yones, Sara A.
Annett, Alva
Stoll, Patricia
Diamanti, Klev
Holmfeldt, Linda
Barrenäs, Carl Fredrik
Meadows, Jennifer R. S.
Komorowski, Jan
author_facet Yones, Sara A.
Annett, Alva
Stoll, Patricia
Diamanti, Klev
Holmfeldt, Linda
Barrenäs, Carl Fredrik
Meadows, Jennifer R. S.
Komorowski, Jan
author_sort Yones, Sara A.
collection PubMed
description Transcriptomic analyses are commonly used to identify differentially expressed genes between patients and controls, or within individuals across disease courses. These methods, whilst effective, cannot encompass the combinatorial effects of genes driving disease. We applied rule-based machine learning (RBML) models and rule networks (RN) to an existing paediatric Systemic Lupus Erythematosus (SLE) blood expression dataset, with the goal of developing gene networks to separate low and high disease activity (DA1 and DA3). The resultant model had an 81% accuracy to distinguish between DA1 and DA3, with unsupervised hierarchical clustering revealing additional subgroups indicative of the immune axis involved or state of disease flare. These subgroups correlated with clinical variables, suggesting that the gene sets identified may further the understanding of gene networks that act in concert to drive disease progression. This included roles for genes (i) induced by interferons (IFI35 and OTOF), (ii) key to SLE cell types (KLRB1 encoding CD161), or (iii) with roles in autophagy and NF-κB pathway responses (CKAP4). As demonstrated here, RBML approaches have the potential to reveal novel gene patterns from within a heterogeneous disease, facilitating patient clinical and therapeutic stratification.
format Online
Article
Text
id pubmed-9076598
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-90765982022-05-08 Interpretable machine learning identifies paediatric Systemic Lupus Erythematosus subtypes based on gene expression data Yones, Sara A. Annett, Alva Stoll, Patricia Diamanti, Klev Holmfeldt, Linda Barrenäs, Carl Fredrik Meadows, Jennifer R. S. Komorowski, Jan Sci Rep Article Transcriptomic analyses are commonly used to identify differentially expressed genes between patients and controls, or within individuals across disease courses. These methods, whilst effective, cannot encompass the combinatorial effects of genes driving disease. We applied rule-based machine learning (RBML) models and rule networks (RN) to an existing paediatric Systemic Lupus Erythematosus (SLE) blood expression dataset, with the goal of developing gene networks to separate low and high disease activity (DA1 and DA3). The resultant model had an 81% accuracy to distinguish between DA1 and DA3, with unsupervised hierarchical clustering revealing additional subgroups indicative of the immune axis involved or state of disease flare. These subgroups correlated with clinical variables, suggesting that the gene sets identified may further the understanding of gene networks that act in concert to drive disease progression. This included roles for genes (i) induced by interferons (IFI35 and OTOF), (ii) key to SLE cell types (KLRB1 encoding CD161), or (iii) with roles in autophagy and NF-κB pathway responses (CKAP4). As demonstrated here, RBML approaches have the potential to reveal novel gene patterns from within a heterogeneous disease, facilitating patient clinical and therapeutic stratification. Nature Publishing Group UK 2022-05-06 /pmc/articles/PMC9076598/ /pubmed/35523803 http://dx.doi.org/10.1038/s41598-022-10853-1 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Yones, Sara A.
Annett, Alva
Stoll, Patricia
Diamanti, Klev
Holmfeldt, Linda
Barrenäs, Carl Fredrik
Meadows, Jennifer R. S.
Komorowski, Jan
Interpretable machine learning identifies paediatric Systemic Lupus Erythematosus subtypes based on gene expression data
title Interpretable machine learning identifies paediatric Systemic Lupus Erythematosus subtypes based on gene expression data
title_full Interpretable machine learning identifies paediatric Systemic Lupus Erythematosus subtypes based on gene expression data
title_fullStr Interpretable machine learning identifies paediatric Systemic Lupus Erythematosus subtypes based on gene expression data
title_full_unstemmed Interpretable machine learning identifies paediatric Systemic Lupus Erythematosus subtypes based on gene expression data
title_short Interpretable machine learning identifies paediatric Systemic Lupus Erythematosus subtypes based on gene expression data
title_sort interpretable machine learning identifies paediatric systemic lupus erythematosus subtypes based on gene expression data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9076598/
https://www.ncbi.nlm.nih.gov/pubmed/35523803
http://dx.doi.org/10.1038/s41598-022-10853-1
work_keys_str_mv AT yonessaraa interpretablemachinelearningidentifiespaediatricsystemiclupuserythematosussubtypesbasedongeneexpressiondata
AT annettalva interpretablemachinelearningidentifiespaediatricsystemiclupuserythematosussubtypesbasedongeneexpressiondata
AT stollpatricia interpretablemachinelearningidentifiespaediatricsystemiclupuserythematosussubtypesbasedongeneexpressiondata
AT diamantiklev interpretablemachinelearningidentifiespaediatricsystemiclupuserythematosussubtypesbasedongeneexpressiondata
AT holmfeldtlinda interpretablemachinelearningidentifiespaediatricsystemiclupuserythematosussubtypesbasedongeneexpressiondata
AT barrenascarlfredrik interpretablemachinelearningidentifiespaediatricsystemiclupuserythematosussubtypesbasedongeneexpressiondata
AT meadowsjenniferrs interpretablemachinelearningidentifiespaediatricsystemiclupuserythematosussubtypesbasedongeneexpressiondata
AT komorowskijan interpretablemachinelearningidentifiespaediatricsystemiclupuserythematosussubtypesbasedongeneexpressiondata