Cargando…

An interpretable machine learning pipeline based on transcriptomics predicts phenotypes of lupus patients

Machine learning (ML) has the potential to identify subsets of patients with distinct phenotypes from gene expression data. However, phenotype prediction using ML has often relied on identifying important genes without a systems biology context. To address this, we created an interpretable ML approa...

Descripción completa

Detalles Bibliográficos
Autores principales: Leventhal, Emily L., Daamen, Andrea R., Grammer, Amrie C., Lipsky, Peter E.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10582499/
https://www.ncbi.nlm.nih.gov/pubmed/37860757
http://dx.doi.org/10.1016/j.isci.2023.108042
_version_ 1785122345433694208
author Leventhal, Emily L.
Daamen, Andrea R.
Grammer, Amrie C.
Lipsky, Peter E.
author_facet Leventhal, Emily L.
Daamen, Andrea R.
Grammer, Amrie C.
Lipsky, Peter E.
author_sort Leventhal, Emily L.
collection PubMed
description Machine learning (ML) has the potential to identify subsets of patients with distinct phenotypes from gene expression data. However, phenotype prediction using ML has often relied on identifying important genes without a systems biology context. To address this, we created an interpretable ML approach based on blood transcriptomics to predict phenotype in systemic lupus erythematosus (SLE), a heterogeneous autoimmune disease. We employed a sequential grouped feature importance algorithm to assess the performance of gene sets, including immune and metabolic pathways and cell types, known to be abnormal in SLE in predicting disease activity and organ involvement. Gene sets related to interferon, tumor necrosis factor, the mitoribosome, and T cell activation were the best predictors of phenotype with excellent performance. These results suggest potential relationships between the molecular pathways identified in each model and manifestations of SLE. This ML approach to phenotype prediction can be applied to other diseases and tissues.
format Online
Article
Text
id pubmed-10582499
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-105824992023-10-19 An interpretable machine learning pipeline based on transcriptomics predicts phenotypes of lupus patients Leventhal, Emily L. Daamen, Andrea R. Grammer, Amrie C. Lipsky, Peter E. iScience Article Machine learning (ML) has the potential to identify subsets of patients with distinct phenotypes from gene expression data. However, phenotype prediction using ML has often relied on identifying important genes without a systems biology context. To address this, we created an interpretable ML approach based on blood transcriptomics to predict phenotype in systemic lupus erythematosus (SLE), a heterogeneous autoimmune disease. We employed a sequential grouped feature importance algorithm to assess the performance of gene sets, including immune and metabolic pathways and cell types, known to be abnormal in SLE in predicting disease activity and organ involvement. Gene sets related to interferon, tumor necrosis factor, the mitoribosome, and T cell activation were the best predictors of phenotype with excellent performance. These results suggest potential relationships between the molecular pathways identified in each model and manifestations of SLE. This ML approach to phenotype prediction can be applied to other diseases and tissues. Elsevier 2023-09-25 /pmc/articles/PMC10582499/ /pubmed/37860757 http://dx.doi.org/10.1016/j.isci.2023.108042 Text en © 2023 The Author(s) https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Article
Leventhal, Emily L.
Daamen, Andrea R.
Grammer, Amrie C.
Lipsky, Peter E.
An interpretable machine learning pipeline based on transcriptomics predicts phenotypes of lupus patients
title An interpretable machine learning pipeline based on transcriptomics predicts phenotypes of lupus patients
title_full An interpretable machine learning pipeline based on transcriptomics predicts phenotypes of lupus patients
title_fullStr An interpretable machine learning pipeline based on transcriptomics predicts phenotypes of lupus patients
title_full_unstemmed An interpretable machine learning pipeline based on transcriptomics predicts phenotypes of lupus patients
title_short An interpretable machine learning pipeline based on transcriptomics predicts phenotypes of lupus patients
title_sort interpretable machine learning pipeline based on transcriptomics predicts phenotypes of lupus patients
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10582499/
https://www.ncbi.nlm.nih.gov/pubmed/37860757
http://dx.doi.org/10.1016/j.isci.2023.108042
work_keys_str_mv AT leventhalemilyl aninterpretablemachinelearningpipelinebasedontranscriptomicspredictsphenotypesoflupuspatients
AT daamenandrear aninterpretablemachinelearningpipelinebasedontranscriptomicspredictsphenotypesoflupuspatients
AT grammeramriec aninterpretablemachinelearningpipelinebasedontranscriptomicspredictsphenotypesoflupuspatients
AT lipskypetere aninterpretablemachinelearningpipelinebasedontranscriptomicspredictsphenotypesoflupuspatients
AT leventhalemilyl interpretablemachinelearningpipelinebasedontranscriptomicspredictsphenotypesoflupuspatients
AT daamenandrear interpretablemachinelearningpipelinebasedontranscriptomicspredictsphenotypesoflupuspatients
AT grammeramriec interpretablemachinelearningpipelinebasedontranscriptomicspredictsphenotypesoflupuspatients
AT lipskypetere interpretablemachinelearningpipelinebasedontranscriptomicspredictsphenotypesoflupuspatients