Cargando…

Multi-omics subtyping pipeline for chronic obstructive pulmonary disease

Chronic Obstructive Pulmonary Disease (COPD) is the third leading cause of mortality in the United States; however, COPD has heterogeneous clinical phenotypes. This is the first large scale attempt which uses transcriptomics, proteomics, and metabolomics (multi-omics) to determine whether there are...

Descripción completa

Detalles Bibliográficos
Autores principales: Gillenwater, Lucas A., Helmi, Shahab, Stene, Evan, Pratte, Katherine A., Zhuang, Yonghua, Schuyler, Ronald P., Lange, Leslie, Castaldi, Peter J., Hersh, Craig P., Banaei-Kashani, Farnoush, Bowler, Russell P., Kechris, Katerina J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8386883/
https://www.ncbi.nlm.nih.gov/pubmed/34432807
http://dx.doi.org/10.1371/journal.pone.0255337
_version_ 1783742339870621696
author Gillenwater, Lucas A.
Helmi, Shahab
Stene, Evan
Pratte, Katherine A.
Zhuang, Yonghua
Schuyler, Ronald P.
Lange, Leslie
Castaldi, Peter J.
Hersh, Craig P.
Banaei-Kashani, Farnoush
Bowler, Russell P.
Kechris, Katerina J.
author_facet Gillenwater, Lucas A.
Helmi, Shahab
Stene, Evan
Pratte, Katherine A.
Zhuang, Yonghua
Schuyler, Ronald P.
Lange, Leslie
Castaldi, Peter J.
Hersh, Craig P.
Banaei-Kashani, Farnoush
Bowler, Russell P.
Kechris, Katerina J.
author_sort Gillenwater, Lucas A.
collection PubMed
description Chronic Obstructive Pulmonary Disease (COPD) is the third leading cause of mortality in the United States; however, COPD has heterogeneous clinical phenotypes. This is the first large scale attempt which uses transcriptomics, proteomics, and metabolomics (multi-omics) to determine whether there are molecularly defined clusters with distinct clinical phenotypes that may underlie the clinical heterogeneity. Subjects included 3,278 subjects from the COPDGene cohort with at least one of the following profiles: whole blood transcriptomes (2,650 subjects); plasma proteomes (1,013 subjects); and plasma metabolomes (1,136 subjects). 489 subjects had all three contemporaneous -omics profiles. Autoencoder embeddings were performed individually for each -omics dataset. Embeddings underwent subspace clustering using MineClus, either individually by -omics or combined, followed by recursive feature selection based on Support Vector Machines. Clusters were tested for associations with clinical variables. Optimal single -omics clustering typically resulted in two clusters. Although there was overlap for individual -omics cluster membership, each -omics cluster tended to be defined by unique molecular pathways. For example, prominent molecular features of the metabolome-based clustering included sphingomyelin, while key molecular features of the transcriptome-based clusters were related to immune and bacterial responses. We also found that when we integrated the -omics data at a later stage, we identified subtypes that varied based on age, severity of disease, in addition to diffusing capacity of the lungs for carbon monoxide, and precent on atrial fibrillation. In contrast, when we integrated the -omics data at an earlier stage by treating all data sets equally, there were no clinical differences between subtypes. Similar to clinical clustering, which has revealed multiple heterogenous clinical phenotypes, we show that transcriptomics, proteomics, and metabolomics tend to define clusters of COPD patients with different clinical characteristics. Thus, integrating these different -omics data sets affords additional insight into the molecular nature of COPD and its heterogeneity.
format Online
Article
Text
id pubmed-8386883
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-83868832021-08-26 Multi-omics subtyping pipeline for chronic obstructive pulmonary disease Gillenwater, Lucas A. Helmi, Shahab Stene, Evan Pratte, Katherine A. Zhuang, Yonghua Schuyler, Ronald P. Lange, Leslie Castaldi, Peter J. Hersh, Craig P. Banaei-Kashani, Farnoush Bowler, Russell P. Kechris, Katerina J. PLoS One Research Article Chronic Obstructive Pulmonary Disease (COPD) is the third leading cause of mortality in the United States; however, COPD has heterogeneous clinical phenotypes. This is the first large scale attempt which uses transcriptomics, proteomics, and metabolomics (multi-omics) to determine whether there are molecularly defined clusters with distinct clinical phenotypes that may underlie the clinical heterogeneity. Subjects included 3,278 subjects from the COPDGene cohort with at least one of the following profiles: whole blood transcriptomes (2,650 subjects); plasma proteomes (1,013 subjects); and plasma metabolomes (1,136 subjects). 489 subjects had all three contemporaneous -omics profiles. Autoencoder embeddings were performed individually for each -omics dataset. Embeddings underwent subspace clustering using MineClus, either individually by -omics or combined, followed by recursive feature selection based on Support Vector Machines. Clusters were tested for associations with clinical variables. Optimal single -omics clustering typically resulted in two clusters. Although there was overlap for individual -omics cluster membership, each -omics cluster tended to be defined by unique molecular pathways. For example, prominent molecular features of the metabolome-based clustering included sphingomyelin, while key molecular features of the transcriptome-based clusters were related to immune and bacterial responses. We also found that when we integrated the -omics data at a later stage, we identified subtypes that varied based on age, severity of disease, in addition to diffusing capacity of the lungs for carbon monoxide, and precent on atrial fibrillation. In contrast, when we integrated the -omics data at an earlier stage by treating all data sets equally, there were no clinical differences between subtypes. Similar to clinical clustering, which has revealed multiple heterogenous clinical phenotypes, we show that transcriptomics, proteomics, and metabolomics tend to define clusters of COPD patients with different clinical characteristics. Thus, integrating these different -omics data sets affords additional insight into the molecular nature of COPD and its heterogeneity. Public Library of Science 2021-08-25 /pmc/articles/PMC8386883/ /pubmed/34432807 http://dx.doi.org/10.1371/journal.pone.0255337 Text en © 2021 Gillenwater et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Gillenwater, Lucas A.
Helmi, Shahab
Stene, Evan
Pratte, Katherine A.
Zhuang, Yonghua
Schuyler, Ronald P.
Lange, Leslie
Castaldi, Peter J.
Hersh, Craig P.
Banaei-Kashani, Farnoush
Bowler, Russell P.
Kechris, Katerina J.
Multi-omics subtyping pipeline for chronic obstructive pulmonary disease
title Multi-omics subtyping pipeline for chronic obstructive pulmonary disease
title_full Multi-omics subtyping pipeline for chronic obstructive pulmonary disease
title_fullStr Multi-omics subtyping pipeline for chronic obstructive pulmonary disease
title_full_unstemmed Multi-omics subtyping pipeline for chronic obstructive pulmonary disease
title_short Multi-omics subtyping pipeline for chronic obstructive pulmonary disease
title_sort multi-omics subtyping pipeline for chronic obstructive pulmonary disease
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8386883/
https://www.ncbi.nlm.nih.gov/pubmed/34432807
http://dx.doi.org/10.1371/journal.pone.0255337
work_keys_str_mv AT gillenwaterlucasa multiomicssubtypingpipelineforchronicobstructivepulmonarydisease
AT helmishahab multiomicssubtypingpipelineforchronicobstructivepulmonarydisease
AT steneevan multiomicssubtypingpipelineforchronicobstructivepulmonarydisease
AT prattekatherinea multiomicssubtypingpipelineforchronicobstructivepulmonarydisease
AT zhuangyonghua multiomicssubtypingpipelineforchronicobstructivepulmonarydisease
AT schuylerronaldp multiomicssubtypingpipelineforchronicobstructivepulmonarydisease
AT langeleslie multiomicssubtypingpipelineforchronicobstructivepulmonarydisease
AT castaldipeterj multiomicssubtypingpipelineforchronicobstructivepulmonarydisease
AT hershcraigp multiomicssubtypingpipelineforchronicobstructivepulmonarydisease
AT banaeikashanifarnoush multiomicssubtypingpipelineforchronicobstructivepulmonarydisease
AT bowlerrussellp multiomicssubtypingpipelineforchronicobstructivepulmonarydisease
AT kechriskaterinaj multiomicssubtypingpipelineforchronicobstructivepulmonarydisease