Cargando…

PEDF, a pleiotropic WTC-LI biomarker: Machine learning biomarker identification and validation

Biomarkers predict World Trade Center-Lung Injury (WTC-LI); however, there remains unaddressed multicollinearity in our serum cytokines, chemokines, and high-throughput platform datasets used to phenotype WTC-disease. To address this concern, we used automated, machine-learning, high-dimensional dat...

Descripción completa

Detalles Bibliográficos
Autores principales: Crowley, George, Kim, James, Kwon, Sophia, Lam, Rachel, Prezant, David J., Liu, Mengling, Nolan, Anna
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8328304/
https://www.ncbi.nlm.nih.gov/pubmed/34288906
http://dx.doi.org/10.1371/journal.pcbi.1009144
_version_ 1783732280428068864
author Crowley, George
Kim, James
Kwon, Sophia
Lam, Rachel
Prezant, David J.
Liu, Mengling
Nolan, Anna
author_facet Crowley, George
Kim, James
Kwon, Sophia
Lam, Rachel
Prezant, David J.
Liu, Mengling
Nolan, Anna
author_sort Crowley, George
collection PubMed
description Biomarkers predict World Trade Center-Lung Injury (WTC-LI); however, there remains unaddressed multicollinearity in our serum cytokines, chemokines, and high-throughput platform datasets used to phenotype WTC-disease. To address this concern, we used automated, machine-learning, high-dimensional data pruning, and validated identified biomarkers. The parent cohort consisted of male, never-smoking firefighters with WTC-LI (FEV(1, %Pred)< lower limit of normal (LLN); n = 100) and controls (n = 127) and had their biomarkers assessed. Cases and controls (n = 15/group) underwent untargeted metabolomics, then feature selection performed on metabolites, cytokines, chemokines, and clinical data. Cytokines, chemokines, and clinical biomarkers were validated in the non-overlapping parent-cohort via binary logistic regression with 5-fold cross validation. Random forests of metabolites (n = 580), clinical biomarkers (n = 5), and previously assayed cytokines, chemokines (n = 106) identified that the top 5% of biomarkers important to class separation included pigment epithelium-derived factor (PEDF), macrophage derived chemokine (MDC), systolic blood pressure, macrophage inflammatory protein-4 (MIP-4), growth-regulated oncogene protein (GRO), monocyte chemoattractant protein-1 (MCP-1), apolipoprotein-AII (Apo-AII), cell membrane metabolites (sphingolipids, phospholipids), and branched-chain amino acids. Validated models via confounder-adjusted (age on 9/11, BMI, exposure, and pre-9/11 FEV(1, %Pred)) binary logistic regression had AUC(ROC) [0.90(0.84–0.96)]. Decreased PEDF and MIP-4, and increased Apo-AII were associated with increased odds of WTC-LI. Increased GRO, MCP-1, and simultaneously decreased MDC were associated with decreased odds of WTC-LI. In conclusion, automated data pruning identified novel WTC-LI biomarkers; performance was validated in an independent cohort. One biomarker—PEDF, an antiangiogenic agent—is a novel, predictive biomarker of particulate-matter-related lung disease. Other biomarkers—GRO, MCP-1, MDC, MIP-4—reveal immune cell involvement in WTC-LI pathogenesis. Findings of our automated biomarker identification warrant further investigation into these potential pharmacotherapy targets.
format Online
Article
Text
id pubmed-8328304
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-83283042021-08-03 PEDF, a pleiotropic WTC-LI biomarker: Machine learning biomarker identification and validation Crowley, George Kim, James Kwon, Sophia Lam, Rachel Prezant, David J. Liu, Mengling Nolan, Anna PLoS Comput Biol Research Article Biomarkers predict World Trade Center-Lung Injury (WTC-LI); however, there remains unaddressed multicollinearity in our serum cytokines, chemokines, and high-throughput platform datasets used to phenotype WTC-disease. To address this concern, we used automated, machine-learning, high-dimensional data pruning, and validated identified biomarkers. The parent cohort consisted of male, never-smoking firefighters with WTC-LI (FEV(1, %Pred)< lower limit of normal (LLN); n = 100) and controls (n = 127) and had their biomarkers assessed. Cases and controls (n = 15/group) underwent untargeted metabolomics, then feature selection performed on metabolites, cytokines, chemokines, and clinical data. Cytokines, chemokines, and clinical biomarkers were validated in the non-overlapping parent-cohort via binary logistic regression with 5-fold cross validation. Random forests of metabolites (n = 580), clinical biomarkers (n = 5), and previously assayed cytokines, chemokines (n = 106) identified that the top 5% of biomarkers important to class separation included pigment epithelium-derived factor (PEDF), macrophage derived chemokine (MDC), systolic blood pressure, macrophage inflammatory protein-4 (MIP-4), growth-regulated oncogene protein (GRO), monocyte chemoattractant protein-1 (MCP-1), apolipoprotein-AII (Apo-AII), cell membrane metabolites (sphingolipids, phospholipids), and branched-chain amino acids. Validated models via confounder-adjusted (age on 9/11, BMI, exposure, and pre-9/11 FEV(1, %Pred)) binary logistic regression had AUC(ROC) [0.90(0.84–0.96)]. Decreased PEDF and MIP-4, and increased Apo-AII were associated with increased odds of WTC-LI. Increased GRO, MCP-1, and simultaneously decreased MDC were associated with decreased odds of WTC-LI. In conclusion, automated data pruning identified novel WTC-LI biomarkers; performance was validated in an independent cohort. One biomarker—PEDF, an antiangiogenic agent—is a novel, predictive biomarker of particulate-matter-related lung disease. Other biomarkers—GRO, MCP-1, MDC, MIP-4—reveal immune cell involvement in WTC-LI pathogenesis. Findings of our automated biomarker identification warrant further investigation into these potential pharmacotherapy targets. Public Library of Science 2021-07-21 /pmc/articles/PMC8328304/ /pubmed/34288906 http://dx.doi.org/10.1371/journal.pcbi.1009144 Text en https://creativecommons.org/publicdomain/zero/1.0/This is an open access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 (https://creativecommons.org/publicdomain/zero/1.0/) public domain dedication.
spellingShingle Research Article
Crowley, George
Kim, James
Kwon, Sophia
Lam, Rachel
Prezant, David J.
Liu, Mengling
Nolan, Anna
PEDF, a pleiotropic WTC-LI biomarker: Machine learning biomarker identification and validation
title PEDF, a pleiotropic WTC-LI biomarker: Machine learning biomarker identification and validation
title_full PEDF, a pleiotropic WTC-LI biomarker: Machine learning biomarker identification and validation
title_fullStr PEDF, a pleiotropic WTC-LI biomarker: Machine learning biomarker identification and validation
title_full_unstemmed PEDF, a pleiotropic WTC-LI biomarker: Machine learning biomarker identification and validation
title_short PEDF, a pleiotropic WTC-LI biomarker: Machine learning biomarker identification and validation
title_sort pedf, a pleiotropic wtc-li biomarker: machine learning biomarker identification and validation
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8328304/
https://www.ncbi.nlm.nih.gov/pubmed/34288906
http://dx.doi.org/10.1371/journal.pcbi.1009144
work_keys_str_mv AT crowleygeorge pedfapleiotropicwtclibiomarkermachinelearningbiomarkeridentificationandvalidation
AT kimjames pedfapleiotropicwtclibiomarkermachinelearningbiomarkeridentificationandvalidation
AT kwonsophia pedfapleiotropicwtclibiomarkermachinelearningbiomarkeridentificationandvalidation
AT lamrachel pedfapleiotropicwtclibiomarkermachinelearningbiomarkeridentificationandvalidation
AT prezantdavidj pedfapleiotropicwtclibiomarkermachinelearningbiomarkeridentificationandvalidation
AT liumengling pedfapleiotropicwtclibiomarkermachinelearningbiomarkeridentificationandvalidation
AT nolananna pedfapleiotropicwtclibiomarkermachinelearningbiomarkeridentificationandvalidation