Cargando…
PEDF, a pleiotropic WTC-LI biomarker: Machine learning biomarker identification and validation
Biomarkers predict World Trade Center-Lung Injury (WTC-LI); however, there remains unaddressed multicollinearity in our serum cytokines, chemokines, and high-throughput platform datasets used to phenotype WTC-disease. To address this concern, we used automated, machine-learning, high-dimensional dat...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8328304/ https://www.ncbi.nlm.nih.gov/pubmed/34288906 http://dx.doi.org/10.1371/journal.pcbi.1009144 |
_version_ | 1783732280428068864 |
---|---|
author | Crowley, George Kim, James Kwon, Sophia Lam, Rachel Prezant, David J. Liu, Mengling Nolan, Anna |
author_facet | Crowley, George Kim, James Kwon, Sophia Lam, Rachel Prezant, David J. Liu, Mengling Nolan, Anna |
author_sort | Crowley, George |
collection | PubMed |
description | Biomarkers predict World Trade Center-Lung Injury (WTC-LI); however, there remains unaddressed multicollinearity in our serum cytokines, chemokines, and high-throughput platform datasets used to phenotype WTC-disease. To address this concern, we used automated, machine-learning, high-dimensional data pruning, and validated identified biomarkers. The parent cohort consisted of male, never-smoking firefighters with WTC-LI (FEV(1, %Pred)< lower limit of normal (LLN); n = 100) and controls (n = 127) and had their biomarkers assessed. Cases and controls (n = 15/group) underwent untargeted metabolomics, then feature selection performed on metabolites, cytokines, chemokines, and clinical data. Cytokines, chemokines, and clinical biomarkers were validated in the non-overlapping parent-cohort via binary logistic regression with 5-fold cross validation. Random forests of metabolites (n = 580), clinical biomarkers (n = 5), and previously assayed cytokines, chemokines (n = 106) identified that the top 5% of biomarkers important to class separation included pigment epithelium-derived factor (PEDF), macrophage derived chemokine (MDC), systolic blood pressure, macrophage inflammatory protein-4 (MIP-4), growth-regulated oncogene protein (GRO), monocyte chemoattractant protein-1 (MCP-1), apolipoprotein-AII (Apo-AII), cell membrane metabolites (sphingolipids, phospholipids), and branched-chain amino acids. Validated models via confounder-adjusted (age on 9/11, BMI, exposure, and pre-9/11 FEV(1, %Pred)) binary logistic regression had AUC(ROC) [0.90(0.84–0.96)]. Decreased PEDF and MIP-4, and increased Apo-AII were associated with increased odds of WTC-LI. Increased GRO, MCP-1, and simultaneously decreased MDC were associated with decreased odds of WTC-LI. In conclusion, automated data pruning identified novel WTC-LI biomarkers; performance was validated in an independent cohort. One biomarker—PEDF, an antiangiogenic agent—is a novel, predictive biomarker of particulate-matter-related lung disease. Other biomarkers—GRO, MCP-1, MDC, MIP-4—reveal immune cell involvement in WTC-LI pathogenesis. Findings of our automated biomarker identification warrant further investigation into these potential pharmacotherapy targets. |
format | Online Article Text |
id | pubmed-8328304 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-83283042021-08-03 PEDF, a pleiotropic WTC-LI biomarker: Machine learning biomarker identification and validation Crowley, George Kim, James Kwon, Sophia Lam, Rachel Prezant, David J. Liu, Mengling Nolan, Anna PLoS Comput Biol Research Article Biomarkers predict World Trade Center-Lung Injury (WTC-LI); however, there remains unaddressed multicollinearity in our serum cytokines, chemokines, and high-throughput platform datasets used to phenotype WTC-disease. To address this concern, we used automated, machine-learning, high-dimensional data pruning, and validated identified biomarkers. The parent cohort consisted of male, never-smoking firefighters with WTC-LI (FEV(1, %Pred)< lower limit of normal (LLN); n = 100) and controls (n = 127) and had their biomarkers assessed. Cases and controls (n = 15/group) underwent untargeted metabolomics, then feature selection performed on metabolites, cytokines, chemokines, and clinical data. Cytokines, chemokines, and clinical biomarkers were validated in the non-overlapping parent-cohort via binary logistic regression with 5-fold cross validation. Random forests of metabolites (n = 580), clinical biomarkers (n = 5), and previously assayed cytokines, chemokines (n = 106) identified that the top 5% of biomarkers important to class separation included pigment epithelium-derived factor (PEDF), macrophage derived chemokine (MDC), systolic blood pressure, macrophage inflammatory protein-4 (MIP-4), growth-regulated oncogene protein (GRO), monocyte chemoattractant protein-1 (MCP-1), apolipoprotein-AII (Apo-AII), cell membrane metabolites (sphingolipids, phospholipids), and branched-chain amino acids. Validated models via confounder-adjusted (age on 9/11, BMI, exposure, and pre-9/11 FEV(1, %Pred)) binary logistic regression had AUC(ROC) [0.90(0.84–0.96)]. Decreased PEDF and MIP-4, and increased Apo-AII were associated with increased odds of WTC-LI. Increased GRO, MCP-1, and simultaneously decreased MDC were associated with decreased odds of WTC-LI. In conclusion, automated data pruning identified novel WTC-LI biomarkers; performance was validated in an independent cohort. One biomarker—PEDF, an antiangiogenic agent—is a novel, predictive biomarker of particulate-matter-related lung disease. Other biomarkers—GRO, MCP-1, MDC, MIP-4—reveal immune cell involvement in WTC-LI pathogenesis. Findings of our automated biomarker identification warrant further investigation into these potential pharmacotherapy targets. Public Library of Science 2021-07-21 /pmc/articles/PMC8328304/ /pubmed/34288906 http://dx.doi.org/10.1371/journal.pcbi.1009144 Text en https://creativecommons.org/publicdomain/zero/1.0/This is an open access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 (https://creativecommons.org/publicdomain/zero/1.0/) public domain dedication. |
spellingShingle | Research Article Crowley, George Kim, James Kwon, Sophia Lam, Rachel Prezant, David J. Liu, Mengling Nolan, Anna PEDF, a pleiotropic WTC-LI biomarker: Machine learning biomarker identification and validation |
title | PEDF, a pleiotropic WTC-LI biomarker: Machine learning biomarker identification and validation |
title_full | PEDF, a pleiotropic WTC-LI biomarker: Machine learning biomarker identification and validation |
title_fullStr | PEDF, a pleiotropic WTC-LI biomarker: Machine learning biomarker identification and validation |
title_full_unstemmed | PEDF, a pleiotropic WTC-LI biomarker: Machine learning biomarker identification and validation |
title_short | PEDF, a pleiotropic WTC-LI biomarker: Machine learning biomarker identification and validation |
title_sort | pedf, a pleiotropic wtc-li biomarker: machine learning biomarker identification and validation |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8328304/ https://www.ncbi.nlm.nih.gov/pubmed/34288906 http://dx.doi.org/10.1371/journal.pcbi.1009144 |
work_keys_str_mv | AT crowleygeorge pedfapleiotropicwtclibiomarkermachinelearningbiomarkeridentificationandvalidation AT kimjames pedfapleiotropicwtclibiomarkermachinelearningbiomarkeridentificationandvalidation AT kwonsophia pedfapleiotropicwtclibiomarkermachinelearningbiomarkeridentificationandvalidation AT lamrachel pedfapleiotropicwtclibiomarkermachinelearningbiomarkeridentificationandvalidation AT prezantdavidj pedfapleiotropicwtclibiomarkermachinelearningbiomarkeridentificationandvalidation AT liumengling pedfapleiotropicwtclibiomarkermachinelearningbiomarkeridentificationandvalidation AT nolananna pedfapleiotropicwtclibiomarkermachinelearningbiomarkeridentificationandvalidation |