Cargando…

Differentiation between Descending Thoracic Aortic Diseases using Machine Learning and Plasma Proteomic Signatures

BACKGROUND: Descending thoracic aortic aneurysms and dissections can go undetected until severe and catastrophic, and few clinical indices exist to screen for aneurysms or predict risk of dissection. METHODS: This study generated a plasma proteomic dataset from 75 patients with descending type B dis...

Descripción completa

Detalles Bibliográficos
Autores principales: Momenzadeh, Amanda, Kreimer, Simion, Guo, Dongchuan, Ayres, Matthew, Berman, Daniel, Chyu, Kuang-Yuh, Shah, Prediman K, Milewicz, Dianna, Azizzadeh, Ali, Meyer, Jesse G., Parker, Sarah
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10168345/
https://www.ncbi.nlm.nih.gov/pubmed/37162892
http://dx.doi.org/10.1101/2023.04.26.538468
_version_ 1785038838268166144
author Momenzadeh, Amanda
Kreimer, Simion
Guo, Dongchuan
Ayres, Matthew
Berman, Daniel
Chyu, Kuang-Yuh
Shah, Prediman K
Milewicz, Dianna
Azizzadeh, Ali
Meyer, Jesse G.
Parker, Sarah
author_facet Momenzadeh, Amanda
Kreimer, Simion
Guo, Dongchuan
Ayres, Matthew
Berman, Daniel
Chyu, Kuang-Yuh
Shah, Prediman K
Milewicz, Dianna
Azizzadeh, Ali
Meyer, Jesse G.
Parker, Sarah
author_sort Momenzadeh, Amanda
collection PubMed
description BACKGROUND: Descending thoracic aortic aneurysms and dissections can go undetected until severe and catastrophic, and few clinical indices exist to screen for aneurysms or predict risk of dissection. METHODS: This study generated a plasma proteomic dataset from 75 patients with descending type B dissection (Type B) and 62 patients with descending thoracic aortic aneurysm (DTAA). Standard statistical approaches were compared to supervised machine learning (ML) algorithms to distinguish Type B from DTAA cases. Quantitatively similar proteins were clustered based on linkage distance from hierarchical clustering and ML models were trained with uncorrelated protein lists across various linkage distances with hyperparameter optimization using 5-fold cross validation. Permutation importance (PI) was used for ranking the most important predictor proteins of ML classification between disease states and the proteins among the top 10 PI protein groups were submitted for pathway analysis. RESULTS: Of the 1,549 peptides and 198 proteins used in this study, no peptides and only one protein, hemopexin (HPX), were significantly different at an adjusted p-value <0.01 between Type B and DTAA cases. The highest performing model on the training set (Support Vector Classifier) and its corresponding linkage distance (0.5) were used for evaluation of the test set, yielding a precision-recall area under the curve of 0.7 to classify between Type B from DTAA cases. The five proteins with the highest PI scores were immunoglobulin heavy variable 6–1 (IGHV6-1), lecithin-cholesterol acyltransferase (LCAT), coagulation factor 12 (F12), HPX, and immunoglobulin heavy variable 4–4 (IGHV4-4). All proteins from the top 10 most important correlated groups generated the following significantly enriched pathways in the plasma of Type B versus DTAA patients: complement activation, humoral immune response, and blood coagulation. CONCLUSIONS: We conclude that ML may be useful in differentiating the plasma proteome of highly similar disease states that would otherwise not be distinguishable using statistics, and, in such cases, ML may enable prioritizing important proteins for model prediction.
format Online
Article
Text
id pubmed-10168345
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Cold Spring Harbor Laboratory
record_format MEDLINE/PubMed
spelling pubmed-101683452023-05-10 Differentiation between Descending Thoracic Aortic Diseases using Machine Learning and Plasma Proteomic Signatures Momenzadeh, Amanda Kreimer, Simion Guo, Dongchuan Ayres, Matthew Berman, Daniel Chyu, Kuang-Yuh Shah, Prediman K Milewicz, Dianna Azizzadeh, Ali Meyer, Jesse G. Parker, Sarah bioRxiv Article BACKGROUND: Descending thoracic aortic aneurysms and dissections can go undetected until severe and catastrophic, and few clinical indices exist to screen for aneurysms or predict risk of dissection. METHODS: This study generated a plasma proteomic dataset from 75 patients with descending type B dissection (Type B) and 62 patients with descending thoracic aortic aneurysm (DTAA). Standard statistical approaches were compared to supervised machine learning (ML) algorithms to distinguish Type B from DTAA cases. Quantitatively similar proteins were clustered based on linkage distance from hierarchical clustering and ML models were trained with uncorrelated protein lists across various linkage distances with hyperparameter optimization using 5-fold cross validation. Permutation importance (PI) was used for ranking the most important predictor proteins of ML classification between disease states and the proteins among the top 10 PI protein groups were submitted for pathway analysis. RESULTS: Of the 1,549 peptides and 198 proteins used in this study, no peptides and only one protein, hemopexin (HPX), were significantly different at an adjusted p-value <0.01 between Type B and DTAA cases. The highest performing model on the training set (Support Vector Classifier) and its corresponding linkage distance (0.5) were used for evaluation of the test set, yielding a precision-recall area under the curve of 0.7 to classify between Type B from DTAA cases. The five proteins with the highest PI scores were immunoglobulin heavy variable 6–1 (IGHV6-1), lecithin-cholesterol acyltransferase (LCAT), coagulation factor 12 (F12), HPX, and immunoglobulin heavy variable 4–4 (IGHV4-4). All proteins from the top 10 most important correlated groups generated the following significantly enriched pathways in the plasma of Type B versus DTAA patients: complement activation, humoral immune response, and blood coagulation. CONCLUSIONS: We conclude that ML may be useful in differentiating the plasma proteome of highly similar disease states that would otherwise not be distinguishable using statistics, and, in such cases, ML may enable prioritizing important proteins for model prediction. Cold Spring Harbor Laboratory 2023-10-23 /pmc/articles/PMC10168345/ /pubmed/37162892 http://dx.doi.org/10.1101/2023.04.26.538468 Text en https://creativecommons.org/licenses/by-nc-nd/4.0/This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License (https://creativecommons.org/licenses/by-nc-nd/4.0/) , which allows reusers to copy and distribute the material in any medium or format in unadapted form only, for noncommercial purposes only, and only so long as attribution is given to the creator.
spellingShingle Article
Momenzadeh, Amanda
Kreimer, Simion
Guo, Dongchuan
Ayres, Matthew
Berman, Daniel
Chyu, Kuang-Yuh
Shah, Prediman K
Milewicz, Dianna
Azizzadeh, Ali
Meyer, Jesse G.
Parker, Sarah
Differentiation between Descending Thoracic Aortic Diseases using Machine Learning and Plasma Proteomic Signatures
title Differentiation between Descending Thoracic Aortic Diseases using Machine Learning and Plasma Proteomic Signatures
title_full Differentiation between Descending Thoracic Aortic Diseases using Machine Learning and Plasma Proteomic Signatures
title_fullStr Differentiation between Descending Thoracic Aortic Diseases using Machine Learning and Plasma Proteomic Signatures
title_full_unstemmed Differentiation between Descending Thoracic Aortic Diseases using Machine Learning and Plasma Proteomic Signatures
title_short Differentiation between Descending Thoracic Aortic Diseases using Machine Learning and Plasma Proteomic Signatures
title_sort differentiation between descending thoracic aortic diseases using machine learning and plasma proteomic signatures
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10168345/
https://www.ncbi.nlm.nih.gov/pubmed/37162892
http://dx.doi.org/10.1101/2023.04.26.538468
work_keys_str_mv AT momenzadehamanda differentiationbetweendescendingthoracicaorticdiseasesusingmachinelearningandplasmaproteomicsignatures
AT kreimersimion differentiationbetweendescendingthoracicaorticdiseasesusingmachinelearningandplasmaproteomicsignatures
AT guodongchuan differentiationbetweendescendingthoracicaorticdiseasesusingmachinelearningandplasmaproteomicsignatures
AT ayresmatthew differentiationbetweendescendingthoracicaorticdiseasesusingmachinelearningandplasmaproteomicsignatures
AT bermandaniel differentiationbetweendescendingthoracicaorticdiseasesusingmachinelearningandplasmaproteomicsignatures
AT chyukuangyuh differentiationbetweendescendingthoracicaorticdiseasesusingmachinelearningandplasmaproteomicsignatures
AT shahpredimank differentiationbetweendescendingthoracicaorticdiseasesusingmachinelearningandplasmaproteomicsignatures
AT milewiczdianna differentiationbetweendescendingthoracicaorticdiseasesusingmachinelearningandplasmaproteomicsignatures
AT azizzadehali differentiationbetweendescendingthoracicaorticdiseasesusingmachinelearningandplasmaproteomicsignatures
AT meyerjesseg differentiationbetweendescendingthoracicaorticdiseasesusingmachinelearningandplasmaproteomicsignatures
AT parkersarah differentiationbetweendescendingthoracicaorticdiseasesusingmachinelearningandplasmaproteomicsignatures