Cargando…

Platelet-Based Liquid Biopsies through the Lens of Machine Learning

SIMPLE SUMMARY: Liquid biopsies are a non-invasive way to diagnose and monitor cancer using blood tests. Machine learning can help understand the genetic data from these tests, but it is challenging to validate clinical applications. In our study, we first compiled a large-scale dataset for cancer c...

Descripción completa

Detalles Bibliográficos
Autores principales: Cygert, Sebastian, Pastuszak, Krzysztof, Górski, Franciszek, Sieczczyński, Michał, Juszczyk, Piotr, Rutkowski, Antoni, Lewalski, Sebastian, Różański, Robert, Jopek, Maksym Albin, Jassem, Jacek, Czyżewski, Andrzej, Wurdinger, Thomas, Best, Myron G., Żaczek, Anna J., Supernat, Anna
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10136732/
https://www.ncbi.nlm.nih.gov/pubmed/37190262
http://dx.doi.org/10.3390/cancers15082336
_version_ 1785032289924677632
author Cygert, Sebastian
Pastuszak, Krzysztof
Górski, Franciszek
Sieczczyński, Michał
Juszczyk, Piotr
Rutkowski, Antoni
Lewalski, Sebastian
Różański, Robert
Jopek, Maksym Albin
Jassem, Jacek
Czyżewski, Andrzej
Wurdinger, Thomas
Best, Myron G.
Żaczek, Anna J.
Supernat, Anna
author_facet Cygert, Sebastian
Pastuszak, Krzysztof
Górski, Franciszek
Sieczczyński, Michał
Juszczyk, Piotr
Rutkowski, Antoni
Lewalski, Sebastian
Różański, Robert
Jopek, Maksym Albin
Jassem, Jacek
Czyżewski, Andrzej
Wurdinger, Thomas
Best, Myron G.
Żaczek, Anna J.
Supernat, Anna
author_sort Cygert, Sebastian
collection PubMed
description SIMPLE SUMMARY: Liquid biopsies are a non-invasive way to diagnose and monitor cancer using blood tests. Machine learning can help understand the genetic data from these tests, but it is challenging to validate clinical applications. In our study, we first compiled a large-scale dataset for cancer classification. Then, we extracted relevant features from the data and performed a binary classification, with the prediction outcome of either a sample collected from a cancer patient or a sample collected from an asymptomatic control. We used different convolutional neural networks (CNNs) and boosting methods to evaluate the classification performance. We have obtained an impressive result of 0.96 area under the curve. Finally, we tested the robustness of the models using test data from novel hospitals and performed data inspection to find the most relevant features for the prediction. Our work proves the great potential of using liquid biopsies for cancer patient classification. ABSTRACT: Liquid biopsies offer minimally invasive diagnosis and monitoring of cancer disease. This biosource is often analyzed using sequencing, which generates highly complex data that can be used using machine learning tools. Nevertheless, validating the clinical applications of such methods is challenging. It requires: (a) using data from many patients; (b) verifying potential bias concerning sample collection; and (c) adding interpretability to the model. In this work, we have used RNA sequencing data of tumor-educated platelets (TEPs) and performed a binary classification (cancer vs. no-cancer). First, we compiled a large-scale dataset with more than a thousand donors. Further, we used different convolutional neural networks (CNNs) and boosting methods to evaluate the classifier performance. We have obtained an impressive result of 0.96 area under the curve. We then identified different clusters of splice variants using expert knowledge from the Kyoto Encyclopedia of Genes and Genomes (KEGG). Employing boosting algorithms, we identified the features with the highest predictive power. Finally, we tested the robustness of the models using test data from novel hospitals. Notably, we did not observe any decrease in model performance. Our work proves the great potential of using TEP data for cancer patient classification and opens the avenue for profound cancer diagnostics.
format Online
Article
Text
id pubmed-10136732
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-101367322023-04-28 Platelet-Based Liquid Biopsies through the Lens of Machine Learning Cygert, Sebastian Pastuszak, Krzysztof Górski, Franciszek Sieczczyński, Michał Juszczyk, Piotr Rutkowski, Antoni Lewalski, Sebastian Różański, Robert Jopek, Maksym Albin Jassem, Jacek Czyżewski, Andrzej Wurdinger, Thomas Best, Myron G. Żaczek, Anna J. Supernat, Anna Cancers (Basel) Article SIMPLE SUMMARY: Liquid biopsies are a non-invasive way to diagnose and monitor cancer using blood tests. Machine learning can help understand the genetic data from these tests, but it is challenging to validate clinical applications. In our study, we first compiled a large-scale dataset for cancer classification. Then, we extracted relevant features from the data and performed a binary classification, with the prediction outcome of either a sample collected from a cancer patient or a sample collected from an asymptomatic control. We used different convolutional neural networks (CNNs) and boosting methods to evaluate the classification performance. We have obtained an impressive result of 0.96 area under the curve. Finally, we tested the robustness of the models using test data from novel hospitals and performed data inspection to find the most relevant features for the prediction. Our work proves the great potential of using liquid biopsies for cancer patient classification. ABSTRACT: Liquid biopsies offer minimally invasive diagnosis and monitoring of cancer disease. This biosource is often analyzed using sequencing, which generates highly complex data that can be used using machine learning tools. Nevertheless, validating the clinical applications of such methods is challenging. It requires: (a) using data from many patients; (b) verifying potential bias concerning sample collection; and (c) adding interpretability to the model. In this work, we have used RNA sequencing data of tumor-educated platelets (TEPs) and performed a binary classification (cancer vs. no-cancer). First, we compiled a large-scale dataset with more than a thousand donors. Further, we used different convolutional neural networks (CNNs) and boosting methods to evaluate the classifier performance. We have obtained an impressive result of 0.96 area under the curve. We then identified different clusters of splice variants using expert knowledge from the Kyoto Encyclopedia of Genes and Genomes (KEGG). Employing boosting algorithms, we identified the features with the highest predictive power. Finally, we tested the robustness of the models using test data from novel hospitals. Notably, we did not observe any decrease in model performance. Our work proves the great potential of using TEP data for cancer patient classification and opens the avenue for profound cancer diagnostics. MDPI 2023-04-17 /pmc/articles/PMC10136732/ /pubmed/37190262 http://dx.doi.org/10.3390/cancers15082336 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Cygert, Sebastian
Pastuszak, Krzysztof
Górski, Franciszek
Sieczczyński, Michał
Juszczyk, Piotr
Rutkowski, Antoni
Lewalski, Sebastian
Różański, Robert
Jopek, Maksym Albin
Jassem, Jacek
Czyżewski, Andrzej
Wurdinger, Thomas
Best, Myron G.
Żaczek, Anna J.
Supernat, Anna
Platelet-Based Liquid Biopsies through the Lens of Machine Learning
title Platelet-Based Liquid Biopsies through the Lens of Machine Learning
title_full Platelet-Based Liquid Biopsies through the Lens of Machine Learning
title_fullStr Platelet-Based Liquid Biopsies through the Lens of Machine Learning
title_full_unstemmed Platelet-Based Liquid Biopsies through the Lens of Machine Learning
title_short Platelet-Based Liquid Biopsies through the Lens of Machine Learning
title_sort platelet-based liquid biopsies through the lens of machine learning
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10136732/
https://www.ncbi.nlm.nih.gov/pubmed/37190262
http://dx.doi.org/10.3390/cancers15082336
work_keys_str_mv AT cygertsebastian plateletbasedliquidbiopsiesthroughthelensofmachinelearning
AT pastuszakkrzysztof plateletbasedliquidbiopsiesthroughthelensofmachinelearning
AT gorskifranciszek plateletbasedliquidbiopsiesthroughthelensofmachinelearning
AT sieczczynskimichał plateletbasedliquidbiopsiesthroughthelensofmachinelearning
AT juszczykpiotr plateletbasedliquidbiopsiesthroughthelensofmachinelearning
AT rutkowskiantoni plateletbasedliquidbiopsiesthroughthelensofmachinelearning
AT lewalskisebastian plateletbasedliquidbiopsiesthroughthelensofmachinelearning
AT rozanskirobert plateletbasedliquidbiopsiesthroughthelensofmachinelearning
AT jopekmaksymalbin plateletbasedliquidbiopsiesthroughthelensofmachinelearning
AT jassemjacek plateletbasedliquidbiopsiesthroughthelensofmachinelearning
AT czyzewskiandrzej plateletbasedliquidbiopsiesthroughthelensofmachinelearning
AT wurdingerthomas plateletbasedliquidbiopsiesthroughthelensofmachinelearning
AT bestmyrong plateletbasedliquidbiopsiesthroughthelensofmachinelearning
AT zaczekannaj plateletbasedliquidbiopsiesthroughthelensofmachinelearning
AT supernatanna plateletbasedliquidbiopsiesthroughthelensofmachinelearning