Cargando…

Machine learning classifier approaches for predicting response to RTK-type-III inhibitors demonstrate high accuracy using transcriptomic signatures and ex vivo data

MOTIVATION: The application of machine learning (ML) techniques in the medical field has demonstrated both successes and challenges in the precision medicine era. The ability to accurately classify a subject as a potential responder versus a nonresponder to a given therapy is still an active area of...

Descripción completa

Detalles Bibliográficos
Autores principales:	Ferrato, Mauricio H, Marsh, Adam G, Franke, Karl R, Huang, Benjamin J, Kolb, E Anders, DeRyckere, Deborah, Grahm, Douglas K, Chandrasekaran, Sunita, Crowgey, Erin L
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Oxford University Press 2023
Materias:	Original Paper
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10209528/ https://www.ncbi.nlm.nih.gov/pubmed/37250111 http://dx.doi.org/10.1093/bioadv/vbad034

_version_	1785046894786904064
author	Ferrato, Mauricio H Marsh, Adam G Franke, Karl R Huang, Benjamin J Kolb, E Anders DeRyckere, Deborah Grahm, Douglas K Chandrasekaran, Sunita Crowgey, Erin L
author_facet	Ferrato, Mauricio H Marsh, Adam G Franke, Karl R Huang, Benjamin J Kolb, E Anders DeRyckere, Deborah Grahm, Douglas K Chandrasekaran, Sunita Crowgey, Erin L
author_sort	Ferrato, Mauricio H
collection	PubMed
description	MOTIVATION: The application of machine learning (ML) techniques in the medical field has demonstrated both successes and challenges in the precision medicine era. The ability to accurately classify a subject as a potential responder versus a nonresponder to a given therapy is still an active area of research pushing the field to create new approaches for applying machine-learning techniques. In this study, we leveraged publicly available data through the BeatAML initiative. Specifically, we used gene count data, generated via RNA-seq, from 451 individuals matched with ex vivo data generated from treatment with RTK-type-III inhibitors. Three feature selection techniques were tested, principal component analysis, Shapley Additive Explanation (SHAP) technique and differential gene expression analysis, with three different classifiers, XGBoost, LightGBM and random forest (RF). Sensitivity versus specificity was analyzed using the area under the curve (AUC)-receiver operating curves (ROCs) for every model developed. RESULTS: Our work demonstrated that feature selection technique, rather than the classifier, had the greatest impact on model performance. The SHAP technique outperformed the other feature selection techniques and was able to with high accuracy predict outcome response, with the highest performing model: Foretinib with 89% AUC using the SHAP technique and RF classifier. Our ML pipelines demonstrate that at the time of diagnosis, a transcriptomics signature exists that can potentially predict response to treatment, demonstrating the potential of using ML applications in precision medicine efforts. AVAILABILITY AND IMPLEMENTATION: https://github.com/UD-CRPL/RCDML. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics Advances online.
format	Online Article Text
id	pubmed-10209528
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	Oxford University Press
record_format	MEDLINE/PubMed
spelling	pubmed-102095282023-05-26 Machine learning classifier approaches for predicting response to RTK-type-III inhibitors demonstrate high accuracy using transcriptomic signatures and ex vivo data Ferrato, Mauricio H Marsh, Adam G Franke, Karl R Huang, Benjamin J Kolb, E Anders DeRyckere, Deborah Grahm, Douglas K Chandrasekaran, Sunita Crowgey, Erin L Bioinform Adv Original Paper MOTIVATION: The application of machine learning (ML) techniques in the medical field has demonstrated both successes and challenges in the precision medicine era. The ability to accurately classify a subject as a potential responder versus a nonresponder to a given therapy is still an active area of research pushing the field to create new approaches for applying machine-learning techniques. In this study, we leveraged publicly available data through the BeatAML initiative. Specifically, we used gene count data, generated via RNA-seq, from 451 individuals matched with ex vivo data generated from treatment with RTK-type-III inhibitors. Three feature selection techniques were tested, principal component analysis, Shapley Additive Explanation (SHAP) technique and differential gene expression analysis, with three different classifiers, XGBoost, LightGBM and random forest (RF). Sensitivity versus specificity was analyzed using the area under the curve (AUC)-receiver operating curves (ROCs) for every model developed. RESULTS: Our work demonstrated that feature selection technique, rather than the classifier, had the greatest impact on model performance. The SHAP technique outperformed the other feature selection techniques and was able to with high accuracy predict outcome response, with the highest performing model: Foretinib with 89% AUC using the SHAP technique and RF classifier. Our ML pipelines demonstrate that at the time of diagnosis, a transcriptomics signature exists that can potentially predict response to treatment, demonstrating the potential of using ML applications in precision medicine efforts. AVAILABILITY AND IMPLEMENTATION: https://github.com/UD-CRPL/RCDML. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics Advances online. Oxford University Press 2023-03-22 /pmc/articles/PMC10209528/ /pubmed/37250111 http://dx.doi.org/10.1093/bioadv/vbad034 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Original Paper Ferrato, Mauricio H Marsh, Adam G Franke, Karl R Huang, Benjamin J Kolb, E Anders DeRyckere, Deborah Grahm, Douglas K Chandrasekaran, Sunita Crowgey, Erin L Machine learning classifier approaches for predicting response to RTK-type-III inhibitors demonstrate high accuracy using transcriptomic signatures and ex vivo data
title	Machine learning classifier approaches for predicting response to RTK-type-III inhibitors demonstrate high accuracy using transcriptomic signatures and ex vivo data
title_full	Machine learning classifier approaches for predicting response to RTK-type-III inhibitors demonstrate high accuracy using transcriptomic signatures and ex vivo data
title_fullStr	Machine learning classifier approaches for predicting response to RTK-type-III inhibitors demonstrate high accuracy using transcriptomic signatures and ex vivo data
title_full_unstemmed	Machine learning classifier approaches for predicting response to RTK-type-III inhibitors demonstrate high accuracy using transcriptomic signatures and ex vivo data
title_short	Machine learning classifier approaches for predicting response to RTK-type-III inhibitors demonstrate high accuracy using transcriptomic signatures and ex vivo data
title_sort	machine learning classifier approaches for predicting response to rtk-type-iii inhibitors demonstrate high accuracy using transcriptomic signatures and ex vivo data
topic	Original Paper
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10209528/ https://www.ncbi.nlm.nih.gov/pubmed/37250111 http://dx.doi.org/10.1093/bioadv/vbad034
work_keys_str_mv	AT ferratomauricioh machinelearningclassifierapproachesforpredictingresponsetortktypeiiiinhibitorsdemonstratehighaccuracyusingtranscriptomicsignaturesandexvivodata AT marshadamg machinelearningclassifierapproachesforpredictingresponsetortktypeiiiinhibitorsdemonstratehighaccuracyusingtranscriptomicsignaturesandexvivodata AT frankekarlr machinelearningclassifierapproachesforpredictingresponsetortktypeiiiinhibitorsdemonstratehighaccuracyusingtranscriptomicsignaturesandexvivodata AT huangbenjaminj machinelearningclassifierapproachesforpredictingresponsetortktypeiiiinhibitorsdemonstratehighaccuracyusingtranscriptomicsignaturesandexvivodata AT kolbeanders machinelearningclassifierapproachesforpredictingresponsetortktypeiiiinhibitorsdemonstratehighaccuracyusingtranscriptomicsignaturesandexvivodata AT deryckeredeborah machinelearningclassifierapproachesforpredictingresponsetortktypeiiiinhibitorsdemonstratehighaccuracyusingtranscriptomicsignaturesandexvivodata AT grahmdouglask machinelearningclassifierapproachesforpredictingresponsetortktypeiiiinhibitorsdemonstratehighaccuracyusingtranscriptomicsignaturesandexvivodata AT chandrasekaransunita machinelearningclassifierapproachesforpredictingresponsetortktypeiiiinhibitorsdemonstratehighaccuracyusingtranscriptomicsignaturesandexvivodata AT crowgeyerinl machinelearningclassifierapproachesforpredictingresponsetortktypeiiiinhibitorsdemonstratehighaccuracyusingtranscriptomicsignaturesandexvivodata

Machine learning classifier approaches for predicting response to RTK-type-III inhibitors demonstrate high accuracy using transcriptomic signatures and ex vivo data

Ejemplares similares