Cargando…

Survival analysis of patient groups defined by unsupervised machine learning clustering methods based on patient metabolomic data.

PURPOSE: Meta-analyses failed to accurately identify patients with non-metastatic breast cancer who are likely to benefit from chemotherapy, and metabolomics could provide new answers. In our previous published work, patients were clustered using five different unsupervised machine learning (ML) met...

Descripción completa

Detalles Bibliográficos
Autores principales:	Bailleux, Caroline, Chardin, David, Guigonis, Jean-Marie, Ferrero, Jean-Marc, Chateau, Yann, Humbert, Olivier, Pourcher, Thierry, Gal, Jocelyn
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Research Network of Computational and Structural Biotechnology 2023
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10618114/ https://www.ncbi.nlm.nih.gov/pubmed/37920813 http://dx.doi.org/10.1016/j.csbj.2023.10.033

_version_	1785129707531927552
author	Bailleux, Caroline Chardin, David Guigonis, Jean-Marie Ferrero, Jean-Marc Chateau, Yann Humbert, Olivier Pourcher, Thierry Gal, Jocelyn
author_facet	Bailleux, Caroline Chardin, David Guigonis, Jean-Marie Ferrero, Jean-Marc Chateau, Yann Humbert, Olivier Pourcher, Thierry Gal, Jocelyn
author_sort	Bailleux, Caroline
collection	PubMed
description	PURPOSE: Meta-analyses failed to accurately identify patients with non-metastatic breast cancer who are likely to benefit from chemotherapy, and metabolomics could provide new answers. In our previous published work, patients were clustered using five different unsupervised machine learning (ML) methods resulting in the identification of three clusters with distinct clinical and simulated survival data. The objective of this study was to evaluate the survival outcomes, with extended follow-up, using the same 5 different methods of unsupervised machine learning. EXPERIMENTAL DESIGN: Forty-nine patients, diagnosed between 2013 and 2016, with non-metastatic BC were included retrospectively. Median follow-up was extended to 85.8 months. 449 metabolites were extracted from tumor resection samples by combined Liquid chromatography-mass spectrometry (LC–MS). Survival analyses were reported grouping together Cluster 1 and 2 versus cluster 3. Bootstrap optimization was applied. RESULTS: PCA k-means, K-sparse and Spectral clustering were the most effective methods to predict 2-year progression-free survival with bootstrap optimization (PFSb); as bootstrap example, with PCA k-means method, PFSb were 94% for cluster 1&2 versus 82% for cluster 3 (p = 0.01). PCA k-means method performed best, with higher reproducibility (mean HR=2 (95%CI [1.4–2.7]); probability of p ≤ 0.05 85%). Cancer-specific survival (CSS) and overall survival (OS) analyses highlighted a discrepancy between the 5 ML unsupervised methods. CONCLUSION: Our study is a proof-of-principle that it is possible to use unsupervised ML methods on metabolomic data to predict PFS survival outcomes, with the best performance for PCA k-means. A larger population study is needed to draw conclusions from CSS and OS analyses.
format	Online Article Text
id	pubmed-10618114
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	Research Network of Computational and Structural Biotechnology
record_format	MEDLINE/PubMed
spelling	pubmed-106181142023-11-02 Survival analysis of patient groups defined by unsupervised machine learning clustering methods based on patient metabolomic data. Bailleux, Caroline Chardin, David Guigonis, Jean-Marie Ferrero, Jean-Marc Chateau, Yann Humbert, Olivier Pourcher, Thierry Gal, Jocelyn Comput Struct Biotechnol J Research Article PURPOSE: Meta-analyses failed to accurately identify patients with non-metastatic breast cancer who are likely to benefit from chemotherapy, and metabolomics could provide new answers. In our previous published work, patients were clustered using five different unsupervised machine learning (ML) methods resulting in the identification of three clusters with distinct clinical and simulated survival data. The objective of this study was to evaluate the survival outcomes, with extended follow-up, using the same 5 different methods of unsupervised machine learning. EXPERIMENTAL DESIGN: Forty-nine patients, diagnosed between 2013 and 2016, with non-metastatic BC were included retrospectively. Median follow-up was extended to 85.8 months. 449 metabolites were extracted from tumor resection samples by combined Liquid chromatography-mass spectrometry (LC–MS). Survival analyses were reported grouping together Cluster 1 and 2 versus cluster 3. Bootstrap optimization was applied. RESULTS: PCA k-means, K-sparse and Spectral clustering were the most effective methods to predict 2-year progression-free survival with bootstrap optimization (PFSb); as bootstrap example, with PCA k-means method, PFSb were 94% for cluster 1&2 versus 82% for cluster 3 (p = 0.01). PCA k-means method performed best, with higher reproducibility (mean HR=2 (95%CI [1.4–2.7]); probability of p ≤ 0.05 85%). Cancer-specific survival (CSS) and overall survival (OS) analyses highlighted a discrepancy between the 5 ML unsupervised methods. CONCLUSION: Our study is a proof-of-principle that it is possible to use unsupervised ML methods on metabolomic data to predict PFS survival outcomes, with the best performance for PCA k-means. A larger population study is needed to draw conclusions from CSS and OS analyses. Research Network of Computational and Structural Biotechnology 2023-10-19 /pmc/articles/PMC10618114/ /pubmed/37920813 http://dx.doi.org/10.1016/j.csbj.2023.10.033 Text en © 2023 The Authors https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle	Research Article Bailleux, Caroline Chardin, David Guigonis, Jean-Marie Ferrero, Jean-Marc Chateau, Yann Humbert, Olivier Pourcher, Thierry Gal, Jocelyn Survival analysis of patient groups defined by unsupervised machine learning clustering methods based on patient metabolomic data.
title	Survival analysis of patient groups defined by unsupervised machine learning clustering methods based on patient metabolomic data.
title_full	Survival analysis of patient groups defined by unsupervised machine learning clustering methods based on patient metabolomic data.
title_fullStr	Survival analysis of patient groups defined by unsupervised machine learning clustering methods based on patient metabolomic data.
title_full_unstemmed	Survival analysis of patient groups defined by unsupervised machine learning clustering methods based on patient metabolomic data.
title_short	Survival analysis of patient groups defined by unsupervised machine learning clustering methods based on patient metabolomic data.
title_sort	survival analysis of patient groups defined by unsupervised machine learning clustering methods based on patient metabolomic data.
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10618114/ https://www.ncbi.nlm.nih.gov/pubmed/37920813 http://dx.doi.org/10.1016/j.csbj.2023.10.033
work_keys_str_mv	AT bailleuxcaroline survivalanalysisofpatientgroupsdefinedbyunsupervisedmachinelearningclusteringmethodsbasedonpatientmetabolomicdata AT chardindavid survivalanalysisofpatientgroupsdefinedbyunsupervisedmachinelearningclusteringmethodsbasedonpatientmetabolomicdata AT guigonisjeanmarie survivalanalysisofpatientgroupsdefinedbyunsupervisedmachinelearningclusteringmethodsbasedonpatientmetabolomicdata AT ferrerojeanmarc survivalanalysisofpatientgroupsdefinedbyunsupervisedmachinelearningclusteringmethodsbasedonpatientmetabolomicdata AT chateauyann survivalanalysisofpatientgroupsdefinedbyunsupervisedmachinelearningclusteringmethodsbasedonpatientmetabolomicdata AT humbertolivier survivalanalysisofpatientgroupsdefinedbyunsupervisedmachinelearningclusteringmethodsbasedonpatientmetabolomicdata AT pourcherthierry survivalanalysisofpatientgroupsdefinedbyunsupervisedmachinelearningclusteringmethodsbasedonpatientmetabolomicdata AT galjocelyn survivalanalysisofpatientgroupsdefinedbyunsupervisedmachinelearningclusteringmethodsbasedonpatientmetabolomicdata

Survival analysis of patient groups defined by unsupervised machine learning clustering methods based on patient metabolomic data.

Ejemplares similares