Cargando…

MapperPlus: Agnostic clustering of high-dimension data for precision medicine

One of the goals of precision medicine is to classify patients into subgroups that differ in their susceptibility and response to a disease, thereby enabling tailored treatments for each subgroup. Therefore, there is a great need to identify distinctive clusters of patients from patient data. There...

Descripción completa

Detalles Bibliográficos
Autores principales: Datta, Esha, Ballal, Aditya, López, Javier E., Izu, Leighton T.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10411786/
https://www.ncbi.nlm.nih.gov/pubmed/37556425
http://dx.doi.org/10.1371/journal.pdig.0000307
_version_ 1785086746336165888
author Datta, Esha
Ballal, Aditya
López, Javier E.
Izu, Leighton T.
author_facet Datta, Esha
Ballal, Aditya
López, Javier E.
Izu, Leighton T.
author_sort Datta, Esha
collection PubMed
description One of the goals of precision medicine is to classify patients into subgroups that differ in their susceptibility and response to a disease, thereby enabling tailored treatments for each subgroup. Therefore, there is a great need to identify distinctive clusters of patients from patient data. There are three key challenges to three key challenges of patient stratification: 1) the unknown number of clusters, 2) the need for assessing cluster validity, and 3) the clinical interpretability. We developed MapperPlus, a novel unsupervised clustering pipeline, that directly addresses these challenges. It extends the topological Mapper technique and blends it with two random-walk algorithms to automatically detect disjoint subgroups in patient data. We demonstrate that MapperPlus outperforms traditional agnostic clustering methods in key accuracy/performance metrics by testing its performance on publicly available medical and non-medical data set. We also demonstrate the predictive power of MapperPlus in a medical dataset of pediatric stem cell transplant patients where a number of cluster is unknown. Here, MapperPlus stratifies the patient population into clusters with distinctive survival rates. The MapperPlus software is open-source and publicly available.
format Online
Article
Text
id pubmed-10411786
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-104117862023-08-10 MapperPlus: Agnostic clustering of high-dimension data for precision medicine Datta, Esha Ballal, Aditya López, Javier E. Izu, Leighton T. PLOS Digit Health Research Article One of the goals of precision medicine is to classify patients into subgroups that differ in their susceptibility and response to a disease, thereby enabling tailored treatments for each subgroup. Therefore, there is a great need to identify distinctive clusters of patients from patient data. There are three key challenges to three key challenges of patient stratification: 1) the unknown number of clusters, 2) the need for assessing cluster validity, and 3) the clinical interpretability. We developed MapperPlus, a novel unsupervised clustering pipeline, that directly addresses these challenges. It extends the topological Mapper technique and blends it with two random-walk algorithms to automatically detect disjoint subgroups in patient data. We demonstrate that MapperPlus outperforms traditional agnostic clustering methods in key accuracy/performance metrics by testing its performance on publicly available medical and non-medical data set. We also demonstrate the predictive power of MapperPlus in a medical dataset of pediatric stem cell transplant patients where a number of cluster is unknown. Here, MapperPlus stratifies the patient population into clusters with distinctive survival rates. The MapperPlus software is open-source and publicly available. Public Library of Science 2023-08-09 /pmc/articles/PMC10411786/ /pubmed/37556425 http://dx.doi.org/10.1371/journal.pdig.0000307 Text en © 2023 Datta et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Datta, Esha
Ballal, Aditya
López, Javier E.
Izu, Leighton T.
MapperPlus: Agnostic clustering of high-dimension data for precision medicine
title MapperPlus: Agnostic clustering of high-dimension data for precision medicine
title_full MapperPlus: Agnostic clustering of high-dimension data for precision medicine
title_fullStr MapperPlus: Agnostic clustering of high-dimension data for precision medicine
title_full_unstemmed MapperPlus: Agnostic clustering of high-dimension data for precision medicine
title_short MapperPlus: Agnostic clustering of high-dimension data for precision medicine
title_sort mapperplus: agnostic clustering of high-dimension data for precision medicine
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10411786/
https://www.ncbi.nlm.nih.gov/pubmed/37556425
http://dx.doi.org/10.1371/journal.pdig.0000307
work_keys_str_mv AT dattaesha mapperplusagnosticclusteringofhighdimensiondataforprecisionmedicine
AT ballaladitya mapperplusagnosticclusteringofhighdimensiondataforprecisionmedicine
AT lopezjaviere mapperplusagnosticclusteringofhighdimensiondataforprecisionmedicine
AT izuleightont mapperplusagnosticclusteringofhighdimensiondataforprecisionmedicine