Cargando…
MapperPlus: Agnostic clustering of high-dimension data for precision medicine
One of the goals of precision medicine is to classify patients into subgroups that differ in their susceptibility and response to a disease, thereby enabling tailored treatments for each subgroup. Therefore, there is a great need to identify distinctive clusters of patients from patient data. There...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10411786/ https://www.ncbi.nlm.nih.gov/pubmed/37556425 http://dx.doi.org/10.1371/journal.pdig.0000307 |
_version_ | 1785086746336165888 |
---|---|
author | Datta, Esha Ballal, Aditya López, Javier E. Izu, Leighton T. |
author_facet | Datta, Esha Ballal, Aditya López, Javier E. Izu, Leighton T. |
author_sort | Datta, Esha |
collection | PubMed |
description | One of the goals of precision medicine is to classify patients into subgroups that differ in their susceptibility and response to a disease, thereby enabling tailored treatments for each subgroup. Therefore, there is a great need to identify distinctive clusters of patients from patient data. There are three key challenges to three key challenges of patient stratification: 1) the unknown number of clusters, 2) the need for assessing cluster validity, and 3) the clinical interpretability. We developed MapperPlus, a novel unsupervised clustering pipeline, that directly addresses these challenges. It extends the topological Mapper technique and blends it with two random-walk algorithms to automatically detect disjoint subgroups in patient data. We demonstrate that MapperPlus outperforms traditional agnostic clustering methods in key accuracy/performance metrics by testing its performance on publicly available medical and non-medical data set. We also demonstrate the predictive power of MapperPlus in a medical dataset of pediatric stem cell transplant patients where a number of cluster is unknown. Here, MapperPlus stratifies the patient population into clusters with distinctive survival rates. The MapperPlus software is open-source and publicly available. |
format | Online Article Text |
id | pubmed-10411786 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-104117862023-08-10 MapperPlus: Agnostic clustering of high-dimension data for precision medicine Datta, Esha Ballal, Aditya López, Javier E. Izu, Leighton T. PLOS Digit Health Research Article One of the goals of precision medicine is to classify patients into subgroups that differ in their susceptibility and response to a disease, thereby enabling tailored treatments for each subgroup. Therefore, there is a great need to identify distinctive clusters of patients from patient data. There are three key challenges to three key challenges of patient stratification: 1) the unknown number of clusters, 2) the need for assessing cluster validity, and 3) the clinical interpretability. We developed MapperPlus, a novel unsupervised clustering pipeline, that directly addresses these challenges. It extends the topological Mapper technique and blends it with two random-walk algorithms to automatically detect disjoint subgroups in patient data. We demonstrate that MapperPlus outperforms traditional agnostic clustering methods in key accuracy/performance metrics by testing its performance on publicly available medical and non-medical data set. We also demonstrate the predictive power of MapperPlus in a medical dataset of pediatric stem cell transplant patients where a number of cluster is unknown. Here, MapperPlus stratifies the patient population into clusters with distinctive survival rates. The MapperPlus software is open-source and publicly available. Public Library of Science 2023-08-09 /pmc/articles/PMC10411786/ /pubmed/37556425 http://dx.doi.org/10.1371/journal.pdig.0000307 Text en © 2023 Datta et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Datta, Esha Ballal, Aditya López, Javier E. Izu, Leighton T. MapperPlus: Agnostic clustering of high-dimension data for precision medicine |
title | MapperPlus: Agnostic clustering of high-dimension data for precision medicine |
title_full | MapperPlus: Agnostic clustering of high-dimension data for precision medicine |
title_fullStr | MapperPlus: Agnostic clustering of high-dimension data for precision medicine |
title_full_unstemmed | MapperPlus: Agnostic clustering of high-dimension data for precision medicine |
title_short | MapperPlus: Agnostic clustering of high-dimension data for precision medicine |
title_sort | mapperplus: agnostic clustering of high-dimension data for precision medicine |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10411786/ https://www.ncbi.nlm.nih.gov/pubmed/37556425 http://dx.doi.org/10.1371/journal.pdig.0000307 |
work_keys_str_mv | AT dattaesha mapperplusagnosticclusteringofhighdimensiondataforprecisionmedicine AT ballaladitya mapperplusagnosticclusteringofhighdimensiondataforprecisionmedicine AT lopezjaviere mapperplusagnosticclusteringofhighdimensiondataforprecisionmedicine AT izuleightont mapperplusagnosticclusteringofhighdimensiondataforprecisionmedicine |