Cargando…
The impact of similarity metrics on cell-type clustering in highly multiplexed in situ imaging cytometry data
MOTIVATION: The advent of highly multiplexed in situ imaging cytometry assays has revolutionized the study of cellular systems, offering unparalleled detail in observing cellular activities and characteristics. These assays provide comprehensive insights by concurrently profiling the spatial distrib...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10625459/ https://www.ncbi.nlm.nih.gov/pubmed/37928340 http://dx.doi.org/10.1093/bioadv/vbad141 |
_version_ | 1785131137792737280 |
---|---|
author | Willie, Elijah Yang, Pengyi Patrick, Ellis |
author_facet | Willie, Elijah Yang, Pengyi Patrick, Ellis |
author_sort | Willie, Elijah |
collection | PubMed |
description | MOTIVATION: The advent of highly multiplexed in situ imaging cytometry assays has revolutionized the study of cellular systems, offering unparalleled detail in observing cellular activities and characteristics. These assays provide comprehensive insights by concurrently profiling the spatial distribution and molecular features of numerous cells. In navigating this complex data landscape, unsupervised machine learning techniques, particularly clustering algorithms, have become essential tools. They enable the identification and categorization of cell types and subsets based on their molecular characteristics. Despite their widespread adoption, most clustering algorithms in use were initially developed for cell suspension technologies, leading to a potential mismatch in application. There is a critical gap in the systematic evaluation of these methods, particularly in determining the properties that make them optimal for in situ imaging assays. Addressing this gap is vital for ensuring accurate, reliable analyses and fostering advancements in cellular biology research. RESULTS: In our extensive investigation, we evaluated a range of similarity metrics, which are crucial in determining the relationships between cells during the clustering process. Our findings reveal substantial variations in clustering performance, contingent on the similarity metric employed. These variations underscore the importance of selecting appropriate metrics to ensure accurate cell type and subset identification. In response to these challenges, we introduce FuseSOM, a novel ensemble clustering algorithm that integrates hierarchical multiview learning of similarity metrics with self-organizing maps. Through a rigorous stratified subsampling analysis framework, we demonstrate that FuseSOM outperforms existing best-practice clustering methods specifically tailored for in situ imaging cytometry data. Our work not only provides critical insights into the performance of clustering algorithms in this novel context but also offers a robust solution, paving the way for more accurate and reliable in situ imaging cytometry data analysis. AVAILABILITY AND IMPLEMENTATION: The FuseSOM R package is available on Bioconductor and is available under the GPL-3 license. All the codes for the analysis performed can be found at Github. |
format | Online Article Text |
id | pubmed-10625459 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-106254592023-11-05 The impact of similarity metrics on cell-type clustering in highly multiplexed in situ imaging cytometry data Willie, Elijah Yang, Pengyi Patrick, Ellis Bioinform Adv Original Article MOTIVATION: The advent of highly multiplexed in situ imaging cytometry assays has revolutionized the study of cellular systems, offering unparalleled detail in observing cellular activities and characteristics. These assays provide comprehensive insights by concurrently profiling the spatial distribution and molecular features of numerous cells. In navigating this complex data landscape, unsupervised machine learning techniques, particularly clustering algorithms, have become essential tools. They enable the identification and categorization of cell types and subsets based on their molecular characteristics. Despite their widespread adoption, most clustering algorithms in use were initially developed for cell suspension technologies, leading to a potential mismatch in application. There is a critical gap in the systematic evaluation of these methods, particularly in determining the properties that make them optimal for in situ imaging assays. Addressing this gap is vital for ensuring accurate, reliable analyses and fostering advancements in cellular biology research. RESULTS: In our extensive investigation, we evaluated a range of similarity metrics, which are crucial in determining the relationships between cells during the clustering process. Our findings reveal substantial variations in clustering performance, contingent on the similarity metric employed. These variations underscore the importance of selecting appropriate metrics to ensure accurate cell type and subset identification. In response to these challenges, we introduce FuseSOM, a novel ensemble clustering algorithm that integrates hierarchical multiview learning of similarity metrics with self-organizing maps. Through a rigorous stratified subsampling analysis framework, we demonstrate that FuseSOM outperforms existing best-practice clustering methods specifically tailored for in situ imaging cytometry data. Our work not only provides critical insights into the performance of clustering algorithms in this novel context but also offers a robust solution, paving the way for more accurate and reliable in situ imaging cytometry data analysis. AVAILABILITY AND IMPLEMENTATION: The FuseSOM R package is available on Bioconductor and is available under the GPL-3 license. All the codes for the analysis performed can be found at Github. Oxford University Press 2023-10-09 /pmc/articles/PMC10625459/ /pubmed/37928340 http://dx.doi.org/10.1093/bioadv/vbad141 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Article Willie, Elijah Yang, Pengyi Patrick, Ellis The impact of similarity metrics on cell-type clustering in highly multiplexed in situ imaging cytometry data |
title | The impact of similarity metrics on cell-type clustering in highly multiplexed in situ imaging cytometry data |
title_full | The impact of similarity metrics on cell-type clustering in highly multiplexed in situ imaging cytometry data |
title_fullStr | The impact of similarity metrics on cell-type clustering in highly multiplexed in situ imaging cytometry data |
title_full_unstemmed | The impact of similarity metrics on cell-type clustering in highly multiplexed in situ imaging cytometry data |
title_short | The impact of similarity metrics on cell-type clustering in highly multiplexed in situ imaging cytometry data |
title_sort | impact of similarity metrics on cell-type clustering in highly multiplexed in situ imaging cytometry data |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10625459/ https://www.ncbi.nlm.nih.gov/pubmed/37928340 http://dx.doi.org/10.1093/bioadv/vbad141 |
work_keys_str_mv | AT willieelijah theimpactofsimilaritymetricsoncelltypeclusteringinhighlymultiplexedinsituimagingcytometrydata AT yangpengyi theimpactofsimilaritymetricsoncelltypeclusteringinhighlymultiplexedinsituimagingcytometrydata AT patrickellis theimpactofsimilaritymetricsoncelltypeclusteringinhighlymultiplexedinsituimagingcytometrydata AT willieelijah impactofsimilaritymetricsoncelltypeclusteringinhighlymultiplexedinsituimagingcytometrydata AT yangpengyi impactofsimilaritymetricsoncelltypeclusteringinhighlymultiplexedinsituimagingcytometrydata AT patrickellis impactofsimilaritymetricsoncelltypeclusteringinhighlymultiplexedinsituimagingcytometrydata |