Cargando…

The impact of similarity metrics on cell-type clustering in highly multiplexed in situ imaging cytometry data

MOTIVATION: The advent of highly multiplexed in situ imaging cytometry assays has revolutionized the study of cellular systems, offering unparalleled detail in observing cellular activities and characteristics. These assays provide comprehensive insights by concurrently profiling the spatial distrib...

Descripción completa

Detalles Bibliográficos
Autores principales: Willie, Elijah, Yang, Pengyi, Patrick, Ellis
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10625459/
https://www.ncbi.nlm.nih.gov/pubmed/37928340
http://dx.doi.org/10.1093/bioadv/vbad141
_version_ 1785131137792737280
author Willie, Elijah
Yang, Pengyi
Patrick, Ellis
author_facet Willie, Elijah
Yang, Pengyi
Patrick, Ellis
author_sort Willie, Elijah
collection PubMed
description MOTIVATION: The advent of highly multiplexed in situ imaging cytometry assays has revolutionized the study of cellular systems, offering unparalleled detail in observing cellular activities and characteristics. These assays provide comprehensive insights by concurrently profiling the spatial distribution and molecular features of numerous cells. In navigating this complex data landscape, unsupervised machine learning techniques, particularly clustering algorithms, have become essential tools. They enable the identification and categorization of cell types and subsets based on their molecular characteristics. Despite their widespread adoption, most clustering algorithms in use were initially developed for cell suspension technologies, leading to a potential mismatch in application. There is a critical gap in the systematic evaluation of these methods, particularly in determining the properties that make them optimal for in situ imaging assays. Addressing this gap is vital for ensuring accurate, reliable analyses and fostering advancements in cellular biology research. RESULTS: In our extensive investigation, we evaluated a range of similarity metrics, which are crucial in determining the relationships between cells during the clustering process. Our findings reveal substantial variations in clustering performance, contingent on the similarity metric employed. These variations underscore the importance of selecting appropriate metrics to ensure accurate cell type and subset identification. In response to these challenges, we introduce FuseSOM, a novel ensemble clustering algorithm that integrates hierarchical multiview learning of similarity metrics with self-organizing maps. Through a rigorous stratified subsampling analysis framework, we demonstrate that FuseSOM outperforms existing best-practice clustering methods specifically tailored for in situ imaging cytometry data. Our work not only provides critical insights into the performance of clustering algorithms in this novel context but also offers a robust solution, paving the way for more accurate and reliable in situ imaging cytometry data analysis. AVAILABILITY AND IMPLEMENTATION: The FuseSOM R package is available on Bioconductor and is available under the GPL-3 license. All the codes for the analysis performed can be found at Github.
format Online
Article
Text
id pubmed-10625459
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-106254592023-11-05 The impact of similarity metrics on cell-type clustering in highly multiplexed in situ imaging cytometry data Willie, Elijah Yang, Pengyi Patrick, Ellis Bioinform Adv Original Article MOTIVATION: The advent of highly multiplexed in situ imaging cytometry assays has revolutionized the study of cellular systems, offering unparalleled detail in observing cellular activities and characteristics. These assays provide comprehensive insights by concurrently profiling the spatial distribution and molecular features of numerous cells. In navigating this complex data landscape, unsupervised machine learning techniques, particularly clustering algorithms, have become essential tools. They enable the identification and categorization of cell types and subsets based on their molecular characteristics. Despite their widespread adoption, most clustering algorithms in use were initially developed for cell suspension technologies, leading to a potential mismatch in application. There is a critical gap in the systematic evaluation of these methods, particularly in determining the properties that make them optimal for in situ imaging assays. Addressing this gap is vital for ensuring accurate, reliable analyses and fostering advancements in cellular biology research. RESULTS: In our extensive investigation, we evaluated a range of similarity metrics, which are crucial in determining the relationships between cells during the clustering process. Our findings reveal substantial variations in clustering performance, contingent on the similarity metric employed. These variations underscore the importance of selecting appropriate metrics to ensure accurate cell type and subset identification. In response to these challenges, we introduce FuseSOM, a novel ensemble clustering algorithm that integrates hierarchical multiview learning of similarity metrics with self-organizing maps. Through a rigorous stratified subsampling analysis framework, we demonstrate that FuseSOM outperforms existing best-practice clustering methods specifically tailored for in situ imaging cytometry data. Our work not only provides critical insights into the performance of clustering algorithms in this novel context but also offers a robust solution, paving the way for more accurate and reliable in situ imaging cytometry data analysis. AVAILABILITY AND IMPLEMENTATION: The FuseSOM R package is available on Bioconductor and is available under the GPL-3 license. All the codes for the analysis performed can be found at Github. Oxford University Press 2023-10-09 /pmc/articles/PMC10625459/ /pubmed/37928340 http://dx.doi.org/10.1093/bioadv/vbad141 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Article
Willie, Elijah
Yang, Pengyi
Patrick, Ellis
The impact of similarity metrics on cell-type clustering in highly multiplexed in situ imaging cytometry data
title The impact of similarity metrics on cell-type clustering in highly multiplexed in situ imaging cytometry data
title_full The impact of similarity metrics on cell-type clustering in highly multiplexed in situ imaging cytometry data
title_fullStr The impact of similarity metrics on cell-type clustering in highly multiplexed in situ imaging cytometry data
title_full_unstemmed The impact of similarity metrics on cell-type clustering in highly multiplexed in situ imaging cytometry data
title_short The impact of similarity metrics on cell-type clustering in highly multiplexed in situ imaging cytometry data
title_sort impact of similarity metrics on cell-type clustering in highly multiplexed in situ imaging cytometry data
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10625459/
https://www.ncbi.nlm.nih.gov/pubmed/37928340
http://dx.doi.org/10.1093/bioadv/vbad141
work_keys_str_mv AT willieelijah theimpactofsimilaritymetricsoncelltypeclusteringinhighlymultiplexedinsituimagingcytometrydata
AT yangpengyi theimpactofsimilaritymetricsoncelltypeclusteringinhighlymultiplexedinsituimagingcytometrydata
AT patrickellis theimpactofsimilaritymetricsoncelltypeclusteringinhighlymultiplexedinsituimagingcytometrydata
AT willieelijah impactofsimilaritymetricsoncelltypeclusteringinhighlymultiplexedinsituimagingcytometrydata
AT yangpengyi impactofsimilaritymetricsoncelltypeclusteringinhighlymultiplexedinsituimagingcytometrydata
AT patrickellis impactofsimilaritymetricsoncelltypeclusteringinhighlymultiplexedinsituimagingcytometrydata