Cargando…

SpatialSort: a Bayesian model for clustering and cell population annotation of spatial proteomics data

MOTIVATION: Recent advances in spatial proteomics technologies have enabled the profiling of dozens of proteins in thousands of single cells in situ. This has created the opportunity to move beyond quantifying the composition of cell types in tissue, and instead probe the spatial relationships betwe...

Descripción completa

Detalles Bibliográficos
Autores principales: Lee, Eric, Chern, Kevin, Nissen, Michael, Wang, Xuehai, Huang, Chris, Gandhi, Anita K, Bouchard-Côté, Alexandre, Weng, Andrew P, Roth, Andrew
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10311307/
https://www.ncbi.nlm.nih.gov/pubmed/37387130
http://dx.doi.org/10.1093/bioinformatics/btad242
_version_ 1785066715351089152
author Lee, Eric
Chern, Kevin
Nissen, Michael
Wang, Xuehai
Huang, Chris
Gandhi, Anita K
Bouchard-Côté, Alexandre
Weng, Andrew P
Roth, Andrew
author_facet Lee, Eric
Chern, Kevin
Nissen, Michael
Wang, Xuehai
Huang, Chris
Gandhi, Anita K
Bouchard-Côté, Alexandre
Weng, Andrew P
Roth, Andrew
author_sort Lee, Eric
collection PubMed
description MOTIVATION: Recent advances in spatial proteomics technologies have enabled the profiling of dozens of proteins in thousands of single cells in situ. This has created the opportunity to move beyond quantifying the composition of cell types in tissue, and instead probe the spatial relationships between cells. However, most current methods for clustering data from these assays only consider the expression values of cells and ignore the spatial context. Furthermore, existing approaches do not account for prior information about the expected cell populations in a sample. RESULTS: To address these shortcomings, we developed SpatialSort, a spatially aware Bayesian clustering approach that allows for the incorporation of prior biological knowledge. Our method is able to account for the affinities of cells of different types to neighbour in space, and by incorporating prior information about expected cell populations, it is able to simultaneously improve clustering accuracy and perform automated annotation of clusters. Using synthetic and real data, we show that by using spatial and prior information SpatialSort improves clustering accuracy. We also demonstrate how SpatialSort can perform label transfer between spatial and nonspatial modalities through the analysis of a real world diffuse large B-cell lymphoma dataset. AVAILABILITY AND IMPLEMENTATION: Source code is available on Github at: https://github.com/Roth-Lab/SpatialSort.
format Online
Article
Text
id pubmed-10311307
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-103113072023-07-01 SpatialSort: a Bayesian model for clustering and cell population annotation of spatial proteomics data Lee, Eric Chern, Kevin Nissen, Michael Wang, Xuehai Huang, Chris Gandhi, Anita K Bouchard-Côté, Alexandre Weng, Andrew P Roth, Andrew Bioinformatics Biomedical Informatics MOTIVATION: Recent advances in spatial proteomics technologies have enabled the profiling of dozens of proteins in thousands of single cells in situ. This has created the opportunity to move beyond quantifying the composition of cell types in tissue, and instead probe the spatial relationships between cells. However, most current methods for clustering data from these assays only consider the expression values of cells and ignore the spatial context. Furthermore, existing approaches do not account for prior information about the expected cell populations in a sample. RESULTS: To address these shortcomings, we developed SpatialSort, a spatially aware Bayesian clustering approach that allows for the incorporation of prior biological knowledge. Our method is able to account for the affinities of cells of different types to neighbour in space, and by incorporating prior information about expected cell populations, it is able to simultaneously improve clustering accuracy and perform automated annotation of clusters. Using synthetic and real data, we show that by using spatial and prior information SpatialSort improves clustering accuracy. We also demonstrate how SpatialSort can perform label transfer between spatial and nonspatial modalities through the analysis of a real world diffuse large B-cell lymphoma dataset. AVAILABILITY AND IMPLEMENTATION: Source code is available on Github at: https://github.com/Roth-Lab/SpatialSort. Oxford University Press 2023-06-30 /pmc/articles/PMC10311307/ /pubmed/37387130 http://dx.doi.org/10.1093/bioinformatics/btad242 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Biomedical Informatics
Lee, Eric
Chern, Kevin
Nissen, Michael
Wang, Xuehai
Huang, Chris
Gandhi, Anita K
Bouchard-Côté, Alexandre
Weng, Andrew P
Roth, Andrew
SpatialSort: a Bayesian model for clustering and cell population annotation of spatial proteomics data
title SpatialSort: a Bayesian model for clustering and cell population annotation of spatial proteomics data
title_full SpatialSort: a Bayesian model for clustering and cell population annotation of spatial proteomics data
title_fullStr SpatialSort: a Bayesian model for clustering and cell population annotation of spatial proteomics data
title_full_unstemmed SpatialSort: a Bayesian model for clustering and cell population annotation of spatial proteomics data
title_short SpatialSort: a Bayesian model for clustering and cell population annotation of spatial proteomics data
title_sort spatialsort: a bayesian model for clustering and cell population annotation of spatial proteomics data
topic Biomedical Informatics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10311307/
https://www.ncbi.nlm.nih.gov/pubmed/37387130
http://dx.doi.org/10.1093/bioinformatics/btad242
work_keys_str_mv AT leeeric spatialsortabayesianmodelforclusteringandcellpopulationannotationofspatialproteomicsdata
AT chernkevin spatialsortabayesianmodelforclusteringandcellpopulationannotationofspatialproteomicsdata
AT nissenmichael spatialsortabayesianmodelforclusteringandcellpopulationannotationofspatialproteomicsdata
AT wangxuehai spatialsortabayesianmodelforclusteringandcellpopulationannotationofspatialproteomicsdata
AT spatialsortabayesianmodelforclusteringandcellpopulationannotationofspatialproteomicsdata
AT huangchris spatialsortabayesianmodelforclusteringandcellpopulationannotationofspatialproteomicsdata
AT gandhianitak spatialsortabayesianmodelforclusteringandcellpopulationannotationofspatialproteomicsdata
AT bouchardcotealexandre spatialsortabayesianmodelforclusteringandcellpopulationannotationofspatialproteomicsdata
AT wengandrewp spatialsortabayesianmodelforclusteringandcellpopulationannotationofspatialproteomicsdata
AT rothandrew spatialsortabayesianmodelforclusteringandcellpopulationannotationofspatialproteomicsdata