Cargando…

CoSTA: unsupervised convolutional neural network learning for spatial transcriptomics analysis

BACKGROUND: The rise of spatial transcriptomics technologies is leading to new insights about how gene regulation happens in a spatial context. Determining which genes are expressed in similar spatial patterns can reveal gene regulatory relationships across cell types in a tissue. However, many curr...

Descripción completa

Detalles Bibliográficos
Autores principales: Xu, Yang, McCord, Rachel Patton
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8351440/
https://www.ncbi.nlm.nih.gov/pubmed/34372758
http://dx.doi.org/10.1186/s12859-021-04314-1
_version_ 1783735976403664896
author Xu, Yang
McCord, Rachel Patton
author_facet Xu, Yang
McCord, Rachel Patton
author_sort Xu, Yang
collection PubMed
description BACKGROUND: The rise of spatial transcriptomics technologies is leading to new insights about how gene regulation happens in a spatial context. Determining which genes are expressed in similar spatial patterns can reveal gene regulatory relationships across cell types in a tissue. However, many current analysis methods do not take full advantage of the spatial organization of the data, instead treating pixels as independent features. Here, we present CoSTA: a novel approach to learn spatial similarities between gene expression matrices via convolutional neural network (ConvNet) clustering. RESULTS: By analyzing simulated and previously published spatial transcriptomics data, we demonstrate that CoSTA learns spatial relationships between genes in a way that emphasizes broader spatial patterns rather than pixel-level correlation. CoSTA provides a quantitative measure of expression pattern similarity between each pair of genes rather than only classifying genes into categories. We find that CoSTA identifies narrower, but biologically relevant, sets of significantly related genes as compared to other approaches. CONCLUSIONS: The deep learning CoSTA approach provides a different angle to spatial transcriptomics analysis by focusing on the shape of expression patterns, using more information about the positions of neighboring pixels than would an overlap or pixel correlation approach. CoSTA can be applied to any spatial transcriptomics data represented in matrix form and may have future applications to datasets such as histology in which images of different genes are from similar but not identical biological sections. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-021-04314-1.
format Online
Article
Text
id pubmed-8351440
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-83514402021-08-10 CoSTA: unsupervised convolutional neural network learning for spatial transcriptomics analysis Xu, Yang McCord, Rachel Patton BMC Bioinformatics Research Article BACKGROUND: The rise of spatial transcriptomics technologies is leading to new insights about how gene regulation happens in a spatial context. Determining which genes are expressed in similar spatial patterns can reveal gene regulatory relationships across cell types in a tissue. However, many current analysis methods do not take full advantage of the spatial organization of the data, instead treating pixels as independent features. Here, we present CoSTA: a novel approach to learn spatial similarities between gene expression matrices via convolutional neural network (ConvNet) clustering. RESULTS: By analyzing simulated and previously published spatial transcriptomics data, we demonstrate that CoSTA learns spatial relationships between genes in a way that emphasizes broader spatial patterns rather than pixel-level correlation. CoSTA provides a quantitative measure of expression pattern similarity between each pair of genes rather than only classifying genes into categories. We find that CoSTA identifies narrower, but biologically relevant, sets of significantly related genes as compared to other approaches. CONCLUSIONS: The deep learning CoSTA approach provides a different angle to spatial transcriptomics analysis by focusing on the shape of expression patterns, using more information about the positions of neighboring pixels than would an overlap or pixel correlation approach. CoSTA can be applied to any spatial transcriptomics data represented in matrix form and may have future applications to datasets such as histology in which images of different genes are from similar but not identical biological sections. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-021-04314-1. BioMed Central 2021-08-09 /pmc/articles/PMC8351440/ /pubmed/34372758 http://dx.doi.org/10.1186/s12859-021-04314-1 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research Article
Xu, Yang
McCord, Rachel Patton
CoSTA: unsupervised convolutional neural network learning for spatial transcriptomics analysis
title CoSTA: unsupervised convolutional neural network learning for spatial transcriptomics analysis
title_full CoSTA: unsupervised convolutional neural network learning for spatial transcriptomics analysis
title_fullStr CoSTA: unsupervised convolutional neural network learning for spatial transcriptomics analysis
title_full_unstemmed CoSTA: unsupervised convolutional neural network learning for spatial transcriptomics analysis
title_short CoSTA: unsupervised convolutional neural network learning for spatial transcriptomics analysis
title_sort costa: unsupervised convolutional neural network learning for spatial transcriptomics analysis
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8351440/
https://www.ncbi.nlm.nih.gov/pubmed/34372758
http://dx.doi.org/10.1186/s12859-021-04314-1
work_keys_str_mv AT xuyang costaunsupervisedconvolutionalneuralnetworklearningforspatialtranscriptomicsanalysis
AT mccordrachelpatton costaunsupervisedconvolutionalneuralnetworklearningforspatialtranscriptomicsanalysis