Cargando…

Assessing the replicability of spatial gene expression using atlas data from the adult mouse brain

High-throughput, spatially resolved gene expression techniques are poised to be transformative across biology by overcoming a central limitation in single-cell biology: the lack of information on relationships that organize the cells into the functional groupings characteristic of tissues in complex...

Descripción completa

Detalles Bibliográficos
Autores principales: Lu, Shaina, Ortiz, Cantin, Fürth, Daniel, Fischer, Stephan, Meletis, Konstantinos, Zador, Anthony, Gillis, Jesse
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8321401/
https://www.ncbi.nlm.nih.gov/pubmed/34280183
http://dx.doi.org/10.1371/journal.pbio.3001341
_version_ 1783730842395213824
author Lu, Shaina
Ortiz, Cantin
Fürth, Daniel
Fischer, Stephan
Meletis, Konstantinos
Zador, Anthony
Gillis, Jesse
author_facet Lu, Shaina
Ortiz, Cantin
Fürth, Daniel
Fischer, Stephan
Meletis, Konstantinos
Zador, Anthony
Gillis, Jesse
author_sort Lu, Shaina
collection PubMed
description High-throughput, spatially resolved gene expression techniques are poised to be transformative across biology by overcoming a central limitation in single-cell biology: the lack of information on relationships that organize the cells into the functional groupings characteristic of tissues in complex multicellular organisms. Spatial expression is particularly interesting in the mammalian brain, which has a highly defined structure, strong spatial constraint in its organization, and detailed multimodal phenotypes for cells and ensembles of cells that can be linked to mesoscale properties such as projection patterns, and from there, to circuits generating behavior. However, as with any type of expression data, cross-dataset benchmarking of spatial data is a crucial first step. Here, we assess the replicability, with reference to canonical brain subdivisions, between the Allen Institute’s in situ hybridization data from the adult mouse brain (Allen Brain Atlas (ABA)) and a similar dataset collected using spatial transcriptomics (ST). With the advent of tractable spatial techniques, for the first time, we are able to benchmark the Allen Institute’s whole-brain, whole-transcriptome spatial expression dataset with a second independent dataset that similarly spans the whole brain and transcriptome. We use regularized linear regression (LASSO), linear regression, and correlation-based feature selection in a supervised learning framework to classify expression samples relative to their assayed location. We show that Allen Reference Atlas labels are classifiable using transcription in both data sets, but that performance is higher in the ABA than in ST. Furthermore, models trained in one dataset and tested in the opposite dataset do not reproduce classification performance bidirectionally. While an identifying expression profile can be found for a given brain area, it does not generalize to the opposite dataset. In general, we found that canonical brain area labels are classifiable in gene expression space within dataset and that our observed performance is not merely reflecting physical distance in the brain. However, we also show that cross-platform classification is not robust. Emerging spatial datasets from the mouse brain will allow further characterization of cross-dataset replicability ultimately providing a valuable reference set for understanding the cell biology of the brain.
format Online
Article
Text
id pubmed-8321401
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-83214012021-07-31 Assessing the replicability of spatial gene expression using atlas data from the adult mouse brain Lu, Shaina Ortiz, Cantin Fürth, Daniel Fischer, Stephan Meletis, Konstantinos Zador, Anthony Gillis, Jesse PLoS Biol Methods and Resources High-throughput, spatially resolved gene expression techniques are poised to be transformative across biology by overcoming a central limitation in single-cell biology: the lack of information on relationships that organize the cells into the functional groupings characteristic of tissues in complex multicellular organisms. Spatial expression is particularly interesting in the mammalian brain, which has a highly defined structure, strong spatial constraint in its organization, and detailed multimodal phenotypes for cells and ensembles of cells that can be linked to mesoscale properties such as projection patterns, and from there, to circuits generating behavior. However, as with any type of expression data, cross-dataset benchmarking of spatial data is a crucial first step. Here, we assess the replicability, with reference to canonical brain subdivisions, between the Allen Institute’s in situ hybridization data from the adult mouse brain (Allen Brain Atlas (ABA)) and a similar dataset collected using spatial transcriptomics (ST). With the advent of tractable spatial techniques, for the first time, we are able to benchmark the Allen Institute’s whole-brain, whole-transcriptome spatial expression dataset with a second independent dataset that similarly spans the whole brain and transcriptome. We use regularized linear regression (LASSO), linear regression, and correlation-based feature selection in a supervised learning framework to classify expression samples relative to their assayed location. We show that Allen Reference Atlas labels are classifiable using transcription in both data sets, but that performance is higher in the ABA than in ST. Furthermore, models trained in one dataset and tested in the opposite dataset do not reproduce classification performance bidirectionally. While an identifying expression profile can be found for a given brain area, it does not generalize to the opposite dataset. In general, we found that canonical brain area labels are classifiable in gene expression space within dataset and that our observed performance is not merely reflecting physical distance in the brain. However, we also show that cross-platform classification is not robust. Emerging spatial datasets from the mouse brain will allow further characterization of cross-dataset replicability ultimately providing a valuable reference set for understanding the cell biology of the brain. Public Library of Science 2021-07-19 /pmc/articles/PMC8321401/ /pubmed/34280183 http://dx.doi.org/10.1371/journal.pbio.3001341 Text en © 2021 Lu et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Methods and Resources
Lu, Shaina
Ortiz, Cantin
Fürth, Daniel
Fischer, Stephan
Meletis, Konstantinos
Zador, Anthony
Gillis, Jesse
Assessing the replicability of spatial gene expression using atlas data from the adult mouse brain
title Assessing the replicability of spatial gene expression using atlas data from the adult mouse brain
title_full Assessing the replicability of spatial gene expression using atlas data from the adult mouse brain
title_fullStr Assessing the replicability of spatial gene expression using atlas data from the adult mouse brain
title_full_unstemmed Assessing the replicability of spatial gene expression using atlas data from the adult mouse brain
title_short Assessing the replicability of spatial gene expression using atlas data from the adult mouse brain
title_sort assessing the replicability of spatial gene expression using atlas data from the adult mouse brain
topic Methods and Resources
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8321401/
https://www.ncbi.nlm.nih.gov/pubmed/34280183
http://dx.doi.org/10.1371/journal.pbio.3001341
work_keys_str_mv AT lushaina assessingthereplicabilityofspatialgeneexpressionusingatlasdatafromtheadultmousebrain
AT ortizcantin assessingthereplicabilityofspatialgeneexpressionusingatlasdatafromtheadultmousebrain
AT furthdaniel assessingthereplicabilityofspatialgeneexpressionusingatlasdatafromtheadultmousebrain
AT fischerstephan assessingthereplicabilityofspatialgeneexpressionusingatlasdatafromtheadultmousebrain
AT meletiskonstantinos assessingthereplicabilityofspatialgeneexpressionusingatlasdatafromtheadultmousebrain
AT zadoranthony assessingthereplicabilityofspatialgeneexpressionusingatlasdatafromtheadultmousebrain
AT gillisjesse assessingthereplicabilityofspatialgeneexpressionusingatlasdatafromtheadultmousebrain