Cargando…
Assessing the replicability of spatial gene expression using atlas data from the adult mouse brain
High-throughput, spatially resolved gene expression techniques are poised to be transformative across biology by overcoming a central limitation in single-cell biology: the lack of information on relationships that organize the cells into the functional groupings characteristic of tissues in complex...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8321401/ https://www.ncbi.nlm.nih.gov/pubmed/34280183 http://dx.doi.org/10.1371/journal.pbio.3001341 |
_version_ | 1783730842395213824 |
---|---|
author | Lu, Shaina Ortiz, Cantin Fürth, Daniel Fischer, Stephan Meletis, Konstantinos Zador, Anthony Gillis, Jesse |
author_facet | Lu, Shaina Ortiz, Cantin Fürth, Daniel Fischer, Stephan Meletis, Konstantinos Zador, Anthony Gillis, Jesse |
author_sort | Lu, Shaina |
collection | PubMed |
description | High-throughput, spatially resolved gene expression techniques are poised to be transformative across biology by overcoming a central limitation in single-cell biology: the lack of information on relationships that organize the cells into the functional groupings characteristic of tissues in complex multicellular organisms. Spatial expression is particularly interesting in the mammalian brain, which has a highly defined structure, strong spatial constraint in its organization, and detailed multimodal phenotypes for cells and ensembles of cells that can be linked to mesoscale properties such as projection patterns, and from there, to circuits generating behavior. However, as with any type of expression data, cross-dataset benchmarking of spatial data is a crucial first step. Here, we assess the replicability, with reference to canonical brain subdivisions, between the Allen Institute’s in situ hybridization data from the adult mouse brain (Allen Brain Atlas (ABA)) and a similar dataset collected using spatial transcriptomics (ST). With the advent of tractable spatial techniques, for the first time, we are able to benchmark the Allen Institute’s whole-brain, whole-transcriptome spatial expression dataset with a second independent dataset that similarly spans the whole brain and transcriptome. We use regularized linear regression (LASSO), linear regression, and correlation-based feature selection in a supervised learning framework to classify expression samples relative to their assayed location. We show that Allen Reference Atlas labels are classifiable using transcription in both data sets, but that performance is higher in the ABA than in ST. Furthermore, models trained in one dataset and tested in the opposite dataset do not reproduce classification performance bidirectionally. While an identifying expression profile can be found for a given brain area, it does not generalize to the opposite dataset. In general, we found that canonical brain area labels are classifiable in gene expression space within dataset and that our observed performance is not merely reflecting physical distance in the brain. However, we also show that cross-platform classification is not robust. Emerging spatial datasets from the mouse brain will allow further characterization of cross-dataset replicability ultimately providing a valuable reference set for understanding the cell biology of the brain. |
format | Online Article Text |
id | pubmed-8321401 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-83214012021-07-31 Assessing the replicability of spatial gene expression using atlas data from the adult mouse brain Lu, Shaina Ortiz, Cantin Fürth, Daniel Fischer, Stephan Meletis, Konstantinos Zador, Anthony Gillis, Jesse PLoS Biol Methods and Resources High-throughput, spatially resolved gene expression techniques are poised to be transformative across biology by overcoming a central limitation in single-cell biology: the lack of information on relationships that organize the cells into the functional groupings characteristic of tissues in complex multicellular organisms. Spatial expression is particularly interesting in the mammalian brain, which has a highly defined structure, strong spatial constraint in its organization, and detailed multimodal phenotypes for cells and ensembles of cells that can be linked to mesoscale properties such as projection patterns, and from there, to circuits generating behavior. However, as with any type of expression data, cross-dataset benchmarking of spatial data is a crucial first step. Here, we assess the replicability, with reference to canonical brain subdivisions, between the Allen Institute’s in situ hybridization data from the adult mouse brain (Allen Brain Atlas (ABA)) and a similar dataset collected using spatial transcriptomics (ST). With the advent of tractable spatial techniques, for the first time, we are able to benchmark the Allen Institute’s whole-brain, whole-transcriptome spatial expression dataset with a second independent dataset that similarly spans the whole brain and transcriptome. We use regularized linear regression (LASSO), linear regression, and correlation-based feature selection in a supervised learning framework to classify expression samples relative to their assayed location. We show that Allen Reference Atlas labels are classifiable using transcription in both data sets, but that performance is higher in the ABA than in ST. Furthermore, models trained in one dataset and tested in the opposite dataset do not reproduce classification performance bidirectionally. While an identifying expression profile can be found for a given brain area, it does not generalize to the opposite dataset. In general, we found that canonical brain area labels are classifiable in gene expression space within dataset and that our observed performance is not merely reflecting physical distance in the brain. However, we also show that cross-platform classification is not robust. Emerging spatial datasets from the mouse brain will allow further characterization of cross-dataset replicability ultimately providing a valuable reference set for understanding the cell biology of the brain. Public Library of Science 2021-07-19 /pmc/articles/PMC8321401/ /pubmed/34280183 http://dx.doi.org/10.1371/journal.pbio.3001341 Text en © 2021 Lu et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Methods and Resources Lu, Shaina Ortiz, Cantin Fürth, Daniel Fischer, Stephan Meletis, Konstantinos Zador, Anthony Gillis, Jesse Assessing the replicability of spatial gene expression using atlas data from the adult mouse brain |
title | Assessing the replicability of spatial gene expression using atlas data from the adult mouse brain |
title_full | Assessing the replicability of spatial gene expression using atlas data from the adult mouse brain |
title_fullStr | Assessing the replicability of spatial gene expression using atlas data from the adult mouse brain |
title_full_unstemmed | Assessing the replicability of spatial gene expression using atlas data from the adult mouse brain |
title_short | Assessing the replicability of spatial gene expression using atlas data from the adult mouse brain |
title_sort | assessing the replicability of spatial gene expression using atlas data from the adult mouse brain |
topic | Methods and Resources |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8321401/ https://www.ncbi.nlm.nih.gov/pubmed/34280183 http://dx.doi.org/10.1371/journal.pbio.3001341 |
work_keys_str_mv | AT lushaina assessingthereplicabilityofspatialgeneexpressionusingatlasdatafromtheadultmousebrain AT ortizcantin assessingthereplicabilityofspatialgeneexpressionusingatlasdatafromtheadultmousebrain AT furthdaniel assessingthereplicabilityofspatialgeneexpressionusingatlasdatafromtheadultmousebrain AT fischerstephan assessingthereplicabilityofspatialgeneexpressionusingatlasdatafromtheadultmousebrain AT meletiskonstantinos assessingthereplicabilityofspatialgeneexpressionusingatlasdatafromtheadultmousebrain AT zadoranthony assessingthereplicabilityofspatialgeneexpressionusingatlasdatafromtheadultmousebrain AT gillisjesse assessingthereplicabilityofspatialgeneexpressionusingatlasdatafromtheadultmousebrain |