Cargando…

Cell type discovery and representation in the era of high-content single cell phenotyping

BACKGROUND: A fundamental characteristic of multicellular organisms is the specialization of functional cell types through the process of differentiation. These specialized cell types not only characterize the normal functioning of different organs and tissues, they can also be used as cellular biom...

Descripción completa

Detalles Bibliográficos
Autores principales: Bakken, Trygve, Cowell, Lindsay, Aevermann, Brian D., Novotny, Mark, Hodge, Rebecca, Miller, Jeremy A., Lee, Alexandra, Chang, Ivan, McCorrison, Jamison, Pulendran, Bali, Qian, Yu, Schork, Nicholas J., Lasken, Roger S., Lein, Ed S., Scheuermann, Richard H.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5763450/
https://www.ncbi.nlm.nih.gov/pubmed/29322913
http://dx.doi.org/10.1186/s12859-017-1977-1
_version_ 1783291890979831808
author Bakken, Trygve
Cowell, Lindsay
Aevermann, Brian D.
Novotny, Mark
Hodge, Rebecca
Miller, Jeremy A.
Lee, Alexandra
Chang, Ivan
McCorrison, Jamison
Pulendran, Bali
Qian, Yu
Schork, Nicholas J.
Lasken, Roger S.
Lein, Ed S.
Scheuermann, Richard H.
author_facet Bakken, Trygve
Cowell, Lindsay
Aevermann, Brian D.
Novotny, Mark
Hodge, Rebecca
Miller, Jeremy A.
Lee, Alexandra
Chang, Ivan
McCorrison, Jamison
Pulendran, Bali
Qian, Yu
Schork, Nicholas J.
Lasken, Roger S.
Lein, Ed S.
Scheuermann, Richard H.
author_sort Bakken, Trygve
collection PubMed
description BACKGROUND: A fundamental characteristic of multicellular organisms is the specialization of functional cell types through the process of differentiation. These specialized cell types not only characterize the normal functioning of different organs and tissues, they can also be used as cellular biomarkers of a variety of different disease states and therapeutic/vaccine responses. In order to serve as a reference for cell type representation, the Cell Ontology has been developed to provide a standard nomenclature of defined cell types for comparative analysis and biomarker discovery. Historically, these cell types have been defined based on unique cellular shapes and structures, anatomic locations, and marker protein expression. However, we are now experiencing a revolution in cellular characterization resulting from the application of new high-throughput, high-content cytometry and sequencing technologies. The resulting explosion in the number of distinct cell types being identified is challenging the current paradigm for cell type definition in the Cell Ontology. RESULTS: In this paper, we provide examples of state-of-the-art cellular biomarker characterization using high-content cytometry and single cell RNA sequencing, and present strategies for standardized cell type representations based on the data outputs from these cutting-edge technologies, including “context annotations” in the form of standardized experiment metadata about the specimen source analyzed and marker genes that serve as the most useful features in machine learning-based cell type classification models. We also propose a statistical strategy for comparing new experiment data to these standardized cell type representations. CONCLUSION: The advent of high-throughput/high-content single cell technologies is leading to an explosion in the number of distinct cell types being identified. It will be critical for the bioinformatics community to develop and adopt data standard conventions that will be compatible with these new technologies and support the data representation needs of the research community. The proposals enumerated here will serve as a useful starting point to address these challenges.
format Online
Article
Text
id pubmed-5763450
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-57634502018-01-17 Cell type discovery and representation in the era of high-content single cell phenotyping Bakken, Trygve Cowell, Lindsay Aevermann, Brian D. Novotny, Mark Hodge, Rebecca Miller, Jeremy A. Lee, Alexandra Chang, Ivan McCorrison, Jamison Pulendran, Bali Qian, Yu Schork, Nicholas J. Lasken, Roger S. Lein, Ed S. Scheuermann, Richard H. BMC Bioinformatics Research BACKGROUND: A fundamental characteristic of multicellular organisms is the specialization of functional cell types through the process of differentiation. These specialized cell types not only characterize the normal functioning of different organs and tissues, they can also be used as cellular biomarkers of a variety of different disease states and therapeutic/vaccine responses. In order to serve as a reference for cell type representation, the Cell Ontology has been developed to provide a standard nomenclature of defined cell types for comparative analysis and biomarker discovery. Historically, these cell types have been defined based on unique cellular shapes and structures, anatomic locations, and marker protein expression. However, we are now experiencing a revolution in cellular characterization resulting from the application of new high-throughput, high-content cytometry and sequencing technologies. The resulting explosion in the number of distinct cell types being identified is challenging the current paradigm for cell type definition in the Cell Ontology. RESULTS: In this paper, we provide examples of state-of-the-art cellular biomarker characterization using high-content cytometry and single cell RNA sequencing, and present strategies for standardized cell type representations based on the data outputs from these cutting-edge technologies, including “context annotations” in the form of standardized experiment metadata about the specimen source analyzed and marker genes that serve as the most useful features in machine learning-based cell type classification models. We also propose a statistical strategy for comparing new experiment data to these standardized cell type representations. CONCLUSION: The advent of high-throughput/high-content single cell technologies is leading to an explosion in the number of distinct cell types being identified. It will be critical for the bioinformatics community to develop and adopt data standard conventions that will be compatible with these new technologies and support the data representation needs of the research community. The proposals enumerated here will serve as a useful starting point to address these challenges. BioMed Central 2017-12-21 /pmc/articles/PMC5763450/ /pubmed/29322913 http://dx.doi.org/10.1186/s12859-017-1977-1 Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Bakken, Trygve
Cowell, Lindsay
Aevermann, Brian D.
Novotny, Mark
Hodge, Rebecca
Miller, Jeremy A.
Lee, Alexandra
Chang, Ivan
McCorrison, Jamison
Pulendran, Bali
Qian, Yu
Schork, Nicholas J.
Lasken, Roger S.
Lein, Ed S.
Scheuermann, Richard H.
Cell type discovery and representation in the era of high-content single cell phenotyping
title Cell type discovery and representation in the era of high-content single cell phenotyping
title_full Cell type discovery and representation in the era of high-content single cell phenotyping
title_fullStr Cell type discovery and representation in the era of high-content single cell phenotyping
title_full_unstemmed Cell type discovery and representation in the era of high-content single cell phenotyping
title_short Cell type discovery and representation in the era of high-content single cell phenotyping
title_sort cell type discovery and representation in the era of high-content single cell phenotyping
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5763450/
https://www.ncbi.nlm.nih.gov/pubmed/29322913
http://dx.doi.org/10.1186/s12859-017-1977-1
work_keys_str_mv AT bakkentrygve celltypediscoveryandrepresentationintheeraofhighcontentsinglecellphenotyping
AT cowelllindsay celltypediscoveryandrepresentationintheeraofhighcontentsinglecellphenotyping
AT aevermannbriand celltypediscoveryandrepresentationintheeraofhighcontentsinglecellphenotyping
AT novotnymark celltypediscoveryandrepresentationintheeraofhighcontentsinglecellphenotyping
AT hodgerebecca celltypediscoveryandrepresentationintheeraofhighcontentsinglecellphenotyping
AT millerjeremya celltypediscoveryandrepresentationintheeraofhighcontentsinglecellphenotyping
AT leealexandra celltypediscoveryandrepresentationintheeraofhighcontentsinglecellphenotyping
AT changivan celltypediscoveryandrepresentationintheeraofhighcontentsinglecellphenotyping
AT mccorrisonjamison celltypediscoveryandrepresentationintheeraofhighcontentsinglecellphenotyping
AT pulendranbali celltypediscoveryandrepresentationintheeraofhighcontentsinglecellphenotyping
AT qianyu celltypediscoveryandrepresentationintheeraofhighcontentsinglecellphenotyping
AT schorknicholasj celltypediscoveryandrepresentationintheeraofhighcontentsinglecellphenotyping
AT laskenrogers celltypediscoveryandrepresentationintheeraofhighcontentsinglecellphenotyping
AT leineds celltypediscoveryandrepresentationintheeraofhighcontentsinglecellphenotyping
AT scheuermannrichardh celltypediscoveryandrepresentationintheeraofhighcontentsinglecellphenotyping