Cargando…

Machine learning predicts putative hematopoietic stem cells within large single-cell transcriptomics data sets

Hematopoietic stem cells (HSCs) are an essential source and reservoir for normal hematopoiesis, and their function is compromised in many blood disorders. HSC research has benefitted from the recent development of single-cell molecular profiling technologies, where single-cell RNA sequencing (scRNA-...

Descripción completa

Detalles Bibliográficos
Autores principales: Hamey, Fiona K., Göttgens, Berthold
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier Science Inc 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6900257/
https://www.ncbi.nlm.nih.gov/pubmed/31513832
http://dx.doi.org/10.1016/j.exphem.2019.08.009
_version_ 1783477315909451776
author Hamey, Fiona K.
Göttgens, Berthold
author_facet Hamey, Fiona K.
Göttgens, Berthold
author_sort Hamey, Fiona K.
collection PubMed
description Hematopoietic stem cells (HSCs) are an essential source and reservoir for normal hematopoiesis, and their function is compromised in many blood disorders. HSC research has benefitted from the recent development of single-cell molecular profiling technologies, where single-cell RNA sequencing (scRNA-seq) in particular has rapidly become an established method to profile HSCs and related hematopoietic populations. The classic definition of HSCs relies on transplantation assays, which have been used to validate HSC function for cell populations defined by flow cytometry. Flow cytometry information for single cells, however, is not available for many new high-throughput scRNA-seq methods, thus highlighting an urgent need for the establishment of alternative ways to pinpoint the likely HSCs within large scRNA-seq data sets. To address this, we tested a range of machine learning approaches and developed a tool, hscScore, to score single-cell transcriptomes from murine bone marrow based on their similarity to gene expression profiles of validated HSCs. We evaluated hscScore across scRNA-seq data from different laboratories, which allowed us to establish a robust method that functions across different technologies. To facilitate broad adoption of hscScore by the wider hematopoiesis community, we have made the trained model and example code freely available online. In summary, our method hscScore provides fast identification of mouse bone marrow HSCs from scRNA-seq measurements and represents a broadly useful tool for analysis of single-cell gene expression data.
format Online
Article
Text
id pubmed-6900257
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Elsevier Science Inc
record_format MEDLINE/PubMed
spelling pubmed-69002572020-01-21 Machine learning predicts putative hematopoietic stem cells within large single-cell transcriptomics data sets Hamey, Fiona K. Göttgens, Berthold Exp Hematol Article Hematopoietic stem cells (HSCs) are an essential source and reservoir for normal hematopoiesis, and their function is compromised in many blood disorders. HSC research has benefitted from the recent development of single-cell molecular profiling technologies, where single-cell RNA sequencing (scRNA-seq) in particular has rapidly become an established method to profile HSCs and related hematopoietic populations. The classic definition of HSCs relies on transplantation assays, which have been used to validate HSC function for cell populations defined by flow cytometry. Flow cytometry information for single cells, however, is not available for many new high-throughput scRNA-seq methods, thus highlighting an urgent need for the establishment of alternative ways to pinpoint the likely HSCs within large scRNA-seq data sets. To address this, we tested a range of machine learning approaches and developed a tool, hscScore, to score single-cell transcriptomes from murine bone marrow based on their similarity to gene expression profiles of validated HSCs. We evaluated hscScore across scRNA-seq data from different laboratories, which allowed us to establish a robust method that functions across different technologies. To facilitate broad adoption of hscScore by the wider hematopoiesis community, we have made the trained model and example code freely available online. In summary, our method hscScore provides fast identification of mouse bone marrow HSCs from scRNA-seq measurements and represents a broadly useful tool for analysis of single-cell gene expression data. Elsevier Science Inc 2019-10 /pmc/articles/PMC6900257/ /pubmed/31513832 http://dx.doi.org/10.1016/j.exphem.2019.08.009 Text en © 2019 ISEH -- Society for Hematology and Stem Cells. Published by Elsevier Inc. http://creativecommons.org/licenses/by/4.0/ This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Hamey, Fiona K.
Göttgens, Berthold
Machine learning predicts putative hematopoietic stem cells within large single-cell transcriptomics data sets
title Machine learning predicts putative hematopoietic stem cells within large single-cell transcriptomics data sets
title_full Machine learning predicts putative hematopoietic stem cells within large single-cell transcriptomics data sets
title_fullStr Machine learning predicts putative hematopoietic stem cells within large single-cell transcriptomics data sets
title_full_unstemmed Machine learning predicts putative hematopoietic stem cells within large single-cell transcriptomics data sets
title_short Machine learning predicts putative hematopoietic stem cells within large single-cell transcriptomics data sets
title_sort machine learning predicts putative hematopoietic stem cells within large single-cell transcriptomics data sets
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6900257/
https://www.ncbi.nlm.nih.gov/pubmed/31513832
http://dx.doi.org/10.1016/j.exphem.2019.08.009
work_keys_str_mv AT hameyfionak machinelearningpredictsputativehematopoieticstemcellswithinlargesinglecelltranscriptomicsdatasets
AT gottgensberthold machinelearningpredictsputativehematopoieticstemcellswithinlargesinglecelltranscriptomicsdatasets