Cargando…

A simple, scalable approach to building a cross-platform transcriptome atlas

Gene expression atlases have transformed our understanding of the development, composition and function of human tissues. New technologies promise improved cellular or molecular resolution, and have led to the identification of new cell types, or better defined cell states. But as new technologies e...

Descripción completa

Detalles Bibliográficos
Autores principales: Angel, Paul W., Rajab, Nadia, Deng, Yidi, Pacheco, Chris M., Chen, Tyrone, Lê Cao, Kim-Anh, Choi, Jarny, Wells, Christine A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7544119/
https://www.ncbi.nlm.nih.gov/pubmed/32986694
http://dx.doi.org/10.1371/journal.pcbi.1008219
_version_ 1783591794197397504
author Angel, Paul W.
Rajab, Nadia
Deng, Yidi
Pacheco, Chris M.
Chen, Tyrone
Lê Cao, Kim-Anh
Choi, Jarny
Wells, Christine A.
author_facet Angel, Paul W.
Rajab, Nadia
Deng, Yidi
Pacheco, Chris M.
Chen, Tyrone
Lê Cao, Kim-Anh
Choi, Jarny
Wells, Christine A.
author_sort Angel, Paul W.
collection PubMed
description Gene expression atlases have transformed our understanding of the development, composition and function of human tissues. New technologies promise improved cellular or molecular resolution, and have led to the identification of new cell types, or better defined cell states. But as new technologies emerge, information derived on old platforms becomes obsolete. We demonstrate that it is possible to combine a large number of different profiling experiments summarised from dozens of laboratories and representing hundreds of donors, to create an integrated molecular map of human tissue. As an example, we combine 850 samples from 38 platforms to build an integrated atlas of human blood cells. We achieve robust and unbiased cell type clustering using a variance partitioning method, selecting genes with low platform bias relative to biological variation. Other than an initial rescaling, no other transformation to the primary data is applied through batch correction or renormalisation. Additional data, including single-cell datasets, can be projected for comparison, classification and annotation. The resulting atlas provides a multi-scaled approach to visualise and analyse the relationships between sets of genes and blood cell lineages, including the maturation and activation of leukocytes in vivo and in vitro. In allowing for data integration across hundreds of studies, we address a key reproduciblity challenge which is faced by any new technology. This allows us to draw on the deep phenotypes and functional annotations that accompany traditional profiling methods, and provide important context to the high cellular resolution of single cell profiling. Here, we have implemented the blood atlas in the open access Stemformatics.org platform, drawing on its extensive collection of curated transcriptome data. The method is simple, scalable and amenable for rapid deployment in other biological systems or computational workflows.
format Online
Article
Text
id pubmed-7544119
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-75441192020-10-19 A simple, scalable approach to building a cross-platform transcriptome atlas Angel, Paul W. Rajab, Nadia Deng, Yidi Pacheco, Chris M. Chen, Tyrone Lê Cao, Kim-Anh Choi, Jarny Wells, Christine A. PLoS Comput Biol Research Article Gene expression atlases have transformed our understanding of the development, composition and function of human tissues. New technologies promise improved cellular or molecular resolution, and have led to the identification of new cell types, or better defined cell states. But as new technologies emerge, information derived on old platforms becomes obsolete. We demonstrate that it is possible to combine a large number of different profiling experiments summarised from dozens of laboratories and representing hundreds of donors, to create an integrated molecular map of human tissue. As an example, we combine 850 samples from 38 platforms to build an integrated atlas of human blood cells. We achieve robust and unbiased cell type clustering using a variance partitioning method, selecting genes with low platform bias relative to biological variation. Other than an initial rescaling, no other transformation to the primary data is applied through batch correction or renormalisation. Additional data, including single-cell datasets, can be projected for comparison, classification and annotation. The resulting atlas provides a multi-scaled approach to visualise and analyse the relationships between sets of genes and blood cell lineages, including the maturation and activation of leukocytes in vivo and in vitro. In allowing for data integration across hundreds of studies, we address a key reproduciblity challenge which is faced by any new technology. This allows us to draw on the deep phenotypes and functional annotations that accompany traditional profiling methods, and provide important context to the high cellular resolution of single cell profiling. Here, we have implemented the blood atlas in the open access Stemformatics.org platform, drawing on its extensive collection of curated transcriptome data. The method is simple, scalable and amenable for rapid deployment in other biological systems or computational workflows. Public Library of Science 2020-09-28 /pmc/articles/PMC7544119/ /pubmed/32986694 http://dx.doi.org/10.1371/journal.pcbi.1008219 Text en © 2020 Angel et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Angel, Paul W.
Rajab, Nadia
Deng, Yidi
Pacheco, Chris M.
Chen, Tyrone
Lê Cao, Kim-Anh
Choi, Jarny
Wells, Christine A.
A simple, scalable approach to building a cross-platform transcriptome atlas
title A simple, scalable approach to building a cross-platform transcriptome atlas
title_full A simple, scalable approach to building a cross-platform transcriptome atlas
title_fullStr A simple, scalable approach to building a cross-platform transcriptome atlas
title_full_unstemmed A simple, scalable approach to building a cross-platform transcriptome atlas
title_short A simple, scalable approach to building a cross-platform transcriptome atlas
title_sort simple, scalable approach to building a cross-platform transcriptome atlas
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7544119/
https://www.ncbi.nlm.nih.gov/pubmed/32986694
http://dx.doi.org/10.1371/journal.pcbi.1008219
work_keys_str_mv AT angelpaulw asimplescalableapproachtobuildingacrossplatformtranscriptomeatlas
AT rajabnadia asimplescalableapproachtobuildingacrossplatformtranscriptomeatlas
AT dengyidi asimplescalableapproachtobuildingacrossplatformtranscriptomeatlas
AT pachecochrism asimplescalableapproachtobuildingacrossplatformtranscriptomeatlas
AT chentyrone asimplescalableapproachtobuildingacrossplatformtranscriptomeatlas
AT lecaokimanh asimplescalableapproachtobuildingacrossplatformtranscriptomeatlas
AT choijarny asimplescalableapproachtobuildingacrossplatformtranscriptomeatlas
AT wellschristinea asimplescalableapproachtobuildingacrossplatformtranscriptomeatlas
AT angelpaulw simplescalableapproachtobuildingacrossplatformtranscriptomeatlas
AT rajabnadia simplescalableapproachtobuildingacrossplatformtranscriptomeatlas
AT dengyidi simplescalableapproachtobuildingacrossplatformtranscriptomeatlas
AT pachecochrism simplescalableapproachtobuildingacrossplatformtranscriptomeatlas
AT chentyrone simplescalableapproachtobuildingacrossplatformtranscriptomeatlas
AT lecaokimanh simplescalableapproachtobuildingacrossplatformtranscriptomeatlas
AT choijarny simplescalableapproachtobuildingacrossplatformtranscriptomeatlas
AT wellschristinea simplescalableapproachtobuildingacrossplatformtranscriptomeatlas