Cargando…

The Gene Expression Deconvolution Interactive Tool (GEDIT): accurate cell type quantification from gene expression data

BACKGROUND: The cell type composition of heterogeneous tissue samples can be a critical variable in both clinical and laboratory settings. However, current experimental methods of cell type quantification (e.g., cell flow cytometry) are costly, time consuming and have potential to introduce bias. Co...

Descripción completa

Detalles Bibliográficos
Autores principales: Nadel, Brian B, Lopez, David, Montoya, Dennis J, Ma, Feiyang, Waddel, Hannah, Khan, Misha M, Mangul, Serghei, Pellegrini, Matteo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7931818/
https://www.ncbi.nlm.nih.gov/pubmed/33590863
http://dx.doi.org/10.1093/gigascience/giab002
_version_ 1783660364832964608
author Nadel, Brian B
Lopez, David
Montoya, Dennis J
Ma, Feiyang
Waddel, Hannah
Khan, Misha M
Mangul, Serghei
Pellegrini, Matteo
author_facet Nadel, Brian B
Lopez, David
Montoya, Dennis J
Ma, Feiyang
Waddel, Hannah
Khan, Misha M
Mangul, Serghei
Pellegrini, Matteo
author_sort Nadel, Brian B
collection PubMed
description BACKGROUND: The cell type composition of heterogeneous tissue samples can be a critical variable in both clinical and laboratory settings. However, current experimental methods of cell type quantification (e.g., cell flow cytometry) are costly, time consuming and have potential to introduce bias. Computational approaches that use expression data to infer cell type abundance offer an alternative solution. While these methods have gained popularity, most fail to produce accurate predictions for the full range of platforms currently used by researchers or for the wide variety of tissue types often studied. RESULTS: We present the Gene Expression Deconvolution Interactive Tool (GEDIT), a flexible tool that utilizes gene expression data to accurately predict cell type abundances. Using both simulated and experimental data, we extensively evaluate the performance of GEDIT and demonstrate that it returns robust results under a wide variety of conditions. These conditions include multiple platforms (microarray and RNA-seq), tissue types (blood and stromal), and species (human and mouse). Finally, we provide reference data from 8 sources spanning a broad range of stromal and hematopoietic types in both human and mouse. GEDIT also accepts user-submitted reference data, thus allowing the estimation of any cell type or subtype, provided that reference data are available. CONCLUSIONS: GEDIT is a powerful method for evaluating the cell type composition of tissue samples and provides excellent accuracy and versatility compared to similar tools. The reference database provided here also allows users to obtain estimates for a wide variety of tissue samples without having to provide their own data.
format Online
Article
Text
id pubmed-7931818
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-79318182021-03-09 The Gene Expression Deconvolution Interactive Tool (GEDIT): accurate cell type quantification from gene expression data Nadel, Brian B Lopez, David Montoya, Dennis J Ma, Feiyang Waddel, Hannah Khan, Misha M Mangul, Serghei Pellegrini, Matteo Gigascience Research BACKGROUND: The cell type composition of heterogeneous tissue samples can be a critical variable in both clinical and laboratory settings. However, current experimental methods of cell type quantification (e.g., cell flow cytometry) are costly, time consuming and have potential to introduce bias. Computational approaches that use expression data to infer cell type abundance offer an alternative solution. While these methods have gained popularity, most fail to produce accurate predictions for the full range of platforms currently used by researchers or for the wide variety of tissue types often studied. RESULTS: We present the Gene Expression Deconvolution Interactive Tool (GEDIT), a flexible tool that utilizes gene expression data to accurately predict cell type abundances. Using both simulated and experimental data, we extensively evaluate the performance of GEDIT and demonstrate that it returns robust results under a wide variety of conditions. These conditions include multiple platforms (microarray and RNA-seq), tissue types (blood and stromal), and species (human and mouse). Finally, we provide reference data from 8 sources spanning a broad range of stromal and hematopoietic types in both human and mouse. GEDIT also accepts user-submitted reference data, thus allowing the estimation of any cell type or subtype, provided that reference data are available. CONCLUSIONS: GEDIT is a powerful method for evaluating the cell type composition of tissue samples and provides excellent accuracy and versatility compared to similar tools. The reference database provided here also allows users to obtain estimates for a wide variety of tissue samples without having to provide their own data. Oxford University Press 2021-02-16 /pmc/articles/PMC7931818/ /pubmed/33590863 http://dx.doi.org/10.1093/gigascience/giab002 Text en © The Author(s) 2021. Published by Oxford University Press GigaScience. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Nadel, Brian B
Lopez, David
Montoya, Dennis J
Ma, Feiyang
Waddel, Hannah
Khan, Misha M
Mangul, Serghei
Pellegrini, Matteo
The Gene Expression Deconvolution Interactive Tool (GEDIT): accurate cell type quantification from gene expression data
title The Gene Expression Deconvolution Interactive Tool (GEDIT): accurate cell type quantification from gene expression data
title_full The Gene Expression Deconvolution Interactive Tool (GEDIT): accurate cell type quantification from gene expression data
title_fullStr The Gene Expression Deconvolution Interactive Tool (GEDIT): accurate cell type quantification from gene expression data
title_full_unstemmed The Gene Expression Deconvolution Interactive Tool (GEDIT): accurate cell type quantification from gene expression data
title_short The Gene Expression Deconvolution Interactive Tool (GEDIT): accurate cell type quantification from gene expression data
title_sort gene expression deconvolution interactive tool (gedit): accurate cell type quantification from gene expression data
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7931818/
https://www.ncbi.nlm.nih.gov/pubmed/33590863
http://dx.doi.org/10.1093/gigascience/giab002
work_keys_str_mv AT nadelbrianb thegeneexpressiondeconvolutioninteractivetoolgeditaccuratecelltypequantificationfromgeneexpressiondata
AT lopezdavid thegeneexpressiondeconvolutioninteractivetoolgeditaccuratecelltypequantificationfromgeneexpressiondata
AT montoyadennisj thegeneexpressiondeconvolutioninteractivetoolgeditaccuratecelltypequantificationfromgeneexpressiondata
AT mafeiyang thegeneexpressiondeconvolutioninteractivetoolgeditaccuratecelltypequantificationfromgeneexpressiondata
AT waddelhannah thegeneexpressiondeconvolutioninteractivetoolgeditaccuratecelltypequantificationfromgeneexpressiondata
AT khanmisham thegeneexpressiondeconvolutioninteractivetoolgeditaccuratecelltypequantificationfromgeneexpressiondata
AT mangulserghei thegeneexpressiondeconvolutioninteractivetoolgeditaccuratecelltypequantificationfromgeneexpressiondata
AT pellegrinimatteo thegeneexpressiondeconvolutioninteractivetoolgeditaccuratecelltypequantificationfromgeneexpressiondata
AT nadelbrianb geneexpressiondeconvolutioninteractivetoolgeditaccuratecelltypequantificationfromgeneexpressiondata
AT lopezdavid geneexpressiondeconvolutioninteractivetoolgeditaccuratecelltypequantificationfromgeneexpressiondata
AT montoyadennisj geneexpressiondeconvolutioninteractivetoolgeditaccuratecelltypequantificationfromgeneexpressiondata
AT mafeiyang geneexpressiondeconvolutioninteractivetoolgeditaccuratecelltypequantificationfromgeneexpressiondata
AT waddelhannah geneexpressiondeconvolutioninteractivetoolgeditaccuratecelltypequantificationfromgeneexpressiondata
AT khanmisham geneexpressiondeconvolutioninteractivetoolgeditaccuratecelltypequantificationfromgeneexpressiondata
AT mangulserghei geneexpressiondeconvolutioninteractivetoolgeditaccuratecelltypequantificationfromgeneexpressiondata
AT pellegrinimatteo geneexpressiondeconvolutioninteractivetoolgeditaccuratecelltypequantificationfromgeneexpressiondata