Cargando…
The Gene Expression Deconvolution Interactive Tool (GEDIT): accurate cell type quantification from gene expression data
BACKGROUND: The cell type composition of heterogeneous tissue samples can be a critical variable in both clinical and laboratory settings. However, current experimental methods of cell type quantification (e.g., cell flow cytometry) are costly, time consuming and have potential to introduce bias. Co...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7931818/ https://www.ncbi.nlm.nih.gov/pubmed/33590863 http://dx.doi.org/10.1093/gigascience/giab002 |
_version_ | 1783660364832964608 |
---|---|
author | Nadel, Brian B Lopez, David Montoya, Dennis J Ma, Feiyang Waddel, Hannah Khan, Misha M Mangul, Serghei Pellegrini, Matteo |
author_facet | Nadel, Brian B Lopez, David Montoya, Dennis J Ma, Feiyang Waddel, Hannah Khan, Misha M Mangul, Serghei Pellegrini, Matteo |
author_sort | Nadel, Brian B |
collection | PubMed |
description | BACKGROUND: The cell type composition of heterogeneous tissue samples can be a critical variable in both clinical and laboratory settings. However, current experimental methods of cell type quantification (e.g., cell flow cytometry) are costly, time consuming and have potential to introduce bias. Computational approaches that use expression data to infer cell type abundance offer an alternative solution. While these methods have gained popularity, most fail to produce accurate predictions for the full range of platforms currently used by researchers or for the wide variety of tissue types often studied. RESULTS: We present the Gene Expression Deconvolution Interactive Tool (GEDIT), a flexible tool that utilizes gene expression data to accurately predict cell type abundances. Using both simulated and experimental data, we extensively evaluate the performance of GEDIT and demonstrate that it returns robust results under a wide variety of conditions. These conditions include multiple platforms (microarray and RNA-seq), tissue types (blood and stromal), and species (human and mouse). Finally, we provide reference data from 8 sources spanning a broad range of stromal and hematopoietic types in both human and mouse. GEDIT also accepts user-submitted reference data, thus allowing the estimation of any cell type or subtype, provided that reference data are available. CONCLUSIONS: GEDIT is a powerful method for evaluating the cell type composition of tissue samples and provides excellent accuracy and versatility compared to similar tools. The reference database provided here also allows users to obtain estimates for a wide variety of tissue samples without having to provide their own data. |
format | Online Article Text |
id | pubmed-7931818 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-79318182021-03-09 The Gene Expression Deconvolution Interactive Tool (GEDIT): accurate cell type quantification from gene expression data Nadel, Brian B Lopez, David Montoya, Dennis J Ma, Feiyang Waddel, Hannah Khan, Misha M Mangul, Serghei Pellegrini, Matteo Gigascience Research BACKGROUND: The cell type composition of heterogeneous tissue samples can be a critical variable in both clinical and laboratory settings. However, current experimental methods of cell type quantification (e.g., cell flow cytometry) are costly, time consuming and have potential to introduce bias. Computational approaches that use expression data to infer cell type abundance offer an alternative solution. While these methods have gained popularity, most fail to produce accurate predictions for the full range of platforms currently used by researchers or for the wide variety of tissue types often studied. RESULTS: We present the Gene Expression Deconvolution Interactive Tool (GEDIT), a flexible tool that utilizes gene expression data to accurately predict cell type abundances. Using both simulated and experimental data, we extensively evaluate the performance of GEDIT and demonstrate that it returns robust results under a wide variety of conditions. These conditions include multiple platforms (microarray and RNA-seq), tissue types (blood and stromal), and species (human and mouse). Finally, we provide reference data from 8 sources spanning a broad range of stromal and hematopoietic types in both human and mouse. GEDIT also accepts user-submitted reference data, thus allowing the estimation of any cell type or subtype, provided that reference data are available. CONCLUSIONS: GEDIT is a powerful method for evaluating the cell type composition of tissue samples and provides excellent accuracy and versatility compared to similar tools. The reference database provided here also allows users to obtain estimates for a wide variety of tissue samples without having to provide their own data. Oxford University Press 2021-02-16 /pmc/articles/PMC7931818/ /pubmed/33590863 http://dx.doi.org/10.1093/gigascience/giab002 Text en © The Author(s) 2021. Published by Oxford University Press GigaScience. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Nadel, Brian B Lopez, David Montoya, Dennis J Ma, Feiyang Waddel, Hannah Khan, Misha M Mangul, Serghei Pellegrini, Matteo The Gene Expression Deconvolution Interactive Tool (GEDIT): accurate cell type quantification from gene expression data |
title | The Gene Expression Deconvolution Interactive Tool (GEDIT): accurate cell type quantification from gene expression data |
title_full | The Gene Expression Deconvolution Interactive Tool (GEDIT): accurate cell type quantification from gene expression data |
title_fullStr | The Gene Expression Deconvolution Interactive Tool (GEDIT): accurate cell type quantification from gene expression data |
title_full_unstemmed | The Gene Expression Deconvolution Interactive Tool (GEDIT): accurate cell type quantification from gene expression data |
title_short | The Gene Expression Deconvolution Interactive Tool (GEDIT): accurate cell type quantification from gene expression data |
title_sort | gene expression deconvolution interactive tool (gedit): accurate cell type quantification from gene expression data |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7931818/ https://www.ncbi.nlm.nih.gov/pubmed/33590863 http://dx.doi.org/10.1093/gigascience/giab002 |
work_keys_str_mv | AT nadelbrianb thegeneexpressiondeconvolutioninteractivetoolgeditaccuratecelltypequantificationfromgeneexpressiondata AT lopezdavid thegeneexpressiondeconvolutioninteractivetoolgeditaccuratecelltypequantificationfromgeneexpressiondata AT montoyadennisj thegeneexpressiondeconvolutioninteractivetoolgeditaccuratecelltypequantificationfromgeneexpressiondata AT mafeiyang thegeneexpressiondeconvolutioninteractivetoolgeditaccuratecelltypequantificationfromgeneexpressiondata AT waddelhannah thegeneexpressiondeconvolutioninteractivetoolgeditaccuratecelltypequantificationfromgeneexpressiondata AT khanmisham thegeneexpressiondeconvolutioninteractivetoolgeditaccuratecelltypequantificationfromgeneexpressiondata AT mangulserghei thegeneexpressiondeconvolutioninteractivetoolgeditaccuratecelltypequantificationfromgeneexpressiondata AT pellegrinimatteo thegeneexpressiondeconvolutioninteractivetoolgeditaccuratecelltypequantificationfromgeneexpressiondata AT nadelbrianb geneexpressiondeconvolutioninteractivetoolgeditaccuratecelltypequantificationfromgeneexpressiondata AT lopezdavid geneexpressiondeconvolutioninteractivetoolgeditaccuratecelltypequantificationfromgeneexpressiondata AT montoyadennisj geneexpressiondeconvolutioninteractivetoolgeditaccuratecelltypequantificationfromgeneexpressiondata AT mafeiyang geneexpressiondeconvolutioninteractivetoolgeditaccuratecelltypequantificationfromgeneexpressiondata AT waddelhannah geneexpressiondeconvolutioninteractivetoolgeditaccuratecelltypequantificationfromgeneexpressiondata AT khanmisham geneexpressiondeconvolutioninteractivetoolgeditaccuratecelltypequantificationfromgeneexpressiondata AT mangulserghei geneexpressiondeconvolutioninteractivetoolgeditaccuratecelltypequantificationfromgeneexpressiondata AT pellegrinimatteo geneexpressiondeconvolutioninteractivetoolgeditaccuratecelltypequantificationfromgeneexpressiondata |