Cargando…

DeconPeaker, a Deconvolution Model to Identify Cell Types Based on Chromatin Accessibility in ATAC-Seq Data of Mixture Samples

While our understanding of cellular and molecular processes has grown exponentially, issues related to the cell microenvironment and cellular heterogeneity have sparked a new debate concerning the cell identity. Cell composition (chromatin and nuclear architecture) poses a strong risk for dynamic ch...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Huamei, Sharma, Amit, Luo, Kun, Qin, Zhaohui S., Sun, Xiao, Liu, Hongde
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7269180/
https://www.ncbi.nlm.nih.gov/pubmed/32547592
http://dx.doi.org/10.3389/fgene.2020.00392
_version_ 1783541734978879488
author Li, Huamei
Sharma, Amit
Luo, Kun
Qin, Zhaohui S.
Sun, Xiao
Liu, Hongde
author_facet Li, Huamei
Sharma, Amit
Luo, Kun
Qin, Zhaohui S.
Sun, Xiao
Liu, Hongde
author_sort Li, Huamei
collection PubMed
description While our understanding of cellular and molecular processes has grown exponentially, issues related to the cell microenvironment and cellular heterogeneity have sparked a new debate concerning the cell identity. Cell composition (chromatin and nuclear architecture) poses a strong risk for dynamic changes in the diseased condition. Since chromatin accessibility patterns play a major role in human diseases, it is therefore anticipated that a deconvolution tool based on open chromatin data will provide better performance in identifying cell composition. Herein, we have designed the deconvolution tool “DeconPeaker,” which can precisely define the uniqueness among subpopulations of cells using open chromatin datasets. Using this tool, we simultaneously evaluated chromatin accessibility and gene expression datasets to estimate cell types and their respective proportions in a mixture of samples. In comparison to other known deconvolution methods, we observed the lowest average root-mean-square error (RMSE = 0.042) and the highest average correlation coefficient (r = 0.919) between the prediction and “true” proportion. As a proof-of-concept, we also tested chromatin accessibility data from acute myeloid leukemia (AML) and successfully obtained unique cell types associated with AML progression. Furthermore, we showed that chromatin accessibility represents more essential characteristics in the identification of cell types than gene expression. Taken together, DeconPeaker as a powerful tool has the potential to combine different datasets (primarily, chromatin accessibility and gene expression) and define different cell types in mixtures. The Python package of DeconPeaker is now available at https://github.com/lihuamei/DeconPeaker.
format Online
Article
Text
id pubmed-7269180
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-72691802020-06-15 DeconPeaker, a Deconvolution Model to Identify Cell Types Based on Chromatin Accessibility in ATAC-Seq Data of Mixture Samples Li, Huamei Sharma, Amit Luo, Kun Qin, Zhaohui S. Sun, Xiao Liu, Hongde Front Genet Genetics While our understanding of cellular and molecular processes has grown exponentially, issues related to the cell microenvironment and cellular heterogeneity have sparked a new debate concerning the cell identity. Cell composition (chromatin and nuclear architecture) poses a strong risk for dynamic changes in the diseased condition. Since chromatin accessibility patterns play a major role in human diseases, it is therefore anticipated that a deconvolution tool based on open chromatin data will provide better performance in identifying cell composition. Herein, we have designed the deconvolution tool “DeconPeaker,” which can precisely define the uniqueness among subpopulations of cells using open chromatin datasets. Using this tool, we simultaneously evaluated chromatin accessibility and gene expression datasets to estimate cell types and their respective proportions in a mixture of samples. In comparison to other known deconvolution methods, we observed the lowest average root-mean-square error (RMSE = 0.042) and the highest average correlation coefficient (r = 0.919) between the prediction and “true” proportion. As a proof-of-concept, we also tested chromatin accessibility data from acute myeloid leukemia (AML) and successfully obtained unique cell types associated with AML progression. Furthermore, we showed that chromatin accessibility represents more essential characteristics in the identification of cell types than gene expression. Taken together, DeconPeaker as a powerful tool has the potential to combine different datasets (primarily, chromatin accessibility and gene expression) and define different cell types in mixtures. The Python package of DeconPeaker is now available at https://github.com/lihuamei/DeconPeaker. Frontiers Media S.A. 2020-06-08 /pmc/articles/PMC7269180/ /pubmed/32547592 http://dx.doi.org/10.3389/fgene.2020.00392 Text en Copyright © 2020 Li, Sharma, Luo, Qin, Sun and Liu. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Li, Huamei
Sharma, Amit
Luo, Kun
Qin, Zhaohui S.
Sun, Xiao
Liu, Hongde
DeconPeaker, a Deconvolution Model to Identify Cell Types Based on Chromatin Accessibility in ATAC-Seq Data of Mixture Samples
title DeconPeaker, a Deconvolution Model to Identify Cell Types Based on Chromatin Accessibility in ATAC-Seq Data of Mixture Samples
title_full DeconPeaker, a Deconvolution Model to Identify Cell Types Based on Chromatin Accessibility in ATAC-Seq Data of Mixture Samples
title_fullStr DeconPeaker, a Deconvolution Model to Identify Cell Types Based on Chromatin Accessibility in ATAC-Seq Data of Mixture Samples
title_full_unstemmed DeconPeaker, a Deconvolution Model to Identify Cell Types Based on Chromatin Accessibility in ATAC-Seq Data of Mixture Samples
title_short DeconPeaker, a Deconvolution Model to Identify Cell Types Based on Chromatin Accessibility in ATAC-Seq Data of Mixture Samples
title_sort deconpeaker, a deconvolution model to identify cell types based on chromatin accessibility in atac-seq data of mixture samples
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7269180/
https://www.ncbi.nlm.nih.gov/pubmed/32547592
http://dx.doi.org/10.3389/fgene.2020.00392
work_keys_str_mv AT lihuamei deconpeakeradeconvolutionmodeltoidentifycelltypesbasedonchromatinaccessibilityinatacseqdataofmixturesamples
AT sharmaamit deconpeakeradeconvolutionmodeltoidentifycelltypesbasedonchromatinaccessibilityinatacseqdataofmixturesamples
AT luokun deconpeakeradeconvolutionmodeltoidentifycelltypesbasedonchromatinaccessibilityinatacseqdataofmixturesamples
AT qinzhaohuis deconpeakeradeconvolutionmodeltoidentifycelltypesbasedonchromatinaccessibilityinatacseqdataofmixturesamples
AT sunxiao deconpeakeradeconvolutionmodeltoidentifycelltypesbasedonchromatinaccessibilityinatacseqdataofmixturesamples
AT liuhongde deconpeakeradeconvolutionmodeltoidentifycelltypesbasedonchromatinaccessibilityinatacseqdataofmixturesamples