Cargando…
GiniClust3: a fast and memory-efficient tool for rare cell type identification
BACKGROUND: With the rapid development of single-cell RNA sequencing technology, it is possible to dissect cell-type composition at high resolution. A number of methods have been developed with the purpose to identify rare cell types. However, existing methods are still not scalable to large dataset...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7183612/ https://www.ncbi.nlm.nih.gov/pubmed/32334526 http://dx.doi.org/10.1186/s12859-020-3482-1 |
_version_ | 1783526453816590336 |
---|---|
author | Dong, Rui Yuan, Guo-Cheng |
author_facet | Dong, Rui Yuan, Guo-Cheng |
author_sort | Dong, Rui |
collection | PubMed |
description | BACKGROUND: With the rapid development of single-cell RNA sequencing technology, it is possible to dissect cell-type composition at high resolution. A number of methods have been developed with the purpose to identify rare cell types. However, existing methods are still not scalable to large datasets, limiting their utility. To overcome this limitation, we present a new software package, called GiniClust3, which is an extension of GiniClust2 and significantly faster and memory-efficient than previous versions. RESULTS: Using GiniClust3, it only takes about 7 h to identify both common and rare cell clusters from a dataset that contains more than one million cells. Cell type mapping and perturbation analyses show that GiniClust3 could robustly identify cell clusters. CONCLUSIONS: Taken together, these results suggest that GiniClust3 is a powerful tool to identify both common and rare cell population and can handle large dataset. GiniCluster3 is implemented in the open-source python package and available at https://github.com/rdong08/GiniClust3. |
format | Online Article Text |
id | pubmed-7183612 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-71836122020-04-29 GiniClust3: a fast and memory-efficient tool for rare cell type identification Dong, Rui Yuan, Guo-Cheng BMC Bioinformatics Software BACKGROUND: With the rapid development of single-cell RNA sequencing technology, it is possible to dissect cell-type composition at high resolution. A number of methods have been developed with the purpose to identify rare cell types. However, existing methods are still not scalable to large datasets, limiting their utility. To overcome this limitation, we present a new software package, called GiniClust3, which is an extension of GiniClust2 and significantly faster and memory-efficient than previous versions. RESULTS: Using GiniClust3, it only takes about 7 h to identify both common and rare cell clusters from a dataset that contains more than one million cells. Cell type mapping and perturbation analyses show that GiniClust3 could robustly identify cell clusters. CONCLUSIONS: Taken together, these results suggest that GiniClust3 is a powerful tool to identify both common and rare cell population and can handle large dataset. GiniCluster3 is implemented in the open-source python package and available at https://github.com/rdong08/GiniClust3. BioMed Central 2020-04-25 /pmc/articles/PMC7183612/ /pubmed/32334526 http://dx.doi.org/10.1186/s12859-020-3482-1 Text en © The Author(s). 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Software Dong, Rui Yuan, Guo-Cheng GiniClust3: a fast and memory-efficient tool for rare cell type identification |
title | GiniClust3: a fast and memory-efficient tool for rare cell type identification |
title_full | GiniClust3: a fast and memory-efficient tool for rare cell type identification |
title_fullStr | GiniClust3: a fast and memory-efficient tool for rare cell type identification |
title_full_unstemmed | GiniClust3: a fast and memory-efficient tool for rare cell type identification |
title_short | GiniClust3: a fast and memory-efficient tool for rare cell type identification |
title_sort | giniclust3: a fast and memory-efficient tool for rare cell type identification |
topic | Software |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7183612/ https://www.ncbi.nlm.nih.gov/pubmed/32334526 http://dx.doi.org/10.1186/s12859-020-3482-1 |
work_keys_str_mv | AT dongrui giniclust3afastandmemoryefficienttoolforrarecelltypeidentification AT yuanguocheng giniclust3afastandmemoryefficienttoolforrarecelltypeidentification |