Cargando…
CAISC: A software to integrate copy number variations and single nucleotide mutations for genetic heterogeneity profiling and subclone detection by single-cell RNA sequencing
BACKGROUND: Although both copy number variations (CNVs) and single nucleotide variations (SNVs) detected by single-cell RNA sequencing (scRNA-seq) are used to study intratumor heterogeneity and detect clonal groups, a software that integrates these two types of data in the same cells is unavailable....
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8939069/ https://www.ncbi.nlm.nih.gov/pubmed/35313800 http://dx.doi.org/10.1186/s12859-022-04625-x |
_version_ | 1784672668235071488 |
---|---|
author | Kannan, Jeerthi Mathews, Liza Wu, Zhijie Young, Neal S. Gao, Shouguo |
author_facet | Kannan, Jeerthi Mathews, Liza Wu, Zhijie Young, Neal S. Gao, Shouguo |
author_sort | Kannan, Jeerthi |
collection | PubMed |
description | BACKGROUND: Although both copy number variations (CNVs) and single nucleotide variations (SNVs) detected by single-cell RNA sequencing (scRNA-seq) are used to study intratumor heterogeneity and detect clonal groups, a software that integrates these two types of data in the same cells is unavailable. RESULTS: We developed Clonal Architecture with Integration of SNV and CNV (CAISC), an R package for scRNA-seq data analysis that clusters single cells into distinct subclones by integrating CNV and SNV genotype matrices using an entropy weighted approach. The performance of CAISC was tested on simulation data and four real datasets, which confirmed its high accuracy in sub-clonal identification and assignment, including subclones which cannot be identified using one type of data alone. Furthermore, integration of SNV and CNV allowed for accurate examination of expression changes between subclones, as demonstrated by the results from trisomy 8 clones of the myelodysplastic syndromes (MDS) dataset. CONCLUSIONS: CAISC is a powerful tool for integration of CNV and SNV data from scRNA-seq to identify clonal clusters with better accuracy than obtained from a single type of data. CAISC allows users to interactively examine clonal assignments. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-022-04625-x. |
format | Online Article Text |
id | pubmed-8939069 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-89390692022-03-23 CAISC: A software to integrate copy number variations and single nucleotide mutations for genetic heterogeneity profiling and subclone detection by single-cell RNA sequencing Kannan, Jeerthi Mathews, Liza Wu, Zhijie Young, Neal S. Gao, Shouguo BMC Bioinformatics Methodology BACKGROUND: Although both copy number variations (CNVs) and single nucleotide variations (SNVs) detected by single-cell RNA sequencing (scRNA-seq) are used to study intratumor heterogeneity and detect clonal groups, a software that integrates these two types of data in the same cells is unavailable. RESULTS: We developed Clonal Architecture with Integration of SNV and CNV (CAISC), an R package for scRNA-seq data analysis that clusters single cells into distinct subclones by integrating CNV and SNV genotype matrices using an entropy weighted approach. The performance of CAISC was tested on simulation data and four real datasets, which confirmed its high accuracy in sub-clonal identification and assignment, including subclones which cannot be identified using one type of data alone. Furthermore, integration of SNV and CNV allowed for accurate examination of expression changes between subclones, as demonstrated by the results from trisomy 8 clones of the myelodysplastic syndromes (MDS) dataset. CONCLUSIONS: CAISC is a powerful tool for integration of CNV and SNV data from scRNA-seq to identify clonal clusters with better accuracy than obtained from a single type of data. CAISC allows users to interactively examine clonal assignments. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-022-04625-x. BioMed Central 2022-03-21 /pmc/articles/PMC8939069/ /pubmed/35313800 http://dx.doi.org/10.1186/s12859-022-04625-x Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Methodology Kannan, Jeerthi Mathews, Liza Wu, Zhijie Young, Neal S. Gao, Shouguo CAISC: A software to integrate copy number variations and single nucleotide mutations for genetic heterogeneity profiling and subclone detection by single-cell RNA sequencing |
title | CAISC: A software to integrate copy number variations and single nucleotide mutations for genetic heterogeneity profiling and subclone detection by single-cell RNA sequencing |
title_full | CAISC: A software to integrate copy number variations and single nucleotide mutations for genetic heterogeneity profiling and subclone detection by single-cell RNA sequencing |
title_fullStr | CAISC: A software to integrate copy number variations and single nucleotide mutations for genetic heterogeneity profiling and subclone detection by single-cell RNA sequencing |
title_full_unstemmed | CAISC: A software to integrate copy number variations and single nucleotide mutations for genetic heterogeneity profiling and subclone detection by single-cell RNA sequencing |
title_short | CAISC: A software to integrate copy number variations and single nucleotide mutations for genetic heterogeneity profiling and subclone detection by single-cell RNA sequencing |
title_sort | caisc: a software to integrate copy number variations and single nucleotide mutations for genetic heterogeneity profiling and subclone detection by single-cell rna sequencing |
topic | Methodology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8939069/ https://www.ncbi.nlm.nih.gov/pubmed/35313800 http://dx.doi.org/10.1186/s12859-022-04625-x |
work_keys_str_mv | AT kannanjeerthi caiscasoftwaretointegratecopynumbervariationsandsinglenucleotidemutationsforgeneticheterogeneityprofilingandsubclonedetectionbysinglecellrnasequencing AT mathewsliza caiscasoftwaretointegratecopynumbervariationsandsinglenucleotidemutationsforgeneticheterogeneityprofilingandsubclonedetectionbysinglecellrnasequencing AT wuzhijie caiscasoftwaretointegratecopynumbervariationsandsinglenucleotidemutationsforgeneticheterogeneityprofilingandsubclonedetectionbysinglecellrnasequencing AT youngneals caiscasoftwaretointegratecopynumbervariationsandsinglenucleotidemutationsforgeneticheterogeneityprofilingandsubclonedetectionbysinglecellrnasequencing AT gaoshouguo caiscasoftwaretointegratecopynumbervariationsandsinglenucleotidemutationsforgeneticheterogeneityprofilingandsubclonedetectionbysinglecellrnasequencing |