Cargando…
bmVAE: a variational autoencoder method for clustering single-cell mutation data
MOTIVATION: Genetic intra-tumor heterogeneity (ITH) characterizes the differences in genomic variations between tumor clones, and accurately unmasking ITH is important for personalized cancer therapy. Single-cell DNA sequencing now emerges as a powerful means for deciphering underlying ITH based on...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9825778/ https://www.ncbi.nlm.nih.gov/pubmed/36478203 http://dx.doi.org/10.1093/bioinformatics/btac790 |
_version_ | 1784866696669954048 |
---|---|
author | Yan, Jiaqian Ma, Ming Yu, Zhenhua |
author_facet | Yan, Jiaqian Ma, Ming Yu, Zhenhua |
author_sort | Yan, Jiaqian |
collection | PubMed |
description | MOTIVATION: Genetic intra-tumor heterogeneity (ITH) characterizes the differences in genomic variations between tumor clones, and accurately unmasking ITH is important for personalized cancer therapy. Single-cell DNA sequencing now emerges as a powerful means for deciphering underlying ITH based on point mutations of single cells. However, detecting tumor clones from single-cell mutation data remains challenging due to the error-prone and discrete nature of the data. RESULTS: We introduce bmVAE, a bioinformatics tool for learning low-dimensional latent representation of single cell based on a variational autoencoder and then clustering cells into subpopulations in the latent space. bmVAE takes single-cell binary mutation data as inputs, and outputs inferred cell subpopulations as well as their genotypes. To achieve this, the bmVAE framework is designed to consist of three modules including dimensionality reduction, cell clustering and genotype estimation. We assess the method on various synthetic datasets where different factors including false negative rate, data size and data heterogeneity are considered in simulation, and further demonstrate its effectiveness on two real datasets. The results suggest bmVAE is highly effective in reasoning ITH, and performs competitive to existing methods. AVAILABILITY AND IMPLEMENTATION: bmVAE is freely available at https://github.com/zhyu-lab/bmvae. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-9825778 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-98257782023-01-10 bmVAE: a variational autoencoder method for clustering single-cell mutation data Yan, Jiaqian Ma, Ming Yu, Zhenhua Bioinformatics Original Paper MOTIVATION: Genetic intra-tumor heterogeneity (ITH) characterizes the differences in genomic variations between tumor clones, and accurately unmasking ITH is important for personalized cancer therapy. Single-cell DNA sequencing now emerges as a powerful means for deciphering underlying ITH based on point mutations of single cells. However, detecting tumor clones from single-cell mutation data remains challenging due to the error-prone and discrete nature of the data. RESULTS: We introduce bmVAE, a bioinformatics tool for learning low-dimensional latent representation of single cell based on a variational autoencoder and then clustering cells into subpopulations in the latent space. bmVAE takes single-cell binary mutation data as inputs, and outputs inferred cell subpopulations as well as their genotypes. To achieve this, the bmVAE framework is designed to consist of three modules including dimensionality reduction, cell clustering and genotype estimation. We assess the method on various synthetic datasets where different factors including false negative rate, data size and data heterogeneity are considered in simulation, and further demonstrate its effectiveness on two real datasets. The results suggest bmVAE is highly effective in reasoning ITH, and performs competitive to existing methods. AVAILABILITY AND IMPLEMENTATION: bmVAE is freely available at https://github.com/zhyu-lab/bmvae. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2022-12-07 /pmc/articles/PMC9825778/ /pubmed/36478203 http://dx.doi.org/10.1093/bioinformatics/btac790 Text en © The Author(s) 2022. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Paper Yan, Jiaqian Ma, Ming Yu, Zhenhua bmVAE: a variational autoencoder method for clustering single-cell mutation data |
title | bmVAE: a variational autoencoder method for clustering single-cell mutation data |
title_full | bmVAE: a variational autoencoder method for clustering single-cell mutation data |
title_fullStr | bmVAE: a variational autoencoder method for clustering single-cell mutation data |
title_full_unstemmed | bmVAE: a variational autoencoder method for clustering single-cell mutation data |
title_short | bmVAE: a variational autoencoder method for clustering single-cell mutation data |
title_sort | bmvae: a variational autoencoder method for clustering single-cell mutation data |
topic | Original Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9825778/ https://www.ncbi.nlm.nih.gov/pubmed/36478203 http://dx.doi.org/10.1093/bioinformatics/btac790 |
work_keys_str_mv | AT yanjiaqian bmvaeavariationalautoencodermethodforclusteringsinglecellmutationdata AT maming bmvaeavariationalautoencodermethodforclusteringsinglecellmutationdata AT yuzhenhua bmvaeavariationalautoencodermethodforclusteringsinglecellmutationdata |