Cargando…

bmVAE: a variational autoencoder method for clustering single-cell mutation data

MOTIVATION: Genetic intra-tumor heterogeneity (ITH) characterizes the differences in genomic variations between tumor clones, and accurately unmasking ITH is important for personalized cancer therapy. Single-cell DNA sequencing now emerges as a powerful means for deciphering underlying ITH based on...

Descripción completa

Detalles Bibliográficos
Autores principales: Yan, Jiaqian, Ma, Ming, Yu, Zhenhua
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9825778/
https://www.ncbi.nlm.nih.gov/pubmed/36478203
http://dx.doi.org/10.1093/bioinformatics/btac790
_version_ 1784866696669954048
author Yan, Jiaqian
Ma, Ming
Yu, Zhenhua
author_facet Yan, Jiaqian
Ma, Ming
Yu, Zhenhua
author_sort Yan, Jiaqian
collection PubMed
description MOTIVATION: Genetic intra-tumor heterogeneity (ITH) characterizes the differences in genomic variations between tumor clones, and accurately unmasking ITH is important for personalized cancer therapy. Single-cell DNA sequencing now emerges as a powerful means for deciphering underlying ITH based on point mutations of single cells. However, detecting tumor clones from single-cell mutation data remains challenging due to the error-prone and discrete nature of the data. RESULTS: We introduce bmVAE, a bioinformatics tool for learning low-dimensional latent representation of single cell based on a variational autoencoder and then clustering cells into subpopulations in the latent space. bmVAE takes single-cell binary mutation data as inputs, and outputs inferred cell subpopulations as well as their genotypes. To achieve this, the bmVAE framework is designed to consist of three modules including dimensionality reduction, cell clustering and genotype estimation. We assess the method on various synthetic datasets where different factors including false negative rate, data size and data heterogeneity are considered in simulation, and further demonstrate its effectiveness on two real datasets. The results suggest bmVAE is highly effective in reasoning ITH, and performs competitive to existing methods. AVAILABILITY AND IMPLEMENTATION: bmVAE is freely available at https://github.com/zhyu-lab/bmvae. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-9825778
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-98257782023-01-10 bmVAE: a variational autoencoder method for clustering single-cell mutation data Yan, Jiaqian Ma, Ming Yu, Zhenhua Bioinformatics Original Paper MOTIVATION: Genetic intra-tumor heterogeneity (ITH) characterizes the differences in genomic variations between tumor clones, and accurately unmasking ITH is important for personalized cancer therapy. Single-cell DNA sequencing now emerges as a powerful means for deciphering underlying ITH based on point mutations of single cells. However, detecting tumor clones from single-cell mutation data remains challenging due to the error-prone and discrete nature of the data. RESULTS: We introduce bmVAE, a bioinformatics tool for learning low-dimensional latent representation of single cell based on a variational autoencoder and then clustering cells into subpopulations in the latent space. bmVAE takes single-cell binary mutation data as inputs, and outputs inferred cell subpopulations as well as their genotypes. To achieve this, the bmVAE framework is designed to consist of three modules including dimensionality reduction, cell clustering and genotype estimation. We assess the method on various synthetic datasets where different factors including false negative rate, data size and data heterogeneity are considered in simulation, and further demonstrate its effectiveness on two real datasets. The results suggest bmVAE is highly effective in reasoning ITH, and performs competitive to existing methods. AVAILABILITY AND IMPLEMENTATION: bmVAE is freely available at https://github.com/zhyu-lab/bmvae. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2022-12-07 /pmc/articles/PMC9825778/ /pubmed/36478203 http://dx.doi.org/10.1093/bioinformatics/btac790 Text en © The Author(s) 2022. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Paper
Yan, Jiaqian
Ma, Ming
Yu, Zhenhua
bmVAE: a variational autoencoder method for clustering single-cell mutation data
title bmVAE: a variational autoencoder method for clustering single-cell mutation data
title_full bmVAE: a variational autoencoder method for clustering single-cell mutation data
title_fullStr bmVAE: a variational autoencoder method for clustering single-cell mutation data
title_full_unstemmed bmVAE: a variational autoencoder method for clustering single-cell mutation data
title_short bmVAE: a variational autoencoder method for clustering single-cell mutation data
title_sort bmvae: a variational autoencoder method for clustering single-cell mutation data
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9825778/
https://www.ncbi.nlm.nih.gov/pubmed/36478203
http://dx.doi.org/10.1093/bioinformatics/btac790
work_keys_str_mv AT yanjiaqian bmvaeavariationalautoencodermethodforclusteringsinglecellmutationdata
AT maming bmvaeavariationalautoencodermethodforclusteringsinglecellmutationdata
AT yuzhenhua bmvaeavariationalautoencodermethodforclusteringsinglecellmutationdata