Cargando…
Identifying tumor clones in sparse single-cell mutation data
MOTIVATION: Recent single-cell DNA sequencing technologies enable whole-genome sequencing of hundreds to thousands of individual cells. However, these technologies have ultra-low sequencing coverage (<0.5× per cell) which has limited their use to the analysis of large copy-number aberrations (CNA...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7355247/ https://www.ncbi.nlm.nih.gov/pubmed/32657385 http://dx.doi.org/10.1093/bioinformatics/btaa449 |
_version_ | 1783558236229599232 |
---|---|
author | Myers, Matthew A Zaccaria, Simone Raphael, Benjamin J |
author_facet | Myers, Matthew A Zaccaria, Simone Raphael, Benjamin J |
author_sort | Myers, Matthew A |
collection | PubMed |
description | MOTIVATION: Recent single-cell DNA sequencing technologies enable whole-genome sequencing of hundreds to thousands of individual cells. However, these technologies have ultra-low sequencing coverage (<0.5× per cell) which has limited their use to the analysis of large copy-number aberrations (CNAs) in individual cells. While CNAs are useful markers in cancer studies, single-nucleotide mutations are equally important, both in cancer studies and in other applications. However, ultra-low coverage sequencing yields single-nucleotide mutation data that are too sparse for current single-cell analysis methods. RESULTS: We introduce SBMClone, a method to infer clusters of cells, or clones, that share groups of somatic single-nucleotide mutations. SBMClone uses a stochastic block model to overcome sparsity in ultra-low coverage single-cell sequencing data, and we show that SBMClone accurately infers the true clonal composition on simulated datasets with coverage at low as [Formula: see text]. We applied SBMClone to single-cell whole-genome sequencing data from two breast cancer patients obtained using two different sequencing technologies. On the first patient, sequenced using the 10X Genomics CNV solution with sequencing coverage [Formula: see text] , SBMClone recovers the major clonal composition when incorporating a small amount of additional information. On the second patient, where pre- and post-treatment tumor samples were sequenced using DOP-PCR with sequencing coverage [Formula: see text] , SBMClone shows that tumor cells are present in the post-treatment sample, contrary to published analysis of this dataset. AVAILABILITY AND IMPLEMENTATION: SBMClone is available on the GitHub repository https://github.com/raphael-group/SBMClone. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-7355247 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-73552472020-07-16 Identifying tumor clones in sparse single-cell mutation data Myers, Matthew A Zaccaria, Simone Raphael, Benjamin J Bioinformatics Genomic Variation Analysis MOTIVATION: Recent single-cell DNA sequencing technologies enable whole-genome sequencing of hundreds to thousands of individual cells. However, these technologies have ultra-low sequencing coverage (<0.5× per cell) which has limited their use to the analysis of large copy-number aberrations (CNAs) in individual cells. While CNAs are useful markers in cancer studies, single-nucleotide mutations are equally important, both in cancer studies and in other applications. However, ultra-low coverage sequencing yields single-nucleotide mutation data that are too sparse for current single-cell analysis methods. RESULTS: We introduce SBMClone, a method to infer clusters of cells, or clones, that share groups of somatic single-nucleotide mutations. SBMClone uses a stochastic block model to overcome sparsity in ultra-low coverage single-cell sequencing data, and we show that SBMClone accurately infers the true clonal composition on simulated datasets with coverage at low as [Formula: see text]. We applied SBMClone to single-cell whole-genome sequencing data from two breast cancer patients obtained using two different sequencing technologies. On the first patient, sequenced using the 10X Genomics CNV solution with sequencing coverage [Formula: see text] , SBMClone recovers the major clonal composition when incorporating a small amount of additional information. On the second patient, where pre- and post-treatment tumor samples were sequenced using DOP-PCR with sequencing coverage [Formula: see text] , SBMClone shows that tumor cells are present in the post-treatment sample, contrary to published analysis of this dataset. AVAILABILITY AND IMPLEMENTATION: SBMClone is available on the GitHub repository https://github.com/raphael-group/SBMClone. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2020-07 2020-07-13 /pmc/articles/PMC7355247/ /pubmed/32657385 http://dx.doi.org/10.1093/bioinformatics/btaa449 Text en © The Author(s) 2020. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Genomic Variation Analysis Myers, Matthew A Zaccaria, Simone Raphael, Benjamin J Identifying tumor clones in sparse single-cell mutation data |
title | Identifying tumor clones in sparse single-cell mutation data |
title_full | Identifying tumor clones in sparse single-cell mutation data |
title_fullStr | Identifying tumor clones in sparse single-cell mutation data |
title_full_unstemmed | Identifying tumor clones in sparse single-cell mutation data |
title_short | Identifying tumor clones in sparse single-cell mutation data |
title_sort | identifying tumor clones in sparse single-cell mutation data |
topic | Genomic Variation Analysis |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7355247/ https://www.ncbi.nlm.nih.gov/pubmed/32657385 http://dx.doi.org/10.1093/bioinformatics/btaa449 |
work_keys_str_mv | AT myersmatthewa identifyingtumorclonesinsparsesinglecellmutationdata AT zaccariasimone identifyingtumorclonesinsparsesinglecellmutationdata AT raphaelbenjaminj identifyingtumorclonesinsparsesinglecellmutationdata |