Cargando…

SCSilicon: a tool for synthetic single-cell DNA sequencing data generation

BACKGROUND: Single-cell DNA sequencing is getting indispensable in the study of cell-specific cancer genomics. The performance of computational tools that tackle single-cell genome aberrations may be nevertheless undervalued or overvalued, owing to the insufficient size of benchmarking data. In sili...

Descripción completa

Detalles Bibliográficos
Autores principales: Feng, Xikang, Chen, Lingxi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9092674/
https://www.ncbi.nlm.nih.gov/pubmed/35546390
http://dx.doi.org/10.1186/s12864-022-08566-w
_version_ 1784705181736239104
author Feng, Xikang
Chen, Lingxi
author_facet Feng, Xikang
Chen, Lingxi
author_sort Feng, Xikang
collection PubMed
description BACKGROUND: Single-cell DNA sequencing is getting indispensable in the study of cell-specific cancer genomics. The performance of computational tools that tackle single-cell genome aberrations may be nevertheless undervalued or overvalued, owing to the insufficient size of benchmarking data. In silicon simulation is a cost-effective approach to generate as many single-cell genomes as possible in a controlled manner to make reliable and valid benchmarking. RESULTS: This study proposes a new tool, SCSilicon, which efficiently generates single-cell in silicon DNA reads with minimum manual intervention. SCSilicon automatically creates a set of genomic aberrations, including SNP, SNV, Indel, and CNV. Besides, SCSilicon yields the ground truth of CNV segmentation breakpoints and subclone cell labels. We have manually inspected a series of synthetic variations. We conducted a sanity check of the start-of-the-art single-cell CNV callers and found SCYN was the most robust one. CONCLUSIONS: SCSilicon is a user-friendly software package for users to develop and benchmark single-cell CNV callers. Source code of SCSilicon is available at https://github.com/xikanfeng2/SCSilicon. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at (10.1186/s12864-022-08566-w).
format Online
Article
Text
id pubmed-9092674
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-90926742022-05-12 SCSilicon: a tool for synthetic single-cell DNA sequencing data generation Feng, Xikang Chen, Lingxi BMC Genomics Software BACKGROUND: Single-cell DNA sequencing is getting indispensable in the study of cell-specific cancer genomics. The performance of computational tools that tackle single-cell genome aberrations may be nevertheless undervalued or overvalued, owing to the insufficient size of benchmarking data. In silicon simulation is a cost-effective approach to generate as many single-cell genomes as possible in a controlled manner to make reliable and valid benchmarking. RESULTS: This study proposes a new tool, SCSilicon, which efficiently generates single-cell in silicon DNA reads with minimum manual intervention. SCSilicon automatically creates a set of genomic aberrations, including SNP, SNV, Indel, and CNV. Besides, SCSilicon yields the ground truth of CNV segmentation breakpoints and subclone cell labels. We have manually inspected a series of synthetic variations. We conducted a sanity check of the start-of-the-art single-cell CNV callers and found SCYN was the most robust one. CONCLUSIONS: SCSilicon is a user-friendly software package for users to develop and benchmark single-cell CNV callers. Source code of SCSilicon is available at https://github.com/xikanfeng2/SCSilicon. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at (10.1186/s12864-022-08566-w). BioMed Central 2022-05-11 /pmc/articles/PMC9092674/ /pubmed/35546390 http://dx.doi.org/10.1186/s12864-022-08566-w Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Software
Feng, Xikang
Chen, Lingxi
SCSilicon: a tool for synthetic single-cell DNA sequencing data generation
title SCSilicon: a tool for synthetic single-cell DNA sequencing data generation
title_full SCSilicon: a tool for synthetic single-cell DNA sequencing data generation
title_fullStr SCSilicon: a tool for synthetic single-cell DNA sequencing data generation
title_full_unstemmed SCSilicon: a tool for synthetic single-cell DNA sequencing data generation
title_short SCSilicon: a tool for synthetic single-cell DNA sequencing data generation
title_sort scsilicon: a tool for synthetic single-cell dna sequencing data generation
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9092674/
https://www.ncbi.nlm.nih.gov/pubmed/35546390
http://dx.doi.org/10.1186/s12864-022-08566-w
work_keys_str_mv AT fengxikang scsiliconatoolforsyntheticsinglecelldnasequencingdatageneration
AT chenlingxi scsiliconatoolforsyntheticsinglecelldnasequencingdatageneration