Cargando…

SCSsim: an integrated tool for simulating single-cell genome sequencing data

MOTIVATION: Allele dropout (ADO) and unbalanced amplification of alleles are main technical issues of single-cell sequencing (SCS), and effectively emulating these issues is necessary for reliably benchmarking SCS-based bioinformatics tools. Unfortunately, currently available sequencing simulators a...

Descripción completa

Detalles Bibliográficos
Autores principales: Yu, Zhenhua, Du, Fang, Sun, Xuehong, Li, Ao
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7703785/
https://www.ncbi.nlm.nih.gov/pubmed/31584615
http://dx.doi.org/10.1093/bioinformatics/btz713
_version_ 1783616695945920512
author Yu, Zhenhua
Du, Fang
Sun, Xuehong
Li, Ao
author_facet Yu, Zhenhua
Du, Fang
Sun, Xuehong
Li, Ao
author_sort Yu, Zhenhua
collection PubMed
description MOTIVATION: Allele dropout (ADO) and unbalanced amplification of alleles are main technical issues of single-cell sequencing (SCS), and effectively emulating these issues is necessary for reliably benchmarking SCS-based bioinformatics tools. Unfortunately, currently available sequencing simulators are free of whole-genome amplification involved in SCS technique and therefore not suited for generating SCS datasets. We develop a new software package (SCSsim) that can efficiently simulate SCS datasets in a parallel fashion with minimal user intervention. SCSsim first constructs the genome sequence of single cell by mimicking a complement of genomic variations under user-controlled manner, and then amplifies the genome according to MALBAC technique and finally yields sequencing reads from the amplified products based on inferred sequencing profiles. Comprehensive evaluation in simulating different ADO rates, variation detection efficiency and genome coverage demonstrates that SCSsim is a very useful tool in mimicking single-cell sequencing data with high efficiency. AVAILABILITY AND IMPLEMENTATION: SCSsim is freely available at https://github.com/qasimyu/scssim. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-7703785
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-77037852020-12-07 SCSsim: an integrated tool for simulating single-cell genome sequencing data Yu, Zhenhua Du, Fang Sun, Xuehong Li, Ao Bioinformatics Applications Notes MOTIVATION: Allele dropout (ADO) and unbalanced amplification of alleles are main technical issues of single-cell sequencing (SCS), and effectively emulating these issues is necessary for reliably benchmarking SCS-based bioinformatics tools. Unfortunately, currently available sequencing simulators are free of whole-genome amplification involved in SCS technique and therefore not suited for generating SCS datasets. We develop a new software package (SCSsim) that can efficiently simulate SCS datasets in a parallel fashion with minimal user intervention. SCSsim first constructs the genome sequence of single cell by mimicking a complement of genomic variations under user-controlled manner, and then amplifies the genome according to MALBAC technique and finally yields sequencing reads from the amplified products based on inferred sequencing profiles. Comprehensive evaluation in simulating different ADO rates, variation detection efficiency and genome coverage demonstrates that SCSsim is a very useful tool in mimicking single-cell sequencing data with high efficiency. AVAILABILITY AND IMPLEMENTATION: SCSsim is freely available at https://github.com/qasimyu/scssim. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2020-02-15 2019-09-17 /pmc/articles/PMC7703785/ /pubmed/31584615 http://dx.doi.org/10.1093/bioinformatics/btz713 Text en © The Author(s) 2019. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Applications Notes
Yu, Zhenhua
Du, Fang
Sun, Xuehong
Li, Ao
SCSsim: an integrated tool for simulating single-cell genome sequencing data
title SCSsim: an integrated tool for simulating single-cell genome sequencing data
title_full SCSsim: an integrated tool for simulating single-cell genome sequencing data
title_fullStr SCSsim: an integrated tool for simulating single-cell genome sequencing data
title_full_unstemmed SCSsim: an integrated tool for simulating single-cell genome sequencing data
title_short SCSsim: an integrated tool for simulating single-cell genome sequencing data
title_sort scssim: an integrated tool for simulating single-cell genome sequencing data
topic Applications Notes
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7703785/
https://www.ncbi.nlm.nih.gov/pubmed/31584615
http://dx.doi.org/10.1093/bioinformatics/btz713
work_keys_str_mv AT yuzhenhua scssimanintegratedtoolforsimulatingsinglecellgenomesequencingdata
AT dufang scssimanintegratedtoolforsimulatingsinglecellgenomesequencingdata
AT sunxuehong scssimanintegratedtoolforsimulatingsinglecellgenomesequencingdata
AT liao scssimanintegratedtoolforsimulatingsinglecellgenomesequencingdata