Cargando…
SCSsim: an integrated tool for simulating single-cell genome sequencing data
MOTIVATION: Allele dropout (ADO) and unbalanced amplification of alleles are main technical issues of single-cell sequencing (SCS), and effectively emulating these issues is necessary for reliably benchmarking SCS-based bioinformatics tools. Unfortunately, currently available sequencing simulators a...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7703785/ https://www.ncbi.nlm.nih.gov/pubmed/31584615 http://dx.doi.org/10.1093/bioinformatics/btz713 |
_version_ | 1783616695945920512 |
---|---|
author | Yu, Zhenhua Du, Fang Sun, Xuehong Li, Ao |
author_facet | Yu, Zhenhua Du, Fang Sun, Xuehong Li, Ao |
author_sort | Yu, Zhenhua |
collection | PubMed |
description | MOTIVATION: Allele dropout (ADO) and unbalanced amplification of alleles are main technical issues of single-cell sequencing (SCS), and effectively emulating these issues is necessary for reliably benchmarking SCS-based bioinformatics tools. Unfortunately, currently available sequencing simulators are free of whole-genome amplification involved in SCS technique and therefore not suited for generating SCS datasets. We develop a new software package (SCSsim) that can efficiently simulate SCS datasets in a parallel fashion with minimal user intervention. SCSsim first constructs the genome sequence of single cell by mimicking a complement of genomic variations under user-controlled manner, and then amplifies the genome according to MALBAC technique and finally yields sequencing reads from the amplified products based on inferred sequencing profiles. Comprehensive evaluation in simulating different ADO rates, variation detection efficiency and genome coverage demonstrates that SCSsim is a very useful tool in mimicking single-cell sequencing data with high efficiency. AVAILABILITY AND IMPLEMENTATION: SCSsim is freely available at https://github.com/qasimyu/scssim. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-7703785 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-77037852020-12-07 SCSsim: an integrated tool for simulating single-cell genome sequencing data Yu, Zhenhua Du, Fang Sun, Xuehong Li, Ao Bioinformatics Applications Notes MOTIVATION: Allele dropout (ADO) and unbalanced amplification of alleles are main technical issues of single-cell sequencing (SCS), and effectively emulating these issues is necessary for reliably benchmarking SCS-based bioinformatics tools. Unfortunately, currently available sequencing simulators are free of whole-genome amplification involved in SCS technique and therefore not suited for generating SCS datasets. We develop a new software package (SCSsim) that can efficiently simulate SCS datasets in a parallel fashion with minimal user intervention. SCSsim first constructs the genome sequence of single cell by mimicking a complement of genomic variations under user-controlled manner, and then amplifies the genome according to MALBAC technique and finally yields sequencing reads from the amplified products based on inferred sequencing profiles. Comprehensive evaluation in simulating different ADO rates, variation detection efficiency and genome coverage demonstrates that SCSsim is a very useful tool in mimicking single-cell sequencing data with high efficiency. AVAILABILITY AND IMPLEMENTATION: SCSsim is freely available at https://github.com/qasimyu/scssim. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2020-02-15 2019-09-17 /pmc/articles/PMC7703785/ /pubmed/31584615 http://dx.doi.org/10.1093/bioinformatics/btz713 Text en © The Author(s) 2019. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Applications Notes Yu, Zhenhua Du, Fang Sun, Xuehong Li, Ao SCSsim: an integrated tool for simulating single-cell genome sequencing data |
title | SCSsim: an integrated tool for simulating single-cell genome sequencing data |
title_full | SCSsim: an integrated tool for simulating single-cell genome sequencing data |
title_fullStr | SCSsim: an integrated tool for simulating single-cell genome sequencing data |
title_full_unstemmed | SCSsim: an integrated tool for simulating single-cell genome sequencing data |
title_short | SCSsim: an integrated tool for simulating single-cell genome sequencing data |
title_sort | scssim: an integrated tool for simulating single-cell genome sequencing data |
topic | Applications Notes |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7703785/ https://www.ncbi.nlm.nih.gov/pubmed/31584615 http://dx.doi.org/10.1093/bioinformatics/btz713 |
work_keys_str_mv | AT yuzhenhua scssimanintegratedtoolforsimulatingsinglecellgenomesequencingdata AT dufang scssimanintegratedtoolforsimulatingsinglecellgenomesequencingdata AT sunxuehong scssimanintegratedtoolforsimulatingsinglecellgenomesequencingdata AT liao scssimanintegratedtoolforsimulatingsinglecellgenomesequencingdata |